- 论坛徽章:
- 0
|
你自己的数据有误吧?怎么scaffold_2重复了?不是指考虑前4列吗?
- $ awk '{if($4>c[$1]||($4==c[$1] && ($3-$2)>(b[$1]-a[$1]))){a[$1]=$2;b[$1]=$3;c[$1]=$4;d[$1]=$5" "$6}}END{OFS=" ";for(i in c){print i,a[i],b[i],c[i],d[i]}}' a.txt
- scaffold_1 56237 56754 702 - scaffold_26_37694_44393
- scaffold_2 56435 56754 425 - scaffold_122_71161_78289
- scaffold_3 87488 97748 505 + scaffold_3_441083_455311
- scaffold_4 99161 99656 743 + scaffold_429_32312_46414
- scaffold_5 99296 99522 302 + scaffold_384_42367_55456
- scaffold_6 99322 99788 697 - scaffold_167_387886_401156
复制代码 |
|