- 论坛徽章:
- 1
|
本帖最后由 56836430 于 2015-08-24 18:07 编辑
有许多文件命名为:1.txt, 2.txt, 3.txt..... 10000.txt
文件的格式均为
>Ath|AT1G06290.1 pacid=19658332 transcript=AT1G06290.1 locus=AT1G06290 ID=AT1G06290.1.TAIR10 annot-version=TAIR10
-----------------------------------------------MSDNRALRRAHVL
ANHILQ--SNPPSSN---PS----LSRELCLQYSPPELNE-SYGFDVKEMRKLLDGHNVV
DRDWIYGLMMQSNLFNRKERGGKIFVSPDYNQTMEQQREITMKRIWYLLENGVFKGWLTE
TGPEAEL-RKLALLEVCGIYDHSVSIKVGVHFFLWGNAVKFFGTKRHHEKWLKNTEDYVV
KGCFAMTELGHGSNVRGIETVTTYDPKTEEFVINTPCESAQKYWIGGAANHATHTIVFSQ
>Ath|AT1G06310.1 pacid=19653225 transcript=AT1G06310.1 locus=AT1G06310 ID=AT1G06310.1.TAIR10 annot-version=TAIR10
-----------------------------------------------MSENVELRRAHIL
ANHILR--SPRPSSN---PS----LTPEVCFQYSPPELNE-SYGFEVKEMRKLLDGHNLE
ERDWLYGLMMQSNLFNPKQRGGQIFVSPDYNQTMEQQRQISMKRIFYLLEKGVFQGWLTE
TGPEAEL-KKFALYEVCGIYDYSLSAKLGVHFLLWGNAVKFFGTKRHHEKWLKDTEDYVV
KGCFAMTELGHGTNVRGIETVTTYDPTTEEFVINTPCESAQKYWIGEAANHANHAIVISQ
L
>lcl|Cyc_c38218_g1_i1_m.97294 unnamed protein product
---------------------------------------------MENSENRIARRTAIL
AAHFPN--SNTSNGI---SS----LHRSPCLRYYPPETASGKLSFDINAMRELMDGHNIE
DRDEIFKLIISSDVFCPRMVAGQVYVIPDYNKPMEHQREMTLKRILYLLEKGIFKGWLTG
TTIEQKM-RRFAIVECLGMYDHSLALKLGVHF-LWGDVLRSLGTKQHQEKFLRDSEEYIV
KGSFAMTELGHGSNVRGIETMATYDPSTQEFIINTPCETAQKYWIGGVVNHATHAIVFSQ
---------------------------------------------------------------------------------------------------------------
目的是想将文件名添加到每个文件的">"后面,并将以>开头的行空格后面的内容删掉。例如文件1.txt
>1|Ath|AT1G06290.1
-----------------------------------------------MSDNRALRRAHVL
ANHILQ--SNPPSSN---PS----LSRELCLQYSPPELNE-SYGFDVKEMRKLLDGHNVV
DRDWIYGLMMQSNLFNRKERGGKIFVSPDYNQTMEQQREITMKRIWYLLENGVFKGWLTE
TGPEAEL-RKLALLEVCGIYDHSVSIKVGVHFFLWGNAVKFFGTKRHHEKWLKNTEDYVV
KGCFAMTELGHGSNVRGIETVTTYDPKTEEFVINTPCESAQKYWIGGAANHATHTIVFSQ
>1|Ath|AT1G06310.1
-----------------------------------------------MSENVELRRAHIL
ANHILR--SPRPSSN---PS----LTPEVCFQYSPPELNE-SYGFEVKEMRKLLDGHNLE
ERDWLYGLMMQSNLFNPKQRGGQIFVSPDYNQTMEQQRQISMKRIFYLLEKGVFQGWLTE
TGPEAEL-KKFALYEVCGIYDYSLSAKLGVHFLLWGNAVKFFGTKRHHEKWLKDTEDYVV
KGCFAMTELGHGTNVRGIETVTTYDPTTEEFVINTPCESAQKYWIGEAANHANHAIVISQ
L
>1|lcl|Cyc_c38218_g1_i1_m.97294
---------------------------------------------MENSENRIARRTAIL
AAHFPN--SNTSNGI---SS----LHRSPCLRYYPPETASGKLSFDINAMRELMDGHNIE
DRDEIFKLIISSDVFCPRMVAGQVYVIPDYNKPMEHQREMTLKRILYLLEKGIFKGWLTG
TTIEQKM-RRFAIVECLGMYDHSLALKLGVHF-LWGDVLRSLGTKQHQEKFLRDSEEYIV
KGSFAMTELGHGSNVRGIETMATYDPSTQEFIINTPCETAQKYWIGGVVNHATHAIVFSQ
文件2.txt输出结果为:
>2|Ath|AT1G06290.1
-----------------------------------------------MSDNRALRRAHVL
ANHILQ--SNPPSSN---PS----LSRELCLQYSPPELNE-SYGFDVKEMRKLLDGHNVV
DRDWIYGLMMQSNLFNRKERGGKIFVSPDYNQTMEQQREITMKRIWYLLENGVFKGWLTE
TGPEAEL-RKLALLEVCGIYDHSVSIKVGVHFFLWGNAVKFFGTKRHHEKWLKNTEDYVV
KGCFAMTELGHGSNVRGIETVTTYDPKTEEFVINTPCESAQKYWIGGAANHATHTIVFSQ
>2|Ath|AT1G06310.1
-----------------------------------------------MSENVELRRAHIL
ANHILR--SPRPSSN---PS----LTPEVCFQYSPPELNE-SYGFEVKEMRKLLDGHNLE
ERDWLYGLMMQSNLFNPKQRGGQIFVSPDYNQTMEQQRQISMKRIFYLLEKGVFQGWLTE
TGPEAEL-KKFALYEVCGIYDYSLSAKLGVHFLLWGNAVKFFGTKRHHEKWLKDTEDYVV
KGCFAMTELGHGTNVRGIETVTTYDPTTEEFVINTPCESAQKYWIGEAANHANHAIVISQ
L
>2|lcl|Cyc_c38218_g1_i1_m.97294
---------------------------------------------MENSENRIARRTAIL
AAHFPN--SNTSNGI---SS----LHRSPCLRYYPPETASGKLSFDINAMRELMDGHNIE
DRDEIFKLIISSDVFCPRMVAGQVYVIPDYNKPMEHQREMTLKRILYLLEKGIFKGWLTG
TTIEQKM-RRFAIVECLGMYDHSLALKLGVHF-LWGDVLRSLGTKQHQEKFLRDSEEYIV
KGSFAMTELGHGSNVRGIETMATYDPSTQEFIINTPCETAQKYWIGGVVNHATHAIVFSQ
。。。。。。
求帮忙求帮忙! |
|