- 论坛徽章:
- 0
|
两道题,问了N多人,没结果,再问一下看看
//我将yuxh老大与lightspeed老大的程序运行比较如下:
//用yuxh老大的将10000个字符的data运行了一下,结果如下:
- Verted Repeat:
- Repeat:AACACAGGGA, Size: 10, Start Positioins:8416, 8471
- Repeat:TGATATATCA, Size: 10, Start Positioins:5523, 5588
- Repeat:TCGGTATTCAA, Size: 11, Start Positioins:1538, 1819
- Repeat:CCACGCTGAT, Size: 10, Start Positioins:1760, 2065
- Repeat:AGACTTTACC, Size: 10, Start Positioins:3304, 3860
- Repeat:CTCTTGTGCTTA, Size: 12, Start Positioins:4218, 4991
- Repeat:AAAACCACTGA, Size: 11, Start Positioins:3877, 4742
- Repeat:CCTCGATTGG, Size: 10, Start Positioins:601, 1599
- Repeat:TAGACGGCGC, Size: 10, Start Positioins:4364, 5496
- Repeat:CCCTGAAAAGC, Size: 11, Start Positioins:6395, 7680
- Repeat:ACAAGTCAAGG, Size: 11, Start Positioins:7969, 9299
- Repeat:CCTGTTTACC, Size: 10, Start Positioins:2056, 3714
- Repeat:TGAGCATATA, Size: 10, Start Positioins:6182, 7983
- Repeat:GCATAAAGAG, Size: 10, Start Positioins:1518, 3322
- Repeat:AGAAGGAATC, Size: 10, Start Positioins:5744, 7817
- Repeat:CCGGCCGTCA, Size: 10, Start Positioins:2697, 4773
- Repeat:AGACTCAACT, Size: 10, Start Positioins:5283, 8031
- Repeat:GATCGGTGGA, Size: 10, Start Positioins:25, 3005
- Repeat:GGTCCTGATAGCG, Size: 13, Start Positioins:1044, 4279
- Repeat:GTAAGTGTTT, Size: 10, Start Positioins:540, 3976
- Repeat:GTAAGCTAGG, Size: 10, Start Positioins:4008, 7636
- Repeat:GTTTCAGATT, Size: 10, Start Positioins:3509, 7617
- Repeat:AGATCGCCAC, Size: 10, Start Positioins:2382, 6775
- Repeat:CCGCAGCGCA, Size: 10, Start Positioins:1121, 5636
- Repeat:CGATTGGAAT, Size: 10, Start Positioins:1602, 6167
- Repeat:CACTCATGTA, Size: 10, Start Positioins:821, 5531
- Repeat:CTCGGATAGC, Size: 10, Start Positioins:2709, 9155
- Repeat:CTCGAGCCAG, Size: 10, Start Positioins:1467, 8255
- Repeat:ATTGCGACGA, Size: 10, Start Positioins:1265, 8589
- Repeat:GAAGTGGGCG, Size: 10, Start Positioins:1848, 9573
- Repeat:TTAATGCAAA, Size: 10, Start Positioins:1748, 9597
- Repeat:CGTGATTCTG, Size: 10, Start Positioins:89, 8530
- Repeat:TGCCTGTATG, Size: 10, Start Positioins:961, 9957
- Inverted Repeat:
- Repeat:ACCTTCCTTGACT, Size: 13, Start Positioins:3226, 6709
- Repeat:AACCTCGATTGG, Size: 12, Start Positioins:599, 9117
- Repeat:ATCGGAAGTC, Size: 10, Start Positioins:3140, 6039
- Repeat:TAGAGGTTGC, Size: 10, Start Positioins:4228, 4520
- Repeat:GGTACAATGC, Size: 10, Start Positioins:1698, 6789
- Repeat:GATGGCTCTGG, Size: 11, Start Positioins:3467, 4314
- Repeat:TTTCAGATTG, Size: 10, Start Positioins:3510, 3857
- Repeat:GTTGCCGATG, Size: 10, Start Positioins:2671, 4515
- Repeat:TAAATGTAGACC, Size: 12, Start Positioins:1072, 5938
- Repeat:GCCGCGGCAG, Size: 10, Start Positioins:2588, 4366
- Repeat:GCGTCTCCTT, Size: 10, Start Positioins:2871, 4048
- Repeat:CGGATTAGGAC, Size: 11, Start Positioins:1624, 5157
- Repeat:TGAATGTCTTAC, Size: 12, Start Positioins:2631, 3970
- Repeat:TGGATTTGAT, Size: 10, Start Positioins:438, 5949
- Repeat:GCGTGATTCTGC, Size: 12, Start Positioins:88, 6202
- Repeat:GGCTTATTAGCG, Size: 12, Start Positioins:1614, 4549
- Repeat:TGGGATCGAA, Size: 10, Start Positioins:1921, 4010
- Repeat:ATAGGAACCT, Size: 10, Start Positioins:2948, 2961
- Repeat:GACAATGGTC, Size: 10, Start Positioins:223, 5337
- Repeat:GTCGCGTGATT, Size: 11, Start Positioins:85, 5420
- Repeat:ACGTGGACTCG, Size: 11, Start Positioins:1530, 3932
- Repeat:CCAGAGAGAGG, Size: 11, Start Positioins:628, 3821
- Repeat:ATGGTAGGCTT, Size: 11, Start Positioins:104, 4258
- Repeat:TAATTCTAAC, Size: 10, Start Positioins:1652, 2338
- Repeat:TACTTAGCCAA, Size: 11, Start Positioins:1158, 2243
- Repeat:TCGACAGTAA, Size: 10, Start Positioins:565, 2138
- Repeat:ATGTCCGTGG, Size: 10, Start Positioins:509, 959
- Repeat:TAGGCTTAAA, Size: 10, Start Positioins:108, 1282
- real 0m1.382s
- user 0m0.010s
- sys 0m0.020s
复制代码
//我使用同样的数据集,也就是yuxh老大程序中产生的10000个随机字符,用lightspeed老大的程序运行结果如下:
- $ time ./1 datafile
- ------------------Repeat Match Line# 1 --------------------
- Repeat: GGTCCTGATAGCG, Size: 13, Start Positions: 1045,4280
- Repeat: CTCTTGTGCTTA, Size: 12, Start Positions: 4219,4992
- Repeat: TCGGTATTCAA, Size: 11, Start Positions: 1539,1820
- Repeat: AAAACCACTGA, Size: 11, Start Positions: 3878,4743
- Repeat: CCCTGAAAAGC, Size: 11, Start Positions: 6396,7681
- Repeat: ACAAGTCAAGG, Size: 11, Start Positions: 7970,9300
- Repeat: ATTGCGACGA, Size: 10, Start Positions: 1266,8590
- Repeat: CTCGAGCCAG, Size: 10, Start Positions: 1468,8256
- Repeat: GCATAAAGAG, Size: 10, Start Positions: 1519,3323
- Repeat: CGATTGGAAT, Size: 10, Start Positions: 1603,6168
- Repeat: TTAATGCAAA, Size: 10, Start Positions: 1749,9598
- Repeat: CCACGCTGAT, Size: 10, Start Positions: 1761,2066
- Repeat: GAAGTGGGCG, Size: 10, Start Positions: 1849,9574
- Repeat: AGATCGCCAC, Size: 10, Start Positions: 2383,6776
- Repeat: CCGGCCGTCA, Size: 10, Start Positions: 2698,4774
- Repeat: CTCGGATAGC, Size: 10, Start Positions: 2710,9156
- Repeat: AGACTTTACC, Size: 10, Start Positions: 3305,3861
- Repeat: GTTTCAGATT, Size: 10, Start Positions: 3510,7618
- Repeat: GTAAGCTAGG, Size: 10, Start Positions: 4009,7637
- Repeat: TAGACGGCGC, Size: 10, Start Positions: 4365,5497
- Repeat: AGACTCAACT, Size: 10, Start Positions: 5284,8032
- Repeat: TGATATATCA, Size: 10, Start Positions: 5524,5589
- Repeat: AGAAGGAATC, Size: 10, Start Positions: 5745,7818
- Repeat: TGAGCATATA, Size: 10, Start Positions: 6183,7984
- real 0m14.341s
- user 0m14.140s
- sys 0m0.050s
复制代码
//不得不佩服两位老大结果一样,呵呵,只是yuxh老大的位置计算的前了一位。不过时间上可是C明显要快,尤其当数据集大的时候。
//不过,还是不明白,为什么重复的片段大多集中在10-13呢,连14都很少见,奇怪。 |
|