Table III. Summary of TGICL merged super assembly validation using 454 sequence data.
Parameter | MIRA+Newbler | TGICL+Newbler | MIRA+TGICL | MIRA+TGICL+Newbler |
Contigs (100 bp or greater)a | 20,918 | 20,639 | 32,654 | 34,314 |
Singletons (100 bp or greater)b | 27,781 | 18,981 | 17,854 | 17,020 |
Total contigs/singletons (100 bp or greater) | 48,699 | 39,620 | 50,508 | 51,334 |
Assembly size (Mb) | 54.3 | 45.3 | 54.2 | 58.2 |
Large contigs (1,000 bp or greater) | 19,367 | 17,007 | 19,226 | 20,986 |
Maximum contig length (bp) | 16,338 | 15,659 | 16,598 | 16,333 |
Average contig length (bp) | 1,115 | 1,144 | 1,072 | 1,133.2 |
N50 (bp) | 1,494 | 1,552 | 1,406 | 1,517 |
Contigs with significant hits (%)c | 40,874 (83.9) | 26,933 (68.0) | 41,556 (82.3) | 42,488 (82.8) |
Contigs showing 80% or greater coverage (%)d | 12,896 (26.5) | 11,993 (30.3) | 12,270 (24.3) | 13,572 (26.4) |
Soybean protein hits (%)e | 20,954 (45.2) | 17,560 (37.9) | 21,188 (45.7) | 20,987 (45.2) |
Soybean proteins with 80% or greater coverage (%)f | 10,485 (22.6) | 10,314 (22.2) | 10,275 (22.2) | 10,780 (23.2) |
No. of 454 reads mapped (%) | 1,740,301 (90.1) | 1,815,449 (94.0) | 1,745,431 (90.4) | 1,840,977 (95.3) |
No. of 454 reads uniquely mapped (%) | 1,214,991 (62.9) | 1,401,916 (72.6) | 1,262,790 (65.4) | 1,179,585 (61.1) |
Contigs generated by TGICL in merged super assembly.
Contigs of primary assembly not assembled by TGICL in merged super assembly.
Contigs showing significant hits (E ≤ 1e-5) with soybean proteins.
Contigs showing 80% or greater coverage of soybean proteins.
Unique soybean proteins to which contigs show significant hits (E ≤ 1e-5).
Unique soybean proteins to which contigs show 80% or greater coverage.