Table 2. Assembly and gene prediction statistics for the draft genomes of all recognized non-encapsulated Trichinella taxa*.
Description | T4.1 ISS13 | T4.2 ISS588 | T4.3 ISS176 | T4.4 ISS470 | T4.5 ISS141 | T10 ISS1980 | T11 ISS1029 |
---|---|---|---|---|---|---|---|
Country of origin | Russia | Russia | Kazakhstan | USA | Australia | Thailand | Zimbabwe |
Host of origin | Raccoon | Brown rat | Tawny eagle | Black vulture | Spotted quoll | Human | Nile crocodile |
Genome size (bp) | 49,202,366 | 48,147,010 | 49,171,591 | 48,479,966 | 46,056,875 | 46,871,975 | 50,937,231 |
Number of scaffolds; contigs | 7,287; 8,136 | 7,547; 7,647 | 6,600; 7,483 | 6,287; 7,079 | 1,381; 2,571 | 2,552; 3,122 | 11,275; 12,675 |
N50 (bp) | 235,426 | 112,255 | 287,133 | 234,172 | 167,180 | 222,396 | 205,645 |
N90 (bp) | 60,266 | 9,250 | 69,797 | 50,288 | 50,779 | 64,723 | 8,776 |
Genome GC content (%) | 32.61 | 32.58 | 32.57 | 32.69 | 32.46 | 32.7 | 32.87 |
Coding GC content (%) | 42.64 | 42.39 | 42.44 | 42.62 | 42.24 | 42.25 | 42.41 |
Exonic proportion; including introns (%) | 33.63; 72.68 | 34.73; 71.61 | 34.24; 73.39 | 29.84; 61.96 | 33.73; 67.98 | 35.96; 76.49 | 33.92; 72.45 |
Number of putative coding genes | 12,699 | 13,754 | 12,462 | 14,708 | 11,006 | 11,854 | 14,933 |
Mean gene size (bp) | 2,955 | 2,620 | 3,053 | 2,071 | 2,944 | 3,169 | 2,591 |
Mean CDS length (bp) | 1,041 | 1,006 | 1,052 | 994 | 1,122 | 1,133 | 933 |
Mean exon count per gene | 6.58 | 6.22 | 6.66 | 5.91 | 6.64 | 6.92 | 5.87 |
Mean exon length (bp) | 210.71 | 207.52 | 217.94 | 169.78 | 222.84 | 217.53 | 209.41 |
Mean intron length (bp) | 281.73 | 255.09 | 283.56 | 217.87 | 260.03 | 281.51 | 280.34 |
Total length of coding sequences | 25,932,768 | 21,637,764 | 25,191,644 | 15,904,360 | 22,611,694 | 24,407,202 | 26,665,876 |
Repetitive sequences (%) | 18.01 | 18.41 | 17.73 | 17.71 | 16.12 | 14.47 | 20.99 |
CEG completeness: complete; partial (%) | 96.77; 97.58 | 96.37; 97.58 | 96.77; 97.58 | 97.18; 97.58 | 95.97; 97.58 | 96.37; 97.58 | 97.18; 97.58 |
CDS, coding DNA sequence; CEG, core essential gene; ISS, Istituto Superiore di Sanità.
International Trichinella Reference Center ( http://www.iss.it/site/Trichinella/) ISS codes are indicated.
*T4=T. pseudospiralis (including five distinct populations: T4.1–T4.5); T10=T. papuae; T11=T. zimbabwensis.