Skip to main content
. 2020 Apr 29;7:131. doi: 10.1038/s41597-020-0470-2

Online-only Table 1.

Comparison of the two O. granulata genome assemblies.

Entries This study IRGC Acc. No. 10211711
Genome sequencing Source country China India
Genome size (Mb) * 792 785
Sequencing technology Illumina; Hi-C Illumina; PacBio
Raw Illumina data (Gb) 133.38 105.157
Sequence coverage (×)** 167 131
Raw Hi-C/PacBio data (Gb) 109.41 16.615
Sequence coverage (×)** 137 21
Assembly statistics Assembly size (Mb) 736.66 776.96
Whole-genome coverage (%) 93 98.1
Contig N50 (kb) 43.9 262.05
Contig number (#) 29,963 4,618
Scaffold N50 (kb) 916.3 262.05
Scaffold number (#) 2,393 4,618
Largest scaffold (Mb) 4.04 1.59
Length of anchored scaffolds (Mb) 723.2
Anchoring rate (%) 98.2
GC content (%) 45.87 46.32
Gene annotation Gene number (#) 40,131 40,116
Functionally annotated gene number (#) 34,436 33,901
Complete BUSCO (%) 96.53 95
Total gene length (Mb) 125.53 102.74
Average gene length (bp) 3,152 2,561.19
Total CDS length (bp) 35.78 40.61
Average CDS length (bp) 892 1,012.28
Number of exons (#) 165,272 162,369
Average exon length (bp) 283 250.1
Average exons per gene 4.1 4.05
Total intron length (Mb) 78.84 62.14
Number of introns (#) 125,141 122,253
Average intron length (bp) 630 508
ncRNA annotation tRNA length (bp) 75,160 82,079
rRNA length (bp) 133,694 99,297
miRNA length (bp) 29,471 30,787
Repeat sequence annotation Total repeat length (Mb) 456.567 528.04
Repeat percentage (%) 61.98 67.96
DNA transposon length (bp) 72,407,795 68,393,246
LINE length (bp) 1,459,061 7,169,231
SINE length (bp) 125,812 59,741
LTR length (bp) 374,444,649 460,976,797
Copia (bp) 41,126,935 54,901,814
Gypsy (bp) 278,699,663 407,036,517
Others (bp) 54,618,051 28,158,459
Other length (bp) 8,130,028 1,696,213

*The genome size was estimated by the k-mer method;

**The genome size was estimated to be 800 Mb.