Skip to main content
. 2018 Oct 8;5:180200. doi: 10.1038/sdata.2018.200

Table 3. Sequence datasets produced in this study.

Dataset Number of sequences Number of core genes a CVG (Vertebrata BUSCO)
N50 contig length (bp) Data Records    
    Only ‘Complete’ Including ‘Fragmented’ ‘Missing’    
Transcriptome assembly - all libraries 1,081,614 218 (2407) 232 (2524) 1 (62) 992 Data Citation 2
Transcriptome assembly - caudal 498,477 226 (2423) 229 (2496) 4 (90) 1,679 Data Citation 5
Transcriptome assembly - cloaca 377,609 224 (2371) 230 (2476) 3 (110) 1,744 Data Citation 6
Transcriptome assembly -trunk 448,394 225 (2389) 232 (2489) 1 (97) 1,560 Data Citation 7
Transcriptome assembly - head 342,765 221 (2361) 228 (2454) 5 (132) 1,889 Data Citation 8
Protein-coding assembly - all libraries 167,783 218 (2401) 232 (2514) 1 (72) 1,782 Data Citation 3
Non-redundant peptides - all libraries 79,083 219 (2400) 233 (2514) 0 (72) N/A Data Citation 4

aSee the existing literature16 for the definitions of ‘complete’, ‘fragmented’ and ‘missing’ in ortholog detection by BUSCO. CVGs consists of 233 orthologs in total, while Vertebrata BUSCO has 2,586 orthologs.