Table 3. Sequence datasets produced in this study.
Dataset | Number of sequences |
Number of core genes
a
CVG (Vertebrata BUSCO) |
N50 contig length (bp) | Data Records | ||
---|---|---|---|---|---|---|
Only ‘Complete’ | Including ‘Fragmented’ | ‘Missing’ | ||||
Transcriptome assembly - all libraries | 1,081,614 | 218 (2407) | 232 (2524) | 1 (62) | 992 | Data Citation 2 |
Transcriptome assembly - caudal | 498,477 | 226 (2423) | 229 (2496) | 4 (90) | 1,679 | Data Citation 5 |
Transcriptome assembly - cloaca | 377,609 | 224 (2371) | 230 (2476) | 3 (110) | 1,744 | Data Citation 6 |
Transcriptome assembly -trunk | 448,394 | 225 (2389) | 232 (2489) | 1 (97) | 1,560 | Data Citation 7 |
Transcriptome assembly - head | 342,765 | 221 (2361) | 228 (2454) | 5 (132) | 1,889 | Data Citation 8 |
Protein-coding assembly - all libraries | 167,783 | 218 (2401) | 232 (2514) | 1 (72) | 1,782 | Data Citation 3 |
Non-redundant peptides - all libraries | 79,083 | 219 (2400) | 233 (2514) | 0 (72) | N/A | Data Citation 4 |
aSee the existing literature16 for the definitions of ‘complete’, ‘fragmented’ and ‘missing’ in ortholog detection by BUSCO. CVGs consists of 233 orthologs in total, while Vertebrata BUSCO has 2,586 orthologs.