Table 1. Transcriptome summary of the adult stage of T. multiceps and detailed bioinformatics annotations.
Raw sequences and Assembly statistics | |
Raw reads | 28,320,027 |
Clean reads | 27,447,770, each 90 bp in length |
GC content | 49.04% |
Contigs (≥300) (mean length; max; N50) | 53,568 (974 bp; 11,875 bp; 1,268 bp) |
Unigenes (≥300) (mean length; max; N50) | 31,282 (920 bp; 11,875 bp; 1,206 bp) |
Bioinformatics annotations of Tm unigenes | |
Gene annotation against animal proteins of Nr | 17,618 (56.3%) |
Gene annotation against Drosophila protein of Nr | 5,925 (18.9%) |
Gene annotation against UniProtKB/Swiss-Prot | 14,350 (45.9%) |
Gene annotation against UniProtKB/TrEMBL | 16,286 (52.1%) |
Gene annotation against COG | 6,653 (21.3%), 24 categories |
Gene annotation against KEGG | 11,645 (37.3%), 213 pathway |
All unigenes matching Nr, UniProtKB, COG, KEGG | 17,768 (56.8%) |
Gene annotation against InterPro | 25,457 (81.38%), 4,562 domains/families |
Gene annotation against Pfam | 12,909 (41.27%), 3,396 domains/families |
Predicted coding sequence (CDS) | 20,896 (66.8%) |
All annotated unigenes | 26,110 (83.47%) |
Unigenes matching all seven databases | 5,509 (17.61%) |
GO annotation for Nr protein hits | 4,706 (15.04%), 2,360 GO terms, 48 sub-categories |
Biological process | 2,315 (1,578 GO terms), 27 sub-categories |
Cellular component | 3,354 (270 GO terms), 10 sub-categories |
Molecular function | 2,809 (512 GO terms), 11 sub-categories |
Tm, T. multiceps.