Skip to main content
. 2015 Aug 4;4:36. doi: 10.1186/s13742-015-0075-4

Table 1.

Transcriptome assembly and annotation statistics for F. arisanus

Number of read pairs used in assembly (SRA accession number)
Larvae (SRA: SRX689040) 53 174 809
Pupae (SRA: SRX689038) 54 026 754
Adult male (SRA: SRX689037) 53 724 417
Adult female (SRA: SRX689041) 49 823 168
Total 210 749 148
Normalized read pairs (in silico normalization) 12 214 054
Unfiltered assembly
Number of unigenes 57577
N50 unigene length (longest transcript/unigene) (bp) 2162
Sum longest transcript/unigene (Mb) 52.23
Number of transcripts 86118
N50 transcript length (bp) 3174
Sum transcript length (Mb) 117.14
Transcripts per unigene 1.50
GC % 40.45
Filtered de novo assembly
Number of unigenes 8307
N50 unigene length (longest transcript/unigene) (bp) 4751
Sum longest transcript/unigene (Mb) 27.13
Number of transcripts 15346
N50 transcript length (bp) 4570
Sum transcript length (Mb) 50.62
Isoforms per unigene 1.85
GC % 41.37
N50 protein length (amino acids) 282
Number of proteins with complete ORF (%) 11115 (72.4)
Annotation statistics
Number of proteins with Pfam domains identified 11978
Number of proteins with gene ontology terms 9938
Number of proteins with gene names 14600