Table 1.
Summary for the S. shirakii transcriptome after the Illumina sequencing.
Parameters | Adult | Larvae | All |
---|---|---|---|
Raw reads | 58,063,258 | 62,093,046 | 120,156,304 |
Total clean reads | 51,299,516 | 54,109,362 | 105,408,878 |
Total Nucleotides (bp) | 4,616,956,440 | 4,869,842,580 | - |
Q20 1 percentage (%) | 97.45% | 97.42% | - |
N percentage (%) | 0 | 0 | - |
Total length of contigs (nt) | 39,306,387 | 54,595,446 | - |
Total length of unigenes (nt) | 35,186,305 | 49,464,707 | - |
Number of unigenes | 69,189 | 93,948 | 74,657 |
Mean size of contigs (nt) | 290 | 292 | - |
Mean size of unigenes (nt) | 509 | 527 | 680 |
N50 2 of contigs (nt) | 428 | 445 | |
N50 2 of unigenes (nt) | 722 | 800 | 1057 |
GC content (%) | 47.93% | 45.31% | - |
1 The percentage of sequences at a sequencing error rate of less than 1%; 2 N50 is used to measure the continuity of the assembly; the greater the value, the better the assembly is. The calculation method is as follows: first, we ranked the transcripts according to their length in descending order, then accumulated one by one to 50% of the total length of all transcripts; the length of the last accumulated transcript is N50.