Table 1. Summary of quality of RNA-Seq original sequencing data.
Raw reads column lists each read in the FASTQ format file by four lines and a unit, and counts the number of sequences in each file. Clean reads column lists the filtered sequencing data of raw reads, and the subsequent bioinformation analysis is based on clean reads. Clean bases column lists the number of clean reads multiply the length, and G represents the data size of clean reads. Q20 column listed the percentages of bases with phred quality score greater than 20. Q30 column listed the percentages of bases with phred quality score greater than 20. GC content column lists the total number of bases G and C as a percentage of the total number of bases. Total mapped column lists sequence count statistics that can be mapped to a reference sequence.
Sample name | Raw reads | Clean reads | Cleanbases | Q20(%) | Q30(%) | GC content(%) | Total mapped(%) |
---|---|---|---|---|---|---|---|
larva_1 | 45,921,768 | 45,216,478 | 6.78G | 97.84 | 93.51 | 39.10 | 89.14% |
larva_2 | 45,488,706 | 44,719,458 | 6.71G | 98.15 | 94.41 | 39.17 | 87.93% |
larva_3 | 46,968,766 | 43,815,018 | 6.57G | 98.20 | 94.52 | 39.61 | 87.88% |
pupa_1 | 51,547,360 | 50,918,894 | 7.64G | 97.82 | 93.42 | 40.09 | 89.11% |
pupa_2 | 49,213,402 | 48,731,238 | 7.31G | 97.85 | 93.58 | 40.66 | 86.42% |
pupa_3 | 47,589,126 | 45,987,494 | 6.9G | 97.88 | 93.68 | 41.13 | 87.12% |
female_1 | 65,486,770 | 64,749,568 | 9.71G | 97.47 | 92.46 | 32.48 | 91.53% |
female_2 | 54,512,618 | 51,662,048 | 7.75G | 97.20 | 92.14 | 34.17 | 87.69% |
female_3 | 49,813,910 | 48,435,084 | 7.27G | 97.31 | 92.31 | 34.11 | 88.84% |
male_1 | 62,939,496 | 60,971,604 | 9.15G | 97.51 | 92.88 | 38.99 | 84.75% |
male_2 | 69,968,690 | 69,071,824 | 10.36G | 97.53 | 92.94 | 38.33 | 85.44% |
male_3 | 52,728,692 | 51,704,548 | 7.76G | 97.66 | 93.00 | 34.71 | 90.75% |