Skip to main content
. 2020 Nov 2;8:e10223. doi: 10.7717/peerj.10223

Table 1. Summary of quality of RNA-Seq original sequencing data.

Raw reads column lists each read in the FASTQ format file by four lines and a unit, and counts the number of sequences in each file. Clean reads column lists the filtered sequencing data of raw reads, and the subsequent bioinformation analysis is based on clean reads. Clean bases column lists the number of clean reads multiply the length, and G represents the data size of clean reads. Q20 column listed the percentages of bases with phred quality score greater than 20. Q30 column listed the percentages of bases with phred quality score greater than 20. GC content column lists the total number of bases G and C as a percentage of the total number of bases. Total mapped column lists sequence count statistics that can be mapped to a reference sequence.

Sample name Raw reads Clean reads Cleanbases Q20(%) Q30(%) GC content(%) Total mapped(%)
larva_1 45,921,768 45,216,478 6.78G 97.84 93.51 39.10 89.14%
larva_2 45,488,706 44,719,458 6.71G 98.15 94.41 39.17 87.93%
larva_3 46,968,766 43,815,018 6.57G 98.20 94.52 39.61 87.88%
pupa_1 51,547,360 50,918,894 7.64G 97.82 93.42 40.09 89.11%
pupa_2 49,213,402 48,731,238 7.31G 97.85 93.58 40.66 86.42%
pupa_3 47,589,126 45,987,494 6.9G 97.88 93.68 41.13 87.12%
female_1 65,486,770 64,749,568 9.71G 97.47 92.46 32.48 91.53%
female_2 54,512,618 51,662,048 7.75G 97.20 92.14 34.17 87.69%
female_3 49,813,910 48,435,084 7.27G 97.31 92.31 34.11 88.84%
male_1 62,939,496 60,971,604 9.15G 97.51 92.88 38.99 84.75%
male_2 69,968,690 69,071,824 10.36G 97.53 92.94 38.33 85.44%
male_3 52,728,692 51,704,548 7.76G 97.66 93.00 34.71 90.75%