Table 1.
Sample | Raw reads | Raw bases | Clean reads | Clean bases | Valid ratio (base) | Q30 (%) | GC content (%) |
---|---|---|---|---|---|---|---|
Sample_C | 53,207,368 | 6,650,921,000 | 52,542,956 | 6,566,343,431 | 98.72% | 97.04% | 48.00% |
Sample_DC | 53,253,970 | 6,656,746,250 | 52,626,414 | 6,576,806,657 | 98.79% | 97.11% | 48.50% |
Sample_DD | 53,253,770 | 6,656,721,250 | 52,780,196 | 6,596,137,284 | 99.08% | 97.38% | 49.00% |
The statistical data here are the results of read1 + read2. C: normal mouse liver tissue; DC: liver tissue of the diabetic mice; DD: liver tissue of the diabetic treated mice. Raw reads: the total number of raw reads was counted in each sample; Raw bases: the total number of bases was counted in each sample sequenced; Clean reads: the calculation method was the same as that of raw reads, except the statistical files were the filtered sequencing data. The subsequent bioinformatics analysis was based on clean reads. Clean bases: the total number of bases was counted in the clean reads, and the results were translated to values in G; Q30: the number of bases in the original data was calculated with Phred values greater than 30% GC content: the percentage of the total number of bases G and C in the original data was calculated as a percentage of the total number of bases.