. 2018 Aug 7;9:1051. doi: 10.3389/fphys.2018.01051

Table 1.

Before and after quality of pre-treatment data statistics.

Sample	Raw reads	Raw bases	Clean reads	Clean bases	Valid ratio (base)	Q30 (%)	GC content (%)
Sample_C	53,207,368	6,650,921,000	52,542,956	6,566,343,431	98.72%	97.04%	48.00%
Sample_DC	53,253,970	6,656,746,250	52,626,414	6,576,806,657	98.79%	97.11%	48.50%
Sample_DD	53,253,770	6,656,721,250	52,780,196	6,596,137,284	99.08%	97.38%	49.00%

The statistical data here are the results of read1 + read2. C: normal mouse liver tissue; DC: liver tissue of the diabetic mice; DD: liver tissue of the diabetic treated mice. Raw reads: the total number of raw reads was counted in each sample; Raw bases: the total number of bases was counted in each sample sequenced; Clean reads: the calculation method was the same as that of raw reads, except the statistical files were the filtered sequencing data. The subsequent bioinformatics analysis was based on clean reads. Clean bases: the total number of bases was counted in the clean reads, and the results were translated to values in G; Q30: the number of bases in the original data was calculated with Phred values greater than 30% GC content: the percentage of the total number of bases G and C in the original data was calculated as a percentage of the total number of bases.