Skip to main content
. 2022 May 9;13:879337. doi: 10.3389/fimmu.2022.879337

Table 1.

After filtering the original data, checking the sequencing error rate, and checking the GC content distribution, we obtain clean reads for subsequent analysis. The data is summarized as shown in the table below.

Sample Library Raw_reads clean_reads Clean_bases Error_rate Q20 Q30 GC_pct
tVaS1 FRAS202156226-1r 43614846 42016796 6.3G 0.03 97.94 93.92 37.1
tVaS2 FRAS202156227-1r 45933166 44134864 6.62G 0.03 97.58 93.11 37.7
tVaS3 FRAS202156228-1r 45061154 43428666 6.51G 0.02 98.07 94.26 37.71
tVaR1 FRAS202156223-1r 43504158 42181696 6.33G 0.03 97.97 93.96 35.83
tVaR2 FRAS202156224-1r 44238968 42270438 6.34G 0.03 97.72 93.37 37.54
tVaR3 FRAS202156225-1r 46739216 43630738 6.54G 0.03 97.79 93.57 37.48
tCon1 FRAS202156229-1r 44446884 42744414 6.41G 0.02 98.04 94.16 38.45
tCon2 FRAS202156230-1r 45099750 43562032 6.53G 0.03 97.95 94 37.95
tCon3 FRAS202156231-1r 47643512 45841434 6.88G 0.03 98.01 94.12 38.14

Sample: Sample name.

Library: Library number.

Raw_reads: The number of reads in the raw data.

Clean_reads: The number of reads after filtering the original data.

Clean_bases: The number of bases after filtering the original data (clean base=clean reads*150bp).

Error_rate: The overall sequencing error rate of the data.

Q20: The percentage of bases with a Phred value greater than 20 to the total bases.

Q30: The percentage of bases with a Phred value greater than 30 to the total bases.

GC_pct: the percentage of G and C in the four bases in clean reads.