Abstract
Multiplex quantitative polymerase chain reaction (qPCR) has found an increasing range of applications. The construction of a reliable and dynamic mathematical model for multiplex qPCR that analyzes the effects of interactions between variables is therefore especially important. This work aimed to analyze the effects of interactions between variables through response surface method (RSM) for uni- and multiplex qPCR, and further optimize the parameters by constructing two mathematical models via RSM and back-propagation neural network-genetic algorithm (BPNN-GA) respectively. The statistical analysis showed that Mg2+ was the most important factor for both uni- and multiplex qPCR. Dynamic models of uni- and multiplex qPCR could be constructed using both RSM and BPNN-GA methods. But RSM was better than BPNN-GA on prediction performance in terms of the mean absolute error (MAE), the mean square error (MSE) and the Coefficient of Determination (R2). Ultimately, optimal parameters of uni- and multiplex qPCR were determined by RSM.
Introduction
Real-time quantitative PCR (qPCR) can quantitatively analyze a reaction template (nucleic acid) via the real-time continuous monitoring of the fluorescence signal generated from each cycle of the PCR amplification process. This technique has the advantages of being highly specific, highly sensitive, reproducible, accurately quantifiable, and highly automatable[1, 2]. Therefore, real-time PCR has been widely applied in fields such as molecular diagnostics, life sciences, agriculture, medicine, and food science[3–5]. Despite this broad application, there are still difficulties in the practical implementation of the technique, especially in multiplex qPCR systems. Due to the addition of multiple pairs of primers and probes, changes of factors such as annealing temperature, elongation temperature, and number of cycles can result in non-specific products and different effects on amplification. Furthermore, there is a lack of uniformity in data interpretation, and standardization needs to be improved[6]. These issues are mainly caused by the external factors that affect qPCR amplification and the practical difficulties in controlling these factors.
Prior studies[7–9]have attempted to optimize various qPCR parameters to ensure accuracy and stability. However, most of these studies employed single-factor tests when optimizing qPCR systems[9]. Although these studies determined the effects of individual factors on qPCR amplification, they required many and repeated tests and overlooked the effects of parameter interactions. The determined parameters may not have been optimized yet.
The recent studies to optimize qPCR parameters have focused on uniplex qPCR. The wide range of application for multiplex qPCR has brought increased attention to improving its amplification efficiency and optimizing the parameters. The construction of a reliable and dynamic mathematical model for multiplex qPCR that can be used to analyze the effects of interactions between various parameters is urgently needed. Response surface methodology (RSM) uses statistical and mathematical analysis to design experiments and involves a combination of experimental design techniques, including Plackett-Burman (PB) design, central composite design (CCD), Box-Behnken design, and others. RSM can evaluate the effects of variables on test results (yield) by analyzing experimental data. Additionally, RSM can analyze the interactions between various factors, and can be used to construct a mathematical model applicable for determining optimal conditions or ranges for a desired response[10]. RSM overcomes the disadvantages of single-factor tests, including the time required, the limited number of factors investigated, and the production of unreliable conclusions[11]. RSM has the advantages of being easy to implement and allowing investigation of the interplay of different factors. For these reasons, this methodology is commonly used in various fields such as pharmacy, architectural science, biology, agriculture, and microbiology[12–14].
As an alternative to RSM, artificial neural networks (ANNs)[15] are an integral component of artificial intelligence that can be applied for data analysis and prediction. As typical ANNs, back-propagation neural networks (BPNN) optimize and monitor the performance of neural networks under learning rules. BPNN can approximate any continuous function and have robust non-linear mapping capabilities. Genetic algorithms (GA) simulate the principle of survival of the fittest in nature and search for the most globally optimized combination of parameters for a given system. These algorithms can be used for discontinuous, indistinguishable, random, or highly nonlinear target functions. Therefore, the prediction models of ANNs are developed to analyze the obtained experimental data, while GA is utilized to optimize experimental parameters for the above well-established models. An increasing number of studies have focused on optimizing experimental conditions via ANNs and RSM in recent years[16–18]. However, there has been little work on the optimization of qPCR systems using these two methods. A few studies addressed that, as a predictive tool, ANNs can theoretically be used to understand complex systems and assist in optimizing conditions for biological experiments and related techniques such as PCR[19].
This study is based on previously developed AllGlo multiplex PCR systems for respiratory syncytial virus (RSV), influenza virus (INF), and human metapneumovirus (HMPV). CCD is used to design experiments with different concentrations of primers, probes, DNA polymerase, Mg2+, and dNTPs in uni- and multiplex qPCR systems, based on the experimental results, the interplay of the tested factors and their effects on cycle threshold (Ct) values of uni- and multiplex qPCR are analyzed and discussed by RSM. Subsequently, prediction models for the tested factors and Ct values are constructed via RSM and BPNN-GA respectively. The prediction performance of these two models is then evaluated using the coefficient of determination (R2), the mean absolute error (MAE), and the mean square error (MSE)[20–22]. The model resulting in better prediction performance is further tested for condition optimization. The optimal conditions for uni- and multiplex qPCR of the three viruses are then determined.
Materials and methods
Preparation of qPCR templates
E. coli DH5α samples containing RSV, INF, or HMPV target gene plasmids were cultured separately at 37°C under 5% CO2. A single colony was then transferred with an inoculation needle to Luria-Bertani (LB) culture and shaken overnight at 200 rpm/min at 37°C. Plasmid extraction was performed using a TaKaRa plasmid extraction kit following the manufacturer’s protocol (Takara Biomedical Technology Co., Ltd., Dalian, China, lot number 9760). The extracted plasmids were dissolved in 50 μL of eluent. A Nanodrop 2000 spectrophotometer was used to measure the A value of plasmid DNA at 260 nm/280 nm. Based on the measured copy number, the plasmids for the three viruses were separately diluted, mixed at equal ratios to 104copies/mL, and stored at -20°C for later use. The repeated re-configuration of the template due to insufficient storage and material preparation could lead to biased results[23]. In this study, the amount of template needed was calculated and prepared prior to experiments, and the templates were stored in aliquots to increase test stability.
Primers and probes
The genetic sequences of the three viruses were downloaded from the GenBank database. The primer picking tool and Oligo 6.22 from the NCBI database were used for comparison and optimization. Conservative segments with high homogeneity were selected for primer and probe designs. NCBI Blast was used to test the specificity of the primer and probe segments. Primers were synthesized by Intragen Trading (Shanghai) Co., Ltd., China, and probes were synthesized by Shanghai Yiyue Biotechnology Co., Ltd. (Table 1).
Table 1. Primers and probes used for AllGloqPCR.
Gene IDa | Primes and probes | Sequences(5’-3’) | Size(bp) |
---|---|---|---|
JX131645.1 | RSV-F | GCACCGCCAAGACACTAGAA | 179 |
RSV-R | GTGGTTTGCCGAGGCTATGA | ||
RSV-P | JUP(VIC)- GGA CCT GGG ACA CTC TCA ATC ATC T -JUPb | ||
KC731523.1 | HMPV-F | GGGAGCAAAGCAGAAAGTTTGT | 128 |
HMPV-R | TTGCACAGACACATGCCCTA | ||
HMPV-P | NEP(CY5)-GCT TAT GGA GCT GGT CAA ACA CTG C-NEPb | ||
L25072.1 | INF-F | ACACCATCTGTGTGGGCTAC | 136 |
INF-R | CCGTTCAGACTGCAGAGCTT | ||
INF-P | MAR(FAM)- CTC TAC AGA CAC TGT TGA CAC AGT ACT AG -MARb |
aGenBank.
bJUP, NEP and MAR are three kinds of different AllGlo probe fluorochromes, and correspond to the currently used VIC, CY5 and FAM fluorescence, respectively[24].
Experimental design
In this study, methods for uniplexqPCR are given in supplementary S1 Text. The concentrations of primer (Factor A), probe (Factor B), DNA polymerase (Factor C), Mg2+ (Factor D), and dNTPs (Factor E) were selected as independent variables and subjected to five levels of design using RSM-CCD. According to the recommended concentrations of these variables given by the VazymeLAmp® DNA Polymerase PCR kit (Vazyme Biotech Co., Ltd, Nanjing, China), baseline levels and ranges were confirmed. The coded values and actual values of the selected RSM design factors are provided in supplementary S1 Table. This study used a 50-test-point second-order RSM. The experiments for uni- and multiplex qPCR of the three viruses (RSV, INF, and HMPV) with the same designs; the uniplex qPCR designs are provided in S2 Table, and the multiplex qPCR designs are given in S3 Table.
Quantitative PCR amplification
qPCR was performed in a final volume of 50 μL, which included 5 μL of 10 ×VazymeLAmp® Buffer (Mg2+-free), 0.5 μL of ROX Reference Dye II (50 ×) *3, and 4 μL of mix template. Additionally, MgCl2 (25 mmol/L), dNTP (10 mmol/L), primers (10 μmol/L), probes (10 μmol/L), and LAmpTM DNA Polymerase (5 U/μL) were added in the concentrations provided in S2 and S3 Tables (VazymeLAmp® DNA Polymerase, Vazyme). An ABI 7500 Real-Time PCR System was used for amplification. A two-step method was implemented that included pre-denaturation at 95°C for 30 s, followed by 40 cycles of denaturation at 95°C for 5 s and elongation at 60°C for 32 s. Three parallel repeats were conducted for each test, and the averaged Ct value from the three repeats was taken as the result.
RSM
The averaged Ct value resulting from the three parallel tests was treated as the response value (Y). Design-Expert.V8.0.6 was used for RSM analysis of the test data. The statistical significance of the RSM-based model (model I) was checked by analysis of variance (ANOVA). The variables were treated as continuous random factors. The complete CCD matrix for the experimental Ct values of uni- and multiplex qPCR is described in S2 and S3 Tables. Optimized conditions for Ct values were obtained in combination with 3D response surface graphs (Fig 1 and Fig 2, S1 and S2 Figs).
Fig 1. RSM-3D contour graphs showing groups of two interacting factors for multiplex qPCR of RSV.
(A: primers, B: probes, D: Mg2+, E: dNTPs).a: The effects of interaction between primers and Mg2+ on the Ct value; b: The effects of the interaction between probes and Mg2+ on the Ct value; c: The effects of the interaction between Mg2+ and dNTPs on the Ct value. Graphs illustrate that the smaller Ct value can be obtained when the Mg2+ concentration is at a median level within the test range and the concentrations of primers and probes are higher, but the associated cost should be considered; in this study, the concentrations of these primers and probes are kept below 0.32 mmol/L.
Fig 2. RSM-3D contour graph showing interactions between parameters for multiplex qPCR of INF and HMPV.
(A: primers, B: probe, D: Mg2+, E: dNTPs)a: The effects of the interaction between Mg2+and primers on the Ct value for multiplex qPCR of INF; b: The effects of interaction between primers and Mg2+ on the Ct value for multiplex qPCR of HMPV; c: The effects of the interaction between probes and Mg2+ on the Ct value multiplex qPCR of HMPV; d: The effects of the interaction between Mg2+and dNTPs on the Ct value for multiplex qPCR of HMPV.
The mathematical expression used by model I to describe the response value, Y (Ct value), versus the five factors studied (A, B, C, D, and E) can be written as follows[25]:
(1) |
Y is the response value (Ct value), Xi and Xj denote the coded levels of the independent variables, and β0 is the intercept; i, j, βi, βj, and βij are coefficients, ε is the test error, and k represents the number of independent factors. In the following descriptions, Y1, Y2, and Y3 represent the corresponding multiplex qPCR RSM polynomial equations for RSV, INF, and HMPV, respectively.
BPNN-GA
A 3-layer BPNN-GA comprised of input, hidden layer, and output layers was used to construct a mathematical model (model II), Concentrations of primers, probes, DNA polymerase, Mg2+, and dNTPs were used as the 5 inputs, and the Ct value was the output. Library (neuralnet) of R language was used for analysis, and we scaled Ct values to lie within the range (-)2.378-(+)2.378 in BPNN-GA, while the activation functions were sigmoid for all the models, a five-fold cross-validation was employed to select the number of nodes on the hidden layer and to train the model[26].
Optimization of uni- and multiplex qPCR conditions for the three viruses (RSV, INF, and HMPV) was performed using the trained model obtained using five-fold cross-validation with GA. Optimized conditions were obtained along with the corresponding predictive Ct values.
Model evaluation and validation
The Coefficient of Determination (R2), the mean absolute error (MAE) and the mean square error (MSE) were monitored to evaluate and compare the stabilities and prediction performances of the model I and model II. R2 is applicable to the training data set, and the larger R2 indicates that the more percent of the variance in the response variable can be explained by the explanatory variables[27]. And the smaller the MAE and MSE means the better the model[28]. The model that performed better was selected as a predictive model for condition optimization. Validation was conducted under the predictive optimal operation conditions subsequently.
The equations representing the evaluation indices used for model performance are provided below[27]:
(2) |
(3) |
(4) |
Yi,p is the value predicted by the model, Yi,e is the experimental Ct value, Ya is the averaged experimental Ct value, and n is the number of data points.
Results
RSM
The effects of operating variables were investigated according to the statistical analysis of CCD. According to variance analysis (Table 2), the F and P values of the corresponding multiplex qPCR models for the three viruses were F = 6.61 and P<0.0001, F = 3.67 and P = 0.0009, and F = 7.89 and P<0.0001; the P values were all less than 0.05, indicating that all the RSM models of multiples qPCR were statistically significant; meanwhile, the statistical results on factors showed that there was evident that at least one of the 5 predictors had an effect on the response, such as the statistical results on factors of RSV, the P value of DNA polymerase and Mg2+ were 0.0452 and <0.0001 respectively, demonstrated DNA polymerase and Mg2+were statistically significant for Ct value in multiplex qPCR of RSV, in addition, the P value of primers, probes, and other factors were >0.5, they had no significant effect on Ct value; the statistical results on factors of INF and HMPV showed the same conclusion. Variance analysis of uniplex qPCR was given in S4 Table.
Table 2. Analysis of variance (ANOVA) for the Ct values of multiplex qPCR using RSM-CCD.
Source | Dfb | RSVa | INFa | HMPVa | ||||||
---|---|---|---|---|---|---|---|---|---|---|
SSb | Fb | P | SSb | Fb | P | SSb | Fb | P | ||
Model | 20 | 195.630 | 6.610 | <0.0001*** | 89.860 | 3.670 | 0.0009** | 90.320 | 7.890 | <0.0001*** |
A | 1 |
2.48 | 2.02 | 0.1834 | 3.66 | 2.99 | 0.0949 | 4.30 | 7.52 | 0.0105* |
B | 1 | 2.83 | 3.35 | 0.1561 | 2.16 | 1.76 | 0.1950 | 0.98 | 1.71 | 0.2017 |
C | 1 | 5.86 | 5.89 | 0.0452* | 3.97 | 3.24 | 0.0828 | 0.011 | 0.019 | 0.8904 |
D | 1 | 114.79 | 82.64 | <0.0001*** | 33.38 | 27.23 | <0.0001*** | 51.94 | 90.75 | <0.0001*** |
E | 1 | 0.094 | 0.024 | 0.7922 | 6.08 | 4.96 | 0.0342* | 1.10 | 1.93 | 0.1756 |
AB | 1 | 1.47 | 1.07 | 0.3022 | 6.71 | 5.48 | 0.0266* | 0.15 | 0.26 | 0.6127 |
AC | 1 | 1.92 | 1.45 | 0.2402 | 3.84 | 3.13 | 0.0878 | 4.28 | 7.47 | 0.0107* |
AD | 1 | 0.70 | 0.44 | 0.4736 | 4.38 | 3.58 | 0.0690 | 2.36 | 4.13 | 0.0518 |
AE | 1 | 0.92 | 0.61 | 0.4143 | 1.56 | 1.27 | 0.2687 | 0.12 | 0.21 | 0.6496 |
BC | 1 | 0.17 | 0.34 | 0.7272 | 0.95 | 0.78 | 0.3853 | 0.76 | 1.33 | 0.2578 |
BD | 1 | 1.36 | 1.79 | 0.3214 | 0.70 | 0.57 | 0.4547 | 0.094 | 0.16 | 0.6886 |
BE | 1 | 5.16 | 4.36 | 0.0592 | 10.72 | 8.75 | 0.0062** | 4.489×10−3 | 7.844×10−3 | 0.9301 |
CD | 1 | 0.63 | 0.38 | 0.4968 | 2.52 | 2.06 | 0.1628 | 0.28 | 0.50 | 0.4869 |
CE | 1 | 3.55 | 4.22 | 0.1143 | 1.80 | 1.47 | 0.2361 | 0.063 | 0.11 | 0.7430 |
DE | 1 | 0.15 | 0.042 | 0.7429 | 2.95 | 2.40 | 0.1322 | 1.48 | 2.58 | 0.1196 |
A2 | 1 | 2.808×10−3 | 0.088 | 0.9637 | 0.49 | 0.40 | 0.5328 | 0.061 | 0.11 | 0.7458 |
B2 | 1 | 6.068×10−3 | 0.028 | 0.9467 | 1.405×10−7 | 1.146×10−7 | 0.9997 | 5.920×10−3 | 0.010 | 0.9197 |
C2 | 1 | 1.23 | 2.30 | 0.3460 | 0.94 | 0.77 | 0.3877 | 0.022 | 0.038 | 0.8460 |
D2 | 1 | 49.71 | 21.76 | <0.0001*** | 3.08 | 2.51 | 0.1243 | 21.32 | 37.25 | <0.0001*** |
E2 | 1 | 0.028 | 0.17 | 0.8866 | 0.11 | 0.088 | 0.7695 | 8.105×10−3 | 0.014 | 0.9061 |
Residual | 28 | 37.370 | 34.320 | 16.020 | ||||||
Lack of Fit | 21 | 37.320 | 277.800 | <0.0001*** | 34.220 | 113.830 | <0.0001*** | 15.960 | 84.010 | <0.0001*** |
Pure Error | 7 | 0.0450 | 0.100 | 0.063 | ||||||
Cor Total | 48 | 233.000 | 124.190 | 106.340 | ||||||
Adequate pression | 14.275 | 8.9560 | 14.392 |
*p−value <0.05
**p−value <0.01
***p−value <0.001.
aRSV, HMPV, INF are three virus used in this study.
bDf: Degree of freedom; SS: Sum of Squares; F, F-value.
(5) |
(6) |
(7) |
Eqs (5)–(7) represent the multiplex qPCR polynomial model for the three viruses (RSV, INF and HMPV). Coefficients that had no statistical significance are eliminated. As seen from polynomial equations of multiplex qPCR for three viruses, the Mg2+ all affected significantly the Ct values, thus, the most important factor under the conditions used for these experiments was the concentration of Mg2+.
The RSM-3D plot used for RSV multiplex qPCR is illustrated in Fig 1, and the RSM-3D plot other viruses of multiplex qPCR is set out in Fig 2. The x- and y-axes represent the two interacting factors, and the obtained Ct value is expressed on the z-axis. The color, height, and contour of the curves indicate different response levels. Interacting factors are described as AD, BD, and DE. As can be seen from Figures, Mg2+ is the most important parameter influencing the Ct value. The RSM-3D plots used for uniplexqPCR are shown in S1 and S2 Figs.
BPNN-GA
The fitting error and predictiveerrorof five-fold cross-validation of model II for multiplex qPCR produced by training neurons in the hidden layer are provided in Table 3. The fitting error and predictive error are the bases used to select the number of nodes in the hidden layer. When fitting error and predictive error were relatively small, and took the minimum prediction error as the primary condition, the corresponding number of hidden-layer nodes was treated as the training model. Such as RSV multiplex qPCR, the fitting error had improvement in performance by increasing the number of hidden layer, whereas, the performance of predictive error showed negative effect for 3, 5 neurons, respectively, so the number of neurous was 4 as final model of RSV multiplex qPCR. At the same, the number of nodes selected for INF and HMPV multiplex qPCR was 4, 3, respectively. And the fitting error and predictive error of five-fold cross-validation of model II for uniplex qPCR were given in S5 Table.
Table 3. Five-fold cross-validation of model II for multiplex qPCR.
Neurons | RSVa | INFa | HMPVa | |||
---|---|---|---|---|---|---|
Error of fit () | Error of prediction () | Error of fit () | Error of prediction () | Error of fit () | Error of prediction () | |
1 | 0.099±0.063 | 0.048±0.023 | 0.120±0.098 | 0.075±0.132 | 0.079±0.12 | 0.057±0.082 |
2 | 0.056±0.058 | 0.055±0.023 | 0.077±0.089 | 0.079±0.077 | 0.040±0.078 | 0.022±0.069 |
3 | 0.035±0.035 | 0.039±0.023 | 0.056±0.072 | 0.043±0.080 | 0.023±0.055 | 0.016±0.044 |
4 | 0.042±0.049 | 0.034±0.041 | 0.051±0.079 | 0.042±0.059 | 0.022±0.024 | 0.020±0.052 |
5 | 0.031±0.050 | 0.040±0.049 | 0.046±0.064 | 0.046±0.06 | 0.024±0.058 | 0.016±0.049 |
aRSV, HMPV, INF are three virus used in this study.
RSM versus BPNN-GA
In this study, RSM and BPNN-GA were used for data analysis. Corresponding mathematical models were constructed and referred to as model I (RSM) and model II (BPNN-GA).
The R2 values for model II were closer to 1 than for model I in both uni- and multiplex PCR for all three viruses, The MAE and MSE of model II was much smaller than those of model Ifor both uni- and multiplex qPCR of the three viruses (Table 4, S6 Table). Though R2 of model II are all larger than RSM, compared with RSM, ANN's predictive Ct values are more deviated from the actual situation, especially in HMPV and INF for multiplex PCR (Table 5, S7 Table), therefore, we believe that in the PCR system optimization, RSM is more appropriate than ANN. Hence, model I was selected as the final prediction model.
Table 4. Performance numbers of BPNN-GA versus RSM models for multiplex qPCR.
parameter | RSVa | INFa | HMPVa | |||
---|---|---|---|---|---|---|
Model I | Model II | Model I | Model II | Model I | Model II | |
R2 | 0.847 | 0.980 | 0.746 | 0.940 | 0.864 | 0.976 |
MAEb | 0.633 | 0.0007 | 0.613 | 0.0009 | 0.483 | 0.0003 |
MSEb | 0.762 | 0.0015 | 0.700 | 0.0021 | 0.327 | 0.0008 |
aRSV, HMPV, INF are three virus used in this study.
bMAE means the mean absolute error; MSE means the mean square error.
Table 5. Optimization conditions and predictive Ct value of model I and model II for multiplex qPCR.
Factors | RSVb | INFb | HMPVb | |||
---|---|---|---|---|---|---|
Model I | Model II | Model I | Model II | Model I | Model II | |
Aa(μM) | 0.290 | 0.319 | 0.230 | 0.319 | 0.320 | 0.164 |
Ba(μM) | 0.310 | 0.319 | 0.260 | 0.319 | 0.320 | 0.320 |
Ca(Uc/μL) | 0.040 | 0.081 | 0.050 | 0.319 | 0.030 | 0.081 |
Da(mM) | 2.550 | 0.172 | 2.350 | 0.319 | 0.800 | 0.195 |
Ea(mM) | 0.160 | 0.319 | 0.090 | 0.144 | 0.080 | 0.082 |
Predictive Ct value | 21.485 | 8.3974 | 19.908 | 17.4232 | 24.362 | -9.5411 |
aA: primers, B: probes, C: DNA polymerase, D: Mg2+, E: dNTPs.
bRSV, HMPV, INF are three virus used in this study.
cU: active unit of enzyme.
Validation
After comparing the stabilities and prediction performances of models I and II, model I was selected for condition optimization. The optimized conditions were subjected to validation tests, three parallel repeats were performed and relative errors (shown as Eq ((8)) between validated Ct values and predictive Ct values for multiplex qPCR for RSV, INF and HMPV are 3.351%, 13.744%, 4.550% respectively. The relative errors of the validated Ct values of uniplex qPCR for RSV, INF and HMPV are 6.748%, 1.520%, 4.590% respectively. The validation results indicate a greater relative error for multiplex qPCR of INF; the error rates for multiplex qPCR of the other two viruses were within an acceptable range.
(8) |
Relative Error: Actual relative error, generally given as a percentage; Ctp: predictive Ct value under optimal conditions; Cte: experimental Ct value under optimal conditions
Discussion
As a detection technique, qPCR has significant advantages[1]. However, the number of factors influencing the process is relatively high, and the interactions between these factors are complicated, especially in multiplex qPCR systems. Due to the addition of multiple pairs of primers and probes, changes to factors such as annealing temperature, elongation temperature, and number of cycles can result in non-specific products and different effects on amplification, which leads to difficulties in comparing results of qPCR.
Considering the important effects of different primers on uni- and multiplex qPCR[29], we considered the primer melting temperature (Tm) and GC percentage as well as the homogeneity of the primers and the target nucleic acid sequence when designing primers. Having close Tmvalues can improve the amplification of product in multiplex qPCR[30–32]. The uni- and multiplex qPCR mathematical models built using RSM for the three viruses produced statistically significant results. Additionally, adequate precision measures the signal to noise ratio, and this ratio greater than 4 is desirable. The ratios for the models were all greater than 4 (Table 2, S4 Table), indicating models are sufficiently precisionand can be used to guide experiments[33].
As seen from Polynomial equations of uni- and multiplex qPCR for three viruses, the Mg2+ all significantly affected the Ct value. Meanwhile, Fig 1 also indicate magnesium ion is the most important parameter influencing the Ct value. Thus, the most important factor under the conditions used for these experiments is the concentration of magnesium ion.
To our knowledge[34], dNTP and DNA polymerase are competing reagents in PCR systems. Free magnesium is necessary for DNA polymerase to exert its biological activity; the formation of bonds between dNTP and DNA also requires magnesium ion. Furthermore, in multiplex PCR with a constant dNTP concentration, the amplification product increases as the concentration of Mg2+ increases, and non-specific bands are eliminated. However, when an excessive concentration of Mg2+ is used, amplification slows down and may even stop, which is speculated to result from a suppression effect[35].
Multiple factors affecting multiplex PCR amplification are consistent with those observed in this study. Mg2+ significantly influenced the formation of the PCR amplification product; at appropriate concentrations, positive effects on the amount of PCR product were observed, but suppression occurred at excessively high concentrations. Under the experimental conditions used, changes in dNTP concentration did not significantly affect PCR amplification, which could be due to the range of dNTP concentration was too narrow in this study.
Interestingly, Mg2+ had a positive effect on uniplex PCR but a negative effect on multiplex qPCR of RSV in this study, which are concordant with prior work[32], DNase concentration differentially affects the amplification of uni- and multiplex PCR; an increase in DNase concentration in uniplex PCR leads to an increase in non-specific product but positively affects multiplex PCR amplification.
Nonetheless, model I was relatively better than model II under the experimental conditions used in this study, its prediction function was still unsatisfactory. This could be related to the limitations of RSM, such as the ability to only build quadratic polynomial functions[36] and the inability to portray the relationship between multiplex qPCR factors. Although model II was better at solving non-linear functions[37], it produced over-fittings[28].
As a result, model I was selected for condition optimization. However, in multiplex qPCR, unlike the concentrations of template-specific primers and probes, the same concentrations must be used for DNA polymerase, Mg2+, and dNTPs. Therefore, further adjustments to multiplex qPCR conditions based on parameter optimization results are necessary. For multiplex qPCR, combined with RSM-3D plots, the smaller Ct values were obtained at greater concentrations of primers and probes. However, considering the increase in non-specificity when primer and probe concentrations are too high and the differences in the optimized conditions for the primers used for different viruses[32], the primers and probes concentrations suggested by the RSM models for each virus were determined to be the optimized condition for each virus. DNA polymerase and Mg2+ behaved similarly within a median range of concentrations (0.04–0.08 mmol/L and 2.0–2.5 μmol/L respectively), which could be sufficient for all three viruses. Considering the increase in non-specificity triggered by a high concentration of Mg2+[29], the concentration used was set at 2.35 mmol/L. dNTPs did not significantly influence the response value. The final and cost-effective optimal conditions to be used for uni- and multiplex qPCR are given in Table 6 and S8 Table.
Table 6. Optimal conditions for multiplex qPCR.
Factors | RSVb | INFb | HMPVb |
---|---|---|---|
Aa(μmol/L) | 0.290 | 0.230 | 0.320 |
Ba(μmol/L) | 0.310 | 0.260 | 0.320 |
Ca(Uc/μL) | 0.050 | 0.050 | 0.050 |
Da(mmol/L) | 2.350 | 2.350 | 2.350 |
Ea(mmol/L) | 0.160 | 0.160 | 0.160 |
aA: primers, B: probes, C: DNA polymerase, D: Mg2+, E: dNTPs.
bRSV, HMPV, INF are three virus used in this study.
cU:active unit of enzyme.
Conclusions
Optimization conditions of uni- and multiplex qPCR were predicted by RSM. By operating designed experiments, the effects between different variations were investigated. Furthermore, two mathematical models were constructed via RSM and BPNN-GA. The statistical results of RSM indicated that Mg2+ was the most important factor in this study. And interestingly, the same factor played different role in different reaction system. Though the R2of GA-BPNNs are all larger than RSM, the predictive Ct value of qPCR is too different from the actual value, therefore, we believe that in the PCR system optimization, RSM is more appropriate than ANN, perhaps because its internal relationship, so RSM was performed better in modeling both uni- and multiplex qPCR.
Supporting information
(A: primers, B: probe, D: Mg2+, E: dNTPs) a: The effects of interaction between primers and Mg2+ on the Ct value; b: The effects of the interaction between probes and Mg2+on the Ct value; c: The effects of the interaction between Mg2+and dNTPs on the Ct value.
(TIF)
(A: primers, B: probe, C: DNA polymerase, D: Mg2+)a: The effects of interaction between primers and Mg2+ on the Ct value for uniplex qPCR of INF; b: The effects of the interaction between probes and Mg2+ on the Ct value for uniplex qPCR of INF; c: The effects of the interaction between Mg2+and DNA polymerase on the Ct value for uniplex qPCR of INF; d: The effects of the interaction between Mg2+and DNA polymerase on the Ct value for uniplex qPCR of HMPV.
(TIF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work was supported by Natural Science Foundation of Zhejiang Province (LY17H190001) grant to DJY, http://www.zjnsf.gov.cn/, Health and Family Planning Commission of Zhejiang Province (2015ZDA025) grant to DJY, http://www.zjwjw.gov.cn/, and Health and Family Planning Commission of Hangzhou Manicipality grant (2018ZD001) to DJY, http://wsjsw.hangzhou.gov.cn/. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Denipitiya DT, Chandrasekharan NV, Abeyewickreme W, Hartskeerl CM, Hartskeerl RA, Jiffrey AM, et al. Application of a real time Polymerase Chain Reaction (PCR) assay for the early diagnosis of human leptospirosis in Sri Lanka. Biologicals: journal of the International Association of Biological Standardization. 2016;44(6):497–502. [DOI] [PubMed] [Google Scholar]
- 2.Klein D. Quantification using real-time PCR technology: applications and limitations. Trends in molecular medicine. 2002;8(6):257–60. [DOI] [PubMed] [Google Scholar]
- 3.Dubljanin E, Colovic Calovski I, Vujcic I, Dzamic A, Arendrup MC, Petersen RF, et al. Clinical evaluation of a T. rubrum-specific polymerase chain reaction and pandermatophyte polymerase chain reaction in the diagnosis of suspected onychomycosis in 183 Serbian patients. The British journal of dermatology. 2014;171(6):1593–5. 10.1111/bjd.13168 [DOI] [PubMed] [Google Scholar]
- 4.Jartti T, Soderlund-Venermo M, Hedman K, Ruuskanen O, Makela MJ. New molecular virus detection methods and their clinical value in lower respiratory tract infections in children. Paediatric respiratory reviews. 2013;14(1):38–45. 10.1016/j.prrv.2012.04.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kim JY, Lee JL. Multipurpose assessment for the quantification of Vibrio spp. and total bacteria in fish and seawater using multiplex real-time polymerase chain reaction. Journal of the science of food and agriculture. 2014;94(13):2807–17. 10.1002/jsfa.6699 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Johnson G, Nour AA, Nolan T, Huggett J, Bustin S. Minimum information necessary for quantitative real-time PCR experiments. Methods in molecular biology. 2014;1160:5–17. 10.1007/978-1-4939-0733-5_2 [DOI] [PubMed] [Google Scholar]
- 7.Rychlik W, Spencer WJ, Rhoads RE. Optimization of the annealing temperature for DNA amplification in vitro. Nucleic acids research. 1990;18(21):6409–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wadle S, Lehnert M, Rubenwolf S, Zengerle R, von Stetten F. Real-time PCR probe optimization using design of experiments approach. Biomolecular detection and quantification. 2016;7:1–8. 10.1016/j.bdq.2015.12.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Noda AA, Rodriguez I, Mondeja B, Fernandeze C. Design, optimization and evaluation of a polymerase chain reaction for detection of borrelia spp. Advances in clinical and experimental medicine: official organ Wroclaw Medical University. 2013;22(5):639–53. [PubMed] [Google Scholar]
- 10.Bezerra MA, Santelli RE, Oliveira EP, Villar LS, Escaleira LA. Response surface methodology (RSM) as a tool for optimization in analytical chemistry. Talanta. 2008;76(5):965–77. 10.1016/j.talanta.2008.05.019 [DOI] [PubMed] [Google Scholar]
- 11.Hwang SH, Kwon SH, Wang Z, Kim TH, Kang YH, Lee JY, et al. Optimization of extraction parameters of PTP1beta (protein tyrosine phosphatase 1beta), inhibitory polyphenols, and anthocyanins from Zea mays L. using response surface methodology (RSM). BMC complementary and alternative medicine. 2016;16(1):317 10.1186/s12906-016-1296-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hallenbeck PC, Grogger M, Mraz M, Veverka D. The use of Design of Experiments and Response Surface Methodology to optimize biomass and lipid production by the oleaginous marine green alga, Nannochloropsis gaditana in response to light intensity, inoculum size and CO2. Bioresource technology. 2015;184:161–8. 10.1016/j.biortech.2014.09.022 [DOI] [PubMed] [Google Scholar]
- 13.Lim CW, Kim BH, Kim IH, Lee MW. Modeling and optimization of phospholipase A(1)-catalyzed hydrolysis of phosphatidylcholine using response surface methodology for lysophosphatidylcholine production. Biotechnology progress. 2015;31(1):35–41. 10.1002/btpr.2009 [DOI] [PubMed] [Google Scholar]
- 14.Taylor MJ, Haralambieva IH, Vierkant RA, Ovsyannikova IG, Poland GA. Response surface methodology to determine optimal measles-specific cytokine responses in human peripheral blood mononuclear cells. Journal of immunological methods. 2012;382(1–2):220–3. 10.1016/j.jim.2012.06.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ayer T, Chen Q, Burnside ES. Artificial neural networks in mammography interpretation and diagnostic decision making. Computational and mathematical methods in medicine. 2013;2013:832509 10.1155/2013/832509 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Ebrahimzadeh H, Tavassoli N, Sadeghi O, Amini MM. Optimization of solid-phase extraction using artificial neural networks and response surface methodology in combination with experimental design for determination of gold by atomic absorption spectrometry in industrial wastewater samples. Talanta. 2012;97:211–7. 10.1016/j.talanta.2012.04.019 [DOI] [PubMed] [Google Scholar]
- 17.Mohamed MS, Tan JS, Mohamad R, Mokhtar MN, Ariff AB. Comparative analyses of response surface methodology and artificial neural network on medium optimization for Tetraselmis sp. FTC209 grown under mixotrophic condition. The Scientific World Journal. 2013;2013:948940 10.1155/2013/948940 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Pan H, Zhang Q, Cui K, Chen G, Liu X, Wang L. Optimization of extraction of linarin from Flos chrysanthemi indici by response surface methodology and artificial neural network. 2017;40(9):2062–70. 10.1002/jssc.201601259 [DOI] [PubMed] [Google Scholar]
- 19.Li Y, Du X, Yuan Q, Lv X. Development and validation of a new PCR optimization method by combining experimental design and artificial neural network. Applied biochemistry and biotechnology. 2010;160(1):269–79. 10.1007/s12010-009-8581-4 [DOI] [PubMed] [Google Scholar]
- 20.Buzaev IV, Plechev VV, Nikolaeva IE, Galimova RM. Artificial intelligence: Neural network model as the multidisciplinary team member in clinical decision support to avoid medical mistakes. Chronic diseases and translational medicine. 2016;2(3):166–72. 10.1016/j.cdtm.2016.09.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Dil EA, Ghaedi M, Asfaram A, Mehrabi F, Bazrafshan AA, Ghaedi AM. Trace determination of safranin O dye using ultrasound assisted dispersive solid-phase micro extraction: Artificial neural network-genetic algorithm and response surface methodology. Ultrasonics sonochemistry. 2016;33:129–40. 10.1016/j.ultsonch.2016.04.031 [DOI] [PubMed] [Google Scholar]
- 22.Sarve A, Sonawane SS, Varma MN. Ultrasound assisted biodiesel production from sesame (Sesamum indicum L.) oil using barium hydroxide as a heterogeneous catalyst: Comparative assessment of prediction abilities between response surface methodology (RSM) and artificial neural network (ANN). Ultrasonics sonochemistry. 2015;26:218–28. 10.1016/j.ultsonch.2015.01.013 [DOI] [PubMed] [Google Scholar]
- 23.Bustin SA, Benes V, Garson JA, Hellemans J, Huggett J, Kubista M, et al. The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clinical chemistry. 2009;55(4):611–22. 10.1373/clinchem.2008.112797 [DOI] [PubMed] [Google Scholar]
- 24.Chen Y, He H, Pan P, He S, Dong X, Chen Y, et al. Rapid and combined detection of Mycoplasma pneumoniae, Epstein-Barr virus and human cytomegalovirus using AllGlo quadruplex quantitative PCR.J Med Microbiol.2016;65(7):590–5. 10.1099/jmm.0.000266 [DOI] [PubMed] [Google Scholar]
- 25.Giordano PC, Martinez HD, Iglesias AA, Beccaria AJ, Goicoechea HC. Application of response surface methodology and artificial neural networks for optimization of recombinant Oryza sativa non-symbiotic hemoglobin 1 production by Escherichia coli in medium containing byproduct glycerol. Bioresource technology. 2010;101(19):7537–44. [DOI] [PubMed] [Google Scholar]
- 26.Arab MM, Yadollahi A, Shojaeiyan A, Ahmadi H. Artificial Neural Network Genetic Algorithm As Powerful Tool to Predict and Optimize In vitro Proliferation Mineral Medium for G × N15 Rootstock. Frontiers in Plant Science. 2016;7:1526 10.3389/fpls.2016.01526 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ross T. Indices for performance evaluation of predictive models in food microbiology. The Journal of applied bacteriology. 1996;81(5):501–8. [DOI] [PubMed] [Google Scholar]
- 28.Coelho S F., da Conceição Saraiva T.L., Dumont de Lima e Silva G., Teles de Faria J., Grijó Pitangui C., Aliakbarian B., Perego P., Converti A. Batch growth of Kluyveromyces lactis cells from deproteinized whey: Response surface methodology versus Artificial neural network—Genetic algorithm approach. Biochemical Engineering Journal.2016;109:305–311. [Google Scholar]
- 29.Qu W, Shen Z, Zhao D, Yang Y, Zhang C. MFEprimer: multiple factor evaluation of the specificity of PCR primers. Bioinformatics (Oxford, England). 2009;25(2):276–8. [DOI] [PubMed] [Google Scholar]
- 30.Elnifro EM, Ashshi AM, Cooper RJ, Klapper PE. Multiplex PCR: optimization and application in diagnostic virology. Clinical microbiology reviews. 2000;13(4):559–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chavali S, Mahajan A, Tabassum R, Maiti S, Bharadwaj D. Oligonucleotide properties determination and primer designing: a critical examination of predictions. Bioinformatics (Oxford, England). 2005;21(20):3918–25. [DOI] [PubMed] [Google Scholar]
- 32.Schoske R, Vallone PM, Ruitberg CM, Butler JM. Multiplex PCR design strategy used for the simultaneous amplification of 10 Y chromosome short tandem repeat (STR) loci. Analytical and bioanalytical chemistry. 2003;375(3):333–43. 10.1007/s00216-002-1683-2 [DOI] [PubMed] [Google Scholar]
- 33.Hamouda HI, Nassar HN, Madian HR, Abu Amr SS, El-Gendy N. Response Surface Optimization of Bioethanol Production from Sugarcane Molasses by Pichia veronae Strain HSC-22. Biotechnology research international. 2015;2015:905792 10.1155/2015/905792 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Vremera T, Iancu LS, Logigan C, Nastase E, Miftode E, Lunca C, et al. Optimization of triplex real time PCR for detecting Staphylococcus aureus mecA, pvl and nuc genes. Roumanian archives of microbiology and immunology. 2011;70(2):69–73. [PubMed] [Google Scholar]
- 35.Henegariu O, Heerema NA, Dlouhy SR, Vance GH, Vogt PH. Multiplex PCR: critical parameters and step-by-step protocol. BioTechniques. 1997;23(3):504–11. [DOI] [PubMed] [Google Scholar]
- 36.Shanmugaprakash M, Sivakumar V. Development of experimental design approach and ANN-based models for determination of Cr(VI) ions uptake rate from aqueous solution onto the solid biodiesel waste residue. Bioresource technology. 2013;148:550–9. 10.1016/j.biortech.2013.08.149 [DOI] [PubMed] [Google Scholar]
- 37.Li Y, Abbaspour MR, Grootendorst PV, Rauth AM, Wu XY. Optimization of controlled release nanoparticle formulation of verapamil hydrochloride using artificial neural networks with genetic algorithm and response surface methodology. European journal of pharmaceutics and biopharmaceutics: official journal of Arbeitsgemeinschaft fur Pharmazeutische Verfahrenstechnik eV. 2015;94:170–9. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
(A: primers, B: probe, D: Mg2+, E: dNTPs) a: The effects of interaction between primers and Mg2+ on the Ct value; b: The effects of the interaction between probes and Mg2+on the Ct value; c: The effects of the interaction between Mg2+and dNTPs on the Ct value.
(TIF)
(A: primers, B: probe, C: DNA polymerase, D: Mg2+)a: The effects of interaction between primers and Mg2+ on the Ct value for uniplex qPCR of INF; b: The effects of the interaction between probes and Mg2+ on the Ct value for uniplex qPCR of INF; c: The effects of the interaction between Mg2+and DNA polymerase on the Ct value for uniplex qPCR of INF; d: The effects of the interaction between Mg2+and DNA polymerase on the Ct value for uniplex qPCR of HMPV.
(TIF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
(PDF)
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.