Skip to main content
Biomolecular Detection and Quantification logoLink to Biomolecular Detection and Quantification
. 2018 Oct 16;16:1–4. doi: 10.1016/j.bdq.2018.08.001

Algorithms for automated detection of hook effect-bearing amplification curves

Michał Burdukiewicz a, Andrej-Nikolai Spiess b, Konstantin A Blagodatskikh c, Werner Lehmann d, Peter Schierack e, Stefan Rödiger e,
PMCID: PMC6287529  PMID: 30560061

Abstract

Amplification curves from quantitative Real-Time PCR experiments typically exhibit a sigmoidal shape. They can roughly be divided into a ground or baseline phase, an exponential amplification phase, a linear phase and finally a plateau phase, where in the latter, the PCR product concentration no longer increases. Nevertheless, in some cases the plateau phase displays a negative trend, e.g. in hydrolysis probe assays. This cycle-to-cycle fluorescence decrease is commonly referred to in the literature as the hook effect. Other detection chemistries also exhibit this negative trend, however the underlying molecular mechanisms are different.

In this study we present two approaches to automatically detect hook effect-like curvatures based on linear (hookreg) and nonlinear regression (hookregNL). As the hook effect is typical for qPCR data, both algorithms can be employed for the automated identification of regular structured qPCR curves. Therefore, our algorithms streamline quality control, but can also be used for assay optimization or machine learning.

Keywords: Hook effect, QPCR, TaqMan, EvaGreen, SybrGreen, Linear, Nonlinear, RDML, Automation

1. Introduction

Key elements of any PCR assay are the primers since they control the sensitivity and specificity of the reaction [1]. Not less important is a stable binding of probes to the amplicon for the generation of a meaningful amplification curve signal in quantitative real-time PCR (qPCR). For hybridization probes, a phenomenon observed in late cycles is the competition between amplicon strands and the probes, which may reduce the fluorescent signal considerably [2]. This so-called hook effect is often observed at high template concentrations that are typical in later cycles [3], and where the single strands of the amplicons re-anneal faster than the probes with the amplicons. For detection chemistries other than hybridization probes, such as hydrolysis probes, hook effects have not been described. Other reasons such as the nuclease activity of the polymerase may also contribute to the decreasing probe signal [4], however it is more likely that this results in an earlier plateau phase.

Several experimental conditions can be adjusted to minimize the hook effect in qPCR, such as optimizing the DNA template, probe or MgCl2 concentrations, reducing the cycle number or conducting asymmetric PCR [3]. Although the decrease in fluorescence does not affect the efficiency or specificity of amplification and target detection [3], a low fluorescence signal can entail limitations to the sensitivity of the assay, including melting curve and data analysis. For example, it was reported that the hook effect is challenging genotyping and forensic applications [2], [5], [6].

Although the hook effect has been reported mainly for HybProbes and other specific detection chemistries [4], [7], interestingly, it can also be encountered in amplification reactions with intercalating dye chemistries (Fig. 1A, D) as well as hydrolysis probes (TaqMan™; Fig. 1E, F). For the latter, hook effects are evident in the non-baselined raw data (data not shown). Here, the mechanistic basis for decreasing fluorescence in late cycles are hitherto unknown, however photobleaching or requenching may pose possibilities.

Fig. 1.

Fig. 1

Overview of the data included in hookreg.rdml. The amplification curves exhibit different curvatures, including sigmoidal shapes, curves with hook effect-like shapes and negative qPCR reactions. For fluorescence readout, EvaGreen, SYTO-13 and TaqMan probes were employed. Black amplification curves display no hook effect, while red amplification curves do, as identified by our analysis pipeline. (A) A01-A12: SYTO-13, (B) B01-D08: hydrolysis probes, (C) D09-F08: SybrGreen I, (D) F09-F10: EvaGreen, (E) F11-H02: hydrolysis probes, (F) H03-H12: hydrolysis probes. Single plots for all amplification experiments are shown in Supplemental Fig. 1. RFU, relative fluorescence units.

On the software side, Nolan et al. [8] clearly demonstrated that a wrong baseline setting can lead to up- or downward sloping baseline estimates, resulting in the plateaus sloping to the opposite direction. Here, it should be noted that most qPCR systems apply a fitted trendline to a number of early cycles in the fluorescence baseline.

Algorithms to detect hook effect-like curve structures in the tail region can be employed for quality control. For example, a deviation from a sigmoidal model can be indicative for a failed experiment, and these amplification curves should undergo a more optimized analysis procedure or even be excluded from further analysis. A plethora of software packages for the analysis of qPCR data have been published [9], but none of these appear to describe algorithms that can be used to detect hook effects. This study revised existing software and demonstrates two approaches to detect hook effects in qPCR curvatures. One algorithm is based on maximum peak finding and linear regression (hookreg), while the other is based on fitting a non-linear six-parameter model to acquire a coefficient for the slope of the plateau phase (hookregNL).

2. Materials and methods

The following algorithmic steps describe both the hookreg() and hookregNL() methods that are applied to raw fluorescence data without baseline correction. If done correctly, these are also applicable to baseline-corrected values. The input values xi and yi stand for the cycles and cycle dependent fluorescence intensity, respectively. The return values are the slope estimate and its p-value, the slope's confidence interval and a logical decision if a hook-effect is present. hookregNL() uses core functionality from the qpcR package [10], [11]. Both algorithms are implemented in the PCRedux package v. 0.2.6-1 (https://cran.r-project.org/package=PCRedux) of the R statistical computing language (https://www.r-project.org).

hookreg():

  • Find x0 = cycle at max(yi).

  • Fit a linear model yi = β0 + β1xi + ε from x0 to xn, where n = length of x (number of cycles), when at least five cycles can be employed.

  • Calculate the p-value for estimated slope βˆ1 as well as its 99.75% confidence interval CI.

  • If p < 0.0025 (one-sided t-test), then a hook-effect is defined as being present, because the slope of βˆ1 is significantly different from 0 (no slope) and negative. If both CI0.125% and CI99.875% bounds are negative, we can be 99.75% certain that 0 (no slope) is not included in the CI of βˆ1.

A visual representation of the algorithm is shown in Fig. 2.

Fig. 2.

Fig. 2

Hook effect analysis of amplification curves from different samples. The amplification plots are shown without background subtraction. The samples were taken from the hookreg.rdml data set (see Supplemental Files). (A) Overview of the amplification curves. Black: Amplification curve with a hook effect-like curvature. Red: Amplification curve with a standard plateau. Green: Negative amplification reaction. (B) All values from x0 to xn in the ROI (green) are used for ordinary linear regression. Presence of a hook effect was tested by the slope of the fit (p < 0.005) or a negative 99.5% confidence interval. (C) Amplification curves with no hook effect, no meaningful regression results from the lack of at least 5 cycles after x0. (D) Negative amplification curves get discarded by a logical decision. RFU, relative fluorescence units; normalized RFU, relative fluorescence units normalized to the 99th percentile. β0, intercept; β1, slope; P, p-value; CI, confidence interval; NA, missing value.

hookregNL():

  • Remove the first 5 (or more) cycles x1, x2 … x5 from the data to minimize slope effects in the baseline region.

  • Fit a six-parameter log-logistic model yi = c + kxi + (d − c)/(1 + exp(b(log(xi) − log(e))))f by nonlinear least-squares. This model has the following parameters: c: lower asymptote (baseline); d: upper asymptote (plateau); k: linear slope; b: sigmoidal slope; e: point-of-inflection (if f = 1); f: asymmetry around point-of-inflection.

  • Calculate the 99.75% confidence interval CI for the estimated slope parameter kˆ.

  • If both CI0.125% and CI99.875% bounds are negative, then a hook-effect is defined as being present, because a slope of 0 (no slope) is not included in CI.

Confidence intervals instead of slope estimates for both functions are employed to deliver significance to the plateau phase's fluorescence decrease.

For algorithm evaluation, we compiled a data set consisting of sigmoidal qPCR curves with no hook effect, non-sigmoidal curves (negative control), curves with no plateau phase, curves with a slight negative trend (hook effect-like) and curves with pronounced hook effect (see also Supplement Fig. 1, Supplement Table 1). The raw data were compiled from the boggy data set [12], the testdat data set [11], the C127EGHP data set [13], a whole genome amplification experiment [14] and from an in-house BRCA1 gene quantification experiment. The raw data were rated by the humanrater() function either as hook effect-bearing (“y”) or not (“n”), as described in the Supplementary Information of [13]. As reproducible research is an important aspect in science [15], [16], the amplification curve data (Fig. 1) were combined and made available as an RDML file [17] that was used for all analyses.

3. Results and discussion

In general, the presence of hook effects should be checked during the analysis of qPCR experiments. For example, when fitting models that utilize all data points, the Cq value is highly influenced by the magnitude of the plateau phase [18], which will be estimated in a four-parameter sigmoidal model approximately as the mean of all plateau phase values. Hence, the stronger the hook effect, the lower the plateau's mean. In contrast, methods that do not include the hook region fluorescence values and fit only the exponential region [19], [20], [21] or are parameter-free (e.g. splines), will not be influenced by the presence of a downward-sloped plateau. Using both algorithms in combination for quality control is feasible for the identification of regular structured qPCR curves prior to Cq value and qPCR efficiency estimation. In the presence of a hook-like plateau, the user is encouraged to check that (i) the baseline setting is correct, (ii) the assay is optimized and (iii) a quantitation algorithm independent of plateau phase values is employed.

In this work, we introduce the hookreg() and hookregNL() algorithms to identify hook effect-bearing qPCR amplification curves. The first, hookreg(), is based on standard linear regression and can be easily implemented in less sophisticated statistical software packages or spread-sheet applications. The second, hookregNL(), is based on the fitting of a six-parameter sigmoidal model, available in statistical software packages such as qpcR for the R statistical computing language [10], [11], but also definable in the “Nonlinear Regression” menu of many softwares.

Both algorithms are implemented in the PCRedux package, and their output provides the fit parameters as well as corresponding p-values, the cycle when the hook presumably starts (hookreg()), the confidence intervals, and finally the decision about the presence or absence of a hook effect (Supplement Section 4). hookreg() makes some assumptions about the data, e.g. at least five additional data points (5 cycles) from the maximum are required for a linear fit, so that curves with less points will be ignored. This may lead to false negative results. The hookregNL() algorithm makes the assumption that a sigmoid model can be fitted. A few cycles (by default 5, but extendable) of the baseline region need to be removed prior to fitting to avoid false positives from a downward sloping baseline (Fig. 1). Both algorithms were compared and gauged against a human classification of the hookreg.rdml data set and displayed different performances, resulting in false negative and false positive decisions in some cases (Supplement Table 2 and Supplement Table 3).

Interestingly, both algorithms complemented each other and complete agreement was observed (Table 1) when they were combined by a logical statement (see Supplement Section 5). Essentially, both algorithms aim to detect negative trends (hook effect or hook effect-like curvatures) in the tail region. In the hookreg.rdml data set we achieved 100% sensitivity and 97% specificity with the combined approach. In addition, we observed cases where the negative trend was not immediately evident to the human rater, but the statistical estimates obtained from hookreg() and hookregNL() indicated a significant downward slope. Since both the hookreg() and hookregNL() functions report the steepness of the slope (Supplement Tables 2 and 3), users can decide if the data have a hook effect-like curvature. Corroborating their high performance, we observed no false negative classification by both algorithms within the hookreg.rdml data set.

Table 1.

Analysis of the performance of both algorithms. The performance measures were determined with the performance() function of the PCRedux package. See Supplement Section 5.

hookreg hookregNL combined
Sensitivity 0.90 0.33 1.00
Specificity 0.97 1.00 0.97
False positive rate 0.03 0.00 0.03
False negative rate 0.10 0.67 0.00
Accuracy 0.96 0.85 0.98
True positive 19 7 21
True negative 73 75 73
False positive 2 0 2
False negative 2 14 0

Finally, a graphical user interface for both algorithms was introduced to the rdmlEdit() function of the RDML R package [17], which is available as a web server or can be deployed locally (see Supplement Section 6). This graphical user interface enables testing qPCR curves for hook effects and marks these, depending on the result.

Supplemental files

Funding

This work was funded by the Federal Ministry of Education and Research (BMBF) InnoProfile-Transfer-Project 03IPT611X and in part by the Gesundheitscampus Brandenburg “digilog: Digitale und analoge Begleiter für eine alternde Bevölkerung”, Brandenburg Ministry for Science, Research and Culture (MWFK).

Conflicts of interest

Werner Lehmann is a shareholder and employee of Attomol GmbH. The other authors have no conflicts of interest to declare.

Acknowledgments

We thank Franziska Dinter (Brandenburg University of Technology Cottbus–Senftenberg) for reevaluation of the amplification curve data and Maria Tokarenko (Evrogen JSC) for wet lab experiments conduction.

Footnotes

Appendix A

Supplementary data associated with this article can be found, in the online version, at https://doi.org/10.1016/j.bdq.2018.08.001.

Appendix A. Supplementary data

The following are the supplementary data to this article:

mmc1.pdf (511.7KB, pdf)

References

  • 1.Bustin S., Huggett J. qPCR primer design revisited. Biomolecular Detection and Quantification. 2017;14:19–28. doi: 10.1016/j.bdq.2017.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Barratt K., Mackay J.F. Improving real-time PCR genotyping assays by asymmetric amplification. J. Clin. Microbiol. 2002;40(4):1571–1572. doi: 10.1128/JCM.40.4.1571-1572.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Isaac P.G. Essentials of nucleic acid analysis: a robust approach. Ann. Bot. 2009;104(2):vi. [Google Scholar]
  • 4.Mackay I.M. Horizon Scientific Press; 2007. Real-time PCR in Microbiology: From Diagnosis to Characterization. google-Books-ID: WKs13RhcEJAC. [Google Scholar]
  • 5.Martinez-Serra J., Robles J., Nicolas A., Gutierrez A., Ros T., Amat J.C., Alemany R., Noguera A., Besalduch J., Abello A., Vogler O. Fluorescence resonance energy transfer-based real-time polymerase chain reaction method without DNA extraction for the genotyping of F5, F2, F12, MTHFR, and HFE. J. Blood Med. 2014;99 doi: 10.2147/JBM.S64976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Whittle M.R., Sumita D.R. Quadruplex real-time PCR for forensic DNA quantitation. Forensic Sci. Int.: Genet. Suppl. Ser. 2008;1(1):86–88. [Google Scholar]
  • 7.Savidge T., Pothulakis C. vol. 34. Academic Press; 2004. (Microbial Imaging). google-Books-ID: tnQI8mjr9joC. [Google Scholar]
  • 8.Nolan T., Hands R.E., Bustin S.A. Quantification of mRNA using real-time RT-PCR. Nat. Protoc. 2006;1:1559. doi: 10.1038/nprot.2006.236. [DOI] [PubMed] [Google Scholar]
  • 9.Pabinger S., Rödiger S., Kriegner A., Vierlinger K., Weinhäusel A. A survey of tools for the analysis of quantitative PCR (qPCR) data. Biomol. Detect. Quantif. 2014;1(1):23–33. doi: 10.1016/j.bdq.2014.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ritz C., Spiess A.-N. qpcR: an R package for sigmoidal model selection in quantitative real-time polymerase chain reaction analysis. Bioinformatics. 2008;24(13):1549–1551. doi: 10.1093/bioinformatics/btn227. [DOI] [PubMed] [Google Scholar]
  • 11.Spiess A.-N., Feig C., Ritz C. Highly accurate sigmoidal fitting of real-time PCR data by introducing a parameter for asymmetry. BMC Bioinform. 2008;9(1):221. doi: 10.1186/1471-2105-9-221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Boggy G.J., Woolf P.J. A mechanistic model of PCR for accurate quantification of quantitative PCR data. PLoS ONE. 2010;5(8):e12355. doi: 10.1371/journal.pone.0012355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Rödiger S., Burdukiewicz M., Schierack P. chipPCR: an R package to pre-process raw data of amplification curves. Bioinformatics. 2015;31(17):2900–2902. doi: 10.1093/bioinformatics/btv205. [DOI] [PubMed] [Google Scholar]
  • 14.Blagodatskikh K.A., Kramarov V.M., Barsova E.V., Garkovenko A.V., Shcherbo D.S., Shelenkov A.A., Ustinova V.V., Tokarenko M.R., Baker S.C., Kramarova T.V., Ignatov K.B. Improved DOP-PCR (iDOP-PCR): a robust and simple WGA method for efficient amplification of low copy number genomic DNA. PLOS ONE. 2017;12(9):e0184507. doi: 10.1371/journal.pone.0184507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Bustin S.A. The reproducibility of biomedical research: sleepers awake! Biomol. Detect. Quantif. 2014;2:35–42. doi: 10.1016/j.bdq.2015.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rödiger S., Burdukiewicz M., Blagodatskikh K.A., Schierack P. R as an environment for the reproducible analysis of DNA amplification experiments. R J. 2015;7(2):127–150. http://journal.r-project.org/archive/2015-1/RJ-2015-1.pdf [Google Scholar]
  • 17.Rödiger S., Burdukiewicz M., Spiess A.-N., Blagodatskikh K. Enabling reproducible real-time quantitative PCR research: the RDML package. Bioinformatics. 2017 doi: 10.1093/bioinformatics/btx528. [DOI] [PubMed] [Google Scholar]
  • 18.Spiess A.N., Rodiger S., Burdukiewicz M., Volksdorf T., Tellinghuisen J. System-specific periodicity in quantitative real-time polymerase chain reaction data questions threshold-based quantitation. Sci. Rep. 2016;6:38951. doi: 10.1038/srep38951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Tichopad A., Dilger M., Schwarz G., Pfaffl M.W. Standardized determination of real-time PCR efficiency from a single reaction set-up. Nucleic Acids Res. 2003;31(20):e122. doi: 10.1093/nar/gng122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhao S., Fernald R.D. Comprehensive algorithm for quantitative real-time polymerase chain reaction. J. Comput. Biol. 2005;12(8):1047–1064. doi: 10.1089/cmb.2005.12.1047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ramakers C., Ruijter J.M., Deprez R.H., Moorman A.F. Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neurosci. Lett. 2003;339(1):62–66. doi: 10.1016/s0304-3940(02)01423-4. [DOI] [PubMed] [Google Scholar]
  • 22.Rödiger S., Friedrichsmeier T., Kapat P., Michalke M. RKWard: a comprehensive graphical user interface and integrated development environment for statistical analysis with R. J. Stat. Softw. 2012;49(9):1–34. https://www.jstatsoft.org/article/view/v049i09/v49i09.pdf [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.pdf (511.7KB, pdf)

Articles from Biomolecular Detection and Quantification are provided here courtesy of Elsevier

RESOURCES