Abstract
Background
Linear regression of efficiency (LRE) introduced a new paradigm for real-time qPCR that enables large-scale absolute quantification by eliminating the need for standard curves. Developed through the application of sigmoidal mathematics to SYBR Green I-based assays, target quantity is derived directly from fluorescence readings within the central region of an amplification profile. However, a major challenge of implementing LRE quantification is the labor intensive nature of the analysis.
Findings
Utilizing the extensive resources that are available for developing Java-based software, the LRE Analyzer was written using the NetBeans IDE, and is built on top of the modular architecture and windowing system provided by the NetBeans Platform. This fully featured desktop application determines the number of target molecules within a sample with little or no intervention by the user, in addition to providing extensive database capabilities. MS Excel is used to import data, allowing LRE quantification to be conducted with any real-time PCR instrument that provides access to the raw fluorescence readings. An extensive help set also provides an in-depth introduction to LRE, in addition to guidelines on how to implement LRE quantification.
Conclusions
The LRE Analyzer provides the automated analysis and data storage capabilities required by large-scale qPCR projects wanting to exploit the many advantages of absolute quantification. Foremost is the universal perspective afforded by absolute quantification, which among other attributes, provides the ability to directly compare quantitative data produced by different assays and/or instruments. Furthermore, absolute quantification has important implications for gene expression profiling in that it provides the foundation for comparing transcript quantities produced by any gene with any other gene, within and between samples.
Introduction
Real-time qPCR has provided the foundation for a plethora of applications in basic research, biomedical diagnostics and pathogen detection [1], [2], [3]. Nevertheless, the relative quantification upon which conventional qPCR methodologies are based has prevented the full potential of real-time qPCR from being realized. Foremost is the difficulty of implementing absolute quantification, due to the necessity of constructing target-specific standard curves [4]. This makes absolute quantification impractical for large-scale applications that require quantification of more than a handful of targets.
Originating from the application of sigmoidal mathematics to model PCR amplification, linear regression of efficiency (LRE) provides an alternative approach to real-time qPCR, in which absolute quantification can be conducted without standard curves [5], [6], [7]. In addition to enabling large-scale absolute quantification, LRE provides quality control capabilities not possible with conventional methods. Finally, extensive testing has demonstrated the ability to achieve absolute accuracies of ±15–30%, even down to a single target molecule [7].
Despite the exceptional capabilities of LRE, attempts to manually implement data analysis using MS Excel quickly became untenable. This in turn prompted attempts to develop software for automated analysis, which led to the production of a small Java program that automated LRE quantification [5]. Unfortunately, this program was limited to analysis of one amplification profile at a time, and provided no ability to store data. Taking advantage of the extensive resources that are freely available for developing Java-based software, it was possible to extend this simple Java program into a fully featured desktop application. Called the LRE Analyzer, this program provides the automated data analysis and database capabilities required for implementing large-scale qPCR applications.
Methods
Implementation
The LRE Analyzer was written in Java using the NetBeans IDE (http://netbeans.org), utilizing the modular architecture and windowing system provided by the NetBeans Platform. The object database DB4O (http://db4o.com) is used for data storage and JExcel (http://jexcelapi.sourceforge.net) is used for data import and export. The program source code and installation files have been published as an open source project at Google Code (http://code.google.com/p/lreqpcr) under a GNU GPL license, in addition to a website that provides supporting information (https://sites.google.com/site/lreqpcr). The program has been tested extensively using the MS Windows XP operating system and has been confirmed to run on the Mac OS X and Unix operating systems (with JRE 1.6 installed).
Installation
The LRE Analyzer can be installed by downloading the files provided at the open source project website (http://code.google.com/p/lreqpcr), which also includes demonstration database files.
cDNA quantifications
RNA extraction and reverse transcription were conducted as previously described [5], [7]. Data presented in the demonstration databases were generated using an Applied Biosystems 7500 instrument (normal ramping), Qiagen QuantiTect in a 10 µl reaction volume containing 500 nM of primers, in 96 well white plates (ABgene) sealed with MicroAmp film (Applied Biosystems), and amplified using a cycling regime of 15 min activation at 95°C, followed by 50 cycles of 95°C −10 s, 65°C −120 s. Limiting dilution assays (LDA) were conduced as previously described [5], [7].
Results
Databases and data structures
The LRE Analyzer stores data in three independent databases, maintained as files with distinct extensions: *.exp, *.cal and *.amp respectively (Figure 1). An important extension of LRE over conventional qPCR methods is construction of an average profile from their respective replicate profiles (i.e. technical replicates), generated by averaging, for each cycle, the fluorescence readings from each of the replicate profiles. This can greatly increase the precision of the fluorescence readings, which is essential for some instruments. Note that average profiles are automatically constructed during data import and are the primary working unit of the LRE Analyzer. However, an important qualification is that the replicate profiles must be tightly clustered. A prominent exception to this requirement occurs when target quantities fall below 10 molecules per reaction, due to the impact of Poisson distribution [7]. In such situations, target quantity must be determined by averaging the quantities produced by each individual replicate profile.
Another concept central to LRE quantification is referred to as “reaction setup”, which encompasses all of the factors impacting the optics of an assay. These include the reaction vessel and closure, the enzyme formulation and the optical characteristics of the instrument, which as a whole determine the fluorescence intensity of an assay. A key aspect of implementing absolute quantification is optical calibration in which the fluorescence intensity of an assay is quantified by amplification of a known quantity of lambda gDNA, generating what is called an optical calibration factor (OCF) that is unique to a specific reaction setup [5], [7].
Note that the LRE Analyzer help set provides detailed descriptions of how LRE quantification is conducted, along with guidelines on how to implement LRE-based assays. Demonstration database files are also provided to assist in illustrating how the LRE Analyzer functions, along with insights into the capabilities of LRE quantification.
User Interface
The user interface is organized into three panels, each containing windows that provide functions for viewing and/or editing of data from each of the three LRE database types (Figure 2). The explorer panel contains windows that allow creation, opening and closing of database files. Once a database file is opened, data is presented as a tree, such as profiles within a run, along with displaying information about the data element within its label (Figure 3). When a data element within an explorer window is selected, the corresponding editor window appears within the editor panel, displaying information about the selected data element, much of which can be edited (Figure 4). The sorting panel makes it possible for profiles to be organized by either amplicon or sample, allowing profiles generated across multiple runs to be viewed/edited or exported as a group (Figure 5).
Workflow
Following completion of a run, fluorescence readings are exported into an Excel workbook. The LRE Analyzer provides Excel templates for manual data import (Figure 6), along with support for importing data from Applied Biosystems 7500 and Stratagene Mx3000p instruments (support for additional platforms could be added depending on demand and available resources). The primary requirement for data import is to link each amplification profile to the sample and amplicon used to generate it, and to designate which profiles are calibration profiles. The data is then read into the LRE Analyzer, which conducts automated analysis for each profile, including automated retrieval of amplicon size from an amplicon database. Although it is possible to manually adjust the analysis by adjusting the LRE window, testing has shown that automated analysis can routinely generate quantitative accuracies in the ±15–30% range. Target quantities can then be exported into an Excel workbook sorted by run, sample or amplicon (Figure 7).
Automated LRE window selection
Implementing a simple strategy
In addition to conducting the mathematical calculations required for LRE quantification [5], [7], devising a method for automated LRE window selection was essential for the general efficacy and reliability of the LRE Analyzer. As described in Figure 8, the LRE window consists of the cycles that are included in the LRE analysis of a profile. The basic strategy was to select the first cycle of the LRE window (referred to as the “start cycle”), which defines the bottom of the LRE window, and to start with a default window size of three cycles. The LRE window is then expanded by adding cycles to the top of the window, until a cycle is encountered that exceeds a specified threshold based on conformity to the LRE model.
Optical read precision is critical for start cycle selection
Although a primary objective is to maximize the size of the LRE window by placing the start cycle as early in a profile as possible, optical read precision becomes a major limitation. This is because LRE analysis is based on determining cycle efficiency (EC), which is calculated by dividing the cycle fluorescence (FC) by the fluorescence reading produced by the preceding cycle (FC-1):
Due to this ratio-based determination, the accuracy of EC determination is dependent on the precision of the fluorescence readings, which is generally referred to as “read precision”. The accuracy of cycle efficiency determination can thus be dramatically compromised when reaction fluorescence is below the lower limit of the instrument's optical capacity. Large differences in this lower limit between different instruments, combined with the arbitrary nature of the fluorescence units used in real-time PCR, presented major challenges for automated selection of the start cycle.
This prompted a default implementation based on a simple, albeit suboptimal method of designating the start cycle as the first cycle below C1/2. However, although this approach can be reasonably reliable using a number of reaction setups, an alternative method was developed that allows the lower limit of the LRE window to be manually specified. Based on entry of a “minimum FC”, the start cycle is set to the cycle following the first cycle that produces a FC greater than this minimum (i.e. the cycle from which the start cycle EC denominator is taken). The “LRE Window Selection Parameters” panel within the Profile Editor window allows the minimal FC and F0 threshold to be adjusted manually (Figure 9).
During early implementation of the LRE window selection parameters, it became apparent that a method for assessing the overall quantitative precision could be useful. The approach taken was based on the variance of target quantities generated by technical replicates; that is, the CV of the F0 values produced by replicate profiles. Referred to as the “Av Repl-Fo CV”, averaging the quantitative variances generated by all the replicate reaction sets within a run, provided such a general assessment. This not only proved to be useful for selecting an optimal minimum FC, but also for assessing the overall performance of an assay. Although beyond the scope of this study, this has revealed, among other things, large differences in instrument performance, due primarily to differences in the optical precision they produce. A simple but generally effective method is to lower the minimum FC until the average replicate F0 CV reaches a minimum, although this should only be taken as a general guideline, as exceptions have been encountered.
Defining the top of the LRE window via the F0 threshold
A major source of quantitative error discovered during early attempts to apply sigmoidal mathematics to PCR using nonlinear regression analysis (sigmoidal curve fitting or SCF) were distortions within the upper region of a profile [9]. In order to maximize quantitative accuracy, it was found essential to exclude such aberrant cycles from the analysis. The recursive nature of LRE analysis provided a simple method for identifying such aberrant cycles [5], which are apparent in both the FC and LRE plots (Figure 10).
An important objective for setting the upper limit of the LRE window was thus to avoid inclusion of such aberrant cycles. An objective method came from taking the F0 value generated by the cycle immediately above the LRE window, and comparing it with the average F0 value generated by the cycles within the LRE window. If the difference is below a specified value, defined as the F0 threshold (Figure 9), the LRE window is expanded to include this next cycle and LRE analysis is repeated. This process is continued until a cycle is encountered that exceeds the F0 threshold. The Tabular Summary located within the profile editing panel provides a numerical perspective on the process (Figure 11).
Although an F0 threshold of 6% has generally been found to be effective, it should be noted that increasing the F0 threshold can lead to susceptibility to another form of kinetic distortion, referred to as “profile collapse” (Figure 12). In contrast to plateau drift that can generate underestimations of Emax, inclusion of collapsed cycles overestimates Emax, which in turn generates an underestimation of target quantity.
Another form of aberrant kinetics
Another form of kinetic distortion found to be produced by some commercial enzyme formulations, is referred to as “profile arcing” (Figure 13). Such profiles do not conform well to the LRE model and can thus generate unreliable quantifications. Enzyme formations that have been found to be effective for LRE analysis include Qiagen QuantiTect, Agilent Brilliant II, and Invitrogen Platinum SYBR, although many others are likely to also be effective. Finally, it should be noted that although extensive testing has shown SYBR Green I to be effective for LRE-based absolute quantification, other detection chemistries many not be as effective.
An example taken from the demonstration databases
Generated during a study investigating the reproducibility of LRE quantification for gene expression profiling, the datasets provided in the demonstration databases focus on assessing the reproducibility of reverse transcription and run-to-run quantitative variances. Consisting of three runs conducted over a 3-day period, three Arabidopsis reference genes (12240, 46630 and GAPDH) were quantified within each of three replicate reverse transcriptase (RT) preparations (i.e. made with the same RNA sample) using three replicate PCR reactions for each quantitative determination. Figure 14 compares target quantities generated across both runs and RT preparations for each of the three targets that produced an overall run-to-run average CV of ±9.5% (i.e. across all three targets), which is substantially below that reported previously (±15–30%) [5], [7], likely due to the exceptional optical precision produced by the Applied Biosystems 7500 instrument used to conduct these runs.
Of potentially greater significance is that these datasets reveal RT variances of 13.0%, 10.1% and 9.5%, respectively, for each of three targets (average 10.9%). This indicates that these RT reactions generated variances below the variance of LRE quantification (i.e. error of measurement), which in turn demonstrates the high level of repeatability that can be achieved with reverse transcription. It is also important to note that these target quantities presented here were generated using the automated analysis provided by the LRE Analyzer, which requires little or no user intervention. Figure 14 also provides an example of a key attribute of absolute quantification, which is the ability to assess quantitative accuracy using limiting dilution assay (LDA), a method that provides the ability to conduct absolute quantification independent of the kinetic and optical parameters on which real-time qPCR is based [5], [7]. This indicated that for two of the targets, LRE quantification agreed within 12% of that generated by LDA, again consistent with that previously reported [5], [7].
Finally, the LRE Analyzer provides an open source platform that facilitates data storage and exchange. For example, the LRE databases could be published as supplementary data that in turn would allow access to raw data in an organized, easy to access form.
Discussion
One of the most striking features of using the LRE Analyzer is the ease of evaluating large amounts of data generated over multiple runs. Central to this capability is the universal perspective provided by absolute quantification, which allows target quantities to be directly compared not only across different runs, but also across different assays and/or instruments. In contrast, the relative quantification upon which conventional qPCR methods rely, generates target quantities based on a single point that defines the position of a profile. Called the “quantitative cycle” (Cq) [10], a major limitation of this approach is that Cq is assay specific. Thus, in order to directly compare Cq values, some form of normalization is required, such as conducting run normalization using external standards [11], or normalization to a reference gene(s) [12].
Absolute quantification eliminates the need for such post-run data processing, in that absolute quantification normalizes assay-specific differences, such that quantities expressed as the number of target molecules become independent of assay implementation. An important implication of this principle is that assay performance can be defined in terms of quantitative accuracy; that is, how well a quantitative determination correlates with the actual number of target molecules within a sample [7].
The ease of conducting absolute quantification provided by LRE, combined with the automated data processing capability of the LRE Analyzer, could thus greatly enhance the utility and reliability of real-time qPCR. In addition to circumventing many of the limitations associated with conventional methods, the universal perspective provided by absolute quantification also provides the foundation for effective standardization of qPCR that, for example, could be achieved through the establishment of performance benchmarks based on quantitative accuracy [5], [7].
Acknowledgments
I am indebted to Don Stewart for conducting the qPCR analysis contained within demonstration databases, to Michel Cusson for reviewing the manuscript, and I thank Pamela Cheers for editorial assistance.
Footnotes
Competing Interests: The author has declared that no competing interests exist.
Funding: The research was supported in part by a grant to RGR from the Canadian Biotechnology Strategy. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. No additional external funding were received for this study.
References
- 1.VanGuilder HD, Vrana KE, Freeman WM. Twenty-five years of quantitative PCR for gene expression analysis. BioTechniques. 2008;44:619–626. doi: 10.2144/000112776. [DOI] [PubMed] [Google Scholar]
- 2.Murphy J, Bustin SA. Reliability of real-time reverse-transcription PCR in clinical diagnostics: gold standard or substandard? Expert Rev Mol Diagn. 2009;9:187–197. doi: 10.1586/14737159.9.2.187. [DOI] [PubMed] [Google Scholar]
- 3.Pfaffl MW. The ongoing evolution of qPCR. Methods. 2010;50:215–216. doi: 10.1016/j.ymeth.2010.02.005. [DOI] [PubMed] [Google Scholar]
- 4.Rutledge RG, Côté C. Mathematics of quantitative kinetic PCR and the application of standard curves. Nucleic Acids Res. 2003;31:e93. doi: 10.1093/nar/gng093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rutledge RG, Stewart D. A kinetic-based sigmoidal model for the polymerase chain reaction and its application to high-capacity absolute quantitative real-time PCR. BMC Biotechnol. 2008;8:47. doi: 10.1186/1472-6750-8-47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rutledge RG, Stewart D. Critical evaluation of methods used to determine amplification efficiency refutes the exponential character of real-time PCR. BMC Mol Biol. 2008;9:96. doi: 10.1186/1471-2199-9-96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rutledge RG, Stewart D. Assessing the performance capabilities of LRE-based assays for absolute quantitative real-time PCR. PLoS One. 2010;5:e9731. doi: 10.1371/journal.pone.0009731. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lefever S, Hellemans J, Pattyn F, Przybylski DR, Taylor C, et al. RDML: structured language and reporting guidelines for real-time quantitative PCR data. Nucleic Acids Res. 2009;37:2065–2069. doi: 10.1093/nar/gkp056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Rutledge RG. Sigmoidal curve-fitting redefines quantitative real-time PCR with the prospective of developing automated high-throughput applications. Nucleic Acids Res. 2004;32:e178. doi: 10.1093/nar/gnh177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bustin SA, Benes V, Garson JA, Hellemans J, Huggett J, et al. The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem. 2009;55:611–622. doi: 10.1373/clinchem.2008.112797. [DOI] [PubMed] [Google Scholar]
- 11.Vermeulen J, Pattyn F, De Preter K, Vercruysse L, Derveaux S, et al. External oligonucleotide standards enable cross laboratory comparison and exchange of real-time quantitative PCR data. Nucleic Acids Res. 2009;37:e138. doi: 10.1093/nar/gkp721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Gutierrez L, Mauriat M, Guenin S, Pelloux J, Lefebvre JF, et al. The lack of a systematic validation of reference genes: a serious pitfall undervalued in reverse transcription-polymerase chain reaction (RT-PCR) analysis in plants. Plant Biotechnol J. 2008;6:609–618. doi: 10.1111/j.1467-7652.2008.00346.x. [DOI] [PubMed] [Google Scholar]