Abstract
Quantitative PCR (qPCR) is used for the determination of gene copy number (GCN). GCNs contribute to human disorders, and characterize copy number variation (CNV). The single laboratory method validations of duplex qPCR assays with hydrolysis probes on CYP21A1P and CYP21A2 genes, residing a CNV (RCCX CNV) and related to congenital adrenal hyperplasia, were performed using 46 human genomic DNA samples. We also performed the verifications on 5 qPCR assays for the genetic elements of RCCX CNV; C4A, C4B, CNV breakpoint, HERV-K(C4) CNV deletion and insertion alleles. Precision of each qPCR assay was under 1.01 CV%. Accuracy (relative error) ranged from 4.96±4.08% to 9.91±8.93%. Accuracy was not tightly linked to precision, but was significantly correlated with the efficiency of normalization using the RPPH1 internal reference gene (Spearman’s ρ: 0.793–0.940, p>0.0001), ambiguity (ρ = 0.671, p = 0.029) and misclassification (ρ = 0.769, p = 0.009). A strong genomic matrix effect was observed, and target-singleplex (one target gene in one assay) qPCR was able to appropriately differentiate 2 GCN from 3 GCN at best. The analysis of all GCNs from the 7 qPCR assays using a multiplex approach increased the resolution of differentiation, and produced 98% of GCNs unambiguously, and all of which were in 100% concordance with GCNs measured by Southern blot, MLPA and aCGH. We conclude that the use of an internal (in one assay with the target gene) reference gene, the use of allele-specific primers or probes, and the multiplex approach (in one assay or different assays) are crucial for GCN determination using qPCR or other methods.
Introduction
Quantitative PCR (qPCR) was originally developed for virus quantification [1], and has recently attracted more attention owing to the SARS-CoV-2 pandemic. It is most often used for the quantification of mRNA levels [2], but the gene copy number (GCN) determination in diploid genomes has gained benefit from it for a long time [3]. GCN is the number of repeats of a gene in one or two sets of chromosomes. The vast majority of the genes occurs twice in a diploid genome, but the copy numbers of some genes can differ from two. GCN is a non-negative whole number, and the “integer GCN” term is used when this characteristic is emphasized. Variations in GCN contributes to both rare genetic disorders and common diseases in humans [4].
The qPCR for human GCN determination can be distinguished from qPCR for gene expression by some key features: 1.) Genomic DNA is the template. The template complexity, which can reduce the performance of qPCR [5], is much greater in genomic DNA than in total mRNA of a particular tissue: The haploid human genome consists of 3.1 billion base pairs, and millions of base pairs differ between two random haploid chromosome sets [6], while a few hundred genes account for 50% of transcripts in most human tissues [7] covering only a couple of hundred thousand base pairs in total length.
2.) Limit of detection (LOD) is not crucial. The absolute copy number of a target gene in a DNA sample is proportional to the absolute number of haploid chromosome sets, which can be approximately calculated from the mass of genomic DNA in the sample. The absolute number of haploid chromosome sets can be more accurately determined by the quantitative measurement of a reference gene, which invariably occurs once in each haploid chromosome set. The ratio of absolute copy numbers of a target gene and a reference gene in the DNA sample of a subject is identical to the ratio of the copies of target and reference genes in two haploid chromosome sets of a diploid cell. The ratios of the target and reference genes is not conditional on the amount of genomic DNA in a sample, and the GCN of a target gene is easily calculated from this ratio since the GCN of a reference gene is always two in a diploid cell. Therefore, the amount of genomic DNA in a measurement also does not influence GCN (in theory), and can be chosen to be conveniently above the limit of detection (LOD).
3.) The differentiation between greater consecutive GCNs is difficult. The quantification cycle (Cq) is determined by qPCR to characterize the absolute copy number of a gene in reality. Cq is proportional to a relatively short DNA sequence specific to a target or a reference gene, and GCN is calculated from Cqs related to the target and reference genes. GCN determined by qPCR can be called “measured GCN”, and can be a positive real number (for example, a rational number), not necessarily a non-negative whole number. The relationship between Cq and GCN can be described by the equation: Cq(target gene)-Cq(reference gene) = -((log2(GCN)/log2(2))-1. The reference gene Cq is constant in theory, and therefore only the target gene Cq determine GCN. The theoretical difference between two target gene Cqs derived from two consecutive GCNs will approach zero if GCN approaches infinity. This means that the theoretical difference of two target gene Cqs is ∞ between 0 and 1 GCN, ΔCq = 1 between 2 and 1 GCNs, ΔCq = 0.585 between 3 and 2 GCNs, ΔCq = 0.415 between 4 and 3 GCNs, ΔCq = 0.322 between 5 and 4 GCNs, and so on. Therefore, it becomes more and more difficult to differentiate the greater consecutive GCN, which presents the key problem of qPCR as well as other molecular biology methods for GCN determination.
4.) The inaccurately measured GCNs can be easily identified in the majority of cases. Ambiguity is the state of a measured GCN which is not close enough to an integer GCN to assign unequivocally the measured GCN to the integer GCN. The measured GCN is a continuous variable, and therefore a measured GCN can be about halfway between two integer GCNs, which clearly indicates the inaccuracy of the particular measurement. The distribution of several measured GCNs derived from the same integer GCN approaches a normal distribution, resulting in the majority of the measured GCNs around the real integer GCN (unambiguous GCNs), some measured GCNs between the real GCN and an adjacent integer GCN (ambiguous GCNs) and a few measured GCNs around the adjacent integer GCNs (misclassified GCNs).
Returning to the key problem, there are two techniques allowing GCN methods to extend beyond this limitation: 1.) The use of allele-specific primers or probes. Paralogous genes or gene variants are the copies of the same gene (or very similar ones), which have high DNA sequence similarity, and are located at the different loci of a haploid chromosome set. There are very often sequence differences between the paralogous gene variants which can be targeted in an allele-specific way, and the total GCN of the gene variants can be decomposed into smaller GCNs. 2.) The use of several probes (in one or different assays) for the same target gene. The integer GCN of a target gene can be estimated from the measured GCNs of more DNA sequences that are parts of the target gene, which can make the estimation more reliable. This estimation of the integer GCN can be based on the personal decision of an operator [8], a simple mathematical measure such as arithmetic mean [9] or a more complex statistical method such as a classifier.
The above-mentioned features and techniques are well illustrated by RCCX copy number variation (CNV, S1 Table). RCCX CNV usually consists of 1–3 tandem repeats of a DNA segment on one chromosome, and each DNA segment harbors 2 complete genes, complement component 4 (C4) and steroid 21-hydroxylase (CYP21) [10, 11]. Therefore, the copy numbers of the RCCX CNV segment, C4 and CYP21 are identical, and no exception to this has been described yet. Both genes have 2–2 paralogous gene variants. C4 genes are sorted into C4A and C4B genes differing in 5 nucleotides in exon 26. The CYP21 genes sort into a functioning gene (CYP21A2) and a pseudogene (CYP21A1P) having several sequence differences. CYP21A2 contributes to the steroidogenesis of adrenal glands, and its mutations cause congenital adrenal hyperplasia (CAH) [12]. An additional sequence difference of C4 genes derived from a virus insertion in intron 9 (called human endogenous retrovirus K (HERV-K(C4) CNV)), and a RCCX CNV breakpoint, where two CNV segments are joined, are currently being researched [13]. Multiplex ligation-dependent probe amplification (MLPA) for the GCNs of CYP21 genes is commercially available, and is recognized as an appropriate methods in the genetic testing of CAH [14]. MLPA uses multiple CYP21A1P- and CYP21A2-specific probes, and special statistical methods followed by the final evaluation of integer GCNs by the operator.
The GCNs in RCCX CNV are also determined by qPCR based on allele-specific primers or probes [13, 15–17]. However, these qPCR assays are singleplex for target genes (target-singleplex) in stark contrast to MLPA, and none of their published documentations completely meets the requirements of “minimum information for publication of quantitative real-time PCR experiments” (MIQE) [18, 19]. In this study we therefore aimed to simultaneously assess the performance of 7 qPCR assays for the GCN determination of the genetic elements of RCCX CNV according to MIQE. Verifications were performed on C4A, C4B, HERV-K(C4) CNV deletion, HERV-K(C4) CNV insertion, and RCCX CNV breakpoint qPCR assays in the current study because some information has been published on these assay performances (S2 Table), whereas single laboratory method validations [20] were completed on CYP21 qPCR assays. Furthermore, our goals were to examine the optimal laboratory strategy of qPCR for GCN determination in general, and the fit-for-purpose of qPCR assay for CYP21A2 GCN determination in the genetic testing of CAH.
Materials and methods
DNA samples
The qPCR validation and verification processes were completely in accordance with MIQE (S1 File). The current research was conducted with the approval by the National Scientific and Ethical Committee, Medical Research Council of Hungary (TUKEB, ETT), approval number 4457/2012/EKU. Written informed consent was given by all of the study subjects. Genomic DNA samples were extracted from whole blood of 10 healthy subjects, 11 patients with CAH and 5 patients with non-functioning adrenal incidentaloma (NFAI) using a Qiagen QIAcube instrument (S3 Table) with Qiagen QIAamp DNA blood mini kit or a Roche DNA isolation kit for mammalian blood (S1 File). Genomic DNA samples were also purchased from the International Histocompatibility Working Group (IHWG). Purchased DNA samples derive from 18 healthy subjects of the HapMap European reference population (CEU) [21] and 2 HLA homozygous cell lines, COX and QBL. Purchased DNA samples were isolated by IHWG using 5-Prime ArchivePure DNA cell/tissue kit, and their nominal concentrations were 100 ng/μl. DNA samples of COX and QBL were applied in a 1:1 mixture. An SD039 reference DNA sample for MLPA with a nominal concentration of 10 ng/μl, which is included in MRC Holland SALSA MLPA probe mix P050 CAH, was also used. The concentration and purity of all DNA stock solutions were determined by a Thermo Fisher Scientific (TFS) NanoDrop 2000 spectrophotometer and a TFS Qubit 1 fluorometer with Qubit dsDNA high sensitivity assay kit, and DNA integrity was checked on 0.7 m/V% agarose gels.
Positive controls, samples for calibration curves and study groups
DNA working solutions with 5 ng/μl DNA concentration were separately diluted from the stock solutions of the DNA samples for 3 replicate measurements, except for ones for positive controls (more than 3 separately diluted working solutions) and for the calibration curve (a series of dilutions). DNA samples derived from our own subjects were divided based on DNA quality into “good quality” (A260/A280>1.8 and A260/A230>2.0 and no sign of DNA degradation) and “bad quality” study groups (n = 10). The SD039 reference sample for MLPA was assigned to the “good quality” group (n = 17). The DNA samples purchased from IHWG were labeled as the “population” group (n = 19).
Measurements
The applied custom primers (Integrated DNA Technologies) and hydrolysis probes (produced by TFS) for C4 genes [16], CYP21 genes [15], HERV-K(C4) CNV alleles and RCCX CNV breakpoint [13] (S4 Table) have been previously published. All these hydrolysis probes had a 5’-fluorochrome, 6-carboxyfluorescein (FAM) reporter, and a 3’-nonfluorescent quencher and a 3’-minor groove binder. The mix for qPCR measurements (S5 Table) also contained ribonuclease P RNA component H1 (RPPH1) internal reference gene included in TFS TaqMan human RNase P copy number reference assay with a probe labeled with 2′-chloro-7′-phenyl-1,4-dichloro-6-carboxy-fluorescein (VIC), and TFS TaqMan fast advanced master mix (FAMM). The concentrations of custom primers and probes were slightly different in different qPCR assays (S5 Table). The same qPCR profile according to the manual of FAMM was used for all 7 qPCR assays. The qPCR measurements were carried on a TFS QuantStudio 7 qPCR instrument (QS7) with TFS QuantStudio software v1.2 except for some experiments with SYBR Green and robustness experiments (see more later). The three DNA working solutions of a DNA sample were separately measured for a target gene on different days except for ones for positive controls. We measured 3 replicates from the same and 3 replicates from the different DNA working solutions of a DNA sample for positive control on each of the three days. The quantification threshold of the RPPH1 reference gene was always 0.1 for relative quantification. The quantification thresholds for the replicate measurements of each target gene were tuned in a way that the average of relative errors (REs) of measurements (based on a preliminary calculation) in a replicate measurement of “good quality” and “population” study group equaled approximately zero (S6 Table). Therefore, all measurements of these study groups (n = 36) were taken into account, instead of the use of arbitrarily selected reference samples and ΔΔCt method. There was no need for the further normalization or correction of the Cq values, and there was no Cq optimization for precision, calibration curve or the variances of RE.
The melting curve analyses were performed with Bioline SensiFAST SYBR Lo-ROX kit on GS7. The micro-capillary electrophoreses of duplex qPCR reactions were carried out by Agilent Bioanalyzer 2100 instrument with Agilent Bioanalyzer high sensitivity DNA kit. MRC Holland SALSA MLPA EK1 reagent kit and probe mix P050 CAH kit on TFS ProFlex PCR and TFS 3130 capillary electrophoresis instruments were used for the measurements of MLPA, and MRC Holland Coffalyser software v140721 for the calculation of MLPA results. Robustness experiments were performed using a Roche LightCycler 1.0 instrument with Roche LightCycler FastStart DNA master SYBR green I reagent, QS7 with TFS TaqMan universal master mix II without uracil-N-glycosylase (UMM2) or a TFS 7500 Fast qPCR instrument (7500F) with FAMM. All qPCR reagents in robustness experiments were used according to the manuals of the manufacturers.
Calculation and statistics
Non-specific PCR products were in silico predicted by Primer Blast [22], and the secondary structures of PCR products were in silico determined by UNAFold [23]. The limit of detection was estimated according to Hubaux and Vos [24], and statistical metrics were used based on MIQE (S6 Table). Statistical analyses were performed with R v4.0.2 [25] and SPSS v26. Normal distribution was tested by Shapiro–Wilk (SW) test. Fisher’s exact test (FE), Student’s t-test, Wilcoxon test, ANOVA with Tukey post-hoc test, Kruskal-Wallis (KW) test with Dunn post-hoc test, Pearson’s correlation, Spearman’s rank correlation and Levene’s test were used for basic statistics. Tests were two-tailed, p-values were corrected with the false discovery rate (FDR) method, and p<0.05 was considered as statistically significant. A linear mixed-effect model of the R package lme4 [26] was applied to the analyses of slopes of calibration curves. Integer GCNs were estimated from measured GCNs by a machine learning classifier, the linear discriminant analysis (LDA).
Results
Parameters of genomic DNA samples
DNA concentration, A260/A280, A260/A230 were determined, and DNA integrity was also tested. The genomic DNA samples purchased from IHWG were assigned to the “population” study group (n = 19). The DNA samples of the subjects, enrolled by us, were divided based on DNA quality into “good quality” (A260/A280≥1.8 and A260/A230≥2.0 and no sign of DNA degradation) and “bad quality” study groups (n = 10). A reference sample (SD039) was also assigned to the “good quality” group (n = 17). The mean and SD belonging to the A260/A280 and A260/A230 quality parameters of the stock solutions of genomic DNA sample were 1.848±0.043 and 2.064±0.483 in the „good quality”, 1.817±0.049 and 1.899±0.182 in the „population”, and 1.787±0.046 and 2.152±0.089 in the „bad quality” study group (S1 File). The “population” group included 5 DNA samples of 19 which had slightly lower A260/A280 values than 1.8. However, all samples were sample intact in the “population” and “good quality” groups. The DNA samples in the “bad quality” study group were partially degraded. The concentrations of DNA stock solutions measured by Qubit were significantly lower (Wilcoxon test: p<0.0001) than those by Nanodrop (S1 Fig) agreeing with a previous findings [27]. Nevertheless, genomic DNA concentrations theoretically affect neither precision nor accuracy in qPCR as long as DNA concentration is in the linear range of the measurement.
Analytical specificity
Putative non-specific PCR products were found only at the primer pair of HERV-K(C4) CNV insertion target element using Primer-BLAST (S7 Table). Non-specific PCR product was observed only at the primer pairs of C4A and C4B target genes by melting curve analyses (S2 Fig), and the micro-capillary electrophoresis of duplex qPCR reactions confirmed this finding. Nevertheless, the same PCR product is amplified from both C4A and C4B gene variants, and allele-specific hydrolysis probes have also discriminated between the different target sequences of the mixed PCR products generated by the same primer pair in a previous study [28]. A couple of nucleotide differences in a target sequence can completely block the binding of the non-specific probe, and a non-specific PCR product can bind a specific probe by chance with very low probability. Therefore, the non-specific PCR product does not necessarily distort the quantitative PCR performance of C4 assays.
Matrix effect of genomic DNA
The Cq values (S1 File) of the RPPH1 reference gene from different assays could be compared, because the concentration of RPPH1 reagents, qPCR running parameters and the quantification threshold for RPPH1 were identical in all assays. There were no significant differences between the RPPH1 Cqs in replicate measurements (SW FDR: p = 0.676–0.768; ANOVA: p = 0.632; Tukey: p = 0.717–1.000) (S3 Fig), but significant differences (SW FDR: p = 0.002–0.958, 3 significant ones out of 46; KW: p<0.0001; Dunn: 39.7% significant pairs) were observed between DNA samples (Fig 1). The matrix effect of genomic DNA (sample-to-sample variation) could cause the differing Cq means of different samples, because the causes, derived from the quality and quantity of DNA solutions, could be ruled out; the DNA quality was completely checked, the DNA extractions were performed in 2 independent laboratories using different methods, the concentrations of DNA stock solutions were double-checked with 2 different methods, and Cqs were measured from 3 independent diluted working solution series.
Linearity, PCR efficiency and analytical sensitivity
Separate calibration curves were measured using 3 different genomic DNA samples in each assay, and 5 DNA samples were selected in total 1.) to ensure the GCNs of the target genes were as diverse as possible, and 2.) to measure each sample at least 3 times (S4 and S5 Figs). The effects of samples and different assays on the regression slopes were examined by a linear mixed-effects model (S8 Table). Samples had twice as large an effect on the standard deviation (SD) of slopes than the assays. The effect of samples on slope deviations is more likely to arise from the matrix effect of genomic DNA (template complexity) than from the deviated PCR inhibition of different samples because there is no difference in the deviation of Cqs from the lines of calibration curves between low and high DNA concentrations.
The average PCR efficiencies in different assays were around 1 (S9 Table), and there were no significant differences between them (SW FDR: p = 0.857, ANOVA: p = 0.333, Tukey: p = 0.356–1.000 for target genes, SW FDR: p = 0.687–0.983, ANOVA: p = 0.604, Tukey: p = 0.627–1.000 for reference genes). The PCR efficiencies of target and RPPH1 pairs from the same assays and DNA samples showed a strong and significant correlation (Spearman’s ρ = 0.7426, p = 0.0001) (S6 Fig), indicating that RPPH1 effectively compensated for the matrix effect on PCR efficiency. Estimated LODs were around the theoretical limit, which seems overestimated (S9 Table). Nevertheless, the lowest dilution of calibration curves contained approximately 400–1200 copies of genomic template depending on the GCN, and all 63 measurements on this dilution produced adequate Cqs and GCNs, indicating that the lowest applied concentration was well above LOD, in agreement with a previous study [29].
Precision
Repeatability and reproducibility were assessed from the same and separate dilutions of positive control samples, and both of them showed low and quite homogenous coefficient of variation % (CV%) values throughout the assays (S10 Table). Reproducibility values were also assessed in “good quality”, “population” and “bad quality” study groups producing comparable results, and the highest pooled CV% was 1.01.
Gene copy numbers and their concordance
The measured GCNs between ±0.3 of an integer GCN were considered as unambiguous. The ambiguity of GCNs is usually defined by a customarily applied fixed limit, which increases the ambiguous GCNs at higher integer GCNs. For instance, a 10% difference between the integer and measured GCNs means an unambiguous result at a GCN of 2 (1.8 or 2.2 measured GCNs), but an ambiguous one at a GCN of 4 (3.6 or 4.4 measured GCNs) using a ±0.3 fixed limit. The majority of the average measured GCNs of samples in “good quality” and “population” study groups (Fig 2, S1 File) were unambiguous except for the GCNs of the HERV-K(C4) CNV insertion assay (S11 Table). Measured GCNs in the “bad quality” group were markedly ambiguous in CYP21A1P and CYP21A2 assays, implying a high sensitivity of these assays to DNA quality.
There is no reference method or certified reference material for GCN determinations in RCCX CNV. However, MLPA is most often used for CYP21 genes in the genetic testing of CAH [14]. Furthermore, the GCNs of C4A and C4B in the CEU human reference population [30, 31] and the full RCCX CNV DNA sequences [32, 33] and GCNs [13] of COX and QBL HLA homozygous cell lines have been determined (S1 File). The GCNs were fully concordant in SD039 and COX-QBL samples, and C4A, C4B and RCCX CNV breakpoint GCNs were reasonably concordant with the previous results of Southern blot and array comparative genome hybridization (CGH) (S12 Table). CYP21A1P and CYP21A2 GCNs were also suitably concordant with GCNs determined by MLPA (S7 Fig). The precisions of MLPA probes ranged between 12.60–38.62 CV% (S13 Table). The ambiguity was significantly higher for the CYP21A2 MLPA probe, which recognizes the same insertion allele of a 8 bp genetic variant than CYP21A2 qPCR assay, compared to the ambiguity of qPCR (35% vs 11% ambiguous GCNs, FE FDR: p = 0.024) in spite of appropriate quality controls of MLPA (S8 Fig). The reproducibility of MLPA was also assessed with the same dilutions of positive control samples in the same way as performed in qPCR assays; The reproducibility of GCNs based on positive controls for β-defensin loci are 6.25 CV% for qPCR and 2.88 CV% for MLPA in a previous study [34], whereas they were 5.08 CV% for CYP21A1P qPCR, 3.04 CV% for CYP21A2 qPCR, 4.84 CV% for CYP21A1P MLPA and 7.52 CV% for CYP21A2 MLPA in the current study.
Expected gene copy numbers, estimated integer gene copy numbers and consistency
The larger deviation (above 0.4) of a particular measured GCN from the expected GCN calculated by the multiple linear regression of other GCNs in the same sample highlighted the inconsistent results (S9 Fig, S1 File). Integer GCNs were estimated from the measured GCNs by LDA. Total C4, total CYP21, total HERV-K(C4) CNV GCNs in addition to RCCX CNV breakpoint GCN plus 2 were all equal, and used for the estimation of total integer GCNs. The integer GCN of a paralogous gene was identical to total GCN, where the GCN of the other paralogous gene was 0. The integer GCNs of a paralogous gene pair having both GCNs larger than 0 were estimated from the measured GCNs of the paralogous gene pair and RCCX CNV breakpoint (Fig 3).
The estimations on integer GCNs were unambiguous in 3 cases (1.94%) based on the cross-validation and probabilities of LDA, while the majority of estimations (N = 150, 98.06%) passed the cross-validation (S1 File). All unambiguously estimated integer GCNs were in 100% of concordance with the integer GCNs measured by Southern blot, MLPA and array CGH (N = 137), suggesting that the probabilities and cross-validation of LDA accurately indicated the ambiguity at estimated integer GCNs. Furthermore, all unambiguously estimated integer GCNs showed 100% consistency with each other.
Estimated accuracy
The RE of measurements in study groups sometimes significantly deviated from a normal distribution and from each other (Fig 4). However, there was no such significant difference (SW FDR: 0.055–0.892; ANOVA: 0.552–0.972) in REs grouped by replicate measurements (S10 Fig). Significant differences were between REs in the same assay grouped by GCNs (S11 Fig), although a clear tendency could not be observed. The REs grouped by DNA samples (Fig 5) reflected a significant matrix effect of genomic DNA (SW FDR: p = 0.060–0.987; ANOVA: p<0.0001; Tukey: 21.5% significant pairs), but the matrix effect on accuracy seemed to show a lesser extent than that on RPPH1 Cqs.
The means and SDs belonging to the absolute values of average REs of samples in the assays were between 4.96±4.08% and 9.91±8.93% (S12 Fig). The distributions of average REs fitted to normal distributions. The normal distribution of a particular assay, characterized by mean and SD, was assumed around every integer GCNs of the particular assay to estimate the ambiguity and misclassification rates. The estimated ambiguity and misclassification rates of assays (Table 1) were in accordance with observed ambiguity and concordance (S11 and S12 Tables). The estimated ambiguities were fairly moderate at a GCN of 2 in the assays with better performance, whereas the estimated misclassifications were sufficiently low at a GCN of 3. The ambiguity and misclassification rates were also estimated based on the normal distributions calculated by the average REs of each GCN in a particular assay (S14 Table).
Table 1. Estimated ambiguity and misclassification rates of different qPCR assays for RCCX CNV at different gene copy numbers (GCNs).
C4A assay | C4B assay | CYP21A1P assay | CYP21A2 assay | HERV-K(C4) CNV deletion assay | HERV-K(C4) CNV insertion assay | RCCX CNV breakpoint assay | |
---|---|---|---|---|---|---|---|
ambiguity at 1 GCN |
0.02% | 0.01% | 0.15% | 0.03% | >0.01% | 2.57% | >0.01% |
ambiguity at 2 GCN |
6.16% | 4.94% | 11.25% | 7.07% | 2.78% | 26.47% | 2.05% |
ambiguity at 3 GCN |
21.27% | 19.01% | 29.00% | 22.83% | 14.24% | 45.71% | 12.23% |
ambiguity at 4 GCN |
35.00% | 32.58% | 42.74% | 36.6% | 27.12% | 57.71% | 24.65% |
misclassification at 1 GCN | >0.01% | >0.01% | >0.01% | >0.01% | >0.01% | >0.01% | >0.01% |
misclassification at 2 GCN | >0.01% | >0.01% | 0.02% | >0.01% | >0.01% | 0.93% | >0.01% |
misclassification at 3 GCN | 0.36% | 0.23% | 1.37% | 0.49% | 0.06% | 8.28% | 0.03% |
misclassification at 4 GCN | 2.92% | 2.19% | 6.41% | 3.50% | 1.02% | 19.32% | 0.68% |
Estimations were performed based on the normal distributions (characterized by mean and standard deviation) of average relative errors of samples.
Robustness of CYP21A1P and CYP21A2 assays
The robustness was screened through precision in CYP21 genes, which was reasonably similar to the general setup except when using different qPCR reagents and instruments (S15 Table). Performance was assessed with UMM2 and a 7500F (S13–S16 Figs and S16–S20 Tables). The estimated ambiguity and misclassification rates with UMM2 were similar to the better ones of the assays measured by the general setup, despite the higher imprecision and worse linearity. The PCR efficiencies between CYP21 target and RPPH1 showed a higher correlation (S17 Fig) with UMM2 than with 7500F. Normalized root-mean-square error (NRMSE) is a common measure of the differences between a predicted value and a measured one. NRMSE was used to characterize how efficiently the Cqs of a target gene follows the Cqs of the RPPH1. The reproducibility values of target and RPPH1 were significantly correlated from sample to sample in all CYP21 assays, as well as NRMSE and accuracy, but there was no correlation between precision and accuracy (S21 Table). The same relationships could be observed in other assays (S22 Table). Moreover, the SDs of accuracies (the SDs of the average REs of samples) in the assays from the default and robustness experiments of “good quality” and “population” study groups were significantly correlated with observed ambiguities and misclassifications (Spearman’s ρ = 0.671, p = 0.029 and ρ = 0.769, p = 0.009).
Discussion
Despite the observation that the performance parameters of 7 different qPCR assays in the current study vary, the number of assays was large enough and the conditions of measurements were homogenous enough to draw generalized conclusions. The precision based on Cqs and calibration curve parameters were not strongly linked to the accuracy of GCNs. The HERV-K(C4) CNV insertion assay showed similarly good precision and calibration curve parameters to other assays, but its accuracy (RE) had a high SD. In contrast, worse precision and calibration curves were observed in the assays of CYP21 genes measured with UMM2, but their accuracies were similar to the assays with high performance. The accuracy is directly associated with ambiguity and misclassification, and therefore accuracy should be considered as the key performance parameter of qPCR for GCN. The matrix effect of genomic DNA observed at RPPH1 Cqs, calibration curves, and even accuracy to a lesser extent, seemed to have a great impact on the performance of qPCR for GCN. The higher effectiveness of the normalization to the RPPH1 reference gene was correlated with the higher accuracy, and the normalization probably contributed to the reduced matrix effect on accuracy. If ambiguity of around 5% and misclassification under 1% are considered acceptable, the methodological limit of a singleplex qPCR assay for a target gene with 3 replicates proved to be a GCN of 2 for ambiguity and a GCN of 3 for misclassification in general. A lower methodological limit of GCN [35], and a similar one [36] have been described in the literature of qPCR. At any rate, the higher range of integer GCNs occurring in patients or a study population often exceeds the methodological GCN limit of target-singleplex qPCR, producing enough ambiguous GCNs and misclassifications that we should question the fit-for-purpose at higher GCNs.
The usage of multiple target sequences from a multiplex assay [37] or separate assays [38] can overcome the GCN limitation of the target-singleplex method. The measured GCN of the multiple target genetic elements were classified using LDA as a multiplex approach. The majority of integer GCNs (98.06%) estimated from measured GCNs passed the cross-validation of LDA and were qualified as unambiguous. Furthermore, all these unambiguously estimated integer GCNs were in 100% concordance with the integer GCN estimations from MLPA and the findings of Southern blot and array CGH from previous studies [30, 31]. LDA was highly effective, even using the ambiguous data of qPCR assays with lower accuracy, such as HERV-K(C4) CNV insertion assay and DNA samples with bad quality. The low sample size of a GCN class limits the performance of the classifier, so it may be worth using a reference set enriched with samples having rarer GCNs. The multiplex approach could render qPCR for the genetic elements of RCCX CNV very effective, however, it would be interesting to see how this multiplex approach for qPCR tackles a target region with higher average GCN than that found in RCCX CNV.
Several studies [29, 34, 39–41] contrast one molecular biology method for GCN with another, often having the ambition to pronounce one of them more advanced or suitable. However, the final conclusions of these studies are controversial, and it is difficult to draw a general conclusion because: 1.) The performances are compared using only a few metrics. GCNs and their concordance between the examined methods are the typical levels of comparison. Methodological differences can hamper the comparison at multiple levels; for instance, the measurement from a single replicate does not allow for the use of many standard performance metrics such as CV%. 2.) Performance metrics are poorly assessed or documented. This is well-illustrated by qPCR experiments where usually only a fraction of performance metrics required by MIQE are published. Ambiguity between measured and integer GCNs is seldom stated explicitly, but it can be assessed in the majority of GCN determination methods, and the performance of a method can be inferred from it. 3.) Study conclusions are drawn from one or a couple of assays. The performance of a GCN method is limited from the direction of high performance due to theoretical or practical reasons, but it can be mediocre to any extent owing to the poor design or execution. Therefore, an assay of a particular method with higher performance characterizes better the particular method than one with lower performance. For instance, the reproducibility of GCN in MLPA for CAH in the current study was a little bit worse than the reproducibility in MLPA for β-defensin in a previous article [34], and the MLPA results for β-defensin more aptly characterize MLPA in general. 4.) The performance of a GCN method is only compared to other methods, not to the fitness for purpose. The target-singleplex performance of MLPA for the same genetic variant also detected by CYP21A2 qPCR assay were lower, but an inappropriate fit-for-purpose does not ensue from this. The real strength of MLPA is the multiplex approach, which provides appropriate final integer GCN results, and the relatively simple procedure. In addition, the MLPA for the genetic test of CAH has some potential to identify CAH mutations and chimeric CYP21 gene variants in the same assay [42].
Southern blot was the first GCN determination method for RCCX CNV [43], and uses a multiplex approach because the unlabeled genomic DNA fragment pattern bound to the membrane can be examined with several probes for the elements of RCCX CNV in succession. The disadvantages of Southern blot include high labor intensity, high time demand, and only semi-quantitative GCN results based on a human operator’s evaluation [44], decreasing its suitability for the genetic test of CAH. Array CGH is a high-throughput method, but its high labor intensity, high time demand and high cost do not fit to the needs of CAH laboratories, where the vast majority of array CGH results would not be used. Other GCN determination methods based on the multiple elements of RCCX CNV such as the paralog ratio test and the high resolution melting PCR have been described [30, 45]. Digital PCR has been used for the GCN determination of C4 paralogous gene variants and HERV-K(C4) CNV [46], but the methodological performance of the digital PCR has not been evaluated on the genetic elements of RCCX CNV yet.
The CYP21A2 qPCR assay in the current study produces a reasonable number of ambiguous results at a GCN of 2, and the measurements of ambiguous GCNs can be conveniently repeated because the labor intensity and time demand of qPCR is low. Lower GCNs (0 and 1) are frequently examined for genetic testing, and misclassification for these GCNs is very low. A GCN of 3 seldom has to be examined, and therefore, the high ambiguity at a GCN of 3 is acceptable, as is the low level of misclassification. Furthermore, the analysis of CAH mutations has to be performed in the genetic testing of CAH, and this should correspond to GCNs, indicating the possible misclassification at a GCN of 3 [47]. Overall, therefore, we suggest that, the target-singleplex CYP21A2 qPCR assay fits for the purpose of the genetic testing of CAH.
Supporting information
Acknowledgments
We are indebted to Mark Eyre for English proofreading. We thank Prof. Barna Vasarhelyi for ensuring an inspiring research environment. Otto Darvasi passed away before the submission of the final version of this manuscript. Marton Doleschall accepts responsibility for the integrity and validity of the data collected and analyzed.
Data Availability
All relevant data are within the manuscript and its Supporting Information files. The minimal data set will be also available in Zenodo database (DOI: 10.5281/zenodo.6780358) after the publication of the current paper.
Funding Statement
The current research was supported by Semmelweis Science and Innovation Fund to MD (STIA-KF-17) and Hungarian Scientific Research Fund to AP (K125231). MD was supported by Janos Bolyai Research Scholarship from the Hungarian Academy of Sciences, and the UNKP-19-4 New National Excellence Program of the Ministry for Innovation and Technology. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Abbott MA, Poiesz BJ, Byrne BC, Kwok S, Sninsky JJ, et al. (1988) Enzymatic gene amplification: qualitative and quantitative methods for detecting proviral DNA amplified in vitro. J Infect Dis 158: 1158–1169. doi: 10.1093/infdis/158.6.1158 [DOI] [PubMed] [Google Scholar]
- 2.VanGuilder HD, Vrana KE, Freeman WM (2008) Twenty-five years of quantitative PCR for gene expression analysis. Biotechniques 44: 619–626. doi: 10.2144/000112776 [DOI] [PubMed] [Google Scholar]
- 3.Gelmini S, Orlando C, Sestini R, Vona G, Pinzani P, et al. (1997) Quantitative polymerase chain reaction-based homogeneous assay with fluorogenic probes to measure c-erbB-2 oncogene amplification. Clin Chem 43: 752–758. [PubMed] [Google Scholar]
- 4.Fanciulli M, Petretto E, Aitman TJ (2010) Gene copy number variation and common human disease. Clin Genet 77: 201–213. doi: 10.1111/j.1399-0004.2009.01342.x [DOI] [PubMed] [Google Scholar]
- 5.Yao Y, Nellaker C, Karlsson H (2006) Evaluation of minor groove binding probe and Taqman probe PCR assays: Influence of mismatches and template complexity on quantification. Mol Cell Probes 20: 311–316. doi: 10.1016/j.mcp.2006.03.003 [DOI] [PubMed] [Google Scholar]
- 6.Lupski JR, Belmont JW, Boerwinkle E, Gibbs RA (2011) Clan genomics and the complex architecture of human disease. Cell 147: 32–43. doi: 10.1016/j.cell.2011.09.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mele M, Ferreira PG, Reverter F, DeLuca DS, Monlong J, et al. (2015) Human genomics. The human transcriptome across tissues and individuals. Science 348: 660–665. doi: 10.1126/science.aaa0355 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Concolino P, Mello E, Toscano V, Ameglio F, Zuppi C, et al. (2009) Multiplex ligation-dependent probe amplification (MLPA) assay for the detection of CYP21A2 gene deletions/duplications in congenital adrenal hyperplasia: first technical report. Clin Chim Acta 402: 164–170. doi: 10.1016/j.cca.2009.01.008 [DOI] [PubMed] [Google Scholar]
- 9.Walker S, Janyakhantikul S, Armour JA (2009) Multiplex Paralogue Ratio Tests for accurate measurement of multiallelic CNVs. Genomics 93: 98–103. doi: 10.1016/j.ygeno.2008.09.004 [DOI] [PubMed] [Google Scholar]
- 10.Banlaki Z, Szabo JA, Szilagyi A, Patocs A, Prohaszka Z, et al. (2013) Intraspecific evolution of human RCCX copy number variation traced by haplotypes of the CYP21A2 gene. Genome Biol Evol 5: 98–112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Handsaker RE, Van Doren V, Berman JR, Genovese G, Kashin S, et al. (2015) Large multiallelic copy number variations in humans. Nat Genet 47: 296–303. doi: 10.1038/ng.3200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.El-Maouche D, Arlt W, Merke DP (2017) Congenital adrenal hyperplasia. Lancet 390: 2194–2210. doi: 10.1016/S0140-6736(17)31431-9 [DOI] [PubMed] [Google Scholar]
- 13.Wu YL, Savelli SL, Yang Y, Zhou B, Rovin BH, et al. (2007) Sensitive and specific real-time polymerase chain reaction assays to accurately determine copy number variations (CNVs) of human complement C4A, C4B, C4-long, C4-short, and RCCX modules: elucidation of C4 CNVs in 50 consanguineous subjects with defined HLA genotypes. J Immunol 179: 3012–3025. doi: 10.4049/jimmunol.179.5.3012 [DOI] [PubMed] [Google Scholar]
- 14.Baumgartner-Parzer S, Witsch-Baumgartner M, Hoeppner W (2020) EMQN best practice guidelines for molecular genetic testing and reporting of 21-hydroxylase deficiency. Eur J Hum Genet. doi: 10.1038/s41431-020-0653-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Szabo JA, Szilagyi A, Doleschall Z, Patocs A, Farkas H, et al. (2013) Both positive and negative selection pressures contribute to the polymorphism pattern of the duplicated human CYP21A2 gene. PLoS One 8: e81977. doi: 10.1371/journal.pone.0081977 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Szilagyi A, Blasko B, Szilassy D, Fust G, Sasvari-Szekely M, et al. (2006) Real-time PCR quantification of human complement C4A and C4B genes. BMC Genet 7: 1. doi: 10.1186/1471-2156-7-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Parajes S, Quinterio C, Dominguez F, Loidi L (2007) A simple and robust quantitative PCR assay to determine CYP21A2 gene dose in the diagnosis of 21-hydroxylase deficiency. Clin Chem 53: 1577–1584. doi: 10.1373/clinchem.2007.087361 [DOI] [PubMed] [Google Scholar]
- 18.Bustin SA, Benes V, Garson JA, Hellemans J, Huggett J, et al. (2009) The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem 55: 611–622. doi: 10.1373/clinchem.2008.112797 [DOI] [PubMed] [Google Scholar]
- 19.Bustin S, Nolan T (2017) Talking the talk, but not walking the walk: RT-qPCR as a paradigm for the lack of reproducibility in molecular research. Eur J Clin Invest 47: 756–774. doi: 10.1111/eci.12801 [DOI] [PubMed] [Google Scholar]
- 20.Theodorsson E (2012) Validation and verification of measurement methods in clinical chemistry. Bioanalysis 4: 305–320. doi: 10.4155/bio.11.311 [DOI] [PubMed] [Google Scholar]
- 21.The International HapMap Project (2003) The International HapMap Project. Nature 426: 789–796. doi: 10.1038/nature02168 [DOI] [PubMed] [Google Scholar]
- 22.Ye J, Coulouris G, Zaretskaya I, Cutcutache I, Rozen S, et al. (2012) Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinformatics 13: 134. doi: 10.1186/1471-2105-13-134 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Markham NR, Zuker M (2008) UNAFold: software for nucleic acid folding and hybridization. Methods Mol Biol 453: 3–31. doi: 10.1007/978-1-60327-429-6_1 [DOI] [PubMed] [Google Scholar]
- 24.Hubaux A, Vos G (1970) Decision and detection limits for linear calibration curves. Anal Chem 42: 849–855. [Google Scholar]
- 25.R Core Team (2019) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
- 26.Bates D, Machler M, Bolker BM, Walker SC (2015) Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software 67: 1–48. [Google Scholar]
- 27.Nakayama Y, Yamaguchi H, Einaga N, Esumi M (2016) Pitfalls of DNA Quantification Using DNA-Binding Fluorescent Dyes and Suggested Solutions. PLoS One 11: e0150528. doi: 10.1371/journal.pone.0150528 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Lee LG, Connell CR, Bloch W (1993) Allelic discrimination by nick-translation PCR with fluorogenic probes. Nucleic Acids Res 21: 3761–3766. doi: 10.1093/nar/21.16.3761 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Leung EKY, Agolini E, Pei X, Melis R, McMillin GA, et al. (2017) Validation of an Extensive CYP2D6 Assay Panel Based on Invader and TaqMan Copy Number Assays. Journal of Applied Laboratory Medicine 1: 471–482. doi: 10.1373/jalm.2016.021923 [DOI] [PubMed] [Google Scholar]
- 30.Fernando MM, Boteva L, Morris DL, Zhou B, Wu YL, et al. (2010) Assessment of complement C4 gene copy number using the paralog ratio test. Hum Mutat 31: 866–874. doi: 10.1002/humu.21259 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, et al. (2010) Origins and functional impact of copy number variation in the human genome. Nature 464: 704–712. doi: 10.1038/nature08516 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Horton R, Wilming L, Rand V, Lovering RC, Bruford EA, et al. (2004) Gene map of the extended human MHC. Nat Rev Genet 5: 889–899. doi: 10.1038/nrg1489 [DOI] [PubMed] [Google Scholar]
- 33.Norman PJ, Norberg SJ, Guethlein LA, Nemat-Gorgani N, Royce T, et al. (2017) Sequences of 95 human MHC haplotypes reveal extreme coding variation in genes other than highly polymorphic HLA class I and II. Genome Res 27: 813–823. doi: 10.1101/gr.213538.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Perne A, Zhang X, Lehmann L, Groth M, Stuber F, et al. (2009) Comparison of multiplex ligation-dependent probe amplification and real-time PCR accuracy for gene copy number quantification using the beta-defensin locus. Biotechniques 47: 1023–1028. doi: 10.2144/000113300 [DOI] [PubMed] [Google Scholar]
- 35.Bubner B, Gase K, Baldwin IT (2004) Two-fold differences are the detection limit for determining transgene copy numbers in plants by real-time PCR. BMC Biotechnol 4: 14. doi: 10.1186/1472-6750-4-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Weaver S, Dube S, Mir A, Qin J, Sun G, et al. (2010) Taking qPCR to a higher level: Analysis of CNV reveals the power of high throughput qPCR to enhance quantitative resolution. Methods 50: 271–276. doi: 10.1016/j.ymeth.2010.01.003 [DOI] [PubMed] [Google Scholar]
- 37.Jiang W, Johnson C, Simecek N, Lopez-Alvarez MR, Di D, et al. (2016) qKAT: a high-throughput qPCR method for KIR gene copy number and haplotype determination. Genome Med 8: 99. doi: 10.1186/s13073-016-0358-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Hollox EJ, Detering JC, Dehnugara T (2009) An integrated approach for measuring copy number variation at the FCGR3 (CD16) locus. Hum Mutat 30: 477–484. doi: 10.1002/humu.20911 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Whale AS, Huggett JF, Cowen S, Speirs V, Shaw J, et al. (2012) Comparison of microfluidic digital PCR and conventional quantitative PCR for measuring copy number variation. Nucleic Acids Res 40: e82. doi: 10.1093/nar/gks203 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Cantsilieris S, Western PS, Baird PN, White SJ (2014) Technical considerations for genotyping multi-allelic copy number variation (CNV), in regions of segmental duplication. BMC Genomics 15: 329. doi: 10.1186/1471-2164-15-329 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Fode P, Jespersgaard C, Hardwick RJ, Bogle H, Theisen M, et al. (2011) Determination of beta-defensin genomic copy number in different populations: a comparison of three methods. PLoS One 6: e16768. doi: 10.1371/journal.pone.0016768 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Chen W, Xu Z, Sullivan A, Finkielstain GP, Van Ryzin C, et al. (2012) Junction site analysis of chimeric CYP21A1P/CYP21A2 genes in 21-hydroxylase deficiency. Clin Chem 58: 421–430. doi: 10.1373/clinchem.2011.174037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Chung EK, Yang Y, Rennebohm RM, Lokki ML, Higgins GC, et al. (2002) Genetic sophistication of human complement components C4A and C4B and RP-C4-CYP21-TNX (RCCX) modules in the major histocompatibility complex. Am J Hum Genet 71: 823–837. doi: 10.1086/342777 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Cantsilieris S, Baird PN, White SJ (2013) Molecular methods for genotyping complex copy number polymorphisms. Genomics 101: 86–93. doi: 10.1016/j.ygeno.2012.10.004 [DOI] [PubMed] [Google Scholar]
- 45.Jaimes-Bernal CP, Trujillo M, Marquez FJ, Caruz A (2020) Complement C4 Gene Copy Number Variation Genotyping by High Resolution Melting PCR. Int J Mol Sci 21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sekar A, Bialas AR, de Rivera H, Davis A, Hammond TR, et al. (2016) Schizophrenia risk from complex variation of complement component 4. Nature 530: 177–183. doi: 10.1038/nature16549 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Doleschall M, Luczay A, Koncz K, Hadzsiev K, Erhardt E, et al. (2017) A unique haplotype of RCCX copy number variation: from the clinics of congenital adrenal hyperplasia to evolutionary genetics. Eur J Hum Genet 25: 702–710. doi: 10.1038/ejhg.2017.38 [DOI] [PMC free article] [PubMed] [Google Scholar]