Abstract
Most proviruses persisting in people living with HIV (PWH) on antiretroviral therapy (ART) are defective. However, rarer intact proviruses almost always reinitiate viral rebound if ART stops. Therefore, assessing therapies to prevent viral rebound hinges on specifically quantifying intact proviruses. We evaluated the same samples from 10 male PWH on ART using the two-probe intact proviral DNA assay (IPDA) and near full length (nfl) Q4PCR. Both assays admitted similar ratios of intact to total HIV DNA, but IPDA found ~40-fold more intact proviruses. Neither assay suggested defective proviruses decay over 10 years. However, the mean intact half-lives were different: 108 months for IPDA and 65 months for Q4PCR. To reconcile this difference, we modeled additional longitudinal IPDA data and showed that decelerating intact decay could arise from very long-lived intact proviruses and/or misclassified defective proviruses: slowly decaying defective proviruses that are intact in IPDA probe locations (estimated up to 5%, in agreement with sequence library based predictions). The model also demonstrates how misclassification can lead to underestimated efficacy of therapies that exclusively reduce intact proviruses. We conclude that sensitive multi-probe assays combined with specific nfl-verified assays would be optimal to document absolute and changing levels of intact HIV proviruses.
Subject terms: HIV infections, Computational models
Quantifying intact proviruses is key to understanding decreases in HIV reservoirs but results can differ depending on the method. To balance sensitivity and specificity of two assays, the authors use mathematical models and measurements of intact and defective proviruses to assess how misclassification can impact estimates of natural and therapeutic reservoir reduction.
Introduction
Although HIV infection can be suppressed by antiretroviral therapy (ART), latent HIV-1 proviruses persist in the genomes of long-lived CD4+ T cells in people living with HIV1,2. Most latent proviruses are defective and cannot replicate3–7. However, a small population of intact proviruses are responsible for rebound viremia upon ART interruption8–10. Therefore, precise measurements of intact HIV levels are essential to evaluating clinical interventions aimed at altering the dynamics of viral rebound, establishing ART-free HIV remission, and ultimately curing HIV11–15.
Reservoir size and dynamics have been estimated by a variety of different methods, all with their own advantages and disadvantages in terms of speed, throughput, sensitivity, and specificity to rebound-competent sequences. Quantitative viral outgrowth assays (QVOAs) are the gold standard for identification of rebound-competent sequences, because they measure the number of cells that can be reactivated in vitro to produce infectious virus4,16. Multi-probe HIV DNA ddPCR approaches are rapid, scalable, and not biased by in vitro cellular reactivation17–20. These assays designate proviral DNA as intact or defective based on hybridization of PCR-amplified proviral DNA to probes in two to five relatively conserved locations on the HIV-1 genome. For example, the intact proviral DNA assay (IPDA)17 targets the HIV-1 Packaging Signal (Ψ) and the Envelope/Rev Responsive Element (RRE). If both regions are amplified within a single provirus, the sequence is said to be intact, and if only one of the two probes is positive the provirus is determined to be defective17. Q4PCR performs both quantitative PCR with four different DNA probes and near full-length (nfl) sequence analysis19. This approach is far more labor intensive than ddPCR and the relatively inefficient long range amplification admits lower absolute measure of reservoir size compared to ddPCR21. However, nfl sequence verification ensures specificity to identify intact sequences19,22. Other techniques include single cell omics23–26, flow cytometry approaches27,28, enzyme-linked ImmunoSpot29, and reporter cell-based methods30.
Most HIV reservoir studies have considered one decade of ART. QVOAs performed on longitudinal samples from that time frame produced a consensus half-life of 4–5 years for the replication-competent latent reservoir31,32. On samples from similar time scales, IPDA and Q4PCR also found that intact proviruses decay with a 4–5-year half-lives whereas defective proviral half-lives were much longer with estimates ranging from 100 months to no decay17,18,33–35 (akin to past estimates of total HIV DNA decay36,37).
In a recent study of longitudinal samples collected up to 20 years after ART initiation, IPDA showed a deceleration in intact HIV decay after 7 years38. The authors hypothesized that the decelerating decay could arise from shifts in the reservoir toward longer lived HIV-infected CD4 + T cells defined by less-transcriptional activation and/or a more naïve in phenotype. Other recent work shows intact proviruses measured by IPDA may continuously decline but also can decelerate and even increase in some individuals after 10 years of therapy39.
In addition to understanding long-term dynamics, evaluating therapeutic reductions of HIV reservoirs involves measuring small numbers of intact HIV proviruses, and/or very small changes in these numbers. For these contexts, we quantitatively clarify how assays with excellent but imperfect specificity for intact proviruses might behave. We developed a mathematical model inclusive of the possibility that some observed intact sequences contain defects outside probe regions such that they are truly defective. Using this model, we highlight how misclassification of defective proviruses could result in underestimates of reservoir reduction in interventional studies.
Results
Intact and defective HIV proviral DNA using two assays
To directly compare Q4PCR19 and IPDA17 we applied both assays to longitudinal samples obtained from 10 people living with HIV-1 (PWH) on suppressive ART over a period of up to 10 years (Fig. 1A, B, data provided in Supplementary Data 1)19. The samples were obtained at time points ranging from 4.5 to 123 months after initiation of ART with a median of 3 (range = [2–4]) samples per participant. IPDA was performed by Accelevir on the same DNA samples used for Q4PCR. Samples with DNA quality or probe amplification issues from IPDA were excluded from further analysis.
Across all samples, IPDA and Q4PCR reported a median of 228 and 6 intact and 2553 and 87 defective proviral copies per million CD4+ T cells, respectively (Fig. 1B). The difference is consistent with prior reports that Q4PCR under-reports HIV DNA due to sequence loss during nfl amplification22. In particular, paired IPDA estimates of intact levels were a median of 42-fold greater than Q4PCR, with a wide range [0.2–39000] including a few time points for which Q4PCR estimates were higher.
Some associations were observed among and between assays for HIV DNA levels (correlation coefficients for all comparisons are tabulated in Supplementary Fig. 1). Across all participants and time points, levels of intact and defective proviruses were weakly correlated between Q4PCR and IPDA (Spearman p = 0.08 and 0.07 for intact and defective, respectively). Estimates of numbers of intact and defective proviruses were not correlated within each assay, potentially because intact levels change over time whereas defective do not (Spearman p > 0.13 for both). However, at early time points (<12 months on ART) intact reservoir measurements correlated much more strongly between assays (ρ = 0.8, p = 0.004; Fig. 1D).
Ratio of intact to total proviruses agrees between assays at early time points
Because the absolute sizes of both intact and defective HIV levels for IPDA were higher, we tested whether the ratio of intact to total HIV DNA could be a conserved quantity for both assays. We found the median ratio was slightly higher for IPDA, but the distributions across all time points were not significantly different, suggesting the ratio of intact to total may agree across assays (Fig. 1E). However, the ratio of intact to total was significantly correlated for early time points but not for all time points (Fig. 1F).
Estimated reservoir decay using two assays
To assess whether trends for reservoir clearance would agree despite absolute differences in proviral levels, we estimated the decay rate for intact and defective proviruses quantified by each assay. To account for within-individual correlations and different numbers of samples per individual, we used a log-linear mixed effects model. We also accounted for known biphasic decay in proviral levels34 by only modeling measurements taken after 6 months of ART (Fig. 2A). The model provides a population average for initial size and decay rate with 95% confidence intervals on these estimates (Fig. 2B).
The mean estimated reservoir decay rates indicate that defective proviruses (by either assay) tend not decay (Fig. 2B). Intact decay rates were on average below zero, with average half-lives of 65 and 108 months for Q4PCR and IPDA, respectively. (Note, the estimate of Q4PCR intact half-life here differs slightly from a published estimate using a different analysis method and some of the same data19). However, IPDA estimates had more uncertainty such that confidence intervals crossed zero, meaning inclusive of no decay (or increases). Our data at 10 years was sparse, but when we reperformed the estimation excluding data after 6 years of ART, estimates were not qualitatively different.
Detailed examination of prior estimates indicates some intact IPDA half-lives are reported after excluding non-decaying participants (Supplementary Table 1). Therefore, it is not completely clear whether our results here agree with past observations.
Defective sequences can be misclassified by multi-probe assays and increase observations of intact proviruses
Multi-probe assays that quantify intact proviruses inherently have imperfect specificity. We hypothesized that this imperfect specificity could influence observed levels of intact proviruses in natural and therapeutic settings. To study this hypothesis, we envisioned a system with 3 categories of proviruses: intact, defective, and proviruses which are intact in all IPDA probe locations yet have a defect elsewhere. We term this key third group “misclassified defective” proviruses (Fig. 3A). We encoded this hypothesis into a mathematical model by assuming true intact and defective sequences each have a half-life (or decay rates and ). Then, a small fraction of defective sequences can contribute to observed intact sequences as (Fig. 3B).
Given this model, even if we assume true intact sequences continuously, once then the decay rate of observed intact sequences will flatten and converge to the decay rate of misclassified defective proviruses, which decay slowly or not at all. This idea is consistent with the observations that IPDA and Q4PCR measurements diverge with time on ART (Fig. 1). In addition, any therapy that selectively reduces intact proviruses might be underestimated because of the inclusion of misclassified defective proviruses into the observed intact pool. We quantitively explore these phenomena in the next sections.
Misclassified defective sequences might have small and potentially less important defects compared to the general pool of defective proviruses. Therefore, to determine whether the half-life of misclassified defectives differs from the overall pool of defective proviruses, we estimated the half-life of proviruses that were defective by Q4PCR but also amplified by Packaging Signal (Ψ) and the envelope Rev Responsive Element (RRE) probes in Q4PCR. The half-life of these PS+ env/RRE+ defective proviruses, which would have been observed as intact by IPDA, was 180 months and not significantly different from overall defective decay rates estimated from either IPDA or Q4PCR (Supplementary Figure 2). This finding helps to simplify the model by letting .
The model quantitatively reproduces independent IPDA observations from a 20-year cohort and estimates the misclassification fraction
Misclassification might explain some of the differences between decay rates observed for Q4PCR and IPDA in Fig. 2. However, our data is relatively sparse. Therefore, we assessed whether this model could explain a published larger and longer-duration data set in which intact reservoir deceleration was observed38. A successful fit (total R-squared = 0.9) was achieved simultaneously for the experimental levels of observed intact (Fig. 4A) and defective (Fig. 4B) proviruses over time, suggesting this model could reasonably approximate these data.
By fitting this model, we derived average estimates for each of the model parameters in Fig. 3B. We estimated a defective half-life of 320 months and a true intact half-life of 46 months. Both values agree with those using QVOA or nfl sequencing. The estimated misclassification fraction was = 0.05, meaning that, across this population, roughly 5% of defective proviruses may contain defects outside the IPDA probe locations—a result in relatively good agreement with prior in silico sequence analyses17,18. With this misclassification fraction and the average initial sizes in the study, we estimate 27% of observed intact proviruses would be misclassified defectives at the initial time points which is nearly identical to the 30% originally reported for the IPDA17.
Biological deceleration of intact proviral decay is not mutually exclusive with misclassification
Decelerating decay of intact proviruses could reflect the dynamics of one (or more) longer lived or dividing populations of CD4+ T cells. Our model does not rule out this hypothesis. To examine how additional phases of proviral decay influence our results, we modified our model to include a second population of long-lived intact proviruses (, with a half-life of 18.7 years based on the observed long term decay rate of intact proviruses38, Fig. 4C). We assumed this population is a subset of all intact proviruses, varied its initial value at 1/30, 1/7, and 1/3 of the original intact proviruses, and re-estimated the misclassification fraction needed to generate observed intact proviral levels. For instance, assuming 1/3 of intact proviruses have an 18.7 year half-life, the model can re-generate observed intact levels correctly provided the misclassification fraction is accordingly reduced from 5% to 1% (Fig. 4D).
Misclassification results in varying degrees of underestimation for theoretical therapies that specifically reduce intact proviruses
Some proposed HIV cure therapies, such as shock and kill40,41 or targeted gene therapy42, might only affect intact proviruses. If intact proviruses are reduced, but misclassified defective proviruses are not, then observed reductions in intact proviruses could be underestimated. To determine theoretical consequences of misclassification, we simulated therapies that reduced true intact proviruses by 2–100 fold. We varied the initial fraction of intact proviruses from 1/100 to 1/2 and the misclassification fraction from 0.5%–8%. Across these simulations, observed intact proviruses did not drop equivalently to true intact proviruses (Fig. 5A, B). Indeed, the simulated true therapeutic efficacy (fold reduction in true intact) was always underestimated by the simulated observed efficacy (fold reduction in observed intact). The misclassification fraction can influence the relationship between true and observed efficacy (Fig. 5C). For low true efficacy (<10 fold reduction), any misclassification fraction could result in near agreement between true and observed efficacy. For higher true efficacy (100-fold reduction) a 0.5% misclassification could still admit agreement between true and observed, depending on initial intact fraction. However, an 8% misclassification admitted at best a 10-fold observed efficacy given a 100-fold true efficacy. In some instances, efficacy could be severely underestimated. The initial fraction of true intact proviruses, a quantity that ranged across individuals with a median of approximately 1/10 (Fig. 1E), plays a large role in how therapeutic efficacy will be observed (Fig. 5D). Regardless of misclassification fraction, if small numbers of intact proviruses disappear, they are not noticed in comparison to the more common misclassified defective proviruses. For example, if intact proviruses are 1/2 of all proviruses, therapeutic efficacy is nearly correctly observed for low efficacy therapies and any misclassification fraction. Yet if 1/1000 of proviruses are intact, even a 0.5% misclassification rate results in an observed therapeutic efficacy of 2, regardless of its true value.
Balancing sensitivity and specificity in a theoretical analysis of intact provirus assays with additional probe regions
Increasing the number of primer-probe target regions could improve specificity to true intact proviruses and might therefore improve agreement between observed and true intact sequences. Along with nfl sequencing, Q4PCR simultaneously counts proviruses that are intact at 2, 3, and 4 probe regions22,43, giving us a window into how measurements change when more probes are used to determine intactness. Unfortunately, as the number of probes increased the sensitivity decreased from 4037 to 1044 to 98 proviruses that were 2, 3 or 4 probe positive, respectively. We estimated the mean and 95% confidence interval for the ratio of intact to total proviruses from all participants and all time points for each probe set from our data (Fig. 6A). The percentage of intact proviruses increases with increasing numbers of probes, but the total number of proviruses decreases. Therefore, the statistical uncertainty around the intact fraction increases.
Then, using our mathematical model, we simulated observed intact sequences using theoretical assays including 2, 3, or 4 probes assuming each probe has similar conservation levels as the original IPDA primer-probe sets (Methods)43. The theoretical 3 probe assay would decrease misclassification from approximately 5% to 1% and the average (dashed red line Fig. 6B) was projected to have reasonable agreement between observed and true intact provirus levels for approximately 10 years of therapy including near the limit of detection of 10 copies per million CD4+ T cells. The 1.5% rate for the theoretical 3 region assay agreed with an in silico study of more than 1000 sequences and a different published 3 region assay18, supporting the plausibility of the initial model estimate of 5% for IPDA as well as the theoretical probe calculation. The theoretical 4 probe assay could lower misclassification further to 0.1%, which was projected to have reasonable agreement between observed and true levels throughout a 20-year study (dashed purple line Fig. 6B). However, when we add uncertainty based on the Q4PCR data in Fig. 6A, the loss of sensitivity for the 3 and 4 probe assays is substantial. The wide confidence intervals mean that changes in intact proviral DNA could also be hard to reliably detect. Therefore, the increase in sensitivity might not compensate for the gains in specificity.
Discussion
A cure for HIV would benefit millions living with HIV14,15. In addition to incremental progress toward cure, any intervention that decreases HIV reservoir size might improve an individual’s HIV management by reducing HIV rebound probability after ART pauses44–48 and could also potentially impact chronic immune activation49. Many interventions have been tested, but most do not significantly reduce HIV reservoirs50. Going forward, individual interventions that produce small reductions in the intact reservoir might be combined to obtain clinically significant results14,18. Therefore, specific assays are crucial to sensitive detection of reservoir reduction and enabling progress towards HIV cure.
Though simple and sensitive, total HIV DNA does not directly measure the rebound competent reservoir because most proviruses are defective2,4. On the other hand, quantitative viral outgrowth assays (QVOA) are difficult and potentially biased by in vitro stimulation requirements31, but specifically count proviruses that could reinitiate infection. Emerging single-cell multi-omics techniques exquisitely illuminate reservoir cell and proviral DNA properties but are also highly complex and their utility for dynamic quantification is not yet well studied23–25. Based on their commercial availability and simplicity, assays like the intact proviral DNA assay (IPDA) that use two or more PCR probes to define intact HIV sequences have been rapidly adopted as go-to assays for HIV reservoir size in clinical studies17,18,35,38,51. Although more challenging than IPDA, assays combining probes with nfl sequencing (e.g., Q4PCR19,22) are another emerging technique that reproducibly define intact and defective HIV proviruses.
After applying IPDA and Q4PCR to longitudinal clinical samples from 10 people living with HIV on suppressive ART for 10 years, we found IPDA data admitted a mean intact half-life of 108 months, not significantly distinguishable from defective sequences in these participants, whereas Q4PCR data admitted a mean half-life of 65 months, with confidence intervals not overlapping with zero31,32. In addition, IPDA and Q4PCR estimates of intact proviral levels were correlated in observations from the first year of therapy but this correlation disappeared at later time points.
For comparison, in 7 prior studies that used IPDA to measure intact reservoir decay (detailed review in Supplementary Table 1), mean half-lives ranged from 19-84 months31,32. But, throughout these studies, reported means often excluded instances of outlier participants who had long (>100 month) half-lives. Various decay estimation methods were applied, and ART adherence is difficult to confirm but could influence decay. But, evidently, cases of non-decaying intact proviral reservoirs have appeared in the IPDA literature. Further, a relatively large and long-term study (81 participants, some with >12 years on suppressive ART) observed a deceleration in intact decay rate with time on ART38.
We considered whether some of these findings could be reconciled by considering a third category of HIV proviruses: those which are intact at both IPDA probe locations yet defective elsewhere. These proviruses were acknowledged in original IPDA publications, but their implications for reservoir decay measurements have not been explored17,52. Using a mathematical model, we showed that observed deceleration could emerge as true intact sequences decay and what remains in the observed intact pool is slower decaying misclassified defective proviruses. By fitting the model to intact and defective levels, we estimated an average misclassification fraction in this data set was 5%, which generally agrees with estimates derived from libraries of proviral sequences18.
Notably, our work does not rule out the possibility that intact proviral decay slows over time.
It may be that cells containing intact proviruses persist and undergo clonal expansion and thereby contribute to decelerating decay or even increases of intact proviruses in individuals on long-term therapy39. Our simulations including an additional long-lived compartment could also generate observed data, provided the estimated misclassification rate was lowered. Additional prolonged longitudinal studies with more specific assays such as QVOA or nfl sequencing will illuminate novel decay dynamics and the biology of the latent reservoir after many years of ART.
Notwithstanding changes in long-term biological decay, we also showed that misclassification leads to underestimates of efficacy from therapies that specifically target intact proviruses. This phenomenon is particularly important to consider for trial participants having low fractions of intact:total HIV DNA.
Assays adding additional primer-probe target regions could improve specificity, but as usual a balance between specificity and sensitivity emerges. Q4PCR includes sequence information but underestimates absolute levels of intact proviruses relative to IPDA21,43. For now, 2 probes appear to achieve a pragmatic balance, and if next generation multi-probe assays are designed, they should be optimized using proviral libraries to limit misclassification (specificity) and maximize amplification (sensitivity).
In epidemiology, it is well known that diagnostic assays with excellent specificities (>90%) will still create a high number of false positives when applied to a low-prevalence population for a given disease. We suggest that this phenomenon is relevant for applying the IPDA to HIV DNA.
Our modeling is inherently theoretical, and limitations include that the misclassification rate itself is assumed to be constant across individuals, which may not be reasonable given the massive diversity of HIV and the clonality of HIV reservoirs9,10,19,53–57. Although we focused on IPDA overestimation of intact sequences, in some unusual cases IPDA apparently underestimated replication competent sequences because of probe mismatches to intact sequences58.
In conclusion, multi-probe ddPCR assays are a pragmatic middle ground with excellent sensitivity and high but imperfect specificity for quantifying intact proviruses. Going forward, care must be taken so that misclassified defective sequences that ultimately emerge from imperfect specificity do not obscure changes in intact reservoir sizes. When it is crucial to detect changes during interventional studies, assays that include additional probes and/or sequence verification are a natural solution.
Methods
Human participants
This study was approved by the institutional review board of the NIH National Institute of Allergy and Infectious Diseases and registered on ClinicalTrials.gov: NCT00039689. All study participants provided written informed consent, and leukapheresis products were collected in accordance with the protocol. All study participants were of male sex based on self-reporting. 5 participants were White, 3 Black and 2 Unknown/Hispanic-Latino. Median age at last study time-point 42 years (range 27–61). 8 participants initiated antiretroviral therapy after chronic infection, 2 participants initiated antiretroviral therapy after early infection. Study participants were enrolled at the Clinical Center, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA. Characteristics of study participants including sex and gender can be found in Table S1 of Ref. 19.
Q4PCR measurements
Q4PCR was performed as previously described21,39. Briefly, genomic DNA from 1 to 5 million total CD4 T cells was isolated using the Gentra Puregene cell kit (Qiagen) or phenol-chloroform, and the DNA concentration was measured using a Qubit high-sensitivity kit (Thermo Fisher Scientific). Next, an outer PCR (nfl1) was performed on genomic DNA at a single-copy dilution using outer PCR primers BLOuterF (5′-AAATCTCTAGCAGTGGCGCCCGAACAG-3′) and BLOuterR (5′-TGAGGGATCTCTAGTTACCAGAGTC-3′)37. Undiluted 1 µl aliquots of the nfl1 PCR product were subjected to a Q4PCR reaction using a combination of four primer/probe sets that target conserved regions in the HIV-1 genome. Each primer/probe set consists of a forward and reverse primer pair as well as a fluorescently labeled internal hydrolysis probe as previously described:21 PS forward, 5′-TCTCTCGACGCAGGACTC-3′; PS reverse, 5′-TCTAGCCTCCGCTAGTCAAA-3′; PS probe, 5′-/Cy5/TTTGGCGTA/ TAO/CTCACCAGTCGCC-3′/IAbRQSp (Integrated DNA Technologies); env forward, 5′-AGTGGTGCAGAGAGAAAAAAGAGC-3′; env reverse, 5′-GTCTGGCCTGTACCGTCAGC-3′; env probe, 5′-/VIC/CCTTGGGTTCTTGGGA-3′/MGB (Thermo Fisher Scientific); gag forward, 5′-ATGTTTTCAGCATTATCAGAAGGA-3′; gag reverse, 5′- TGCTTGATGTCCCCCCACT-3′; gag probe, 5′-/6-FAM/CCACCCC- AC/ZEN/AAGATTTAAACACCATGCTAA-3′/IABkFQ (Integrated DNA Technologies); and pol forward, 5′-GCACTTTAAATTTTCCCATTAGTCCTA-3′; pol reverse, 5′-CAAATTTCTACTAATGCTTTTATTTTTTC-3′; pol probe, 5′-/NED/AAGCCAGGAATGGA-TGGCC-3′/MGB (Thermo Fisher Scientific).
Each Q4PCR reaction was performed in a 10 µl total reaction volume containing 5 µl TaqMan universal PCR master mix containing Rox (catalog no. 4304437; Applied Biosystems), 1 µl diluted genomic DNA, nuclease-free water, and the following primer and probe concentrations: PS, 675 nM forward and reverse primers with 187.5 nM PS internal probe; env, 90 nM forward and reverse primers with 25 nM env internal probe; gag, 337.5 nM forward and reverse primers with 93.75 nM gag internal probe; and pol, 675 nM forward and reverse primers with 187.5 nM pol internal probe.
qPCR conditions were 94 °C for 10 min, 40 cycles of 94 °C for 15 s, and 60 °C for 60 s. All qPCRs were performed in a 384-well plate format using the Applied Biosystem QuantStudio 6 or 7 Flex real-time PCR system. qPCR data analysis was performed as previously described21 using ThermoFisher Design and Analysis Software 2.4.3.
Generally, samples showing reactivity with two or more of the four qPCR probes were selected for a nested PCR (nfl2). This focused approach detects the subset of defective proviruses that are positive for at least 2 of the 4 oligonucleotide probes which in turn leads to an underestimation of the absolute number of defective proviruses. The nfl2 reaction was performed on undiluted 1 µl aliquots of the nfl1 PCR product. Reactions were performed in a 20 µl reaction volume using Platinum Taq high-fidelity polymerase (Thermo Fisher Scientific) and PCR primers 275 F (5′-ACAGGGACCTGAAAGCGAAAG-3′) and 280 R (5′-CTAGTTACCAGAGTCACACAACAGACG-3′)37 at a concentration of 800 nM. Library preparation and sequencing were performed as previously described21. Sampling time points without recovery of intact sequences were determined at lower limit of detection for statistical analyses. Limit of detection is calculated as half of the proviral frequency assuming 1 intact sequence out of the highest input of sampled cells.
IPDA measurements
Aliquots of genomic DNA samples that were used for Q4PCR measurements were sent to Accelevir Diagnostics who performed IPDA measurements as previously described51. DNA quality was determined by spectrophotometry. DNA samples that did not pass company quality control or showed probe amplification issues were flagged by Accelevir Diagnostics and were excluded from further analysis.
Estimating decay half-lives from longitudinal measurements of population sizes
To estimate decay half-lives, we apply a log-linear mixed effects model implemented using the statsmodels package in Python. Using the data in Supplementary Data 1, we restricted to time points after 6 months of ART to avoid early rapid (biphasic) decays as observed previously34,53. We then estimate the decay rate () and the initial value (y-intercept) as well as confidence intervals on the decay rate for each assay (Q4PCR and IPDA) and each proviral type (intact and defective). We then calculate half-lives using the decay rate , where positive decay rates indicate proviral increases such that the half-life would be formally infinite, and a doubling time would need to be defined instead.
Simulating reservoirs including misclassified defective proviruses
To simulate HIV reservoirs including misclassified defective proviruses, we developed a simple mathematical model in which the dynamics of true intact and defective proviruses each follow exponential decays with initial sizes and decay rates ( and ) drawn for each individual . Thus,
1 |
and similarly, . The probability that a putatively intact sequence is truly defective, that is the chance that the sequence is intact for all probes yet has a defect elsewhere, is modeled with the misclassification rate that is the same for all individuals. Thus, the observed intact sequences in the -th individual follows
2 |
and similarly, .
Fitting the misclassification model to experimental data
We used a least-squares approach and SciPy’s optimize package to fit to both intact and defective HIV proviral levels simultaneously against the misclassification model. The score function to be optimized was the root-mean-squared expression
3 |
where and are log10 transformed experimental observations from each individual. Fitting on the log-scale implies that noise is log-normally distributed and means score accuracy is balanced across the two data types despite their large differences on the absolute scale. Five parameters were estimated: the initial levels of intact and defective proviruses, the two half-lives, and the misclassification rate. Identifiability was confirmed by an additional analysis using population nonlinear mixed effects modeling in Monolix59 which admitted similar estimates for parameters as well as a lack of correlations among population parameters and residual squared errors below 50%.
Simulating reservoirs including additional decay phase for intact proviruses
Based on the possibility that intact proviruses do not decay indefinitely with the same rate, we re-simulated observed intact proviral levels using a model with multiple phases of true intact proviral decay. In this case
4 |
Where I2 and θI2 represent the initial size (computed as a fraction of I1) and the decay rate of the second phase, respectively. For correspondence with the initial measurement of the second phase decay, we imputed an 18.7 year half-life for θI2. Observed intact levels are again calculated using Eq. (2) Because of the extra contribution to the true intact pool at later times, the as I2 grows, the misclassification fraction f must be reduced to maintain a match to observed data.
Calculating loss of sensitivity from Q4PCR with multiple probes
Q4PCR determines proviral intactness with both near full length sequencing and 2-4 probes. However, the requirement to bind to more probes results in a loss of sequences. For 2, 3, and 4, probes there were 270:3676, 187:857, and 49:49 intact to defective sequences quantified. Many participants had missing time points and time points with zero intact sequences. Therefore, we estimated a coarse average ratio of intact to total sequences and a 95% binomial confidence interval on this ratio where is the total number of sequences. This interval was then applied to model output to demonstrate how increasing probes raises specificity but at a cost to sensitivity (Fig. 6).
Extrapolating misclassification rates to assays including more ddPCR probes
The probability of a misclassification can be restated as the probability that a defective primer probe at region ensures a completely defective HIV sequence. With this definition, the probability of finding a defective sequence which is intact at both sites of a two-probe assay but defective elsewhere is
5 |
If we assume both probes have similar probabilities, i.e., , then we can input our estimated average misclassification rate 5% and find 78%–that a defective read at each target region is likely to properly call a defective sequence in roughly 3/4 HIV-1 proviruses. Therefore, if it were possible to develop assays with probes having the same level of conservation, the total misclassification rate would be . When we simulate 3 and 4 probe assays, the misclassification fractions are accordingly = 1% and 0.2%, respectively. Using these misclassification fractions, we simulated the observed intact proviruses using Eq. (2), and added confidence intervals calculated from the Q4PCR data.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
The authors are extremely grateful to the study participants who devoted their time and bodies to our research and the Rockefeller University Hospital Research support office and nursing staff for help conducting the study. We also thank A Chakraborty for early reading and comments. This work was supported by the National Institutes of Health (K25 AI155224 to D.B.R., K23 A137522 to M.J.P., UM1 AI100663 and R01AI129795 to M.C.N.); the UW/Fred Hutch Center for AIDS Research (P30 AI027757). the Einstein-Rockefeller-CUNY Center for AIDS Research (1P30 AI124414-01A1); and the Robertson Fund. C.G. was supported by the HJH-Foundation, the Robert S. Wennett postdoctoral fellowship, the Shapiro-Silverberg Fund for the Advancement of Translational Research, and the National Center for Advancing Translational Sciences (UL1 TR001866). L.B.C. is supported by Delaney AIDS Research Enterprise (DARE) Collaboratory (UM1AI126611) and REACH: Research Enterprise to Advance a Cure for HIV (UM1AI164565). M.C.N. is a Howard Hughes Medical Institute (HHMI) Investigator, and this article is subject to HHMI’s Open Access to Publications policy.
Author contributions
M.C.N., L.B.C., C.G., and D.B.R. conceived the project. D.B.R. performed all modeling, analysis, and generated all figures. T.Y.O. and M.J.P. provided data. M.C.N., S.G.D., and J.T.S. supervised the project. D.B.R. and M.C.N. wrote the manuscript and all authors revised.
Peer review
Peer review information
Nature Communications thanks Eugenio Montini and Linos Vandekerckhove for their contribution to the peer review of this work. A peer review file is available.
Data availability
All data used to generate all figures is available in Supplementary Data 1. Viral sequences have been deposited in GenBank with the accession codes OQ948507-OQ953743.
Code availability
All custom plotting scripts and code used for modeling and to generate all figures are freely available at https://github.com/dbrvs/IPDAmodel/releases/tag/v1.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-39837-z.
References
- 1.Finzi D, et al. Latent infection of CD4+ T cells provides a mechanism for lifelong persistence of HIV-1, even in patients on effective combination therapy. Nat. Med. 1999;5:512–517. doi: 10.1038/8394. [DOI] [PubMed] [Google Scholar]
- 2.Chun TW, et al. Quantification of latent tissue reservoirs and total body viral load in HIV-1 infection. Nature. 1997;387:183–188. doi: 10.1038/387183a0. [DOI] [PubMed] [Google Scholar]
- 3.Piantadosi A, Humes D, Chohan B, McClelland RS, Overbaugh J. Analysis of the percentage of human immunodeficiency virus type 1 sequences that are hypermutated and markers of disease progression in a longitudinal cohort, including one individual with a partially defective Vif. J. Virol. 2009;83:7805–7814. doi: 10.1128/JVI.00280-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ho Y-C, et al. Replication-competent noninduced proviruses in the latent reservoir increase barrier to HIV-1 cure. Cell. 2013;155:540–551. doi: 10.1016/j.cell.2013.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bruner KM, et al. Defective proviruses rapidly accumulate during acute HIV-1 infection. Nat. Med. 2016;22:1043–1049. doi: 10.1038/nm.4156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Pollack RA, et al. Defective HIV-1 proviruses are expressed and can be recognized by cytotoxic T lymphocytes, which shape the proviral landscape. Cell Host Microbe. 2017;21:494–506.e4. doi: 10.1016/j.chom.2017.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Imamichi H, et al. Defective HIV-1 proviruses produce viral proteins. Proc. Natl Acad. Sci. USA. 2020;5:201917876–3710. doi: 10.1073/pnas.1917876117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Joos B, et al. HIV rebounds from latently infected cells, rather than from continuing low-level replication. Proc. Natl Acad. Sci. USA. 2008;105:16725–16730. doi: 10.1073/pnas.0804192105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Vibholm LK, et al. Characterization of intact proviruses in blood and lymph node from HIV-infected individuals undergoing analytical treatment interruption. J. Virol. 2019;93:183–11. doi: 10.1128/JVI.01920-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Cole B, et al. Extensive characterization of HIV-1 reservoirs reveals links to plasma viremia before and during analytical treatment interruption. Cell Rep. 2022;39:110739. doi: 10.1016/j.celrep.2022.110739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cohn LB, Chomont N, Deeks SG. The biology of the HIV-1 latent reservoir and implications for cure strategies. Cell Host Microbe. 2020;27:519–530. doi: 10.1016/j.chom.2020.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Davenport MP, et al. Functional cure of HIV: the scale of the challenge. Nat. Rev. Immunol. 2019;19:45–54. doi: 10.1038/s41577-018-0085-4. [DOI] [PubMed] [Google Scholar]
- 13.Peluso MJ, Deeks SG, McCune JM. HIV “cure”€: A shot in the arm? EBioMedicine. 2019;42:3–5. doi: 10.1016/j.ebiom.2019.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Richman DD, et al. The challenge of finding a cure for HIV infection. Science. 2009;323:1304–1307. doi: 10.1126/science.1165706. [DOI] [PubMed] [Google Scholar]
- 15.Ndung’u T, McCune JM, Deeks SG. Why and where an HIV cure is needed and how it might be achieved. Nature. 2019;576:1–9. doi: 10.1038/s41586-019-1841-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Dhummakupt A, et al. Differences in inducibility of the latent HIV reservoir in perinatal and adult infection. JCI Insight. 2020;5:e134105. doi: 10.1172/jci.insight.134105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Bruner KM, et al. A quantitative approach for measuring the reservoir of latent HIV-1 proviruses. Nature. 2019;566:1–19. doi: 10.1038/s41586-019-0898-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Levy CN, et al. A highly multiplexed droplet digital PCR assay to measure the intact HIV-1 proviral reservoir. Cell Rep. Med. 2021;2:100243. doi: 10.1016/j.xcrm.2021.100243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Cho A, et al. Longitudinal clonal dynamics of HIV-1 latent reservoirs measured by combination quadruplex polymerase chain reaction and sequencing. Proc. Natl Acad. Sci. USA. 2022;119:e2117630119. doi: 10.1073/pnas.2117630119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Cassidy NAJ, et al. HIV reservoir quantification using cross-subtype multiplex ddPCR. Iscience. 2022;25:103615. doi: 10.1016/j.isci.2021.103615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.White JA, et al. Measuring the latent reservoir for HIV-1: Quantification bias in near full-length genome sequencing methods. PLoS Pathog. 2022;18:e1010845. doi: 10.1371/journal.ppat.1010845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Gaebler C, et al. Sequence evaluation and comparative analysis of novel assays for intact proviral HIV-1 DNA. J. Virol. 2021;95:e01986–20. doi: 10.1128/JVI.01986-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lian X, et al. Signatures of immune selection in intact and defective proviruses distinguish HIV-1 elite controllers. Sci. Transl. Med. 2021;13:eabl4097. doi: 10.1126/scitranslmed.abl4097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Einkauf KB, et al. Parallel analysis of transcription, integration, and sequence of single HIV-1 proviruses. Cell. 2022;185:266–282.e15. doi: 10.1016/j.cell.2021.12.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Collora JA, et al. Single-cell multiomics reveals persistence of HIV-1 in expanded cytotoxic T cell clones. Immunity. 2022;55:1013–1031.e7. doi: 10.1016/j.immuni.2022.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Sun, W. et al. Phenotypic signatures of immune selection in HIV-1 reservoir cells. Nature 1–9 10.1038/s41586-022-05538-8 (2023). [DOI] [PMC free article] [PubMed]
- 27.Pardons M, et al. Single-cell characterization and quantification of translation-competent viral reservoirs in treated and untreated HIV infection. PLoS Pathog. 2019;15:e1007619. doi: 10.1371/journal.ppat.1007619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cole, B. et al. In-depth single-cell analysis of translation- competent HIV-1 reservoirs identifies cellular sources of plasma viremia. Nat. Commun. 1–13 10.1038/s41467-021-24080-1 (2021). [DOI] [PMC free article] [PubMed]
- 29.Puertas MC, et al. VIP-SPOT: an innovative assay to quantify the productive HIV-1 reservoir in the monitoring of cure strategies. mBio. 2021;12:e0056021. doi: 10.1128/mBio.00560-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Sanyal A, et al. Novel assay reveals a large, inducible, replication-competent HIV-1 reservoir in resting CD4+ T cells. Nat. Med. 2017;23:885–889. doi: 10.1038/nm.4347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Siliciano JD, et al. Long-term follow-up studies confirm the stability of the latent reservoir for HIV-1 in resting CD4+ T cells. Nat. Med. 2003;9:727–728. doi: 10.1038/nm880. [DOI] [PubMed] [Google Scholar]
- 32.Crooks AM, et al. Precise quantitation of the latent HIV-1 reservoir: implications for eradication strategies. J. Infect. Dis. 2015;212:1361–1365. doi: 10.1093/infdis/jiv218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Gandhi RT, et al. Selective decay of Intact HIV-1 Proviral DNA on Antiretroviral Therapy. J. Infect. Dis. 2021;223:225–233. doi: 10.1093/infdis/jiaa532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.White JA, et al. Complex decay dynamics of HIV virions, intact and defective proviruses, and 2LTR circles following initiation of antiretroviral therapy. Proc. Natl Acad. Sci. 2022;119:e2120326119. doi: 10.1073/pnas.2120326119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Antar AAR, et al. Longitudinal study reveals HIV-1-infected CD4+ T cell dynamics during long-term antiretroviral therapy. J. Clin. Investig. 2020;2:4629. doi: 10.1172/JCI135953. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Golob JL, et al. HIV DNA levels and decay in a cohort of 111 long-term virally suppressed patients. AIDS. 2018;32:2113–2118. doi: 10.1097/QAD.0000000000001948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Besson GJ, et al. HIV-1 DNA decay dynamics in blood during more than a decade of suppressive antiretroviral therapy. Clin. Infect. Dis. 2014;59:1312–1321. doi: 10.1093/cid/ciu585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Peluso MJ, et al. Differential decay of intact and defective proviral DNA in HIV-1–infected individuals on suppressive antiretroviral therapy. JCI Insight. 2020;5:23–14. doi: 10.1172/jci.insight.132997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Gandhi RT, et al. Varied patterns of decay of intact HIV-1 proviruses over two decades of art. J. Infect. Dis. 2023;227:1376–1380. doi: 10.1093/infdis/jiad039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Nixon CC, et al. Systemic HIV and SIV latency reversal via non-canonical NF-κB signalling in vivo. Nature. 2020;5:1–28. doi: 10.1038/s41586-020-1951-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Grossman Z, et al. ‘Rinse and replace’: boosting T cell turnover to reduce HIV-1 reservoirs. Trends Immunol. 2020;41:466–480. doi: 10.1016/j.it.2020.04.003. [DOI] [PubMed] [Google Scholar]
- 42.Roychoudhury P, et al. Viral diversity is an obligate consideration in CRISPR/Cas9 designs for targeting the HIV reservoir. BMC Biol. 2018;16:75. doi: 10.1186/s12915-018-0544-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Gaebler C, et al. Combination of quadruplex qPCR and next-generation sequencing for qualitative and quantitative analysis of the HIV-1 latent reservoir. J. Exp. Med. 2019;216:2253–2264. doi: 10.1084/jem.20190896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Shelton EM, Reeves DB, Ignacio RAB. Initiation of antiretroviral therapy during primary hiv infection: effects on the latent HIV reservoir, including on analytic treatment interruptions. AIDS Rev. 2020;23:28–39. doi: 10.24875/AIDSRev.20000001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gaebler, C. et al. Prolonged viral suppression with anti-HIV-1 antibody therapy. Nature 1–7 10.1038/s41586-022-04597-1 (2022). [DOI] [PMC free article] [PubMed]
- 46.Li JZ, et al. The size of the expressed HIV reservoir predicts timing of viral rebound after treatment interruption. AIDS. 2016;30:343–353. doi: 10.1097/QAD.0000000000000953. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Conway JM, Meily P, Li JZ, Perelson AS. Unified model of short- and long-term HIV viral rebound for clinical trial planning. J. R. Soc. Interface. 2021;18:20201015. doi: 10.1098/rsif.2020.1015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hill AL, Rosenbloom DIS, Fu F, Nowak MA, Siliciano RF. Predicting the outcomes of treatment to eradicate the latent reservoir for HIV-1. Proc. Natl Acad. Sci. USA. 2014;111:13475–13480. doi: 10.1073/pnas.1406663111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Turner CT, et al. Persistent T cell repertoire perturbation and T cell activation in HIV after long term treatment. Front. Immunol. 2021;12:634489. doi: 10.3389/fimmu.2021.634489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Margolis DM, et al. Curing HIV: seeking to target and clear persistent infection. Cell. 2020;181:1–18. doi: 10.1016/j.cell.2020.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Simonetti FR, et al. Intact proviral DNA assay analysis of large cohorts of people with HIV provides a benchmark for the frequency and composition of persistent proviral DNA. Proc. Natl Acad. Sci. 2020;117:18692–18700. doi: 10.1073/pnas.2006816117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Falcinelli SD, et al. Longitudinal dynamics of intact HIV proviral DNA and outgrowth virus frequencies in a cohort of individuals receiving antiretroviral therapy. J. Infect. Dis. 2020;224:92–100. doi: 10.1093/infdis/jiaa718. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Reeves DB, et al. A majority of HIV persistence during antiretroviral therapy is due to infected cell proliferation. Nat. Commun. 2018;9:49–16. doi: 10.1038/s41467-018-06843-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Cohn, L. B. et al. Clonal CD4+ T cells in the HIV-1 latent reservoir display a distinct gene profile upon reactivation. Nat. Med. 1–12 10.1038/s41591-018-0017-7 (2018). [DOI] [PMC free article] [PubMed]
- 55.Wang Z, et al. Expanded cellular clones carrying replication-competent HIV-1 persist, wax, and wane. Proc. Natl Acad. Sci. 2018;115:E2575–E2584. doi: 10.1073/pnas.1720665115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Cohn LB, et al. HIV-1 integration landscape during latent and active infection. Cell. 2015;160:420–432. doi: 10.1016/j.cell.2015.01.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Maldarelli F, et al. HIV latency. Specific HIV integration sites are linked to clonal expansion and persistence of infected cells. Science. 2014;345:179–183. doi: 10.1126/science.1254194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kinloch NN, et al. HIV-1 diversity considerations in the application of the Intact Proviral DNA Assay (IPDA) Nat. Commun. 2021;12:165. doi: 10.1038/s41467-020-20442-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Kuhn E, Lavielle M. Maximum likelihood estimation in nonlinear mixed effects models. Computational Stat. Data Anal. 2005;49:1020–1038. doi: 10.1016/j.csda.2004.07.002. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data used to generate all figures is available in Supplementary Data 1. Viral sequences have been deposited in GenBank with the accession codes OQ948507-OQ953743.
All custom plotting scripts and code used for modeling and to generate all figures are freely available at https://github.com/dbrvs/IPDAmodel/releases/tag/v1.