Summary
HIV-1-infected cells that persist despite antiretroviral therapy (ART) are frequently considered “transcriptionally silent,” but active viral gene expression may occur in some cells, challenging the concept of viral latency. Applying an assay for profiling the transcriptional activity and the chromosomal locations of individual proviruses, we describe a global genomic and epigenetic map of transcriptionally active and silent proviral species and evaluate their longitudinal evolution in persons receiving suppressive ART. Using genome-wide epigenetic reference data, we show that proviral transcriptional activity is associated with activating epigenetic chromatin features in linear proximity of integration sites and in their inter- and intrachromosomal contact regions. Transcriptionally active proviruses were actively selected against during prolonged ART; however, this pattern was violated by large clones of virally infected cells that may outcompete negative selection forces through elevated intrinsic proliferative activity. Our results suggest that transcriptionally active proviruses are dynamically evolving under selection pressure by host factors.
Keywords: HIV reservoir, proviruses, chromosomal integration site, HIV RNA transcription, antiretroviral treatment, epigenetics
Graphical abstract
Highlights
-
•
A multidimensional assay for HIV-1 reservoir cell profiling is presented (PRIP-seq)
-
•
Transcriptionally active HIV-1 proviruses are actively selected against during ART
-
•
Large transcriptionally active proviral clones resist negative host selection forces
-
•
Epigenetic signals in linear and 3D chromatin contacts influence HIV-1 transcription
PRIP-seq is a multidimensional single-cell assay that simultaneously captures the proviral sequence, the corresponding chromosomal integration site, and the expression of HIV-1 RNA in single virally infected cells and allows for the global mapping of transcriptionally active and silent proviruses in patients receiving suppressive antiretroviral therapy.
Introduction
Continuously ongoing, high-level viral replication represents the hallmark of untreated HIV-1 infection. Antiretroviral therapy (ART) inhibits active viral replication, but infected cells harboring chromosomally integrated proviruses persist life long and can drive rebound viremia after treatment interruption, necessitating indefinite ART (Chun et al., 1997; Finzi et al., 1997; Wong et al., 1997). Such residual HIV-1 proviruses have traditionally been regarded as “transcriptionally silent,” but proviral gene expression is not affected by currently available antiretroviral agents and might be ongoing in some cells (Pollack et al., 2017; Wiegand et al., 2017; Yukl et al., 2018). Viral gene expression may represent a key and possibly the only transcriptional feature that distinguishes HIV-1-infected cells from uninfected counterparts; moreover, proviral transcriptional activity can selectively expose HIV-1-infected cells to recognition by adaptive immune responses and increase cell-intrinsic viral cytopathic effects. For these reasons, proviral gene expression might influence the persistence, longevity, and fate of virally infected cells during ongoing suppressive ART.
Footprints of host immune selection mechanisms can be readily detected in untreated people living with HIV-1 (PLHIV) through the emergence of escape mutations in cytotoxic T cell epitopes and in antibody contact regions. However, after initiation of ART, viral mutational escape is effectively blocked, reducing the probability that selection mechanisms can be inferred from viral sequence variations (Antar et al., 2020); instead, selection forces influencing HIV-1-infected cells during suppressive ART may become visible using assays that capture the transcriptional activity and/or the chromosomal location of proviruses. Until recently, HIV-1-infected cells were typically evaluated by PCR assays designed to amplify small segments of HIV-1 DNA, an approach that did not provide information beyond the simple presence or absence of the amplified proviral region (Bruner et al., 2019). Subsequent technical advances permitted the analysis of individual proviral sequences through single-template, near full-length HIV-1 sequencing; in combination with bioinformatic tools designed to discriminate between genome-intact and defective proviruses, this method allowed for the profiling of proviral landscapes at single-genome resolution, frequently revealing clusters of sequence-identical proviral species derived from clonal proliferation of infected cells (Hiener et al., 2017; Ho et al., 2013; Lee et al., 2017; Lorenzi et al., 2016). When preceded by whole-genome amplification, single-genome near full-length proviral sequencing can be performed in conjunction with corresponding chromosomal integration site (IS) analysis (Einkauf et al., 2019; Huang et al., 2021; Patro et al., 2019). This technique, referred to as matched integration site and proviral sequencing (MIP-seq), recently demonstrated a highly disproportionate accumulation of genome-intact proviruses from elite controllers in heterochromatin regions, likely reflecting immune-mediated selection mechanisms that preferentially eliminate proviruses integrated in chromosomal regions more permissive to viral gene expression (Jiang et al., 2020). These studies indirectly supported the assumption that proviral transcriptional activity may increase the vulnerability of HIV-1-infected cells to host immune activity but failed to provide a direct assessment of the transcriptional behavior of individual proviruses. A simultaneous ex vivo analysis of the transcriptional activity and chromosomal location of proviruses in participant-derived cells would greatly facilitate the understanding of mechanisms that govern proviral gene expression, influence proviral susceptibility to host immune recognition, and determine immune selection of HIV-1-infected cells during ART.
In this study, we developed an assay for the high-resolution analysis of individual HIV-1-infected cells, designed to simultaneously capture the transcriptional activity, the sequence, and the chromosomal IS of single HIV-1 proviruses. In combination with genome-wide assessments of epigenetic chromatin features from reference data of primary CD4 T cells, this analysis enabled us to measure the frequency of transcriptionally active proviruses, annotate their corresponding chromosomal locations with surrounding epigenetic chromatin features, and profile their longitudinal evolution during continuous ART.
Results
Simultaneous analysis of HIV-1 RNA, HIV-1 DNA, and proviral chromosomal integration sites from individual proviral species
We adapted a previously described protocol for simultaneous genome and transcriptome analysis (G&T-seq) (Macaulay et al., 2015) to develop an assay designed for the parallel interrogation of HIV-1 RNA, the HIV-1 chromosomal IS, and the corresponding proviral sequence of individual HIV-1-infected cells, here termed parallel HIV-1 RNA, integration site, and proviral sequencing (PRIP-seq) (Figures 1A, S1A, and S1B). For this purpose, peripheral blood mononuclear cells (PBMC) purified from persons living with HIV-1 were diluted to single HIV-1-infected cells based on Poisson distribution statistics, lysed, and exposed to biotin-labeled primers designed to anneal to HIV-1 RNA. Subsequently, viral RNA transcripts were physically separated using ultrasensitive immunomagnetic enrichment technologies specifically adjusted for the isolation of low-abundance nucleic acids. Viral RNA was then reverse transcribed, followed by cDNA amplification using a Smart-seq2 protocol (Picelli et al., 2014). Next, viral cDNA copies were quantified using droplet digital PCR with primers annealing to the HIV-1 long LTR region and to more elongated HIV-1 RNA transcripts, including those containing HIV-1 pol, nef, tat-rev, and poly-A sequences (Yukl et al., 2018). In parallel, corresponding genomic DNA from each cell lysate was subjected to phi29-catalyzed multiple displacement amplification (MDA), enabling subsequent simultaneous analysis of near full-length proviral sequences and their respective chromosomal IS using a previously described protocol (Einkauf et al., 2019). Briefly, MDA products were split and separately subjected to near full-length HIV-1 amplification as well as amplification of 3′-LTR viral/host junctions by integration site loop amplification (ISLA) assays (Wagner et al., 2014); resulting amplification products were sequenced on the Illumina MiSeq platform. Technical evaluations of this assay platform with defined copy numbers of HIV-1 RNA transcripts spiked into background populations of HIV-1-uninfected cells revealed that as few as 1–5 viral RNA copies could be detected, and there was a 50% probability of detecting 10 HIV-1 RNA copies. Moreover, there was a strong correlation between input HIV-1 RNA copy numbers and the postamplification cDNA copy numbers, indicating that postamplification viral cDNA copies provide a quantitative estimate of the original number of viral RNA molecules present in HIV-1-infected cells (Figures S1C and S1D). Our protocol also reliably excluded contamination of viral RNA with genomic HIV-1 DNA (Figure S1E).
We applied this complex experimental workflow to PBMC samples that were longitudinally collected from 5 ART-treated PLHIV (P1–P3, P5, P6); in one study participant (P4), a single cross-sectional PBMC sample collected after long-term ART was analyzed (Figure S2). Collectively, we analyzed a total of 1,270 single HIV-1 proviruses from these study participants, of which 147 encoded genome-intact HIV-1; the remaining proviruses displayed various structural sequence defects precluding active viral replication (Figure 1B; Table S1). The chromosomal IS was successfully identified in a total of 701 of these proviral sequences; failure to determine the proviral chromosomal location in the remaining cells was frequently due to the absence of ISLA primer annealing sites in proviruses with large deletions. IS coordinates were most frequently located in introns of highly expressed genes, consistent with previous results (Battivelli et al., 2018; Maldarelli et al., 2014; Schröder et al., 2002); moreover, among proviruses integrated in genes, opposite orientation relative to the host gene was approximately twice as common as concordant/same orientation (Figures 1C and S1F).
Among all 1,270 analyzed proviruses, we noted that HIV-1 RNA transcription varied profoundly: HIV-1 long LTR transcripts were detected in 429 (33.8%) of all 1,270 analyzed HIV-1 proviruses when all sequences were considered, and in 296 (31.1%) of 953 analyzed proviruses when sequences from aviremic time points were selectively evaluated (Figure 1D). Because structural sequence defects in the viral promoter region may compromise the ability of proviruses to express HIV-1 RNA, we used MDA products to amplify and sequence the viral core promoter region in a subset of HIV-1-infected cells; restricting the analysis to proviruses with confirmed intact promoter regions did not markedly affect the relative proportion of transcriptionally active proviruses (Figure 1D). To explore the transcriptional activity of proviruses in the context of epigenetic and architectural features of human genome organization, we annotated all IS coordinates with genome-wide data derived from assays to characterize host gene expression (RNA-seq), chromatin accessibility (ATAC-seq), and three-dimensional (3D) chromosomal contacts (in situ Hi-C). RNA-seq and ATAC-seq data were previously generated from primary total CD4 T cells of three ART-treated PLHIV (Einkauf et al., 2019); corresponding Hi-C data from the same three study participants were generated using an in situ Hi-C protocol described previously (Díaz et al., 2018) (Figure S1G). In addition, we used ChIP-seq data corresponding to activating and inhibitory histone modifications from primary total CD4 T cells included in the ROADMAP database (Kundaje et al., 2015); genome-wide cytosine methylation data from the iMethyl database (Komaki et al., 2018) were also evaluated to determine epigenetic cytosine methylation in defined regions upstream of HIV-1 promoters. Collectively, these data permitted a comprehensive investigation of the sequence and transcriptional activity of individual proviruses, combined with an analysis of their corresponding chromosomal locations relative to a diverse range of transcriptional, genomic, and epigenetic host features.
Reduced transcriptional activity of HIV-1 proviruses in nongenic regions
We initially investigated the transcriptional behavior of HIV-1 proviruses located in nongenic regions, which are typically disfavored for proviral integration because of host factors that bias chromosomal insertion sites toward active transcription units in the human genome (Achuthan et al., 2018; Ciuffi et al., 2005). Among all 701 proviruses for which chromosomal IS were identified, 117 (17%) were located in such nongenic/pseudogenic locations (Figure 1E); intact proviruses were more frequently located in nongenic regions relative to defective proviruses (32/122 intact versus 85/579 defective, p = 0.003). Notably, HIV-1 proviruses integrated in intergenic regions of the human genome displayed significantly lower transcriptional activity (Figures 1E–1G); this was particularly true for the small number (n = 34) of HIV-1 proviruses integrated in repetitive satellite or microsatellite DNA, of which the vast majority (94%) were completely transcriptionally silent, consistent with a state of “deep viral latency” (Jordan et al., 2003; Lewinski et al., 2005). Because intergenic DNA is not generally resistant to transcription and RNA complementary to nongenic and satellite DNA is detectable in many species (Smurova and De Wulf, 2018), we explored mechanisms that might explain the repressed transcriptional activity of proviruses integrated in nongenic DNA. As expected, we noted that RNA-seq reads surrounding nongenic locations of integrated HIV-1 DNA were significantly diminished, relative to genic sites harboring integrated HIV-1 DNA (Figure S3A). A similar finding was made for ChIP-seq reads corresponding to activating histone features (H3K27ac, H3K4me1, and H3K4me3); however, no difference was noted for ChIP-seq reads related to inhibitory histone marks (H3K27me3 and H3K9me3) (Figures S3B and S3C). Compared with those in genes, nongenic proviral IS were also in regions with significantly reduced chromatin accessibility, as determined by ATAC-seq (Figure S3D). Interestingly, an alignment of HIV-1 IS to spatial 3D chromatin organization features, evaluated by genome-wide in situ Hi-C, demonstrated that nongenic chromosomal regions harboring HIV-1 IS displayed significantly increased chromosomal distances to frequently interacting regions (FIREs) (Schmitt et al., 2016) and to topologically associated domains (TADs) (Beagan and Phillips-Cremins, 2020) (Figures S3E and S3F); moreover, the nongenic regions containing integrated HIV-1 DNA had reduced numbers of 3D intra- and interchromosomal chromatin contacts (Figures S3G–S3H). Together, these results demonstrate that proviruses in nongenic locations have significantly weaker viral transcriptional activity, likely because of nonpermissive genomic and epigenetic chromatin features at chromosomal IS.
Epigenetic features associated with transcriptional activity of HIV-1 proviruses
To dissect global features associated with the transcriptional activity of proviruses, we analyzed epigenetic characteristics in DNA regions linearly surrounding IS in genic and nongenic regions (Figures 2A–2D and S4A–S4D). Although positioning of IS relative to the most proximal host transcriptional start sites (TSSs) had no noticeable influence on proviral transcription (Figures S4A and S4B), we observed that ATAC-seq reads and ChIP-seq reads corresponding to activating histone modifications (H3K4me1, H3K4me3, and H3K27ac) in linear proximity of HIV-1 DNA IS were significantly higher for transcriptionally active proviruses (Figures 2B, 2C, and S4C); this was true when all proviruses from all time points were considered and when proviruses in genic locations or from aviremic time points were selectively analyzed. Proviral transcriptional silence, on the other hand, was associated with hypermethylated cytosine residues in genomic DNA upstream of the 5′-LTR junction of proviral IS (Figure 2D); inhibitory H3K27me3-specific ChIP-seq marks appeared unrelated to proviral transcriptional behavior (Figure S4D). In a subsequent analysis, we investigated proviral transcriptional activity relative to genomic and epigenetic features in intra- and interchromosomal 3D contact regions, determined by in situ Hi-C (Figures 2E–2I). The total number of significant 3D contacts tended to be higher in regions harboring IS of transcriptionally active proviruses, and their chromosomal distances to FIREs were reduced, suggesting that proviral gene expression is facilitated by integration into more interactive chromosomal regions (Figures 2F, 2G, S4E, and S4F). Moreover, there were trends for higher levels of activating histone-specific ChIP-seq reads (Figures 2H and S4H) and ATAC/RNA-seq reads (Figures 2I and S4G) in 3D contact regions of IS of transcriptionally active proviruses. Hypothesizing that combinations of chromatin features in linear, intrachromosomal, and interchromosomal contact regions of chromosomal IS may influence proviral transcriptional behavior, we calculated the sum of reads corresponding to epigenetic features in all three of these chromatin compartments for each individual proviral IS; this composite marker may better approximate the totality of epigenetic signals influencing a given provirus in the 3D spatial configuration of the genome. Collectively, these analyses supported the assumption that combined epigenetic features in linear and 3D contact regions of IS can jointly regulate the transcriptional activity of HIV-1 DNA (Figures 2J, 2K, and S4I). We also observed some evidence for privileged interchromosomal interactions between genomic positions of transcriptionally active proviruses, possibly reflecting the localization of transcriptionally active proviral species at highly permissive and interactive chromatin regions on the outer surface of chromosomal territories (“transcription factories”) (Edelman and Fraser, 2012) (Figure S4J); however, signs of such interchromosomal “transcriptional interactomes” (Schoenfelder et al., 2010) were not observed genome wide, possibly because of the disproportionate overrepresentation of the small, gene-rich chromosomes in 3D interchromosomal interactions (Maass et al., 2019). Collectively, these data suggest that the relative position of proviruses in the nuclear 3D chromatin architecture can influence viral transcription through combined ensemble effects of cis and trans epigenetic signals and that HIV-1 can take advantage of the full complexity of the human genome and epigenome in its 3D spatial configuration to fine-tune viral transcriptional behavior.
Together, these results identify several genomic and epigenetic features permitting discrimination of transcriptionally active proviruses across a large pool of proviral species isolated from a heterogeneous group of participants and sampling time points; however, the discriminatory effect size of each of these parameters, when assessed as isolated variables, was relatively weak. We subsequently considered a combinatorial analysis of genomic and epigenetic parameters for which significant associations with transcriptional activity were noted in univariable comparisons; these investigations indicated a progressive increase in transcriptional activity among proviruses stratified according to combinations of distinct IS features (Figure 2L). Consistent with this observation, a formal multivariable logistic regression model demonstrated that a combinatorial analysis of genomic and epigenetic IS characteristics facilitated discrimination of transcriptionally active proviruses (Figures 2M–2O) and supported positive associations between activating epigenetic chromatin features and proviral transcriptional activity. However, effects in this model varied considerably with the study subject and were critically influenced by the sampling time point, with longer durations of ART being generally associated with more limited proviral gene expression (Figure 2O). Therefore, the regulation of proviral transcriptional behavior may follow distinct evolutionary pathways in individual subjects, requiring participant-specific longitudinal analyses for further clarification.
Longitudinal selection of proviral sequences with weaker transcriptional activity
To investigate the time-dependent evolution of proviruses during suppressive ART, we evaluated the trajectory of HIV-1-infected cells around the time of ART initiation and during the ensuing 9–12 years of therapy in P1 and P2; in P3, data from year 1 and year 9 after ART commencement were available for longitudinal investigation. The frequency of all proviruses tended to decline after ART initiation in these three participants (Figures S5A–S5L). However, this decrease was substantially more pronounced for transcriptionally active proviruses, categorized by the detection of any HIV-1 RNA transcript, high-level HIV-1 RNA expression (>10,000 postamplification copies), or elongated HIV-1 RNA transcripts (Figures 3A–3C, S5A, S5E, and S5I). The accumulation of transcriptionally silent proviruses over time coincided with a gradual increase in the frequency of proviral IS in nongenic and satellite DNA (Figure 3D); however, the progressive proportional reduction in HIV-1 RNA-expressing cells was similarly visible among proviruses integrated in genic chromosomal regions (Figures 3B, S5B, S5F, and S5J) and resulted in profound alterations in the composition of the proviral landscape, with a disproportionate overrepresentation of transcriptionally silent proviruses after prolonged ART (Figure 3E). This pattern was consistent with low proportions of transcriptionally active proviruses detected cross-sectionally in study person 4 after 15 years of ART (Figures S5M–S5O). Notably, the longitudinal decline of HIV-1 RNA-expressing proviruses was more obvious for intact HIV-1 DNA compared with defective proviral species, leading to a markedly biased proviral IS profile with almost completely undetectable transcriptionally active intact proviruses after long-term ART (Figures 3F–3H). By contrast, the contribution of transcriptionally silent intact as well as transcriptionally silent defective proviruses to the total pool of HIV-1-infected cells remained relatively stable or expanded over the entire observation period (Figure 3H). Large clones of proviral sequences were detected in two of these study participants after several years of continuous ART. In P2, we noted a large genome-intact clone integrated in the ZNF140 gene and a second clone of near full-length proviruses with a premature stop codon (PSC) in gag integrated in pericentromeric satellite DNA on chromosome 16; both of these proviral clones showed no or minimal transcriptional activity (Figure 5). In P3, we noticed the evolution of two clones of intact proviruses, integrated in centromeric satellite DNA on chromosome 10 and on chromosome Y, respectively; no members of these two clones expressed detectable HIV-1 RNA (Figure 5). Together, these observations suggest a progressive selection of proviruses with lower transcriptional activity and chromosomal integration into nongenic or satellite DNA locations during prolonged ART.
Persistence of large clones of transcriptionally active proviruses
In contrast to P1–P3, study participant 5 displayed a biphasic evolution of HIV-1-infected cells, with an initial decline occurring during the first year of ART and a subsequent increase during prolonged ART (Figures 3I–3K). A closer investigation demonstrated that this longitudinal pattern was primarily driven by large clones of virally infected cells (Figures 3K–3M) that became visible after prolonged ART durations and frequently expressed viral long LTR HIV-1 RNA and, to a lesser extent, elongated HIV-1 RNA transcripts; in at least two cases, these large, transcriptionally active clones of cells encoded genome-intact HIV-1 (Figure 5). IS of these large, transcriptionally active proviruses were located at chromosomal positions with strong activating epigenetic support (Figure 6). By contrast, among proviruses with IS detected only once (derived from smaller clones or nonclonal HIV-1-infected cells), the proportion of proviruses expressing HIV-1 RNA declined over time (Figure 3J); this longitudinal decrease was most visible in the context of elongated HIV-1 RNA transcripts. A similar pattern was noted in P6, who had detectable low-level plasma viremia (>20 but <100 copies/mL) despite prolonged treatment with ART, in the absence of notable proviral resistance mutations against antiretroviral drugs. In this person, we generally noted a high (>50%) proportion of transcriptionally active proviruses (Figures 3N–3P). Nevertheless, there was a trend for a longitudinal decline of proviruses that were detected once and expressed elongated and/or high-level viral RNA (Figure 3O). By contrast, no such decrease was observed for large proviral clones that remained transcriptionally active and were integrated in chromosomal locations with strong activating chromatin features (Figures 3P–3R, 5, and 6); two of these transcriptionally active clones encoded genome-intact HIV-1 DNA (Figure 5). Collectively, the longitudinal trajectories in P5 and P6 resulted in a proviral landscape characterized by a gradual decline of HIV-1 RNA-expressing proviruses originating from small clones or nonclonal HIV-1-infected cells, coupled with a parallel expansion of large proviral clones frequently displaying strong proviral transcriptional activity. These results are generally consistent with a progressive selection advantage for proviruses with lower transcriptional activity during extended ART. However, large transcriptionally active proviral clones in P5 and P6 appeared to violate this evolutionary pattern and persisted/expanded despite strong expression of viral RNA, possibly because they can outcompete negative host selection forces through enhanced cell proliferation.
Longitudinal evolution of proviral integration site features
To better understand longitudinal selection mechanisms underlying changes in proviral transcriptional activity, we analyzed proviral chromosomal IS features over time in P1, P2, and P5, in whom PRIP-seq data were obtained at relatively similar time points (viremia, 1 year after ART initiation, and 9–12 years after ART initiation). Aligning IS coordinates to genome-wide cytosine methylation data in reference CD4 T cells (Komaki et al., 2018), we noted a progressive longitudinal accumulation of genic proviruses integrated in chromosomal regions with hypermethylated cytosine residues in host DNA upstream of the proviral 5′-LTR promoter, suggesting a role of epigenetic cytosine methylation in silencing proviral transcriptional activity during prolonged ART (Figures 4A and 4B). Although progressive transcriptional silencing was similarly visible for genic proviruses in same and opposite configurations to the most proximal host TSS (Figures S5P and S5Q), we noted marked differences in the longitudinal evolution of IS features for proviruses in the same versus opposite orientations to the nearest host TSS. Although chromosomal distances between proviral IS and the nearest same-oriented TSSs tended to increase over time, this trend was less apparent for chromosomal distances to the nearest opposite-oriented TSSs (Figure 4C). Moreover, using RNA-seq data from primary CD4 T cells from reference datasets (Einkauf et al., 2019), we noted that the host gene expression intensity at the nearest same-oriented TSSs remained stable over time, whereas host transcriptional activity at the nearest opposite-oriented TSSs increased, most notably in the context of TSSs in convergent orientation to the provirus (Figure 4D). In addition, among proviruses with the same orientation as the nearest host TSS, ATAC-seq and activating ChIP-seq reads in linear proximity of regions harboring integrated HIV-1 DNA decreased significantly over time (Figures 4E–4G); by contrast, there was a trend for increasing levels of ATAC-seq and activating ChIP-seq reads in linear proximity of IS for proviruses in opposite orientation to the nearest host TSS (Figures 4H–4J). Together, these data suggest that progressive selection of more transcriptionally silent proviruses over time is partly achieved by two complementary mechanisms: (1) selection forces that promote preferential persistence of proviruses with relatively increased distance to same-oriented host TSSs; transcriptional silencing of these proviruses is likely due to deprived proviral access to activating epigenetic chromatin signals and to host transcriptional machinery; and (2) host factors that select for proviruses in relative proximity to highly expressed host TSSs in opposite orientation; proviral transcriptional repression in this context may be achieved through transcriptional interference from host gene expression (Han et al., 2008; Lenasi et al., 2008).
Transcriptional activity of clonal HIV-1 proviruses
Clusters of clonal, sequence-identical HIV-1 proviruses were detected in all six study subjects, although their frequencies varied considerably; these sequence clusters (n = 42 in total) result from clonal proliferation of virally infected cells (Bui et al., 2017; Cohn et al., 2015; Hiener et al., 2017; Hosmane et al., 2017; Lee et al., 2017) (Figure 5). Particularly, we noted multiple large clones integrated in nongenic/pseudogenic DNA that evolved after prolonged ART and displayed no or minimal signs of transcriptional activity, specifically in P1–P4. Such clusters of sequence-identical proviruses included a clone with a large deletion integrated in centromeric satellite DNA (P1), a clone with a PSC in pericentromeric satellite DNA (P2), two genome-intact clones in centromeric satellite DNA (P3), and a clone with a 5′-LTR defect in a pseudogenic region (P4). Moreover, one large genome-intact clone integrated in the ZNF140 gene on chromosome 4 was observed after 12 years of treatment in P2; this clone was also completely transcriptionally silent across all 8 member sequences (Figure 5). By contrast, proviral clones with transcriptionally active members in P5 and P6 were frequently integrated in chromosomal locations surrounded by exceptionally strong activating epigenetic chromatin signals in their immediate chromatin proximity, particularly with regard to H3K4me1-specific ChIP-seq reads (Figures 5 and 6A–6E). Similarly, high levels of RNA-seq, ATAC-seq, and activating ChIP-seq signals were observed for transcriptionally active clones when sums of epigenetic sequencing reads in linear proximity and in 3D interchromosomal contact regions of IS were evaluated as composite readouts (Figures 6F–6H). Notably, one large transcriptionally active intact clone was detected in a nongenic region in P6; the IS of this clone was again located in immediate proximity to activating histone modifications, although there were no detectable RNA-seq reads at this nongenic site (Figures 5 and 6B).
The persistence and expansion of large, transcriptionally active proviral clones during long-term ART appeared to violate the longitudinal trajectory of proviral sequences detected once, for which we documented a progressive accumulation of transcriptionally silent proviruses with features of deeper transcriptional latency over time; this raises the question of how transcriptionally active proviruses can resist negative host selection mechanisms. Notably, genes harboring transcriptionally active clonal proviruses were frequently involved in the regulation of cell proliferation and oncogenesis, consistent with previous findings (Maldarelli et al., 2014; Simonetti et al., 2016; Wagner et al., 2014). This was particularly true for a large, transcriptionally active proviral clone integrated into the KDM2A gene (Figures 5 and 6A), which encodes a lysine-specific demethylase that can enhance cell-autonomous proliferation through downregulation of TET2 (Chen et al., 2017b); disruption of TET2 has previously been associated with enhanced clonal proliferation of CAR-encoding T cells (Fraietta et al., 2018). An oncogenic role was also reported for other genes harboring transcriptionally active proviral clones, such as CFAP299/C4orf22 (Kumar et al., 2014), USP34 (Scholtysik et al., 2015), and GON4L (Agarwal et al., 2016) (Figures 5 and 6A). Retroviral integration into such cancer/proliferation-associated genes may result in splicing-induced chimeric host/viral RNA, leading to fusion proteins that can stimulate cell-autonomous clonal proliferation (Cesana et al., 2017; Liu et al., 2020). For other transcriptionally active clonal proviruses, oncogenic functions of genes harboring the proviral IS were less obvious, and proliferation of the corresponding HIV-1-infected cells may be driven by antigen-specific effects (Mendoza et al., 2020; Simonetti et al., 2021). Independently of the mechanisms that drive proliferation of clonal HIV-1-infected cells, we propose that transcriptionally active proviral clones can persist long-term during ART when negative selection forces are outperformed by elevated cell turnover rates that effectively replenish and expand the clonal pool size.
Responses of individual proviruses to the latency-reversing agents PMA/ionomycin
Latency-reversing agents (LRAs) have been developed for the purpose of enhancing proviral transcriptional activity, with the intention of increasing their susceptibility to host immune factors; however, the influence of such LRAs on transcriptional activity may vary among individual proviruses and has rarely been assessed in single proviruses. To address this, we used the PRIP-seq platform to evaluate the transcriptional activity of HIV-1-infected cells in the presence or absence of stimulation with phorbol 12-myristate 13-acetate (PMA) and ionomycin, activators of protein kinase C (PKC) that are frequently considered the most powerful inducers of HIV-1 gene expression and commonly used as the positive control in HIV-1 reactivation assays. Briefly, participant-derived CD4 T cells were diluted to single HIV-1-infected cells, stimulated with PMA/ionomycin or with control medium for 12 h, and subsequently subjected to PRIP-seq assays. Overall, we assayed a total of 222 proviruses from two different ART-treated participants (P4 and P7); 97 cells were analyzed after stimulation with PMA and 125 cells after treatment with medium alone (Table S2). We observed no significant differences between the proportion of cells producing detectable levels of HIV-1 long LTR or elongated transcripts in stimulated versus nonstimulated cells (Figure 7A); however, among transcriptionally active proviruses, the per-cell levels of any HIV-1 RNA and of elongated HIV-1 RNA were higher after stimulation (Figures 7B and 7C). Remarkably, though, 79.4% of analyzed HIV-1-infected cells failed to produce detectable HIV-1 RNA despite stimulation with PMA/ionomycin; such stimulation-refractory proviruses had IS located in closer proximity to ChIP-seq peaks related to the inhibitory histone modifications H3K9me3 and H3K27me3 (Figure 7D), suggesting that only a subset of proviruses with favorable epigenetic chromosomal IS features can respond to PMA/ionomycin. Notably, for 4 clusters of clonal proviruses, we detected at least one proviral species in both stimulated and unstimulated conditions, allowing for a direct comparison of the transcriptional activity of proviruses with shared IS in the presence or absence of stimulation with PMA/ionomycin (Figures 7E and 7F). In this paired analysis, we observed that proviruses with detectable HIV-1 RNA at baseline uniformly showed higher per-cell levels of viral transcripts after stimulation (Figure 7F). By contrast, a member of a proviral clone with an intact HIV-1 promoter integrated in a pseudogenic position (Chr5: 177901756) that did not produce HIV-1 RNA in the absence of stimulation also remained transcriptionally silent after stimulation (Figure 7F); 5 transcriptionally silent members of this clone were also detected in direct ex vivo assessments (P4, Figure 5). Together, these results support the hypothesis that chromosomal location critically determines the susceptibility of proviruses to LRAs; PMA/ionomycin stimulation appears to enhance per-cell levels of viral RNA transcripts but is unable to effectively overcome epigenetic transcriptional blocks and “deep” viral latency in proviruses that are transcriptionally silent at baseline. Future studies will be necessary to determine viral reactivation after stimulation with alternative LRAs.
Discussion
Although HIV-1 integration in chromosomal DNA is heavily biased toward active host transcription units, the virus can integrate almost anywhere in the human genome, leading to a diverse portfolio of proviral species that differ profoundly with regard to chromosomal location and transcriptional activity (Chen et al., 2017a; Vansant et al., 2020). Seeding such a variegated reservoir of proviral sequences that exist in distinct transcriptional activation states may offer the highest flexibility to resist host immune activity and can be viewed as a probabilistic bet-hedging strategy designed to maximize viral persistence in unpredictable environments (Rouzine et al., 2015). However, how proviral transcriptional behavior and chromosomal location synergize in determining the evolutionary fate and the persistence or elimination of HIV-1-infected cells during ART is unclear. Here, we used an assay to simultaneously profile the transcriptional behavior, the proviral sequence, and the chromosomal position of HIV-1 proviruses in unmanipulated participant-derived cells, permitting the global mapping of transcriptionally active and silent proviruses relative to genome-wide genomic and epigenetic chromatin features and the identification of proviral species that are under evolutionary selection pressure during ART in vivo. We note that instead of being “stable” and “transcriptionally silent,” viral reservoir cells are frequently transcriptionally active and dynamically evolving in response to host-dependent selection forces.
Remarkably, our work suggests positive selection of proviruses with lower transcriptional activity during prolonged ART, arguably due to preferential elimination of transcriptionally active proviruses with higher vulnerability to host immune activity. These footprints of host selection strongly suggest that HIV-1-infected cells are subject to host immune surveillance during ART, likely because proviral gene expression can be sensed by host immune factors. In contrast to immune selection effects during untreated HIV-1 infection that can be readily detected by accumulation of viral sequence variations, the selection footprints reported here are indirectly inferred through complex profiling of individual HIV-1-infected cells; identifying the specific immune responses that drive such selection processes will represent a major challenge for future studies. Of note, we repeatedly noticed that large proviral clones can resist evolutionary selection processes that otherwise promote longitudinal accumulation of proviral sequences with progressively decreasing transcriptional activity. Although long-term persistence of transcriptionally active proviruses has also been suggested by other investigators (Halvas et al., 2020; Simonetti et al., 2016), our results suggest that they represent an exception of the rule and are facilitated by two coinciding factors: the presence of strong activating epigenetic signals in linear and 3D contact regions of their IS and an elevated rate of cell turnover that can antagonize negative host selection mechanisms through compensatory repopulation and expansion of the clonal pool size. In addition, upregulation of antiapoptosis and cell survival markers during clonal proliferation of T cells may protect large cell clones against antiviral host immune responses (Cohn et al., 2018; Kuo et al., 2018; Ren et al., 2020). It is possible that these intact, transcriptionally active proviruses play a critical role for driving viral rebound in case of treatment interruptions; future studies will be necessary to address this (Aamer et al., 2020; Cole et al., 2021).
Manipulating the transcriptional behavior of proviruses to enhance their susceptibility to immune-mediated elimination, colloquially referred to as the “shock and kill” approach, has been evaluated in multiple clinical trials (Archin et al., 2012; Rasmussen et al., 2014; Søgaard et al., 2015). Our data suggest that pharmacological increases of proviral transcriptional activity may augment proviral vulnerability to host selection mechanisms, but such effects may have remained invisible in previous clinical trials that primarily used quantitative viral reservoir measurements in bulk CD4 T cells as study endpoints. Indeed, it remains possible that LRAs or immunotherapeutic interventions intensify or accelerate naturally occurring immune selection of proviruses and promote a reservoir structure characterized by features of deeper latency and lower transcriptional activity. These more subtle selection effects may result in a proviral reservoir landscape that is more easily controlled by antiviral host immunity and less likely to fuel rebound viremia; ultimately, such a modulated reservoir structure may allow for a peaceful coexistence between HIV-1 and the host and enable drug-free control of HIV-1 infection (Jiang et al., 2020; Lian et al., 2021). Future experiments on the PRIP-seq platform may help to better dissect the effects of LRAs on single HIV-1-infected cells in samples from clinical trial participants and to detect specific susceptibilities and vulnerabilities of HIV-1 reservoir cells.
Limitations of the study
A limitation of our work is that transcriptional profiles of individual proviruses were related to chromatin features from reference datasets, and not from single-cell assessments in infected cells. Technical progress in multidimensional single-cell profiling may in the future permit to address this point; in fact, recent studies have started to evaluate lentiviral chromosomal IS coordinates within ATAC-seq reads from single infected cells (Wang et al., 2020). In addition, the transcriptional activity of proviruses is known to be burst driven (Singh et al., 2010; Skupsky et al., 2010), and it is likely that transcriptional noise may have influenced our findings, although our work identifies clear contributions of the genomic environment to the transcriptional regulation of HIV-1 proviruses.
STAR★Methods
Key resources table
REAGENT or RESOURCE | SOURCE | IDENTIFIER |
---|---|---|
Antibodies | ||
Mouse antihuman CD4 (clone RPA-T4) | BioLegend | Cat#300518; RRID: AB_314086 |
Mouse antihuman CD3 (clone OKT3) | BioLegend | Cat#317332; RRID: AB_2561943 |
Mouse antihuman CD45RO (clone UCHL1) | BioLegend | Cat#304236; RRID: AB_2562107 |
Mouse antihuman CCR7 (clone G043H7) | BioLegend | Cat#353216; RRID: AB_10916386 |
Biological samples | ||
PBMC samples from study participants living with HIV | Massachusetts General Hospital | https://www.massgeneral.org/ |
Chemicals, peptides, and recombinant proteins | ||
Buffer RLT Plus | Qiagen | Cat#1053393 |
Invitrogen Dynabeads MyOne Streptavidin C1 | ThermoFisher Scientific | Cat#65002 |
Invitrogen SUPERase⋅In RNase Inhibitor | ThermoFisher Scientific | Cat#AM2694 |
Invitrogen dNTP mix (10mM each) | ThermoFisher Scientific | Cat#18427088 |
AMPure XP beads | Beckman Coulter | Cat#A63882 |
5M NaCl | ThermoFisher Scientific | Cat#AM9760G |
10M NaOH | Millipore Sigma | Cat#72068 |
0.5M EDTA (pH 8.0) | Promega | Cat#V4231 |
UltraPure 1M Tris-HCI Buffer (pH 7.5) | ThermoFisher Scientific | Cat#15567027 |
1M MgCl2 | ThermoFisher Scientific | Cat#AM9530G |
2M KCl | ThermoFisher Scientific | Cat#AM9640G |
1M DTT | Millipore Sigma | Cat#646563 |
TWEEN 20 (50% Solution) | ThermoFisher Scientific | Cat#003005 |
Betaine solution (5M) | Millipore Sigma | Cat#B0300 |
BioLegend Cell Activation Cocktail (without Brefeldin A) | BioLegend | Cat#423302 |
Recombinant IL-2 | NIH AIDS Reagent program | www.hivreagentprogram.org |
AZT | NIH AIDS Reagent Program | www.hivreagentprogram.org |
Critical commercial assays | ||
DNeasy Blood and Tissue Kit | Qiagen | Cat#69504 |
ddPCR Supermix for Probes (No dUTP) | Bio-Rad | Cat#1863024 |
Invitrogen SuperScript II Reverse Transcriptase kit | ThermoFisher Scientific | Cat#18064022 |
KAPA HiFi HotStart ReadyMix | Roche | Cat#7958935001 |
REPLI-g Single Cell Kit | Qiagen | Cat#150345 |
REPLI-g Advanced Single Cell Kit | Qiagen | Cat#150365 |
Invitrogen Platinum Taq DNA Polymerase High Fidelity | ThermoFisher Scientific | Cat#11304102 |
Stemcell EasySep Human CD4+ T Cell Isolation Kit | Stemcell Technologies | Cat#17952 |
PicoPure RNA Isolation Kit | Applied Biosystems | Cat#0204 |
Deposited data | ||
Ensembl (v86) | Ensembl | http://oct2016.archive.ensembl.org/index.html |
UCSC Genome Browser | UCSC | http://genome.ucsc.edu |
GENCODE (v32) | GENCODE | https://www.gencodegenes.org/human/release_32.html |
Roadmap database | Kundaje et al., 2015 | http://www.roadmapepigenomics.org/ |
ENCODE database | (ENCODE Project Consortium, 2012) | https://www.encodeproject.org/ |
iMethyl database | Komaki et al., 2018 | http://imethyl.iwate-megabank.org/ |
Hi-C data from CD4 T cells | This study | GEO ID: GSE168337 |
ATAC-Seq and RNA-Seq data from CD4 T cells | Jiang et al., 2020 | GEO ID: GSE144334 |
Oligonucleotides | ||
See Table S3 for List of Primers/Probes | Millipore Sigma/IDT/Qiagen | N/A |
Quantitative Synthetic Human immunodeficiency virus 1 (HIV-1) RNA | ATCC | Cat#VR-3245S |
Software and algorithms | ||
QuantaSoft software | Bio-Rad | Cat#1864011 |
Ultracycler v1.0 | Seed and Wang, personal communication | https://dnacore.mgh.harvard.edu/new-cgi-bin/site/pages/viral_genome_sequencing_pages/viral_genome_sequencing_data.jsp |
Automated in-house proviral intactness bioinformatic pipeline in Python | Lee et al., 2017 | https://github.com/BWH-Lichterfeld-Lab/Intactness-Pipeline |
Los Alamos National Laboratory (LANL) HIV Sequence Database Hypermut 2.0 | Rose and Korber, 2000 | https://www.hiv.lanl.gov/content/sequence/HYPERMUT/background.html |
ProSeq-IT | Shao et al., 2020 | https://psd.cancer.gov/tools/pvs_annot.php |
MUSCLE | Edgar, 2004 | http://www.drive5.com/muscle/ |
Geneious Prime 2021.0.3 | Biomatters | https://www.geneious.com/download/ |
bwa-mem | Li and Durbin, 2009 | http://maq.sourceforge.net/ |
RepeatMasker | Institute for Systems Biology | http://www.repeatmasker.org/ |
RSEM (v1.2.22) | Li and Dewey, 2011 | http://deweylab.github.io/RSEM/ |
STAR aligner software (2.5.1b) | ENCODE | https://www.encodeproject.org/software/star/ |
Prism | Graphpad, https://www.graphpad.com/scientific-software/prism | version 8.2.1 |
R | R Core Team and R Foundation for Statistical Computing, https://www.r-project.org | version 3.5.3 |
FastQC | Babraham Bioinformatics, https://www.bioinformatics.babraham.ac.uk | version 0.11.9 |
Samtools | Genome Research Limited, http://www.htslib.org | version 1.14 |
MACS2 | https://github.com/macs3-project/MACS | version 2.1.1.20160309 |
Recombinant Identification Program | Los Alamos National Laboratory, https://www.hiv.lanl.gov/content/sequence/RIP/RIP.html | |
Bowtie2 | http://bowtie-bio.sourceforge.net/bowtie2/index.shtml | version 2.2.9 |
Homer | http://homer.ucsd.edu/homer/interactions/ | version 4.10.3 |
FitHiC2 | https://bioconductor.org/packages/release/bioc/html/FitHiC.html | version 1.20.0 |
FIREcaller | https://github.com/yycunc/FIREcaller | version 1.40 |
Python | Python Software Foundation, https://www.python.org/ | version 3.9 |
Scikit-learn | https://scikit-learn.org/ | Version 0.24.0 |
Biorender | https://biorender.com | |
Other | ||
QX200 Droplet Digital PCR System | Bio-Rad | https://www.bio-rad.com/en-us/life-science/digital-pcr/qx200-droplet-digital-pcr-system |
C1000 Touch Thermal Cycler with 96-Well Fast Reaction Module | Bio-Rad | Cat#1851196 |
Quantify One and ChemiDoc MP Image Lab | Bio-Rad | https://www.bio-rad.com/en-us/product/chemidoc-mp-imaging-system |
ThermoMixer C | Eppendorf | Cat#5382000023 |
96-Well PCR Post Magnet Low Elution Plate | Permagen | Cat#LE400 |
DynaMag-96 Side Skirted Magnet | ThermoFisher Scientific | Cat#12027 |
DynaMag-2 Magnet | ThermoFisher Scientific | Cat#12321D |
Illumina MiSeq performed by MGH CCIB DNA Core facility | Illumina/MGH CCIB DNA Core | https://dnacore.mgh.harvard.edu/new-cgi-bin/site/pages/index.jsp |
FACS Aria Cell Sorter | BD Biosciences | https://www.bdbiosciences.com/en-us/products/instruments/flow-cytometers/research-cell-sorters/bd-facsaria-iii |
NextSeq 500 Instrument | Illumina | https://www.illumina.com/systems/sequencing-platforms/nextseq.html |
Resource availability
Lead contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Mathias Lichterfeld (mlichterfeld@partners.org).
Materials availability
This study did not generate new unique reagents.
Experimental model and subject details
Study participants
HIV-1-infected study participants were recruited at the Massachusetts General Hospital in Boston, MA. PBMC samples were obtained according to protocols approved by the Institutional Review Board. Clinical characteristics of study participants are summarized in Figure S2. Demographic characteristics of the study patients are as follows:
Participant | Age at time of study | Sex |
---|---|---|
P1 | 54 yrs | M |
P2 | 74 yrs | M |
P3 | 56 yrs | undetermined |
P4 | 65 yrs | M |
P5 | 54 yrs | M |
P6 | 62 yrs | F |
P7 | 60 yrs | M |
Method details
HIV-1 DNA quantification by droplet digital PCR
PBMC isolated according to standard protocols were subjected to DNA extraction using commercial kits (Qiagen DNeasy Blood and Tissue Kit, #69504). We amplified total HIV-1 DNA using droplet digital PCR (Bio-Rad), using primers and probes described previously (Lee et al., 2017) (127 bp 5’-LTR-gag amplicon; HXB2 coordinates 684-810). PCR was performed using the following program: 95°C for 10 min, 45 cycles of 94°C for 30s and 60°C for 1 min, 98°C for 10 min. The droplets were subsequently read by a QX200 droplet reader and data were analyzed using QuantaSoft software (Bio-Rad).
Separation of genomic DNA and HIV-1 RNA
Cryopreserved PBMCs were thawed, washed, diluted in 96-well plates to single HIV-1-infected cells according to ddPCR results, so that one virally-infected cell was present in approximately 30% of wells, and lysed with Buffer RLT Plus (Qiagen, #1053393). Subsequently, cell lysates were incubated on a thermal mixer (Eppendorf ThermoMixer C, #5382000023) at 1200 rpm for 20 min at room temperature with complexes of magnetic streptavidin beads (Invitrogen Dynabeads MyOne Streptavidin C1, #65002) linked to biotinylated primers targeting defined regions (poly-A, tat-rev, nef, pol, long LTR) of HIV-1 RNA; primer sequences were previously described (Yukl et al., 2018) and modified to permit subsequent amplification with a Smart-seq2 protocol (Picelli et al., 2014). Primer sequences are listed in Table S3. After primer annealing, viral RNA was magnetically separated from genomic DNA according to the G&T-Seq protocol described previously (Bronner and Lorenz, 2019; Macaulay et al., 2016). Briefly, the magnetic beads bound to HIV-1 RNA were immobilized to the bottom of each well with an external magnet and the supernatant containing the genomic DNA fraction was transferred to a new plate. The beads were washed twice in a buffer containing 50 mM Tris-HCl, 7.5 mM KCl, 3 mM MgCl2, 10 mM DTT and 0.5% Tween 20; after each wash, the supernatants from each well were collected and added to the corresponding genomic DNA solutions. In a modification of the RNA-binding process for participants 2, 3, 5 and 6, biotinylated poly-A, nef, pol and long LTR primers were added sequentially in 10-minute intervals, rather than simultaneously, since this facilitated the capture and later detection of elongated HIV-1 RNA species. In selected cases, these experiments were performed using an RNA standard (ATCC, VR-3245SD) with known viral copy numbers spiked into a background population of 10,000 cells from an HIV-1-negative person for technical validation of the assay workflow.
HIV-1 cDNA synthesis, amplification, and detection
The bead-bound HIV-1 RNA was subjected to reverse transcription with SuperScript II Reverse Transcriptase (Invitrogen, #18064022) using a master mix containing 10 mM dNTPs, 6 mM MgCl2, 1 M betaine, 5 mM DTT, 1 U/μL SUPERase⋅In RNase Inhibitor (Invitrogen, #AM2694), 2 μM template-switching oligonucleotide (5’-AAGCAGTGGTATCAACGCAGAGTACATrGrG+G-3’) and 20 U/μL reverse transcriptase in SuperScript II RT First-Strand Buffer. This mix was incubated on a ThermoMixer at 1500 rpm for 60 min at 42 °C, 30 min at 50 °C and 10 min at 60 °C. Subsequently, cDNA was amplified by PCR with KAPA HiFi HotStart ReadyMix (Roche, #7958935001) and 300 nM ISPCR primer (5’-AAGCAGTGGTATCAACGCAGAGT-3’) using a modified Smart-seq2 protocol, as described in the G&T-Seq procedure (Macaulay et al., 2016); for participants 2, 3, 5 and 6, HIV-1-specific primers for poly-A, nef, tat-rev, pol and long LTR viral transcripts (Yukl et al., 2018) were added after the initial 10 Smart-seq2 amplification cycles to complete an additional 25 amplification cycles. Amplified cDNA was subjected to absolute quantification using ddPCR with primers and probes targeting different regions of HIV-1 transcripts, as previously described (Yukl et al., 2018). All primer sequences are listed in Table S3.
Whole genome amplification
Isolated gDNA was incubated with AMPure XP magnetic beads (Beckman Coulter, #A63882) and immobilized at the side of each well with an external magnet. After several washes with 80% ethanol, each well was subjected to multiple displacement amplification (MDA) with phi29 polymerase (Qiagen REPLI-g Single Cell Kit, #150345 or REPLI-g Advanced Single Cell Kit, #150365) for 4 hours (Kit #150345) or for 2 hours (Kit #150365), per the manufacturer’s protocol for amplification of genomic DNA from single eukaryotic cells. After this unbiased whole genome amplification (Burtt, 2011), DNA was again incubated with AMPure XP beads and washed and eluted in water. gDNA from each well was split and separately subjected to viral sequencing and integration site analysis, as described below.
HIV-1 near full-genome sequencing
DNA resulting from whole-genome amplification reactions was subjected to HIV-1 near full-genome amplification using a near full-length amplicon as well as a nonmultiplexed overlapping 5-amplicon approach, as described before (Einkauf et al., 2019). Additionally, amplification of the HIV-1 core promoter and enhancer region (Shukla et al., 2020) was performed, using one of two nested PCRs: one with primer sequences corresponding to HXB2 coordinates 350-372 and 642-661 (first round) and 367-385 and 626-643 (second round); or another nested PCR with primer sequences corresponding to HXB2 coordinates 24-50 and 936-962 (first round) and 76-100 and 797-818 (second round). All primer sequences are listed in Table S3. These promoter PCRs were performed with the following program: 94 °C for 2 min, 30 cycles of 94 °C for 15 s, 60 °C for 30 s, 68 °C for 1 min. PCR products were visualized by agarose gel electrophoresis (Quantify One and ChemiDoc MP Image Lab, BioRad). All near full-length or 5-amplicon positive PCR products, as well as a large number of proviruses with evident major deletions, were subjected to Illumina MiSeq sequencing at the MGH DNA Core facility. Resulting short reads were de novo assembled using Ultracycler v1.0 and aligned to HXB2 to identify large deleterious deletions (<8000bp of the amplicon aligned to HXB2), out-of-frame indels, premature/lethal stop codons, internal inversions, or packaging signal deletions (≥15 bp insertions and/or deletions relative to HXB2), using an automated in-house pipeline written in Python programming language (https://github.com/BWH-Lichterfeld-Lab/Intactness-Pipeline), consistent with prior studies (Hiener et al., 2017; Lee et al., 2017, 2019; Pinzone et al., 2019). Presence/absence of APOBEC-3G/3F-associated hypermutations was determined using the Los Alamos National Laboratory (LANL) HIV Sequence Database Hypermut 2.0 (Rose and Korber, 2000) program; hypermutated proviruses that also contained other structural defects were categorized as hypermutated when binning proviruses according to proviral genomic defects. Viral sequences that lacked all mutations listed above were classified as “genome-intact” sequences. Proviral sequences identified as “genome-intact” with this algorithm were also classified as “genome-intact” using an alternative analysis procedure (Shao et al., 2020). Sequence alignments were performed using MUSCLE (Edgar, 2004) and Geneious Prime 2021.0.3 (geneious.com). Phylogenetic distances between sequences were examined using UPGMA trees in Geneious Prime. Viral sequences were considered clonal if they had identical integration sites and 3 or fewer mismatches between proviral sequences; single nucleotide variations in primer binding sites were not considered for clonality analysis.
Integration site analysis
Integration sites associated with each viral sequence were obtained using integration site loop amplification (ISLA), using a protocol previously described (Wagner et al., 2014) and DNA produced by whole-genome amplification as template. Resulting PCR products were subjected to next-generation sequencing using Illumina MiSeq. MiSeq paired-end FASTQ files were demultiplexed; small reads (142 bp) were then aligned simultaneously to human reference genome GRCh38 and HIV-1 reference genome HXB2 using bwa-mem (Li and Durbin, 2009). Biocomputational identification of integration sites was performed according to previously-described procedures (Wagner et al., 2014): Briefly, chimeric reads containing both human and HIV-1 sequences were evaluated for mapping quality based on (i) HIV-1 coordinates mapping to the terminal nucleotides of the viral genome, (ii) absolute counts of chimeric reads, and (iii) depth of sequencing coverage in the host genome adjacent to the viral integration site. The final list of integration sites and corresponding chromosomal annotations was obtained using Ensembl (v86, www.ensembl.org), the UCSC Genome Browser (www.genome.ucsc.edu) and GENCODE (v25, www.gencodegenes.org). Repetitive genomic sequences harboring HIV-1 integration sites were identified using RepeatMasker (www.repeatmasker.org). Cells in which multiple HIV-1 integration sites were detected, either as a result of multiple HIV-1-infected cells being present in one limiting-dilution well or multiple HIV-1 proviruses being integrated into a single cell, were excluded from analysis; the latter possibility is unlikely (Josefsson et al., 2011).
HIV-1 reactivation assays
To evaluate HIV-1 RNA expression in response to stimulation, we isolated CD4+ T cells from participant-derived PBMC (Stemcell EasySep™ Human CD4+ T Cell Isolation Kit, #17952), followed by dilution to single HIV-1-infected cells for incubation with 81 nM PMA and 1.34 μM ionomycin (Biolegend Cell Activation Cocktail (without Brefeldin A), #423302) with 20 U/μL recombinant IL-2 and 400 nM AZT; simultaneously, unstimulated autologous cells were plated in the same way for control purposes. After 12 hours of incubation, stimulated and unstimulated CD4+ T cells were subjected to the PRIP-Seq protocol for analysis of HIV-1 integration sites, proviral sequences and HIV-1 RNA expression from individual HIV-1-infected cells.
Evaluation of epigenetic and chromosomal contact features
PBMC from three ART-treated HIV-1 participants were used for parallel analysis of CD4 T cells by RNA-Seq, ATAC-Seq, and Hi-C, as described below. ChIP-Seq data were obtained from primary memory CD4 T cells included in the ROADMAP database (Kundaje et al., 2015) or from activated CD4 T cells included in the ENCODE project (ENCODE Project Consortium, 2012). Cytosine methylation data were obtained from the iMethyl database (Komaki et al., 2018).
Cell sorting
Briefly, total PBMC were stained with monoclonal antibodies to CD4 (clone RPA-T4, Biolegend, #300518), CD3 (clone OKT3, Biolegend, #317332), CD45RO (clone UCHL1, Biolegend, #304236) and CCR7 (clone G043H7, Biolegend, #353216). Afterwards, cells were washed and CD45RO+ CCR7+ (central-memory) and CD45RO+ CCR7- (effector-memory) and CD3+ CD4+ (total) CD4+ T cells were sorted in a specifically designated biosafety cabinet (Baker Hood), using a FACS Aria cell sorter (BD Biosciences) at 70 pounds per square inch. Cell sorting was performed by the Ragon Institute Imaging Core Facility at MGH and resulted in isolation of lymphocytes with the defined phenotypic characteristics of >95% purity.
RNA-Seq
RNA was extracted from total CD4+ T-cell populations using a PicoPure RNA Isolation Kit (Applied Biosystems, #0204). RNA-Seq libraries were generated as previously described (Trombetta et al., 2014). Briefly, whole transcriptome amplification and tagmentation-based library preparation was performed using Smart-seq2, followed by sequencing on a NextSeq 500 Instrument (Illumina). The quantification of transcript abundance was conducted using RSEM software (v1.2.22; (Li and Dewey, 2011) supported by STAR aligner software (STAR 2.5.1b) and aligned to the hg38 human genome. Transcripts per million (TPM) values were then normalized among all samples using the upper quantile normalization method. Integration site features were calculated individually using each of the three RNA-Seq datasets from the three assayed participants; medians or averages of the three datasets were used for statistical analyses.
ATAC-Seq
A previously-described protocol with some modifications (Corces et al., 2016, 2017) was used. Briefly, 20,000 total CD4 T cells were centrifuged at 1500 rpm for 10 min at 4°C in a precooled fixed-angle centrifuge. All supernatant was removed and a modified transposase mixture (including 25 μl of 2x TD buffer, 1.5 μl of TDE1, 0.5 μl of 1% digitonin, 16.5 μl of PBS, 6.5 μl of nuclease-free water) was added to the cells and incubated in a heat block at 37°C for 30 min. Transposed DNA was purified using a ChIP DNA Clean & Concentrator Kit (Zymo Research, #D5205) and eluted DNA fragments were used to amplify libraries. The libraries were quantified using an Agilent Bioanalyzer 2100 and the Q-Qubit™ dsDNA High Sensitivity Assay Kit (Invitrogen, #Q33230). All Fast-ATAC libraries were sequenced using paired-end, single-index sequencing on a NextSeq 500/550 instrument with v2.5 Kits (75 Cycles). The quality of reads was assessed using FastQC (https://www.bioinformatics.babraham.ac.uk). Low quality DNA end fragments and sequencing adapters were trimmed using Trimmomatic (http://www.usadellab.org). Sequencing reads were then aligned to the human reference genome hg38 using a short-read aligner (Bowtie2, http://bowtie-bio.sourceforge.net/bowtie2/index.shtml) with the nondefault parameters “X2000”, “nonmixed” and “nondiscordant”. Reads from mitochondrial DNA were removed using Samtools (http://www.htslib.org). Peak calls were made using MACS2 with the callpeak command (https://pypi.python.org/pypi/MACS2), with a threshold for peak calling set to FDR-adjusted p<0.05. Integration site features were calculated individually using each of the three ATAC-Seq datasets from the three assayed participants; medians or averages of the three datasets were used for statistical analyses.
Hi-C
We performed in situ Hi-C on sorted total CD4 T cells, central-memory and effector-memory CD4 T cells as previously described with minor modifications (Díaz et al., 2018). Briefly, one million cells were crosslinked in 1% formaldehyde (Sigma, #F8775-25ML) for 10 min, followed by quenching with glycine (Sigma, #50046-50G). Cell nuclei were permeabilized with 0.4% sodium dodecyl sulphate (SDS) solution and chromatin was digested using 100 U of MboI (NEB, #R0147L). Overhangs generated by the restriction enzyme were filled using Klenow DNA polymerase (NEB, #M0210L) and a mix of dNTPs including biotin-14-dCTP (Invitrogen, #19518018). DNA fragments were then ligated using T4 DNA ligase (NEB, #M0202). After reversion of crosslinking, cellular proteins were digested with Proteinase K (NEB, #P8107), followed by DNA extraction using Phenol-Chloroform-Isoamyl alcohol. The extracted DNA was subjected to digestion of contaminating RNA with RNase A (ThermoFisher, #EN0531). Samples were sheared using a Covaris E220 instrument with a target size of 300-500 base pairs (140W peak incident power, each 67s, 10% duty, 200 cycles/burst) at the MGH NextSeq Core facility. Biotinylated fragments were isolated with Dynabeads MyOne Streptavidin C1 beads (Invitrogen, #65002). DNA linked to the Dynabeads was end-repaired using the End Prep Enzyme Mix and subsequently used for library construction with the NEBNext Ultra DNA Library Prep Kit for Illumina Sequencing (NEB, #E7370S). Final amplification of the libraries was done in 4 parallel reactions per sample according to the following program: 98 °C for 1 min, (98 °C for 10 s, 65 °C for 75 s, ramping 1.5 °C/s) repeated 12–20 times, 65 °C for 5 min, 4 °C. The indexed samples were processed individually and double size-selected using SPRISelect beads (Beckman Coulter, #B23318). Final Hi-C libraries were quantified using the Qubit dsDNA HS assay kit (Invitrogen, #Q33230) and a High Sensitivity D1000 kit (Agilent, #5067-5585) on a Tapestation 4200 (Agilent). The library was then sequenced on an Illumina NextSeq 500/550 (2x80 bp paired-end; NextSeq 500/550 High Output kit v2.5-150 cycles). HomerTools in the software suite HOMER (version 4.10.3) was used to truncate raw sequencing reads at the restriction enzyme cutting site (Heinz et al., 2010), followed by aligning reads to the human reference genome (GRCh38) with Bowtie2 (version 2.3.4.3) (Langmead and Salzberg, 2012). The genome was tiled into bins with defined resolutions using the default method in HOMER to normalize the total read counts in each region. TADs were determined using default utilities from HOMER. Independently, we called significant inter-chromosomal and intrachromosomal interactions using FitHiC2 with Knight-Ruiz matrix balancing (Knight and Ruiz, 2013) and FDR-adjusted p-values < 0.05 as a cutoff (Ay et al., 2014; Kaul et al., 2020). Frequently interacting regions (FIREs) were called using FIREcaller (version 1.10) (Crowley et al., 2021). For data analysis, reads from autologous central-memory, effector-memory and total CD4 T cells were combined to generate a pooled dataset with overrepresentation of reads from memory cells, to account for the fact that most HIV-1-infected cells are included in the memory cell compartment (Chomont et al., 2009). The numbers of total unique sequencing reads in these pooled datasets in the three assayed participants were 1,424,507,007; 1,685,072,487; and 2,400,898,418, respectively. Integration site features were calculated individually using each of the three Hi-C datasets; medians or averages of the three datasets were used for statistical analysis.
Quantification and statistical analysis
Statistics
Data are presented as circos plots, pie charts, bar charts, line graphs and box and whisker plots, showing the median, the 25% and 75% percentiles, and maximum/minimum values. Differences were tested for statistical significance using Mann-Whitney U tests (two-tailed), Fisher’s exact tests (two-tailed) or G-tests, as appropriate. Longitudinal correlations were evaluated using Kendall’s rank correlation coefficient (τ). p-values of <0.05 were considered significant. Analyses were performed using Prism (GraphPad Software, Inc.), Python (Python Software Foundation), and R (R Foundation for Statistical Computing (R Development Core Team, 2019)).
Logistic regression model
The Scikit-learn module (Pedregosa et al., 2012) in Python was used to develop a machine learning model to predict the transcriptional activity of HIV-1 integration sites. Briefly, the entire dataset of proviral species was randomly split into a training set with 80% of observations and a test set with 20% of observations, with each set containing the same proportion of observations from each participant and timepoint and the same proportion of RNA+ and RNA- proviruses. A logistic regression model was then trained on the training dataset with HIV-1 Long LTR RNA detection as the target variable. During training, stratified 10-fold cross-validation was used to determine the optimal data standardization and regularization strategies, and ultimately the model with the highest mean cross-validated area under the receiver operating characteristic curve (ROC AUC) was selected for evaluation with the test dataset.
Acknowledgments
M.L. is supported by NIH grants AI117841, AI120008, AI130005, DK120387, AI152979, and AI155233 and by the American Foundation for AIDS Research (#110181-69-RGCV). X.G.Y. is supported by NIH grants AI155171, AI116228, AI078799, HL134539, and DA047034 and the Bill and Melinda Gates Foundation (INV-002703). M.L. and X.G.Y. are members of the DARE, ERASE, and BEAT-HIV Martin Delaney Collaboratories (UM1 AI164560, AI164562, and AI164570).
Author contributions
Whole-genome amplification and HIV-1 sequencing, K.B.E., M.R.O., E.M.P., W.S., G.T.G., K.W.S., and J.E.B.; IS analysis, K.B.E., M.R.O., E.M.P., C.J., X.L., and J.E.B.; analysis of HIV-1 RNA transcripts, K.B.E., M.R.O., W.S., E.M.P., G.T.G., amd J.E.B.; RNA-seq, ATAC-seq, and Hi-C, X.S.; bioinformatics analysis, C.G.; modeling and machine learning, M.R.O. and C.G.; biological specimen contribution, E.S.R.; data interpretation, analysis, and presentation, K.B.E., M.R.O., W.S., C.G., X.G.Y., and M.L.; preparation and writing of manuscript, K.B.E., M.R.O., W.S., C.G., and M.L.; research idea, K.B.E., M.R.O., S.Y., X.G.Y., and M.L.; study supervision, M.L.
Declaration of interests
The authors declare no competing interests.
Inclusion and diversity
We worked to ensure ethnic or other types of diversity in the recruitment of human subjects. One or more of the authors of this paper self-identifies as a member of the LGBTQ+ community.
Published: January 6, 2022
Footnotes
Supplemental information can be found online at https://doi.org/10.1016/j.cell.2021.12.011.
Supplemental information
Data and code availability
-
•
This paper does not report original code.
-
•
Data were deposited to Gene Expression Omnibus (GEO) with the following accession numbers: RNA-Seq and ATAC-Seq: GSE144334, Hi-C: GSE168337
-
•
Proviral integration sites and their transcriptional activity are listed in Tables S1 and S2.
-
•
Proviral sequences: Due to study participant confidentiality concerns, viral sequencing data cannot be publicly released but will be made available to investigators upon reasonable request and after signing a coded tissue agreement.
-
•
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.
References
- Aamer H.A., McClure J., Ko D., Maenza J., Collier A.C., Coombs R.W., Mullins J.I., Frenkel L.M. Cells producing residual viremia during antiretroviral treatment appear to contribute to rebound viremia following interruption of treatment. PLOS Pathog. 2020;16 doi: 10.1371/journal.ppat.1008791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Achuthan V., Perreira J.M., Sowd G.A., Puray-Chavez M., McDougall W.M., Paulucci-Holthauzen A., Wu X., Fadel H.J., Poeschla E.M., Multani A.S., et al. Capsid-CPSF6 interaction licenses nuclear HIV-1 trafficking to sites of viral DNA integration. Cell Host Microbe. 2018;24:392–404. doi: 10.1016/j.chom.2018.08.002. e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Agarwal N., Dancik G.M., Goodspeed A., Costello J.C., Owens C., Duex J.E., Theodorescu D. GON4L drives cancer growth through a YY1-androgen receptor-CD24 axis. Cancer Res. 2016;76:5175–5185. doi: 10.1158/0008-5472.CAN-16-1099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amemiya H.M., Kundaje A., Boyle A.P. The ENCODE blacklist: identification of problematic regions of the genome. Sci. Rep. 2019;9:9354. doi: 10.1038/s41598-019-45839-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Antar A.A., Jenike K.M., Jang S., Rigau D.N., Reeves D.B., Hoh R., Krone M.R., Keruly J.C., Moore R.D., Schiffer J.T., et al. Longitudinal study reveals HIV-1-infected CD4+ T cell dynamics during long-term antiretroviral therapy. J. Clin. Invest. 2020;130:3543–3559. doi: 10.1172/JCI135953. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Archin N.M., Liberty A.L., Kashuba A.D., Choudhary S.K., Kuruc J.D., Crooks A.M., Parker D.C., Anderson E.M., Kearney M.F., Strain M.C., et al. Administration of vorinostat disrupts HIV-1 latency in patients on antiretroviral therapy. Nature. 2012;487:482–485. doi: 10.1038/nature11286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ay F., Bailey T.L., Noble W.S. Statistical confidence estimation for Hi–C data reveals regulatory chromatin contacts. Genome Res. 2014;24:999–1011. doi: 10.1101/gr.160374.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Battivelli E., Dahabieh M.S., Abdel-Mohsen M., Svensson J.P., Tojal Da Silva I., Cohn L.B., Gramatica A., Deeks S., Greene W.C., Pillai S.K., et al. Distinct chromatin functional states correlate with HIV latency reactivation in infected primary CD4(+) T cells. Elife. 2018;7 doi: 10.7554/eLife.34655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beagan J.A., Phillips-Cremins J.E. On the existence and functionality of topologically associating domains. Nat. Genet. 2020;52:8–16. doi: 10.1038/s41588-019-0561-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bronner I.F., Lorenz S. Combined genome and transcriptome (G&T) sequencing of single cells. Methods Mol. Biol. 2019;1979:319–362. doi: 10.1007/978-1-4939-9240-9_20. [DOI] [PubMed] [Google Scholar]
- Bruner K.M., Wang Z., Simonetti F.R., Bender A.M., Kwon K.J., SenGupta S., Fray E.J., Beg S.A., Antar A.A.R., Jenike K.M., et al. A quantitative approach for measuring the reservoir of latent HIV-1 proviruses. Nature. 2019;566:120–125. doi: 10.1038/s41586-019-0898-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bui J.K., Sobolewski M.D., Keele B.F., Spindler J., Musick A., Wiegand A., Luke B.T., Shao W., Hughes S.H., Coffin J.M., et al. Proviruses with identical sequences comprise a large fraction of the replication-competent HIV reservoir. PLoS Pathog. 2017;13 doi: 10.1371/journal.ppat.1006283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burtt N.P. Whole-genome amplification using Phi29 DNA polymerase. Cold Spring Harb. Protoc. 2011;2011 doi: 10.1101/pdb.prot5552. pdb.prot5552. [DOI] [PubMed] [Google Scholar]
- Cesana D., Santoni de Sio F.R., Rudilosso L., Gallina P., Calabria A., Beretta S., Merelli I., Bruzzesi E., Passerini L., Nozza S., et al. HIV-1-mediated insertional activation of STAT5B and BACH2 trigger viral reservoir in T regulatory cells. Nat. Commun. 2017;8:498. doi: 10.1038/s41467-017-00609-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen H.C., Martinez J.P., Zorita E., Meyerhans A., Filion G.J. Position effects influence HIV latency reversal. Nat. Struct. Mol. Biol. 2017;24:47–54. doi: 10.1038/nsmb.3328. [DOI] [PubMed] [Google Scholar]
- Chen J.Y., Luo C.W., Lai Y.S., Wu C.C., Hung W.C. Lysine demethylase KDM2A inhibits TET2 to promote DNA methylation and silencing of tumor suppressor genes in breast cancer. Oncogenesis. 2017;6:e369. doi: 10.1038/oncsis.2017.71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chomont N., El-Far M., Ancuta P., Trautmann L., Procopio F.A., Yassine-Diab B., Boucher G., Boulassel M.R., Ghattas G., Brenchley J.M., et al. HIV reservoir size and persistence are driven by T cell survival and homeostatic proliferation. Nat. Med. 2009;15:893–900. doi: 10.1038/nm.1972. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chun T.W., Carruth L., Finzi D., Shen X., DiGiuseppe J.A., Taylor H., Hermankova M., Chadwick K., Margolick J., Quinn T.C., et al. Quantification of latent tissue reservoirs and total body viral load in HIV-1 infection. Nature. 1997;387:183–188. doi: 10.1038/387183a0. [DOI] [PubMed] [Google Scholar]
- Ciuffi A., Llano M., Poeschla E., Hoffmann C., Leipzig J., Shinn P., Ecker J.R., Bushman F. A role for LEDGF/p75 in targeting HIV DNA integration. Nat. Med. 2005;11:1287–1289. doi: 10.1038/nm1329. [DOI] [PubMed] [Google Scholar]
- Cohn L.B., da Silva I.T., Valieris R., Huang A.S., Lorenzi J.C.C., Cohen Y.Z., Pai J.A., Butler A.L., Caskey M., Jankovic M., et al. Clonal CD4(+) T cells in the HIV-1 latent reservoir display a distinct gene profile upon reactivation. Nat. Med. 2018;24:604–609. doi: 10.1038/s41591-018-0017-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohn L.B., Silva I.T., Oliveira T.Y., Rosales R.A., Parrish E.H., Learn G.H., Hahn B.H., Czartoski J.L., McElrath M.J., Lehmann C., et al. HIV-1 integration landscape during latent and active infection. Cell. 2015;160:420–432. doi: 10.1016/j.cell.2015.01.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cole B., Lambrechts L., Gantner P., Noppe Y., Bonine N., Witkowski W., Chen L., Palmer S., Mullins J.I., Chomont N., et al. In-depth single-cell analysis of translation-competent HIV-1 reservoirs identifies cellular sources of plasma viremia. Nat. Commun. 2021;12:3727. doi: 10.1038/s41467-021-24080-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corces M.R., Buenrostro J.D., Wu B., Greenside P.G., Chan S.M., Koenig J.L., Snyder M.P., Pritchard J.K., Kundaje A., Greenleaf W.J., et al. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nat. Genet. 2016;48:1193–1203. doi: 10.1038/ng.3646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corces M.R., Trevino A.E., Hamilton E.G., Greenside P.G., Sinnott-Armstrong N.A., Vesuna S., Satpathy A.T., Rubin A.J., Montine K.S., Wu B., et al. An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nat. Methods. 2017;14:959–962. doi: 10.1038/nmeth.4396. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crowley C., Yang Y., Qiu Y., Hu B., Abnousi A., Lipiński J., Plewczyński D., Wu D., Won H., Ren B., et al. FIREcaller: detecting frequently interacting regions from Hi–C data. Comput. Struct. Biotechnol. J. 2021;19:355–362. doi: 10.1016/j.csbj.2020.12.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Díaz N., Kruse K., Erdmann T., Staiger A.M., Ott G., Lenz G., Vaquerizas J.M. Chromatin conformation analysis of primary patient tissue using a low input Hi-C method. Nat. Commun. 2018;9:4938. doi: 10.1038/s41467-018-06961-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Edelman L.B., Fraser P. Transcription factories: genetic programming in three dimensions. Curr. Opin. Genet. Dev. 2012;22:110–114. doi: 10.1016/j.gde.2012.01.010. [DOI] [PubMed] [Google Scholar]
- Edgar R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Einkauf K.B., Lee G.Q., Gao C., Sharaf R., Sun X., Hua S., Chen S.M., Jiang C., Lian X., Chowdhury F.Z., et al. Intact HIV-1 proviruses accumulate at distinct chromosomal positions during prolonged antiretroviral therapy. J. Clin. Invest. 2019;129:988–998. doi: 10.1172/JCI124291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- ENCODE Project Consortium An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489:57–74. doi: 10.1038/nature11247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finzi D., Hermankova M., Pierson T., Carruth L.M., Buck C., Chaisson R.E., Quinn T.C., Chadwick K., Margolick J., Brookmeyer R., et al. Identification of a reservoir for HIV-1 in patients on highly active antiretroviral therapy. Science. 1997;278:1295–1300. doi: 10.1126/science.278.5341.1295. [DOI] [PubMed] [Google Scholar]
- Fraietta J.A., Nobles C.L., Sammons M.A., Lundh S., Carty S.A., Reich T.J., Cogdill A.P., Morrissette J.J.D., DeNizio J.E., Reddy S., et al. Disruption of TET2 promotes the therapeutic efficacy of CD19-targeted T cells. Nature. 2018;558:307–312. doi: 10.1038/s41586-018-0178-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Halvas E.K., Joseph K.W., Brandt L.D., Guo S., Sobolewski M.D., Jacobs J.L., Tumiotto C., Bui J.K., Cyktor J.C., Keele B.F., et al. HIV-1 viremia not suppressible by antiretroviral therapy can originate from large T cell clones producing infectious virus. J. Clin. Invest. 2020;130:5847–5857. doi: 10.1172/JCI138099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han Y., Lin Y.B., An W., Xu J., Yang H.C., O’Connell K., Dordai D., Boeke J.D., Siliciano J.D., Siliciano R.F. Orientation-dependent regulation of integrated HIV-1 expression by host gene transcriptional readthrough. Cell Host Microbe. 2008;4:134–146. doi: 10.1016/j.chom.2008.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heinz S., Benner C., Spann N., Bertolino E., Lin Y.C., Laslo P., Cheng J.X., Murre C., Singh H., Glass C.K. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hiener B., Horsburgh B.A., Eden J.S., Barton K., Schlub T.E., Lee E., von Stockenstrom S., Odevall L., Milush J.M., Liegler T., et al. Identification of genetically intact HIV-1 proviruses in specific CD4(+) T cells from effectively treated participants. Cell Rep. 2017;21:813–822. doi: 10.1016/j.celrep.2017.09.081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ho Y.C., Shan L., Hosmane N.N., Wang J., Laskey S.B., Rosenbloom D.I., Lai J., Blankson J.N., Siliciano J.D., Siliciano R.F. Replication-competent noninduced proviruses in the latent reservoir increase barrier to HIV-1 cure. Cell. 2013;155:540–551. doi: 10.1016/j.cell.2013.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hosmane N.N., Kwon K.J., Bruner K.M., Capoferri A.A., Beg S., Rosenbloom D.I., Keele B.F., Ho Y.C., Siliciano J.D., Siliciano R.F. Proliferation of latently infected CD4(+) T cells carrying replication-competent HIV-1: potential role in latent reservoir dynamics. J. Exp. Med. 2017;214:959–972. doi: 10.1084/jem.20170193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang A.S., Ramos V., Oliveira T.Y., Gaebler C., Jankovic M., Nussenzweig M.C., Cohn L.B. Integration features of intact latent HIV-1 in CD4+ T cell clones contribute to viral persistence. J. Exp. Med. 2021;218 doi: 10.1084/jem.20211427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang C., Lian X., Gao C., Sun X., Einkauf K.B., Chevalier J.M., Chen S.M.Y., Hua S., Rhee B., Chang K., et al. Distinct viral reservoirs in individuals with spontaneous control of HIV-1. Nature. 2020;585:261–267. doi: 10.1038/s41586-020-2651-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jordan A., Bisgrove D., Verdin E. HIV reproducibly establishes a latent infection after acute infection of T cells in vitro. EMBO J. 2003;22:1868–1877. doi: 10.1093/emboj/cdg188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Josefsson L., King M.S., Makitalo B., Brännström J., Shao W., Maldarelli F., Kearney M.F., Hu W.S., Chen J., Gaines H., et al. Majority of CD4+ T cells from peripheral blood of HIV-1-infected individuals contain only one HIV DNA molecule. Proc. Natl. Acad. Sci. USA. 2011;108:11199–11204. doi: 10.1073/pnas.1107729108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaul A., Bhattacharyya S., Ay F. Identifying statistically significant chromatin contacts from Hi–C data with FitHiC2. Nat. Protoc. 2020;15:991–1012. doi: 10.1038/s41596-019-0273-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Knight P.A., Ruiz D. A fast algorithm for matrix balancing. IMA J. Numer. Anal. 2013;33:1029–1047. [Google Scholar]
- Komaki S., Shiwa Y., Furukawa R., Hachiya T., Ohmomo H., Otomo R., Satoh M., Hitomi J., Sobue K., Sasaki M., et al. iMETHYL: an integrative database of human DNA methylation, gene expression, and genomic variation. Hum. Genome Var. 2018;5:18008. doi: 10.1038/hgv.2018.8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar A., Rao A., Bhavani S., Newberg J.Y., Murphy R.F. Automated analysis of immunohistochemistry images identifies candidate location biomarkers for cancers. Proc. Natl. Acad. Sci. USA. 2014;111:18249–18254. doi: 10.1073/pnas.1415120112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kundaje A., Meuleman W., Ernst J., Bilenky M., Yen A., Heravi-Moussavi A., Kheradpour P., Zhang Z., Wang J., et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. doi: 10.1038/nature14248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuo H.H., Ahmad R., Lee G.Q., Gao C., Chen H.R., Ouyang Z., Szucs M.J., Kim D., Tsibris A., Chun T.W., et al. Anti-apoptotic protein BIRC5 maintains survival of HIV-1-infected CD4(+) T cells. Immunity. 2018;48:1183–1194. doi: 10.1016/j.immuni.2018.04.004. e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee G.Q., Orlova-Fink N., Einkauf K., Chowdhury F.Z., Sun X., Harrington S., Kuo H.H., Hua S., Chen H.R., Ouyang Z., et al. Clonal expansion of genome-intact HIV-1 in functionally polarized Th1 CD4+ T cells. J. Clin. Invest. 2017;127:2689–2696. doi: 10.1172/JCI93289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee G.Q., Reddy K., Einkauf K.B., Gounder K., Chevalier J.M., Dong K.L., Walker B.D., Yu X.G., Ndung'u T., Lichterfeld M. HIV-1 DNA sequence diversity and evolution during acute subtype C infection. Nat. Commun. 2019;10:2737. doi: 10.1038/s41467-019-10659-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lenasi T., Contreras X., Peterlin B.M. Transcriptional interference antagonizes proviral gene expression to promote HIV latency. Cell Host Microbe. 2008;4:123–133. doi: 10.1016/j.chom.2008.05.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewinski M.K., Bisgrove D., Shinn P., Chen H., Hoffmann C., Hannenhalli S., Verdin E., Berry C.C., Ecker J.R., Bushman F.D. Genome-wide analysis of chromosomal features repressing human immunodeficiency virus transcription. J. Virol. 2005;79:6610–6619. doi: 10.1128/JVI.79.11.6610-6619.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li B., Dewey C.N. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinform. 2011;12:323. doi: 10.1186/1471-2105-12-323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H., Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lian X., Gao C., Sun X., Jiang C., Einkauf K.B., Seiger K.W., Chevalier J., Yuki Y., Martin M., Hoh R., et al. Signatures of immune selection in intact and defective proviruses distinguish HIV-1 elite controllers. Sci. Transl. Med. 2021;13:1–15. doi: 10.1126/scitranslmed.abl4097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu R., Yeh Y.J., Varabyou A., Collora J.A., Sherrill-Mix S., Talbot C.C., Jr., Mehta S., Albrecht K., Hao H., Zhang H., et al. Single-cell transcriptional landscapes reveal HIV-1-driven aberrant host gene transcription as a potential therapeutic target. Sci. Transl. Med. 2020;12 doi: 10.1126/scitranslmed.aaz0802. eaaz0802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lorenzi J.C., Cohen Y.Z., Cohn L.B., Kreider E.F., Barton J.P., Learn G.H., Oliveira T., Lavine C.L., Horwitz J.A., Settler A., et al. Paired quantitative and qualitative assessment of the replication-competent HIV-1 reservoir and comparison with integrated proviral DNA. Proc. Natl. Acad. Sci. USA. 2016;113:E7908–E7916. doi: 10.1073/pnas.1617789113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maass P.G., Barutcu A.R., Rinn J.L. Interchromosomal interactions: a genomic love story of kissing chromosomes. J. Cell Biol. 2019;218:27–38. doi: 10.1083/jcb.201806052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Macaulay I.C., Haerty W., Kumar P., Li Y.I., Hu T.X., Teng M.J., Goolam M., Saurat N., Coupland P., Shirley L.M., et al. G&T-seq: parallel sequencing of single-cell genomes and transcriptomes. Nat. Methods. 2015;12:519–522. doi: 10.1038/nmeth.3370. [DOI] [PubMed] [Google Scholar]
- Macaulay I.C., Teng M.J., Haerty W., Kumar P., Ponting C.P., Voet T. Separation and parallel sequencing of the genomes and transcriptomes of single cells using G&T-seq. Nat. Protoc. 2016;11:2081–2103. doi: 10.1038/nprot.2016.138. [DOI] [PubMed] [Google Scholar]
- Maldarelli F., Wu X., Su L., Simonetti F.R., Shao W., Hill S., Spindler J., Ferris A.L., Mellors J.W., Kearney M.F., et al. HIV latency. Specific HIV integration sites are linked to clonal expansion and persistence of infected cells. Science. 2014;345:179–183. doi: 10.1126/science.1254194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mendoza P., Jackson J.R., Oliveira T.Y., Gaebler C., Ramos V., Caskey M., Jankovic M., Nussenzweig M.C., Cohn L.B. Antigen-responsive CD4+ T cell clones contribute to the HIV-1 latent reservoir. J. Exp. Med. 2020;217 doi: 10.1084/jem.20200051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patro S.C., Brandt L.D., Bale M.J., Halvas E.K., Joseph K.W., Shao W., Wu X., Guo S., Murrell B., Wiegand A., et al. Combined HIV-1 sequence and integration site analysis informs viral dynamics and allows reconstruction of replicating viral ancestors. Proc. Natl. Acad. Sci. USA. 2019;116:25891–25899. doi: 10.1073/pnas.1910334116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Prettenhofer P., Weiss R., Dubourg V., et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 2012;12:2825–2830. [Google Scholar]
- Picelli S., Faridani O.R., Björklund A.K., Winberg G., Sagasser S., Sandberg R. Full-length RNA-seq from single cells using Smart-seq2. Nat. Protoc. 2014;9:171–181. doi: 10.1038/nprot.2014.006. [DOI] [PubMed] [Google Scholar]
- Pinzone M.R., VanBelzen D.J., Weissman S., Bertuccio M.P., Cannon L., Venanzi-Rullo E., Migueles S., Jones R.B., Mota T., Joseph S.B., et al. Longitudinal HIV sequencing reveals reservoir expression leading to decay which is obscured by clonal expansion. Nat. Commun. 2019;10:728. doi: 10.1038/s41467-019-08431-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pollack R.A., Jones R.B., Pertea M., Bruner K.M., Martin A.R., Thomas A.S., Capoferri A.A., Beg S.A., Huang S.H., Karandish S., et al. Defective HIV-1 proviruses are expressed and can be recognized by cytotoxic T lymphocytes, which shape the proviral landscape. Cell Host Microbe. 2017;21:494–506. doi: 10.1016/j.chom.2017.03.008. e4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Development Core Team . R Foundation for Statistical Computing; 2019. R: A language and environment for statistical computing.http://www.R-project.org [Google Scholar]
- Rasmussen T.A., Tolstrup M., Brinkmann C.R., Olesen R., Erikstrup C., Solomon A., Winckelmann A., Palmer S., Dinarello C., Buzon M., et al. Panobinostat, a histone deacetylase inhibitor, for latent-virus reactivation in HIV-infected patients on suppressive antiretroviral therapy: a phase 1/2, single group, clinical trial. Lancet HIV. 2014;1:e13–e21. doi: 10.1016/S2352-3018(14)70014-1. [DOI] [PubMed] [Google Scholar]
- Ren Y., Huang S.H., Patel S., Alberto W.D.C., Magat D., Ahimovic D., Macedo A.B., Durga R., Chan D., Zale E., et al. BCL-2 antagonism sensitizes cytotoxic T cell-resistant HIV reservoirs to elimination ex vivo. J. Clin. Invest. 2020;130:2542–2559. doi: 10.1172/JCI132374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rose P.P., Korber B.T. Detecting hypermutations in viral sequences with an emphasis on G → A hypermutation. Bioinformatics. 2000;16:400–401. doi: 10.1093/bioinformatics/16.4.400. [DOI] [PubMed] [Google Scholar]
- Rouzine I.M., Weinberger A.D., Weinberger L.S. An evolutionary role for HIV latency in enhancing viral transmission. Cell. 2015;160:1002–1012. doi: 10.1016/j.cell.2015.02.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmitt A.D., Hu M., Jung I., Xu Z., Qiu Y., Tan C.L., Li Y., Lin S., Lin Y., Barr C.L., et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 2016;17:2042–2059. doi: 10.1016/j.celrep.2016.10.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schoenfelder S., Clay I., Fraser P. The transcriptional interactome: gene expression in 3D. Curr. Opin. Genet. Dev. 2010;20:127–133. doi: 10.1016/j.gde.2010.02.002. [DOI] [PubMed] [Google Scholar]
- Scholtysik R., Kreuz M., Hummel M., Rosolowski M., Szczepanowski M., Klapper W., Loeffler M., Trümper L., Siebert R., Küppers R., et al. Characterization of genomic imbalances in diffuse large B-cell lymphoma by detailed SNP-chip analysis. Int. J. Cancer. 2015;136:1033–1042. doi: 10.1002/ijc.29072. [DOI] [PubMed] [Google Scholar]
- Schröder A.R., Shinn P., Chen H., Berry C., Ecker J.R., Bushman F. HIV-1 integration in the human genome favors active genes and local hotspots. Cell. 2002;110:521–529. doi: 10.1016/s0092-8674(02)00864-4. [DOI] [PubMed] [Google Scholar]
- Shao W., Boltz V.F., Hattori J., Bale M.J., Maldarelli F., Coffin J.M., Kearney M.F. Short communication: HIV-DRLink: a tool for reporting linked HIV-1 drug resistance mutations in large single-genome data sets using the Stanford HIV database. AIDS Res. Hum. Retroviruses. 2020;36:942–947. doi: 10.1089/aid.2020.0109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shukla A., Ramirez N.P., D'Orso I. HIV-1 proviral transcription and latency in the new era. Viruses. 2020;12:555. doi: 10.3390/v12050555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simonetti F.R., Sobolewski M.D., Fyne E., Shao W., Spindler J., Hattori J., Anderson E.M., Watters S.A., Hill S., Wu X., et al. Clonally expanded CD4+ T cells can produce infectious HIV-1 in vivo. Proc. Natl. Acad. Sci. USA. 2016;113:1883–1888. doi: 10.1073/pnas.1522675113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simonetti F.R., Zhang H., Soroosh G.P., Duan J., Rhodehouse K., Hill A.L., Beg S.A., McCormick K., Raymond H.E., Nobles C.L., et al. Antigen-driven clonal selection shapes the persistence of HIV-1-infected CD4+ T cells in vivo. J. Clin. Invest. 2021;131 doi: 10.1172/JCI145254. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singh A., Razooky B., Cox C.D., Simpson M.L., Weinberger L.S. Transcriptional bursting from the HIV-1 promoter is a significant source of stochastic noise in HIV-1 gene expression. Biophys. J. 2010;98:L32–L34. doi: 10.1016/j.bpj.2010.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skupsky R., Burnett J.C., Foley J.E., Schaffer D.V., Arkin A.P. HIV promoter integration site primarily modulates transcriptional burst size rather than frequency. PLOS Comput. Biol. 2010;6 doi: 10.1371/journal.pcbi.1000952. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smurova K., De Wulf P. Centromere and pericentromere transcription: roles and regulation. in sickness and in health. Front. Genet. 2018;9:674. doi: 10.3389/fgene.2018.00674. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Søgaard O.S., Graversen M.E., Leth S., Olesen R., Brinkmann C.R., Nissen S.K., Kjaer A.S., Schleimann M.H., Denton P.W., Hey-Cunningham W.J., et al. The depsipeptide Romidepsin reverses HIV-1 latency in vivo. PLoS Pathog. 2015;11 doi: 10.1371/journal.ppat.1005142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trombetta J.J., Gennert D., Lu D., Satija R., Shalek A.K., Regev A. Preparation of single-cell RNA-seq libraries for next generation sequencing. Curr. Protoc. Mol. Biol. 2014;107 doi: 10.1002/0471142727.mb0422s107. 4.22.1–4.22.17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vansant G., Chen H.C., Zorita E., Trejbalová K., Miklík D., Filion G., Debyser Z. The chromatin landscape at the HIV-1 provirus integration site determines viral expression. Nucleic Acids Res. 2020;48:7801–7817. doi: 10.1093/nar/gkaa536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wagner T.A., McLaughlin S., Garg K., Cheung C.Y., Larsen B.B., Styrchak S., Huang H.C., Edlefsen P.T., Mullins J.I., Frenkel L.M. HIV latency. Proliferation of cells with HIV integrated into cancer genes contributes to persistent infection. Science. 2014;345:570–573. doi: 10.1126/science.1256304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang W., Fasolino M., Cattau B., Goldman N., Kong W., Frederick M.A., McCright S.J., Kiani K., Fraietta J.A., Vahedi G. Joint profiling of chromatin accessibility and CAR-T integration site analysis at population and single-cell levels. Proc. Natl. Acad. Sci. USA. 2020;117:5442–5452. doi: 10.1073/pnas.1919259117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiegand A., Spindler J., Hong F.F., Shao W., Cyktor J.C., Cillo A.R., Halvas E.K., Coffin J.M., Mellors J.W., Kearney M.F. Single-cell analysis of HIV-1 transcriptional activity reveals expression of proviruses in expanded clones during ART. Proc. Natl. Acad. Sci. USA. 2017;114:E3659–E3668. doi: 10.1073/pnas.1617961114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wong J.K., Hezareh M., Günthard H.F., Havlir D.V., Ignacio C.C., Spina C.A., Richman D.D. Recovery of replication-competent HIV despite prolonged suppression of plasma viremia. Science. 1997;278:1291–1295. doi: 10.1126/science.278.5341.1291. [DOI] [PubMed] [Google Scholar]
- Yukl S.A., Kaiser P., Kim P., Telwatte S., Joshi S.K., Vu M., Lampiris H., Wong J.K. HIV latency in isolated patient CD4(+) T cells may be due to blocks in HIV transcriptional elongation, completion, and splicing. Sci. Transl. Med. 2018;10 doi: 10.1126/scitranslmed.aap9927. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
-
•
This paper does not report original code.
-
•
Data were deposited to Gene Expression Omnibus (GEO) with the following accession numbers: RNA-Seq and ATAC-Seq: GSE144334, Hi-C: GSE168337
-
•
Proviral integration sites and their transcriptional activity are listed in Tables S1 and S2.
-
•
Proviral sequences: Due to study participant confidentiality concerns, viral sequencing data cannot be publicly released but will be made available to investigators upon reasonable request and after signing a coded tissue agreement.
-
•
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.