Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2020 Jun 29;48(14):7801–7817. doi: 10.1093/nar/gkaa536

The chromatin landscape at the HIV-1 provirus integration site determines viral expression

Gerlinde Vansant 1,2, Heng-Chang Chen 2,2, Eduard Zorita 3, Katerina Trejbalová 4, Dalibor Miklík 5, Guillaume Filion 6,7, Zeger Debyser 8,
PMCID: PMC7641320  PMID: 32597987

Abstract

HIV-1 persists lifelong in memory cells of the immune system as latent provirus that rebounds upon treatment interruption. Therefore, the latent reservoir is the main target for an HIV cure. Here, we studied the direct link between integration site and transcription using LEDGINs and Barcoded HIV-ensembles (B-HIVE). LEDGINs are antivirals that inhibit the interaction between HIV-1 integrase and the chromatin-tethering factor LEDGF/p75. They were used as a tool to retarget integration, while the effect on HIV expression was measured with B-HIVE. B-HIVE tracks insert-specific HIV expression by tagging a unique barcode in the HIV genome. We confirmed that LEDGINs retarget integration out of gene-dense and actively transcribed regions. The distance to H3K36me3, the marker recognized by LEDGF/p75, clearly increased. LEDGIN treatment reduced viral RNA expression and increased the proportion of silent provirus. Finally, silent proviruses obtained after LEDGIN treatment were located further away from epigenetic marks associated with active transcription. Interestingly, proximity to enhancers stimulated transcription irrespective of LEDGIN treatment, while the distance to H3K36me3 only changed after treatment with LEDGINs. The fact that proximity to these markers are associated with RNA expression support the direct link between provirus integration site and viral expression.

INTRODUCTION

During infection with the human immunodeficiency virus type 1 (HIV-1), a DNA copy of the viral genome is inserted as provirus in a host chromosome by the viral integrase. Part of the proviruses enter a silent mode without gene expression rendering the virus invisible for the host immune system. These latent HIV proviruses mainly reside in long lived memory CD4+ T cells, that form a latent reservoir with an estimated half-life of 44 months (1–3). Although combination antiretroviral therapy (cART) suppresses plasma viral loads to undetectable levels, interruption of treatment allows latent proviruses to rebound (4,5). Therefore, cART is required lifelong and the latent reservoir is considered as the major barrier to cure HIV infection (6).

The predominant HIV-1 cure strategies aim to eliminate the latent reservoir (6,7). In order to achieve such cure, it is essential to understand the molecular determinants of HIV-1 latency. The impact of the chromatin environment on gene expression is only partially understood. Studies have shown that epigenetic silencing can contribute to HIV-1 latency; acetylated histones are generally associated with active provirus, while tri-methylation of H3K27 and H3K9 is associated with HIV silencing (8–13). Moreover, CpG methylation is known to maintain latency and resistance to reactivation (14,15). Less studied is the role of HIV-1 integration in the host genome and more specifically the impact of integration site selection on proviral gene expression. In 2001, the Verdin lab first postulated that the integration site affects transcription after observing highly variable expression levels between different transduced Jurkat clones (16). The Bushman lab reported that low-expressing proviruses integrate more often in genomic regions devoid of protein-coding genes and centromeric heterochromatin (17) and are less associated with DNase I sensitive sites, CpG islands and GC rich regions (18). Orientation of the provirus in the host gene was also suggested to play a role in HIV-1 expression (19–21).

The discovery of LEDGF/p75 as host factor of HIV integrase (IN) (22) led to a better understanding of the role of integration site selection. The integration step is catalyzed by IN (for a review see (23)). Lens-epithelium derived growth factor (LEDGF/p75), a transcriptional co-activator, tethers the viral pre-integration complex to transcriptionally active regions (24,25). LEDGF/p75 binds IN via its C-terminal Integrase Binding Domain (IBD) (22,26) and interacts with chromatin via its PWWP domain (27–29). The protein is known to recognize the H3K36me3 mark on nucleosomes (27–29). Depletion of LEDGF/p75 indeed shifts integration out of transcription units (30,31). In 2010 small molecule inhibitors of the interaction between IN and LEDGF/p75 were developed (32–37). These compounds, referred to as LEDGINs, are inhibitors of the well-defined interface between the IBD of LEDGF/p75 and the HIV-1 IN catalytic core (32–34,38). Potent LEDGINs with nanomolar activity inhibit HIV-1 IN allosterically (33,36–37). Later, it was found that LEDGINs also affect late stage replication steps; they disturb particle maturation by stimulating IN oligomerization, resulting in defective progeny virus (39–42). Vranckx et al. demonstrated that residual integrants after LEDGIN treatment are retargeted out of active genes to sites that are less transcriptionally active (31). At the same time, Chen et al. claimed that proviruses that integrate in proximity to endogenous enhancers display higher expression compared to those which are far from enhancers based on studies with barcoded HIV (B-HIVE) (43). Moreover, latent HIV was inserted ∼2-fold further away from endogenous enhancers than non-latent HIV (43). A similar conclusion was made by Miklik et al. who found that the proximity to active regulatory elements, particularly enhancers, correlates with stable proviral expression (44).

In the present study, we combined LEDGINs and B-HIVE to investigate the effect of integration site selection on HIV-1 transcription. B-HIVE is a technique that tags individual lentiviral vectors with a unique barcode of 20 random nucleotides (43,45). By simultaneously tracking each barcode in DNA and RNA of the same infected cells, transcription of proviruses can be correlated to the corresponding integration site (43,45). We here investigated the impact of LEDGIN-mediated retargeting on the chromosomal preference, epigenetic signature and viral expression of the residual provirus.

After treatment with LEDGINs, the distribution of HIV-1 integrants was less selective for gene-dense chromosomes (e.g. chromosomes 16, 17 and 19). LEDGINs retargeted the provirus towards silent genes and intergenic regions. They reduced overall RNA expression and increased the proportion of barcoded proviruses without expression. Finally, ‘no-expression’ sites obtained after treatment with LEDGINs were located further away from several epigenetic features associated with active transcription (H3K36me3, H3K79me2/3, H3K27ac, H3K4me1, RNAPII, H3K4me3, Med1, CBP). Interestingly, the distance of silent provirus to H3K36me3, the mark recognized by LEDGF/p75 increased after treatment with LEDGINs. In contrast, H3K27ac, H3K4me1, Med1 and CBP representative for (super-) enhancers, stimulated transcription regardless of LEDGIN-mediated retargeting. In conclusion, we show that the site of HIV-1 integration affects transcription by a combination of both general and LEDGF/p75-specific effects.

MATERIALS AND METHODS

Cell culture

SupT1 (provided by the National Institutes of Health reagent program, NIH, Bethesda, MD, USA) (46) and Jurkat cells (obtained from the cell collection of the Center for Genomic Regulation, CRG, Barcelona, Spain) were cultured in RPMI medium (GIBCO BRL) with 10% (v/v) fetal bovine serum (FBS, GIBCO) and 0.01% (v/v) gentamicin (GIBCO). HEK293T cells (generous gift from O. Danos, Evry, France) (47) were cultured in Dulbecco modified Eagle's medium (DMEM, GIBCO, Dublin, Ireland) with 5% (v/v) fetal bovine serum (FBS, GIBCO) and 0.01% (v/v) gentamicin (GIBCO). Cells were cultured at 37°C in a humidified atmosphere containing 5% CO2. All cells were verified to be mycoplasma free.

Vector production

Linear polyetylenimine (PEI, Polysciences) was used to co-transfect HEK293T cells with the barcoded transfer plasmid pHCC1, Δ8.91 packaging plasmid and a vesicular stomatitis virus G (VSV-G) protein encoding plasmid to produce VSV-G-pseudotyped vectors. The barcoded plasmid pHCC1 was produced as described before (43). Six hours post transfection, cells were washed twice with phosphate buffered saline (PBS) to remove the excess of plasmid. Supernatant was collected 72 h post-transfection and filtered through a 0.45 μm pore membrane (Merck, Overijse, Belgium). The vector was concentrated using a Vivaspin with a 15–50 kDa cut-off column (Merck) and washed three times with PBS. Next, the vector was treated with 100 U/ml DNase (Roche Diagnostics, Vilvoorde, Belgium) for 1 h at 37°C and stored at −80°C.

Transduction, DNA and mRNA purification

40 000 SupT1 and Jurkat cells were transduced with barcoded vector in the presence of different concentrations of LEDGIN CX014442 (33) (6.25, 15.62 and 31.25 μM) in a 96-well plate. The plates were centrifuged for 90 min at 1000 G. Twenty four hours after transduction, cells were washed twice with PBS and resuspended in medium containing LEDGIN CX014442 at the same concentrations of 6.25, 15.62 or 31.25 μM in a 96-well plate. GFP positive and negative cells were either sorted at day 4 and cultivated for 2 weeks, or directly cultivated for 2 weeks without sorting. While cultivating the cells, medium without LEDGINs was used. Sorting was performed on the S3e Cell Sorter (Bio Rad, Temse, Belgium). Genomic DNA and total RNA was isolated with an AllPrep DNA/RNA Mini Kit (Qiagen). The mRNA fraction was isolated from total RNA with an Oligotex mRNA Mini Kit (Qiagen).

LEDGIN CX014442

LEDGIN CX014442 has been characterized previously using different techniques and viruses (33). Here we used a replication-deficient vector and calculated an IC50 of 6.5 μM based on inhibition of integrated copies in single round experiments. Thus the concentrations of 6.25, 15.62 and 31.25 μM correspond approximately to 1×, 2.5× and 5× IC50, respectively.

Library preparation and sequencing

All DNA and RNA sequencing libraries were prepared as described before (43). DNA library preparation was performed via inverse PCR. mRNA was reverse transcribed to cDNA and amplified via barcode specific PCR. DNA integration site samples were sequenced with 76-bp paired-end Illumina next generation sequencing. RNA expression and DNA normalization samples were sequenced as 50-bp single reads on Illumina HiSeq.

Flow cytometry and cell sorting

GFP expression was measured to determine transduction efficiency using a Guava® Easycyte 5HT flow cytometer (Merck, Overijse, Belgium) with a 488 nm, 50 mW laser and 525/30 nm band pass filters. Before flow cytometry analysis, cells were fixed for 15 min at room temperature in 2% paraformaldehyde. GFP positive and negative cells were sorted 4 days after transduction by using the Bio-Rad S3™ cell sorter (Bio-Rad, Temse, Belgium) with a 488 nm, 100 mW laser. Prior to sorting, cells were washed and resuspended in PBS.

qPCR

Total integrated HIV DNA was quantified using a nested real-time Alu-LTR qPCR (48). The first round PCR reaction mix consisted of 5 μl of genomic DNA, 10 μl of iQ supermix (Bio rad, Temse, Belgium), 0.5 μl of each primer (20 μM, Alu FW: TCCCAGCTACTGGGGAGGCTGAGG, Alu RV: TGCTGGGATTACAGGCGTGAG and HIV-1 LTR FW: GCTAACTAGGGAACCCACTGCTTA) and 3.5 μl of water. Cycling conditions for the first round PCR were 95°C for 10 min, followed by 15 cycles of 95°C for 30 s, 60°C for 40 s and 72°C for 3.5 min. 5 μl of the first-round product was added to a second round PCR mix containing 10 μl of iQ supermix, 0.5 μl of forward and reverse primer (20 μM, HIV-1 LTR FW: AGCTTGCCTTGAGTGCTTCAA, HIV-1 LTR RV: TGACTAAAAGGGTCTGAGGGATCT), 1 μl of probe (5 μM, 5′-FAM-TTACCAGAGTCACACAACAGACGGGCA-TAMRA-3′) and 3 μl of water. The second round PCR was performed in a LightCycler 480 (Roche Life Science, Vilvoorde, Belgium) for 5 min at 95°C, followed by 45 cycles of 95°C for 15 s, 60°C for 30 s and 72°C for 1 min. In parallel, a CCR5 qPCR was performed as previously described (49) to normalize for total input DNA. All samples were run at least in duplicate. Data were analyzed using the provided LightCycler 480 software.

Bisulfite cytosine methylation analysis

SupT1 cells were transduced with OGH vector in the presence of varying concentrations of LEDGIN CX014442 (ranging between 3.12 and 25 μM). Two weeks post transduction, genomic DNA was extracted with the QIAamp DNA mini kit (Qiagen). 1 μg of gDNA was used for bisulfite conversion using the Epitect Plus DNA Bisulfite Kit (Qiagen) and eluted in a total of 20 μl of water. Bisulfite-treated DNA was amplified by PCR specific for the 5′ LTR in a 50 μl reaction mixture. PCR was performed with ∼75 ng of genomic DNA. Following primers were used: FW GGtAGAAtTAtAtAttAGGGttAGGGGTt, RV CACCCATCTCTCTCCTTCTAaCCTC. The sense primers contained T and the antisense primers A instead of C in positions complementary to non-methylable C (i.e. C in CpG dinucleotides). The reaction mixture contained 50 mM Tris–HCl (pH 9.2), 2.5 mM MgCl2, each dNTP at 200 μM, 320 nM each primer and 1 U of Platinum Taq Polymerase (Invitrogen), two mg of HotStart-IT Binding Protein (Affymetrix), and 1.5 μl of bisulfite-treated template DNA. Forty cycles were run under following conditions: 94°C for 20 s, 58°C for 50 s, and 72°C for 60 s. At least three primary PCRs were performed for each sample to exclude amplification of one template molecule. Non-converted DNA did not provide bands. Several non-template controls were included in each bisulfite PCR reaction. Amplification products were cloned in the pGEM-T-EasyVector System (Promega, Madison, WI, USA) and sequenced. Analysis was performed using the Quma (Quantification tool for Methylation Analysis) software (http://quma.cdb.riken.jp/). Only PCR clones with at least 95% conversion of cytosines outside CpGs were taken into account. When more converted molecules with identical sequences were obtained, only one was used for calculation of the methylated CpG percentage to minimize the bias originating from the preferential amplification of one molecule.

RNA sequencing

We extracted total RNA from SupT1 and Jurkat cells, both untreated cells and cells cultured in the presence of 31.25 μM of LEDGIN CX014442, using the Aurum™ total RNA mini kit (Bio Rad). Sequencing libraries were prepared with the 3′ mRNA-seq library prep kit (Lexogen) and sequenced on Illumina HiSeq 4000.

ChIP-sequencing

We generated ChIP-seq samples from untreated SupT1 and Jurkat cells, and from Jurkat cells cultured in the presence of 31.25 μM of LEDGIN CX014442 by using the Magna ChIP™ A/G Chromatin Immunoprecipitation Kit (Merck Millipore). Immunoprecipitations were performed with ChIP-grade antibodies against H3K36me3 (ab9050, Abcam) and H3K27ac (ab4729, Abcam). Samples were sequenced on Illumina HiSeq 4000.

Gene enrichment analysis

We applied the function enrichKEGG involved in the R package clusterProfiler (50) to assay the enrichment Kyoto Encyclopedia of Genes and Genomes (KEGG) categories from less frequently HIV-targeted genes and genes harboring non-RNA expressing provirus with the false discovery rate (FDR) control represented by adjusted P-values. An adjusted P-value takes multiple statistic tests (individual P-values) into account in one entire dataset (51,52). We used the Benjamini–Hochberg procedure (53) to compute adjusted P-values for each enriched KEGG pathway in given gene sets. The Gene Ratio was calculated by taking the number of unique genes overlapping with those involved in a specific KEGG pathway (value k) divided by the number of unique genes overlapping with those in the collection of tested KEGG pathways (value n), based on the definition in the R package clusterProfiler. The equation for calculating Gene Ratio can thus be written as follows: GeneRatio = k/n. The output genes (value k) from the enriched pathways with significant adjusted P-values were annotated aside the node shown in the cnetplots.

Quantification and statistical analysis

GraphPad Prism version 7.00 was used for statistical analysis (GraphPad Software, La Jolla, CA, USA, www.graphpad.com). Differences in distribution of integration sites across chromosomes and different genome categories were assessed by the Chi square test. The non-parametric Kruskal–Wallis with Dunn's multiple comparison test was used to compare distances of integration sites to certain features. In case other tests were used, the information is specified in the main text and figure legends.

Bioinformatic analyses

ChIP-seq reads were mapped on GRCh37/hg19 with BWA-MEM with default parameters and a minimum mapping quality of 20. The targets were identified with Zerone v1.0 (54) with options ‘–list-output’ and ‘–confidence 0.99’.

Sequencing reads from Jurkat and SupT1 mRNA were mapped to Ensembl cDNA assembly GRCh37 (release 75) with kallisto (55) 46 with options ‘–single’ (single-end mode), ‘–bias’ (sequence bias correction, ‘–s300’ (fragment length 300 nucleotides) and ‘–l100’ (s.d. 100 nucleotides). The counts of the different isoforms were summed, thus generating a total count per gene copy in transcripts per million.

Identification of barcodes and HIV integration sites was performed as described before (43). The human genome was partitioned into six types: active genes, silent genes (genes refer to only protein-coding genes), active promoters, enhancers, intergenic regions and repeats. Active genes were defined as the 60% most expressed. Active promoters were defined as the regions spanning 5000 bp centered on the transcription start sites of active genes. Insertions were considered to be in the vicinity of an enhancer if their mapped location was within 2500 bp of a H3K27ac-enriched region. Insertions close to enhancers were in the enhancer category even if they were inserted inside a gene or a promoter. Genomic regions with bwa mappability score <20 were considered repeats and repeat classes were determined by classifying the raw FASTQ sequence with RepeatMasker. The rest of the genome was classified as intergenic.

Data and code availability

All data-processing steps are documented in a Docker virtual machine available at https://github.com/ezorita/bhive. The datasets generated during this study are deposited at Gene Expression Omnibus (GEO), series GSE135295. Additional ChIP-Seq data for H3K36me3, H3K79me3, H3K27ac, H3K4me1, RNAPII, H3K4me3, H3K9me3 (GSE65687) (56), H3K79me2 (GSE60104) (57), Med1 (GSE59657) (58) and CBP (GSE17954) (59) were downloaded from GEO.

RESULTS

LEDGINs reduce the chromosomal bias of HIV integration

We have previously shown that LEDGIN treatment inhibits HIV-1 integration and that residual integrants are more often in a transcriptionally inactive state that is refractory to reactivation (31). Still, it is not clear whether the nature of the chromatin landscape surrounding the insertion site of those retargeted viruses can explain their latent state. We thus applied the B-HIVE technology (43,45) to track individual barcoded viruses retargeted by LEDGINs on a genome-wide scale.

In this study, we adapted the transduction process used in the B-HIVE technology (Figure 1A). The improved method allows us to shorten the cultivation time needed to acquire sufficient genomic material and to retrieve at least four times more high-confidence insertion sites in the condition without LEDGIN compared to the previous method (43,45). Briefly, we transduced 40 000 SupT1 and Jurkat T cells with barcoded vector expressing GFP driven by the LTR promoter in the presence of 6.25, 15.62 or 31.25 μM of LEDGIN CX014442 in a 96-well plate (Figure 1A). Transduced cells were collected two weeks post transduction to map insert-specific expression. Inhibition by LEDGIN CX014442 was confirmed by flow cytometry (Figure 1B) and Alu-LTR qPCR (Figure 1C) 14 days post transduction in both SupT1 and Jurkat cells. Both the number of integrated copies and the percentage of GFP positive cells decreased with increasing concentration of LEDGIN CX014442. While on average 10 000 insertion sites were retrieved in the control samples, only 500 were obtained in cells treated with 31.25 μM of LEDGIN (Figure 1D).

Figure 1.

Figure 1.

LEDGINs inhibit integration and reduce chromosomal preference. (A) 40 000 Jurkat or SupT1 cells were transduced in a 96-well plate with barcoded HIV in the presence of varying concentrations of LEDGIN CX014442. Cells were cultured for two weeks before DNA and RNA were extracted. (B) The percentage of GFP positive cells two weeks post transduction of SupT1 (black) and Jurkat (gray) cells in the presence of LEDGIN CX014442. Bars represent the average of flow cytometry measurement in duplicate with standard deviation. (C) The number of integrated copies two weeks post transduction, as determined by Alu-LTR nested qPCR on gDNA that was normalized to CCR5. Bars represent the average of qPCR in duplicate with standard deviation. (D) The number of retrieved integration sites in SupT1 and Jurkat cells. (E, G) Relative number of mapped insertions/Mb plotted for each chromosome in SupT1 (E) and Jukat (G) cells whereby the sum of all chromosomes per condition is 100. Different colors represent different concentrations of LEDGINs added during transduction. (F, H) XY-plot showing the relative number of mapped insertions/Mb (y-axis, log2 scale) over the gene density of each chromosome (x-axis, log2 scale) in SupT1 (F) and Jurkat (H) cells. Thus, each value on the x-axis corresponds to a certain chromosome (see also Supplementary Table S1 for gene densities) and to four y-values, one for each condition. The lines are the result of regression analysis (see also Supplementary Table S2). Two experiments were performed in Jurkat cells and two in SupT1 cells. Results are shown for one representative experiment, referred to as experiment A, in which SupT1 and Jurkat cells were transduced in parallel. GFP; Green Fluorescent Protein, Mb; megabase.

HIV-1 prefers to integrate in chromosomes 16, 17 and 19 as these chromosomes have a high gene density (24). The obtained chromosomal distribution of HIV-1 integration was in line with previous studies (24,60) and reproducible in both SupT1 (Figure 1E) and Jurkat cells (Figure 1G). Treatment with LEDGIN CX014442 during infection significantly altered the chromosomal distribution as determined by the Chi-square test (P< 0.0001). The relative number of mapped insertions/megabase (Mb) decreased in chromosome 19 and to a lesser extent in chromosome 16 and 17 upon addition of LEDGIN CX014442 (Figure 1E and G, Supplementary Table S1). In other less gene-dense chromosomes more variability was observed (Supplementary Table S1). The chromosomal distribution of integration sites positively correlated with gene density of the chromosomes (Figure 1F and H). The average slope of the regression line dropped from 0.58 and 0.77 insertions per gene in Jurkat and SupT1 controls, respectively, to 0.31 and 0.36 in Jurkat and SupT1 cells treated with 31.25 μM of CX014442 (Jurkat P = 0.0172, SupT1 P< 0.0001, ANCOVA; Supplementary Table S2). Altogether, these results show that LEDGINs retarget integration away from gene dense regions.

LEDGIN treatment retargets insertion sites towards silent genes and intergenic regions

We have previously shown that the genomic context and more specifically enhancer elements influence HIV-1 gene expression (43,44). In order to investigate whether LEDGINs can alter HIV-1 expression by retargeting insertion sites towards genomic regions that disfavor HIV expression, we first analyzed integration sites relative to some genomic features. We partitioned the genome into four types including active (protein-coding) genes (AG), silent (protein-coding) genes (SG), regulatory elements (RE) and intergenic regions (IR) based on SupT1 and Jurkat cell mRNA sequencing. Although the gene expression profile differed between SupT1 and Jurkat cells, treatment with 31.25 μM of CX014442 did not affect gene expression (Supplementary Figure S1). Since the proportion of promoter regions in the genome is limited, regions of active promoters and enhancers were combined as regulatory elements in this study. After calculating the proportion of integration sites in each category, we found that more proviruses were retargeted to SG and IR upon addition of increasing concentrations of LEDGIN CX014442 in SupT1 and Jurkat cells (P< 0.0001, Chi square test) (Figure 2). We further characterized the IR containing HIV integration sites and found that short and long interspersed nuclear elements (SINE and LINE), and retrotransposons were most frequently targeted. Addition of LEDGINs did not significantly alter the distribution of HIV within different types of IR (Supplementary Figure S2).

Figure 2.

Figure 2.

LEDGIN treatment retargets integration away from active genes. Integration sites retrieved in each condition in SupT1 (A) and Jurkat (B) cells were divided in four genomic categories and plotted as relative proportions: silent genes (SG), intergenic regions (IR), regulatory elements containing enhancers and promoters (RE) and active genes (AG).

Next, we evaluated the effect of LEDGIN treatment on the type of genes that were targeted by applying the R package clusterProfiler on enriched pathways based on Kyoto Encyclopedia of Genes and Genomes (KEGG) (50,61–63). We retrieved the HIV-targeted genes lost after increasing the concentration of LEDGINs in SupT1 and Jurkat cells, referred to as ‘less frequently targeted genes’ (Supplemental results, Figures S3 and S4, Tables S3 and S4). The majority of these genes are highly transcribed genes (Supplementary Figures S3A and S4A). The endogenous gene expression of less frequently targeted genes after treatment with 31.25 μM of CX014442 was most affected. However, the change in endogenous gene expression was not consistent among genes involved in the same pathway (Supplementary Figures S3E, G and S4E, G).

LEDGIN treatment increases the distance of integration sites to H3K36me3

Next, we plotted the distance of integration sites to certain epigenetic features (overview of features in Supplementary Table S5). Of all features analyzed, integration occurred closest to H3K36me3, the epigenetic histone modification recognized by LEDGF/p75 (27,28) and a marker for actively transcribed genes (64). Integration located on average at 1 kb distance from H3K36me3 in control conditions, indicating that this is an important target for HIV integration. LEDGIN treatment significantly increased the distance to H3K36me3 in both SupT1 and Jurkat cells (P< 0.0001, Kruskal–Wallis test) (Figure 3A). We also observed a LEDGIN-mediated increase in distance to H3K79me3 and me2 (ranging between 15 and 100 kb in control conditions) (Figure 3B and C) that are associated with gene bodies. H3K79me3 is bound by Hepatoma-derived growth factor-related protein 2 (HRP-2) (65), a paralogue of LEDGF/p75 that can take over its targeting function (66) and is inhibited by LEDGINs as well (67). The distance to H3K27ac and H3K4me1 (Figure 3D and E), both associated with enhancers (68), was not significantly altered. Finally, LEDGINs slightly increased the distance to RNAPII and H3K4me3 (Figure 3F and G), associated with transcription start sites (TSS) and promoters (69), while integration occurred somewhat closer to the silent chromatin marks H3K9me3 (Figure 3H) and to a lesser extent to H3K27me3 (data not shown). All analysis were performed using online available ChIP-seq data from untreated Jurkat cells. We obtained similar results when using ChIP-seq data that we generated in house in LEDGIN treated Jurkat cells and in SupT1 cells (Supplementary Figure S5).

Figure 3.

Figure 3.

LEDGIN treatment retargets integration away from H3K36me3 in SupT1 and Jurkat cells. The median distance in base pairs (bp) between the integration site and certain features is plotted for two experiments in SupT1 cells and three in Jurkat cells. Panels A-H plot the distance to: (A) H3K36me3, (B) H3K79me3, (C) H3K79me2, (D) H3K27ac, (E) H3K4me1, (F) RNAPII, (G) H3K4me3 and (H) H3K9me3 (See also Supplementary Table S3 for explanation of different markers). Statistical significance was calculated by the Kruskal–Wallis test, * P< 0.05, ** P< 0.01, *** P< 0.001, **** P< 0.0001. RNAPII; RNA polymerase II.

We compared our results with sequencing results in LEDGF/p75 depleted SupT1 and Jurkat cells (Supplementary Figures S6 and S7). Although depletion of LEDGF/p75 in both cell lines shifted integration in a qualitatively similar manner to treatment with LEDGINs, 31.25 μM of CX014442 shifted integration more than depletion of LEDGF/p75. Overall, these data confirm that LEDGINs retarget integration out of active genes, towards silent genes and intergenic regions, implying a manifest role of LEDGF/p75 in selecting gene regions for transcription.

As methylation of the HIV 5′ LTR restricts HIV reactivation and contributes to latency as shown in cell lines and in patient samples (14,15), we next investigated the effect of LEDGINs on CpG methylation. We performed bisulfite cytosine methylation analysis on DNA sequences from cells treated with varying concentrations of LEDGIN during transduction (Supplementary Figure S8). 1% of sequenced CpG dinucleotides in the 5′ LTR of the control sample was methylated. Methylation significantly increased up to 3.3% when treated with 12.5 μM of CX014442 (P = 0.014, Chi-square test). However, there was no clear dose-response effect since at a concentration of 25 μM LEDGIN no further increase in methylation was detected.

LEDGIN treatment reduces viral RNA expression

LEDGIN treatment was previously shown to increase HIV-1 latency using different reporter viruses in both cell lines and primary CD4+ T cells (31,48). In this study, we calculated RNA expression levels by using the B-HIVE method that is not limited to averaged readouts. mRNA was extracted from cells transduced in the presence of varying concentrations of LEDGIN CX014442 and reverse transcribed to cDNA. We were able to calculate expression levels of individual proviruses by normalizing cDNA counts of each barcode to the corresponding number of DNA counts. LEDGINs significantly reduced the median RNA expression per DNA copy in Jurkat cells (P< 0.0001, Kruskal–Wallis test) (Figure 4A). These results were confirmed in three independent experiments both in SupT1 and Jurkat cells (Figure 4B). We also retrieved many barcodes in the DNA without RNA expression. LEDGINs increased the percentage of these ‘no RNA’ barcodes up to 25% of total barcodes (P< 0.0001, Chi square test) (Figure 4C). We assume that these results indicate an overall reduction in HIV-1 RNA expression. Although we cannot entirely exclude the possibility that some of these non-expressing barcodes result from sequencing errors, we used these non-expressing barcodes to further characterize silent provirus. Interestingly, analysis of the integration landscape per chromosome revealed that some highly expressing barcodes persisted even after treatment with 31.25 μM of LEDGIN CX014442 (e.g. in chromosome 2, Figure 4D), while in some chromosomes LEDGINs reduced all expression (e.g. in chromosome 19, Figure 4E). Both high and low expressors were observed in several chromosomes and results varied between different experiments, indicating that this is not a chromosome specific effect.

Figure 4.

Figure 4.

LEDGIN treatment reduces RNA expression. Two weeks post transduction in the presence of varying concentrations of LEDGIN CX014442, mRNA was extracted and reverse transcribed to cDNA to determine RNA expression. (A) Expression in Jurkat cells (experiment B) was determined using the B-HIVE method. The expression score is calculated for each unique barcode (dot) by normalizing the barcode counts in the RNA to the corresponding DNA counts. Error bars represent median and interquartile range. (B) Median RNA expression scores for two independent experiments in Jurkat cells (experiment A and B) and one in SupT cells (experiment A). (C) The percentage of DNA barcodes that did not show RNA expression was calculated relative to the total number of barcodes found in each condition in SupT1 and Jurkat cells. (D) Expression landscape of barcodes integrated in chromosome 2. Each bar represents an integration site and the height of the bar corresponds to the expression level. (E) Expression landscape of barcodes integrated in chromosome 19. Statistical significance was calculated by the Kruskal–Wallis test, **** P< 0.0001, *P< 0.05.

Additionally, a B-HIVE experiment was performed in GFP-sorted SupT1 cells. Four days post transduction, GFP positive (+) and negative (–) cells were sorted and cultivated to perform DNA integration site sequencing. As expected LEDGINs reduced the total number of integration sites, although the relative proportion of integration sites retrieved in the GFP(–) population increased (Supplementary Figure S9A). Interestingly, the genomic distribution of integration sites from GFP(+), GFP(–) and unsorted SupT1 cells was comparable. Integration sites from all conditions, also from GFP(-) cells, were mostly found in active genes in the absence of LEDGINs (Supplementary Figure S9B). Yet, integrations obtained in GFP(–) cells seemed less associated with regulatory elements. LEDGINs had a similar retargeting effect in all conditions regardless of the sorting. Integration in active genes was reduced, while more provirus was found in intergenic regions and silent genes.

In conclusion, all methods show a reduction of RNA expression per provirus after LEDGIN treatment. Sorting of cells based on GFP expression provided no added value for further analysis of retargeting effects.

HIV expression is influenced by both general and LEDGF/p75-specific epigenetic features

One major goal of this study was to investigate the link between integration sites and HIV-1 transcription. By using B-HIVE, we indeed show that LEDGINs retarget integration and reduce viral RNA expression. The following key question was to find out where these silent, non-RNA expressing proviruses are located. Therefore, we calculated the distances of these ‘no RNA’ barcoded proviruses (RNA expression = 0) to some epigenetic features and compared these barcodes with their transcriptionally active counterparts in each condition. The Kruskal-Wallis test was used to evaluate whether the distance of the ‘no RNA’ barcodes was altered compared to ‘all’ barcodes (Supplementary Figure S10). The median data for all three experiments in SupT1 and Jurkat cells revealed a good reproducibility of these distance plots (Figure 5). Interestingly, H3K36me3 affected barcode expression only after treatment with LEDGINs (Figure 5A). Although the distance to H3K36me3 did not change in cells without or with 6.25 μM of LEDGIN, the distance of DNA barcodes without RNA to all barcodes increased up to maximum eight and 15-fold when treating cells with 15.62 and 31.25 μM of CX014442 (Figure 5A, Supplementary Table S6). This result proves that LEDGF/p75 is a default determinant of integration site selection and that LEDGIN-mediated retargeting away from H3K36me3, as shown in Figure 3, negatively affects HIV-1 RNA expression. Non expressing proviruses were also 2- to 6-fold more separated from H3K79me3 and me2, associated with gene bodies, in cells treated with 15.62 and 31.25 μM of LEDGIN, respectively (Figure 5B and C). On the other hand, the median distance of ‘no RNA’ sites to the enhancer marker H3K27ac was approximately two times larger compared to ‘all’ sites in each condition, with or without LEDGINs (Figure 5D, Supplementary Table S4). These results indicate that enhancers affect transcription as reported (43,44). A similar, although less pronounced, effect was observed for another enhancer marker H3K4me1 (Figure 5E, Supplementary Table S6). RNAPII and H3K4me3, enriched near active promoters (69), stimulated transcription also in the absence of LEDGINs (Figure 5F and G, Supplementary Table S6). Treatment with LEDGIN CX014442 increased the distances between ‘all’ sites and ‘no RNA’ sites even more for all of these features. RNA expression was not affected by the distance to the silent chromatin mark H3K9me3 (69) (Figure 5H, Supplementary Table S6). We also investigated two markers associated with super-enhancers: Mediator 1 (Med1) and CREB-binding protein (CBP) (Figure 6) (70). As shown in Figure 6A and B, LEDGINs did not significantly change the distance of integration to the super-enhancer markers. Still, ‘no RNA’ provirus was located at increased distance to super-enhancers (1.3- to 7-fold increase compared to ‘all’ sites) (Figure 6C and D).

Figure 5.

Figure 5.

Silent barcoded provirus is targeted away from epigenetic features associated with active transcription. The distance in base pairs (bp) of integration sites to certain features was determined for either ‘all’ retrieved insertion sites or the ‘no RNA’ sites in each condition (0, 6.25, 15.62 or 31.25 μM of CX014442). The median distance (bp) is plotted on the y-axis for two independent experiments in Jurkat cells (experiment A blue, experiment B black) and one experiment in SupT1 cells (experiment A green). Panels A-H plot the distance to: (A) H3K36me3, (B) H3K79me3, (C) H3K79me2, (D) H3K27ac, (E) H3K4me1, (F) RNAPII, (G) H3K4me3 and (H) H3K9me3. Statistical significance was calculated by the Kruskal–Wallis test, * P< 0.05, ** P< 0.01, *** P< 0.001, **** P< 0.0001. bp; base pairs, RNAPII; RNA polymerase II.

Figure 6.

Figure 6.

Effect of LEDGINs on super-enhancer markers Med1 and CBP. (A, B) Integration site analysis in SupT1 and Jurkat cells determined the distance in base pairs of HIV provirus to super-enhancer markers Med1 (A) and CBP (B) in each condition (0, 6.25, 15.62 or 31.25 μM of CX014442). The median distance (bp) is plotted on the y-axis for two independent experiments in Jurkat cells (experiment A blue, experiment B black) and two in SupT1 cells (experiment A green, experiment B gray). (C, D) The distance in base pairs (bp) of integration sites to Med1 (C) and CBP (D) was determined for ‘no RNA’ sites in each condition (0, 6.25, 15.62 and 31.25 μM of CX014442) and plotted next to ‘all sites’. The median distance (bp) is plotted on the y-axis for two independent experiments in Jurkat cells and one experiment in SupT1 cells. Statistical significance was calculated by the Kruskal–Wallis test, *P< 0.05, **P< 0.01, ***P< 0.001, ****P< 0.0001. bp; base pairs, Med1; Mediator 1, CBP; CREB-binding protein.

Next, we further characterized the non-RNA expressing proviruses by looking at the distribution across genic/non-genic regions and targeted genes. LEDGIN treatment retargeted integration away from active genes towards intergenic regions and silent genes. Interestingly, the observed retargeting effect seen for silent provirus (Supplementary Figure S11) was more pronounced compared to expressing proviruses (Figure 2). Secondly, we evaluated genes harboring non-RNA expressing provirus via the enrichment analysis based on KEGG categories (Supplemental results, Figure S12 and S13, Tables S7 and S8). Both in SupT1 and in Jurkat cells, we did not observe a direct link between endogenous gene expression and proviruses that do not transcribe RNA (Supplementary Figures S12 and 13).

In conclusion, these results indicate that the chromatin landscape at the site of integration determines HIV-1 RNA expression, independently of endogenous gene expression. Transcription depends on multiple determinants, among which H3K36me3, the recognition marker for LEDGF/p75, and enhancers, characterized by H3K27ac and H3K4me1, stand out. The positive correlation with proximity to (super-) enhancers is independent of LEDGF/p75, possibly explaining the presence of few residual high expressors after LEDGIN treatment. Indeed, opposite to non-expressing barcodes, the 10% highest expressing barcodes were found closer to H3K27ac and H3K4me1 (Supplementary Figure S14D and E), and to Med1 and CBP (data not shown). In line with results from Figure 5, high expressing sites were also closer to RNAPII and H3K4me3 (Supplementary Figure S14F and G). Of note, results obtained with high expressors were more variable due to a lower number of barcodes, especially in LEDGIN treated cells.

DISCUSSION

The latent reservoir is the major hurdle for curing HIV and hence the main target for cure strategies (7,71). The extensively studied ‘shock-and-kill’ strategy aims to reactivate latent provirus followed by killing of reactivated cells by viral cytopathic effects or immune clearance (72,73). As such, this strategy attempts to eradicate the entire reservoir. This method is confounded by the high complexity of the latent reservoir and insufficient potency of presently available latency reversing agents (LRA) (74–76). It was shown that <5% of the reservoir reactivates upon stimulation (13). Moreover, reactivation of latent provirus is influenced by the site of integration indicating that a combination of multiple LRAs would be required (43). More recently, a strategy called ‘block-and-lock’ was proposed, that aims to permanently lock HIV provirus in a silent state unable to resume viral replication upon cART interruption (7,71). This latent state may be maintained by blocking HIV transcription with an inhibitor of trans-activator of transcription (Tat) (77–79). Alternatively, in order to permanently silence HIV, it might be feasible to retarget provirus to sites that are less susceptible to reactivation as shown with LEDGINs (31,48,80). Whatever strategy used, a better understanding of the role of the integration site in HIV latency is required. Here we used LEDGINs as a tool to study the effect of the chromatin landscape on viral RNA expression by retargeting HIV-1 integration. We used barcoded HIV-1 vectors to link single proviruses with chromatin features and transcription.

First, we confirmed that LEDGIN treatment during infection inhibits HIV-1 integration and retargets residual integrants. In line with previous results (31) the LEDGIN CX014442 retargeted integration out of active genes towards silent genes and intergenic regions (Figures 2 and 3). Integration frequency correlated with the gene density of chromosomes in agreement with previous results, but the chromosomal distribution was strikingly less selective for gene-dense chromosomes upon addition of LEDGIN (Figure 1, Supplementary Tables S1 and S2). LEDGINs profoundly affected the distance of HIV-1 provirus to H3K36me3, the recognition mark of LEDGF/p75 (27–29) (Figure 3): the distance increased from 1 kb on average in control samples to >10 kb in cells treated with 31.25 μM of CX014442. These results indicate that LEDGF/p75 is the predominant determinant of integration site selection. Although depletion of LEDGF/p75 altered the integration pattern in an analogous manner, the retargeting effect of 31.25 μM of LEDGIN CX014442 was stronger (Supplementary Figures S6 and S7). The stronger potency of LEDGIN compared to LEDGF/p75 depletion was documented before (31). Either high concentrations of LEDGINs reduce IN binding to LEDGF/p75 more than RNAi-mediated depletion of LEDGF/p75, or inhibition of the interaction between IN and HRP-2 plays a role in the overall phenotype. Hepatoma-derived growth factor-related protein 2 (HRP-2) (65), a paralogue of LEDGF/p75 that can take over its targeting function (66), is inhibited by LEDGINs as well (67). Of note, HRP-2 binds to H3K79me3, a feature found in transcribed genes (81–83) that was less favored for integration upon addition of LEDGINs. The fact that the distance to H3K79me3 was >70 kb in control samples and that knockdown of LEDGF/p75 affected the distance to this marker as well (Supplementary Figures S6 and S7), suggests that the role of HRP-2 may be rather small. Intriguingly, LEDGINs did not significantly alter the distance of provirus to the main enhancer markers H3K27ac and H3K4me1. Additionally, LEDGINs had a minor effect on the distance of HIV-1 provirus to RNAPII, H3K4me3 and H3K9me3, found at transcription start sites, promoters and transcriptionally silent regions, respectively (69). Finally, we investigated whether LEDGINs affect CpG methylation in the viral LTR promoter, as methylation was described to contribute to viral latency (14,15). In these experiments, LEDGINs had no major effect on methylation (Supplementary Figure S8). However, since in this experiment cells were not sorted based on GFP expression and only 10 to 20 sequences per condition were analyzed, these results might not be entirely representative for the latent population obtained after treatment with LEDGINs.

Next, we investigated the effect of LEDGIN-mediated retargeting on RNA expression. In agreement with our previous work (31), we showed that LEDGIN treatment reduced RNA expression per provirus in SupT1 and Jurkat cells (Figure 4). Consistently, a higher proportion of barcodes without RNA expression was detected in cells treated with LEDGINs. We investigated whether these silent barcodes are linked to certain chromatin features and found epigenetic markers that influenced HIV-1 transcription either in a LEDGF/p75-dependent or independent manner. After treatment with LEDGINs, silent barcodes were located at increased distance from H3K36me3, H3K79me2 and H3K79me3 compared to their transcriptionally active counterparts (Figure 5). These results indicate that inhibition of the interaction between HIV-1 IN and LEDGF/p75 affects HIV-1 RNA expression. Whether the interaction of LEDGF/p75 with nucleosomes directly activates HIV transcription remains unknown. Alternatively, they may target the provirus to chromatin that supports active transcription. Interestingly, ‘no RNA’ barcodes were located further away from markers associated with (super-) enhancers (H3K27ac, H3K4me1, Med1 and CBP) in all conditions, regardless of the presence of LEDGINs (Figures 5 and 6). This result is in agreement with previous findings (43,44). The influence of H3K27ac on expression was more pronounced than that of H3K4me1, possibly because H3K27ac is specifically associated with active enhancers, while H3K4me1 is also associated with poised enhancers (68,84). Finally, some other markers associated with active transcription like RNAPII and H3K4me3 were disfavored by silent viruses, also in the absence of LEDGINs. Although the median RNA expression decreased, some highly expressing barcodes persisted even in the presence of LEDGIN CX014442. The 10% highest RNA expressing barcodes were found closer to enhancers and to a lesser extent closer to RNAPII and H3K4me3 (Supplementary Figure S14). Altogether, these results prove (i) that the chromatin environment at the site of integration affects HIV transcription, (ii) that LEDGINs reduce transcription by retargeting provirus and that (iii) high RNA expression from residual integrants is due to their proximity to (super-) enhancers.

Based on these results we propose a model including LEDGF/p75-dependent and independent chromatin features determining HIV-1 RNA expression (Figure 7). LEDGF/p75 tethers HIV-1 to H3K36me3 in transcriptionally active regions. HIV-1 also integrates in proximity to enhancers in a LEDGF/p75 independent manner. Integration in these regions is associated with high HIV RNA expression. LEDGIN treatment retargets integration away from nucleosomes tagged with H3K36me3 by inhibiting the interaction between HIV-1 IN and LEDGF/p75. This LEDGIN-mediated retargeting negatively affects RNA expression. However, not all provirus is retargeted away from active regions by LEDGINs; some viruses still integrate in active regions or near enhancers, explaining the few high expressing barcodes persisting after treatment with LEDGINs. Those high expressors are associated with (super-) enhancers, RNAPII and promoters independently of LEDGF/p75. It is not clear whether integration near enhancers happens stochastically or is virus-induced. In fact, integration site selection has many determinants including nuclear import, host factors such as cleavage and polyadenylation specificity factor 6 (CPSF6) (85) and LEDGF/p75, and integrase recognition domains (86). After the pre-integration complex enters the nucleus through nuclear pore complexes (NPC), HIV integrates preferentially in the nuclear periphery in active chromatin close to the nuclear pore (87–93). Depletion of several NPC associated proteins (Nup98, Nup153, Transportin-3, RanBP2 and Tpr) hampers integration in gene dense regions (93–95). Interestingly, several studies reported that enhancers are frequently associated with nuclear pores (96–99). CPSF6 is an HIV cofactor that promotes nuclear entry via interaction with HIV capsid (85,100–102). Depletion and knockout of CPSF6 decreased integration in active genes (103–106). Additionally, HIV IN itself might affect integration site selection as it shows a weak preference for a conserved sequence logo at the site of integration (107–109). Moreover, HIV IN was shown to directly interact with chromatin via interaction with H4 amino-terminal tails (110).

Figure 7.

Figure 7.

Model for HIV-1 integration and transcription. (A) HIV-1 integration is mainly determined by LEDGF/p75 binding to H3K36me3 in active genes. HIV-1 also integrates near enhancer regions characterized by H3K27ac and H3K4me1. Integration in these areas is associated with high RNA expression. (B) LEDGIN treatment retargets integration away from actively transcribed regions to silent genes and intergenic regions, resulting in provirus with lower RNA expression. However, some LEDGF/p75-independent integrations might still occur near regulatory elements, explaining few high residual expressors even in the presence of LEDGINs.

In terms of HIV cure, this model suggests that the use of LEDGINs in a ‘block-and-lock’ cure strategy will affect residual HIV-1 expression but may not be sufficient to achieve complete repression of HIV-1 transcription. The present study hints at additionally blocking enhancer-stimulated transcription. Therefore, a combination of multiple latency promoting factors and transcription inhibitors might be required to silence all provirus. In addition, it might be interesting to target other integration site determinants like cofactors of nuclear import or chromatin remodelers to achieve a full remission. In future research, the role of HIV integration sites should also be investigated in clinically relevant models such as patient-derived cells. In addition to the B-HIVE method, it might be interesting to evaluate proviral sequences as data by Einkauf et al. suggest that intact proviral sequences in genomic regions associated with latency are selected over time (21).

DATA AVAILABILITY

The datasets generated during this study are deposited at Gene Expression Omnibus (GEO), series GSE135295.

Supplementary Material

gkaa536_Supplemental_Files

ACKNOWLEDGEMENTS

We would like to thank A. Bruggemans, M. Carlon, F. Christ and J. Janssens for critical reading of the manuscript. We are grateful to B. Van Remoortel and P. Van de Velde (Molecular Virology and Gene Therapy, KU Leuven) for the technical assistance and to K. Jacobs (Center for cancer biology, VIB-KU Leuven) for helping with fluorescence sorting. We would like to thank the Genomics Core facility of the Center for Genomic Regulation (CRG, Barcelona) for the support of Illumina sequencing. LEDGINs were synthesized by Cistim/CD3 (courtesy of Dr A. Marchand).

Author contributions: Z.D. and H.C. conceived the study. G.V., H.C., K.T., D.M. and Z.D. designed experiments. G.V., H.C. and K.T. conducted experiments. H.C. and E.Z. performed bioinformatic analysis. G.V., H.C., E.Z., K.T., D.M., G.F. and Z.D. analyzed and interpreted data. G.V., H.C., K.T. and Z.D. prepared the manuscript. All authors read and approved the final manuscript.

Notes

Present address: Heng-Chang Chen, Institut de Génétique Humaine, Laboratoire de Virologie Moléculaire, UMR9002, CNRS, Université de Montpellier, Montpellier, France.

Present address: Guillaume Filion, Department of Biological Sciences, University of Toronto Scarborough, Canada.

Contributor Information

Gerlinde Vansant, Laboratory for Molecular Virology and Gene Therapy, Department of Pharmaceutical and Pharmacological Sciences, KU Leuven, Leuven, Flanders, Belgium.

Heng-Chang Chen, Center for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalunya, Spain.

Eduard Zorita, Center for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalunya, Spain.

Katerina Trejbalová, Institute of Molecular Genetics, Czech Academy of Sciences, Videnska, Prague, Czech Republic.

Dalibor Miklík, Institute of Molecular Genetics, Czech Academy of Sciences, Videnska, Prague, Czech Republic.

Guillaume Filion, Center for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalunya, Spain; University Pompeu Fabra, Barcelona, Catalunya, Spain.

Zeger Debyser, Laboratory for Molecular Virology and Gene Therapy, Department of Pharmaceutical and Pharmacological Sciences, KU Leuven, Leuven, Flanders, Belgium.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Flemish Fund for Scientific Research (FWO; Fonds voor Wetenschappelijk Onderzoek); Academische Stichting Leuven [to GV]; Research at KU Leuven received financial support from the FWO; KU Leuven Research Council [OT; OT/13/098]; HIV-ERA EURECA [IWT-SBO-EURECA, ZL345530]; KU Leuven Interdisciplinary Research (IDO) program [IDO/12/008]; Belgian IAP Belvir [ZKC4893-P7/45-P]; FWO-SBO-Saphir. The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication. Funding for open access charge: Fonds Wetenschappelijk Onderzoek.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Finzi D., Hermankova M., Pierson T., Carruth L.M., Buck C., Chaisson R.E., Quinn T.C., Chadwick K., Margolick J., Brookmeyer R. et al.. Identification of a reservoir for HIV-1 in patients on highly active antiretroviral therapy. Science. 1997; 278:1295–1300. [DOI] [PubMed] [Google Scholar]
  • 2. Finzi D., Blankson J., Siliciano J.D., Margolick J.B., Chadwick K., Pierson T., Smith K., Lisziewicz J., Lori F., Flexner C. et al.. Latent infection of CD4+ T cells provides a mechanism for lifelong persistence of HIV-1, even in patients on effective combination therapy. Nat. Med. 1999; 5:512. [DOI] [PubMed] [Google Scholar]
  • 3. Chomont N., El-Far M., Ancuta P., Trautmann L., Procopio F.A., Yassine-Diab B., Boucher G., Boulassel M.-R., Ghattas G., Brenchley J.M. et al.. HIV reservoir size and persistence are driven by T cell survival and homeostatic proliferation. Nat. Med. 2009; 15:893. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Chun T.-W., Stuyver L., Mizell S.B., Ehler L.A., Mican J.A.M., Baseler M., Lloyd A.L., Nowak M.A., Fauci A.S.. Presence of an inducible HIV-1 latent reservoir during highly active antiretroviral therapy. Proc. Natl. Acad. Sci. U.S.A. 1997; 94:13193–13197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Chun T.-W., Carruth L., Finzi D., Shen X., DiGiuseppe J.A., Taylor H., Hermankova M., Chadwick K., Margolick J., Quinn T.C. et al.. Quantification of latent tissue reservoirs and total body viral load in HIV-1 infection. Nature. 1997; 387:183–188. [DOI] [PubMed] [Google Scholar]
  • 6. Deeks S.G., Lewin S.R., Ross A.L., Ananworanich J., Benkirane M., Cannon P., Chomont N., Douek D., Lifson J.D., Lo Y.-R. et al.. International AIDS society global scientific strategy: towards an HIV cure 2016. Nat. Med. 2016; 22:839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Darcis G., Van Driessche B., Van Lint C.. HIV latency: should we shock or lock?. Trends Immunol. 2017; 38:217–228. [DOI] [PubMed] [Google Scholar]
  • 8. Nguyen K., Das B., Dobrowolski C., Karn J.. Multiple histone lysine methyltransferases are required for the establishment and maintenance of HIV-1 latency. MBio. 2017; 8:e00133-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Gallastegui E., Millán-Zambrano G., Terme J.-M., Chávez S., Jordan A.. Chromatin reassembly factors are involved in transcriptional interference promoting HIV latency. J. Virol. 2011; 85:3187–3202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Pearson R., Kim Y.K., Hokello J., Lassen K., Friedman J., Tyagi M., Karn J.. Epigenetic silencing of Human Immunodeficiency Virus (HIV) transcription by formation of restrictive chromatin structures at the viral long terminal repeat drives the progressive entry of HIV into latency. J. Virol. 2008; 82:12291–12303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Matsuda Y., Kobayashi-Ishihara M., Fujikawa D., Ishida T., Watanabe T., Yamagishi M.. Epigenetic heterogeneity in HIV-1 latency establishment. Sci. Rep. 2015; 5:7701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Tyagi M., Pearson R.J., Karn J.. Establishment of HIV latency in primary CD4+ cells is due to epigenetic transcriptional silencing and P-TEFb restriction. J. Virol. 2010; 84:6425–6437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Battivelli E., Dahabieh M.S., Abdel-Mohsen M., Svensson J.P., Tojal Da Silva I., Cohn L.B., Gramatica A., Deeks S., Greene W.C., Pillai S.K. et al.. Chromatin functional states correlate with HIV latency reversal in infected primary CD4+ T cells. Elife. 2018; 7:e34655. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Trejbalová K., Kovářová D., Blažková J., Machala L., Jilich D., Weber J., Kučerová D., Vencálek O., Hirsch I., Hejnar J.. Development of 5′ LTR DNA methylation of latent HIV-1 provirus in cell line models and in long-term-infected individuals. Clin. Epigenet. 2016; 8:19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Blazkova J., Trejbalova K., Gondois-Rey F., Halfon P., Philibert P., Guiguen A., Verdin E., Olive D., Van Lint C., Hejnar J. et al.. CpG methylation controls reactivation of HIV from latency. PLoS Pathog. 2009; 5:e1000554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Jordan A., Defechereux P., Verdin E.. The site of HIV-1 integration in the human genome determines basal transcriptional activity and response to Tat transactivation. EMBO J. 2001; 20:1726–1738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Lewinski M.K., Bisgrove D., Shinn P., Chen H., Hoffmann C., Hannenhalli S., Verdin E., Berry C.C., Ecker J.R., Bushman F.D.. Genome-wide analysis of chromosomal features repressing human immunodeficiency virus transcription. J. Virol. 2005; 79:6610–6619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Brady T., Agosto L.M., Malani N., Berry C.C., O’Doherty U., Bushman F.. HIV integration site distributions in resting and activated CD4+ T cells infected in culture. AIDS. 2009; 23:1461–1471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Han Y., Lin Y.B., An W., Xu J., Yang H.-C., O’Connell K., Dordai D., Boeke J.D., Siliciano J.D., Siliciano R.F.. Orientation-dependent regulation of integrated HIV-1 expression by host gene transcriptional read through. Cell Host Microbe. 2008; 4:134–146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Shan L., Yang H.C., Rabi S.A., Bravo H.C., Shroff N.S., Irizarry R.A.. Influence of host gene transcription level and orientation on HIV-1 latency in a primary-cell model. J. Virol. 2011; 85:5384–5393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Einkauf K.B., Lee G.Q., Gao C., Sharaf R., Sun X., Hua S., Chen S.M. Y., Jiang C., Lian X., Chowdhury F.Z. et al.. Intact HIV-1 proviruses accumulate at distinct chromosomal positions during prolonged antiretroviral therapy. J. Clin. Invest. 2019; 129:988–998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Cherepanov P., Maertens G., Proost P., Devreese B., Van Beeumen J., Engelborghs Y., De Clercq E., Debyser Z.. HIV-1 integrase forms stable tetramers and associates with LEDGF/P75 protein in human cells. J. Biol. Chem. 2003; 278:372–381. [DOI] [PubMed] [Google Scholar]
  • 23. Lesbats P., Engelman A.N., Cherepanov P. Retroviral DNA integration. Chem. Rev. 2016; 116:12730–12757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Schroder A.R., Shinn P., Chen H., Berry C., Ecker J.R., Bushman F.. HIV-1 integration in the human genome favors active genes and local hotspots. Cell. 2002; 110:521–529. [DOI] [PubMed] [Google Scholar]
  • 25. Shun M.-C., Raghavendra N.K., Vandegraaff N., Daigle J.E., Hughes S., Kellam P., Cherepanov P., Engelman A.. LEDGF/P75 functions downstream from preintegration complex formation to effect gene-specific HIV-1 integration. Genes Dev. 2007; 21:1767–1778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Cherepanov P., Ambrosio A.L.B., Rahman S., Ellenberger T., Engelman A.. Structural basis for the recognition between HIV-1 integrase and transcriptional coactivator P75. Proc. Natl. Acad. Sci. U.S.A. 2005; 102:17308–17313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Eidahl J.O., Crowe B.L., North J.A., McKee C.J., Shkriabai N., Feng L., Plumb M., Graham R.L., Gorelick R.J., Hess S. et al.. Structural basis for high-affinity binding of LEDGF PWWP to mononucleosomes. Nucleic Acids Res. 2013; 41:3924–3936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Pradeepa M.M., Sutherland H.G., Ule J., Grimes G.R., Bickmore W.A.. Psip1/Ledgf P52 binds methylated histone H3K36 and splicing factors and contributes to the regulation of alternative splicing. PLoS Genet. 2012; 8:e1002717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. van Nuland R., van Schaik F.M., Simonis M., van Heesch S., Cuppen E., Boelens R., Timmers H.M., van Ingen H.. Nucleosomal DNA binding drives the recognition of H3K36-methylated nucleosomes by the PSIP1-PWWP domain. Epigenet. Chromatin. 2013; 6:12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Ciuffi A., Llano M., Poeschla E., Hoffmann C., Leipzig J., Shinn P., Ecker J.R., Bushman F.. A role for LEDGF/P75 in targeting HIV DNA integration. Nat. Med. 2005; 11:1287. [DOI] [PubMed] [Google Scholar]
  • 31. Vranckx L.S., Demeulemeester J., Saleh S., Boll A., Vansant G., Schrijvers R., Weydert C., Battivelli E., Verdin E., Cereseto A. et al.. LEDGIN-mediated inhibition of Integrase–LEDGF/P75 interaction reduces reactivation of residual latent HIV. EBioMedicine. 2016; 8:248–264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Christ F., Voet A., Marchand A., Nicolet S., Desimmie B.A., Marchand D., Bardiot D., Van der Veken N.J., Van Remoortel B., Strelkov S.V. et al.. Rational design of Small-molecule inhibitors of the LEDGF/P75-integrase interaction and HIV replication. Nat. Chem. Biol. 2010; 6:442–448. [DOI] [PubMed] [Google Scholar]
  • 33. Christ F., Shaw S., Demeulemeester J., Desimmie B.A., Marchand A., Butler S., Smets W., Chaltin P., Westby M., Debyser Z. et al.. Small-molecule inhibitors of the LEDGF/P75 binding site of integrase block HIV replication and modulate integrase multimerization. Antimicrob. Agents Chemother. 2012; 56:4365–4374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Demeulemeester J., Chaltin P., Marchand A., De Maeyer M., Debyser Z., Christ F.. LEDGINs, non-catalytic site inhibitors of HIV-1 Integrase: a patent review (2006 – 2014). Expert Opin. Ther. Pat. 2014; 24:609–632. [DOI] [PubMed] [Google Scholar]
  • 35. Al-Mawsawi L.Q., Neamati N.. Allosteric inhibitor development targeting HIV-1 integrase. ChemMedChem. 2011; 6:228–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Tsiang M., Jones G.S., Niedziela-Majka A., Kan E., Lansdon E.B., Huang W., Hung M., Samuel D., Novikov N., Xu Y. et al.. New class of HIV-1 integrase (IN) inhibitors with a dual mode of action. J. Biol. Chem. 2012; 287:21189–21203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Kessl J.J., Jena N., Koh Y., Taskent-Sezgin H., Slaughter A., Feng L., de Silva S., Wu L., Le Grice S.F.J., Engelman A. et al.. Multimode, cooperative mechanism of action of allosteric HIV-1 integrase inhibitors. J. Biol. Chem. 2012; 287:16801–16811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Cherepanov P., Sun Z.-Y.J., Rahman S., Maertens G., Wagner G., Engelman A.. Solution structure of the HIV-1 integrase-binding domain in LEDGF/P75. Nat. Struct. Mol. Biol. 2005; 12:526. [DOI] [PubMed] [Google Scholar]
  • 39. Desimmie B.A., Schrijvers R., Demeulemeester J., Borrenberghs D., Weydert C., Thys W., Vets S., Van Remoortel B., Hofkens J., De Rijck J. et al.. LEDGINs inhibit late stage HIV-1 replication by modulating integrase multimerization in the virions. Retrovirology. 2013; 10:57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Jurado K.A., Wang H., Slaughter A., Feng L., Kessl J.J., Koh Y., Wang W., Ballandras-Colas A., Patel P.A., Fuchs J.R. et al.. Allosteric integrase inhibitor potency is determined through the inhibition of HIV-1 particle maturation. Proc. Natl. Acad. Sci. U.S.A. 2013; 110:8690–8695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Balakrishnan M., Yant S.R., Tsai L., O’Sullivan C., Bam R.A., Tsai A., Niedziela-Majka A., Stray K.M., Sakowicz R., Cihlar T.. Non-catalytic site HIV-1 integrase inhibitors disrupt core maturation and induce a reverse transcription block in target cells. PLoS One. 2013; 8:e74163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Le Rouzic E., Bonnard D., Chasset S., Bruneau J.-M., Chevreuil F., Le Strat F., Nguyen J., Beauvoir R., Amadori C., Brias J. et al.. Dual inhibition of HIV-1 replication by Integrase-LEDGF allosteric inhibitors is predominant at the post-integration stage. Retrovirology. 2013; 10:144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Chen H.-C., Martinez J.P., Zorita E., Meyerhans A., Filion G.J. Position effects influence HIV latency reversal. Nat. Struct. Mol. Biol. 2017; 24:47–54. [DOI] [PubMed] [Google Scholar]
  • 44. Miklík D., Šenigl F., Hejnar J.. Proviruses with long-term stable expression accumulate in transcriptionally active chromatin close to the gene regulatory elements: comparison of ASLV-, HIV- and MLV-derived VECTORS. Viruses. 2018; 10:116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Chen H.-C., Zorita E., Filion G.J.. Using barcoded HIV ensembles (B-HIVE) for single provirus transcriptomics. Curr. Protoc. Mol. Biol. 2018; 122:e56. [DOI] [PubMed] [Google Scholar]
  • 46. Ablashi D.V, Berneman Z.N., Kramarsky B., Whitman J., Asano Y., Pearson G.R.. Human Herpesvirus-7 (HHV-7): current status. Clin. Diagn. Virol. 1995; 4:1–13. [DOI] [PubMed] [Google Scholar]
  • 47. Vincent-Lacaze N., Snyder R.O., Gluzman R., Bohl D., Lagarde C., Danos O.. Structure of adeno-associated virus vector DNA following transduction of the skeletal muscle. J. Virol. 1999; 73:1949–1955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Vansant G., Vranckx L.S., Zurnic I., Van Looveren D., Van de Velde P., Nobles C., Gijsbers R., Christ F., Debyser Z.. Impact of LEDGIN treatment during virus production on residual HIV-1 transcription. Retrovirology. 2019; 16:8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Zhang L., Lewin S.R., Markowitz M., Lin H.H., Skulsky E., Karanicolas R.. Measuring recent thymic emigrants in blood of normal and HIV-1-infected individuals before and after effective therapy. J. Exp. Med. 1999; 190:725–732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Yu G., Wang L.-G., Han Y., He Q.-Y.. ClusterProfiler: an R package for comparing biological themes among gene clusters. Omi. A J. Integr. Biol. 2012; 16:284–287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Rosenthal R., Rubin D.B.. Ensemble-adjusted P values. Psychol. Bull. 1983; 94:540–541. [Google Scholar]
  • 52. Wright S.P. Adjusted P-values for simultaneous inference. Biometrics. 1992; 48:1005–1013. [Google Scholar]
  • 53. Benjamini Y., Hochberg Y.. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B. 1995; 57:289–300. [Google Scholar]
  • 54. Cuscó P., Filion G.J.. Zerone: a ChIP-Seq discretizer for multiple replicates with built-in quality control. Bioinformatics. 2016; 32:2896–2902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Bray N.L., Pimentel H., Melsted P., Pachter L. Near-optimal probabilistic RNA-Seq quantification. Nat. Biotechnol. 2016; 34:525–527. [DOI] [PubMed] [Google Scholar]
  • 56. Reeder J.E., Kwak Y.-T., McNamara R.P., Forst C.V., D’Orso I.. HIV Tat controls RNA polymerase II and the epigenetic landscape to transcriptionally reprogram target immune cells. Elife. 2015; 4:e08955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Orlando D.A., Chen M.W., Brown V.E., Solanki S., Choi Y.J., Olson E.R., Fritz C.C., Bradner J.E., Guenther M.G.. Quantitative ChIP-Seq normalization reveals global modulation of the epigenome. Cell Rep. 2014; 9:1163–1170. [DOI] [PubMed] [Google Scholar]
  • 58. Mansour M.R., Abraham B.J., Anders L., Berezovskaya A., Gutierrez A., Durbin A.D., Etchin J., Lawton L., Sallan S.E., Silverman L.B. et al.. Oncogene regulation. An oncogenic super-enhancer formed through somatic mutation of a noncoding intergenic element. Science. 2014; 346:1373–1377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Hollenhorst P.C., Chandler K.J., Poulsen R.L., Johnson W.E., Speck N.A., Graves B.J.. DNA specificity determinants associate with distinct transcription factor functions. PLoS Genet. 2009; 5:e1000778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Wang G.P., Ciuffi A., Leipzig J., Berry C.C., Bushman F.D. HIV integration site selection: analysis by massively parallel pyrosequencing reveals association with epigenetic modifications. Genome Res. 2007; 17:1186–1194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Kanehisa M., Goto S.. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000; 28:27–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Kanehisa M., Sato Y., Furumichi M., Morishima K., Tanabe M.. New approach for understanding genome variations in KEGG. Nucleic Acids Res. 2018; 47:D590–D595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019; 28:1947–1951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Bannister A.J., Schneider R., Myers F.A., Thorne A.W., Crane-Robinson C., Kouzarides T.. Spatial distribution of Di- and Tri-Methyl lysine 36 of histone H3 at active genes. J. Biol. Chem. 2005; 280:17732–17736. [DOI] [PubMed] [Google Scholar]
  • 65. Wu H., Zeng H., Lam R., Tempel W., Amaya M.F., Xu C., Dombrovski L., Qiu W., Wang Y., Min J.. Structural and histone binding ability characterizations of human PWWP domains. PLoS One. 2011; 6:e18919. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Schrijvers R., Vets S., De Rijck J., Malani N., Bushman F.D., Debyser Z., Gijsbers R.. HRP-2 determines HIV-1 integration site selection in LEDGF/P75 depleted cells. Retrovirology. 2012; 9:84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Schrijvers R., De Rijck J., Demeulemeester J., Adachi N., Vets S., Ronen K., Christ F., Bushman F.D., Debyser Z., Gijsbers R.. LEDGF/P75-Independent HIV-1 replication demonstrates a role for HRP-2 and remains sensitive to inhibition by LEDGINs. PLoS Pathog. 2012; 8:e1002558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Zentner G.E., Tesar P.J., Scacheri P.C.. Epigenetic signatures distinguish multiple classes of enhancers with distinct cellular functions. Genome Res. 2011; 21:1273–1283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Mellor J., Dudek P., Clynes D.. A glimpse into the epigenetic landscape of gene regulation. Curr. Opin. Genet. Dev. 2008; 18:116–122. [DOI] [PubMed] [Google Scholar]
  • 70. Pott S., Lieb J.D.. What are super-enhancers?. Nat. Genet. 2014; 47:8. [DOI] [PubMed] [Google Scholar]
  • 71. Delannoy A., Poirier M., Bell B. Cat and Mouse: HIV transcription in latency, immune evasion and cure/remission strategies. Viruses. 2019; 11:269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Schwartz C., Bouchat S., Marban C., Gautier V., Van Lint C., Rohr O., Le Douce V.. On the way to find a cure: purging latent HIV-1 reservoirs. Biochem. Pharmacol. 2017; 146:10–22. [DOI] [PubMed] [Google Scholar]
  • 73. Kim Y., Anderson J.L., Lewin S.R.. Getting the “Kill” into “Shock and Kill”: Strategies to eliminate latent HIV. Cell Host Microbe. 2018; 23:14–26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74. Abner E., Jordan A.. HIV “Shock and Kill” therapy: in need of revision. Antiviral Res. 2019; 166:19–34. [DOI] [PubMed] [Google Scholar]
  • 75. Bullen C.K., Laird G.M., Durand C.M., Siliciano J.D., Siliciano R.F.. Novel Ex vivo approaches distinguish effective and ineffective single agents for reversing HIV-1 latency in vivo. Nat. Med. 2014; 20:425–429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Ho Y.C., Shan L., Hosmane N.N., Wang J., Laskey S.B., Rosenbloom D.I.. Replication-competent noninduced proviruses in the latent reservoir increase barrier to HIV-1 cure. Cell. 2013; 155:540–551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Mousseau G., Mediouni S., Valente S.T.. Targeting HIV transcription: the quest for a functional cure. Curr. Top. Microbiol. Immunol. 2015; 389:121–145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Mousseau G., Kessing C.F., Fromentin R., Trautmann L., Chomont N., Valente S.T.. The tat inhibitor didehydro-cortistatin a prevents HIV-1 reactivation from latency. MBio. 2015; 6:e00465-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79. Kessing C.F., Nixon C.C., Li C., Tsai P.M., Takata H., Mousseau G., Ho P.T., Honeycutt J.B., Fallahi M., Trautmann L. et al.. In vivo suppression of HIV rebound by didehydro-cortistatin A, a “Block-and-Lock” strategy for HIV-1 cure. Cell Rep. 2017; 21:600–611. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Debyser Z., Vansant G., Bruggemans A., Janssens J., Christ F.. Insight in HIV integration site selection provides a block-and-lock strategy for a functional cure of HIV infection. Viruses. 2018; 11:12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Wang Z., Zang C., Rosenfeld J.A., Schones D.E., Barski A., Cuddapah S., Cui K., Roh T.-Y., Peng W., Zhang M.Q. et al.. Combinatorial patterns of histone acetylations and methylations in the human genome. Nat. Genet. 2008; 40:897–903. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82. Huff J.T., Plocik A.M., Guthrie C., Yamamoto K.R.. Reciprocal intronic and exonic histone modification regions in humans. Nat. Struct. Mol. Biol. 2010; 17:1495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Farooq Z., Banday S., Pandita T.K., Altaf M.. The many faces of histone H3K79 methylation. Mutat. Res. Rev. Mutat. Res. 2016; 768:46–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Creyghton M.P., Cheng A.W., Welstead G.G., Kooistra T., Carey B.W., Steine E.J., Hanna J., Lodato M.A., Frampton G.M., Sharp P.A. et al.. Histone H3K27ac separates active from poised enhancers and predicts developmental state. Proc. Natl. Acad. Sci. U.S.A. 2010; 107:21931–21936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Lee K., Ambrose Z., Martin T.D., Oztop I., Mulky A., Julias J.G., Vandegraaff N., Baumann J.G., Wang R., Yuen W. et al.. Flexible use of nuclear import pathways by HIV-1. Cell Host Microbe. 2010; 7:221–233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Demeulemeester J., De Rijck J., Gijsbers R., Debyser Z.. Retroviral integration: site matters: mechanisms and consequences of retroviral integration site selection. Bioessays. 2015; 37:1202–1214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Wong R.W., Mamede J.I., Hope T.J.. Impact of nucleoporin-mediated chromatin localization and nuclear architecture on HIV integration site selection. J. Virol. 2015; 89:9702–9705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88. Lusic M., Siliciano R.F.. Nuclear landscape of HIV-1 infection and integration. Nat. Rev. Microbiol. 2016; 15:69. [DOI] [PubMed] [Google Scholar]
  • 89. Burdick R.C., Hu W.-S., Pathak V.K.. Nuclear import of APOBEC3F-Labeled HIV-1 preintegration complexes. Proc. Natl. Acad. Sci. U.S.A. 2013; 110:E4780–E4789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90. Di Primio C., Quercioli V., Allouch A., Gijsbers R., Christ F., Debyser Z., Arosio D., Cereseto A.. Single-cell imaging of HIV-1 provirus (SCIP). Proc. Natl. Acad. Sci. U.S.A. 2013; 110:5636–5641. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91. Marini B., Kertesz-Farkas A., Ali H., Lucic B., Lisek K., Manganaro L. Nuclear architecture dictates HIV-1 integration site selection. Nature. 2015; 521:227–231. [DOI] [PubMed] [Google Scholar]
  • 92. Albanese A., Arosio D., Terreni M., Cereseto A.. HIV-1 pre-integration complexes selectively target decondensed chromatin in the nuclear periphery. PLoS One. 2008; 3:e2413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93. Lelek M., Casartelli N., Pellin D., Rizzi E., Souque P., Severgnini M., Di Serio C., Fricke T., Diaz-Griffero F., Zimmer C. et al.. Chromatin organization at the nuclear pore favours HIV replication. Nat. Commun. 2015; 6:6483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94. Di Nunzio F., Fricke T., Miccio A., Valle-Casuso J.C., Perez P., Souque P., Rizzi E., Severgnini M., Mavilio F., Charneau P. et al.. Nup153 and Nup98 Bind the HIV-1 core and contribute to the early steps of HIV-1 replication. Virology. 2013; 440:8–18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95. Ocwieja K.E., Brady T.L., Ronen K., Huegel A., Roth S.L., Schaller T., James L.C., Towers G.J., Young J.A.T., Chanda S.K. et al.. HIV integration targeting: a pathway involving transportin-3 and the nuclear pore protein RanBP2. PLoS Pathog. 2011; 7:e1001313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96. Pascual-Garcia P., Capelson M.. Nuclear pores in genome architecture and enhancer function. Curr. Opin. Cell Biol. 2019; 58:126–133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97. Pascual-Garcia P., Debo B., Aleman J.R., Talamas J.A., Lan Y., Nguyen N.H., Won K.J., Capelson M.. Metazoan nuclear pores provide a scaffold for poised genes and mediate induced enhancer-promoter contacts. Mol. Cell. 2017; 66:63–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98. Ibarra A., Benner C., Tyagi S., Cool J., Hetzer M.W.. Nucleoporin-mediated regulation of cell identity genes. Genes Dev. 2016; 30:2253–2258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99. Liu X., Zhang Y., Chen Y., Li M., Zhou F., Li K., Cao H., Ni M., Liu Y., Gu Z. et al.. In situ capture of chromatin interactions by biotinylated DCas9. Cell. 2017; 170:1028–1043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100. Price A.J., Fletcher A.J., Schaller T., Elliott T., Lee K., KewalRamani V.N., Chin J.W., Towers G.J., James L.C.. CPSF6 defines a conserved capsid interface that modulates HIV-1 replication. PLOS Pathog. 2012; 8:e1002896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101. Matreyek K.A., Yücel S.S., Li X., Engelman A.. Nucleoporin NUP153 phenylalanine-glycine motifs engage a common binding pocket within the HIV-1 capsid protein to mediate lentiviral infectivity. PLoS Pathog. 2013; 9:e1003693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102. Bhattacharya A., Alam S.L., Fricke T., Zadrozny K., Sedzicki J., Taylor A.B., Demeler B., Pornillos O., Ganser-Pornillos B.K., Diaz-Griffero F. et al.. Structural basis of HIV-1 capsid recognition by PF74 and CPSF6. Proc. Natl. Acad. Sci. U.S.A. 2014; 111:18625–18630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103. Chin C.R., Perreira J.M., Savidis G., Portmann J.M., Aker A.M., Feeley E.M., Smith M.C., Brass A.L.. Direct visualization of HIV-1 replication intermediates shows that capsid and CPSF6 modulate HIV-1 intra-nuclear invasion and integration. Cell Rep. 2015; 13:1717–1731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104. Rasheedi S., Shun M.-C., Serrao E., Sowd G.A., Qian J., Hao C., Dasgupta T., Engelman A.N., Skowronski J.. The cleavage and polyadenylation specificity factor 6 (CPSF6) subunit of the capsid-recruited pre-messenger RNA cleavage factor I (CFIm) complex mediates HIV-1 integration into genes. J. Biol. Chem. 2016; 291:11809–11819. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105. Sowd G.A., Serrao E., Wang H., Wang W., Fadel H.J., Poeschla E.M., Engelman A.N.. A critical role for alternative polyadenylation factor CPSF6 in targeting HIV-1 integration to transcriptionally active chromatin. Proc. Natl. Acad. Sci. U.S.A. 2016; 113:E1054–E1063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106. Achuthan V., Perreira J.M., Sowd G.A., Puray-Chavez M., McDougall W.M., Paulucci-Holthauzen A., Wu X., Fadel H.J., Poeschla E.M., Multani A.S. et al.. Capsid-CPSF6 interaction licenses nuclear HIV-1 trafficking to sites of viral DNA integration. Cell Host Microbe. 2018; 24:392–404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107. Holman A.G., Coffin J.M.. Symmetrical base preferences surrounding HIV-1, avian Sarcoma/Leukosis virus, and murine leukemia virus integration sites. Proc. Natl. Acad. Sci. U.S.A. 2005; 102:6103–6107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108. Wu X., Li Y., Crise B., Burgess S.M., Munroe D.J.. Weak palindromic consensus sequences are a common feature found at the integration target sites of many retroviruses. J. Virol. 2005; 79:5211–5214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109. Berry C., Hannenhalli S., Leipzig J., Bushman F.D.. Selection of target sites for mobile DNA integration in the human genome. PLoS Comput. Biol. 2006; 2:e157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110. Benleulmi M.S., Matysiak J., Robert X., Miskey C., Mauro E., Lapaillerie D., Lesbats P., Chaignepain S., Henriquez D.R., Calmels C. et al.. Modulation of the functional association between the HIV-1 intasome and the nucleosome by histone amino-terminal tails. Retrovirology. 2017; 14:54. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

gkaa536_Supplemental_Files

Data Availability Statement

All data-processing steps are documented in a Docker virtual machine available at https://github.com/ezorita/bhive. The datasets generated during this study are deposited at Gene Expression Omnibus (GEO), series GSE135295. Additional ChIP-Seq data for H3K36me3, H3K79me3, H3K27ac, H3K4me1, RNAPII, H3K4me3, H3K9me3 (GSE65687) (56), H3K79me2 (GSE60104) (57), Med1 (GSE59657) (58) and CBP (GSE17954) (59) were downloaded from GEO.

The datasets generated during this study are deposited at Gene Expression Omnibus (GEO), series GSE135295.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES