ABSTRACT
The ability of Epstein-Barr virus (EBV) to switch between latent and lytic infection is key to its long-term persistence, yet the molecular mechanisms behind this switch remain unclear. To investigate transcriptional events during the latent-to-lytic switch, we utilized Precision nuclear Run On followed by deep Sequencing (PRO-Seq) to map cellular RNA polymerase (Pol) activity to single-nucleotide resolution on the host and EBV genome in three different models of EBV latency and reactivation. In latently infected Mutu-I Burkitt lymphoma (BL) cells, Pol activity was enriched at the Qp promoter, the EBER region, and the BHLF1/LF3 transcripts. Upon reactivation with phorbol ester and sodium butyrate, early-phase Pol activity occurred bidirectionally at CTCF sites within the LMP-2A, EBER-1, and RPMS1 loci. PRO-Seq analysis of Akata cells reactivated from latency with anti-IgG and a lymphoblastoid cell line (LCL) reactivated with small molecule C60 showed a similar pattern of early bidirectional transcription initiating around CTCF binding sites, although the specific CTCF sites and viral genes were different for each latency model. The functional importance of CTCF binding, transcription, and reactivation was confirmed using an EBV mutant lacking the LMP-2A CTCF binding site. This virus was unable to reactivate and had disrupted Pol activity at multiple CTCF binding sites relative to the wild-type (WT) virus. Overall, these data suggest that CTCF regulates the viral early transcripts during reactivation from latency. These activities likely help maintain the accessibility of the viral genome to initiate productive replication.
IMPORTANCE The ability of EBV to switch between latent and lytic infection is key to its long-term persistence in memory B cells, and its ability to persist in proliferating cells is strongly linked to oncogenesis. During latency, most viral genes are epigenetically silenced, and the virus must overcome this repression to reactivate lytic replication. Reactivation occurs once the immediate early (IE) EBV lytic genes are expressed. However, the molecular mechanisms behind the switch from the latent transcriptional program to begin transcription of the IE genes remain unknown. In this study, we mapped RNA Pol positioning and activity during latency and reactivation. Unexpectedly, Pol activity accumulated at distinct regions characteristic of transcription initiation on the EBV genome previously shown to be associated with CTCF. We propose that CTCF binding at these regions retains Pol to maintain a stable latent chromosome conformation and a rapid response to various reactivation signals.
KEYWORDS: CCCTC binding factor, CTCF, Epstein-Barr virus, herpesviruses, latent infection, transcription, transcriptional regulation
INTRODUCTION
Epstein-Barr virus (EBV) is a gammaherpesvirus that establishes a persistent infection in 95% of adults worldwide. EBV maintains this persistence through a biphasic life cycle in which it switches between latent and lytic phases. The virus establishes latency in B cells where it exists as a circular episome, and the majority of genes are believed to be epigenetically silenced via repressive chromatin and DNA methylation (1). EBV infection of B cells leads to their transformation into immortalized lymphoblastoid cell lines (LCLs) and is strongly associated with B-cell cancers such as Burkitt’s lymphoma and Hodgkin’s lymphoma (2).
The latent transcriptional program is highly restricted compared to the lytic program, with expression limited to a combination of six EBNA genes, two LMP genes, and multiple noncoding RNAs (3). The expression pattern of the EBNA and LMP genes determines the classification of the four distinct latency types (0 to III), with 0 being the most restricted (no viral protein expression) and III the least restricted (3). EBNA expression is regulated by selection from the Qp, Cp, and Wp promoters (4). This promoter selection is mediated by the chromatin architecture of the EBV episome, which is dependent on host CCCTC binding factor (CTCF) (5). CTCF is a multifunctional DNA binding protein with roles in transcriptional activation and repression, chromatin loop and boundary formation, and promoter-enhancer blocking activity (6). The EBV genome contains multiple CTCF binding sites (7), and a number of these sites have been linked to the maintenance of the latent chromatin state (8–10).
Reactivation is known to be induced by stimulation of the B cell receptor (BCR) (11) and various stress-related signals (12) such as activation of the NLRP3 inflammasome (13). These events lead to activation of the viral Zp and Rp promoters to drive transcription of the immediate early (IE) EBV genes, BZLF1 and BRLF1, which, in turn, induce early lytic gene expression (14). Multiple cellular transcription factors have been shown to be involved in this switch, including HIF-1α, which promotes expression of BZLF1 (15). Others, such as MYC/MAX (16), STAT3 (17), and TRIM28 (18), are negative regulators, and their depletion promotes BZFL1 expression. However, the full molecular mechanisms as to how EBV overcomes the repressive latent state to promote transcription of the lytic genes remain to be fully understood.
EBV relies on cellular RNA polymerase II (Pol II) for transcription, although it also uses RNA polymerase III (Pol III) to transcribe Epstein-Barr virus-encoded small RNA 1 and 2 (EBER1 and -2) (19). Pol II contains an extra C-terminal domain (CTD), and the phosphorylation pattern of the CTD changes during functional switches between transcription initiation, elongation, and termination (20). EBV is known to modulate Pol II transcription elongation by promoting serine 2 CTD phosphorylation to drive B-cell immortalization. The switch from type I to III latency is regulated by Pol II pausing at the Cp promoter through association with the negative elongation factor (NELF), which facilitates recruitment of the positive transcription elongation factor (pTEFb). pTEFb maintains CTD serine 2 phosphorylation, leading to strong expression of EBV immortalization genes (21). It is not yet known if RNA polymerase pausing-elongation regulates the latent-lytic switch.
Pausing of Pol II at promoter regions, known as promoter-proximal pausing (PPP), is a mechanism known to be used by herpesviruses to regulate transcription. The alphaherpesvirus herpes simplex virus 1 (HSV-1) utilizes PPP to regulate temporal gene expression during lytic replication (22), as does the betaherpesvirus human cytomegalovirus (HCMV) (23). Kaposi’s sarcoma-associated herpesvirus (KSHV), another gammaherpesvirus, exploits PPP to regulate the latent-lytic switch. During latency, NELF is recruited to pause Pol II on lytic KSHV gene promoters, allowing for prompt expression upon stimulation of reactivation (24).
The aim of this study was to investigate changes in Pol II activity and positioning during the transition from a restricted latent state to the transcriptionally active lytic state. We hypothesized that Pol II would remain paused on the promoters of the IE genes during latency and that upon reactivation, Pol II would rapidly elongate into gene bodies. To address this hypothesis, we used Precision nuclear Run On followed by deep Sequencing (PRO-Seq) (25) to map cellular Pol activity to single-nucleotide resolution on the EBV genome during latency and reactivation. As PRO-Seq does not distinguish between Pol II or Pol III, we refer to all identified activity as just Pol.
In contrast to the hypothesis posed, we found no evidence of PPP on IE genes during EBV latency or reactivation. Instead, Pol activity during latency and reactivation was enriched at regions on the EBV genome that were often unassociated with known viral transcripts or promoters but correlated with CTCF binding sites. Using an EBV mutant lacking a CTCF binding site, we confirmed the functional importance of CTCF binding on transcription and subsequent reactivation.
RESULTS
Distribution of active polymerase on the EBV episome in latent and reactivated Mutu-I cells.
To investigate the latent-lytic EBV transcriptional switch, we performed PRO-Seq on latently infected B cells at various stages after reactivation. We first focused on the robust reactivation of Mutu-I Burkitt lymphoma (BL) cells treated with neutralizing antibody (NAb)-tetradecanoyl phorbol acetate (TPA). Nuclei from Mutu-I cells were harvested at 1, 4, and 12 h postreactivation. The extent of RNA Pol activity during latency was first established by visualization of the data across the EBV Mutu-I episome (GenBank accession no. KC207814 [26]) using the Integrative Genomics Viewer (IGV) (27). Low levels of transcriptional activity were detected across the entire EBV genome with more substantial activity at the EBERs and the BHLF1/LF3 transcripts located at the lytic origins of replication (OriLyt) (Fig. 1, top panel). We utilized the software package dREG (Detection of regulatory elements using GRO-Seq and other run-on and sequencing assays) to help identify transcription initiation regions (TIRs) based on patterns of elongating Pol identified by PRO-Seq (28). An overview of the dREG output is shown below the PRO-Seq tracks. Peaks with a false-discovery rate (FDR)-corrected P value of ≤0.05 are highlighted in green. The computed TIRs during latency were detected at the EBERs, both OriLyts, the Qp promoter, and the LMP region. Except for LMP, these TIRs were consistent with the expected transcriptional pattern of type 1 latency.
FIG 1.
Distribution of active polymerase on the EBV episome in latent and reactivated Mutu-I cells. IGV view of the distribution of PRO-Seq reads along the EBV episome in latent Mutu-I cells and at 1, 4, and 12 h postreactivation with NaB-TPA (mean of 2 to 3 biological replicates, scaled relative to Drosophila spike-in). dREG-called peaks from PRO-Seq, characteristic of transcription initiation regions (TIRs) (28), are shown below each PRO-Seq panel. Significant dREG peaks are highlighted in green (false-discovery rate [FDR]-corrected P ≤ 0.05). Peaks of Pol enrichment at 1 and 4 h postreactivation are highlighted.
We next assessed the PRO-Seq transcriptional profile of EBV after reactivation. At 1 h postreactivation, increased Pol activity relative to the latent profile developed only at the LMP-2A, RPMS1, and EBER regions, shown in highlighted regions (Fig. 1, bottom panels). At 4 h postreactivation, increased activity was limited to these same regions. By 12 h, Pol activity increased across the entire EBV genome, although predominant peaks remained at the LMP-2A, RPMS1, and EBER regions. Interestingly, most of the dREG-predicted TIRS were active during both latency and reactivation, and very few new TIRS emerged during reactivation. However, reactivation correlated with higher levels of Pol activity at these regions. The lack of Pol enrichment at the BZLF1/BRLF1 was notable; however, dREG did predict a TIR at this site, but it did not pass the FDR threshold. Expression of these genes by 24 h postreactivation was confirmed by reverse transcriptase quantitative PCR (RT-qPCR) (see Fig. S1A in the supplemental material), indicating that a lack of activity at this region was not due to inefficient reactivation.
Distinct patterns of transcriptional enrichment during EBV reactivation are associated with CTCF binding sites in Mutu-I cells.
In experimental support of the dREG computed TIRs, we used data from a previous Assay for Transposase-accessible Chromatin Sequencing (ATAC-Seq) experiment (GEO accession no. GSE172476) in 2 subpopulations of latent Mutu-I cells, MUN14 and M14, and aligned this to the dREG data (Fig. 2A). TIRs at the EBERs, BHLF1, Qp, OriLyt(R), and LMP-2A showed an alignment to ATAC-Seq peaks, indicating that these predicted TIRs are located at sites of accessible chromatin. As TIRs discovered by the dREG package have been shown to be associated with sites of transcription factor binding (28), we next asked whether the TIRs identified on the EBV episome were linked to transcription factor binding sites. We utilized the EBV portal database (7) to align transcription factor chromatin immunoprecipitation sequencing (ChIP-Seq) data to the Mutu-I latent and reactivated dREG-called peaks generated from the PRO-Seq data. The most striking observation was that for virtually every dREG site, a CTCF peak also aligned (Fig. 2A).
FIG 2.
PRO-Seq identifies distinct patterns of Pol activity on the EBV episome during reactivation associated with CTCF binding sites. (A) Alignment of dREG-called peaks in latent and reactivated Mutu-I cells, ATAC-Seq data from 2 subpopulations of latent Mutu-I cells, and CTCF ChIP-Seq. (B to F) High-resolution IGV views of PRO-Seq tracks from the EBV genome aligned with dREG-called TIRs and regions of CTCF binding at the EBER-Cp region (B), OriLyt (L)-Qp region (C), RPMS1-OriLyt(R) region (D), LMP-2A exon 1 region (E), and BZLF1-BRLF1 region (F). The data shown are means of 2 to 3 biological replicates, scaled relative to Drosophila spike-in. In genome annotation, black bars indicate known transcription termination signals (TTSs), with coding regions shown in red.
Though not every CTCF site aligned with a TIR, the strong correlation warranted further investigation. We therefore examined some of the TIRs at higher resolution using IGV. Commonly, peaks of Pol activity were visible next to or over sites of CTCF binding and often on both strands of the genome. This is shown in detail at the EBER region in Fig. 2B, with extensive Pol activity on both strands over and adjacent to the CTCF site during reactivation. The CTCF site upstream of the Cp promoter is visible in the same figure, with bidirectional Pol activity over the CTCF site upon reactivation. The BHLF1/OriLyt(L)/Qp region is shown in Fig. 2C, with 3 CTCF sites coinciding with enriched Pol activity. Further examples of extensive Pol activity adjacent to and flanking CTCF sites during reactivation are shown for the RPMS1/OriLyt(R) region (Fig. 2D) and the LMP-2A region (Fig. 2E). The CTCF binding site at BZLF1 is shown in Fig. 2F. An enrichment of Pol activity can be seen over this site at 12 h postreactivation, suggesting it is more relevant later in reactivation than other CTCF binding sites.
Viewing the data at this high resolution also indicated that the PRO-Seq peaks and predicted TIRs did not map near known EBV gene transcription start sites (TSSs) or promoters, suggesting that promoter-proximal pausing of Pol was not a major feature of latency or reactivation. For example, the increased activity associated with the EBERs was found upstream of EBER-1 on both strands of the genome in an area encoding no known transcripts (Fig. 2B). A further example was noted at LMP-2A, where the increase in Pol occurred over the exon downstream of the start codon, with additional activity appearing later in reactivation 150 bp (on the opposite strand) and 500 bp downstream of the initial peak (Fig. 2E). The lytic EBV transcriptome is known to be more complex than the GenBank annotation (29, 30). However, even when novel lytic transcripts (using transcript information from O’Grady et al. [29]) were included in the alignment, the majority of PRO-Seq peaks did not align to canonical or novel TSSs (Fig. S2A to F). Taken together, the PRO-Seq data have indicated that enrichment of Pol activity during EBV reactivation is inconsistent with promoter-proximal pausing upstream of known EBV genes and is instead associated with CTCF binding sites.
Pol enrichment at CTCF binding sites is a consistent feature of EBV reactivation in multiple cell types.
To assess whether these TIRs were specific to Pol activity in Mutu-I cells and the NaB/TPA method of reactivation, we repeated the experiment using another EBV-positive Burkitt’s lymphoma cell line, Akata, reactivated with anti-IgG. It should be noted that the number of EBV genomes in Akata cells is lower than in Mutu-I cells, causing the Akata PRO-Seq data to comprise approximately 10-fold-lower normalized EBV-aligned reads (Tables S1 and S2). As a result, it was difficult to visualize distinct changes in PRO-Seq activity on IGV until 12 h postreactivation, when transcription was more robust (Fig. S3A). We confirmed this was not due to a failure to reactivate because elevated BZLF1 expression was detectable by RT-qPCR at just 4 h postreactivation (Fig. S1B). However, using dREG, we were able to identify relatively enriched regions of Pol activity, which again showed a strong association with CTCF binding sites throughout latency and reactivation (Fig. 3A).
FIG 3.
Sites of Pol enrichment during latency and reactivation in multiple models of EBV reactivation align with CTCF binding sites. (A) Summary of dREG-called peaks in latent and reactivated Mutu-I cells, Akata cells, and LCLs, aligned to CTCF and RAD 21 binding sites. Hatched areas indicated regions that were consistent in all cell lines. Significant dREG peaks are highlighted in green (false-discovery rate [FDR]-corrected P ≤ 0.05). For Akata and Mutu-I cells, all reactivated time point tracks are overlaid. (B to F) Volcano plots showing the DeSeq2-calculated fold changes of PRO-Seq reads for each EBV gene relative to latency at Mutu-I 1 h postreactivation (B), Mutu-1 4 h postreactivation (C), Mutu-I 12 h postreactivation (D), Akata 1 h postreactivation (E), Akata 4 h postreactivation (F), and Akata 12 h postreactivation (G).
Despite both systems sharing type 1 latency, some variations in the locations of the significant dREG-called TIRs were observed between Mutu-I and Akata cells. However, most of these cell type-specific peaks also aligned near CTCF binding sites. Similar to Mutu-I, positions of most dREG-predicted TIRs in Akata were unchanged from latency, even by 12 h, when transcriptional activity was extensive. Interestingly, a peak at the promoter of the transcriptional activator, BRLF1, appeared in Akata during reactivation (Fig. 3A, middle panel). Transcription at this promoter generates a bicistronic mRNA containing BZLF1 and BRLF1, and these genes are separated by a CTCF site. The early appearance of a peak in the region containing the major lytic transcriptional activators in Akata and not Mutu-I suggests that these cells may differ in their kinetics of reactivation. This was supported by the DeSeq fold change quantification of EBV gene reads, with more viral genes significantly increasing in Pol activity at 1 h and 4 h postreactivation, including BRLF1 and BZLF1, in Akata cells relative to Mutu-I (Fig. 3B to G). Supporting what was visible on the PRO-Seq tracks (Fig. 1), significant increases in Mutu-I at 1 and 4 h postreactivation were mostly limited to genes in the RPMS1-LMP region (Fig. 3B and C), and BZLF1 and BRLF1 were only significantly increased in Pol activity at 12 h postreactivation (Fig. 3D).
We next used PRO-Seq to investigate reactivation in another reactivation model, type III latency LCLs reactivated with the small molecule C60 (31). Cells were treated with either dimethyl sulfoxide (DMSO) (in which the genome remains in a latent state) or C60 treatment to induce reactivation for 24 h and then PRO-Seq. After 24 h of treatment with C60 to induce reactivation, lytic gene activity increased relative to cellular control gene GUSB (Fig. 4B to F, grey bars). No viral genes had statistically more active Pol relative to DMSO treatment (Appendix S2). However, consistent with the other cell types studied, the dREG-predicted TIRs aligned with CTCF binding sites (Fig. 3A, bottom panel), except for a peak at BcLF1, which was also found in Akata cells. As with both Mutu-I and Akata cells, clusters of TIRs were identified at the EBERs, Orilyt(L), and RPMS1 throughout latency and reactivation. A peak was called at the LMP region but did not reach the FDR significance threshold. An overview of dREG-called TIR peaks from all three cell types aligned to CTCF binding sites is shown in Fig. 3, with hatched areas highlighting regions that were consistent in all models. CTCF commonly functions in complex with cohesin. Therefore, a track containing ChIP for the cohesin subunit RAD21 was also included, and peak binding aligned with dREG peaks at the EBER and LMP regions.
FIG 4.
An EBV mutant with a mutated CTCF binding at LMP does not reactivate from latency. (A) Schematic diagram of the deletion in the ΔCTCF EBV mutant. (B to F) RT-qPCR for EBV genes BZLF1 (B), BMRF1 (C), LMP-2A (D), EBER-1 (E), and EBER-2 (F) after induction of reactivation with the small molecule C60 at 12, 24, and 48 h posttreatment of LCL WT or ΔCTCF. Statistical significance was determined by Student’s t test.
CTCF-cohesin rearrangements occur during EBV reactivation.
Using three different models of EBV reactivation, we have shown that Pol activity during reactivation was enriched at CTCF binding sites. However, it remained unknown whether CTCF is directly involved or if increased transcriptional activity was an indirect result of genome accessibility. To investigate the role of CTCF binding on the EBV episome during early reactivation, ChIP-qPCR was performed on latent (untreated) and 4-h-reactivated Mutu-I and Akata cells. Pol II ChIP confirmed an increase in Pol II on the EBV genome during reactivation (Fig. 5A and D). At 4 h postreactivation in Mutu-I cells, there was a decrease in CTCF binding at all 3 sites examined on the EBV genome (LMP-CTCF, Cp-CTCF, and Qp-CTCF) (Fig. 5B), suggesting that CTCF binding decreases upon reactivation. However, ChIP-qPCR in Akata cells gave contrasting results and indicated increased CTCF at the LMP-CTCF and Qp-CTCF binding sites but was unchanged at the Cp-CTCF site (Fig. 5E).
FIG 5.
CTCF-cohesin binding rearrangements during EBV reactivation. ChIP-qPCR at CTCF binding sites on the EBV and human genomes in latent and reactivated Mutu-I cells using antibodies indicated. (A to G) Mutu-I Pol II (A), Mutu-I CTCF (4 h) (B), Mutu-I RAD21 (C), Akata Pol II (D), Akata CTCF (E), Akata RAD21 (F), and Mutu-I CTCF (G) (24 h).
ChIP-qPCR for the cohesin subunit RAD21 in Mutu-I cells revealed an increase in binding at 4h postreactivation at the LMP region (Fig. 5C) but was unchanged in Akata cells (Fig. 5F). This suggested that the CTCF-cohesin complex is dynamic and rearranged upon reactivation, but the details of this process are likely dependent on cell type and reactivation stimulus. Our PRO-Seq data indicated Mutu-I cells have slower reactivation kinetics than Akata cells, and repeating ChIP-qPCR for CTCF in Mutu-I cells at 24 h postreactivation revealed CTCF binding increased relative to 4 h (Fig. 5G). This further supports the conclusion that CTCF rearrangements are linked to the kinetics of reactivation.
A mutation that disrupts CTCF binding at LMP alters transcriptional activity and precludes reactivation.
To further investigate the functional role of CTCF during EBV reactivation, we utilized the LCL EBV mutant (ΔCTCF), which contains an 18-bp deletion at the LMP-2A CTCF binding site (Fig. 4A), resulting in a disruption of CTCF binding (9). The ability of this mutant to reactivate was first assessed using RT-qPCR for viral genes after treatment with C60. The ΔCTCF mutant was found to have a limited ability to reactivate from latency, with very low expression of the lytic genes BZLF1 and BMRF1, even by 48 h postreactivation when the WT exhibited robust expression (Fig. 4B and C). In contrast to WT, ΔCTCF was also unable to express LMP-2A throughout latency and reactivation (Fig. 4D) and showed dysregulated expression of both EBER-1 and EBER-2 (Fig. 4E and F). These results implicate the importance of CTCF binding in the regulation of EBV transcription because disruption of only a single CTCF site precluded expression of lytic mRNAs and reactivation. Therefore, we used PRO-Seq to investigate transcriptional activity in detail on the ΔCTCF mutant 24 h after treatment with either DMSO or C60 compared to the previously studied WT LCL. Compared to the PRO-seq profiles of Mutu-I and Akata cells shown previously, higher activity across the LCL genome (Fig. S3B) was expected because these cells maintain type III latency, the most transcriptionally active of all EBV latency types (3).
Initially focusing on latency (DMSO treatment), the most striking difference between ΔCTCF and WT was a decrease in transcriptional activity (approximately 2.7× [Table S2]) across the mutant genome. To better illustrate the differences between the two transcriptional profiles, a heatmap of the PRO-seq data was generated to visualize the density of reads on both DNA strands (Fig. 6A). This heatmap confirmed the initial PRO-Seq analysis and indicated that ΔCTCF had a lower level of Pol activity across most of the genome relative to WT, although specific sites of the mutant genome had higher activity than WT. Most noticeable of these was an increase in activity at ~11 kb on the ΔCTCF genome (near the Cp promoter), continuing ~30 kb downstream through the W repeat region. Visualization of the PRO-Seq data at this region using IGV (with adjusted scaling to equate total viral reads between viruses) revealed strong peaks on the ΔCTCF genome at the Cp promoter and over the CTCF site just upstream of Cp (Fig. 6B). The EBERs also increased in activity in ΔCTCF relative to WT and are flanked by a CTCF site (Fig. 6B). There were differences in activity at the Qp promoter (50 kb), with ΔCTCF exhibiting a reduction in several prominent peaks visible in WT (Fig. 6C). Further differences were seen at ~139 kb (RPMS1); in particular, Pol activity increased over and adjacent to the CTCF site in the mutant, primarily on the minus strand where no genes map (Fig. 6D). A clear shift in Pol enrichment between the viruses was seen at the LMP region. The WT displayed strong peaks of bidirectional transcription over the CTCF site at LMP-2A (site of the deletion in the mutant) (Fig. 6E). ΔCTCF had a reduction in activity at this site with a marked increase downstream over LMP-2B.
FIG 6.
Mutation of CTCF binding site at LMP alters transcriptional activity at other CTCF binding sites during latency. (A) Heatmap of Pol density on both strands of the viral genome in 50-bp bins. The background level was determined for each virus as the mean of each 50-bp window, and regions in red indicate sites of Pol enrichment above this level. (B to E) High-resolution IGV views of PRO-Seq tracks from EBV genome regions with disrupted activity in the ΔCTCF mutant during latency, aligned to CTCF binding sites at the EBER-Cp region (B), Qp region (C), RPMS1 region (D), or LMP region (E).
After 24 h of treatment with C60 to induce reactivation, specific sites of transcriptional enrichment were identified by fold change analysis across the viral genome and indicated differences between WT and ΔCTCF at the Cp, LMP, BHRF1, and BZLF1 regions (Fig. S3B). Closer analysis of the Cp region of WT revealed 3 peaks of increased Pol activity after C60 treatment at BCRF1, over the CTCF binding site, and at the Cp promoter (Fig. 7A). The peak activity aligning with BCRF1 and the Cp promoter was located downstream of the start codon rather than directly over promoter regions. Spline interpolation analysis indicated that the peak increases directly over the CTCF binding site Cp promoter were statistically significant (in highlighted regions) after reactivation in WT (Fig. 7A). In ΔCTCF, although there was a significant increase at the Cp promoter, there was no change the CTCF binding site and a complete lack of activity at BCRF1. These differences in Pol activity at the Cp region correlated with altered Pol activity throughout the W repeats, which was heavily restricted in WT but highly active in ΔCTCF (Fig. S3B).
FIG 7.
A mutation of the CTCF binding site at LMP changes transcriptional activity at other CTCF binding sites during stimulation of reactivation. (A to D) Spline interpolation of PRO-Seq reads from EBV genome regions with disrupted activity in the ΔCTCF mutant during reactivation, aligned to CTCF binding sites at the BCRF1-Cp region (A), LMP-2A region (B), BHLF1-BHRF1 region (C), and BRLF1-BZLF1 region (D). The shaded area shows confidence of fit. Highlighted regions indicate regions of statistical significance between DMSO and C60 treatment.
At LMP-2A, transcriptional activity was significantly increased over the CTCF binding site after reactivation in WT (Fig. 7B). However, in ΔCTCF, there was no activity at this site in either DMSO and C60 treatment and, as previously noted, a subsequent peak at LMP-2B, but this was unchanged after C60 treatment. At BHRF1, approximately 500 bp downstream of the CTCF site, and which flanks the highly active BHLF1 transcript at OriLyt(L), C60 treatment led to significant increases of Pol activity at the BHRF1 promoter region in WT (Fig. 7C). Such differences were less striking in ΔCTCF and did not reach statistical significance. Activity at the BZLF1 region was also altered between viruses with a significant increase in Pol activity in WT after C60 treatment in the gene body of BZLF1, ~700 bp downstream of the CTCF binding site (Fig. 7D). In ΔCTCF, this increase in Pol activity appeared to shift a further ~900 bp downstream, over the gene body of BZLF2.
In summary, these analyses have indicated that peaks of transcriptional activity occur adjacent to or over multiple CTCF sites during LCL latency. Aberration of a single CTCF binding site at the LMP region disrupts transcriptional activity at other CTCF sites during LCL latency, and this phenotype correlates with an inability to reactivate from latency. These data support the previous findings in Mutu-I and Akata cells and provide functional evidence of the importance of CTCF binding in the control of EBV transcription during latency and reactivation.
CTCF is known to regulate three-dimensional (3D) genome interactions, and Hi-C analyses for both latent LCLs and Mutu-I cells have recently been reported (32). We therefore aligned Hi-C DNA interactomes of LCLs and Mutu-I with our ProSeq data (Fig. 8). This overlay reveals that the sites of transcriptional enrichment identified here by PRO-Seq in both Mutu-I and LCLs aligned with intragenomic contact sites identified in the Hi-C study (Fig. 8) (32). The interaction site deleted in ΔCTCF (LMP1/2 region) is highlighted (9) and clearly shows that this region makes multiple contacts with other regions of the viral genome, especially the EBER-oriP region. In LCLs, additional contacts are detected with Qp, BZLF1, and RPMS1, suggesting the CTCF interactome on the viral genome provides coordinated control of viral reactivation.
FIG 8.
Alignment of PRO-Seq and previously obtained Hi-C data. Significant chromatin loops identified from Hi-C in Morgan et al. (32) were aligned to PRO-Seq data from latent and reactivated Mutu-I cells (A) and latent and reactivated WT and ΔCTCF LCLs (B). The orange loop highlights an interaction found to be lost in ΔCTCF by Chen et al. (9).
DISCUSSION
We initially speculated that EBV regulates the latent-to-lytic switch at the elongation step of transcription via release of Pol II from PPP. However, PRO-Seq data from EBV Mutu-I latency (Fig. 1A) showed no evidence of PPP on any EBV genes normally expressed during productive infection. Instead, the data revealed a background level of transcriptional activity across the entire viral genome, indicating that transcription during latency is not entirely silent, a notion that is also becoming more apparent in studies of other herpesviruses such as herpes simplex virus 1 (HSV-1) (33). We should note that PRO-Seq only measures transcriptional activity, and the ultimate fate of a transcribed RNA depends on its stability, processing, interactions, polyadenylation, and export to the cytoplasm. Nevertheless, peak regions of activity revealed in the PRO-Seq profiles were consistent with what was expected in type I latency (34, 35).
After Mutu-I reactivation, regions of increased Pol activity did emerge but were rarely located at promoters of viral genes, even after alignment to novel transcript data. These transcriptional increases were predominantly located on both strands of the genome around the EBERs, RPMS1, and LMP-2A, which are all traditionally latency associated. It was notable that the IE transcriptional activators BZLF1 and BRLF1 did not exhibit significant increases in Pol activity until 12 h postreactivation in Mutu-I cells. A previous nascent transcriptome study involving NaB-induced reactivation of HH514-16 BL highlighted that there are multiple cellular transcriptional events that occur before BZLF1 expression (36). These observations are consistent with our data, which also identified extensive transcriptional activity that occurs prior to robust BZLF1 expression.
EBV reactivation may potentially be analogous to biphasic HSV-1 reactivation, which begins with widespread transcriptional derepression across the viral genome, allowing for accumulation of the IE transcriptional activator VP16 (37). It is therefore possible that the high levels of activity during EBV reactivation are sites of transcription initiation, as indicated by computation using dREG, and that final gene expression is regulated through other co- or posttranscriptional mechanisms, leading to BZLF1 accumulation. This is supported by previous findings indicating that the EBV transcriptome is highly complex, with abundant antisense transcription, multiple splice junctions, alternative promoter usage, and readthrough transcription (29, 38).
dREG-predicted TIRs can also indicate sites of enhancer RNA (eRNA) initiation, and therefore, lytic gene transcription could be stimulated through long-range interactions between these potential enhancer regions and the IE gene promoters (39). We also cannot rule out that the peaks represent Pol III activity because PRO-Seq clearly detected the strong Pol III EBER transcription and dREG has been shown to detect Pol III TIRs (28). This may also explain the apparent lack of PPP, as Pol III has a greatly reduced pause time relative to Pol II (40).
We discovered a strong correlation between Pol activity and CTCF binding sites on the EBV genome during both latency and reactivation. It was notable that all cell lines used in this study had well-defined PRO-Seq peaks during latency at the EBERs and BHLF1, both of which are located adjacent to CTCF binding sites. Upon reactivation, sites with enriched transcriptional activity, characteristic of initiation, were consistently located adjacent to or over CTCF binding sites in all experiments. CTCF has previously been linked to transcription initiation in the human genome (41), and promoter-proximal CTCF-CTCF loops have been shown to be directly associated with promoter-enhancer contacts to regulate transcription (42). Recent studies have also indicated that active transcription is required to maintain these enhancer-promoter contacts (43).
CTCF is known to be important for maintenance of the epigenetic state during EBV latency (8–10), and so, it is perhaps unsurprising that it is also involved in reactivation. Our ChIP data indicated that CTCF binding is altered during reactivation; however, this contrasts with a previous study that stated CTCF binding is unchanged during EBV reactivation (44). A possible explanation for this difference is that we looked at an earlier time point, 4 h, compared to 24 h in the previous study. Indeed, ChIP-qPCR at 24 h postreactivation in Mutu-I cells showed a recovery in CTCF binding at both the LMP and Cp CTCF loci (Fig. 5G). It is also likely linked to the cell/virus background and method of reactivation as the previous study used Akata cells, which we found to give contrasting results to Mutu-I. In addition, the previous study primarily focused on the BZLF1 CTCF site, which we found to align less well with altered transcriptional activity during reactivation compared to other CTCF sites.
CTCF is involved in maintaining the distinct chromatin loops found in different EBV latency stages (5) and therefore its role in reactivation is likely linked to alteration of viral genome conformation. CTCF function in loop formation involves cohesin, which forms a ring structure around the DNA (45). ChIP for the cohesin subunit, RAD21, revealed peaks at the EBER and LMP regions (Fig. 3A), and these regions are known to physically interact in DNA loops (7). Interestingly, Pol activity at both regions was increased during reactivation in all 3 of the cell lines used in the current study. ChIP-qPCR for RAD21 in Mutu-I cells revealed an increase in binding at 4 h postreactivation at the LMP region, contrasting with the CTCF result. This is possibly due to the unstable nature of the CTCF-cohesin complex (46). Overall, these data suggest that CTCF-cohesin binding is dynamic and rearranges upon reactivation, but the details of this process are linked to cell type, reactivation method, and the subsequent kinetics of reactivation.
The functional link between CTCF binding and reactivation was confirmed using a ΔCTCF (at LMP locus) EBV mutant. Aside from an overall lower level of Pol activity, the differences in transcriptional activity between WT and ΔCTCF were mostly confined to alterations at or near CTCF binding sites during latency. These differences correlated with ΔCTCF being unable to reactivate and express lytic viral mRNAs. The ΔCTCF virus has already been shown to lose an interaction upstream of the OriP region, which is flanked by two CTCF sites at the EBERs and Cp (9). Interestingly, PRO-Seq of the ΔCTCF virus showed that transcription was dysregulated at these regions relative to WT, suggesting that the transcriptional differences resulted from CTCF-mediated variation in genome conformation. Another ΔCTCF EBV mutant, with a deleted binding site at Qp, also displayed altered transcriptional patterns (8). Hi-C analysis of both latent LCLs and Mutu-I cells has indicated that disrupting PARP1-mediated stabilization of CTCF on the viral genome leads to alternative genome conformations (32). In addition, viral gene expression was shown to be dysregulated in these LCLs, and the regions with altered genome conformations strongly correlated with the dysregulated viral genes. Notably, the sites of transcriptional enrichment identified here by PRO-Seq in both Mutu-I and LCLs aligned with intragenomic contact sites identified in the Hi-C study (Fig. 8) (32). Although some sites with transcriptional changes do not loop with LMP, it is apparent that the 3D structure of the EBV genome is complex, increasing the likelihood that disruption of certain interactions may disrupt others indirectly. Taken together, these studies support a link between transcription, CTCF binding, and the structure of the EBV genome.
In summary, our data support the hypothesis that active transcription is the key driver in regulation of the chromatin state of a genome (47). We suggest a model in which the higher-order structure of the viral genome is organized through CTCF-mediated binding. This structure is likely maintained by the active transcription we identified at CTCF binding regions. Upon reactivation, CTCF-cohesin binding is rearranged, thus altering the 3D structure of the viral genome, leading to increased accessibility of Pol and other factors. These rearrangements and alterations in Pol activity are required before BZFL1 is expressed.
MATERIALS AND METHODS
Cells.
EBV-positive Burkitt’s lymphoma cell line Mutu-I and Akata were maintained in RPMI 1640 containing 12% fetal bovine serum (FBS) and antibiotics (penicillin and streptomycin). Mutu-I cells were obtained from Jeffrey Sample, Penn State University, Hershey Medical School, Pennsylvania. Akata was obtained from Elena Mattia, University of Rome, Sapienza, Italy. The EBV WT and ΔCTCF LCLs have been previously described in reference 9. Drosophila melanogaster S2 (ATCC) cells were grown in Schneider’s medium (Lonza) containing 10% FBS and maintained at 23°C.
EBV reactivation.
For PRO-seq, Mutu-I cells were treated with NaB (1 mM) and TPA (20 ng/mL) for 0, 1, 4, or 12 h, Akata cells were treated with 10 μg/mL of goat anti-human IgG (Sigma; catalog no. I1886) for 0, 1, 4, or 12 h, and the EBV WT and ΔCTCF LCLs were treated with DMSO or C60 (5 μM) for 24 h. For ChIP assays, Mutu-I cells were untreated or treated with NaB (1 mM) and TPA (20 ng/mL) for 4 or 24 h, and Akata cells were untreated or treated with 10 μg/mL of goat anti-human IgG for 4 h. For RT-qPCR, the EBV WT and ΔCTCF LCLs were treated with DMSO or C60 (5 μM) for 12, 24, and 48 h.
Nucleus isolation.
Nuclei were isolated following methods previously described (48). In brief, cells were washed 2× with ice-cold phosphate-buffered saline (PBS) and then incubated for 10 min on ice with swelling buffer (10 mM Tris-HCl [pH 7.5], 10% glycerol, 3 mM CaCl2, 2 mM, MgCl22, 0.5 mM dithiothreitol [DTT], protease inhibitors [Roche], and 4 U/mL RNase inhibitors [RNaseOUT; Thermo Fisher]). Cells were scraped from the plate, pelleted via centrifugation at 600 × g for 10 min (4°C), resuspended in lysis buffer (swelling buffer plus 0.5% Igepal), and then incubated on ice for 20 min to release nuclei. Nuclei were pelleted by centrifugation at 1,500 × g for 5 min (4°C) and washed 2× with lysis buffer, followed by a final wash in storage buffer (50 mM Tris-HCl [pH 8.0], 25% glycerol, 5 mM Mg acetate, 0.1 mM EDTA, and 5 mM DTT). Nuclei were resuspended in storage buffer, flash-frozen in LN2, and stored at −80°C.
Precision nuclear run-on assay.
The nuclear run-on assays were performed in triplicate as described in references 25, 48, and 49 for Mutu-I and Akata experiments and duplicate for the LCL experiment. Frozen nuclei were thawed on ice and S2 nuclei spiked in to Mutu-I, Akata, and LCL cells at a ratio of 1:1,000. An equal volume of run-on buffer (10 mM Tris-HCl [pH 8.0]; 5 mM MgCl2; 1 mM DTT; 300 mM KCl; 20 μM biotin-11-ATP, -UTP, -GTP, and -CTP; 0.4 U/mL RNase inhibitor [RNaseOUT]; and 1% Sarkosyl) was added to thawed nuclei. Nuclear run-on was performed under constant shaking for 3 min on a vortex shaker at 37°C. The reaction was terminated by the addition of TRIzol LS (Thermo Fisher) and vortexed to homogenize.
Library preparation.
RNA was extracted from run-on nuclei via TRIzol extraction. Limited hydrolysis of RNA was done by the addition of 0.2N NaOH and incubation on ice for 20 min. Unincorporated nucleotides were removed using a P-30 column (Bio-Rad). Biotinylated RNA was purified using streptavidin M280 Dynabeads (Thermo Fisher) after a series of washes. Bead-bound RNA was washed 2× in ice-cold high-salt wash (50 mM Tris-HCl [pH 7.4], 2 M NaCl, 0.5% Triton X-100, 0.4 U/mL RNase inhibitor [RNaseOUT Thermo Fisher]), 2× in ice-cold medium-salt wash (10 mM Tris-HCl [pH 7.4], 300 mM NaCl, 0.1% Triton X-100, and 0.4 U/mL RNase inhibitor [RNaseOUT]) and once in ice-cold low-salt wash (5 mM Tris-HCl [pH 7.4], 0.1% Triton X-100, and 0.4 U/mL RNase inhibitor [RNaseOUT]). RNA was eluted from beads with two TRIzol extractions.
The 3′-RNA adapter with a 5′ phosphate (5′-Phos) and 3′ inverted dT (InvdT) (5′-Phos-GAUCGUCGGACUGUAGAACUCUGAAC-3′-InvdT) was ligated to the 3′ end of the RNA using T4 RNA ligase I (NEB). RNA was purified as described above by binding to streptavidin beads and washing, and TRIzol was extracted followed by removal of the 5′ cap with 10 U of 5′-pyrophosphohydrolase (RppH) (NEB) and 5′ end repair with T4 PNK (NEB). The 5′-RNA adapter (5′-CCUUGGCACCCGAGAAUUCCA-3′) was ligated to the 5′ end of the RNA using T4 RNA ligase I (NEB). RNA was purified as above by binding to streptavidin beads, washing, and TRIzol extraction.
RNA was reverse transcribed with SuperScript III reverse transcriptase (Thermo Fisher) using the RNA PCR primer 1 (5′-AATGATACGGCGACCACCGAGATCTACACGTTCAGAGTTCTACAGTCCGA-3′). The cDNA was PCR amplified with Phusion high-fidelity DNA polymerase (NEB) using barcoded Illumina PCR index primers. Libraries were purified on an 8% polyacrylamide-Tris-borate-EDTA (TBE) gel and assessed on a fragment analyzer (Bioanalyzer) to confirm the correct library size. One library replicate of Mutu-I 1 h reactivated did not meet quality requirements and was not analyzed. Libraries were sequenced on an Illumina NextSeq 500, performed by the GeneLab at Louisiana State University School of Veterinary Medicine or by the Biotechnology Resource Center (BRC) Genomics facility (RRID SCR_021727) at the Cornell Institute of Biotechnology.
Read processing.
FastQ files were processed using the PRO-Seq pipeline developed by Danko and colleagues at Cornell University (https://github.com/Danko-Lab/utils/tree/master/proseq). The pipeline first trims adapters and aligns reads to a concatenated genome file containing hg38, dm3, EBV genomes, and rRNA. The EBV genome used was dependent on the experiment. (Mutu-1, GenBank accession no. KC207814; Akata, GenBank accession no. KC207813; and LCL, GenBank accession no. NC_007605). SeqMonk software (50) was used to align the output BAM files to the drosophila, EBV, and Hg38 genomes. For viral genes that overlap on the same strand, assignment of the originating gene was not possible. Read counts in these instances were therefore assigned to all overlapping genes. Drosophila spike-in normalization was used to account for variation in sequencing depth between libraries. Libraries were normalized relative to the library with the largest Drosophila read count, with the scale factor set to 1. The total Drosophila reads for all libraries were divided from the largest number to determine the relative normalization scale factor. The IGV genome browser (27) was used for visualization of the output BW files (containing only the 3′ final read position). Individual reads for each nucleotide across the EBV genome were extracted from the BW files using multiBigWigSummary from deepTools (51) and normalized as described above. The resulting TXT files were used for analysis in SeqMonk and converted to BedGraph for IGV visualization for figure production.
Fold change analysis.
Fold change analysis was performed using the R package DeSeq2 (52). DeSeq2 for EBV genes was performed incorporating Hg38 genes to account for the library size correction step. Full viral gene DeSeq2 fold change and P values are given in Appendix S2 in the supplemental material.
dREG analysis.
The PRO-Seq output BW files were merged for each replicate using the mergeBigWigs script from the Danko PRO-Seq pipeline (https://github.com/Danko-Lab/utils/tree/master/proseq). The merged BW files were then imported into the dREG web-based server, freely available at https://dreg.dnasequence.org/, and dREG analysis was performed (28). To identify overlapping regions between cell lines, peaks were aligned to a consensus reference genome (GenBank accession no. NC_007605). A full list of dREG significant peaks and scores is given in Appendix S1.
ChIP-Seq and ATAC-Seq data.
WIG files of ChIP-Seq data from the EBV genome were downloaded from the EBV portal (7), freely available at https://ebv.wistar.upenn.edu/. ATAC-Seq data are available from the NCBI GEO database under accession no. GSE172476.
ChIP-qPCR assays.
ChIP-qPCR assays were performed as described previously (53). Quantification of precipitated DNA was determined using real-time PCR and the threshold cycle (ΔΔCT) method for relative quantitation (ABI 7900HT Fast real-time PCR system). Rabbit IgG (Cell Signaling; catalog no. 2729S), rabbit anti-CTCF (EMD Millipore; catalog no. 07-729), and rabbit anti-Rad21 (Abcam; catalog no. ab992) were used in ChIP assays. Primers for ChIP assays are listed in Table S3.
RT-qPCR.
The EBV WT and ΔCTCF LCLs treated with DMSO or lytic activating compound C60 (31) were harvested at indicated time points. Total RNA was isolated using the RNeasy kit (Qiagen) and then treated with DNase I by using the DNase treatment and removal kit from Ambion. Reverse transcription was carried out on equal amounts of DNase-treated RNA using SuperScript IV reverse transcriptase (Invitrogen), random priming mix (New England Biolabs), and RNase inhibitor (New England Biolabs) following the manufacturer’s instructions. qPCR was performed with Power SYBR green 2× PCR mastermix with primers (Table S4). Data were normalized to the gene GUSB.
Data availability.
The PRO-Seq data are publicly available on the GEO database under the accession numbers GSE188584 and GSE211247.
ACKNOWLEDGMENTS
Portions of this research were conducted with the high-performance SuperMic supercomputer provided by Louisiana State University (http://www.hpc.lsu.edu).
We thank Le Yan for his help in maintaining the PRO-Seq pipeline. We thank Thaya Stoufflet and Vladimir Chouljenko at the LSU SVM GeneLab for their help with the NextSeq 500 sequencing. We also thank Claire Birkenheuer for all her assistance and discussion with the PRO-Seq technique.
This work was supported by NIH grants R01 AI 141968 and R21 AI 148926 to J.D.B. and R01 DE017336 and R01 CA093606 to P.M.L.
Footnotes
Supplemental material is available online only.
Contributor Information
Joel D. Baines, Email: jdb11@cornell.edu.
Felicia Goodrum, The University of Arizona.
REFERENCES
- 1.Hammerschmidt W. 2015. The epigenetic life cycle of Epstein–Barr virus. Curr Top Microbiol Immunol 390:103–117. 10.1007/978-3-319-22822-8_6. [DOI] [PubMed] [Google Scholar]
- 2.Klein E, Kis LL, Klein G. 2007. Epstein–Barr virus infection in humans: from harmless to life endangering virus–lymphocyte interactions. Oncogene 26:1297–1305. 10.1038/sj.onc.1210240. [DOI] [PubMed] [Google Scholar]
- 3.Kang MS, Kieff E. 2015. Epstein–Barr virus latent genes. Exp Mol Med 47:e131. 10.1038/emm.2014.84. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Minarovits J. 2006. Epigenotypes of latent herpesvirus genomes, p 61–80. In Doerfler W, Böhm P (ed), DNA methylation: development, genetic disease and cancer. Springer, Berlin, Heidelberg, Germany. [Google Scholar]
- 5.Tempera I, Klichinsky M, Lieberman PM. 2011. EBV latency types adopt alternative chromatin conformations. PLoS Pathog 7:e1002180. 10.1371/journal.ppat.1002180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Phillips JE, Corces VG. 2009. CTCF: master weaver of the genome. Cell 137:1194–1211. 10.1016/j.cell.2009.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Arvey A, Tempera I, Tsai K, Chen H-S, Tikhmyanova N, Klichinsky M, Leslie C, Lieberman PM. 2012. An atlas of the Epstein-Barr Virus transcriptome and epigenome reveals host-virus regulatory interactions. Cell Host Microbe 12:233–245. 10.1016/j.chom.2012.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tempera I, Wiedmer A, Dheekollu J, Lieberman PM. 2010. CTCF prevents the epigenetic drift of EBV latency promoter Qp. PLoS Pathog 6:e1001048. 10.1371/journal.ppat.1001048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Chen H-S, Martin KA, Lu F, Lupey LN, Mueller JM, Lieberman PM, Tempera I. 2014. Epigenetic deregulation of the LMP1/LMP2 locus of Epstein-Barr virus by mutation of a single CTCF-cohesin binding site. J Virol 88:1703–1713. 10.1128/JVI.02209-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lupey-Green LN, Caruso LB, Madzo J, Martin KA, Tan Y, Hulse M, Tempera I. 2018. PARP1 stabilizes CTCF binding and chromatin structure to maintain Epstein-Barr virus latency type. J Virol 92:e00755-18. 10.1128/JVI.00755-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Reusch JA, Nawandar DM, Wright KL, Kenney SC, Mertz JE. 2015. Cellular differentiation regulator BLIMP1 induces Epstein-Barr virus lytic reactivation in epithelial and B cells by activating transcription from both the R and Z promoters. J Virol 89:1731–1743. 10.1128/JVI.02781-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.McKenzie J, El-Guindy A. 2015. Epstein-Barr virus lytic cycle reactivation, p 237–261. In Münz C (ed), Epstein Barr virus volume 2: one herpes virus: many diseases. Springer International Publishing, Cham, Switzerland. [Google Scholar]
- 13.Burton EM, Goldbach-Mansky R, Bhaduri-McIntosh S. 2020. A promiscuous inflammasome sparks replication of a common tumor virus. Proc Natl Acad Sci USA 117:1722–1730. 10.1073/pnas.1919133117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Kenney SC, Mertz JE. 2014. Regulation of the latent-lytic switch in Epstein–Barr virus. Semin Cancer Biol 26:60–68. 10.1016/j.semcancer.2014.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kraus RJ, Yu X, Cordes B-LA, Sathiamoorthi S, Iempridee T, Nawandar DM, Ma S, Romero-Masters JC, McChesney KG, Lin Z, Makielski KR, Lee DL, Lambert PF, Johannsen EC, Kenney SC, Mertz JE. 2017. Hypoxia-inducible factor-1α plays roles in Epstein-Barr virus’s natural life cycle and tumorigenesis by inducing lytic infection through direct binding to the immediate-early BZLF1 gene promoter. PLoS Pathog 13:e1006404. 10.1371/journal.ppat.1006404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Guo R, Jiang C, Zhang Y, Govande A, Trudeau SJ, Chen F, Fry CJ, Puri R, Wolinsky E, Schineller M, Frost TC, Gebre M, Zhao B, Giulino-Roth L, Doench JG, Teng M, Gewurz BE. 2020. MYC controls the Epstein-Barr virus lytic switch. Mol Cell 78:653–669.e8. 10.1016/j.molcel.2020.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hill ER, Koganti S, Zhi J, Megyola C, Freeman AF, Palendira U, Tangye SG, Farrell PJ, Bhaduri-McIntosh S. 2013. Signal transducer and activator of transcription 3 limits Epstein-Barr virus lytic activation in B lymphocytes. J Virol 87:11438–11446. 10.1128/JVI.01762-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Li X, Burton EM, Koganti S, Zhi J, Doyle F, Tenenbaum SA, Horn B, Bhaduri-McIntosh S. 2018. KRAB-ZFP repressors enforce quiescence of oncogenic human herpesviruses. J Virol 92:e00298-18. 10.1128/JVI.00298-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Arrand JR, Rymo L. 1982. Characterization of the major Epstein-Barr virus-specific RNA in Burkitt lymphoma-derived cells. J Virol 41:376–389. 10.1128/JVI.41.2.376-389.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Buratowski S. 2009. Progression through the RNA polymerase II CTD cycle. Mol Cell 36:541–546. 10.1016/j.molcel.2009.10.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Palermo RD, Webb HM, West MJ. 2011. RNA polymerase II stalling promotes nucleosome occlusion and pTEFb recruitment to drive immortalization by Epstein-Barr virus. PLoS Pathog 7:e1002334. 10.1371/journal.ppat.1002334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Birkenheuer CH, Baines JD. 2020. RNA polymerase II promoter-proximal pausing and release to elongation are key steps regulating herpes simplex virus 1 transcription. J Virol 94:e02035-19. 10.1128/JVI.02035-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Parida M, Nilson KA, Li M, Ball CB, Fuchs HA, Lawson CK, Luse DS, Meier JL, Price DH. 2019. Nucleotide resolution comparison of transcription of human cytomegalovirus and host genomes reveals universal use of RNA polymerase II elongation control driven by dissimilar core promoter elements. mBio 10:e02047-18. 10.1128/mBio.02047-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Toth Z, Brulois KF, Wong LY, Lee HR, Chung B, Jung JU. 2012. Negative elongation factor-mediated suppression of RNA polymerase II elongation of Kaposi’s sarcoma-associated herpesvirus lytic gene expression. J Virol 86:9696–9707. 10.1128/JVI.01012-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kwak H, Fuda NJ, Core LJ, Lis JT. 2013. Precise maps of RNA polymerase reveal how promoters direct initiation and pausing. Science 339:950–953. 10.1126/science.1229386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lin Z, Wang X, Strong MJ, Concha M, Baddoo M, Xu G, Baribault C, Fewell C, Hulme W, Hedges D, Taylor CM, Flemington EK. 2013. Whole-genome sequencing of the Akata and Mutu Epstein-Barr virus strains. J Virol 87:1172–1182. 10.1128/JVI.02517-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Thorvaldsdóttir H, Robinson JT, Mesirov JP. 2013. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief Bioinform 14:178–192. 10.1093/bib/bbs017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wang Z, Chu T, Choate LA, Danko CG. 2019. Identification of regulatory elements from nascent transcription using dREG. Genome Res 29:293–303. 10.1101/gr.238279.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.O'Grady T, Wang X, Höner Zu Bentrup K, Baddoo M, Concha M, Flemington EK. 2016. Global transcript structure resolution of high gene density genomes through multi-platform data integration. Nucleic Acids Res 44:e145. 10.1093/nar/gkw629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Fülöp Á, Torma G, Moldován N, Szenthe K, Bánáti F, Almsarrhad IAA. 2022. Integrative profiling of Epstein–Barr virus transcriptome using a multiplatform approach. Virol J 19:7. 10.1186/s12985-021-01734-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Tikhmyanova N, Schultz DC, Lee T, Salvino JM, Lieberman PM. 2014. Identification of a new class of small molecules that efficiently reactivate latent Epstein–Barr virus. ACS Chem Biol 9:785–795. 10.1021/cb4006326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Morgan SM, Tanizawa H, Caruso LB, Hulse M, Kossenkov A, Madzo J, Keith K, Tan Y, Boyle S, Lieberman PM, Tempera I. 2022. The three-dimensional structure of Epstein-Barr virus genome varies by latency type and is regulated by PARP1 enzymatic activity. Nat Commun 13:187. 10.1038/s41467-021-27894-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Singh N, Tscharke DC, Goodrum F. 2020. Herpes simplex virus latency is noisier the closer we look. J Virol 94:e01701-19. 10.1128/JVI.01701-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Lin Z, Xu G, Deng N, Taylor C, Zhu D, Flemington EK. 2010. Quantitative and qualitative RNA-Seq-based evaluation of Epstein-Barr virus transcription in type i latency Burkitt’s lymphoma cells. J Virol 84:13053–13058. 10.1128/JVI.01521-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Price AM, Luftig MA. 2015. To be or not IIb: a multi-step process for Epstein-Barr virus latency establishment and consequences for B cell tumorigenesis. PLoS Pathog 11:e1004656. 10.1371/journal.ppat.1004656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Frey TR, Brathwaite J, Li X, Burgula S, Akinyemi IA, Agarwal S. 2020. Nascent transcriptomics reveal cellular prolytic factors upregulated upstream of the latent-to-lytic switch protein of Epstein-Barr virus. J Virol 94:e01966-19. 10.1128/JVI.01966-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kim JY, Mandarino A, Chao MV, Mohr I, Wilson AC. 2012. Transient reversal of episome silencing precedes VP16-dependent transcription during reactivation of latent HSV-1 in neurons. PLoS Pathog 8:e1002540. 10.1371/journal.ppat.1002540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.O'Grady T, Cao S, Strong MJ, Concha M, Wang X, Splinter Bondurant S, Adams M, Baddoo M, Srivastav SK, Lin Z, Fewell C, Yin Q, Flemington EK. 2014. Global bidirectional transcription of the Epstein-Barr virus genome during reactivation. J Virol 88:1604–1616. 10.1128/JVI.02989-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Higgs DR. 2020. Enhancer–promoter interactions and transcription. Nat Genet 52:470–471. 10.1038/s41588-020-0620-7. [DOI] [PubMed] [Google Scholar]
- 40.Zhang J, Cavallaro M, Hebenstreit D. 2021. Timing RNA polymerase pausing with TV-PRO-seq. Cell Rep Methods 1:100083. 10.1016/j.crmeth.2021.100083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chernukhin I, Shamsuddin S, Kang SY, Bergström R, Kwon Y-W, Yu W, Whitehead J, Mukhopadhyay R, Docquier F, Farrar D, Morrison I, Vigneron M, Wu S-Y, Chiang C-M, Loukinov D, Lobanenkov V, Ohlsson R, Klenova E. 2007. CTCF interacts with and recruits the largest subunit of RNA polymerase II to CTCF target sites genome-wide. Mol Cell Biol 27:1631–1648. 10.1128/MCB.01993-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kubo N, Ishii H, Xiong X, Bianco S, Meitinger F, Hu R, Hocker JD, Conte M, Gorkin D, Yu M, Li B, Dixon JR, Hu M, Nicodemi M, Zhao H, Ren B. 2021. Promoter-proximal CTCF binding promotes distal enhancer-dependent gene activation. Nat Struct Mol Biol 28:152–161. 10.1038/s41594-020-00539-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Barshad G, Lewis JJ, Chivu AG, Abuhashem A, Krietenstein N, Rice EJ, Rando OJ, Hadjantonakis AK, Danko CG. 2022. RNA polymerase II and PARP1 shape enhancer-promoter contacts. bioRxiv. 10.1101/2022.07.07.499190. [DOI] [PMC free article] [PubMed]
- 44.Lupey-Green LN, Moquin SA, Martin KA, McDevitt SM, Hulse M, Caruso LB, Pomerantz RT, Miranda JL, Tempera I. 2017. PARP1 restricts Epstein Barr Virus lytic reactivation by binding the BZLF1 promoter. Virology 507:220–230. 10.1016/j.virol.2017.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Rowley MJ, Corces VG. 2018. Organizational principles of 3D genome architecture. Nat Rev Genet 19:789–800. 10.1038/s41576-018-0060-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hansen AS, Pustova I, Cattoglio C, Tjian R, Darzacq X. 2017. CTCF and cohesin regulate chromatin loop stability with distinct dynamics. Elife 6:e25776. 10.7554/eLife.25776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Wang Z, Chivu AG, Choate LA, Rice EJ, Miller DC, Chu T, Chou S-P, Kingsley NB, Petersen JL, Finno CJ, Bellone RR, Antczak DF, Lis JT, Danko CG. 2022. Prediction of histone post-translational modification patterns based on nascent transcription data. Nat Genet 54:295–305. 10.1038/s41588-022-01026-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Birkenheuer CH, Danko CG, Baines JD. 2018. Herpes simplex virus 1 dramatically alters loading and positioning of RNA polymerase II on host genes early in infection. J Virol 92:e02184-17. 10.1128/JVI.02184-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Mahat DB, Kwak H, Booth GT, Jonkers IH, Danko CG, Patel RK, Waters CT, Munson K, Core LJ, Lis JT. 2016. Base-pair-resolution genome-wide mapping of active RNA polymerases using precision nuclear run-on (PRO-seq). Nat Protoc 11:1455–1476. 10.1038/nprot.2016.086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Babraham Institute. 2020. (Version 1.48.0). https://www.bioinformatics.babraham.ac.uk/projects/seqmonk/.
- 51.Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. 2016. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res 44:W160–W165. 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Love MI, Huber W, Anders S. 2014. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15:550. 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Chau CM, Lieberman PM. 2004. Dynamic chromatin boundaries delineate a latency control region of Epstein-Barr virus. J Virol 78:12308–12319. 10.1128/JVI.78.22.12308-12319.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Appendix S1. Download jvi.01894-22-s0001.xlsx, XLSX file, 0.02 MB (22KB, xlsx)
Appendix S2. Download jvi.01894-22-s0002.xlsx, XLSX file, 0.08 MB (83.6KB, xlsx)
Fig. S1 to S3 and Tables S1 to S4. Download jvi.01894-22-s0003.pdf, PDF file, 3.9 MB (3.9MB, pdf)
Data Availability Statement
The PRO-Seq data are publicly available on the GEO database under the accession numbers GSE188584 and GSE211247.








