Abstract
Caviid gammaherpesvirus 1 (CaGHV-1), formerly known as the guinea pig herpes-like virus, is an oncogenic gammaherpesvirus with a sequenced genome but an as-yet uncharacterized transcriptome. Using nanopore long-read RNA sequencing, we annotated the CaGHV-1 genome and constructed a detailed transcriptomic atlas. Our findings reveal diverse viral mRNAs and non-coding RNAs, along with mapped promoter elements for each viral gene. We demonstrated that the CaGHV-1 RTA lytic cycle transcription factor activates its own promoter, similar to KSHV, and that the CaGHV-1 ORF50 promoter responds to RTA proteins from other gammaherpesviruses, highlighting the evolutionary conservation of RTA-mediated transcriptional mechanisms. Additionally, our analysis uncovered extensive transcriptional overlap within the viral genome, suggesting a role in regulating global gene expression. Given its tumorigenic properties, broad host range, and non-human pathogenicity, this work establishes CaGHV-1 as a promising small animal model for investigating human gammaherpesvirus pathogenesis.
Keywords: guinea pig herpes-like virus, Caviid gammaherpesvirus 1 (CaGHV-1), gammaherpesvirus, transcriptome, long-read sequencing, nanopore sequencing, direct RNA sequencing, luciferase reporter assay
INTRODUCTION
Gammaherpesviruses are a subfamily of herpesviruses that establish lifelong latency in their hosts by infecting lymphocytes. They are medically significant due to their association with various cancers, including lymphomas as well as endothelial and epithelial tumors (1, 2). Epstein-Barr virus (EBV), one of the most well-known human gammaherpesviruses, is associated with infectious mononucleosis and various malignancies (3). Another human gammaherpesvirus is Kaposi’s sarcoma-associated herpesvirus (KSHV), responsible for Kaposi’s sarcoma, primary effusion lymphoma, and a subset of multicentric Castleman diseases, particularly in immunocompromised individuals such as those with AIDS (4). Current research predominantly relies on the murine model, specifically the murine gammaherpesvirus 4 strain 68 (MHV68), for studying the pathogenesis of human gammaherpesviruses in vivo (5). While MHV68 has been valuable for understanding the key viral and host determinants of gammaherpesvirus infection, it is constrained by its limited sequence homology and significant physiological differences compared to human gammaherpesviruses (6, 7). Non-human primate models, such as those employing Rhesus rhadinovirus (RRV) and Retroperitoneal fibromatosis-associated herpesvirus (RFHV), offer more relevant biological insights into human gammaherpesvirus pathogenesis, but their use is limited due to the expenses of maintaining the animals, ethical concerns, the need for specialized facilities, and the complexity of conducting long-term studies in these models (8, 9). These limitations highlight the need for an alternative, affordable small animal model to study human gammaherpesvirus infection and pathogenesis.
Caviid gammaherpesvirus 1 (CaGHV-1), first identified in 1969, was recently sequenced and subsequently classified as a rhadinovirus within the gammaherpesvirus subfamily (10, 11). Previously, CaGHV-1 was referred to as the guinea pig herpes-like virus (GPHLV); however, this term is misleading, as it refers to an actual herpesvirus rather than a “herpes-like” virus. To address this, Stanfield and colleagues proposed renaming it Caviid gammaherpesvirus 1 (11). This revised name will be used consistently throughout the manuscript. The genome of CaGHV-1 spans 103,374 base pairs, with a GC content of 35.4% (11). It encodes 75 predicted open reading frames (ORFs), the majority of which are homologous to genes found in human gammaherpesviruses, such as EBV and KSHV. However, detailed transcriptomic data and the precise mapping of the viral genes are still lacking.
The introduction of long-read RNA sequencing (lrRNA-Seq) technology has revolutionized viral transcriptomics, providing a more comprehensive and accurate mapping of viral RNAs. This approach has led to the discovery of previously unidentified RNA molecules and offered deeper insights into gene expression patterns and regulatory mechanisms (12). It is increasingly evident that viral transcriptomes exhibit a complexity far beyond what was previously understood (13–16). The transcriptomes of several herpesviruses (17–20), including gammaherpesviruses (21–23), have been uncovered using this technique alone or in combination with short-read sequencing.
In this study, we present a comprehensive transcriptomic analysis of CaGHV-1, including the identification of cis-regulatory elements that may control the expression of viral genes. Additionally, we report the first functional evaluation of the transcriptional activity of the CaGHV-1 ORF50-encoded protein, which shares homology with RTA, the lytic cycle inducer in gammaherpesviruses.
RESULTS
General attributes of the transcriptomic analysis of CaGHV-1
To characterize the CaGHV-1 transcriptome, we analyzed the poly(A)-selected RNA fraction from infected cells using direct RNA sequencing (dRNA-Seq) and direct cDNA sequencing (dcDNA-Seq) on the ONT PromethION platform. The sequencing reads were aligned to the viral genome (OQ679822.1) using the minimap2 software. For transcriptome annotation, we utilized the LoRTIA toolkit developed in our laboratory (24). The dcDNA-Seq samples yielded a total of 71,651,155 reads, of which 5,260,508 were identified as viral reads. The dRNA-Seq generated 9,833,670 reads, which included 1,208,372 viral reads. The average read length of the dcDNA-Seq reads was 575.33 nt, while the dRNA-Seq reads had an average length of 1,013.17 nt (Supplemental Table 1; Figures 1 A–C). The LoRTIA software validates the quality of poly(A) sequences and sequencing adapters, while filtering out incorrect transcription start sites (TSSs), transcription end sites (TESs), and introns arising from RNA degradation, erroneous reverse transcription, artefactual PCR amplification, or sequencing mispriming. To enhance the accuracy of transcript annotations generated by the LoRTIA program, stricter filtering criteria were implemented: TSSs and TESs were considered valid only if supported by at least three dcDNA-Seq samples and one dRNA-Seq sample, while introns were identified solely based on dRNA-Seq data and subsequently verified using dcDNA-Seq data. In this study, we present only the canonical transcripts, totaling 278 (Figure 1D), with an average length of 4283.104 nt.
Identification of promoters and transcription start sites of the virus
The initiator sequence at TSSs (Py A N U/A) in the CaGHV-1 genome was found to exhibit lower conservation compared to standard eukaryotic sequences (25). Notably, G/A nucleotides predominated at the TSS, with G nucleotides being especially frequent at the +1 position and T/C nucleotides primarily occupying the −1 position preceding the TSS (Figure 2A). This enrichment of G nucleotides has also been observed in the initiator element (Inr) of the VP5 promoter in herpes simplex virus type 1 (HSV-1) (26–28). Additionally, this Inr motif was identified in several other herpesviruses, including Epstein-Barr virus (EBV) and bovine herpesvirus 1 (BoHV-1). This enrichment of G nucleotides has also been observed in the initiator element (Inr) of the VP5 promoter in herpes simplex virus type 1 (HSV-1) (27) and identified in other herpesviruses, including Epstein-Barr virus (EBV), bovine herpesvirus 1 (BoHV-1) (22, 26, 28). We identified 92 potential TATA boxes in the CaGHV-1 genome, with an average distance of 31.80 nucleotides upstream of the TSSs. Additionally, we found 18 putative CAAT boxes, averaging 112.33 nucleotides upstream, and 5 potential GC boxes approximately 49.2 nucleotides upstream of the TSSs (Figures 2B and 2C). Promoter elements within the −20 to −40 region are notably enriched in T/A nucleotides (Supplemental Table 2). Most of these promoter elements contain a TATA box sequence with the TATTWAA motif, which was previously detected in KSHV (29). This motif plays a key role in initiating the transcription of late genes. This analysis led to the annotation of 162 canonical transcripts. The TSS corresponded to the PAN non-coding RNA (ncRNA), supported by 1,661,506 reads, indicating an exceptionally high transcriptional state. The second most abundant TSS was associated with ORF67, with 65,034 reads. Several viral transcripts, such as the mRNAs of ORF75, ORF52, ORF59, ORF25, and ORF26, also displayed a substantial number of transcript ends.
Identification of polyadenylation signals and transcription end sites of the virus
In this work, we annotated 140 canonical TESs, 131 of which were associated with polyadenylation signals (PAS), with an average distance of 25.94 nucleotides between the TESs and their corresponding poly(A) signals (Supplemental Table 3). The TES environment is defined by A/C cleavage sites and a preference for U/G-rich downstream elements, consistent with eukaryotic transcription termination consensus sequences. Notably, poly(A) signals are predominantly located within the 50-nucleotide upstream region (Figures 3A and B). Furthermore, we mapped TES positions across the entire viral genome using dcDNA-seq, which was validated with dRNA-Seq (Figure 3C).
Introns and splice junctions in the viral transcripts
To explore the splicing landscape of the viral transcriptome, we analyzed our dRNA-Seq data and identified 56 introns, all validated by dcDNA-Seq (Supplemental Table 4). We annotated a higher number of spliced transcripts (79) attributed to the occurrence of alternative splicing events (Supplemental Table 5A). These spliced transcripts map to several genomic regions, including GPHLV-11; ORF20-ORF21; ORF29a-ORF29b; ORF31-ORF29b; ORF38-ORF40; ORF44-ORF50 (encompassing ORFs 45–47, 48, and G3-G4); ORF54-ORF57; ORF63-ORF67; ORF72-ORF73; and ORF75 (Figure 4). We note that we detected a substantially higher number (280 versus 56) of introns in the dcDNA-Seq samples, most of which are likely artifacts of cDNA sequencing methodologies, potentially caused by errors during reverse transcription or second-strand DNA synthesis. To test the reliability of our dRNA-Seq results, we conducted parallel sequencing of mpox virus (a virus lacking splicing). Our results validated the accuracy of the applied sequencing and bioinformatics methods, as no false splice sites were detected in the mpox transcripts. This conclusion was further supported by obtaining the same results with respect to the introns using both the NAGATA (30) and the LoRTIA software (24) for splicing detection. This finding suggests that all spliced transcripts, including those of low abundance, are indeed of biological origin; however, many may represent mere transcriptional noise without functional significance.
We detected splicing events in both ncRNAs and mRNAs of CaGHV-1. In mRNAs, most introns were positioned in 5’-UTRs and 3’-UTRs. However, we identified introns in the coding regions of two genes (ORF50 and ORF57), where splicing resulted in different amino acid compositions at the N-terminal regions of the encoded proteins compared to the non-spliced transcripts. The intron structures of ORF50 and ORF57 are conserved and match those found in the homologous genes of KSHV. Furthermore, we found that ORF29 is composed of two exons, ORF29a and ORF29b, separated by a 3,093-nucleotide intron containing four genes (ORF30, ORF31, ORF32, and ORF33) oriented oppositely to the ORF29 gene. This intron arrangement is conserved in the homologous genes of PRV, HSV, KSHV, and EBV, except in alphaherpesviruses, where the intron contains only two genes (31–33). We identified several intron-containing ncRNAs in the G4-G5, ORF63-64, and OriLyt-R regions. The G4-G5 region, previously described only in KSHV, produces RNAs categorized as non-coding, despite containing small ORFs with no annotated function. In the ORF63–64 region, the antisense ncRNAs and the antisense segments of complex transcripts undergo splicing. Additionally, we detected an intron in a non-coding raRNA mapped to the OriLyt-R region (Figure 4).
Monocistronic viral mRNAs with canonical ORFs
In our analysis, we identified 54 canonical viral mRNAs, which were defined as the most abundant transcript variant for each viral gene (Figure 4 and Supplemental Table 5B). This approach ensures that we capture the predominant functional outputs of the genome. Notably, one of the most prevalent viral transcript named ORF17.5 was located within ORF17 and featured a 5’-truncated ORF, exhibiting greater abundance than its full-length counterpart. The orthologous genes likewise express this embedded gene in all three herpesvirus subfamilies. Furthermore, we detected a transcript overlapping the genomic junction, beginning at the end of the G12 gene and ending within the GPHLV-11 gene.
Multigenic CaGHV-1 transcripts
We identified 60 polycistronic viral mRNAs that encode two or more co-oriented ORFs and 108 complex transcripts containing two or more ORFs, with at least one positioned in an antiparallel orientation. The substantial number of complex viral transcripts indicates a high level of transcriptional complexity within the CaGHV-1 genome, primarily due to the presence of overlapping transcripts and alternative splicing events (Figure 4 and Supplemental Table 5B). The longest polycistronic transcript measured 13,834 nucleotides, spanning seven genes within the ORF4-ORF10 region. While in KSHV, downstream genes within the ORF72-ORF71 and ORF35-36-37 transcripts have been shown to be translated via mechanisms such as termination-reinitiation, involving the utilization of upstream open reading frames (uORFs) (34), no similar mechanisms have been observed in other multigenic herpesviral transcripts. We detected three uORFs upstream of the ORF35 gene in CaGHV-1, but they are not as close to the ATG as in KSHV (Supplemental Figure 1). The longest complex transcript, measuring 22,936 nucleotides, shared a promoter with the PAN ncRNA and encompassed 16 genes spanning from ORF17 to ORF33.
Non-coding viral transcripts
Non-coding RNAs comprise intergenic transcripts (limited to PAN), antisense transcripts, and likely numerous long 5’-UTR variants of mRNAs. A distinct class of ncRNAs is the replication origin-associated RNAs (raRNAs). Similar to KSHV, PAN is the most abundant ncRNA in the CaGHV-1 genome. Additionally, we identified 44 antisense RNAs (asRNAs), 33 of which are located within single genes, while 10 overlap two genes and 1 overlaps three genes (Figure 4 and Supplemental Table 5B). For dRNA-Seq annotation, we utilized the NAGATA software developed by the Depledge laboratory (30). TATA boxes were identified in 26 antisense transcripts, 15 of which contained the TATTWAA sequence characteristic of late gene promoters in the KSHV virus. The average length of ncRNAs is 758.78 nt, with the shortest being 104 nt within ORF16 and the longest, 3,788 nt, located within ORF63–64. It is noteworthy that several asRNAs were detected in the ORF63-ORF64 and ORF75 regions, both in spliced and unspliced forms. Similar asRNAs have also been described in a closely related virus, Murine gammaherpesvirus 68 (35). Polycistronic transcripts with significant distances between their TESs and ATGs, as well as complex transcripts whose most upstream gene stands in an antiparallel orientation, are likely non-coding. We calculated the proportion of overlapping asRNA/mRNA pairs (Supplemental Table 5B).
Our analysis revealed numerous transcripts encoded in the vicinity of viral Oris, termed replication origin-associated RNAs, most of which are non-coding (Figure 5). The replication origins of the CaGHV-1genome were identified by aligning it with the KSHV reference genome and mapping the corresponding KSHV replication regions onto the CaGHV-1 genome. Similar RNAs have also been previously described in other gammaherpesviruses, such as EBV and KSHV (36). We found that several transcripts, coterminating with K3 transcripts, overlap Orilyt-L with their 5’-UTRs. These transcripts are likely ncRNAs, given the large distance between their transcription and translation start sites. We also observed a very long RNA molecule in this region that fully encompasses the replication origin and co-terminates with PAN ncRNA. At the OriLyt-R, ORF72 produces a TES isoform that overlaps the replication origin with its 3’-UTR. Whether this transcript is involved in translation or serves solely to interfere with the replication process remains unknown. Additionally, multiple RNAs were identified with their promoter regions directly associated with OriLyt-R. A particularly intriguing discovery is that the transcripts originating from OriLyt-R utilize the TATTWAA promoter. Since these consensus sequences are recognized by LTF1 (encoded by ORF24), which facilitates the recruitment of RNA polymerase II, thereby regulating global transcription, we propose that this transcription factor may interfere with DNA replication by binding to the replication origin during the late stages of infection. Long complex RNAs were also detected at this region overlapping the entire lytic origin. We detected asRNAs overlapping ORF69, which is located near OriLyt-R but does not overlap it. Similar to CTO-S described in alphaherpesviruses (20, 37), which is also a non-overlapping ncRNA, these transcripts can be considered raRNAs. While we did not identify a latent replication origin in CaGHV-1, this does not rule out that it does not exist.
Additionally, we examined the potential of raRNAs to form RNA/RNA interactions using the IntaRNA program (38). Our analysis showed that three raRNAs (labeled 1, 2, and 3 in Figure 4) in the OriLyt-L regions slightly exceeded the required threshold (−30 kcal/mol) to be considered as potential interacting partners with the mRNAs of four viral genes: ORF9 (raRNA1, −3), ORF50 (raRNA1), ORF64 (raRNA1, −2, −3), and ORF73 (raRNA1, −3) (Supplemental Table 5C). Further studies are needed to determine whether these high values indicate real functionality.
Extensive genome-wide transcriptional overlaps among CaGHV-1 genes
In our work, we uncovered a remarkable level of transcriptional complexity, marked by transcriptional overlaps among convergent, divergent, and co-oriented genes in the CaGHV-1 genome (Figure 6). Our analysis showed that the entire viral genome is transcriptionally active on both DNA strands. Similar extensive transcriptional overlaps have been observed in other gammaherpesviruses, such as EBV (21 and 22) and KSHV (23). Notably, in the majority of convergent clusters (e.g., ORF18-ORF19, ORF22-ORF23, ORF27, ORF29b, ORF38-ORF39, ORF40-ORF42, ORF54-ORF55, ORF64-ORF65, G2-ORF72, ORF74-ORF75), we found ‘hard’ overlaps, where the 3’-ends of canonical transcripts overlap. In another group of convergent clusters (ORF10-K3, ORF44-ORF45, G4-ORF52, ORF57-ORF58), we observed ‘soft’ overlaps, where canonical transcripts do not terminate within each other; however, transcriptional overlaps are occasionally generated through transcriptional readthrough between convergent or parallel-oriented gene pairs, or through head-to-head overlaps of divergent gene pairs.
Transcriptional Activity of ORF50 and Comparison with Gammaherpesviral Homologs
The homologs of the ORF50 gene product in gammaherpesviruses encode the replication and transcription activator protein (RTA), which is essential for driving the viral lytic cycle by activating the promoters of lytic genes, including that of ORF50. In our study, we found that the most abundant mRNAs of the CaGHV-1 ORF50 gene are initiated from two adjacent TSSs and contain four exons (Figure 7A). The RTA protein consisting of 643 amino acid residues is encoded by the first two exons and is expressed as a 90-kDa protein (Figure 7B). To assess the transcriptional activity of CaGHV-1 RTA (gpRTA) on a viral promoter, we cloned a 3-kb DNA region upstream of the ORF50 translational start site into a luciferase reporter vector. The luciferase reporter plasmid was co-transfected with increasing amounts of 3xFLAG-gpRTA into HEK293T and the guinea pig fibroblast cell line 104C1. Luciferase assays demonstrated that gpRTA greatly induced the ORF50 promoter in a dose-dependent manner in both cell lines (Figure 7C). We also tested the effect of gpRTA on shorter 2 kb and 1 kb regions of the ORF50 promoter. We found that while gpRTA similarly activated the 3, 2, and 1 kb promoters of ORF50 in HEK293T cells, activation of the 2 kb promoter in the guinea pig cell line 104C1 was reduced by 4-fold compared to the 3 kb promoter, suggesting cell type and/or species-specific differences in the gpRTA-mediated promoter activation (Figure 7D). To compare the promoter inducing function of gpRTA with its homologs from other gammaherpesviruses, we performed luciferase assays using the 3 kb promoter of CaGHV-1 ORF50 alongside RTAs from MHV68, EBV, HVS, KSHV, CaGHV-1, and RRV (Figure 7E). The results revealed that while gpRTA efficiently induced the CaGHV-1 ORF50 promoter in both human and guinea pig cell lines, the other gammaherpesvirus RTAs showed varying levels of promoter activation in the two cell lines (Figure 7F). In conclusion, our findings indicate that gpRTA shares functional similarities with its gammaherpesvirus homologs regarding its ability to induce the promoter of its own gene.
DISCUSSION
In this study, we integrated ONT dcDNA-Seq and dRNA-Seq data and used two bioinformatics software tools (LoRTIA and NAGATA) to construct a comprehensive transcriptome map of CaGHV-1. Specifically, we mapped a number of canonical mono- and polycistronic mRNAs, along with non-coding transcripts, including intergenic (PAN), antisense, and replication origin-associated RNA molecules, as well as transcripts with truncated ORFs. Furthermore, we also described fusion and complex RNA molecules. We also identified cis-regulatory elements with single-nucleotide resolution, including TSS initiation motifs, promoter elements, poly(A) signals, 3’-cleavage sites, and splice junctions. For TSS identification, we employed the LoRTIA software package (24), which enabled us to filter out false 5’-ends by excluding reads with incorrect template switching adapters. To reduce false positives, only TSS present in all three samples were accepted, leading to the identification of 165 TSS and 93 TATA-boxes. Notably, the TATTWAA motif - previously described in KSHV, EBV, and CMV - was detected upstream of several TSSs, distinct from the TATA-box motif used by early genes (39). These sequences serve as binding sites for LTF1 and vTA, which are crucial for the transcription of late herpesviral genes, as they recruit RNA Polymerase II (29). Moreover, the InR pattern we observed aligns with previous findings across all herpesvirus families, indicating that the nucleotide composition around the TSS is highly conserved (13, 22, 26).
To identify mRNA 3’-ends, we employed the LoRTIA software package, selecting only those 3’-ends that were present in all three samples and contained three or more adenines at the 3’-end. This approach is currently the most accurate method for detecting mRNA 3’-ends, as it effectively filters out false 3’-ends caused by false priming or template switching (24). Additionally, we used dRNA-Seq for validation, which further eliminates false results because it processes native RNA molecules. Through this process, we identified 143 TESs, most of which contained the canonical AAUAAA motif characteristic of eukaryotic mRNAs. This motif has been detected across all three herpesvirus subfamilies, suggesting that efficient 3’-end cleavage is regulated by host factors (40, 41).
The presence of RNAs overlapping the lytic replication origin has been observed in several DNA viruses (36). In HCMV, the RNA 4.9 is the most abundant viral RNA and plays a regulatory role in replication, while in KSHV, expression of the T1.5 transcript is associated with various biological functions, such as promoting cell survival, immune modulation, and contributing to angiogenesis and pathogenesis in infected cells (42–44). In this study, we identified several raRNAs in both lytic Ori regions of CaGHV-1. Some of these RNAs originate directly from the Ori, while others, including relative short ncRNAs and long polycistronic transcripts, overlap with the Ori and extend into the cis-regulatory elements of neighboring genes. One notable overlap spans from ORF8 to PAN. It is intriguing to speculate why this long transcript uses the PAN PAS signal rather than terminating earlier. One possible explanation is that during the transcription of long RNAs, collisions between RNA and DNA polymerases may occur. Additionally, such overlaps could inhibit the transcription of genes, thereby helping to separate the processes of replication and transcription (45). Thus, these RNAs may play multiple roles in regulating the viral life cycle. Moreover, an interesting observation is that the OriLyt-R region contains the TATTWAA motif necessary for late gene transcription, hinting at an interaction between transcription regulation and DNA replication. Understanding this phenomenon could pave the way for new research directions, potentially leading to interventions targeting viral molecular mechanisms.
Using dRNA-Seq, validated by dcDNA-Seq and parallel sequencing of mpox transcripts (which lack introns), we identified numerous introns in both the coding and UTR regions of several mRNAs, as well as in non-coding transcripts of CaGHV-1. For spliced transcripts, we observed a pattern similar to that found in KSHV and MHV68. Notable examples include the ORF50 (RTA) mRNA, which plays a key role in initiating and regulating the lytic cycle of the virus, and the ORF64 mRNA (35, 46, 47). Additionally, we found a high degree of isoform diversity in ORF73 (LANA), a protein involved in maintaining latent infection and immune evasion (48).
We also detected a large number of antisense, complex, and polycistronic transcripts, contributing to an intricate network of gene overlaps throughout the genome. These overlaps occur between convergent, divergent, and parallel genes. Read-throughs between convergent genes and overlaps between divergent genes result in antisense segments on the generated transcript. In HSV-1, it has been demonstrated that these form dsRNAs, which are inhibited by the virion host-shutoff (VHS) gene product, suggesting that the actual frequency of read-throughs may be higher than observed (49). Genome-wide transcriptional read-throughs contribute to widespread antisense activity, consistent with findings in other gammaherpesviruses. We have proposed a “transcriptional interference network” hypothesis for this phenomenon, suggesting that competition and collision between transcriptional machineries during interactions between neighboring genes serve as a regulatory mechanism (50). Moreover, transcriptional overlaps were detected not only between genes but also at the genomic ends, where RNAs spanning the circular genomic junctions were observed. These RNA molecules have also been confirmed in KSHV and EBV (51); however, they are often overlooked in studies due to their low abundance (52).
Based on the similarity in gene organization between KSHV and CaGHV-1, we cloned the CaGHV-1 ORF50 gene, which is anticipated to encode RTA, a viral transcription factor conserved across all gammaherpesviruses and essential for inducing the lytic cycle (53). A key feature of gammaherpesviral RTAs is their ability to bind to and activate the ORF50 promoter, initiating a positive feedback loop that enhances the expression of RTA and other lytic genes. Our findings show that CaGHV-1 RTA strongly activates the CaGHV-1 ORF50 promoter. Interestingly, several homologs of CaGHV-1 RTA from other gammaherpesviruses also activate the CaGHV-1 ORF50 promoter, highlighting functional similarities between CaGHV-1 RTA and other gammaherpesvirus RTA proteins in promoting viral transcription. Further studies are needed to assess the extent to which CaGHV-1 RTA shares functional similarities with human gammaherpesvirus RTAs, particularly in regulating viral and host gene expression and influencing protein degradation. By mapping the genes and regulatory regions of CaGHV-1, and cloning its RTA, we pave the way for developing CaGHV-1 as a novel model to investigate the biology and pathogenesis of human gammaherpesvirus infections.
MATERIALS AND METHODS
Cells and Virus
CaGHV-1 (VR-543), the guinea pig (GP) fibroblast cell line 104C1 (CRL-1405), and HEK293T cells were purchased from ATCC. The cell lines 104C1 and HEK293T were grown in RPMI-1640 and DMEM media, respectively, supplemented with 10% FBS and penicillin/streptomycin. CaGHV-1 was amplified in the cell line 104C1 followed by the concentration of the virus supernatant by ultracentrifugation. 104 GP cells were infected with CaGHV-1, and the cells were collected at eight time points (4h, 8h, 16h, 24h, 48h, 72h, 96h, and 120h) post-infection. The samples from each time point were mixed in equal volumes for both dRNA-Seq and cDNA-Seq.
DNA cloning and luciferase reporter assay
The protein coding sequence of CaGHV-1 ORF50 was PCR amplified and cloned into the pCDH-CMV-MCS-EF1-puro expression vector using In-Fusion cloning (TaKaRa). The cloning of the other gammaherpesvirus RTAs has been published in our previous study (54). CaGHV-1 RTA was expressed as an N-terminally 3xFLAG-tagged protein in HEK293T cells by transfecting the cells with PEI (Polysciences). The CaGHV-1 ORF50 promoter fragments were PCR amplified and cloned into pGL4.15 luciferase reporter vector (Promega) using In-Fusion cloning. In the luciferase reporter assays, 100 ng of reporter plasmids were co-transfected with 400 ng of RTA expression plasmids. To assess the effect of varying RTA levels, increasing amounts of RTA expression plasmids (50, 100, 200, and 400 ng) were used for transfection. The luciferase assay was performed as described previously (55).
Isolation of RNA
Total RNA was isolated using TRIzol reagent (Invitrogen) according to the manufacturer’s protocol with some modifications. After adding chloroform to the cells lysed in TRIzol, the lysates were spun down. Afterwards the supernatants were mixed with 100% ethanol and added to RNeasy columns. The RNA purification was performed by the protocol of the RNeasy kit (Qiagen). The polyadenylated RNA enrichment was carried out using Lexogen’s Poly(A) RNA Selection Kit V1.5. The RNA samples were bound to beads, washed, and hybridized. After incubation and washing, the polyadenylated RNA was eluted in nuclease-free water and stored at −80°C for subsequent analysis.
Direct cDNA sequencing
Direct cDNA sequencing was performed on the Oxford Nanopore Technologies (ONT) Mk1B and Promethion P2 Solo devices. For the preparation of sequencing libraries, we used the Ligation Sequencing V14 – dcDNA-Seq (SQK-LSK114) kit. Currently, the manufacturer does not provide barcoding for direct cDNA libraries, so we combined it with the Ligation Sequencing gDNA - Native Barcoding Kit 24 V14 (SQK-NBD114.24) for sample barcoding. For each sample, the initial amount was 1 µg total RNA, which was mixed with a VN primer (User-Supplied VNP 2 µM, ordered from IDT) and a 10 mM dNTP mix, then incubated at 65°C for 5 minutes. This was followed by cooling on a pre-chilled freezer block for 1 minute, and then adding the 5x RT Buffer, RNaseOUT (Thermo Fisher Scientific), and the Strand-Switching Primer (User-Supplied SSP 10 µM, ordered from IDT), followed by heating at 42°C for 2 minutes.
Reverse transcription and first-strand cDNA synthesis were carried out using the Maxima H Minus Reverse Transcriptase enzyme (Thermo Fisher Scientific), with the reaction occurring at 42°C for 90 minutes and enzyme inactivation at 85°C for 5 minutes. The RNA molecules were digested from the RNA-cDNA hybrids using RNase Cocktail Enzyme Mix (Thermo Fisher Scientific) at 37°C for 10 minutes. For second-strand cDNA synthesis, we used LongAmp Taq Master Mix [New England Biolabs (NEB)] and a PR2 Primer (User-Supplied 10 µM, ordered from IDT), with the PCR reaction involving Denaturation at 94°C for 1 minute (1 cycle), Annealing at 50°C for 1 minute (1 cycle), and Extension at 65°C for 15 minutes (1 cycle).
The double-stranded cDNAs then underwent end-repair and dA-tailing using the NEBNext® Ultra II End Repair/dA-tailing Module, incubated at 20°C for 5 minutes and 65°C for 5 minutes. For subsequent steps, we used the ONT Ligation Sequencing gDNA - Native Barcoding Kit 24 V14 (SQK-NBD114.24) protocol for sample barcoding. End-prepped DNAs were barcoded, and NEB Blunt/TA Ligase Master Mix (NEB) was added, followed by a 20-minute incubation at room temperature (RT), then EDTA addition. This was followed by ligation of the Native Adapter (NA) included in the kit, using the NEBNext Quick Ligation Module (NEB) enzyme and buffer. AMPure XP Beads (AXP, from the ONT kit) were used for DNA purification after each enzymatic step. Samples were then eluted in nuclease-free water. For concentration measurement, we used the Qubit 4.0 fluorometer and the Qubit dsDNA HS Assay kit. From the prepared cDNA libraries, 50 fmol/flow cell was loaded into R10.4.1 flow cells. To prevent “barcode hopping,” early time points were sequenced separately using an R10.4.1 flow cell (FLO-MIN114) and an R10.4.1 flow cell (FLO-PRO114M), and later time points were sequenced using an R10.4.1 flow cell (FLO-PRO114M).
Native RNA sequencing
For native RNA sequencing, we used the ONT Direct RNA Sequencing (SQK-RNA004) kit. For library preparation, we pooled 1 µg of total RNA from the samples into 8.5 µl. First, we ligated an RT Adapter (RTA) to the samples using NEBNext® Quick Ligation Reaction Buffer (NEB), T4 DNA Ligase (2M U/ml, NEB), and RNaseOUT™ Recombinant Ribonuclease Inhibitor (Invitrogen), followed by a 10-minute incubation at room temperature. The next step involved adding a reverse transcription master mix to the adapter-ligated RNA, which contained 10 mM dNTPs, 5X First-Strand Buffer, and DTT. Synthesis of the cDNA strand was performed using SuperScript™ III Reverse Transcriptase (Thermo Fisher Scientific), with the reaction run at 50°C for 50 minutes, followed by inactivation at 70°C for 10 minutes. The RNA-cDNA hybrids then had the RNA Ligation Adapter (RLA) ligated using NEBNext Quick Ligation Reaction Buffer and T4 DNA Ligase. After each enzymatic reaction, we used Agencourt RNAClean XP beads for purification. For concentration measurement, we used the Qubit 4.0 fluorometer and Qubit dsDNA HS Assay Kit. The prepared library was sequenced on the Promethion P2 Solo device using an RNA flow cell (FLO-PRO004RA).
Bioinformatics
The raw current signals underlying the analyses were initially assigned to nucleotides using the Dorado-0.8.2 basecaller. Reads were aligned to the reference genome (accession number: OQ679822.1) using the minimap2 software with the following parameters: Y -C5 -ax splice –cs. The identifiers and their availability in the European Nucleotide Archive (ENA) database are listed in Supplemental Table 6. SeqTools (https://github.com/moldovannorbert/seqtools) was employed for promoter element identification and basic statistical calculations. The LoRTIA tool, developed by our research group, was used to detect TSS, TES, and introns (“features”) and to annotate transcripts (https://github.com/zsolt-balazs/LoRTIA, v0.9.9). The first phase of the process involved identifying sequencing adapters, homopolymer As, and removing erroneous reads generated by RNA degradation, template switching, or faulty priming. The parameters for this step were as follows: Samprocessor.py --five_adapter GCTGATATTGCTGGG --five_score 14 --check_in_soft 15 --three_adapter AAAAAAAAAAAAAAA --three_score 14 input output.
In the next step, potential TSS and TES positions were identified. The first nucleotide that did not align with the adapter was marked as a potential TSS, while the last nucleotide that did not align with the homopolymer A was designated as a potential TES. This analysis was conducted for each ‘sam’ file using the following commands: Stats.py -r genome -f r5 -b 10 and Stats.py -r genome -f l5 -b 10 for TSS detection, Stats.py -r genome -f r3 -b 10 for TES detection, and Stats.py -r genome -f in for intron identification. Adapter alignment was evaluated using the Smith-Waterman algorithm. False 3’-ends arising from false priming or template switching were removed if at least three adenines preceded the homopolymer A. To further validate the TES positions and exclude those resulting from internal priming or other errors, the poly(A) length estimation module in the Dorado software package was used to identify and estimate poly(A) sites in dRNA-Seq samples (default settings were applied). Reads in the dRNA samples were identified and assigned to transcripts using the NAGATA software (default settings) (30).
In the third phase, potential TSSs and TESs were evaluated using the Poisson distribution to filter out random start and end positions caused by RNA degradation. Significance was corrected using the Bonferroni method. Features observed in fewer than two reads or with coverage of less than 1% were excluded from further analysis. Additional criteria required TSSs to appear in at least three direct cDNA samples, and TESs to occur in direct RNA samples. The command Gff_creator.py -s poisson -o was used for this step. Subsequently, we ran the transcript annotation module, which assigns validated features (TSSs, TESs, and introns) to each read using the parameters: Transcript_Annotator_two_wobbles.py -z 20 -a 10.
Statistical charts in Figure 1, along with nucleotide distribution and Log₁₀ line plot diagrams in Figures 2 and 3, were visualized using the Matplotlib Python library. Nucleotide sequences were extracted using the Bedtools getfasta software package. The Integrative Genomics Viewer (IGV) was used for overall transcriptome visualization.
Supplementary Material
IMPORTANCE.
The molecular underpinnings of gammaherpesvirus pathogenesis remain poorly understood, partly due to limited animal models. This study provides the first comprehensive transcriptomic atlas of CaGHV-1, highlighting both coding and non-coding RNAs and revealing regulatory elements that drive viral gene expression. Functional studies of the CaGHV-1 RTA transcription factor demonstrated its ability to self-activate and cross-activate promoters from homologous gammaherpesviruses, reflecting conserved mechanisms of transcriptional control. These findings solidify CaGHV-1 as a unique and versatile small animal model, offering new opportunities to investigate gammaherpesvirus replication, transcriptional regulation, and tumorigenesis in a controlled experimental system.
Acknowledgements
The research was funded by the National Research, Development and Innovation Office (NRDIO), through the Researcher-initiated research projects (Grant number: K 142674) awarded to ZB. ZT was supported by the NIH grant R01DE028331. DT was supported by NRDIO FK 142676. The publication fee was covered by the University of Szeged, Open Access Fund: 7358.
Footnotes
Ethics declarations
Not applicable
Conflicts of interests
The authors do not declare any conflicts of interest.
Data availability
The sequencing datasets generated in this study are available at the European Nucleotide Archive under the accession: PRJEB80811 and link https://www.ebi.ac.uk/ena/browser/view/PRJEB80811.
References
- 1.Muralidhar S, Pumfery AM, Hassani M, Sadaie MR, Azumi N, Kishishita M, Brady JN, Doniger J, Medveczky P, Rosenthal LJ. 1998. Identification of Kaposin (Open Reading Frame K12) as a Human Herpesvirus 8 (Kaposi’s Sarcoma-Associated Herpesvirus) Transforming Gene. J Virol 72:4980–4988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Young LS, Yap LF, Murray PG. 2016. Epstein-Barr virus: More than 50 years old and still providing surprises. Nat Rev Cancer. Nature Publishing Group; 10.1038/nrc.2016.92. [DOI] [PubMed] [Google Scholar]
- 3.Kieff E RA. 2007. Epstein-Barr Virus and its replication. Fields Virol 5th:2603–2654. [Google Scholar]
- 4.Dittmer DP, Krown SE. 2007. Targeted therapy for Kaposi’s sarcoma and Kaposi’s sarcoma-associated herpesvirus. Curr Opin Oncol 19:452–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rajcáni J, Blaskovic D, Svobodová J, Ciampor F, Hucková D, Staneková D. 1985. Pathogenesis of acute and persistent murine herpesvirus infection in mice. Acta Virol 29:51–60. [PubMed] [Google Scholar]
- 6.Sunil-Chandra NP, Efstathiou S, Nash AA. 1992. Murine gammaherpesvirus 68 establishes a latent infection in mouse B lymphocytes in vivo. J Gen Virol 73 ( Pt 12):3275–9. [DOI] [PubMed] [Google Scholar]
- 7.Virgin HW, Latreille P, Wamsley P, Hallsworth K, Weck KE, Dal Canto AJ, Speck SH. 1997. Complete sequence and genomic analysis of murine gammaherpesvirus 68. J Virol 71:5894–904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Chang Y, Cesarman E, Pessin MS, Lee F, Culpepper J, Knowles DM, Moore PS. 1994. Identification of Herpesvirus-Like DNA Sequences in AIDS-Sssociated Kaposi’s Sarcoma. Science (80- ) 266:1865–1869. [DOI] [PubMed] [Google Scholar]
- 9.Rose TM, Strand KB, Schultz ER, Schaefer G, Rankin GW, Thouless ME, Tsai CC, Bosch ML. 1997. Identification of two homologs of the Kaposi’s sarcoma-associated herpesvirus (human herpesvirus 8) in retroperitoneal fibromatosis of different macaque species. J Virol 71:4138–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hsiung GD, Kaplow LS. 1969. Herpeslike virus isolated from spontaneously degenerated tissue culture derived from leukemia-susceptible guinea pigs. J Virol 3:355–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Stanfield BA, Ruiz E, Chouljenko VN, Kousoulas KG. 2024. Guinea pig herpes like virus is a gamma herpesvirus. Virus Genes 60:148–158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Boldogkői Z, Moldován N, Balázs Z, Snyder M, Tombácz D. 2019. Long-Read Sequencing – A Powerful Tool in Viral Transcriptome Research. Trends Microbiol 27:578–592. [DOI] [PubMed] [Google Scholar]
- 13.Tombácz D, Prazsák I, Szűcs A, Dénes B, Snyder M, Boldogkői Z. 2018. Dynamic transcriptome profiling dataset of vaccinia virus obtained from long-read sequencing techniques. Gigascience 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Moldován N, Tombácz D, Szucs A, Csabai Z, Balázs Z, Kis E, Molnár J, Boldogkoi Z, Szűcs A, Csabai Z, Balázs Z, Kis E, Molnár J, Boldogkői Z. 2018. Third-generation Sequencing Reveals Extensive Polycistronism and Transcriptional Overlapping in a Baculovirus. Sci Rep 8:8604. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Torma G, Tombácz D, Csabai Z, Moldován N, Mészáros I, Zádori Z, Boldogkői Z. 2021. Combined Short and Long-Read Sequencing Reveals a Complex Transcriptomic Architecture of African Swine Fever Virus. Viruses 13:579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Kakuk B, Dörmő Á, Csabai Z, Kemenesi G, Holoubek J, Růžek D, Prazsák I, Dani VÉ, Dénes B, Torma G, Jakab F, Tóth GE, Földes F V., Zana B, Lanszki Z, Harangozó Á, Fülöp Á, Gulyás G, Mizik M, Kiss AA, Tombácz D, Boldogkői Z. 2023. In-depth Temporal Transcriptome Profiling of Monkeypox and Host Cells using Nanopore Sequencing. Sci Data 10:262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Moldován N, Tombácz D, Szucs A, Csabai Z, Snyder M, Boldogkoi Z. 2018. Multi-platform sequencing approach reveals a novel transcriptome profile in pseudorabies virus. Front Microbiol 8:1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Prazsák I, Moldován N, Balázs Z, Tombácz D, Megyeri K, Szűcs A, Csabai Z, Boldogkői Z, Szucs A, Csabai Z, Boldogkoi Z, Szűcs A, Csabai Z, Boldogkői Z. 2018. Long-read sequencing uncovers a complex transcriptome topology in varicella zoster virus. BMC Genomics 19:873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Depledge DP, Srinivas KP, Sadaoka T, Bready D, Mori Y, Placantonakis DG, Mohr I, Wilson AC. 2019. Direct RNA sequencing on nanopore arrays redefines the transcriptional complexity of a viral pathogen. Nat Commun 10:754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Tombácz D, Torma G, Gulyás G, Fülöp Á, Dörmő Á, Prazsák I, Csabai Z, Mizik M, Hornyák Á, Zádori Z, Kakuk B, Boldogkői Z. 2023. Hybrid sequencing discloses unique aspects of the transcriptomic architecture in equid alphaherpesvirus 1. Heliyon 9:e17716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.O’Grady T, Wang X, Höner zu Bentrup K, Baddoo M, Concha M, Flemington EK, Höner zu Bentrup K, Baddoo M, Concha M, Flemington EK. 2016. Global transcript structure resolution of high gene density genomes through multi-platform data integration. Nucleic Acids Res 44:e145–e145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Fülöp Á, Torma G, Moldován N, Szenthe K, Bánáti F, Almsarrhad IAA, Csabai Z, Tombácz D, Minárovits J, Boldogkői Z. 2022. Integrative profiling of Epstein-Barr virus transcriptome using a multiplatform approach. Virol J 19:7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Prazsák I, Tombácz D, Fülöp Á, Torma G, Gulyás G, Dörmő Á, Kakuk B, McKenzie Spires L, Toth Z, Boldogkői Z. 2024. KSHV 3.0: a state-of-the-art annotation of the Kaposi’s sarcoma-associated herpesvirus transcriptome using cross-platform sequencing. mSystems 10.1128/msystems.01007-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Balázs Z, Tombácz D, Csabai Z, Moldován N, Snyder M, Boldogkői Z, Boldogkoi Z, Boldogkői Z. 2019. Template-switching artifacts resemble alternative polyadenylation. BMC Genomics 20:824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Javahery R, Khachi A, Lo K, Zenzie-Gregory B, Smale ST. 1994. DNA sequence requirements for transcriptional initiator activity in mammalian cells. Mol Cell Biol 14:116–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Moldován N, Torma G, Gulyás G, Hornyák Á, Zádori Z, Jefferson VA, Csabai Z, Boldogkői M, Tombácz D, Meyer F, Boldogkői Z. 2020. Time-course profiling of bovine alphaherpesvirus 1.1 transcriptome using multiplatform sequencing. Sci Rep 10:20496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Huang CJ, Petroski MD, Pande NT, Rice MK, Wagner EK. 1996. The herpes simplex virus type 1 VP5 promoter contains a cis-acting element near the cap site which interacts with a cellular protein. J Virol 70:1898–1904. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Tombácz D, Moldován N, Balázs Z, Gulyás G, Csabai Z, Boldogkői M, Snyder M, Boldogkői Z. 2019. Multiple Long-Read Sequencing Survey of Herpes Simplex Virus Dynamic Transcriptome. Front Genet 10:1–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Nandakumar D, Glaunsinger B. 2019. An integrative approach identifies direct targets of the late viral transcription complex and an expanded promoter recognition motif in Kaposi’s sarcoma-associated herpesvirus. PLoS Pathog 15:e1007774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Abebe JS, Alwie Y, Fuhrmann E, Leins J, Mai J, Verstraten R, Schreiner S, Wilson AC, Depledge DP. 2024. Nanopore guided annotation of transcriptome architectures. mSystems 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chen J, Ueda K, Sakakibara S, Okuno T, Yamanishi K. 2000. Transcriptional Regulation of the Kaposi’s Sarcoma-Associated Herpesvirus Viral Interferon Regulatory Factor Gene. J Virol 74:8623–8634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Renne R, Blackbourn D, Whitby D, Levy J, Ganem D. 1998. Limited Transmission of Kaposi’s Sarcoma-Associated Herpesvirus in Cultured Cells. J Virol 72:5182–5188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Majerciak V, Yang W, Zheng J, Zhu J, Zheng Z-M. 2018. A Genome-Wide Epstein-Barr Virus Polyadenylation Map and Its Antisense RNA to EBNA. J Virol 93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kronstad LM, Brulois KF, Jung JU, Glaunsinger BA. 2013. Dual Short Upstream Open Reading Frames Control Translation of a Herpesviral Polycistronic mRNA. PLoS Pathog 9:e1003156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.O’Grady T, Feswick A, Hoffman BA, Wang Y, Medina EM, Kara M, van Dyk LF, Flemington EK, Tibbetts SA. 2019. Genome-wide Transcript Structure Resolution Reveals Abundant Alternate Isoform Usage from Murine Gammaherpesvirus 68. Cell Rep 27:3988–4002.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Torma G, Tombácz D, Csabai Z, Almsarrhad IAA, Nagy GÁ, Kakuk B, Gulyás G, Spires LM, Gupta I, Fülöp Á, Dörmő Á, Prazsák I, Mizik M, Dani VÉ, Csányi V, Harangozó Á, Zádori Z, Toth Z, Boldogkői Z. 2023. Identification of herpesvirus transcripts from genomic regions around the replication origins. Sci Rep 13:16395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Tombacz D, Csabai Z, Olâh P, Havelda Z, Sharon D, Snyder M, Boldogkôi Z. 2015. Characterization of novel transcripts in pseudorabies virus. Viruses 7:2727–2744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Mann M, Wright PR, Backofen R. 2017. IntaRNA 2.0: Enhanced and customizable prediction of RNA-RNA interactions. Nucleic Acids Res 45:W435–W439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Dremel SE, Didychuk AL. 2023. Better late than never: A unique strategy for late gene transcription in the beta- and gammaherpesviruses. Semin Cell Dev Biol 146:57–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Proudfoot NJ, Brownlee GG. 1976. 3′ Non-coding region sequences in eukaryotic messenger RNA. Nature 263:211–214. [DOI] [PubMed] [Google Scholar]
- 41.Vijayakumar A, Park A, Steitz JA. 2022. Modulation of mRNA 3′-End Processing and Transcription Termination in Virus-Infected Cells. Front Immunol 13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Tai-Schmiedel J, Karniely S, Lau B, Ezra A, Eliyahu E, Nachshon A, Kerr K, Suárez N, Schwartz M, Davison AJ, Stern-Ginossar N. 2020. Human cytomegalovirus long noncoding RNA4.9 regulates viral DNA replication. PLOS Pathog 16:e1008390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Tai-Schmiedel J, Karniely S, Ezra A, Eliyahu E, Nachshon A, Winkler R, Schwartz M, Stern-Ginossar N. 2018. The virally encoded long non-coding RNA4.9 is controlling viral DNA replication, p. 2.32. In International Herpesvirus Workshop 2018. University of British Columbia, Vancouver. [Google Scholar]
- 44.Wang Y, Tang Q, Maul GG, Yuan Y. 2006. Kaposi’s sarcoma-associated herpesvirus ori-Lyt-dependent DNA replication: dual role of replication and transcription activator. J Virol 80:12171–86. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Helmrich A, Ballarino M, Tora L. 2011. Collisions between Replication and Transcription Complexes Cause Common Fragile Site Instability at the Longest Human Genes. Mol Cell 44:966–977. [DOI] [PubMed] [Google Scholar]
- 46.Purushothaman P, Uppal T, Verma SC. 2015. Molecular biology of KSHV lytic reactivation. Viruses 7:116–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Shekhar R, O’Grady T, Keil N, Feswick A, Amador DAM, Tibbetts SA, Flemington EK, Renne R. 2024. High-density resolution of the Kaposi’s sarcoma associated herpesvirus transcriptome identifies novel transcript isoforms generated by long-range transcription and alternative splicing. Nucleic Acids Res 52:7720–7739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Allen RD, Dickerson S, Speck SH. 2006. Identification of Spliced Gammaherpesvirus 68 LANA and v-Cyclin Transcripts and Analysis of Their Expression In Vivo during Latent Infection. J Virol 80:2055–2062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Dauber B, Saffran HA, Smiley JR. 2019. The herpes simplex virus host shutoff (vhs) RNase limits accumulation of double stranded RNA in infected cells: Evidence for accelerated decay of duplex RNA. PLOS Pathog 15:e1008111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Boldogköi Z. 2012. Transcriptional interference networks coordinate the expression of functionally related genes clustered in the same genomic loci. Front Genet 3:1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Sberna G, Maggi F, Amendola A. 2023. Virus-Encoded Circular RNAs: Role and Significance in Viral Infections. Int J Mol Sci 24:16547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Zhang X, Liang Z, Wang C, Shen Z, Sun S, Gong C, Hu X. 2022. Viral Circular RNAs and Their Possible Roles in Virus-Host Interaction. Front Immunol 13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Staudt MR, Dittmer DP. 2007. The Rta/Orf50 transactivator proteins of the gamma-herpesviridae. Curr Top Microbiol Immunol 312:71–100. [DOI] [PubMed] [Google Scholar]
- 54.Combs LR, Spires LM, Alonso JD, Papp B, Toth Z. 2022. KSHV RTA Induces Degradation of the Host Transcription Repressor ID2 To Promote the Viral Lytic Cycle. J Virol 96:e0010122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Spires LM, Wind E, Papp B, Toth Z. 2023. KSHV RTA utilizes the host E3 ubiquitin ligase complex RNF20/40 to drive lytic reactivation. J Virol 97. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The sequencing datasets generated in this study are available at the European Nucleotide Archive under the accession: PRJEB80811 and link https://www.ebi.ac.uk/ena/browser/view/PRJEB80811.