SUMMARY
Mapping host-pathogen interactions has proven instrumental for understanding how viruses manipulate host machinery and how numerous cellular processes are regulated. DNA viruses such as herpesviruses have relatively large coding capacity and thus can target an extensive network of cellular proteins. To identify the host proteins hijacked by this pathogen, we systematically affinity tagged and purified all 89 proteins of Kaposi’s sarcoma-associated herpesvirus (KSHV) from human cells. Mass spectrometry of this material identified over 500 virus-host interactions. KSHV causes AIDS-associated cancers and its interaction network is enriched for proteins linked to cancer and overlaps with proteins that are also targeted by HIV-1. We found that the conserved KSHV protein ORF24 binds to RNA polymerase II and brings it to viral late promoters by mimicking and replacing cellular TATA-box-binding protein (TBP). This is required for herpesviral late gene expression, a complex and poorly understood phase of the viral lifecycle.
INTRODUCTION
Viruses reshape the intracellular environment during infection, both to co-opt processes necessary for viral amplification and to subvert antiviral defenses. Studies of virus-host interactions have thus provided a wealth of insight into host biology, including how the manipulation of specific pathways can contribute to disease. Due to genome size constraints, viral proteins are generally multifunctional and have evolved to target diverse cellular machinery. The number of interactions coordinated by individual viral proteomes is therefore anticipated to be substantial, as indicated by recent high throughput proteomics analyses of virus-host protein-protein interactions (PPIs) in mammalian cells (Pichlmair et al., 2012; Rozenblatt-Rosen et al., 2012). Systems-level analyses can also reveal infection-linked patterns within cells, as well as pathways or machinery that serve as hubs for viral perturbation (Hirsch, 2010; Navratil et al., 2011).
The first comprehensive analyses of protein complexes hijacked by viruses in mammalian cells were recently documented for the RNA viruses human immunodeficiency virus 1 (HIV-1) and hepatitis C virus (HCV) using affinity tag/purification mass spectrometry (AP-MS) (Germain et al., 2013; Jager et al., 2011a). Similar systematic mass spectrometry-based approaches have yet to be applied to DNA viruses, although a number of binary interaction screens using yeast-2-hybrid assays have been reported (Calderwood et al., 2007; Lee et al., 2011; Rajagopala et al., 2011; Uetz et al., 2006). DNA viruses can have significantly greater coding capacity relative to their RNA virus counterparts and generally exhibit genome amplification and gene expression strategies that more closely mimic those of the host.
Herpesviruses are among the largest mammalian DNA viruses identified to date, encoding 70 to over 230 proteins. Divided into three subfamilies (α, β, and γ), herpesvirus infections have diverse pathogenic outcomes that are frequently serious in immunocompromised individuals. For example, the majority of lethal AIDS-associated cancers are caused by human γ-herpesviruses, including Kaposi’s sarcoma (KS). The etiologic agent of KS is a γ-herpesvirus termed Kaposi’s sarcoma-associated herpesvirus (KSHV), which is also associated with the B cell lymphoproliferative disorders multicentric Castleman’s disease and primary effusion lymphoma.
The KSHV life cycle is divided into lytic and latent transcriptional programs. Latency is the stage primarily linked to neoplastic disease, as the restricted subset of viral genes expressed during this phase generally manipulate growth regulatory pathways. All viral proteins are expressed during lytic replication, which is when progeny virion production occurs. Both lytic and latent KSHV infection result in broad changes in cellular metabolism and gene expression. KSHV encodes an estimated 89 proteins, including immune modulators and signaling proteins that have been ‘pirated’ from the host, as well as proteins broadly conserved within the herpesvirus family involved in viral replication. However, the majority of KSHV-encoded proteins remain uncharacterized with a relatively small number of PPIs identified.
Here, we sought to gain a global perspective on how a large DNA virus interfaces with its host by assembling a PPI network for KSHV proteins in human cells. This network is the largest host-pathogen interactome constructed to date, as well as the first comprehensive PPI map for a DNA virus in mammalian cells. We use it to study a virus-human hybrid transcription pre-initiation complex (PIC) with an essential role in directing viral late gene expression. This PIC incorporates functional mimicry of the human TATA-box-binding protein (TBP) with direct recruitment of cellular RNA polymerase II, suggesting a system that merges principles underlying both eukaryotic and prokaryotic transcriptional regulation.
RESULTS
Assembly of the KSHV-human interactome
To systematically construct a comprehensive interaction network map for KSHV, we cloned each of the 89 KSHV open reading frames (ORFs) from infected cells and fused them to a strep epitope affinity tag. Protein expression was confirmed by Western blotting with strep antibodies (data not shown). Where antibodies were available, we also compared the expression level of transfected viral ORFs to the endogenous version expressed during lytic KSHV infection (Figure S1A). The levels were generally comparable, although the level of some ORFs (e.g. K8.1) was significantly higher in the 293T cells, which could impact the number of interactions detected. To assemble the network, constructs were individually transfected in a minimum of three biological replicates into 293T cells and viral ORFs together with their associated protein complexes were affinity purified (AP) over Strep-Tactin beads and analyzed by mass spectrometry (MS) (Figure 1A). The resulting data set, assembled from over 500 AP-MS runs, contained a total of 50,835 interactions (Table S1).
Figure 1. Assembly and broad characterization of the KSHV-human interactome.
(A) Summary of the workflow used for assembly of the PPI network.
(B) The MiST feature weights and score threshold were subjected to an exhaustive parameter grid search and the optimal values were empirically determined by maximizing the area under the curve defined by the True Positive versus False Positive rate. The depicted receiver operating characteristic (ROC) curve illustrates MiST prediction accuracy with optimal feature weights for specificity (0.5) and reproducibility (0.5) with the curve inflection point marked with an x. The ROC curve was used to help define an appropriate MiST threshold of 0.7. This corresponded to 564 high confidence interactions (see also Tables S1 and S2).
(C) Venn diagram showing the overlap between previously reported KSHV-host interactions and the high confidence PPI network. The bar graph shows the number of host proteins associating with each viral protein, ranked in descending order, with the previously reported interactions noted in red (see also Table S2).
(D–E) Host genes that interact with viral proteins unique to KSHV show significantly elevated signs of recent selection. Shown are the distributions of mean differences from background across 10,000 bootstrap samples. Signatures of selection are measured using a mammalian based dN/dS statistic for ancient selection (D), and the iHS statistic based on the 1000 Genomes Project for recent selection (E).
(F–G) Bar graphs showing the percentage of all human proteins, versus the subset that interact with KSHV lytic or latent proteins, that are associated with cancer (F) or non-cancer diseases (G) in the DisGeNET database. * Indicates a p-value < 0.05 and ** indicates a p-value < 0.005 as determined by the hyper-geometric test (see also Figure S1 and Table S3).
(H) Venn diagram depicting the overlap between the high confidence HEK293T interaction partners of KSHV and HIV-1. The interaction partners are broken out into the number of cellular proteins in each category. The p-value was determined by the hyper-geometric test (see also Table S4).
A set of high confidence PPIs was established by analyzing all interactions using the mass spectrometry interaction statistics (MiST) scoring algorithm, which we designed for scoring AP-MS derived host-pathogen PPIs (Jager et al., 2011a; Verschueren et al., 2014). MiST scores are a weighted combination of the specificity, reproducibility, and abundance features of each detected interaction. We trained the feature weights and validated our prediction accuracy on a set of published interactions to establish a MiST score of ≥ 0.7 for our high confidence interaction set (Figure 1B, S1B–C). A total of 21 of the 28 published interactions detected (75%, 14 trained and 7 in the validation group, p = 7.32 × 10−19) had MiST scores ≥ 0.7, demonstrating the ability of MiST to enrich for biologically relevant interactions. When applied to all detected PPIs, MiST scoring prioritized 564 KSHV-human interactions, which we refer to as the ‘high confidence’ data set (Table S1). Of the 89 KSHV proteins tested, 67 displayed high confidence interactions with human proteins (Figure 1C). Our stringent cutoff likely excluded some biologically relevant interactions, however, the significant majority of known interactions in the data set were maintained and further revealed a multitude of novel interactions with interconnected human complexes. Indeed, comparison of this set with published KSHV-human interactions curated from the literature revealed 44% overlap (28 out of 63 published PPIs) (Table S2). We also independently validated 16 out of an attempted 28 high confidence interactions through IP and Western blotting (Table S1). While some of the interactions that failed to validate may represent false positives, it is possible that others are non-binary interactions that require a more limiting endogenous bridging factor.
We next looked for patterns in the KSHV interaction partners by grouping the viral proteins according to their conservation within the α-, β-, and γ-herpesviral subfamilies and analyzing the evolutionary selection profile of the interacting cellular proteins. We made the assumption that the degree of conservation of viral proteins between subfamilies reflects in part the evolutionary age of the protein, with proteins conserved amongst the herpesviruses likely being “older”, while those unique to γ-herpesviruses or KSHV likely being “younger”. We found that interactors of the more conserved viral proteins showed strong signals of ancient selection as measured by dN/dS ratios across mammals (Figure 1D). In contrast, interactors of KSHV-specific viral proteins had strong signals of recent human-specific selection as measured by iHS (Voight et al., 2006) using data from the 1000 Genomes Project (Genomes Project et al., 2010), exhibiting an average increase in positive selection of 33% above background across bootstrap samples. (Figure 1E). Thus, there is a dichotomy of natural selection acting broadly across mammals for interactors of ancient herpesvirus proteins compared to the human-specific signatures of selection for interactors of viral proteins exclusive to KSHV. This is suggestive of a longstanding evolutionary interplay between herpesviruses and their hosts that continues to shape mammalian genomes.
A biologically relevant interaction network should be enriched for proteins linked to KSHV-induced diseases. The viral latency factors are the primary drivers of KSHV-induced cancers, although lytic cycle proteins provide paracrine signals to enhance the tumor microenvironment (Mesri et al., 2014). We therefore looked for overlap between the PPI networks of the lytic or latent proteins and disease associated cellular proteins annotated in the DisGeNET (Bauer-Mehren et al., 2011) (Table S3) or cBioPortal (TCGA) databases. Indeed, the interaction partners of latent proteins were significantly enriched for host factors associated with cancer (Figures 1D and S1D–E), whereas partners of lytic proteins were instead enriched in non-cancer disease associations (Figure 1E and S2). This correlation suggests that a number of KSHV proteins target host factors whose disruption may contribute to disease.
The KSHV and HIV-1 proteins have significant overlap in their interaction partners
Epidemic KS is closely associated with HIV/AIDS patients, and we recently assembled an HIV-1-human interactome using the same pipeline described herein (Jager et al., 2011a). Notably there was a significant degree of overlap in high confidence interactors identified in both sets, with nearly all HIV-1 proteins and 22 KSHV proteins interacting with a common group of 52 cellular proteins (Figure 1F, Table S4). These overlapping proteins may be participants in cellular processes broadly required for viral amplification, or proteins selectively contributing to the KSHV-HIV-1 interaction.
The KSHV interactome as a tool for functional prediction
The high confidence KSHV-human PPI network comprised 564 interactions between 67 viral proteins and 556 human proteins (Figure 2). When annotated to include known human-human interactions either within the network of individual viral proteins (Figure 2) or the network as a whole (Figure S2 and Table S5), enrichment for components of many distinct host complexes emerged. A primary goal was to use these data to predict new protein functions, as the majority of KSHV ORFs remain uncharacterized. We therefore performed enrichment tests for gene ontology (GO) terms associated with the set of PPIs for each of the KSHV proteins (Figure S3A and Table S6). We also analyzed the individual viral PPI networks for enrichment of specific protein domains to identify functions not well annotated in the GO database (Figure S3B). The enrichment analyses accurately assigned functions to many of the characterized KSHV proteins, suggested new additional functional groupings for several of these proteins, as well as enabled us to link many of the uncharacterized KSHV proteins to specific pathways or processes (Figure 3 and Table S7).
Figure 2. Network representation of the KSHV-host interactome.
The full, high confidence interaction network contains 67 KSHV proteins (gold nodes) and 556 cellular proteins (blue nodes). Black edges indicate known interactions between cellular proteins associated with each individual viral protein. A subset of the identified complexes is labeled. NADH UOR: ubiquinone oxidoreductase; TFIIIC: transcription factor for polymerase III C; SSB repair: single-stranded break repair; 5FMC: 5 friends of methylated Chtop; CDK: cyclin dependent kinase; RSKs: ribosomal s6 kinases; IKK: IκB kinase complex; RPA: replication protein A complex; RNA Pol II: RNA polymerase II; CPSF: cleavage and polyadenylation specificity factor complex; APC: anaphase-promoting complex; CTLH: C-terminal to LisH motif complex (see also Figure S2 and Table S5).
Figure 3. Using the KSHV interactome to predict viral protein functions.
Summary of predicted functions of KSHV proteins, derived from GO term and protein domain enrichments within the set of high confidence PPIs associated with each viral protein (see also Figure S3 and Tables S6 and S7).
ORF24 interaction with RNA Pol II is essential for KSHV late gene expression
One of the most striking functional predictions was the link between the viral early lytic cycle protein ORF24 and transcription. Though unstudied in KSHV, the ORF24 orthologs in γ- and β-herpesviruses are required for the expression of viral late genes, which are transcribed only after the onset of viral genome replication (Gruffat et al., 2012; Isomura et al., 2011; Wong et al., 2007). In the high confidence data set, exogenously expressed ORF24 co-purified 10 of the 13 human RNA polymerase II (Pol II) subunits (an additional 2 subunits had MiST scores above 0.6), and also with multiple Pol II subunits in lytically reactivated KSHV-positive iSLK.219 cells (Figure 4A). We engineered a FLAG tag at the N-terminus of ORF24 within the KSHV genome (KSHV.FLAG.24), and found that Pol II also interacted with this endogenous ORF24 expressed from its native promoter (Figure 4B). Pol II subunits were not found in any of the other KSHV protein high confidence networks, including well-characterized KSHV transcriptional factors (ORF50 and K8α) (Figure S4A and Table S1). ORF24 specifically bound the lower mobility, hypophosphorylated form of the RPB1 (POLR2A) subunit of Pol II and not the form phosphorylated at serine 2, 5, or 7 of the RPB1 C-terminal domain (CTD), consistent with a potential role in PIC assembly (Figure 4C). The interaction was DNA-independent, suggesting that it may occur prior to Pol II recruitment to promoters (Figure S4B). Finally, we observed that Pol II binding, as measured both by AP-MS and IP-Western blotting, was a feature conserved among ORF24 orthologs in human cytomegalovirus (HCMV; UL87), Epstein-Barr virus (EBV; BcRF1), and murine γ-herpesvirus 68 (MHV68; mORF24) (Figure 4D, S4C–F).
Figure 4. KSHV ORF24 binds human RNA polymerase II to drive viral late gene expression.
(A) Network showing the high confidence interaction partners of ORF24 identified in 293T cells (all blue nodes), and those also identified in KSHV infected iSLK.219 cells (blue nodes with heavy black borders). Pol II subunits are depicted to the left of the ORF24 node while Pol II accessory factors are shown to the right. Grey lines indicate known interactions between human proteins.
(B) Lysates of iSLK cells either latently (− doxycycline) or lytically (+ doxycycline) infected with KSHV.FLAG.24 were subjected to α-FLAG IP and Western blotted for the RPB1 subunit of Pol II. GAPDH was used as a loading and IP specificity control. In this and all subsequent IPs, input represents 5% of lysate used for IP.
(C) Lytically induced iSLK cells infected with KSHV.WT were transduced with FLAG-tagged ORF24, and lysates were subjected to α-FLAG IP and Western blotted using antibodies against total RPB1 (Pol II) or the hyperphosphorylated forms of Pol II.
(D) Heat map representing the MiST scores for Pol II subunits detected in association with the ORF24 orthologs from KSHV (ORF24), MHV68 (mORF24), EBV (BcRF1), and CMV (UL87) via AP-MS from 293T cells.
(E) 293T cells were transfected with plasmids expressing FLAG-tagged full length (FL) ORF24 or fragments of ORF24 encompassing the N-terminus (aa 1-401), central domain (aa 402-603), or C-terminus (aa 604-752). Cell lysates were subjected to α-FLAG IP and Western blotted for the RPB1 subunit of Pol II.
(F) A Clustal Omega multiple sequence alignment for the ORF24 orthologs showing the stretch of 5 conserved residues within their N-termini.
(G) 293T cells were transfected with plasmids expressing FLAG-tagged WT ORF24 or an ORF24 mutant in which the conserved ‘RLLLG’ motif was mutated to ‘RAAAG’ or ‘AAAAA’. Lysates were subjected to α-FLAG IP and Western blotted with the indicated antibodies.
(H) Heat map representing the MiST scores for Pol II subunits detected in association with WT ORF24 (ORF24) versus the ORF24RAAAG mutant via AP-MS from 293T cells.
(I–J) iSLK cells infected with WT KSHV or KSHV lacking ORF24 (KSHV.24.Stop) were transduced with a retroviral vector expressing either WT ORF24 or the indicated ORF24 Pol II binding mutants. Forty-eight hr following lytic reactivation, accumulation of the ORF52 late gene (I) or the ORF38 early gene (J) was measured by RT-qPCR and normalized to levels of 18S, with the viral mRNA levels present during WT infection set to 1. * Indicates a p-value < 0.05 and ** indicates a p-value < 0.005 as determined by Student’s t-test (see also Figure S4).
KSHV ORF24 is a 752 aa protein with a central 209 aa domain that is highly conserved amongst its orthologs and is flanked by more divergent N- and C-terminal regions. Truncation mutants of KSHV ORF24 encompassing each of these domains demonstrated that the Pol II interaction was mediated exclusively through the N-terminus (aa 1-401) (Figure 4E). Although the amino acid sequence of the N-terminal domain is poorly conserved, it does contain an invariant 5 aa stretch (Figure 4F). Mutation of these 5 aa in KSHV ORF24 to alanines (ORF24AAAAA), or just the 3 leucines to alanines (ORF24RAAAG) completely abrogated the interaction with Pol II (Figure 4G–H).
Similar to previous observations in EBV, CMV, and MHV68 (Gruffat et al., 2012; Isomura et al., 2011; Wong et al., 2007), introduction of a stop codon in the KSHV ORF24 locus (KSHV.24.Stop) impaired late gene expression but did not impact early gene expression or viral DNA replication (Figure S4G–H, S5A). As expected, repair of the stop mutation in the virus or complementation with wild type (WT) ORF24 restored late gene expression (Figure S4G, S5A). When WT or Pol II binding mutants of ORF24 were exogenously expressed in cells infected with KSHV.24.Stop (Figure S4I), expression of WT ORF24, but not the Pol II binding mutants, restored expression of both the ORF52 (Figure 4I) and other late genes (Figure S5A) but did not impact expression of the ORF38 early gene (Figure 4J). Furthermore, cells complemented with the ORF24 Pol II binding mutants produced no detectable infectious virions in supernatant transfer assays (Figure S5B). Thus, the ORF24-Pol II interaction is essential for the selective activation of late genes and completion of the viral lifecycle.
ORF24 functionally resembles cellular TBP
Mapped late gene promoters are strikingly minimal, as they encompass only 12–15 bp in total and are comprised of little more than a core TATA-like element (TATT) (Homa et al., 1988; Tang et al., 2004; Wong-Ho et al., 2014). The central conserved domain in the ORF24 orthologs was predicted to adopt a TBP-like secondary structure, although the proteins lack significant sequence homology (Wyrwicz and Rychlewski, 2007). Similar to the EBV and MHV68 ORF24 orthologs (Gruffat et al., 2012; Wong-Ho et al., 2014), purified KSHV ORF24 bound to a TATT-containing KSHV K8.1 late gene promoter sequence in DNA gel shift assays, but failed to bind a size-matched segment from the KSHV ORF57 early gene promoter containing a canonical TATA box (Figure 5A). K8.1 promoter binding was dependent on its core TATT sequence, as mutating these bases to CCCC abrogated the interaction (Figure 5A, S5C).
Figure 5. ORF24 binds late gene promoter DNA in a manner essential for late gene expression.
(A) Purified WT ORF24 or ORF24 putative DNA-binding mutants (ORF24N425A, N427A and ORF24N518A, N520A) (0–280 nM) were subjected to EMSA with 23 nt radiolabeled dsDNA probes encompassing the K8.1 late promoter, which contains a central ‘TATT’ sequence, or the ORF57 early promoter, which contains a central ‘TATA’ sequence. Where indicated, the ‘TATT’ motif in the K8.1 promoter probe was mutated to ‘CCCC’. The specificity of the interaction was confirmed by competition with unlabeled (cold) probe.
(B) 293T cells were transfected with plasmids expressing strep-tagged WT ORF24 or the indicated ORF24 DNA-binding mutants. Lysates were subjected to α-strep IP and Western blotted for the RPB1 subunit of Pol II. Tubulin was used as a loading and IP specificity control.
(C–D) iSLK cells infected with WT KSHV or KSHV.24.Stop were transduced with a retroviral vector expressing WT ORF24 or the indicated ORF24 DNA-binding mutants. Forty-eight hr following lytic reactivation, accumulation of the ORF52 late gene (C) or the ORF38 early gene (D) was measured by RT-qPCR and normalized to levels of 18S, with the viral mRNA levels present during WT infection set to 1. * Indicates a p-value < 0.05 as determined by Student’s t-test (see also Figure S5).
TBP has asparagine and valine residues within its inner ‘saddle’ that contact DNA and contribute to its specificity for the TATA sequence (Kim et al., 1993). Mutation of these positionally conserved residues in KSHV ORF24 (ORF24N425A, N427A and ORF24N518A, N520A) did not impair Pol II binding, but abrogated the interaction with the K8.1 late gene promoter as well as the ability to rescue late gene expression and virion production during infection with the KSHV.24.Stop virus (Figure 5A–C, S5A–B). Thus, KSHV ORF24 is a modular protein containing at least two essential but genetically separable domains required to drive selective transcription from minimalistic viral late promoters.
ORF24 replaces TBP in the viral late gene pre-initiation complex
TBP and TBP associated factors (TAFs) collectively comprise TFIID, which binds the core promoter and serves as a platform for the recruitment of other PIC components. Our data suggested, however, that Pol II might be recruited independently of TFIID to viral late promoters by ORF24.
In agreement with our in vitro binding results, chromatin immunoprecipitation (ChIP) assays showed FLAG-ORF24 bound to the K8.1 late gene promoter but not the ORF57 early gene promoter in lytically reactivated iSLK cells infected with KSHV.FLAG.24 virus (Figure 6A). In contrast, TBP was present at the ORF57 early promoter but not detectable at the K8.1 late promoter (Figure 6A). Unlike Pol II, both TBP and ORF24 remained at their associated promoter (Figure 6A). The apparent mutually exclusive binding of ORF24 and TBP suggests that ORF24 acts as a replacement for TBP on late gene promoters.
Figure 6. ORF24 functionally replaces TBP on the late promoter.
(A–B) Chromatin from iSLK cells lytically infected with KSHV.FLAG.24 was isolated and subjected to ChIP using antibodies against FLAG (to precipitate FLAG ORF24), TBP, and Pol II (A), or TFIIB and TFIIH (ERCC3) (B). Co-precipitating DNA was detected by PCR using radiolabeled nucleotides and primers specific for the indicated promoter or gene body. Experiments were repeated at least three times and a representative image is presented with the ChIPs quantified against input samples (see also Figure S6).
Cellular TBP does not bind Pol II directly; its interaction with Pol II at the PIC is instead bridged by TFIIB and other general transcription factors (GTFs) (Bushnell et al., 2004). However, these additional PIC components were absent from the KSHV ORF24 MS data, a finding we independently confirmed by Western blotting for TBP, TFIIB, TFIIE, TFIIF, and TFIIH in KSHV.FLAG.24 immunoprecipitates (Figure S6A). Thus, either the recruitment of Pol II to late promoters by ORF24 bypasses the need for additional GTFs or, alternatively, GTF assembly into the late promoter PIC occurs in a TBP-independent manner. We addressed these possibilities by measuring the in vivo promoter occupancy of the GTFs TFIIB and TFIIH (ERCC3), two canonical components of PICs found at RNA Pol II transcribed promoters (Figure 6B). Despite the absence of TBP, ChIP assays revealed assembly of both TFIIH and TFIIB on the K8.1 late gene promoter (Figure 6B). TFIIB occupancy at the K8.1 promoter was lost in the KSHV.24.Stop virus, confirming that recruitment of at least a subset of GTFs to this promoter is ORF24 dependent (Figure S6B). Thus, the KSHV ORF24 protein nucleates assembly of a PIC that bypasses the need for TBP but is nonetheless structured to enable subsequent recruitment of other core mammalian transcription factors.
DISCUSSION
Here, we present the largest whole virus-host interactome defined to date. Host genes that interact with more highly conserved viral proteins show significantly elevated rates of positive selection across mammals. This effect diminishes for the more recently evolved KSHV-specific viral proteins, consistent with restricted contact across the expanses of the host-pathogen phylogenetic trees. However, the interactors of these KSHV-specific proteins show a dramatic increase in signatures of recent natural selection within human populations. We therefore speculate that KSHV may have contributed to shaping patterns of human genetic variation over the last 10,000–30,000 years.
There is significant overlap between the virus-host interactomes of KSHV and HIV-1, with high confidence targets of nearly every HIV-1 protein present in the overlapping PPI set. This overlap is notable, because although these two viruses are pathologically linked (particularly in the case of KSHV-induced diseases), they have distinct life cycles and replication strategies. As more systematic interaction data sets emerge, it will be of interest to examine whether any of these overlapping partners are present within additional virus-host interactomes. In this regard, core host pathways that are reiteratively targeted by invading pathogens could generate signals akin to ‘patterns of pathogenesis’ that are recognized by the innate immune system as markers of viral infection (Vance et al., 2009).
There are several strengths in our systematic approach that lead to a robust PPI network. First is our ability to identify reproducible and specific interactions, which should serve as the basis for future hypothesis-driven studies to probe their role in the KSHV life cycle. We demonstrated that the PPI network could accurately predict known protein functions and, in the case of viral latency factors, was enriched for proteins linked to cancer. Our standardized pipeline also eliminates the inherent difficulties in cross-comparisons of individual PPI experiments. That said, the network likely underestimates the number of interactions coordinated by KSHV. It does not capture PPIs driven by complexes comprised of multiple viral proteins and excludes some biologically relevant interactions that were detected but de-prioritized by MiST due to low specificity.
Directed by the interactome, we gained insight into a poorly understood stage of γ-herpesvirus gene expression (Figure 7). Late genes of all DNA viruses become transcriptionally active only after the onset of viral DNA replication and produce proteins necessary for progeny virion assembly. Foundational work in related herpesviruses has revealed a complex of six viral proteins to be essential for late gene expression and recruitment of Pol II (Arumugaswami et al., 2006; Aubry et al., 2014; Jia et al., 2004; Perng et al., 2014; Wong et al., 2007; Wu et al., 2009). Although the precise roles of each of these factors remain unknown, the ORF24 orthologs in related herpesviruses have been shown to exhibit sequence-specific DNA binding activity and display predicted structural homology to TBP (Gruffat et al., 2012; Wyrwicz and Rychlewski, 2007). We find that ORF24 but no other KSHV proteins binds Pol II, indicating that it is the late gene transcription factor that mediates polymerase recruitment to KSHV late promoters. Furthermore, the observation that ORF24 but not TBP occupies the KSHV K8.1 late promoter in vivo suggests that ORF24 functionally replaces TBP specifically at late gene PICs. This interaction occurs through the atypical TATA box (TATT) characteristic of late promoters and requires ORF24 residues that are spatially and functionally similar to the DNA binding residues of TBP. These data support the hypothesis that KSHV ORF24 and its orthologs function as the first described viral TBP mimics (Wyrwicz and Rychlewski, 2007), orchestrating the assembly of a novel virus-host hybrid transcription complex.
Figure 7. Model for ORF24 activation of late gene promoters.
ORF24 interacts with Pol II through conserved leucines in its N-terminal domain. The polymerase is then brought to the late promoter TATT box via the ORF24 TBP-like domain. TBP is excluded from late promoters, and instead ORF24 binding nucleates assembly of the PIC by enabling recruitment of other cellular (GTFs).
TBP serves as a critical nucleation factor for PIC assembly, including at promoters lacking a TATA box (Thomas and Chiang, 2006). However, unlike ORF24, TBP does not directly bind Pol II; the polymerase is instead brought to the TBP-bound promoter through interactions with TFIIB and additional transcription associated factors (Murakami et al., 2013). The low degree of sequence homology between ORF24 and TBP makes it unlikely that ORF24 is able to associate with a similar cohort of TAFs and GTFs, a notion supported by the selectivity of the ORF24-Pol II interaction. Thus, cellular GTF recruitment to late promoters may instead occur through other viral or cellular proteins that perhaps join the ORF24-bound DNA. ORF24-directed recruitment of Pol II could also enable late genes to bypass select regulatory factors required for canonical eukaryotic promoter activation.
While proteins from other viruses have been reported to interact with Pol II, ORF24 is unique in its ability to coordinately bind Pol II and promoter DNA (Dorjsuren et al., 1998; Engelhardt et al., 2005; Mavankal et al., 1996; Takramah et al., 2003; Zhou and Knipe, 2002). This is reminiscent of prokaryotic sigma factors, which compete to bind the RNA polymerase holoenzyme and are required for selective promoter recognition and initiation (Osterberg et al., 2011). It is possible that herpesviruses have evolved an analogous strategy to secure sufficient levels of Pol II for strong late gene expression late in infection, a time when the cell is stressed and perhaps constrained for resources. DNA viruses likely display significant variety in PIC composition, including in the core GTFs as has been shown for some cell cycle-regulated promoters (Guglielmi et al., 2013). Future studies of the regulation of late genes are therefore anticipated to uncover other novel modes of transcriptional regulation with parallels in mammalian cells.
EXPERIMENTAL PROCEDURES
Plasmid construction
A library of strep-tagged KSHV ORFs was assembled from RNA from TREx BCBL-1-RTA cells reactivated for 36 hr with TPA (20 ng/mL), ionomycin (500 ng/mL) and doxycycline (1 μg/mL). ORF primers were designed based on sequences from the European Nucleotide Archive and GenBank. Inserts were ligated into pcDNA 4/TO (Invitrogen) modified to include a C-terminal 2x strep tag. Additional cloning and primers including restriction sites for all ORFs cloned in this study are listed in Extended Experimental Procedures.
Cell culture and Transductions
HEK293T (ATCC), Phoenix 293 retroviral packaging cells (Swift et al., 2001), the renal carcinoma cell line iSLK puro (Sturzl et al., 2013) bearing a dox-inducible version of the major KSHV lytic transactivator RTA, KSHV positive iSLK.219 cells (Myoung and Ganem, 2011), and other iSLK puro-derived cell lines were maintained in Dulbecco’s modified Eagle medium (DMEM; Invitrogen) with 10% fetal bovine serum (FBS; Invitrogen, HyClone) and penicillin-streptomycin (Invitrogen). The BJAB B cell line was maintained in RPMI (Invitrogen) medium with 10% FBS and penicillin-streptomycin. Stable cell lines expressing tagged viral ORF24 for affinity purification or ORF24 complementation assays were generated by retroviral transduction.
Affinity purification and Western blotting
Affinity purification of proteins to be analyzed by MS was performed as previously described (Jager et al., 2011a; Jager et al., 2011b). Briefly, clarified cell lysate was incubated with pre-washed Strep-Tactin beads (IBA) or FLAG beads (Sigma) and allowed to bind for 1–3 hr. Following purification, complexes bound to beads were washed and then eluted with desthiobiotin (IBA) or 3X FLAG peptide (Elim BioPharm) for strep or FLAG tags, respectively.
Mass spectrometry
Eluates were processed, trypsin digested, and concentrated for LC-MS/MS. Digested peptide mixtures were analyzed by LC-MS/MS on a Thermo Scientific Velos Pro ion trap mass spectrometry system equipped with a Proxeon Easy nLC II high pressure liquid chromatography and autosampler system.
Network scoring and benchmarking
Weights of the three MiST features were set by supervised training using two thirds of the known KSHV-host interactions detected in our screen as a positive set (19/28), 100 times larger random subsets of the interactome as negative sets, and the remaining third of known interactions as a validation set (9/28). An exhaustive grid search was performed on the 4-dimensional vector described by the 3 weight variables and a threshold variable. The domain of these variables was limited to between 0 and 1, with 0.01 increments. The sum of the three variables was constrained to 1. For every complete assignment of the vector, the True Positive Rate (TPR) and False Positive Rate (FPR) in the set of MiST scores higher than the threshold were computed against the high confidence training and validation set or the negative set, respectively, together with the AUC for these rates. The feature weight values that gave rise to a maximal AUC, low FPR, and maximal TPR on the validation set were selected as the final MiST weights for abundance, reproducibility, and specificity. Network representations were plotted using Cytoscape, v. 2.8.3 (Smoot et al., 2011).
Evolutionary analysis
Human proteins interacting with only one KSHV protein were binned according to the classification of their viral binding partner within the herpesviral subfamilies. Within these bins of human proteins, differences in evolutionary pressures compared to a random background of MS-observable human proteins were assessed. Cross species selection (dN/dS) was assessed based on multiple sequence alignments of human, chimp, gorilla, orangutan, and rhesus macaque sequences constructed using MOSAIC (Maher and Hernandez, 2014). iHS scores were calculated by selscan (Szpiech and Hernandez, 2014), then summarized at the gene level by taking a linear combination of the log of the number of nominally significant z-scores and the maximum z-score within a given coding region. Weighting was chosen using principle components analysis.
Disease association analysis
All proteins in the reference proteome were labeled as ‘cancer-associated’, ‘other disease associated’ or ‘none’ based on their DisGeNET disease annotations. The reference proteome was divided in three sets: all proteins, latent KSHV ORF (K1, K2, K12A, K12B, K12C, ORF71, ORF72 and ORF73) interacting proteins or lytic KSHV ORFS (remaining). Using this partitioning, relative fractions of proteins with cancer-associations or other disease associations were computed and the significance of the observed counts was computed with the hyper-geometric test.
Ontology enrichment analysis of prey proteins
For each analysis, the list of identified preys was joined with the respective ontology mapping using Uniprot accession codes. Then, for each ontology term in this joined list, the average MiST score of all associated prey proteins was compared against a sampled distribution of averages for equally sized random subsets of the data. The probability that the observed term score was higher than the distributed sampled average was computed by the one-tailed cumulative normal distribution test. The set of enrichment p-values for all terms in the ontology mapping was adjusted for multiple hypotheses testing by applying the Benjamini-Hochbach correction (also known as FDR control or q-value). Terms that were significant at an FDR of 0.05 were reported as output.
KSHV Mutagenesis
KSHV mutants were engineered using KSHV BAC16 mutagenesis as previously described using a two-step scarless Red recombination system (Brulois et al., 2012). iSLK derived cells used in this study include a BAC with a premature stop codon in the ORF24 coding region (KSHV.24.Stop) and an N-terminally FLAG tagged ORF24 (KSHV.FLAG.24).
RT-qPCR
RNA isolated from the relevant lytically reactivated iSLK puro derived cells was treated with Turbo DNase (Ambion) and reverse transcribed with AMV RT (Promega) using random 9-mers or gene specific primers in cases of overlapping viral transcripts. cDNA levels were quantified using DyNAmo color flash SYBR green master mix, ROX passive reference dye (Thermo Scientific) and transcript specific primers. Transcript levels were normalized to 18s. Error bars represent the standard deviation of three independent experiments.
Chromatin immunoprecipitation
iSLK cells containing WT or mutant KSHV were induced with 1 μg/mL doxycycline for 36 hr before harvesting chromatin. ChIP was performed as described (Listerman et al., 2006) but using a Covaris focused sonicator to shear chromatin. After reversing crosslinks, DNA was purified and eluted in 150 μL of dH2O. Eluted DNA (0.5–1 μL) was subjected to 20–25 cycles of PCR in reactions spiked with α-32P dCTP. PCR products were resolved on urea PAGE gels, quantified using Image Lab (BioRad) software and normalized to 1% input.
Electromobility shift assays
Wild type or DNA binding mutants of 3X FLAG ORF24 were isolated from transduced BJABs in lysis buffer (100 mM HEPES, pH 7.9, 500 mM NaCl, 1% NP-40), affinity purified over FLAG beads, and eluted with 3X FLAG peptide. Protein purity was assessed by silver stain. Samples were dialyzed against 20 mM HEPES, pH 7.9, 40% glycerol, 100 mM KCl, 0.5 mM DDT and 0.5 mM PMSF. Shift buffer (1 μg/μL salmon sperm DNA, 0.5 mg/mL BSA, 0.05% triton-X 100, 10% glycerol, 10 mM HEPES pH 7.9, 40 mM KCl, 5 mM MgCl2 and 10 μM BME) was added to binding reactions containing 70–280 nM protein and 4.4 nM radiolabeled probe. Oligonucleotides were end-labeled with γ-32P ATP using T4 PNK (NEB), purified, and reverse complemented sequences were annealed together to make a double-stranded probe. Reactions were incubated at 37°C for 30 min and resolved by native PAGE.
Supplementary Material
Figure S1 (associated with Figure 1): Protein expression levels and KSHV latency protein interactors linked to cancer.
(A) Plasmids expressing strep-tagged KSHV ORFs were transfected into 293T cells and their expression level compared to that in unreactivated (no doxycycline) and reactivated (with doxycycline) KSHV.WT iSLK cells using the indicated antibodies against each viral protein. Viral proteins in 293T cells are slightly larger than in infected cells due to the strep tag.
(B) ROC curves comparing the optimal MiST feature weights to CompPASS (Sowa et al., 2009) and SAINT (Choi et al., 2011) scoring algorithms. The high-confidence data set (MiST ≥ 0.7) is indicated with an x.
(C) Viral and cellular protein pairs were mapped in three-dimensional space showing the reproducibility, abundance, and specificity features of the MiST score. Interaction pairs with MiST scores ≥ 0.7 are indicated in red.
(D) Bar graph showing the percentage of all human proteins, versus the subset that interact with KSHV lytic or latent proteins, that are associated with cancer using the cBioPortal (TCGA) cancer mutation dataset (Cerami et al., 2012). ** Indicates a p-value of < 0.005 as determined by the hyper-geometric test.
(E) Network of KSHV latency protein interactions showing enrichment for cellular proteins linked to cancer. The KSHV latency protein-human interactome is shown with viral proteins as gold nodes and associated cellular proteins as red (associated with cancer) or grey (not associated with cancer) nodes. Disease terms from the DisGeNET (Bauer-Mehren et al., 2011) database were manually collapsed to create curated terms for the network. Interactions between cellular proteins present in the interaction network for each individual viral protein are displayed using black edges. Host-host edges were manually curated from CORUM (Ruepp et al., 2010) and the literature.
Figure S2 (associated with Figure 2): Full high-confidence KSHV interactome network overlaid with all host-host connections and cellular disease-linked proteins.
The KSHV-human interactome is shown with viral proteins as gold nodes and associated cellular proteins as pie chart nodes, with the coloring indicating the association of that cellular protein with a particular disease (specified by the colors in the legend). Host proteins without disease association are shown in grey. Disease terms from the DisGeNET (Bauer-Mehren et al., 2011) database were manually collapsed to create curated terms for the network. All known interactions between cellular proteins present in the interactome are displayed using black and grey edges. Host-host edges were manually curated from CORUM (Ruepp et al., 2010) and the literature.
Figure S3 (associated with Figure 3): Gene ontology and domain enrichment for the KSHV interactome.
(A) Heat map depicting the enrichment of biological processes of cellular proteins interacting with KSHV baits via a GO term analysis. The color scale corresponds to the significance of the bait-term association q-values (p-values adjusted for FDR).
(B) Heat map representing the enrichment q-values for the domains found in the interacting proteins of KSHV.
Figure S4 (associated with Figure 4): ORF24 interacts with Pol II in a DNA-independent manner and the interaction is conserved across ORF24 orthologs.
(A) Pol II interaction is not a conserved feature of viral transcription factors. Lysates of 293T cells transfected with either empty vector or plasmids expressing the strep-tagged KSHV transcription factors K8α or ORF50 (RTA) were subjected to α-strep IP and Western blotted with antibodies against strep and Pol II. Input lanes represent 5% of the lysate used for IP.
(B) The interaction between ORF24 and Pol II is DNA independent. Lysates were prepared as described above, but were treated with DNase prior to α-FLAG IP. Input lanes represent 5% of the lysate used for IP. GAPDH was used as a loading and IP specificity control.
(C) Percent identity matrix for ORF24 orthologs.
(D–F) ORF24 orthologs from other herpesviruses all interact with Pol II. 293T cells were transfected with plasmids expressing FLAG mORF24 from MHV68 (E), FLAG BcRF1 from EBV (F), or strep UL87 from HCMV (G). The viral proteins were then subjected to IP with the indicated antibody and their interaction with Pol II was monitored by Western blotting with an antibody against RPB1. Input lanes represent 5% of the lysate used for IP. Where indicated, RPS6, GAPDH, and tubulin served as loading and IP specificity controls.
(G) Deletion of ORF24 from KSHV prevents expression of late but not early genes. Western blots of lysates from iSLK cells lytically infected with WT KSHV, KSHV bearing a stop codon at the 5′ end of ORF24 (KSHV.24.Stop), or a mutant rescue KSHV in which the stop mutation was repaired to WT (KSHV.MR). SOX (ORF37) is a KSHV protein expressed with early kinetics, whereas K8.1 is a KSHV late protein. Pol II serves as a loading control.
(H) ORF24 is not required for viral DNA amplification. qRT-PCR measurements of the DNA levels of the KSHV viral genome in iSLK cells containing WT KSHV, KSHV.24.Stop, or KSHV.MR 24 and 48 hr after induction into the lytic cycle.
(I) Lysates of the KSHV.24.Stop iSLK cell lines transduced with the indicated WT or mutant ORF24 for the complementation assay were Western blotted with anti-FLAG antibodies to confirm equivalent expression of the ORF24 proteins. Pol II serves as a loading control.
Figure S5 (associated with Figure 5): ORF24 mutants unable to bind either DNA or Pol II are defective for late gene expression and virion production.
(A) RNA harvested from lytically reactivated KSHV.WT and KSHV.24.Stop iSLK cells complemented with the indicated WT or mutant ORF24 was Northern blotted using strand specific primer probes against the late genes ORF8, ORF65, ORF25, K8.1, and ORF52, the ORF38 early gene, or 7SL (as a loading control).
(B) Both the DNA binding and Pol II binding activities of ORF24 are required for production of infectious progeny virions. KSHV.WT and KSHV.24.Stop iSLK cells complemented with WT ORF24, ORF24-Pol II binding mutants, or DNA-binding mutants were lytically reactivated for 48 hr and supernatant from these cells was collected and transferred to recipient 293T cells. After 24 hr, the recipient cells were monitored for GFP expression, as infectious virus constitutively expresses GFP. DNA is stained with DAPI (blue). A minimum of 30 fields were monitored for each experiment, with a single representative field shown. No GFP positive cells were detected in any field of recipient cells containing supernatant from infected cells lacking WT ORF24. Scale bar = 50 μm.
(C) Coomassie stained gel showing the purity of the ORF24 protein used in EMSA assays.
Figure S6 (associated with Figure 6): ORF24 interacts selectively with Pol II and is needed for late gene PIC assembly.
(A) Lysates of iSLK cells either latently (− doxycycline) or lytically (+ doxycycline) infected with KSHV.FLAG.ORF24 were subjected to α-FLAG IP and Western blotted with antibodies to the indicated general transcription factors or GAPDH as a control. Input lanes represent 5% of the lysate used for IP.
(B) Chromatin from iSLK cells lytically infected with KSHV.WT or KSHV.24.Stop was isolated and subjected to ChIP using antibodies against TFIIB. Co-precipitating DNA was detected by PCR using radiolabeled nucleotides and primers specific for the indicated promoter. Experiments were repeated at least three times and a representative image is presented with the ChIPs quantified against input samples.
Table S1: MS data for all proteins detected in the KSHV interactome and ROC data, and attempted interactome validations, related to Figures 1 and 2.
MiST-abundance, MiST-reproducibility, and MiST-specificity refer to the individual component scores for each feature. The MiST composite score is the weighted combination of the specificity and reproducibility scores. The unique peptide average is the sum of unique prey peptides across all pulled down baits, divided by the number of replicates. The “high confidence interactions” tab contains all interactors with MiST scores ≥0.7 while the “All detected interactions” tab contains all collected data regardless of score. In the “ROC data” tab are all of the data points for the ROC curves. RAS refers to the order of the Reproducibility, Abundance, and Specificity weights for each curve. Scores above threshold refers to the number of interactions with MiST scores at or above the listed MiST threshold value. The “validated interactions” tab refers to the attempted validations by co-IP done for high confidence interaction partners identified by the interactome. Positive interactions are indicated by +; the absence of a detected interaction by co-IP is indicated by −. Interaction partners were epitope-tagged expression plasmids unless otherwise indicated.
Table S2: Previously described interactions for KSHV viral proteins, related to Figure 1.
Prey proteins found in the high (H) confidence data set correspond to a MiST score of ≥0.7. Proteins found below the cutoff are labeled low (L) while proteins absent from the MS, regardless of score are labeled not detected (ND).
Table S3: Disease network dictionary, related to Figures 1, S1, and S2.
Disease terms from the DisGeNET database were manually collapsed to create curated terms for the Figures S1 and S2 networks. The first tab (Disease term dictionary) contains the dictionary and the second tab (Disease term summary) defines which disease terms were collapsed together to generate the new term.
Table S4: KSHV-HIV-1 high confidence overlapping proteins, related to Figure 1.
Cellular proteins interacting with at least one HIV-1 and KSHV protein are divided by functional category. The viral proteins interacting with the cellular protein are written after in parentheses with the KSHV protein(s) first in black and the HIV-1 protein(s) second in orange.
Table S5: Host-host edges in interactome network depictions, related to Figures 2 and S2.
Host-host edges depicted in Figure 2 were restricted to host protein pairs that interact with the same KSHV bait (listed in the “Hu-Hu interactions in Fig 2” tab). In Figure S2, all host-host edges, including host protein pairs that interact with different KSHV baits, were shown (listed in the “Hu-Hu interactions in Fig S2” tab).
Table S6: Gene ontology term curation, related to Figures 3 and S3.
Gene ontology terms from the GO biological processes database were manually collapsed to create curated terms for the heat map. The first tab (Dictionary for GO terms) contains the dictionary and the second tab (GO terms in dictionary) defines which processes were collapsed together to generate the term.
Table S7: Domain and GO term summary, related to Figures 3 and S3.
Cellular location refers to the location on the Figure 3B cell diagram. The ORFs to annotate in the various locations were determined by significant enrichment in the listed GO terms or domains.
Acknowledgments
We would like to thank the members of the Glaunsinger and Krogan labs and Laurent Coscoy for helpful discussions and critical reading of the manuscript. We would also like to thank Christopher Hann-Soden, Alexandre Mercier, Trever Greene, Andrew Birnberg, Matthew Gardner, Marta Gaglia and Edward Tang for help with cloning, cell culture, and mass spectrometry; Jeffrey Queisser for computational help; Sumit Chanda for ORFeome constructs.
This research was supported by NIH R01 grants CA160556 and CA136367 and CHRP IDEA award ID13-B-529 to BAG, NIH P50 GM082250, P50 GM081879, PO1 AI090935, PO1 AI091575, PO1 CA177322, U54 AI081680 and DARPA-10-93-Prophecy-PA-008 grants to NJK, R01HG007644 and a Sloan Foundation Research Fellowship to RDH, an Irving H. Wiesenfeld Fellowship to ZHD, a UCSF M. Lloyd Kozloff fellowship and NIH F31 CA180609-01 to MCM. NJK and BG are WM Keck Foundation Young Investigators.
Footnotes
AUTHOR CONTRIBUTIONS
ZD: Cloned and expression tested the KSHV ORF library, assisted with analysis of interactome dataset, performed all experiments related to KSHV ORF24. EV and JVD: Bioinformatics and statistical analysis of the interactome data. GMJ and SJ: Affinity purified KSHV ORFs from human cells for MS. KK: Assisted with generation of the KSHV ORF library. JRJ, TJ, WN and JH: Involved in MS analysis. JP: Assisted with ORF24 cloning and construction of KSHV mutants. MCM, RDH: Performed evolutionary analysis. TJ and WN: Assisted with affinity purification of the KSHV ORFs. MS: Assisted with figure design and graphics. NJK and BAG: Oversaw all experiments and assisted with data interpretation. ZD, EV, NJK and BAG wrote the paper.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- Arumugaswami V, Wu TT, Martinez-Guzman D, Jia Q, Deng H, Reyes N, Sun R. ORF18 is a transfactor that is essential for late gene transcription of a gammaherpesvirus. Journal of virology. 2006;80:9730–9740. doi: 10.1128/JVI.00246-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aubry V, Mure F, Mariame B, Deschamps T, Wyrwicz LS, Manet E, Gruffat H. Epstein-barr virus late gene transcription depends on the assembly of a virus-specific preinitiation complex. Journal of virology. 2014;88:12825–12838. doi: 10.1128/JVI.02139-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bauer-Mehren A, Bundschus M, Rautschka M, Mayer MA, Sanz F, Furlong LI. Gene-disease network analysis reveals functional modules in mendelian, complex and environmental diseases. PloS one. 2011;6:e20284. doi: 10.1371/journal.pone.0020284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brulois KF, Chang H, Lee AS, Ensser A, Wong LY, Toth Z, Lee SH, Lee HR, Myoung J, Ganem D, et al. Construction and manipulation of a new Kaposi’s sarcoma-associated herpesvirus bacterial artificial chromosome clone. Journal of virology. 2012;86:9708–9720. doi: 10.1128/JVI.01019-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bushnell DA, Westover KD, Davis RE, Kornberg RD. Structural basis of transcription: an RNA polymerase II-TFIIB cocrystal at 4.5 Angstroms. Science. 2004;303:983–988. doi: 10.1126/science.1090838. [DOI] [PubMed] [Google Scholar]
- Calderwood MA, Venkatesan K, Xing L, Chase MR, Vazquez A, Holthaus AM, Ewence AE, Li N, Hirozane-Kishikawa T, Hill DE, et al. Epstein-Barr virus and virus human protein interaction maps. Proceedings of the National Academy of Sciences of the United States of America. 2007;104:7606–7611. doi: 10.1073/pnas.0702332104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dorjsuren D, Lin Y, Wei W, Yamashita T, Nomura T, Hayashi N, Murakami S. RMP, a novel RNA polymerase II subunit 5-interacting protein, counteracts transactivation by hepatitis B virus X protein. Molecular and cellular biology. 1998;18:7546–7555. doi: 10.1128/mcb.18.12.7546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engelhardt OG, Smith M, Fodor E. Association of the influenza A virus RNA-dependent RNA polymerase with cellular RNA polymerase II. Journal of virology. 2005;79:5812–5818. doi: 10.1128/JVI.79.9.5812-5818.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Genomes Project C, Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA. A map of human genome variation from population-scale sequencing. Nature. 2010;467:1061–1073. doi: 10.1038/nature09534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Germain MA, Chatel-Chaix L, Gagne B, Bonneil E, Thibault P, Pradezynski F, de Chassey B, Meyniel-Shicklin L, Lotteau V, Baril M, et al. Elucidating Novel Hepatitis C Virus/Host Interactions Using Combined Mass Spectrometry and Functional Genomics Approaches. Molecular & cellular proteomics : MCP. 2013 doi: 10.1074/mcp.M113.030155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gruffat H, Kadjouf F, Mariame B, Manet E. The Epstein-Barr virus BcRF1 gene product is a TBP-like protein with an essential role in late gene expression. Journal of virology. 2012;86:6023–6032. doi: 10.1128/JVI.00159-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guglielmi B, La Rochelle N, Tjian R. Gene-specific transcriptional mechanisms at the histone gene cluster revealed by single-cell imaging. Molecular cell. 2013;51:480–492. doi: 10.1016/j.molcel.2013.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hirsch AJ. The use of RNAi-based screens to identify host proteins involved in viral replication. Future microbiology. 2010;5:303–311. doi: 10.2217/fmb.09.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Homa FL, Glorioso JC, Levine M. A specific 15-bp TATA box promoter element is required for expression of a herpes simplex virus type 1 late gene. Genes & development. 1988;2:40–53. doi: 10.1101/gad.2.1.40. [DOI] [PubMed] [Google Scholar]
- Isomura H, Stinski MF, Murata T, Yamashita Y, Kanda T, Toyokuni S, Tsurumi T. The human cytomegalovirus gene products essential for late viral gene expression assemble into prereplication complexes before viral DNA replication. Journal of virology. 2011;85:6629–6644. doi: 10.1128/JVI.00384-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jager S, Cimermancic P, Gulbahce N, Johnson JR, McGovern KE, Clarke SC, Shales M, Mercenne G, Pache L, Li K, et al. Global landscape of HIV-human protein complexes. Nature. 2011a;481:365–370. doi: 10.1038/nature10719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jager S, Gulbahce N, Cimermancic P, Kane J, He N, Chou S, D’Orso I, Fernandes J, Jang G, Frankel AD, et al. Purification and characterization of HIV-human protein complexes. Methods. 2011b;53:13–19. doi: 10.1016/j.ymeth.2010.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jia Q, Wu TT, Liao HI, Chernishof V, Sun R. Murine gammaherpesvirus 68 open reading frame 31 is required for viral replication. Journal of virology. 2004;78:6610–6620. doi: 10.1128/JVI.78.12.6610-6620.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim Y, Geiger JH, Hahn S, Sigler PB. Crystal structure of a yeast TBP/TATA-box complex. Nature. 1993;365:512–520. doi: 10.1038/365512a0. [DOI] [PubMed] [Google Scholar]
- Lee S, Salwinski L, Zhang C, Chu D, Sampankanpanich C, Reyes NA, Vangeloff A, Xing F, Li X, Wu TT, et al. An integrated approach to elucidate the intra-viral and viral-cellular protein interaction networks of a gamma-herpesvirus. PLoS pathogens. 2011;7:e1002297. doi: 10.1371/journal.ppat.1002297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Listerman I, Sapra AK, Neugebauer KM. Cotranscriptional coupling of splicing factor recruitment and precursor messenger RNA splicing in mammalian cells. Nature structural & molecular biology. 2006;13:815–822. doi: 10.1038/nsmb1135. [DOI] [PubMed] [Google Scholar]
- Maher MC, Hernandez RD. A MOSAIC of methods: Improving ortholog detection through integration of algorithmic diversity. J arXiv preprint. 2014 arXiv: 1309.2319. [Google Scholar]
- Mavankal G, Ignatius Ou SH, Oliver H, Sigman D, Gaynor RB. Human immunodeficiency virus type 1 and 2 Tat proteins specifically interact with RNA polymerase II. Proceedings of the National Academy of Sciences of the United States of America. 1996;93:2089–2094. doi: 10.1073/pnas.93.5.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mesri EA, Feitelson MA, Munger K. Human Viral Oncogenesis: A Cancer Hallmarks Analysis. Cell host & microbe. 2014;15:266–282. doi: 10.1016/j.chom.2014.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murakami K, Elmlund H, Kalisman N, Bushnell DA, Adams CM, Azubel M, Elmlund D, Levi-Kalisman Y, Liu X, Gibbons BJ, et al. Architecture of an RNA polymerase II transcription pre-initiation complex. Science. 2013;342:1238724. doi: 10.1126/science.1238724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Myoung J, Ganem D. Generation of a doxycycline-inducible KSHV producer cell line of endothelial origin: maintenance of tight latency with efficient reactivation upon induction. Journal of virological methods. 2011;174:12–21. doi: 10.1016/j.jviromet.2011.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Navratil V, de Chassey B, Combe CR, Lotteau V. When the human viral infectome and diseasome networks collide: towards a systems biology platform for the aetiology of human diseases. BMC systems biology. 2011;5:13. doi: 10.1186/1752-0509-5-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Osterberg S, del Peso-Santos T, Shingler V. Regulation of alternative sigma factor use. Annual review of microbiology. 2011;65:37–55. doi: 10.1146/annurev.micro.112408.134219. [DOI] [PubMed] [Google Scholar]
- Perng YC, Campbell JA, Lenschow DJ, Yu D. Human cytomegalovirus pUL79 is an elongation factor of RNA polymerase II for viral gene transcription. PLoS pathogens. 2014;10:e1004350. doi: 10.1371/journal.ppat.1004350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pichlmair A, Kandasamy K, Alvisi G, Mulhern O, Sacco R, Habjan M, Binder M, Stefanovic A, Eberle CA, Goncalves A, et al. Viral immune modulators perturb the human molecular network by common and unique strategies. Nature. 2012;487:486–490. doi: 10.1038/nature11289. [DOI] [PubMed] [Google Scholar]
- Rajagopala SV, Casjens S, Uetz P. The protein interaction map of bacteriophage lambda. BMC microbiology. 2011;11:213. doi: 10.1186/1471-2180-11-213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rozenblatt-Rosen O, Deo RC, Padi M, Adelmant G, Calderwood MA, Rolland T, Grace M, Dricot A, Askenazi M, Tavares M, et al. Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins. Nature. 2012;487:491–495. doi: 10.1038/nature11288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smoot ME, Ono K, Ruscheinski J, Wang PL, Ideker T. Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics. 2011;27:431–432. doi: 10.1093/bioinformatics/btq675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sturzl M, Gaus D, Dirks WG, Ganem D, Jochmann R. Kaposi’s sarcoma-derived cell line SLK is not of endothelial origin, but is a contaminant from a known renal carcinoma cell line. International journal of cancer Journal international du cancer. 2013;132:1954–1958. doi: 10.1002/ijc.27849. [DOI] [PubMed] [Google Scholar]
- Swift S, Lorens J, Achacoso P, Nolan GP. Rapid production of retroviruses for efficient gene delivery to mammalian cells using 293T cell-based systems. In: Coligan John E, et al., editors. Current protocols in immunology. Unit 10. Chapter 10. 2001. p. 17C. [DOI] [PubMed] [Google Scholar]
- Szpiech ZA, Hernandez RD. selscan: An Efficient Multithreaded Program to Perform EHH-Based Scans for Positive Selection. Molecular biology and evolution. 2014 doi: 10.1093/molbev/msu211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takramah D, Seiffert BM, Schaller S, Vigneron M, Hacker G. Baculovirus P35 interacts with a subunit of human RNA polymerase II and can enhance promoter activity in human cells. The Journal of general virology. 2003;84:3011–3019. doi: 10.1099/vir.0.19395-0. [DOI] [PubMed] [Google Scholar]
- Tang S, Yamanegi K, Zheng ZM. Requirement of a 12-base-pair TATT-containing sequence and viral lytic DNA replication in activation of the Kaposi’s sarcoma-associated herpesvirus K8.1 late promoter. Journal of virology. 2004;78:2609–2614. doi: 10.1128/JVI.78.5.2609-2614.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomas MC, Chiang CM. The general transcription machinery and general cofactors. Critical reviews in biochemistry and molecular biology. 2006;41:105–178. doi: 10.1080/10409230600648736. [DOI] [PubMed] [Google Scholar]
- Uetz P, Dong YA, Zeretzke C, Atzler C, Baiker A, Berger B, Rajagopala SV, Roupelieva M, Rose D, Fossum E, et al. Herpesviral protein networks and their interaction with the human proteome. Science. 2006;311:239–242. doi: 10.1126/science.1116804. [DOI] [PubMed] [Google Scholar]
- Vance RE, Isberg RR, Portnoy DA. Patterns of pathogenesis: discrimination of pathogenic and nonpathogenic microbes by the innate immune system. Cell host & microbe. 2009;6:10–21. doi: 10.1016/j.chom.2009.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Verschueren E, Von Dollen J, Cimermancic P, Gulbahce N, Sali A, Krogan NJ. Analyzing AP-MS data with Mass Spectrometry Interaction Statistics (MIST) Current Protocols. 2014 doi: 10.1002/0471250953.bi0819s49. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Voight BF, Kudaravalli S, Wen X, Pritchard JK. A map of recent positive selection in the human genome. PLoS biology. 2006;4:e72. doi: 10.1371/journal.pbio.0040072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wong E, Wu TT, Reyes N, Deng H, Sun R. Murine gammaherpesvirus 68 open reading frame 24 is required for late gene expression after DNA replication. Journal of virology. 2007;81:6761–6764. doi: 10.1128/JVI.02726-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wong-Ho E, Wu TT, Davis ZH, Zhang B, Huang J, Gong H, Deng H, Liu F, Glaunsinger B, Sun R. Unconventional Sequence Requirement for Viral Late Gene Core Promoters of Murine Gammaherpesvirus 68. Journal of virology. 2014 doi: 10.1128/JVI.01374-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu TT, Park T, Kim H, Tran T, Tong L, Martinez-Guzman D, Reyes N, Deng H, Sun R. ORF30 and ORF34 are essential for expression of late genes in murine gammaherpesvirus 68. Journal of virology. 2009;83:2265–2273. doi: 10.1128/JVI.01785-08. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wyrwicz LS, Rychlewski L. Identification of Herpes TATT-binding protein. Antiviral research. 2007;75:167–172. doi: 10.1016/j.antiviral.2007.03.002. [DOI] [PubMed] [Google Scholar]
- Zhou C, Knipe DM. Association of herpes simplex virus type 1 ICP8 and ICP27 proteins with cellular RNA polymerase II holoenzyme. Journal of virology. 2002;76:5893–5904. doi: 10.1128/JVI.76.12.5893-5904.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figure S1 (associated with Figure 1): Protein expression levels and KSHV latency protein interactors linked to cancer.
(A) Plasmids expressing strep-tagged KSHV ORFs were transfected into 293T cells and their expression level compared to that in unreactivated (no doxycycline) and reactivated (with doxycycline) KSHV.WT iSLK cells using the indicated antibodies against each viral protein. Viral proteins in 293T cells are slightly larger than in infected cells due to the strep tag.
(B) ROC curves comparing the optimal MiST feature weights to CompPASS (Sowa et al., 2009) and SAINT (Choi et al., 2011) scoring algorithms. The high-confidence data set (MiST ≥ 0.7) is indicated with an x.
(C) Viral and cellular protein pairs were mapped in three-dimensional space showing the reproducibility, abundance, and specificity features of the MiST score. Interaction pairs with MiST scores ≥ 0.7 are indicated in red.
(D) Bar graph showing the percentage of all human proteins, versus the subset that interact with KSHV lytic or latent proteins, that are associated with cancer using the cBioPortal (TCGA) cancer mutation dataset (Cerami et al., 2012). ** Indicates a p-value of < 0.005 as determined by the hyper-geometric test.
(E) Network of KSHV latency protein interactions showing enrichment for cellular proteins linked to cancer. The KSHV latency protein-human interactome is shown with viral proteins as gold nodes and associated cellular proteins as red (associated with cancer) or grey (not associated with cancer) nodes. Disease terms from the DisGeNET (Bauer-Mehren et al., 2011) database were manually collapsed to create curated terms for the network. Interactions between cellular proteins present in the interaction network for each individual viral protein are displayed using black edges. Host-host edges were manually curated from CORUM (Ruepp et al., 2010) and the literature.
Figure S2 (associated with Figure 2): Full high-confidence KSHV interactome network overlaid with all host-host connections and cellular disease-linked proteins.
The KSHV-human interactome is shown with viral proteins as gold nodes and associated cellular proteins as pie chart nodes, with the coloring indicating the association of that cellular protein with a particular disease (specified by the colors in the legend). Host proteins without disease association are shown in grey. Disease terms from the DisGeNET (Bauer-Mehren et al., 2011) database were manually collapsed to create curated terms for the network. All known interactions between cellular proteins present in the interactome are displayed using black and grey edges. Host-host edges were manually curated from CORUM (Ruepp et al., 2010) and the literature.
Figure S3 (associated with Figure 3): Gene ontology and domain enrichment for the KSHV interactome.
(A) Heat map depicting the enrichment of biological processes of cellular proteins interacting with KSHV baits via a GO term analysis. The color scale corresponds to the significance of the bait-term association q-values (p-values adjusted for FDR).
(B) Heat map representing the enrichment q-values for the domains found in the interacting proteins of KSHV.
Figure S4 (associated with Figure 4): ORF24 interacts with Pol II in a DNA-independent manner and the interaction is conserved across ORF24 orthologs.
(A) Pol II interaction is not a conserved feature of viral transcription factors. Lysates of 293T cells transfected with either empty vector or plasmids expressing the strep-tagged KSHV transcription factors K8α or ORF50 (RTA) were subjected to α-strep IP and Western blotted with antibodies against strep and Pol II. Input lanes represent 5% of the lysate used for IP.
(B) The interaction between ORF24 and Pol II is DNA independent. Lysates were prepared as described above, but were treated with DNase prior to α-FLAG IP. Input lanes represent 5% of the lysate used for IP. GAPDH was used as a loading and IP specificity control.
(C) Percent identity matrix for ORF24 orthologs.
(D–F) ORF24 orthologs from other herpesviruses all interact with Pol II. 293T cells were transfected with plasmids expressing FLAG mORF24 from MHV68 (E), FLAG BcRF1 from EBV (F), or strep UL87 from HCMV (G). The viral proteins were then subjected to IP with the indicated antibody and their interaction with Pol II was monitored by Western blotting with an antibody against RPB1. Input lanes represent 5% of the lysate used for IP. Where indicated, RPS6, GAPDH, and tubulin served as loading and IP specificity controls.
(G) Deletion of ORF24 from KSHV prevents expression of late but not early genes. Western blots of lysates from iSLK cells lytically infected with WT KSHV, KSHV bearing a stop codon at the 5′ end of ORF24 (KSHV.24.Stop), or a mutant rescue KSHV in which the stop mutation was repaired to WT (KSHV.MR). SOX (ORF37) is a KSHV protein expressed with early kinetics, whereas K8.1 is a KSHV late protein. Pol II serves as a loading control.
(H) ORF24 is not required for viral DNA amplification. qRT-PCR measurements of the DNA levels of the KSHV viral genome in iSLK cells containing WT KSHV, KSHV.24.Stop, or KSHV.MR 24 and 48 hr after induction into the lytic cycle.
(I) Lysates of the KSHV.24.Stop iSLK cell lines transduced with the indicated WT or mutant ORF24 for the complementation assay were Western blotted with anti-FLAG antibodies to confirm equivalent expression of the ORF24 proteins. Pol II serves as a loading control.
Figure S5 (associated with Figure 5): ORF24 mutants unable to bind either DNA or Pol II are defective for late gene expression and virion production.
(A) RNA harvested from lytically reactivated KSHV.WT and KSHV.24.Stop iSLK cells complemented with the indicated WT or mutant ORF24 was Northern blotted using strand specific primer probes against the late genes ORF8, ORF65, ORF25, K8.1, and ORF52, the ORF38 early gene, or 7SL (as a loading control).
(B) Both the DNA binding and Pol II binding activities of ORF24 are required for production of infectious progeny virions. KSHV.WT and KSHV.24.Stop iSLK cells complemented with WT ORF24, ORF24-Pol II binding mutants, or DNA-binding mutants were lytically reactivated for 48 hr and supernatant from these cells was collected and transferred to recipient 293T cells. After 24 hr, the recipient cells were monitored for GFP expression, as infectious virus constitutively expresses GFP. DNA is stained with DAPI (blue). A minimum of 30 fields were monitored for each experiment, with a single representative field shown. No GFP positive cells were detected in any field of recipient cells containing supernatant from infected cells lacking WT ORF24. Scale bar = 50 μm.
(C) Coomassie stained gel showing the purity of the ORF24 protein used in EMSA assays.
Figure S6 (associated with Figure 6): ORF24 interacts selectively with Pol II and is needed for late gene PIC assembly.
(A) Lysates of iSLK cells either latently (− doxycycline) or lytically (+ doxycycline) infected with KSHV.FLAG.ORF24 were subjected to α-FLAG IP and Western blotted with antibodies to the indicated general transcription factors or GAPDH as a control. Input lanes represent 5% of the lysate used for IP.
(B) Chromatin from iSLK cells lytically infected with KSHV.WT or KSHV.24.Stop was isolated and subjected to ChIP using antibodies against TFIIB. Co-precipitating DNA was detected by PCR using radiolabeled nucleotides and primers specific for the indicated promoter. Experiments were repeated at least three times and a representative image is presented with the ChIPs quantified against input samples.
Table S1: MS data for all proteins detected in the KSHV interactome and ROC data, and attempted interactome validations, related to Figures 1 and 2.
MiST-abundance, MiST-reproducibility, and MiST-specificity refer to the individual component scores for each feature. The MiST composite score is the weighted combination of the specificity and reproducibility scores. The unique peptide average is the sum of unique prey peptides across all pulled down baits, divided by the number of replicates. The “high confidence interactions” tab contains all interactors with MiST scores ≥0.7 while the “All detected interactions” tab contains all collected data regardless of score. In the “ROC data” tab are all of the data points for the ROC curves. RAS refers to the order of the Reproducibility, Abundance, and Specificity weights for each curve. Scores above threshold refers to the number of interactions with MiST scores at or above the listed MiST threshold value. The “validated interactions” tab refers to the attempted validations by co-IP done for high confidence interaction partners identified by the interactome. Positive interactions are indicated by +; the absence of a detected interaction by co-IP is indicated by −. Interaction partners were epitope-tagged expression plasmids unless otherwise indicated.
Table S2: Previously described interactions for KSHV viral proteins, related to Figure 1.
Prey proteins found in the high (H) confidence data set correspond to a MiST score of ≥0.7. Proteins found below the cutoff are labeled low (L) while proteins absent from the MS, regardless of score are labeled not detected (ND).
Table S3: Disease network dictionary, related to Figures 1, S1, and S2.
Disease terms from the DisGeNET database were manually collapsed to create curated terms for the Figures S1 and S2 networks. The first tab (Disease term dictionary) contains the dictionary and the second tab (Disease term summary) defines which disease terms were collapsed together to generate the new term.
Table S4: KSHV-HIV-1 high confidence overlapping proteins, related to Figure 1.
Cellular proteins interacting with at least one HIV-1 and KSHV protein are divided by functional category. The viral proteins interacting with the cellular protein are written after in parentheses with the KSHV protein(s) first in black and the HIV-1 protein(s) second in orange.
Table S5: Host-host edges in interactome network depictions, related to Figures 2 and S2.
Host-host edges depicted in Figure 2 were restricted to host protein pairs that interact with the same KSHV bait (listed in the “Hu-Hu interactions in Fig 2” tab). In Figure S2, all host-host edges, including host protein pairs that interact with different KSHV baits, were shown (listed in the “Hu-Hu interactions in Fig S2” tab).
Table S6: Gene ontology term curation, related to Figures 3 and S3.
Gene ontology terms from the GO biological processes database were manually collapsed to create curated terms for the heat map. The first tab (Dictionary for GO terms) contains the dictionary and the second tab (GO terms in dictionary) defines which processes were collapsed together to generate the term.
Table S7: Domain and GO term summary, related to Figures 3 and S3.
Cellular location refers to the location on the Figure 3B cell diagram. The ORFs to annotate in the various locations were determined by significant enrichment in the listed GO terms or domains.







