Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2021 Apr 29;184(9):2394–2411.e16. doi: 10.1016/j.cell.2021.03.012

Discovery and functional interrogation of SARS-CoV-2 RNA-host protein interactions

Ryan A Flynn 1,12,13,14,15,, Julia A Belk 2,3,12, Yanyan Qi 3, Yuki Yasumoto 4, Jin Wei 5,6, Mia Madel Alfajaro 5,6, Quanming Shi 7, Maxwell R Mumbach 7, Aditi Limaye 3, Peter C DeWeirdt 8, Cameron O Schmitz 5,6, Kevin R Parker 7, Elizabeth Woo 9, Howard Y Chang 7,10, Tamas L Horvath 4, Jan E Carette 11, Carolyn R Bertozzi 1,10, Craig B Wilen 5,6,∗∗, Ansuman T Satpathy 3,∗∗∗
PMCID: PMC7951565  PMID: 33743211

Summary

SARS-CoV-2 is the cause of a pandemic with growing global mortality. Using comprehensive identification of RNA-binding proteins by mass spectrometry (ChIRP-MS), we identified 309 host proteins that bind the SARS-CoV-2 RNA during active infection. Integration of this data with ChIRP-MS data from three other RNA viruses defined viral specificity of RNA-host protein interactions. Targeted CRISPR screens revealed that the majority of functional RNA-binding proteins protect the host from virus-induced cell death, and comparative CRISPR screens across seven RNA viruses revealed shared and SARS-specific antiviral factors. Finally, by combining the RNA-centric approach and functional CRISPR screens, we demonstrated a physical and functional connection between SARS-CoV-2 and mitochondria, highlighting this organelle as a general platform for antiviral activity. Altogether, these data provide a comprehensive catalog of functional SARS-CoV-2 RNA-host protein interactions, which may inform studies to understand the host-virus interface and nominate host pathways that could be targeted for therapeutic benefit.

Keywords: ChIRP-MS, SARS-CoV-2, host-pathogen interactions, RNA virus, CRISPR, RNA-binding proteins, mitochondria

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • ChIRP-MS of SARS-CoV-2 RNA identifies viral RNA-host protein interaction networks

  • Comparative analysis identifies SARS-specific and multi-viral RNA-protein complexes

  • SARS-CoV-2 interactome-focused CRISPR screens reveal a broad antiviral response

  • Host mitochondria serve as a general organelle platform for anti-SARS-CoV-2 immunity


Interrogation of SARS-CoV-2 RNA-host protein interaction networks by ChIRP-MS and CRISPR screens, in comparison with other human viruses such as flaviviruses, picornavirus, and rhinovirus, identifies complexes specific to SARS-CoV-2 infection and highlights the role of mitochondria in mediating antiviral immunity.

Introduction

Despite similarities in replication strategies of their compact genomes, positive single-stranded RNA (ssRNA) viruses cause a remarkable variety of human diseases. Mosquito-borne flaviviruses such as dengue virus and Zika virus cause systemic disease, while human coronaviruses generally cause respiratory symptoms (Ahlquist, 2006; Carrasco-Hernandez et al., 2017). The recent pandemic emergence of the novel coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which can cause potentially fatal coronavirus disease 2019 (COVID-19), illustrates the threat to public health posed by RNA viruses. Less than 1 year into the outbreak, more than 103 million people have been infected by SARS-CoV-2, and 2.3 million people have died. The severity of the virus has caused global economic disruption, and treatment options remain limited, in part due to an incomplete understanding of the molecular determinants of viral pathogenesis.

The process of infecting a host cell is complex, multistep, and often highly virus-specific. Viruses must bind and enter host cells, and once inside the cell, their genetic material leverages and remodels cellular pathways to express, replicate, and produce new infectious virions. RNA viruses deposit large autonomous RNA transcripts into the dense intracellular milieu of the host cells, which eventually generate virally encoded protein products. Together, these RNA and protein species remodel the cell to facilitate the viral life cycle. We and others have demonstrated the utility of functionally exploring how different virally derived molecules hijack the host, in particular in the context of flaviviruses (Li et al., 2020). For example, mapping physical associations between the host and virus at the level of protein-protein interactions (PPIs) have defined key pathways relevant to infection (Eckhardt et al., 2020). In parallel to efforts that focus on viral proteins, a number of groups have taken an RNA-centric view of the host-viral interface to understand how host cells recognize and respond to the RNA genome (Kim et al., 2020a; Lenarcic et al., 2013; Ooi et al., 2019; Phillips et al., 2016; Viktorovskaya et al., 2016). Finally, genetic screening efforts provide another strategy to discover cellular proteins and pathways that are essential for viral replication or that are part of the host innate immune responses (Puschnik et al., 2017; Schoggins and Rice, 2011).

While there has been significant past work to understand coronaviruses (Cockrell et al., 2018; Gralinski and Baric, 2015), the emergence of novel strains that are highly transmissible and cause severe disease in humans has underscored the need for further study (Menachery et al., 2015). Recent studies have described SARS-CoV-2-encoded proteins (Kim et al., 2020b) and how these proteins associate with host protein factors (Gordon et al., 2020) or host RNA transcripts (Banerjee et al., 2020); however, there is a gap in understanding the precise host interactions of the SARS-CoV-2 viral RNA (vRNA). To address this gap, we used comprehensive identification of RNA-binding proteins by mass spectrometry (ChIRP-MS), which provides a comprehensive view of the host interactions of vRNAs (Chu et al., 2015). This strategy provided an opportunity to define the shared and SARS-CoV-2-specific host pathways that associate with vRNAs. We combined the RNA-centric approach with genome-wide and focused mini-pool genetic perturbations, which demonstrated that the majority of functional SARS-CoV-2 RNA-binding factors protect the host from virus-induced cell death. Finally, we discovered a physical and functional interaction between SARS-CoV-2 and host mitochondria, particularly as a subcellular platform for antiviral host proteins.

Results

ChIRP-MS of SARS-CoV-2 viral RNA in infected mammalian cells

To define the host protein interactome of the ∼30kb SARS-CoV-2 RNA, we performed ChIRP-MS (Figure 1A). ChIRP-MS is advantageous as a discovery tool because it uses formaldehyde as a crosslinking agent to recover entire protein complexes associated with cellular RNAs (Chu and Chang, 2018; Chu et al., 2015). We selected two cell lines: Huh7.5, a human hepatocyte cell line that is naturally susceptible to productive infection by SARS-CoV-2, and Vero E6, a monkey kidney cell line that dominates the research space for preparation and propagation of SARS-CoV-2 and other viruses (Harcourt et al., 2020; Zhou et al., 2020). We tiled 108 biotinylated oligonucleotide probes (Table S1) to capture the full-length positive-strand vRNA, which includes subgenomic RNA species that accumulate to higher copy numbers during infection (Kim et al., 2020b). We performed ChIRP-MS experiments at two different time points, 24 and 48 h post infection (h.p.i.), to (1) comprehensively identify all vRNA-binding factors and (2) understand the temporal association of host factors with the vRNA (Figure 1A). From each condition, input and ChIRP-enriched RNA and protein samples were collected for analysis (Figure 1A). Analysis of enriched ChIRP protein samples showed that mock samples had little protein staining, while we observed an infection- and time-dependent increase in total protein recovered after infection of either cell line with SARS-CoV-2 (Figure 1B). The band present in all infected samples at ∼50 kDa is consistent with the viral nucleocapsid (N) protein (Figure 1B; Chang et al., 2014). We assessed the technical quality of the ChIRP by analyzing the viral and host RNAs recovered. RNA sequencing from mock samples resulted in negligible mapping to the SARS-CoV-2 genome before or after pull-down, as expected (Figure S1A). In contrast, in SARS-CoV-2 infected cells, we observed 2.7% (Huh7.5, 48 h.p.i.) and 14.4% (Vero E6, 48 h.p.i.) of all reads in total RNA mapping to the viral genomic RNA, which increased to 60% (Huh7.57, 48 h.p.i.) and 68% (Vero E6 48 h.p.i.) after pull-down, demonstrating robust enrichment of vRNA after ChIRP (Figure S1A). Since coronaviruses produce full-length as well as subgenomic RNAs, we next assessed whether ChIRP-MS was biased for the higher molar copy subgenomic RNAs. ChIRP enrichment showed robust coverage of the ORF1a/b region as well as of the subgenomic RNA regions, which was visually and quantitatively similar to the input coverage across Huh7.5 and Vero E6 (Figures 1C–1E). Together these protein- and RNA-level quality controls demonstrate the robust sampling of the entire SARS-CoV-2 positive-strand RNA by the designed ChIRP-MS probes.

Figure 1.

Figure 1

ChIRP-MS identifies host and viral proteins associated with the SARS-CoV-2 RNA genome in infected cells

(A) Schematic of the ChIRP-MS protocol.

(B) SDS-PAGE analysis of total protein samples enriched using SARS-CoV-2 targeting biotinylated oligonucleotides from mock (uninfected) cells or cells infected for 24 or 48 h with SARS-CoV-2.

(C) Quantification of the percentage of reads mapping to SARS-CoV-2 gRNA (ORF1a/b) versus the subgenomic RNA before and after pull-down.

(D) RNA-seq coverage of the SARS-CoV-2 genome before and after pull-down.

(E) Structure of the SARS-CoV-2 genome.

(F) ChIRP-MS enrichment of each viral protein in Huh7.5 and Vero E6 cells at the indicated time points.

Figure S1.

Figure S1

SARS-CoV-2 ChIRP-RNA-seq in Huh7.5 and Vero E6 cells, related to Figure 1

(A) Host and viral RNA-seq alignment statistics for all samples across Huh7.5 (left) and VeroE6 (right) cell lines.

(B) Enriched host RNAs after viral RNA pulldown in VeroE6 cell line 48 h.p.i. (left) and conservation across time points (right).

(C) Enriched host RNAs after viral RNA pulldown in Huh7.5 cell line 48 h.p.i. (left) and comparison across time points (right).

SARS-CoV-2 encodes 16 nonstructural proteins, 4 structural proteins, and 6 accessory proteins (Finkel et al., 2020) (Figure 1E). We observed that 13 of 26 viral proteins were reproducibly enriched, including RNA-binding viral proteins. In the subgenomic RNA region, the major viral proteins conserved across cell types were N, M, and S, while ORF3a and 7a were selectively enriched from infected Huh7.5 cells (Figure 1F). Within the larger ORF1a/b, nsp3 and nsp4 were enriched in both cell lines; however, we saw stronger association of the known RNA-binding proteins (RBPs, names in red in Figure 1E) in Vero E6 cells (Figure 1F). The robust enrichment of specific ORF1a/b-encoded proteins provides strong evidence that the ChIRP-MS approach samples interactions across the entire length of the genomic RNA. However, species, cell type, and sex of organism differences between Vero E6 and Huh7.5 may underlie differences in overall interactomes. For example, Vero E6 cells support higher SARS-CoV-2 replication and viral egress, while replication in Huh7.5 cells reaches lower peak levels with delayed kinetics (Harcourt et al., 2020). Nonetheless, viral protein enrichments were specific and reproducible, and the common features of these cell lines enabled us to define a core SARS-CoV-2 RNA-associated proteome.

A comprehensive atlas of host-factors that interact with the SARS-CoV-2 genomic RNA

To define the host-derived interacting proteins of the SARS-CoV-2 RNA, we searched the ChIRP-MS data against a database of known monkey or human proteins. Comparing SARS-CoV-2-infected to mock (uninfected) cells, we defined high-confidence interactomes in each condition (FDR ≤ 0.05, LFC > 0; Figures 2A and 2B). A total of 163 (Vero E6) and 229 (Huh7.5) host factors were bound to the SARS-CoV-2 RNA (Table S2). Analysis of the factors enriched at 24 versus 48 h.p.i revealed that most factors enriched in Vero E6 cells were invariant between the two time points (Figure 2C, left), while the Huh7.5 interactome evolved more dramatically over this period, with 48 h.p.i. showing an expanded set of interacting proteins (Figure 2C, middle). We repeated the same analysis on the ChIRP RNA sequencing (ChIRP-RNA-seq; Figures S1B and S1C), which is discussed in more detail below. We next compared the associated host factors across cell lines and found a core set of 83 factors co-bound in both cell lines, totaling 309 host factors aggregated across the two cell lines (Figure 2C, right). Given the more complete proteome reference, we focused our subsequent analysis on the human dataset.

Figure 2.

Figure 2

Changes in the SARS-CoV-2-associated proteome across time points and cell lines

(A and B) ChIRP-MS results in Vero E6 (A) or Huh7.5 (B) cells after viral RNA pull-down at 24 and 48 h.p.i. Significantly enriched proteins indicated in black (host proteins) or red (viral proteins).

(C) Conservation of enriched proteins between time points (left, middle) and cell lines (right).

(D) Cytoscape network representation of the SARS-CoV-2-associated human proteome. Colors indicate ChIRP enrichment in Huh7.5 cells 48 h.p.i.

We visualized the high-confidence human interactome using Cytoscape (Shannon et al., 2003), where each node represents a protein significantly enriched in the Huh7.5 ChIRP-MS dataset, and nodes are connected if there is a previously described PPI (Figure 2D). Structuring this network by broad functional categories demonstrated the diversity of host proteins associated with the vRNA, spanning generic RNA adaptor proteins, RNA helicases, RNA processing enzymes, and RNA modification enzymes (Figure 2D). We also noted a set of relatively unexpected pathways including metabolic enzymes, intracellular vesicle proteins, cytosolic signaling, cytoskeleton, and intracellular trafficking proteins (Figure 2D). We next compared these results between 24 and 48 h.p.i. and found that a set of RBPs were strongly bound early in infection, suggesting that these RBPs may be important for the earliest steps of detection or replication of the vRNA (Figure S2A). Comparing the Vero E6 and Huh7.5 interactomes revealed that the binding of core RBPs was highly conserved across cell lines, as were other categories such as nuclear complexes, poly-A binding proteins, and serine/arginine-rich splicing factors (Figure S2B). We next compared the ChIRP-MS results to a set of host factors identified by vRNA pull-down after UV-C crosslinking (RNA antisense purification MS [RAP-MS]; Schmidt et al., 2021) and found that the majority of RAP-MS factors (30/47, 64%) were also enriched in the ChIRP-MS dataset (Figures S3A and S3B). However, ChIRP-MS enriched an additional 199 proteins that were not identified as significant in the UV-C dataset. The increased scope, but high specificity, of ChIRP-enriched factors is consistent with prior reports (Chu et al., 2015; Ooi et al., 2019) and is due to crosslinking differences between formaldehyde and UV-C. We confirmed this finding by comparing enrichments of each method within the combined high-confidence interactomes (FDR ≤ 0.05, average LFC ≥ 0; Figure S3D, left) and expanded interactomes (average LFC ≥ 1; Figure S3D, right). Finally, we compared the ChIRP-MS data to the host-viral protein-protein interactome (PPI; Gordon et al., 2020). We found that only 11/332 host factors (3.3%) from the PPI study overlapped with the ChIRP-MS network (Figures S3A and S3C), demonstrating that SARS-CoV-2 RNA and proteins largely interact with distinct protein complexes inside of the cell. However, of the 11 host factors that bind both vRNA and viral proteins, RAB2A, RAB7A, and RAB10 have been validated as functional in SARS-CoV-2 infection (Hoffmann et al., 2021). Together, these comparisons highlight the orthogonality of an RNA-centric approach to PPI-based studies and the power of ChIRP-MS to discover large complexes associated with vRNAs during infection.

Figure S2.

Figure S2

SARS-CoV-2 ChIRP-MS across infection times and between cell types, related to Figure 2

(A) High-confidence SARS-CoV-2 human interactome network colored by time point (24 h.p.i., 48 h.p.i., or both).

(B) High-confidence SARS-CoV-2 human interactome network colored by cell line conservation.

Figure S3.

Figure S3

Comparison of SARS-CoV-2 ChIRP-MS to other RNA- and protein-centric views of the viral interactome in human cells, related to Figure 3

(A) Comparison of the high-confidence SARS-CoV-2 RNA associated human proteome obtained by RAP-MS (UV crosslinking; (Schmidt et al., 2021)) to that by formaldehyde crosslinking (ChIRP-MS; this study) and comparison of the SARS-CoV-2 RNA associated proteome to the SARS-CoV-2 protein associated proteome (PPI; (Gordon et al., 2020)).

(B) Overlap of human high-confidence interactomes obtained by RAP-MS or ChIRP-MS.

(C) Overlap of PPI and ChIRP-MS interactomes.

(D) Left: enrichment correlation of the human high-confidence interactomes obtained by RAP-MS or ChIRP-MS (FDR ≤ 0.05). Right: enrichment correlation of the human expanded interactomes obtained by RAP-MS or ChIRP-MS (average enrichment > = 1).

ChIRP-MS identifies factors expressed in human lung tissue

Although Huh7.5 and Vero E6 cells are common models for SARS-CoV-2 infection, neither is derived from the lung, which is the primary tissue targeted by SARS-CoV-2 infection. To understand whether host factors identified by ChIRP-MS in these cell lines may be relevant to human disease, we analyzed the expression of each host factor in single-cell RNA-seq (scRNA-seq) profiles from primary human lung cells (Travaglini et al., 2020). After excluding immune cells and putative doublets, we identified 30,700 cells that clustered into 17 distinct epithelial, endothelial, and stromal cell types (Figures S4A–S4C). Prior studies have demonstrated that multiple lung cell types express the SARS-CoV-2 entry receptor, ACE2, and the serine protease, TMPRSS2, including epithelial basal, club, and ciliated cells, and alveolar type 1 (AT-1) and 2 (AT-2) cells (Hou et al., 2020; Salahudeen et al., 2020; Sungnak et al., 2020). Human bronchial epithelial cell (HBEC) cultures have shown that ciliated cells may be the initial target of infection, which can later spread to other cell types (Ravindra et al., 2020). Therefore, we conservatively considered any cell type with moderate RNA expression levels of ACE2 and TMPRSS2 in the scRNA-seq data as relevant targets of SARS-CoV-2 (Figure 3A). The vast majority of core human ChIRP-MS factors (219/229; 95.6%) were detectably expressed in SARS-CoV-2 target cell types, as well as other cell types, and 215/219 detected factors were expressed at a level equal to or greater than ACE2 (Figure 3B). These results suggest that the vast majority of SARS-CoV-2 RBPs identified in Huh7.5 cells are robustly expressed and relevant to infection in primary human target cells of SARS-CoV-2 infection, although confirmation studies in primary or model human lung cells are warranted (Figure 3C). More broadly, this analysis suggests that vRNA-binding factors may be broadly expressed across cell types and thus play a functional role across multiple viruses and target cells.

Figure S4.

Figure S4

Re-analysis of single-cell RNA-seq analysis of human lung tissue, related to Figure 4

(A) Louvain clustering of all cells in the human lung scRNA-seq dataset are shown, alongside the expression of PTPRC (CD45) and TMPRSS2.

(B) For the final filtered dataset, putative doublets (doublet score > 0.15) were removed, and the subset of CD45-negative cells was identified. The resulting data was re-clustered and the cluster labels are shown.

(C) Representative marker genes for each cluster.

(D) Multi-viral comparison of associations with translation initiation (EIF) factors.

Figure 3.

Figure 3

Expression of the SARS-CoV-2 RNA-associated proteome across lung cell types and comparison to other RNA virus-associated proteomes

(A) Clustering and dimensionality reduction and gene expression of non-immune single-cell RNA-seq profiles from primary human lung tissue.

(B) Expression in single cells of the SARS-CoV-2 human core interactome. SARS-CoV-2 target clusters indicated with a box. Each dot represents the mean expression of a given gene in the core ChIRP-MS interactome across all cells in the indicated cluster.

(C) Histogram of expression of each gene in the core ChIRP-MS interactome (orange) compared with all other genes (gray) in the lung epithelial ciliated cell cluster.

(D) Principal component analysis of ChIRP enrichments in human cells across time points and viruses.

(E) Upset plot comparing expanded interactomes of each virus.

(F) GO term analysis of the expanded interactome of each virus.

(G) Comparison of proteasome subunits and proteasome accessory factor associations across viruses.

Inter-virus analysis of host factors reveals specificity of interacting cellular pathways

Interactions between vRNAs and host proteins play key roles in multiple aspects of viral infection (Fritzlar et al., 2019; Garcia-Blanco et al., 2016; Hosmillo et al., 2019). To understand how positive-stranded RNA viruses have evolved to interact with their host, we sought to compare the SARS-CoV-2 dataset to our previously generated ChIRP-MS data from the flaviviruses Zika (ZIKV, ZIKV-PRVABC59) and Dengue-2 (DENV, DENV-16681), as well as a human picornavirus and rhinovirus (RV, RV-B14; Ooi et al., 2019). We note that all datasets were collected from Huh7.5 cells except the rhinovirus data, which was collected from HeLa cells. Principal component analysis (PCA) of host factors enriched across viruses showed that PC1 separated all four viral types and PC2 further distinguished RV and demonstrated the time-dependent host factor changes for SARS-CoV-2 (Figure 3D). To facilitate a quantitative comparison across viruses, we defined an “expanded interactome,” consisting of proteins reproducibly enriched for each ChIRP-MS dataset: SARS-CoV-2-D1 (Huh7.5 24 h.p.i.), SARS-CoV-2-D2 (Huh7.5 48 h.p.i.), ZIKV-D2, DENV-D2, and RV-D2 (Table S3), resulting in about 1,000 proteins (Figure 3E). We found that the largest group of 425 proteins was shared across all ChIRP-MS datasets, suggesting a common host strategy for interacting with positive polarity, ssRNA viruses (Figure 3E).

We next performed gene ontology (GO) term analysis on the expanded interactomes of each virus. All viruses robustly enriched the intracellular RNP complex term; however, we found patterns of specificity when examining other terms (Figure 3F). For example, the SARS-CoV-2 interactome displayed a reduced enrichment of the ER and ribosome GO terms but an increased enrichment of mitochondria and proteasome GO terms (Figure 3F). Examining functional terms again corroborated a decreased enrichment of translation and splicing factor terms in the SARS-CoV-2 interactome, compared to that of the flaviviruses, but a specific increased enrichment of multiple immune pathways, such as antigen presentation, NF-κb signaling, and TNF signaling (Figure 3F). To understand the specific proteins driving these enrichments, we visualized all the individual subunits of the proteasome present in the ChIRP-MS as an example (Figure 3G). Previous work has reported a functional connection between proper proteasome function and coronavirus life cycles (Raaben et al., 2010), which together with our ChIRP-MS data may suggest that the vRNA directly leverages the proteasome during infection, potentially to modulate antigen presentation and/or evade host adaptive immunity. The specificity of association between the proteasome and the SARS-CoV-2 RNA, and clear validation of this interaction in the literature, motivated us to explore the set of RNA-centric viral interactomes across a number of other important cellular pathways.

Translational apparatus

After entry into the cytosol, one of the first steps of the viral life cycle is to express the protein products encoded in its genome, which requires interactions with the host translational apparatus. Work examining the translational capacity of RNA viruses has shown that, in contrast to flaviviruses, coronaviruses do not translate their mRNAs at higher efficiency than cellular mRNAs during infection (Finkel et al., 2020). A comparison of enriched translation initiation factors (eIFs) demonstrated quantitative differences across the viruses: flaviviruses strongly enriched EIF3A, 4G1, 3C, and 3D, while SARS-CoV-2 was relatively depleted for these factors but preferred EIF3B, 4H, 4B, 3F, and A3 (Figure S4D). Beyond translational initiation, we visualized enrichment for the core components of the 80S ribosome (Figure 4A). Here, we note that while there was specificity in the enrichment of specific ribosomal proteins (RPs), more striking was the generalized lack of association of the vast majority of the RPs with the SARS-CoV-2 vRNA compared to either DENV or ZIKV (Figure 4A). This is consistent with recent reports demonstrating global translation inhibition by SARS-CoV-2 encoded nsp1 (Edgil et al., 2006; Roth et al., 2017; Schubert et al., 2020; Thoms et al., 2020).

Figure 4.

Figure 4

Cellular context of expanded interactomes across viruses

Selected groups of proteins; their enrichment in SARS-CoV-2, ZIKV, DENV, and RV ChIRP-MS; and their approximate subcellular localization or categorization in the ribosome (A), classical RBPs and RNA helicases (B), ER and ER-targeting factors (C), RNA post-transcriptional modification factors including m6A family proteins (D), and cytoskeleton and cellular vesicle factors (E). Heatmap colors indicate the log2 of ChIRP-MS enrichment values. Each heatmap has a separate scale bar.

RNA-binding proteins

RBPs have a wide array of cellular functions and are often recovered with host or pathogenic RNAs (Geuens et al., 2016; Meier-Stephenson et al., 2018; Taschuk and Cherry, 2020). Heterogeneous nuclear ribonucleoproteins (hnRNPs), a large set of adaptor proteins (Geuens et al., 2016), showed robust interaction with all four viruses and similar enrichments for the majority of the 20 proteins we identified (Figure 4B). Dead-box RNA helicases (DDX), which remodel RNA structural elements (Jankowsky, 2011), showed a more virus-specific binding profile wherein family members such as DDX3X, 5, 6, and 38B were similar across viruses, while DDX21, 23, 42, and 46 were more specifically associated with the DENV and RV RNAs (Figure 4B).

Sec translocon, and ER-Golgi transport

We and others previously showed that RV weakly enriches factors related to membrane biology, in contrast to the functional use of membrane organelles like the ER by flaviviruses (Fernandez-Garcia et al., 2009; Mukhopadhyay et al., 2005; Ooi et al., 2019). Given the strong dependence of flaviviruses on the translocon, the channel for nascent peptide entry into the ER, we examined these factors (Figure 4C). We found that while SARS-CoV-2 does enrich ER-tethered (RRBP1) or associated (HDLBP/vigilin) RBPs, it is less strongly associated with the ER-targeting complex (SRP) or the Sec translocon itself. However, SARS-CoV-2’s vRNA associates with the COPI vesicle complexes in a more similar manner to the flaviviruses. COPI proteins are canonically responsible for retrograde transport of vesicles from the Golgi to ER (Szul and Sztul, 2011). The association with COPI complex members is consistent with the reported cycling of SARS-CoV in the ER-Golgi network for eventual budding into the lumen ER-Golgi intermediate compartment (ERGIC; McBride et al., 2007).

N6-methyladenosine

Post-transcriptional modification of RNA is a broadly used regulatory mechanism, and among many, methylation of the N-6 position on adenine (m6A) has received renewed interest (Yue et al., 2015; Zaccara et al., 2019). Recently, it has been reported that m6A is deposited on the ZIKV vRNA contributing to an anti-viral response via binding of YTH family proteins (which recognize m6A) and degradation of the ZIKV vRNA (Lichinchi et al., 2016). We therefore examined the association of the writers (METTL family), readers (YTH family), and erasers (ALKBH family) of m6A with vRNA. We saw a robust association of the YTHDF family with ZIKV and DENV vRNAs (Figure 4D). RV also captured these proteins, while SARS-CoV-2 lacked robust enrichment of these factors. Conversely, we found relatively stronger enrichment of the m6A-demethylases associated with the SARS-CoV-2 vRNA, while ZIKV, DENV, and RV all poorly bound these proteins (Figure 4D).

Intracellular vesicles and trafficking

Lastly, we explored intracellular vesicle and trafficking complexes, given evidence of intracellular double-membrane vesicles produced during the SARS-CoV-2 life cycle (Wolff et al., 2020). We found many host factors involved in cytokinesis, actin filaments, cytoskeleton, and microtubules were most strongly associated with the SARS-CoV-2 vRNA (Figure 4E). Recent reports highlighted the physical association of Rab GTPase family members with viral proteins and their functional importance in the temperature-dependent life cycle of coronaviruses (Gordon et al., 2020; Hoffmann et al., 2021). The ChIRP-MS data support these observations; four Rab proteins, RAB1B, RAB2A, RAB7A, and RAB10, were present in the SARS-CoV-2 high-confidence interactome (Figure 2D), with multiple others strongly associated with the SARS-CoV-2 vRNA (Figure 4E).

Genome-wide and targeted CRISPR screens of the SARS-CoV-2 interactome reveal functions of RNA-protein interactions

To understand the functional role of SARS-CoV-2 RBPs in host infection, we used CRISPR-knockout (KO) perturbation screens. First, we intersected the ChIRP-MS interactome with genome-wide CRISPR perturbation data from our previous study (Wei et al., 2021). Second, we designed a custom pool of sgRNAs targeting the SARS-CoV-2 expanded interactome compatible with both human and monkey cells by intersecting the genome-wide sgRNA designs for Homo sapiens and Chlorocebus sabaeus. We included control sgRNAs targeting known proviral factors, ACE2 and CTSL, and control sgRNAs targeting known antiviral factors, HIRA and CABIN1, as well as 100 non-targeting negative controls and 100 single-targeting negative controls. In total, our final custom sgRNA pool consisted of 8,264 sgRNAs targeting 1,331 of the 1,470 (90.5%) SARS-CoV-2 expanded interactome proteins, which we used to perform a screen for factors that impact virus-induced cell death (Table S4). The genome-wide screen and targeted interactome screens were both performed in Vero E6 cells using our previously developed screening protocol to identify putative pro- and antiviral host factors (Figure 5A). In this assay, KO of proviral factors causes resistance to virus-induced cell death and enrichment of their associated targeting sgRNAs, while KO of antiviral factors causes sensitization to virus-induced cell death and depletion of their associated targeting sgRNAs.

Figure 5.

Figure 5

Integration of ChIRP-MS and genome-wide and targeted interactome CRISPR screens identify pro- and antiviral host factors

(A) CRISPR screen schematic for genome-wide and targeted interactome screens.

(B) Expanded SARS-CoV-2 interactome overlaid on genome-wide CRISPR screen data.

(C) Comparison of sgRNA residuals for significant hits (FDR ≤ 0.05) of all sgRNAs (left, black, n = 3,189), sgRNAs targeting genes present in the high-confidence SARS-CoV-2 RNA interactome (purple, middle, n = 132), or sgRNAs targeting genes present in the expanded SARS-CoV-2 RNA interactome (right, blue, n = 400). p values computed from Mann-Whitney test.

(D) Focused interactome screening results for the high-confidence interactome (left) and the rest of the expanded interactome (right).

(E) Expanded interactome mini-pool results for hits identified in the genome-wide screen, showing proviral hits (red), antiviral hits (blue), or positive controls (green).

(F) Cytoscape network colored by enrichment or depletion in CRISPR screen.

(G) sgRNA Z scores for top mini-pool CRISPR hits. Individual CRISPR guides are represented by black lines. The average of these is shown in red.

(H) Inter-virus ChIRP-MS comparison of human ChIRP-MS/CRISPR hits identified in (E).

We first examined the genome-wide screening data and calculated CRISPR Z scores for the core (309) and expanded (1,430) host-protein interactomes identified by ChIRP-MS. We identified 131 factors (33 core factors and 98 expanded factors) that had a functional impact on host cell survival after SARS-CoV-2 infection (FDR ≤ 0.05; Figure 5B). Strikingly, we observed a significant bias for overlapping factors to have antiviral function (29/33 core factors and 87/98 expanded factors), compared to the distribution of all hits in the genome-wide screen. These results suggest that a large fraction of intracellular vRNA-host protein interactions may represent the host cell’s attempt to prevent or combat viral pathogenicity, rather than proviral host pathways co-opted by the virus (Figure 5C).

We previously demonstrated that a CRISPR mini-pool approach can provide further validation of genome-wide hits and increased sensitivity for the discovery of functional SARS-CoV-2 factors, due to a smaller sgRNA pool with more sgRNAs per gene (Wei et al., 2021). Therefore, we performed a SARS-CoV-2 survival screen using the expanded interactome CRISPR mini-pool (Figure 5A). We observed a high correlation of gene Z scores between biological replicates (Figure S5A, left), which were then merged. Using a conservative significance threshold (FDR ≤ 0.001), we identified 179 proviral factors (13.4% of the mini-pool) and 343 antiviral factors (25.8% of the mini-pool), and 108 of these functional factors were present in the core interactome (Figure 5D). We compared the mini-pool and genome-wide screens and identified 8 proviral factors and 53 antiviral factors that were hits in both screens (Figure 5E). We were particularly interested in the validated proviral hits since they may have direct relevance as therapeutic targets. We recently validated PFI-3, an inhibitor of SMARCA4 (the top proviral CRISPR hit in our interactome screen), as an inhibitor of viral replication in vitro (Wei et al., 2021), and drug targets nominated by PPI studies have also yielded promising candidates for SARS-CoV-2 (Gordon et al., 2020; White et al., 2021). To expand this analysis to additional compounds, particularly clinically approved compounds that may be amenable to drug repurposing, we compared the expanded ChIRP-MS interactome with known drug compound-target protein interactions and identified a list of 113 interactome proteins targeted by 275 compounds (Table S7). Focusing on CRISPR-validated proviral factors identified clofarabine as among the top drug candidates, and this compound has indeed been shown to have activity against SARS-CoV-2 in candidate drug screens (Table S7; Janes et al., 2018).

Figure S5.

Figure S5

Correlation analysis of expanded interactome CRISPR mini-pool screens, related to Figure 5

(A) Replicate correlations for SARS-CoV-2 (left) and MERS (right) expanded interactome CRISPR mini-pool screens.

(B) Principal component analysis of gene-level z-scores for all expanded interactome CRISPR mini-pool screen conditions and replicates.

(C) Pairwise correlations of selected pairs of conditions.

Analysis of antiviral hits revealed known factors that regulate the innate immune response, including NONO, TARDBP, DDX5, DDX6, and HNRNPA2B1. NONO is a member of the Drosophila behavior/human splicing (DBHS) protein family, which contains conserved N-terminal RNA recognition motifs, and has been demonstrated to directly bind vRNA and alter viral pathogenicity by impacting vRNA processing or by impacting innate immune gene expression (Knott et al., 2016; Lahaye et al., 2018; Lenarcic et al., 2013). For example, in human immunodeficiency virus (HIV), NONO acts as an activator of the DNA sensor, cGAS, to trigger innate immunity and the interferon response (Lahaye et al., 2018). In SARS-CoV-2 infection, NONO may function in a similar manner, albeit with RNA-sensing proteins or pathways. TARDBP has also been shown to display antiviral activity in the context of HIV infection by directly binding to a particular regulatory motif within the HIV-1 RNA genome and thereby repressing viral gene expression (Ou et al., 1995). Interestingly, subsequent work has demonstrated that TARDBP preferentially binds UGUGUG RNA motifs, and a search of the SARS-CoV-2 genomic RNA found 11 UGUGUG motifs in the sense strand. Extending this concept, we found that the majority of validated hits were physically associated with multiple RNA viruses (Figure 5F), while a small subset showed SARS-CoV-2 specificity (Figure 5F). Finally, we analyzed the CRISPR hits in the context of the Cytoscape network and observed that many of the functional hits were RBPs, helicases, and hnRNPs, which bind the vRNA early during infection, suggesting that the host’s initial response to viral infection is to mount a diverse vRNA recognition program to restrict the viral life cycle (Figure 5G). Examining all hits, we identified COASY, HSPA8, UGP2, PSMB3, and UBR4 as top antiviral factors (Figure 5H, left) and SMARCA4, AKAP8, DRG1, and TRMU as top proviral factors (Figure 5H, right). In summary, the genome-wide and mini-pool strategies provide independent functional validation of ChIRP-MS data and nominate pro- and antiviral factors in SARS-CoV-2 pathogenesis.

An expanded view of vRNA-associated factors across multiple RNA viruses

Since SARS-CoV-2 RBPs are broadly expressed in tissues and many are bound by other RNA viruses, we hypothesized that they may have functional roles in other viral infections. We performed the CRISPR mini-pool screen in six additional RNA viruses: (1) HKU5: a bat betacoronavirus using the SARS-CoV-1 spike protein for entry (a model of SARS-CoV-1), (2) rcVSV-SARS-CoV-2-S: a vesicular stomatitis virus (VSV) with an envelope engineered to use SARS-CoV-2 spike protein for entry, (3) Middle East respiratory syndrome coronavirus, MERS-CoV: another related betacoronavirus, (4) T1015N: a tissue culture-adapted form of MERS-CoV (Agnihothram et al., 2014; Scobey et al., 2013), (5) encephalomyocarditis virus (EMCV): a non-enveloped picornavirus with a positive-polarity ssRNA genome, and (6) influenza A virus (IAV): an enveloped orthomyxovirus with a negative-polarity ssRNA genome (Figure 6A; Wei et al., 2021). These viruses contain diverse viral envelopes, genome polarities, and varying degrees of sequence similarity to the SARS-CoV-2 genome and enabled the analysis of shared and SARS-specific pro- and antiviral RNA-binding host factors.

Figure 6.

Figure 6

SARS-CoV-2 ChIRP-MS interactome CRISPR screen in a panel of seven RNA viruses

(A) CRISPR screen schematic.

(B) Correlation of gene Z scores for each condition.

(C) Number of proviral and antiviral hits (FDR ≤ 0.001) overlapping with the SARS-CoV-2 hits (FDR ≤ 0.001) for all conditions.

(D) Volcano plot for each condition.

(E) CRISPR Z scores for top hits for each virus. Top: proviral hits. Bottom: antiviral hits. Positive controls indicated in green.

We first analyzed the technical quality of these screens and observed a high correlation of Z scores between biological replicates, which were then merged (Figures S5A and S5B). Next, we performed PCA analysis to understand the global similarity of gene Z scores across viruses, and we observed that the conditions clustered according to entry pathway (Figure 6B). Namely, SARS-CoV-2, HKU5, and rcVSV all require ACE2 for cell entry, while MERS and T1015N require DPP4 (Figures 6B. 6C, and S5B). In line with these observations, we compared functional conservation of SARS-CoV-2 pro- and antiviral genes in each additional RNA virus screen, which demonstrated that proviral hits were largely unique within virus families (and related to viral entry). In contrast, many SARS-CoV-2 antiviral factors were shared across viruses (25 antiviral factors shared across all viruses, 88 shared across SARS and MERS viruses; Figures 6C, 6D, and S5C; Table S6). We visualized the results of each screen in individual volcano plots, which highlighted top concordant and discordant hits between SARS-CoV-2 and other viruses (Figure 6D). Although each factor was present in the SARS-CoV-2 interactome, some showed specific function in other viruses, such as the RNA helicase DHX30 (HKU5), and PA2G4 (EMCV; Baggen et al., 2019; Bazzone et al., 2019; Figure 6D). To more closely examine the conserved or divergent functions of highly scoring factors across viruses, we analyzed the top pro- and antiviral factors for each virus (Figure 6E). Among proviral factors, we first confirmed the expected specificity of the entry receptors, ACE2 and DPP4, for SARS and MERS viruses, respectively. Next, unbiased clustering revealed several proviral factors that were highly specific to SARS-related viruses (e.g., DRG1, TNPO1, MARS2, and AKAP8), MERS-related viruses (e.g., NF2 and SLC30A1), IAV (e.g., GANAB and CCDC47), and EMCV (DNM2 and PA2G4). In contrast to proviral factors, we observed a much greater degree of overlap of antiviral factors across viruses. We could still observe factors with antiviral specificity for viral families, for example KPNA2, ZC3H4, NONO, UGP2, and COASY in SARS-related viruses, and SARNP, USP7, RPL13, and MATR3 in MERS-related viruses. However, we also observed a class of factors with antiviral activity conserved across virus families, and even in all viruses, including ARIH2, CCT2, and PSMB3. In summary, our focused mini-pool approach (1) validated selected functional hits identified from the genome-wide screen, (2) expanded the functional set of SARS-CoV-2 RNA-binding host proteins, particularly those with antiviral activity, and (3) established the virus-specific logic for each factor.

An RNA-centric view of SARS-CoV-2 reveals a specific perturbation of mitochondria during infection

Reexamining the list of vRNA-binding proteins, we noticed that MRM2 was the most strongly enriched host factor in Huh7.5 cells at 48 h.p.i. (Figure 2B). MRM2 is a mitochondrial-localized (nuclear-encoded) RNA 2′-O-methyltransferase (2′-O-MTase) and is of particular interest due to the previous characterization of FTSJ3/SPB1 (another 2′-O-MTase) as a factor that methylates the HIV RNA genome, which leads to proviral shielding of the HIV RNA from MDA5 recognition (Ringeard et al., 2019). We asked whether this binding was specific to SARS-CoV-2 and if there were other 2′-O-MTases enriched in the ChIRP-MS data. MRM2 was highly selective for binding the SARS-CoV-2 RNA, while the nucleolar FBL was more enriched on DENV, ZIKV, and RV, and MRM3 was selective for RV (Figure S6A). To understand if the mitochondrial association of SARS-CoV-2 was supported by other aspects of the ChIRP data, we revisited the ChIRP-RNA-seq data that we initially used for quality control. We found a robust and consistent enrichment for the RNA components of the mitochondrial ribosome (mito-ribosome and 12S and 16S RNAs) in both Vero E6 and Huh7.5 cells (Figures S1B and S1C). This is consistent with a recent report that SARS-CoV-2 genomic RNAs, particularly the 5′ untranslated region, contain sequence elements that strongly direct residency in mitochondria (Wu et al., 2020). The ChIRP RNA-seq also demonstrated recovery of a number of snoRNAs with the vRNAs in both Huh7.5 and Vero E6 cells (Figures S1B and S1C). SnoRNA-vRNA interactions and the importance of 2′-O-methylation has recently been independently validated by others (Yang et al., 2021). Together, our RNA-RNA and RNA-protein view of the SARS-CoV-2 vRNA highlight an association with the mitochondria and nominate specific RNA post-transcriptional modification enzymes within the organelle that may be important during infection.

Figure S6.

Figure S6

ChIRP-MS and electron microscopy analysis of mitochondria during SARS-CoV-2 infection, related to Figure 6

(A) ChIRP-MS enrichment of rRNA 2’-O-ribose methyltransferases across viruses.

(B) Identification of SARS-CoV-2 virions (highlighted with orange arrow heads) in the 12 h.p.i. EM images, taken from the same samples as in Figure 6 at different imaging depths.

(C) EM imaging of SARS-CoV-2 infected Huh7.5 cells at 24 and 48 h.p.i. Mitochondria are highlighted with red arrow heads.

(D) EM analysis of mitochondria in human bronchial epithelial cells (HBECs). Mitochondria are highlighted with red arrow heads. (D) Quantification of (C), n = 190 mitochondria in five infected or mock ciliated cells. P ≤ 0.001 by two-tailed Student’s t test.

We next assessed if there were morphological changes in the mitochondria over the course of SARS-CoV-2 infection. We performed electron microscopy of Huh7.5 cells infected with SARS-CoV-2 at different time points and quantified mitochondrial size. After 12 h.p.i., we found an increase in the average area of the mitochondria in infected cells (Figures 7A and S7B). At 24 h.p.i., mitochondria continued to increase in size, eventually leading to gross damage at 48 h.p.i. (Figure S7C). To confirm this observation in human lung cells, we examined previously published electron microscopy data (Ravindra et al., 2020) of SARS-CoV-2-infected HBECs (48 h.p.i.) and again found a significant increase in average mitochondrial size in cells infected with SARS-CoV-2 (Figure S7D). Altogether, these results suggest altered mitochondrial homeostasis during SARS-CoV-2 infection.

Figure 7.

Figure 7

SARS-CoV-2-associated proteins and a targeted mitochondrial CRISPR screen identify functional interactions between SARS-CoV-2 and host mitochondria.

(A) Electron microscopy (EM) of Huh7.5 cells uninfected (left, mock) or infected by SARS-CoV-2 (right).

(B) Quantification of mitochondria size by EM in infected cells. n = 348 and n = 361 mitochondria from 15 (mock) and 12 (12 h.p.i.) Huh 7.5 cells were analyzed. p ≤ 0.0001 by two-tailed Student’s t test.

(C) Mini-pool CRISPR screen design.

(D) ChIRP-MS enrichments of mitochondrial proteins present in the expanded interactome of at least one virus. The larger segment of the circle corresponds to proteins encoded by the mitochondrial genome or proteins encoded by the nuclear genome which are localized or associated with the mitochondria. Components of the mitochondrial ribosome are shown on the smaller segment. Proteins that are significant hits in the CRISPR screen data in Figure 5 are indicated with red labels (proviral hits) or blue labels (antiviral hits).

(E) Correlation of gene Z scores for each condition.

(F) Number of proviral and antiviral hits (FDR ≤ 0.001) overlapping with the SARS-CoV-2 hits (FDR ≤ 0.001) for all conditions.

(G) Center: volcano plot for SARS-CoV-2 condition. Significant hits (FDR ≤ 0.001) indicated in black. Top: CRISPR Z scores for top proviral hits. Bottom: CRISPR Z scores for top antiviral hits.

Figure S7.

Figure S7

Correlation analysis of mitochondria CRISPR mini-pool screens, related to Figure 7

(A) Replicate correlations for SARS-CoV-2 mitochondria mini-pool CRISPR screens.

(B) Principal component analysis of gene-level z-scores for all mitochondria mini-pool screen conditions.

(C) Mitochondria mini-pool CRISPR screen results for hits identified in the genome-wide screen. Red dots indicate proviral hits in both screens, blue dots indicate antiviral hits in both screens, and green dots indicate positive controls.

(D) Pairwise correlations of selected pairs of conditions.

Next, we assessed the interaction of SARS-CoV-2 vRNA with mitochondria-localized proteins by expanding our analysis of the ChIRP-MS data. We curated a set of 810 proteins that are annotated in Uniprot as physically localized to the mitochondria (UniProtKB subcellular location of SL-0173) and found that 170 of these proteins were present in the expanded interactome of at least one virus (Figures 6C and 6D). DENV and ZIKV had relatively poor recovery of these proteins, while RV and SARS-CoV-2 robustly bound to many mitochondrial factors, although specific protein associations were generally non-overlapping (Figure 6D). Given the specificity of vRNA binding to mitochondrially localized proteins, we systematically evaluated the functional impact of each of the 810 factors in the context of the seven RNA virus infections (described above) by designing a second custom CRISPR mini-pool of 5,558 CRISPR sgRNAs targeting these 810 genes, as well as positive and negative controls (Figure 6C; Table S5). We performed survival screens with the mitochondrial mini-pool with each virus. To confirm the technical quality of each screen, we compared data from biological replicates (Figure S7A) and performed PCA to visualize every replicate and condition (Figure S7B). We then merged the replicates and compared the gene-level Z scores (Figure 7E). Next, we analyzed the distribution of pro- and antiviral function among mitochondrial factors, and their conservation across viruses. In the SARS-CoV-2 screen, we identified 57 proviral factors and 175 antiviral factors (FDR ≤ 0.001; Figure 7G, middle), which validated 1 proviral factor (TRAF3) and 24 antiviral factors identified in the genome-wide screen (Figure S7C). Expanding our analysis to other viruses, we first computed the number of hits overlapping with SARS-CoV-2 in each condition and observed that each virus had a substantial overlap of antiviral hits with SARS-CoV-2 (Figure 7F). However, unlike the interactome screening, the proviral hits were less well conserved, consistent with the concept that proviral hits are driven by the viral entry pathway and mitochondria are minimally (if at all) involved in viral entry (Figure 7F). Comparing the top pro- and antiviral factors for each condition across all viruses, we found proviral factors that were highly specific to SARS-related viruses (e.g., TRAF3 and MRPS35), SARS- and MERS-related viruses (e.g., SLC44A1 and SPNS1), or IAV or EMCV (e.g., BOLA1), while many antiviral factors displayed multi-viral activity (22 antiviral hits shared across all viruses, 46 shared across SARS and MERS viruses; Figures 7G and S7D). Altogether these data provide insights into the specific mitochondrial factors that associate with SARS-CoV-2 RNA, likely contributing to a central role of mitochondria as intracellular hubs for antiviral activity.

Discussion

In summary, our results provide an RNA-centric view of the landscape of the host proteins interacting with SARS-CoV-2 RNA during infection. By integrating our analysis across time points, cell lines, and other viruses, we identify shared and SARS-CoV-2-specific patterns of RNA-host protein interactions. In the context of the rapidly evolving literature on subcellular mechanisms of SARS-CoV-2 pathogenicity, the ChIRP-MS data provide an orthogonal but complementary resource to existing PPI, RNA-protein interaction, and phenotypic CRISPR screening studies (Banerjee et al., 2020; Gordon et al., 2020; Hoffmann et al., 2021; Schmidt et al., 2021; Wang et al., 2020; Wei et al., 2021). In particular, we find that the vRNA:host protein interface is largely distinct from that of viral proteins and nominates roles for previously unappreciated biological processes and host proteins in SARS-CoV-2 infection.

Integration of the SARS-CoV-2 ChIRP-MS data with ChIRP-MS of three other positive-sense RNA viruses provided several new insights into the “molecular arms race” that takes place between the virus and host. First, this analysis identified shared and unique strategies employed by viruses to hijack the host for trafficking and replication. For example, SARS-CoV-2 and flavivirus RNAs both associate with the Rab GTPase proteins RAB10 and RAB2A, which are involved in subcellular trafficking, and CRISPR perturbation revealed that these proteins are required for viral replication and virus-induced cell death (Gordon et al., 2020; Hoffmann et al., 2021). In contrast, despite the fact that both viral families depend on glycoproteins to produce infectious virions, there was a limited association of SARS-CoV-2 RNA with the Sec/Translocon/OST complexes, compared to flaviviruses (Ooi et al., 2019). There are known differences between flavivirus and coronavirus replication strategies: flavivirus may physically leverage the translocon complex, which eventually forms invaginated vesicles or spherules (Fernandez-Garcia et al., 2009; Mukhopadhyay et al., 2005), whereas coronaviruses leverage the ERGIC and eventually form double-membrane vesicles (McBride et al., 2007). Therefore, the differences in the ChIRP-MS data likely reflect established differences in these viral life cycles, but our data provide specific host factors within each pathway that are closely associated with the vRNA genomes and thus may play physical roles in these processes.

Integration of the ChIRP-MS data with genome-wide CRISPR screen data, as well as targeted screening of a custom pool of sgRNAs against the SARS-CoV-2 interactome, provides extensive functional characterization of the RNA interactome proteins. In the context of SARS-CoV-2 infection, an unexpected finding from the intersection of ChIRP-MS and CRISPR screen datasets was that many vRNA-binding proteins were antiviral factors. Additional targeted screens in the context of six other viruses enabled us to decode their specificity and revealed SARS-specific and multi-viral host factors. A striking difference between pro- and antiviral factors was their conservation across viruses, which perhaps suggests distinct RNA sequence specificity and logic for each class of factors. Many of these factors were broadly expressed in human lung tissue, bound the vRNA early during infection, were commonly bound to multiple RNA virus families, and demonstrated antiviral function across RNA viruses, including related betacoronaviruses, and also more distant viral families. These results suggest that host cells deploy a diverse array of proteins to physically recognize and counteract viral infection and that these proteins are not limited to those with well-characterized viral recognition function, such as Toll-like receptors (TLRs) and retinoic acid inducible gene-I-like receptors (RLRs), but also extend to many other protein families with RNA-binding capacity.

Finally, we identified a functional connection between SARS-CoV-2 RNA and the mitochondria. Both RNA and protein components of the mitochondria were robustly captured with the SARS-CoV-2 RNA in Vero E6 and Huh7.5 cells, suggesting a close physical interaction, and electron microscopy demonstrated changes in mitochondrial shape and size after infection. Interestingly, other viruses, including HIV, have been reported to physically enter the mitochondria, providing evidence that vRNA can gain access to the mitochondria during infection (Somasundaran et al., 1994). Mitochondria are central to the underlying health of a cell, play an active role in sensing and signaling during cellular stress, and act as a hub for innate immune signaling. Based on the ChIRP-MS results, we propose that RNA viruses may follow a distinct logic when causing mitochondrial stress; that is, many viruses may interact with and perturb this organelle, but the precise manner in which stress is caused, and thus signaling occurs, is virus specific. Indeed, our custom mitochondria-focused CRISPR mini-pool screens revealed many pro- and antiviral factors associated with the mitochondria. These results further support the concept that mitochondria may serve as an organelle platform in the antiviral innate immune response to RNA viruses, perhaps exemplified best by the RLR family of RNA helicases (which signal on the outer mitochondrial membrane; Loo and Gale, 2011), and possibly also extending to a broader set of proteins identified here. Altogether, this study provides an unbiased and comprehensive catalog of functional SARS-CoV-2 RNA-host protein interactions, revealed a functional link between SARS-CoV-2 and the mitochondria, and may inform future studies to understand the mechanisms of viral pathogenesis and nominate strategies to combat the virus for therapeutic benefit.

Limitations of the study

There are several limitations to our study. First, ChIRP-MS experiments were performed in cell lines that were not derived from the lung. Therefore, investigation of these vRNA-binding factors in additional models is warranted. Second, our functional studies utilized survival CRISPR screens in Vero E6 cells. Future screens focused on other aspects of the viral life cycle, as well as screening primary human cells and other cell types, particularly type I interferon-sufficient cells, may identify additional functional aspects of these factors. In this regard, the pro- and antiviral terminology is used here for clarity and does not signify specific functional archetypes; these factors may function at any stage of the viral life cycle, including but not limited to, vRNA processing or replication pathways, viral trafficking within the cell, innate immune pathways, and stress responses to maintain cellular metabolism or fitness during infection.

STAR★Methods

Key resources table

REAGENT or RESOURCE SOURCE IDENTIFIER
Bacterial and Virus Strains

SARS-CoV-2 isolate USA-WA1/2020 BEI Resources Cat#NR-48814

Chemicals, Peptides, and Recombinant Proteins

1M Tris-HCl, pH 7 Thermo Fisher Scientific Cat#AM9850G
UltraPure 0.5M EDTA Thermo Fisher Scientific Cat#15575020
UltraPure 10% SDS Thermo Fisher Scientific Cat#15553027
UltraPure Formamide Thermo Fisher Scientific Cat#15515026
UltraPure 5M NaCl Thermo Fisher Scientific Cat#24740011
20x SSC Thermo Fisher Scientific Cat#15557044
50mM D-Biotin Thermo Fisher Scientific Cat#B20656
20% N-Lauroylsarcosine sodium salt solution Sigma Cat#L7414
Sodium deoxycholate Sigma Cat#30970
1M HEPES Thermo Fisher Scientific Cat#15630106
Trichloroacetic acid Sigma Cat#T6399
Acetone Sigma Cat#179124
4x NuPAGE LDS Sample Buffer Thermo Fisher Scientific Cat#NP0007
UltraPure™ Dithiothreitol Thermo Fisher Scientific Cat#15508013
Pierce Acetonitrile (ACN), LC-MS Grade Thermo Fisher Scientific Cat#51101
Ammonium bicarbonate Sigma Cat#A6141
Formic Acid, 99.0%, Optima LC/MS Grade Fisher Scientific Cat#A117
Iodoacetamide Sigma Cat#I1149
Proteinase K Thermo Fisher Scientific Cat#AM2546
Sequencing Grade Modified Trypsin Promega Cat# V5111
DNase I (RNase-free) New England Biolabs Cat#M0303S

Critical Commercial Assays

TAKARA Bio SMART-Seq Stranded Kit Takara Bio Cat#634442
Colloidal Blue Staining Kit Thermo Fisher Scientific Cat#LC6025

Deposited Data

CRISPR-KO Screen in VeroE6 after SARS-CoV-2 Infection Wei et al., 2021 N/A
ChIRP-RNA-seq in VeroE6 and Huh7.5 cell after SARS-CoV-2 Infection This Study GSE167341
CRISPR-KO mini-pool screen sequencing data from VeroE6 cells; ChIRP-MS and mitochondrial pools This Study GSE167341

Experimental Models: Cell Lines

Huh7.5 ATCC CVCL-7927
VeroE6 ATCC CRL-1586

Oligonucleotides

See Table S1 for ChIRP-MS Oligos N/A N/A

Software and Algorithms

R https://www.r-project.org/ R 3.6
Cytoscape https://cytoscape.org/ Cytoscape 3.8.1
Differential Enrichment analysis of Proteomics Data (DEP) https://rdrr.io/bioc/DEP/man/DEP.html DEP 1.10.0
DESeq2 https://bioconductor.org/packages/release/bioc/html/DESeq2.html DESeq2 1.28.1
DAVID Bioinformatics Resources https://david.ncifcrf.gov/ DAVID 6.8

Resource availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Ryan Flynn (ryan.flynn@childrens.harvard.edu).

Materials availability

All reagents generated in this study are available from the Lead Contact upon request.

Data and code availability

Source code is available on GitHub (https://github.com/juliabelk/sarscov2_chirp_ms) along with original spreadsheets for the MS and CRISPR analyses. Sequencing data has been deposited on NCBI GEO as series GSE167341 which includes the ChIRP RNA-seq and CRISPR screening experiments.

Experimental model and subject details

Cell lines

Huh7.5 (male), Vero-E6 (female), and Vero-E6-Cas9v2 cell lines were grown in Dulbecco’s Modified Eagle Medium (DMEM) supplemented with 10% heat-inactivated fetal bovine serum (FBS), and 1% Penicillin/Streptomycin. All cell lines tested negative for mycoplasma contamination prior to use in experiments and were authenticated by morphological evaluation by microscopy. None of the cell lines used in this study are listed in the commonly misidentified cell lines database (ICLAC). All procedures with infectious virus were done at a Biosafety Level 3 (BSL3) laboratory and approved by the Yale University Biosafety Committee.

Human samples

Human scRNA-seq data was previously published. No other human samples were used.

Animal models

No animal experiments were performed in this study.

Method details

Cell lines, SARS-CoV-2 infection, and cell processing

Vero-E6 (female) and Huh7.5 (male) cells were seeded at 1x106 cells per T150 flask and were grown in Dulbecco’s Modified Eagle Medium (DMEM) supplemented with 10% heat-inactivated fetal bovine serum (FBS), and 1% Penicillin/Streptomycin. Three T150 flasks were assigned per condition: 0, 1, and 2 days post-infection (dpi). The next day, the media was removed, and cells were inoculated with SARS-CoV-2 isolate USA-WA1/2020 (BEI Resources #NR-48814) at MOI of 0.01. Flasks were incubated at 37°C for 1 h with gentle rocking every 15 min. At 0, 1, and 2 dpi, supernatant from the flasks were discarded, and cells were washed with 1X PBS twice. 4 mL of 4% of paraformaldehyde was added on each of the flasks and incubated for 30 min at room temperature. Afterward, cells were quenched with 250 μL of 2 M glycine (final concentration of 125 mM) for each flask. Cells were scraped, harvested in pre-weighed microcentrifuge tubes, and span at 1000 x g for 5 min at 4°C. All supernatants aspirated, and the final pellet were weighed. Cells were frozen at −80°C until used.

Comprehensive identification of RNA binding proteins by mass spectrometry (ChIRP-MS)

SARS-CoV-2 targeting probes were designed online (https://www.biosearchtech.com/stellaris), with repeat masking setting of 3 and even coverage of the whole transcript. Full probe sequences available in Table S1. Oligos were synthesized with a 3′ biotin-TEG modification at Stanford Protein and Nucleic Acid Facility (panoligo@stanford.edu).

ChIRP-MS was performed largely as described in (Chu et al., 2015). Cells were cultured, infected, and crosslinked as described above in the BSL3 facility. Lysate was generated by resuspending cell pellets in 1 mL lysis buffer (50 mM Tris-HCl pH 7.0, 10 mM EDTA, 1% SDS) per 100 mg of cell pellet weight (∼100μL pellet volume). Lysates were sonicated using a focused-ultrasonicator (Covaris, E220) until the average RNA length was ∼500 nucleotides as determined by agarose gel analysis and stored at −80°C. Stored lysates were thawed on ice and prepared for pre-clearing. Precleared was achieved by adding 30 μL washed MyOne C1 beads per mL of lysate at 37°C for 30 min on rotation. Preclearing beads were removed twice from lysate using a magnetic stand; for this and all subsequent magnetic stand steps allow for > 1 min of separation before removing any supernatant. Next, for every 1 mL of sonicated lysate 2 mL of ChIRP hybridization buffer (750 mM NaCl, 1% SDS, 50 mM Tris-HCl pH 7.0, 1 mM EDTA, 15% formamide; made fresh) and 2.5 μL of 100 μM ChIRP Probe Pools were added per mL of lysate. ChIRP Probe Pools (Table S1) were composed of an equimolar mix of 108 antisense oligos. For each biological triplicate, a total of 7 mL of sonicated cell lysate was used. Hybridization took place on rotation for 16 h at 37°C. Subsequently, 250 μL of washed MyOne C1 beads per mL of lysate were added to each sample and incubated on rotation for 45 min at 37°C. Enriched material was collected on the beads with a magnetic stand, and beads were washed 5x 2 min in 1 mL of ChIRP Wash Buffer (2x NaCl-Sodium Citrate (SSC, ThermoFisher Scientific), 0.5% SDS) at 37°C. After washing, 1% of each sample was saved for RNA extraction and RNA-seq library preparation (below). To elute enriched proteins, beads were collected on magnetic stand, resuspended in ChIRP biotin elution buffer (12.5 mM biotin, 7.5 mM HEPES, pH 7.9, 75 mM NaCl, 1.5 mM EDTA, 0.15% SDS, 0.075% sarkosyl, and 0.02% Na-Deoxycholate), mixed at 25°C for 20 min on rotation and at 65°C for 15 min shaking. Eluent was transferred to a fresh tube, and beads were eluted again. The two eluents were pooled (∼1200 μL), and residual beads were removed again using the magnetic stand. 25% total volume (300 μL) trichloroacetic acid was added to the clean eluent, vortexed, and then samples were placed at 4°C overnight for precipitation. The next day, proteins were pelleted at 21,000 rcf at 4°C for 60 min. Supernatant was carefully removed, and protein pellets were washed once with ice-cold acetone. Samples were spun at 21,000 rcf at 4°C for 5 min. Acetone supernatant was removed, tubes briefly centrifuged again and, after removal of residual acetone, were left to air-dry on the bench-top. Proteins were then solubilized in 1x LDS Buffer in NT2 with 20 mM DTT and boiled at 95°C for 30 min with occasional mixing for reverse-crosslinking.

Protein samples were size-separated on bis-tris SDS-PAGE gels (Bio-Rad), and the gel was fixed and stained with the Colloidal Blue Staining Kit (ThermoFisher Scientific) as per the manufacturer’s instructions. Each ChIRP-MS experiment (1 lane in the gel) was cut into 2 slices from the SDS-PAGE and prepared independently. Gel slices were prepared for mass spectrometry by rinsing sequentially in 200 μL HPLC-grade water, 100% Acetonitrile (ACN, ThermoFisher Scientific), 50 mM Ammonium Bicarbonate (AmBic). Samples were reduced by adding 200 μL of 5 mM DTT in 50 mM AmBic and incubating at 65°C for 35 min. The reduction buffer was discarded, and samples were cooled to room temperature. Alkylation was achieved by adding 200 μL of 25 mM iodoacetamide in 50 mM AmBic for 20 min at 25°C in the dark. The alkylation buffer was discarded, samples were rinsed once in 200 μL 50 mM AmBic, and then they were washed twice for 10 min each in 200 μL of freshly prepared 50% ACN in 50 mM AmBic. After each wash, the supernatant was discarded, and after all washes, samples were dried for 3 h using a SpeedVac. Once dry, proteins were digested by adding 100 ng of trypsin in 200 μL of 50 mM AmBic for 16 h at 37°C. Samples were subsequently acidified by adding formic acid to a final concentration of 1% and incubating at 37°C for 45 min. Finally, samples were desalted using HyperSep Filter Plates with a 5-7 μL bed volume (ThermoFisher Scientific) following the manufacturer’s instructions. Samples were eluted twice in 100 μL 80% ACN in 0.2% formic acid, dried on a SpeedVac, and resuspended in 10 μL 0.1% formic acid for mass spectrometry analysis.

All samples were resuspended in 10 μL 0.2% formic acid in water and 4 μL were injected on column for each sample. Peptides were separated over a 50 cm EasySpray reversed phase LC column (75 μm inner diameter packed with 2 μm, 100 Å, PepMap C18 particles, Thermo Fisher Scientific). The mobile phases (A: water with 0.2% formic acid and B: acetonitrile with 0.2% formic acid) were driven and controlled by a Dionex Ultimate 3000 RPLCnano system (Thermo Fisher Scientific). An integrated loading pump was used to load peptides onto a trap column (Acclaim PepMap 100 C18, 5 um particles, 20 mm length, ThermoFisher) at 5 μL/min, which was put in line with the analytical column 6 min into the gradient for the total protein samples. Gradient elution was performed at 300 nL/min. The gradient increased from 0% to 5% B over the first 6 min of the analysis, followed by an increase from 5% to 25% B from 6 to 86 min, an increase from 25% to 90% B from 86 to 94 min, isocratic flow at 90% B from 94 to 102 min, and a re-equilibration at 0% for 18 min for a total analysis time of 120 min. Precursors were ionized using an EASY-Spray ionization source (Thermo Fisher Scientific) source held at +2.2 kV compared to ground, and the column was held at 45°C. The inlet capillary temperature was held at 275°C, and the RF lens was held at 60%. Survey scans of peptide precursors were collected in the Orbitrap from 350-1350 Th with an AGC target of 1,000,000, a maximum injection time of 50 ms, and a resolution of 120,000 at 200 m/z. Monoisotopic precursor selection was enabled for peptide isotopic distributions, precursors of z = 2-5 were selected for data-dependent MS/MS scans for 2 s of cycle time, and dynamic exclusion was set to 45 s with a ± 10 ppm window set around the precursor monoisotope. An isolation window of 1 Th was used to select precursor ions with the quadrupole. MS/MS scans were collected using HCD at 30 normalized collision energy (nce) with an AGC target of 50,000 and a maximum injection time of 54 ms. Mass analysis was performed in the Orbitrap with a resolution of 30,000 at 200 m/z and an automatically determined mass range.

FASTA sequences of the human proteome (Uniprot: UP000005640) were used and FASTA sequences of the viral proteins from SARS-CoV-2 (Uniprot: P0DTC1, P0DTD1, P0DTC2, P0DTC3, P0DTC4, P0DTC5, P0DTC6, P0DTC7, P0DTD8, P0DTC8, P0DTC9, P0DTD2, P0DTD3, A0A663DJA2), DENV (Uniprot: A0A173DS53), ZIKV (Uniprot: A0A140D2T1), RV (Uniprot: P03303) were appended to the end of the human proteome reference file. For the VeroE6 reference: GreenMonkey (Chlorocebus sabaeus, Uniprot: UP000029965). This concatenated file was used to search the ChIRP-MS data with MaxQuant with the following parameters: semi-specific cleavage specificity at the C-terminal site of R and K allowing for 2 missed cleavages. Mass tolerance was set at 12 ppm for MS1s, 0.4 for MS2s. Methionine oxidation, asparagine deamidation, and N-term acetylation were set as variable modifications. Cysteine carbamidomethylation was set as a fixed modification.

The above procedure yielded two spreadsheets: one containing the data obtained from human cells (SARS-CoV-2-D1, SARS-CoV-2-D2, ZIKV-D2, DENV-D2, RV-D2) and one containing the data obtained from monkey cells (SARS-VeroE6-D1, SARS-VeroE6-D2). Label-free quantitation (LFQ) values from each MaxQuant output spreadsheet were imported into R for downstream analysis. First, log2-normalized ChIRP-MS enrichment values were obtained for each condition by subtracting the appropriate log-normalized Mock condition. Before computing enrichments, high correlations of the Mock conditions within each group (e.g., Mock SARS 1,2,3, Mock Flavivirus 1,2,3, etc) were confirmed and then the Mock replicates were averaged to ensure any observed variability would be attributable to variation in the infected conditions rather than variability in the mock samples. At this step, we also computed average enrichments across replicates to create a succinct representation of the data for each virus. Next, we matched protein IDs to gene names by querying the uniprot server using `https://www.uniprot.org/uniprot/?query=`. When multiple gene names matched a given protein, we used the first one as the `name` of the hit which was used for most downstream gene lookups. However, we also retained alternate names in the `gene.x` and `gene.y` columns of Table S3. Throughout the manuscript we have visualized enrichments for specific genes using heatmaps–rectangular heatmaps (e.g., in Figure 4) were visualized using R package `pheatmap` while circular heatmaps (e.g., in Figure 7) were visualized using R package `RCircos`. Examples of code for this analysis as well as analysis below can be found: https://github.com/juliabelk/sarscov2_chirp_ms

We defined the “high-confidence” interactome of each SARS-CoV-2 infection condition (D1 / D2 and VeroE6 / Huh7.5 using R package `Differential Enrichment analysis of Proteomics data` (DEP). DEP has its own procedure for data preprocessing, so for this analysis, filtering, normalization, and imputation were performed directly on MaxQuant outputs using the DEP default workflow (i.e., instead of the enrichment computation procedure described above). Enriched protein sets were defined using cutoffs log2 fold change > 0 and adjusted p value ≤ 0.05, comparing infected cells after SARS RNA pulldown to identically treated uninfected (mock) cells. After defining these high-confidence protein sets, the processed enrichments described in the preceding paragraph were used for all downstream analyses.

Principal component analysis was performed to visualize the differences between replicates and viruses (Figure 3D). To compute principal components, we used the standard R package `stats` and function `prcomp(t(x),scale = T)` where x represents the matrix of proteins by ChIRP-MS enrichments in each condition. We additionally defined an “expanded interactome” for each condition as the set of all proteins with mean enrichment > = 1, to aid comparisons across viruses. For GO term analysis, expanded interactomes of each virus were annotated with the DAVID Bioinformatics Resource (Huang et al., 2009a, 2009b). Annotations for Cellular Components, Binding Proteins, and Protein Domains were used to compute enrichments for each expanded interactome. Finally, to perform integrative analysis with the genome-wide CRISPR screen and CRISPR mini-pools we merged the above, initially separate human and monkey tables based on gene name to create one large table encompassing all attributes of the dataset. This table is provided as Table S3.

ChIRP-RNA-seq and analysis

Input lysate samples and enriched RNA samples (1% of the ChIRP sample) were first digested of their cellular proteins which also acts to effectively reverse the formaldehyde crosslinking. RNA samples were brought to 50 μL with 1x PBS and 5 μL Proteinase K (Thermo Fisher Scientific) and incubated at 55°C for 30 min. RNA was cleaned using the Zymo Clean and Concentrate 5 column (Zymo Research) and eluted in 2x 20 μL (final 40 μL). DNA was removed by adding 2 μL DNaseI and 5 μL 10x DNase buffer (NEB) to the purified RNA and incubated at 37°C for 30 min. The RNA was cleaned up as above with the Zymo Clean and Concentrate 5 column but eluted 2x 10 μL (final 20 μL). To construct RNA seq libraries, TAKARA Bio SMART-Seq Stranded Kit User Manual (TAKARA Bio) was used with the following modifications. Up to 5 ng RNA was reverse-transcribed and amplified by PCR following the SMART-seq protocol. To increase cDNA yield and detection efficiency, we started from first-strand cDNA synthesis without fragmentation. The number of PCR1 cycles was 5. We purified the cDNA product with 50 μL AMPure beads (1:1 ratio) and eluted into 20 μL water. Then the 20 μL purified cDNA was used as input for the final RNA-Seq library amplification. To reduce the amount of primer dimer artifacts, we purified the RNA-Seq library with 90 μL AMPure beads (x0.9 selection) and eluted into 20 μL water. Sequencing was performed using the Nextseq 500/550 Sequencing system (Illumina) with 2 × 75 bp paired-end reads and 2 × 8 bp index reads.

Adapters were automatically detected and trimmed using fastp (Chen et al., 2018). Host genomes (for Homo sapiens and chlorocebus sabaeus) were obtained from Ensembl along with annotation (gtf) files for use with feature counts. The SARS-CoV-2 genome was obtained from NCBI. Hisat2 was used to index all genomes and align reads (Kim et al., 2019). Fastq files were initially aligned to a file of known “repeat” sequences–specific sequences which are present in multiple locations in the genome and which can cause a high percentage of multi-mapped reads. Remaining reads were then aligned to the SARS-CoV-2 genome. SARS-CoV-2 genome coverage was visualized in the Integrative Genomics Viewer to assess pulldown efficiency. Remaining reads were then aligned to the host genome and reads overlapping genomic features (genes) were quantified using the featureCounts command line utility (Liao et al., 2014). Aggregated counts matrices were loaded into DESeq2 for normalization and differential gene expression analysis (Love et al., 2014). Mirroring our ChIRP-MS protein analysis, differential gene expression analysis was performed by comparing SARS-CoV-2 infected samples to mock samples.

Electron Microscopy

Huh 7.5 cells infected with SARS-CoV-2 were analyzed at different time points (12, 24 and 48 h.p.i.), and HBECs samples were from Wei et al., 2021. The samples were prepared in the following way: HBECs were fixed using 2.5% glutaraldehyde in 100 mM phosphate buffer, osmicated in 1% osmium tetroxide, and dehydrated in ethanol. During dehydration, 1% uranyl acetate was added to the 70% ethanol to enhance ultrastructural membrane contrast. After dehydration the cells were embedded in Durcupan and ultrathin sections were cut on a Leica Ultra-Microtome, collected on Formvar-coated single-slot grids, and analyzed with a Tecnai 12 Biotwin electron microscope (FEI). ImageJ software was used to measure mitochondrial area.

sgRNA design and cloning

Two sgRNA pools were designed: one targeting the SARS-CoV-2 expanded interactome, and one targeting the set of mitochondrially annotated proteins. Six sgRNAs per gene were used, and sgRNA sequences were selected from the previously described genome-wide African Green Monkey and Human sgRNA libraries (Wei et al., 2021). To create a pool compatible with both human and monkey cell lines, we first intersected the genome-wide guide designs for the two species to obtain cross-species compatible guides. The majority of guides were selected based on the highest ranked cross-species guides. Genes not present in both libraries were selected based on the highest ranked single-species guides. Positive control genes ACE2, CTSL, DPP4, CABIN1, and HIRA were included in both pools, as well as 100 non-targeting controls and 100 single-targeting controls.

Oligos were designed as (fw_primer)(Esp3I site)G(sgRNA)(Esp3I site)(rev_primer) for golden gate cloning into a lentiGuidePuro vector modified to express a 2A-EGFP fusion in frame with the puromycin resistance gene. Vector was pre-digested overnight with restriction enzyme Esp3I and purified on an agarose gel. Oligos were PCR amplified for 20 cycles and purified with a commercial PCR cleanup kit. One step digestion / ligation was performed by combining 1 μL T4 DNA ligase, 1 μL Esp3I, 2 μL T4 DNA ligase buffer, 200 ng digested vector, and 40 ng purified PCR product in a 20 uL reaction. Reaction was incubated at 37°C for 1 h and then heat inactivated at 65°C for 15 min. 1 μL of the reaction was electroporated into 25 μL electrocompetent cells, grown overnight in liquid culture, and purified by maxiprep. Guide representation was confirmed by sequencing.

CRISPR screens

Lentivirus for each pool was transduced into our previously described VeroE6-Cas9 cell line at an MOI of 0.1 (Wei et al., 2021). Three days later, puromycin was added to the media to select transduced cells. Puromycin selection was performed for ten days prior to commencing screens. Seven viruses were used for the CRISPR screen, including HKU5-SARS-CoV-1-S, SARS-CoV-2, rcVSV-SARS-CoV-2-S, MERS-CoV, MERS-CoV T1015N, IAV and EMCV. All the viruses were screened in duplicate. 3x106 transduced VeroE6 cells were plated in 5% FBS in T150 flasks. Mock infected cells were harvested 48 h after seeding and served as a reference for sgRNA enrichment analysis. At 4 d.p.i., 80% of the media was exchanged for fresh media. At 7 d.p.i., cell lysates were harvested in DNA/RNA shield buffer and gDNA of surviving cells was isolated for sequencing. Briefly, we used a standard three round amplification procedure. In the first round, all gDNA was split into 125 μL reactions of up to 5 μg gDNA each and amplified for 23 cycles. In the second round, adapters compatible with Illumina indexing were added, and 0-8 nucleotide offsets were appended to the beginning of the PCR product to increase library complexity for sequencing. Finally, sample indices were added. The resulting libraries were sequenced in dual indexed 1x75 format on an Illumina NextSeq.

CRISPR screen analysis

Reads were trimmed by fastp to remove flanking sequences with `fastp -f 10 -t 15`. Trimmed reads were aligned to the library designs using hisat2. Resulting bam files were converted to counts tables for each sample using package `Rsamtools`. Counts tables were processed using our previously described procedure (Wei et al., 2021). Briefly, counts for each guide were depth normalized to counts per million and then log transformed. Log fold changes for each condition were computed by subtracting the mock condition. Negative control guides were used to obtain z-scores for each guide’s log fold change, from which gene-level z-scores and p values were computed. FDR values were obtained using `p.adjust` in R. PCA analysis was performed as described above for the ChIRP-MS data except on the table of gene-level z-scores for each condition. For genome-wide data, a cutoff of FDR ≤ 0.05 was used to define “hits.” For mini-pool results, we used a conservative threshold of FDR ≤ 0.001 to define hits.

Drug target analysis

Drug-target interaction data was downloaded from DrugCentral (https://drugcentral.org/download). The full database was filtered to include only compounds targeting proteins which were present in the SARS-CoV-2 expanded interactome and compounds with data on human proteins. We added the SARS-CoV-2 ChIRP-MS enrichment (maximum average value across the VeroE6 and Huh7.5 datasets) and the SARS-CoV-2 expanded interactome screening data and have provided the data as Table S8.

Analysis of lung single-cell RNA-sequencing data

Human lung single-cell RNA-sequencing data from Travaglini et al., 2020 was downloaded from Synapse (accession #syn21041850) as a processed cell-by-gene counts table. The data processed using Scanpy version 1.6.0 and Scrublet version 0.2.1. First, low-quality cells were removed (cells with fewer than 250 detected genes, fewer than 500 total UMI counts, or greater than 0.25% mitochondrial reads). Then, the data was depth-normalized to a total of 10,000 reads per cell, and log-transformed. The top 2000 variable genes were identified, any batch effects due to the number of counts detected per cell were regressed out, and the data was scaled with a maximum value of 10. PCA, nearest neighbors, and UMAP calculations were performed using default settings. Leiden clustering was performed with resolution 0.2 to allow identification of non-immune clusters, i.e., CD45 (PTPRC) negative clusters. These non-immune cells were then re-processed, and a low-quality cluster containing a high number of doublets, as well as any other cells with a doublet score greater than 0.15, were filtered out, resulting in the final filtered set of high-quality non-immune cells.

These cells were then selected from the original, un-processed data; and re-processed with the same workflow as described above, with the exception of removing patient-patient batch effects by integrating the data with BBKNN with the patient information as the batch key. Clusters were labeled using the cluster IDs reported in Travaglini et al. as a guide. Cluster-identifying genes, as identified using the rank_genes_groups function with method = ’logreg’ in Scanpy, are shown in Figure S4C.

Quantification and statistical analysis

Proteomics statistics (e.g., Figures 2A and 2B) were determined by R package `Differential Enrichment analysis of Proteomics data` (DEP). RNA-seq statistics were determined by R package `DESeq2.` CRISPR screen statistics were determined with the R code available at https://github.com/juliabelk/sarscov2_chirp_ms. Statistics for mitochondria size quantification were determined using GraphPad Prism. Quantification details are available in the figure legends.

Acknowledgments

We thank Nicholas Riley for designing the mass spec methods and operating the instrument, Chris Richards and Chris Lapointe for helpful discussions, John Doench and Ruth Hanna for assistance analyzing CRISPR data, and Sigrid Knemeyer and Christine Shan at SciStories LLC for illustrations. This work was supported by the National Institutes of Health grants K08CA230188 (A.T.S.), U54CA260517 (A.T.S. and H.Y.C.), K08AI128043 (C.B.W.), R01AI140186S (J.E.C), R01AI141970 (J.E.C.), and RM1HG007735 (H.Y.C.); the Burroughs Wellcome Fund Career Award for Medical Scientists (R.A.F., A.T.S., and C.B.W.); the Bill and Melinda Gates Foundation (A.T.S.), the Stanford Innovative Medicine Accelerator and Stanford ChEM-H (A.T.S.); the Ludwig Family Foundation (C.B.W.); Patterson Trust (C.B.W.); Mathers Foundation (C.B.W.), and Fast Grants from Emergent Ventures (A.T.S. and C.B.W.). R.A.F. was supported by the Damon Runyon Cancer Research Foundation DRG-2286-17. J.A.B and K.R.P. were supported by Stanford Graduate Fellowships. J.A.B. was supported by the National Science Foundation Graduate Research Fellowship under grant no. DGE-1656518. A.T.S. was supported by a Parker Bridge Scholar Award from the Parker Institute for Cancer Immunotherapy and a Cancer Research Institute Technology Impact Award. J.E.C. was supported by the Burroughs Wellcome Investigators in the Pathogenesis of Infectious Disease Award. H.Y.C was supported by the Pershing Square Foundation. C.R.B. and H.Y.C. are Investigators of the Howard Hughes Medical Institute. The sequencing data were generated at the Stanford Functional Genomics Facility with instrumentation purchased with NIH grants S10OD018220 and 1S10OD021763.

Author contributions

R.A.F., M.R.M., C.B.W., and A.T.S. conceived the study. R.A.F., Y.Q., C.O.S., and C.B.W. performed SARS-CoV-2 and ChIRP experiments. R.A.F., J.A.B., A.L., and K.R.P. performed analysis of ChIRP data. J.A.B., J.W., M.M.A., Q.S., P.C.D., and C.B.W. performed and analyzed CRISPR screens. Y.Y., E.W., and T.L.H. performed mitochondrial imaging experiments. R.A.F., J.A.B., H.Y.C., T.H., J.E.C., C.R.B., C.B.W., and A.T.S. oversaw and guided experiments and analysis. R.A.F., J.A.B., C.B.W., and A.T.S. drafted the manuscript, and all authors reviewed and provided comments on the manuscript.

Declaration of interests

A.T.S. is a scientific co-founder of Immunai and receives research funding from Arsenal Biosciences and 10x Genomics. K.R.P., H.Y.C., and A.T.S. are co-founders of Cartography Biosciences. H.Y.C. is a co-founder of Accent Therapeutics, Boundless Bio, and an advisor for 10x Genomics, Arsenal Biosciences, and Spring Discovery. Yale University (C.B.W.) has a patent pending related to this work entitled: “Compounds and Compositions for Treating, Ameliorating, and/or Preventing SARS-CoV-2 Infection and/or Complications Thereof.” Yale University has committed to rapidly executable non-exclusive royalty-free licenses to intellectual property rights for the purpose of making and distributing products to prevent, diagnose, and treat COVID-19 infection during the pandemic and for a short period thereafter.

Published: March 11, 2021

Footnotes

Supplemental information can be found online at https://doi.org/10.1016/j.cell.2021.03.012.

Contributor Information

Ryan A. Flynn, Email: ryan.flynn@childrens.harvard.edu.

Craig B. Wilen, Email: craig.wilen@yale.edu.

Ansuman T. Satpathy, Email: satpathy@stanford.edu.

Supplemental information

Table S1. Sequences of 108 biotinylated ChIRP probes, related to Figure 1
mmc1.xls (37KB, xls)
Table S2. Mass spectrometry data for viral proteins and host proteins with mean enrichment ≥ 1 in SARS-CoV-2 ChIRP-MS datasets, related to Figure 2
mmc2.xlsx (527.5KB, xlsx)
Table S3. Complete matrix of data for all proteins in all datasets, related to Figure 3

Full data for all proteins in all datasets including multi-virus data, high-confidence interactomes, expanded interactomes, genome-wide CRISPR screen results, and presence in other datasets (such as UV-crosslinking data, PPI data).

mmc3.xls (3.5MB, xls)
Table S4. CRISPR mini-pool designs, related to Figures 5, 6, and 7

SARS-CoV-2 interactome CRISPR mini-pool design and mitochondrial CRISPR mini-pool design. Includes target genes, controls, and cross-species compatibility of each sgRNA.

mmc4.csv (819.3KB, csv)
Table S5. Expanded ChIRP-MS interactome CRISPR mini-pool screen, related to Figures 5 and 6

Results for SARS-CoV-2 and six other viruses. Z-scores and FDRs are included for each gene and condition.

mmc5.csv (290.1KB, csv)
Table S6. Mitochondria CRISPR mini-pool screen, related to Figure 7

Results for SARS-CoV-2 and six other viruses. Z-scores and FDRs are included for each gene and condition.

mmc6.csv (186.4KB, csv)
Table S7. ChIRP-MS overlap with DrugCentral database, related to Figure 5

Intersection of SARS-CoV-2 expanded interactome and a database of drug targets from DrugCentral (https://drugcentral.org/).

mmc7.csv (126.5KB, csv)

References

  1. Agnihothram S., Yount B.L., Jr., Donaldson E.F., Huynh J., Menachery V.D., Gralinski L.E., Graham R.L., Becker M.M., Tomar S., Scobey T.D. A Mouse Model for Betacoronavirus Subgroup 2c Using a Bat Coronavirus Strain HKU5 Variant. mBio. 2014;5 doi: 10.1128/mBio.00047-14. e00047–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ahlquist P. Parallels among positive-strand RNA viruses, reverse-transcribing viruses and double-stranded RNA viruses. Nat. Rev. Microbiol. 2006;4:371–382. doi: 10.1038/nrmicro1389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Baggen J., Thibaut H.J., Hurdiss D.L., Wahedi M., Marceau C.D., van Vliet A.L.W., Carette J.E., van Kuppeveld F.J.M. Identification of the Cell-Surface Protease ADAM9 as an Entry Factor for Encephalomyocarditis Virus. MBio. 2019;10:e01780-19. doi: 10.1128/mBio.01780-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Banerjee A.K., Blanco M.R., Bruce E.A., Honson D.D., Chen L.M., Chow A., Bhat P., Ollikainen N., Quinodoz S.A., Loney C. SARS-CoV-2 Disrupts Splicing, Translation, and Protein Trafficking to Suppress Host Defenses. Cell. 2020;183:1325–1339.e21. doi: 10.1016/j.cell.2020.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bazzone L.E., King M., MacKay C.R., Kyawe P.P., Meraner P., Lindstrom D., Rojas-Quintero J., Owen C.A., Wang J.P., Brass A.L. A Disintegrin and Metalloproteinase 9 Domain (ADAM9) Is a Major Susceptibility Factor in the Early Stages of Encephalomyocarditis Virus Infection. MBio. 2019;10:e02734-18. doi: 10.1128/mBio.02734-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Carrasco-Hernandez R., Jácome R., López Vidal Y., Ponce de León S. Are RNA Viruses Candidate Agents for the Next Global Pandemic? A Review. ILAR J. 2017;58:343–358. doi: 10.1093/ilar/ilx026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chang C.K., Hou M.-H., Chang C.-F., Hsiao C.-D., Huang T.H. The SARS coronavirus nucleocapsid protein--forms and functions. Antiviral Res. 2014;103:39–50. doi: 10.1016/j.antiviral.2013.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chen S., Zhou Y., Chen Y., Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chu C., Chang H.Y. ChIRP-MS: RNA-Directed Proteomic Discovery. Methods Mol. Biol. 2018;1861:37–45. doi: 10.1007/978-1-4939-8766-5_3. [DOI] [PubMed] [Google Scholar]
  10. Chu C., Zhang Q.C., da Rocha S.T., Flynn R.A., Bharadwaj M., Calabrese J.M., Magnuson T., Heard E., Chang H.Y. Systematic discovery of Xist RNA binding proteins. Cell. 2015;161:404–416. doi: 10.1016/j.cell.2015.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cockrell A.S., Leist S.R., Douglas M.G., Baric R.S. Modeling pathogenesis of emergent and pre-emergent human coronaviruses in mice. Mamm. Genome. 2018;29:367–383. doi: 10.1007/s00335-018-9760-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Eckhardt M., Hultquist J.F., Kaake R.M., Hüttenhain R., Krogan N.J. A systems approach to infectious disease. Nat. Rev. Genet. 2020;21:339–354. doi: 10.1038/s41576-020-0212-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Edgil D., Polacek C., Harris E. Dengue virus utilizes a novel strategy for translation initiation when cap-dependent translation is inhibited. J. Virol. 2006;80:2976–2986. doi: 10.1128/JVI.80.6.2976-2986.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Fernandez-Garcia M.-D., Mazzon M., Jacobs M., Amara A. Pathogenesis of flavivirus infections: using and abusing the host cell. Cell Host Microbe. 2009;5:318–328. doi: 10.1016/j.chom.2009.04.001. [DOI] [PubMed] [Google Scholar]
  15. Finkel Y., Mizrahi O., Nachshon A., Weingarten-Gabbay S., Morgenstern D., Yahalom-Ronen Y., Tamir H., Achdout H., Stein D., Israeli O. The coding capacity of SARS-CoV-2. Nature. 2020;589:125–130. doi: 10.1038/s41586-020-2739-1. [DOI] [PubMed] [Google Scholar]
  16. Fritzlar S., Aktepe T.E., Chao Y.-W., Kenney N.D., McAllaster M.R., Wilen C.B., White P.A., Mackenzie J.M. Mouse Norovirus Infection Arrests Host Cell Translation Uncoupled from the Stress Granule-PKR-eIF2α Axis. MBio. 2019;10:e00960-19. doi: 10.1128/mBio.00960-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Garcia-Blanco M.A., Vasudevan S.G., Bradrick S.S., Nicchitta C. Flavivirus RNA transactions from viral entry to genome replication. Antiviral Res. 2016;134:244–249. doi: 10.1016/j.antiviral.2016.09.010. [DOI] [PubMed] [Google Scholar]
  18. Geuens T., Bouhy D., Timmerman V. The hnRNP family: insights into their role in health and disease. Hum. Genet. 2016;135:851–867. doi: 10.1007/s00439-016-1683-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gordon D.E., Jang G.M., Bouhaddou M., Xu J., Obernier K., White K.M., O’Meara M.J., Rezelj V.V., Guo J.Z., Swaney D.L. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature. 2020;583:459–468. doi: 10.1038/s41586-020-2286-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gralinski L.E., Baric R.S. Molecular pathology of emerging coronavirus infections. J. Pathol. 2015;235:185–195. doi: 10.1002/path.4454. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Harcourt J., Tamin A., Lu X., Kamili S., Sakthivel S.K., Murray J., Queen K., Tao Y., Paden C.R., Zhang J. Severe Acute Respiratory Syndrome Coronavirus 2 from Patient with Coronavirus Disease, United States. Emerg. Infect. Dis. 2020;26:1266–1273. doi: 10.3201/eid2606.200516. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hoffmann H.-H., Sánchez-Rivera F.J., Schneider W.M., Luna J.M., Soto-Feliciano Y.M., Ashbrook A.W., Le Pen J., Leal A.A., Ricardo-Lax I., Michailidis E. Functional interrogation of a SARS-CoV-2 host protein interactome identifies unique and shared coronavirus host factors. Cell Host Microbe. 2021;29:267–280.e5. doi: 10.1016/j.chom.2020.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hosmillo M., Lu J., McAllaster M.R., Eaglesham J.B., Wang X., Emmott E., Domingues P., Chaudhry Y., Fitzmaurice T.J., Tung M.K. Noroviruses subvert the core stress granule component G3BP1 to promote viral VPg-dependent translation. eLife. 2019;8:e46681. doi: 10.7554/eLife.46681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hou Y.J., Okuda K., Edwards C.E., Martinez D.R., Asakura T., Dinnon K.H., 3rd, Kato T., Lee R.E., Yount B.L., Mascenik T.M. SARS-CoV-2 Reverse Genetics Reveals a Variable Infection Gradient in the Respiratory Tract. Cell. 2020;182:429–446.e14. doi: 10.1016/j.cell.2020.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Huang W., Sherman B.T., Lempicki R.A. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37:1–13. doi: 10.1093/nar/gkn923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Huang W., Sherman B.T., Lempicki R.A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 2009;4:44–57. doi: 10.1038/nprot.2008.211. [DOI] [PubMed] [Google Scholar]
  27. Janes J., Young M.E., Chen E., Rogers N.H., Burgstaller-Muehlbacher S., Hughes L.D., Love M.S., Hull M.V., Kuhen K.L., Woods A.K. The ReFRAME library as a comprehensive drug repurposing library and its application to the treatment of cryptosporidiosis. Proc. Natl. Acad. Sci. U S A. 2018;115:10750–10755. doi: 10.1073/pnas.1810137115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Jankowsky E. RNA helicases at work: binding and rearranging. Trends Biochem. Sci. 2011;36:19–29. doi: 10.1016/j.tibs.2010.07.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kim D., Paggi J.M., Park C., Bennett C., Salzberg S.L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Kim B., Arcos S., Rothamel K., Ascano M. Viral crosslinking and solid-phase purification enables discovery of ribonucleoprotein complexes on incoming RNA virus genomes. Nat. Protoc. 2020;16:516–531. doi: 10.1038/s41596-020-00429-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kim D., Lee J.-Y., Yang J.-S., Kim J.W., Kim V.N., Chang H. The Architecture of SARS-CoV-2 Transcriptome. Cell. 2020;181:914–921.e10. doi: 10.1016/j.cell.2020.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Knott G.J., Bond C.S., Fox A.H. The DBHS proteins SFPQ, NONO and PSPC1: a multipurpose molecular scaffold. Nucleic Acids Res. 2016;44:3989–4004. doi: 10.1093/nar/gkw271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Lahaye X., Gentili M., Silvin A., Conrad C., Picard L., Jouve M., Zueva E., Maurin M., Nadalin F., Knott G.J. NONO Detects the Nuclear HIV Capsid to Promote cGAS-Mediated Innate Immune Activation. Cell. 2018;175:488–501.e22. doi: 10.1016/j.cell.2018.08.062. [DOI] [PubMed] [Google Scholar]
  34. Lenarcic E.M., Landry D.M., Greco T.M., Cristea I.M., Thompson S.R. Thiouracil cross-linking mass spectrometry: a cell-based method to identify host factors involved in viral amplification. J. Virol. 2013;87:8697–8712. doi: 10.1128/JVI.00950-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Li M., Ramage H., Cherry S. Deciphering flavivirus-host interactions using quantitative proteomics. Curr. Opin. Immunol. 2020;66:90–97. doi: 10.1016/j.coi.2020.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Liao Y., Smyth G.K., Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
  37. Lichinchi G., Zhao B.S., Wu Y., Lu Z., Qin Y., He C., Rana T.M. Dynamics of Human and Viral RNA Methylation during Zika Virus Infection. Cell Host Microbe. 2016;20:666–673. doi: 10.1016/j.chom.2016.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Loo Y.-M., Gale M., Jr. Immune signaling by RIG-I-like receptors. Immunity. 2011;34:680–692. doi: 10.1016/j.immuni.2011.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Love M.I., Huber W., Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. McBride C.E., Li J., Machamer C.E. The cytoplasmic tail of the severe acute respiratory syndrome coronavirus spike protein contains a novel endoplasmic reticulum retrieval signal that binds COPI and promotes interaction with membrane protein. J. Virol. 2007;81:2418–2428. doi: 10.1128/JVI.02146-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Meier-Stephenson V., Mrozowich T., Pham M., Patel T.R. DEAD-box helicases: the Yin and Yang roles in viral infections. Biotechnol. Genet. Eng. Rev. 2018;34:3–32. doi: 10.1080/02648725.2018.1467146. [DOI] [PubMed] [Google Scholar]
  42. Menachery V.D., Yount B.L., Jr., Debbink K., Agnihothram S., Gralinski L.E., Plante J.A., Graham R.L., Scobey T., Ge X.-Y., Donaldson E.F. A SARS-like cluster of circulating bat coronaviruses shows potential for human emergence. Nat. Med. 2015;21:1508–1513. doi: 10.1038/nm.3985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Mukhopadhyay S., Kuhn R.J., Rossmann M.G. A structural perspective of the flavivirus life cycle. Nat. Rev. Microbiol. 2005;3:13–22. doi: 10.1038/nrmicro1067. [DOI] [PubMed] [Google Scholar]
  44. Ooi Y.S., Majzoub K., Flynn R.A., Mata M.A., Diep J., Li J.K., van Buuren N., Rumachik N., Johnson A.G., Puschnik A.S. An RNA-centric dissection of host complexes controlling flavivirus infection. Nat. Microbiol. 2019;4:2369–2382. doi: 10.1038/s41564-019-0518-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Ou S.H., Wu F., Harrich D., García-Martínez L.F., Gaynor R.B. Cloning and characterization of a novel cellular protein, TDP-43, that binds to human immunodeficiency virus type 1 TAR DNA sequence motifs. J. Virol. 1995;69:3584–3596. doi: 10.1128/jvi.69.6.3584-3596.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Phillips S.L., Soderblom E.J., Bradrick S.S., Garcia-Blanco M.A. Identification of Proteins Bound to Dengue Viral RNA In Vivo Reveals New Host Proteins Important for Virus Replication. MBio. 2016;7:e01865-e15. doi: 10.1128/mBio.01865-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Puschnik A.S., Majzoub K., Ooi Y.S., Carette J.E. A CRISPR toolbox to study virus-host interactions. Nat. Rev. Microbiol. 2017;15:351–364. doi: 10.1038/nrmicro.2017.29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Raaben M., Posthuma C.C., Verheije M.H., te Lintelo E.G., Kikkert M., Drijfhout J.W., Snijder E.J., Rottier P.J.M., de Haan C.A.M. The ubiquitin-proteasome system plays an important role during various stages of the coronavirus infection cycle. J. Virol. 2010;84:7869–7879. doi: 10.1128/JVI.00485-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Ravindra N.G., Alfajaro M.M., Gasque V., Habet V., Wei J., Filler R.B., Huston N.C., Wan H., Szigeti-Buck K., Wang B. Single-cell longitudinal analysis of SARS-CoV-2 infection in human airway epithelium. bioRxiv. 2020 doi: 10.1101/2020.05.06.081695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Ringeard M., Marchand V., Decroly E., Motorin Y., Bennasser Y. FTSJ3 is an RNA 2¢-O-methyltransferase recruited by HIV to avoid innate immune sensing. Nature. 2019;565:500–504. doi: 10.1038/s41586-018-0841-4. [DOI] [PubMed] [Google Scholar]
  51. Roth H., Magg V., Uch F., Mutz P., Klein P., Haneke K., Lohmann V., Bartenschlager R., Fackler O.T., Locker N. Flavivirus Infection Uncouples Translation Suppression from Cellular Stress Responses. MBio. 2017;8:e02150-16. doi: 10.1128/mBio.02150-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Salahudeen A.A., Choi S.S., Rustagi A., Zhu J., van Unen V., de la O S.M., Flynn R.A., Margalef-Català M., Santos A.J.M., Ju J. Progenitor identification and SARS-CoV-2 infection in human distal lung organoids. Nature. 2020;588:670–675. doi: 10.1038/s41586-020-3014-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Schmidt N., Lareau C.A., Keshishian H., Ganskih S., Schneider C., Hennig T., Melanson R., Werner S., Wei Y., Zimmer M. The SARS-CoV-2 RNA–protein interactome in infected human cells. Nat. Microbiol. 2021;6:339–353. doi: 10.1038/s41564-020-00846-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Schoggins J.W., Rice C.M. Interferon-stimulated genes and their antiviral effector functions. Curr. Opin. Virol. 2011;1:519–525. doi: 10.1016/j.coviro.2011.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Schubert K., Karousis E.D., Jomaa A., Scaiola A., Echeverria B., Gurzeler L.-A., Leibundgut M., Thiel V., Mühlemann O., Ban N. SARS-CoV-2 Nsp1 binds the ribosomal mRNA channel to inhibit translation. Nat. Struct. Mol. Biol. 2020;27:959–966. doi: 10.1038/s41594-020-0511-8. [DOI] [PubMed] [Google Scholar]
  56. Scobey T., Yount B.L., Sims A.C., Donaldson E.F., Agnihothram S.S., Menachery V.D., Graham R.L., Swanstrom J., Bove P.F., Kim J.D. Reverse genetics with a full-length infectious cDNA of the Middle East respiratory syndrome coronavirus. Proc. Natl. Acad. Sci. U S A. 2013;110:16157–16162. doi: 10.1073/pnas.1311542110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Shannon P., Markiel A., Ozier O., Baliga N.S., Wang J.T., Ramage D., Amin N., Schwikowski B., Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Somasundaran M., Zapp M.L., Beattie L.K., Pang L., Byron K.S., Bassell G.J., Sullivan J.L., Singer R.H. Localization of HIV RNA in mitochondria of infected cells: potential role in cytopathogenicity. J. Cell Biol. 1994;126:1353–1360. doi: 10.1083/jcb.126.6.1353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Sungnak W., Huang N., Bécavin C., Berg M., Queen R., Litvinukova M., Talavera-López C., Maatz H., Reichart D., Sampaziotis F., HCA Lung Biological Network SARS-CoV-2 entry factors are highly expressed in nasal epithelial cells together with innate immune genes. Nat. Med. 2020;26:681–687. doi: 10.1038/s41591-020-0868-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Szul T., Sztul E. COPII and COPI traffic at the ER-Golgi interface. Physiology (Bethesda) 2011;26:348–364. doi: 10.1152/physiol.00017.2011. [DOI] [PubMed] [Google Scholar]
  61. Taschuk F., Cherry S. DEAD-Box Helicases: Sensors, Regulators, and Effectors for Antiviral Defense. Viruses. 2020;12:181. doi: 10.3390/v12020181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Thoms M., Buschauer R., Ameismeier M., Koepke L., Denk T., Hirschenberger M., Kratzat H., Hayn M., Mackens-Kiani T., Cheng J. Structural basis for translational shutdown and immune evasion by the Nsp1 protein of SARS-CoV-2. Science. 2020;369:1249–1255. doi: 10.1126/science.abc8665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Travaglini K.J., Nabhan A.N., Penland L., Sinha R., Gillich A., Sit R.V., Chang S., Conley S.D., Mori Y., Seita J. A molecular cell atlas of the human lung from single-cell RNA sequencing. Nature. 2020;587:619–625. doi: 10.1038/s41586-020-2922-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Viktorovskaya O.V., Greco T.M., Cristea I.M., Thompson S.R. Identification of RNA Binding Proteins Associated with Dengue Virus RNA in Infected Cells Reveals Temporally Distinct Host Factor Requirements. PLoS Negl. Trop. Dis. 2016;10:e0004921. doi: 10.1371/journal.pntd.0004921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Wang R., Simoneau C.R., Kulsuptrakul J., Bouhaddou M., Travisano K., Hayashi J.M., Carlson-Stevermer J., Oki J., Holden K., Krogan N.J. Functional genomic screens identify human host factors for SARS-CoV-2 and common cold coronaviruses. bioRxiv. 2020 doi: 10.1101/2020.09.24.312298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Wei J., Alfajaro M.M., DeWeirdt P.C., Hanna R.E., Lu-Culligan W.J., Cai W.L., Strine M.S., Zhang S.-M., Graziano V.R., Schmitz C.O. Genome-wide CRISPR Screens Reveal Host Factors Critical for SARS-CoV-2 Infection. Cell. 2021;184:76–91.e13. doi: 10.1016/j.cell.2020.10.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. White K.M., Rosales R., Yildiz S., Kehrer T., Miorin L., Moreno E., Jangra S., Uccellini M.B., Rathnasinghe R., Coughlan L. Plitidepsin has potent preclinical efficacy against SARS-CoV-2 by targeting the host protein eEF1A. Science. 2021;371:926–931. doi: 10.1126/science.abf4058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Wolff G., Limpens R.W.A.L., Zevenhoven-Dobbe J.C., Laugks U., Zheng S., de Jong A.W.M., Koning R.I., Agard D.A., Grünewald K., Koster A.J. A molecular pore spans the double membrane of the coronavirus replication organelle. Science. 2020;369:1395–1398. doi: 10.1126/science.abd3629. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Wu K.E., Fazal F.M., Parker K.R., Zou J., Chang H.Y. RNA-GPS Predicts SARS-CoV-2 RNA Residency to Host Mitochondria and Nucleolus. Cell Syst. 2020;11:102–108.e3. doi: 10.1016/j.cels.2020.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Yang S.L., DeFalco L., Anderson D.E., Zhang Y., Aw A.J., Lim S.Y., Ni L.X., Tan K.Y., Zhang T., Chawla T. Comprehensive mapping of SARS-CoV-2 interactions in vivo reveals functional virus-host interactions. bioRxiv. 2021 doi: 10.1101/2021.01.17.427000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Yue Y., Liu J., He C. RNA N6-methyladenosine methylation in post-transcriptional gene expression regulation. Genes Dev. 2015;29:1343–1355. doi: 10.1101/gad.262766.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Zaccara S., Ries R.J., Jaffrey S.R. Reading, writing and erasing mRNA methylation. Nat. Rev. Mol. Cell Biol. 2019;20:608–624. doi: 10.1038/s41580-019-0168-5. [DOI] [PubMed] [Google Scholar]
  73. Zhou P., Yang X.-L., Wang X.-G., Hu B., Zhang L., Zhang W., Si H.-R., Zhu Y., Li B., Huang C.-L. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S1. Sequences of 108 biotinylated ChIRP probes, related to Figure 1
mmc1.xls (37KB, xls)
Table S2. Mass spectrometry data for viral proteins and host proteins with mean enrichment ≥ 1 in SARS-CoV-2 ChIRP-MS datasets, related to Figure 2
mmc2.xlsx (527.5KB, xlsx)
Table S3. Complete matrix of data for all proteins in all datasets, related to Figure 3

Full data for all proteins in all datasets including multi-virus data, high-confidence interactomes, expanded interactomes, genome-wide CRISPR screen results, and presence in other datasets (such as UV-crosslinking data, PPI data).

mmc3.xls (3.5MB, xls)
Table S4. CRISPR mini-pool designs, related to Figures 5, 6, and 7

SARS-CoV-2 interactome CRISPR mini-pool design and mitochondrial CRISPR mini-pool design. Includes target genes, controls, and cross-species compatibility of each sgRNA.

mmc4.csv (819.3KB, csv)
Table S5. Expanded ChIRP-MS interactome CRISPR mini-pool screen, related to Figures 5 and 6

Results for SARS-CoV-2 and six other viruses. Z-scores and FDRs are included for each gene and condition.

mmc5.csv (290.1KB, csv)
Table S6. Mitochondria CRISPR mini-pool screen, related to Figure 7

Results for SARS-CoV-2 and six other viruses. Z-scores and FDRs are included for each gene and condition.

mmc6.csv (186.4KB, csv)
Table S7. ChIRP-MS overlap with DrugCentral database, related to Figure 5

Intersection of SARS-CoV-2 expanded interactome and a database of drug targets from DrugCentral (https://drugcentral.org/).

mmc7.csv (126.5KB, csv)

Data Availability Statement

Source code is available on GitHub (https://github.com/juliabelk/sarscov2_chirp_ms) along with original spreadsheets for the MS and CRISPR analyses. Sequencing data has been deposited on NCBI GEO as series GSE167341 which includes the ChIRP RNA-seq and CRISPR screening experiments.

RESOURCES