Abstract
Physical interactions between viral and host proteins are responsible for almost all aspects of the viral life cycle and the host’s immune response. Studying viral-host protein-protein interactions is thus crucial for identifying strategies for treatment and prevention of viral infection. Here, we use high-throughput yeast two-hybrid and affinity purification followed by mass spectrometry to generate a comprehensive SARS-CoV-2-human protein-protein interactome network consisting of both binary and co-complex interactions. We report a total of 739 high-confidence interactions, showing the highest overlap of interaction partners among published datasets as well as the highest overlap with genes differentially expressed in samples (such as upper airway and bronchial epithelial cells) from patients with SARS-CoV-2 infection. Showcasing the utility of our network, we describe a novel interaction between the viral accessory protein ORF3a and the host zinc finger transcription factor ZNF579 to illustrate a SARS-CoV-2 factor mediating a direct impact on host transcription. Leveraging our interactome, we performed network-based drug screens for over 2,900 FDA-approved/investigational drugs and obtained a curated list of 23 drugs that had significant network proximities to SARS-CoV-2 host factors, one of which, carvedilol, showed promising antiviral properties. We performed electronic health record-based validation using two independent large-scale, longitudinal COVID-19 patient databases and found that carvedilol usage was associated with a significantly lowered probability (17%−20%, P < 0.001) of obtaining a SARS-CoV-2 positive test after adjusting various confounding factors. Carvedilol additionally showed anti-viral activity against SARS-CoV-2 in a human lung epithelial cell line [half maximal effective concentration (EC50) value of 4.1 μM], suggesting a mechanism for its beneficial effect in COVID-19. Our study demonstrates the value of large-scale network systems biology approaches for extracting biological insight from complex biological processes.
Keywords: Interactomics, SARS-CoV-2
The global coronavirus disease 2019 (COVID-19) pandemic caused by the highly transmissible and pathogenic severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) remains a persistent part of everyday life for much of the world. Epidemiological models predict, in line with virological theory, that the pandemic is unlikely to end with the complete eradication of the virus1,2. Indeed, SARS-CoV-2 becoming endemic in pockets of the world’s population is hypothesized to be a natural consequence of the virus’s widespread propagation2, and the emergence of numerous viral variants raise concern of a perennial selection for more infectious or virulent mutants to again sweep through the globe. This highlights the need to fill in the gaps in our understanding of the interplay between this virus and its host upon natural infection and immunization, and thus, much work remains to be done to elucidate the pathobiology of SARS-CoV-2, especially as the maintenance of immunity against this pathogen remains at utmost interest to global public health.
Viruses interface with host cell surfaces to gain entry, wherein they interact with intracellular proteins to hijack host mechanisms that facilitate viral replication and evasion of an immune response3. Studying viral-host protein-protein interactions (PPIs) is therefore pivotal for understanding the mechanisms by which viral infection progresses, how the host responds to said infection, and thus, for identifying strategies for treatment and prevention4–7. These networks of interactions are especially important as proteins rarely act in isolation and their roles should be evaluated in conjunction with their neighborhood of interacting partners. Such interactomes can thus reveal biological pathways and processes impacted by the viral proteome, allowing for the discovery of novel drug targets that directly or indirectly affect the viral-host point of contact.
To that effect, here we leverage high-throughput yeast two-hybrid (Y2H) and tandem mass tag affinity purification followed by mass spectrometry (TMT-AP/MS) to generate the first binary and co-complex SARS-CoV-2-human protein-protein interactome network, which we propose to be a more complete resource for exploration of the viral-host interactome (Fig. 1a). We adopted this approach for several reasons. To date, Y2H and AP/MS are the only two methodologies available for mapping protein-protein interactome networks on a proteomic scale8,9. Pioneering studies publishing the earliest SARS-CoV-2-human interactomes utilized label-free AP/MS as their sole method for interaction mapping4–7. While both Y2H and AP/MS alone produce high-quality interactome datasets, they fundamentally capture different yet complementary aspects of the full network; specifically, Y2H interactions represent key connections between different protein complexes and pathways10. Thus, Y2H and AP/MS together can provide a more comprehensive view of the topological and biological properties of the interactome10. Moreover, labelled (e.g., TMT-based) AP/MS has been shown to provide more precise, accurate, and reproducible quantification of proteins compared to label-free AP/MS-based approaches, which is an important criterion when trying to identify true protein interactions and generate high-quality interactome networks11–21.
Here, we used both Y2H and quantitative TMT-AP/MS to generate a total of 739 high-confidence interactions among 579 human proteins and 28 SARS-CoV-2 proteins. Our interactome had an unprecedented scale and coverage compared with existing ones. Using our interactome, we identified important pathways such as protein translation, mRNA splicing, Golgi transportation, neutrophil mediated immunity, and glucose metabolism. Moreover, we prioritized host-targeting therapies by searching U.S. FDA-approved/investigational drugs for their potential anti-SARS-CoV-2 effect using state-of-the-art network proximity methods. Using two large independent COVID-19 patient databases, we found that usage of one of the top candidates, carvedilol, was associated with a lowered risk (17%−20%) of a positive COVID-19 test. Experimental validation shows that carvedilol inhibits SARS-CoV-2 infection with a half maximal effective concentration (EC50) of 4.1 μM. Altogether, these results suggest that our comprehensive SARS-CoV-2-human protein interactome offers great opportunities for understanding the pathobiological process of SARS-CoV-2 in human and identifying host-targeting therapies for COVID-19.
Results
A comprehensive SARS-CoV-2-human protein-protein interactome network
To generate a binary SARS-CoV-2-human protein-protein interactome, we systematically tested all pairwise combinations of 28 SARS-CoV-2 proteins (GenBank entry - MN908947) against ~16,000 human proteins (hORFeome V8.1)22 using high-throughput Y2H screens10,23–26 (Fig. 1a). We treated each protein as both a bait and a prey, yielding over 896,000 (28 × ~16,000 × 2) total tested pair combinations. Prior to screening, all autoactivating DNA-binding domain (DB) ORF clones were removed from further tests (see Methods). To increase experimental throughput, viral ORF activating domain (AD) and DB clones were mated against pools of 24 human ORF DB or AD clones, respectively. Following auxotrophic selection, AD:DB pairs were identified via PLATE-seq26 to generate a list of candidate interactions (see Methods). Interaction candidates were then subsequently re-tested using Y2H to ascertain high reproducibility. In all, we report a total of 299 high-quality binary SARS-CoV-2-human PPIs via our high-throughput Y2H screen, 267 of which were unique to this assay in this study (Table S1).
To complement our binary SARS-CoV-2-human protein-protein interactome, we independently expressed each of the 28 SARS-CoV-2 proteins in the human intestinal epithelial cell line Caco-2 (HTB-37; ATCC) to identify viral-host co-complex interactions using TMT-AP/MS proteomics (Fig. 1a). We used Caco-2 as our cell line model due to its endogenous expression of angiotensin-converting enzyme 2 (ACE2) and transmembrane serine protease 2 (TMPRSS2) required for SARS-CoV-2 cell entry and S protein priming, respectively27, the line’s extensive use in SARS-CoV and SARS-CoV-2 infection studies28,29, supported by known in vivo replication of SARS-CoV-2 in gastrointestinal cells30,31, and desirable cell culture characteristics including robust transfectability and rapid propagation. All Strep-, Myc-, or FLAG-tagged SARS-CoV-2 baits and their corresponding empty vector controls were transfected in biological duplicates, followed by subsequent affinity purification, TMT10 labeling, and SPS-MS3-based quantification. We filtered for interactions that met stringent fold change and p-value based cutoffs (see Methods). In all, we report a total of 472 high-confidence co-complex SARS-CoV-2-human PPIs via AP/MS, 440 of which were unique to this assay in this study (Table S1). Altogether, our orthogonal approaches generated a network composed of 739 interactions among 28 viral and 579 host proteins (Table S1).
We visualized the SARS-CoV-2-human protein-protein interactome through a network shown in Fig. 1b. The colors of the edges between the viral proteins (represented as diamond nodes) and the host proteins (represented as circle nodes) indicate the methods that detected the interaction. Host proteins that interact with a single viral protein are shown in boxes connected to their interacting partner. Several human proteins interact with multiple SARS-CoV-2 proteins, such as ACTN4, ITGB1BP2, TRIM27, and ACTN1, while the majority of human proteins (469, 81%) interact with only one SARS-CoV-2 protein (Fig. S1a). Among the viral proteins, N, ORF7b, and ORF9b achieved the highest network degrees, whereas E, NSP7, and NSP1 have the lowest network degrees (Fig. S1b). In terms of the shared interacting partners, overall, the viral proteins showed low overlap (Fig. S1c), consistent with a previously published SARS-CoV-2 interactome network4,5. We examined the overlap of host factors for Y2H and AP/MS separately and found overall low overlap of host factors as well (Fig. S1d–e).
For the entire interactome, functional enrichment analysis revealed significantly overrepresented biological processes (Fig. S2a and Table S2), including protein translation, transcription, and neutrophil-mediated immunity (highlighted with yellow background in Fig. 1b). Sematic analysis shows major biological process categories such as “ribosome biogenesis,” “rRNA metabolic process,” and “viral gene expression” (Fig. S2b). Pathway enrichment analysis show top enriched pathways such as “protein processing in the endoplasmic reticulum,” “tight junction,” “glycolysis,” “ribosome,” and “protein export” (Fig. S2c and Table S2). For individual SARS-CoV-2 proteins, many pathways and biological processes are shared in these viral proteins (Fig. S3). For example, NSP12, NSP13, and NSP16 share biological processes such as “regulation of cellular component movement,” “negative regulation of cell morphogenesis involved in differentiation,” and “negative regulation of substrate adhesion-dependent cell spreading” (Fig. S3a), and ORF7a, ORF7b, ORF8, and NSP4 share the pathway “protein processing in endoplasmic reticulum” (Fig. S3b).
Overall, our interactome is comprised of abundant information that can be utilized for the identification of COVID-19-relevant pathobiology and host-targeting therapies. We also developed an interactive visualization tool for our interactome which can be accessed from https://github.com/ChengF-Lab/COVID-19_PPI.
This SARS-CoV-2-human interactome is of high coverage and quality
To ensure the authenticity when applying our interactome for downstream studies, we first evaluated the quality through several means. We examined three previously published SARS-CoV-2-human protein-protein interactome networks4,6,7. Importantly, all three of these interactomes were generated using AP/MS-based methods alone. Overall, we found that the host factors of these interactomes significantly overlap (Fisher’s exact test, FDR < 0.05) (Fig. 2a, Extended Fig. 1a, and Table S3), although each interactome still identified a large number of unique factors. This could be explained by differences in the cell line models used (Gordon et al. and Li et al. used HEK-293T/17; Stukalov et al. used A549) as well as distinct computational and/or experimental methodologies implemented in their respective study. Still, we found that our interactome had the highest overlap of interaction partners among published SARS-CoV-2-human protein-protein interactome networks (Fig. 2a, Extended Fig. 1a, and Table S3), suggesting that our interactome had a high level of coverage. We also found that Y2H resulted in slightly more overlapping host factors (Fig. 2a), confirming the complementary nature of Y2H interactions, which are independent of human cells. Overall, AP/MS achieved significantly more overlapping viral-human protein-protein interactions across studies when known human pathways and complexes are taken into consideration (Extended Fig. 1b).
Compared to the published SARS-CoV-2-human protein-protein interactome networks4,6,7, our interactome validated 218 (38%) human host factors previously reported. Proteins including ALG5, G3BP1, CLCC1, VPS39, SIGMAR1, G3BP2, and RAP1GDS1 are identified in all four interactomes (Fig. 2a). Importantly, our interactome offers 361 (62%) newly discovered human host factors which in total interact with SARS-CoV-2 proteins in 493 novel interactions. For S protein, which plays a key role in the entry of SARS-CoV-2 into host cells32, we identified 24 novel interacting partners. Among these interacting partners of S protein, we found that CORO1C33 and STON234 express on the cell membrane, suggesting potential cell entry of SARS-CoV-2 through these human proteins in addition to known mechanisms.
We next performed several comparisons using these interactomes and our own. First, we found that our interactome was enriched in several disease-relevant pathways and biological processes that were not enriched in previous interactomes, for example, “Coronavirus disease, ” “ribosome biogenesis,” and “rRNA metabolic process”35–38 (Table S2). Next, we examined whether these datasets contained interaction partners that coincided with genes that had expression changes in response to SARS-CoV-2 infection. To this end, we performed differential expression analysis for several bulk and single-cell RNA-seq datasets from COVID-19 models or patients (see Methods). For the single-cell dataset39 which we compared the gene expression in SARS-CoV-2+ and SARS-CoV-2− cells, we found that our interactome showed significant overlap (Fisher’s exact test, FDR < 0.05) with the differentially expressed genes (DEGs) in more cell types than that of other interactomes (Extended Fig. 1c and Table S3). Using four bulk RNA-seq/proteomics datasets that contained samples such as upper airway and bronchial epithelial cells40–43, we found that our interactome had a comparable number of significant overlaps to other datasets and showed the highest overall Jaccard index and overlap coefficient with the bulk RNA-seq datasets (Extended Fig. 1d and Table S3). These results suggest that our interactome is highly enriched in genes differentially expressed in response to SARS-CoV-2 infection.
We next inspected the evolutionary features of the SARS-CoV-2 human host factors (Extended Fig. 1e–f and Table S4). Previous studies have shown that virus host factors have more conserved dN/dS rates compared to non-virus host factors44,45. Our SARS-CoV-2-human interactome showed more purifying selection (quantified by lower non-synonymous versus synonymous substitution rate ratio [dN/dS ratio]), as well as a lower evolutionary rate ratio, compared to the random background from the human protein interactome. These bioinformatics observations further suggested high evolutionary conservation of host factors of SARS-CoV-2 identified by our Y2H and TMT-AP/MS proteomics platforms, consistent with previous studies46,47.
Gene expression patterns in disease-related tissues capture important information for revealing the pathogenesis of the disease and identifying potential treatments48–50. We therefore examined the expression of the human host factors in different tissues (Fig. S4a–b and Table S5) using the GTEx data51. By normalizing the expression of each gene across different tissues (tissue specificity, see Methods), we found that lung ranked the 7th out of 33 tissues in terms of the number of host factors with positive tissue specificity (Fig. S4a), suggesting that lung is one of the tissues where these host factors have high expression52.
Altogether, these results show the high quality of the SARS-CoV-2-human interactome identified in this study and strongly encouraged us to further look into the pathobiology of COVID-19 and potential treatment using our interactome.
This SARS-CoV-2-human interactome has the potential to identify pathobiology of COVID-19
ORF3a is a SARS-CoV-2 accessory protein that has been reported to induce apoptosis in 293T cells53 and to suppress the innate immune response54–56 via unclear molecular mechanisms. Our interactome revealed that ORF3a physically interacts with ZNF579 (Fig. 1b, indicated by an arrow in the box of targets of ORF3a), a previously uncharacterized human protein likely to be a transcription factor. We were able to validate this interaction using co-immunoprecipitation (co-IP) western blotting (Fig. 2b). Furthermore, we found that the level of ZNF579 protein is decreased after overexpression of ORF3a in 293T cells (Fig. 2c). As a result, we hypothesized that the presence of ORF3a in cells might trigger changes in the transcriptional state of human genes that are normally regulated by ZNF579. Using ENCODE ChIP-seq data57,58, we found that there is a significant enrichment of genes known to be dysregulated in SARS-CoV-2 infection among targets bound by ZNF579 (Extended Fig. 2a–b). These overlapped genes participate in several disease-relevant pathways such as “ribosome,”35–38 “coronavirus disease,” and various infection-related pathways in several COVID-19 relevant cell types (Extended Fig. 2c). Specifically, ZNF579 is bound strongly to the promoter of HSPA6 (Fig. 2d). Using qPCR, we found that overexpression of ORF3a in 293T cells causes massive induction of HSPA6 (Fig. 2e). These results indicate that the multifunctional SARS-CoV-2 accessory protein ORF3a can induce expression of HSPA6, presumably by disrupting ZNF579, which is likely to normally exert repressive activity at the HSPA6 promoter. This represents an additional previously unknown activity of this multifunctional viral accessory protein.
The oligosaccharyltransferase (OST) complex catalyzes the N-glycosylation of nascent polypeptides in the endoplasmic reticulum59. Glycoproteins are critical for normal cell-cell interactions, RNA replication and pathogenesis60–62. Interestingly, OST inhibitor has been shown to have activity against Dengue virus, Zika virus, West Nile virus, yellow fever viruses, and HSV163–65 by affecting the viral replication. The OST complex was also found to be crucial for innate immune responses triggered by lipopolysaccharide66. Notably, the OST complex subunits STT3A/B, RPN1/2, and DDOST67 were all shown to be present in our Y2H and AP/MS interactome datasets, which we further validated using co-IP (Fig. 2f). Additionally, we also found Sec61 (Fig. 2g), which is a major component of the ER translocon that facilitates the entry of nascent polypeptides into the ER lumen for protein processing68. Evidence suggests that Sec61 may participate in the replication and transcription of several viruses like Ebola virus, Influenza virus, HIV, and Dengue virus68–70. Thus, we hypothesize that OST and Sec61 may also participate in SARS-COV-2 replication and/or the host immune response, offering potential targets for host-targeting therapy development.
The SARS-CoV-2 nucleocapsid (N) protein binds to the viral RNA genome and is multifunctional in viral RNA transcription, replication, and genome condensation71–73. N protein is conserved and stable with ~90% amino acid homology to the SARS-COV N protein74. From our dataset, we confirmed known interactions, including the stress granule core protein G3BP1/2 also found in three other interactome datasets. In addition to these known interactions, we identified a novel interaction between histone H1.4 and N protein. To validate this histone H1.4 and N protein interaction, we overexpressed both N protein and histone H1.4 to perform co-IP, confirming their interaction (Fig. 2h). Histone H1, also known as linker histone, mainly functions in chromatin condensation and transcriptional repression75,76. Accumulating evidence suggests that linker histone is essential in the pathogenesis of several diseases, particularly for viral infection76. There is also evidence that Histone H1 could regulate IFN and inhibit influenza replication77, in addition to playing a role in the regulation of viral gene expression78. Thus, we hypothesize that this novel viral-host interaction could also be involved in mediation viral replication and/or gene expression.
Discovery of interactome-based host-targeting therapies for COVID-19
Using our newly discovered SARS-CoV-2-human protein-protein interactome network, we performed network-based drug screening for more than 2,900 FDA-approved/investigational drugs (Fig. 3a and Table S6)79. We obtained a list of 189 FDA-approved drugs with significantly closer network proximities to the SARS-CoV-2 host factors, among which 44 had clinical trials for SARS-CoV-2 (Table S7). To refine this list, we obtained the antiviral profiles of the top 189 drugs from NCATS (https://opendata.ncats.nih.gov/covid19/assays, National Center for Advancing Translational Sciences) and evaluated each drug for their desired antiviral properties (see Methods). From this, we obtained a curated list of 23 drugs with significant network proximities to the SARS-CoV-2 host factors as well as desired anti-SARS-CoV-2 activities in at least two NCATS assays (Fig. 3b, Fig. S5, and Table S8).
Overall, these top drugs fall into several major categories, including anti-infective (amodiaquine, azithromycin, tetracycline, adefovir dipivoxil, tipranavir), anti-inflammatory (apremilast, mefenamic acid, balsalazide, fenoprofen), antihypertensive (carvedilol, hydrochlorothiazide, nilvadipine), and antineoplastic (toremifene, decitabine, venetoclax). Among these drugs, apremilast, toremifene, decitabine, amodiaquine, and azithromycin are currently being or have been tested in clinical trials for SARS-CoV-2. These top 23 drugs offer candidate treatments for SARS-CoV-2 infections across diverse mechanism-of-actions identified from our human-SARS-CoV-2 interactome. For example, balsalazide, toremifene, tetracycline, venetoclax, tipranavir, and brimonidine may inhibit viral replication by inhibiting papain-like protease 3CL (Fig. S5 and Table S8). Other drugs, such as carvedilol and hydrochlorothiazide, may directly inhibit viral entry by disrupting the Spike-ACE2 PPI (Fig. S5 and Table S8). We also found some literature evidence that may provide mechanistic insights for these drugs against SARS-CoV-2 (highlighted in Fig. 3b). For example, apremilast is a phosphodiesterase 4D (PDE4D) inhibitor80, which interacts with PDE4DIP81, a direct target of NSP13. Amitriptyline activates SIGMAR182, while NSP6 interact with SIGMAR1 to inhibit host autophagosome formation to facility coronavirus replication83. SIGMAR1 also interact with MOV10 (an RNA helicase, also a host factor targeted by the N protein) which exhibits antiviral activity against RNA viruses84.
To test whether our interactome identified drugs that could not be predicted by previously published datasets, we compared the screening results using different interactomes and their combinations. We found that 16 drugs identified by our interactome could not be predicted by any of the other three interactomes or their combinations (Extended Fig. 3a). Of the top 23 drugs with desired anti-SARS-CoV-2 profiles, six were among the 16 drugs that can only be identified by our interactome (Extended Fig. 3a). We also found that among the seven drugs identified by combining all four interactomes that could not be identified by any interactome individually, three drugs (Extended Fig. 3b, highlighted with a star) were found to have desired anti-SARS-CoV-2 profiles (Fig. S6).
Among the top 23 drug candidates, toremifene achieved significantly closer network proximity (Z = −2.19, FDR = 0.037) and has a desired anti-SARS-CoV-2 profile. Previous studies show that toremifene blocks various viral infections efficiently, including SARS-CoV-285 [half maximal inhibitory concentration (IC50) = 3.58 μM], SARS-CoV-186 (EC50 = 11.97 μM), MERS-CoV87 (EC50 = 12.9 μM), and Ebola virus88 (IC50 = ~1 μM). Indeed, NCATS data show that toremifene is active across four assays: Spike-ACE2 protein-protein interaction [half maximal activity concentration (AC50) = 11.92 μM], SARS-CoV pseudotyped particle entry (AC50 = 15.85 μM), MERS-CoV pseudotyped particle entry (AC50 = 31.62 μM), and 3CL enzymatic activity (AC50 = 5.01 μM) (Fig. S5). Mechanistically, a previous study that toremifene may inhibit SARS-CoV-2 cell entry by blocking the S and NSP14 proteins89. These comprehensive validations show potential implications of SARS-CoV-2 interactome-predicted drugs (e.g., toremifene) offer candidates to be tested further in COVID-19 patients.
Population-based and experimental validation of interactome-predicted drugs
Further, we used subject matter expertise to select candidate drugs for patient-level data validation and experimental validation based on a combination of factors: (1) strength of the interactome network-based prediction associations (a stronger network proximity score in Table S7); (2) novelty of predicted drugs; (3) availability of sufficient patient data for meaningful evaluation (exclusion of infrequently used medications); and (4) ideal pharmacokinetics properties in lung of interactome-predicted drugs. Applying these criteria resulted in 2 top candidate drugs, carvedilol (Z = −2.195, FDR = 0.03) and hydrochlorothiazide (Z = −2.428, FDR = 0.005), which are originally approved for treatment of hypertension.
To identify the drug-outcome relationships of these drugs, we used a state-of-the-art active user-design approach49,90 based on large-scale electronic health record (EHR) data. Using the Northwestern Medicine Enterprise Data Warehouse (NMEDW) COVID-19 dataset (481,526 total patients, 66,541 COVID-19 positive cases, Table 1), we found that both carvedilol (odds ratio [OR] = 0.8, 95% confidence interval [CI] 0.68–0.94, P = 0.008) and hydrochlorothiazide (OR = 0.62, 95% CI 0.47–0.82, P < 0.001) were associated with a significantly lowered risk of positive COVID-19 test after confounding adjustment (age, sex, race, and comorbidities) using a propensity score (PS) matching approach49,90 (Fig. 4a–b and Table S9). The effect of carvedilol was consistent for different race and sex subgroups (Fig. 4a and Table S9). To validate these observations, we used a second EHR database from the Cleveland Clinic COVID-19 registry as an external validation set (168,712 total individuals, 83,340 SARS-CoV-2 positive cases, Table S10). We found that carvedilol had a sufficient number of usage cases for the drug-outcome evaluation. By comparing individuals with and without carvedilol usages (PS-matched by age, sex, race, and/or comorbidities), we found that carvedilol usage was associated with a 17% (OR = 0.83, 95% CI 0.78–0.88, P < 0.001) significantly lowered risk of COVID-19 positive test (Fig. 4c). This protective effect was also consistent when we examined subgroups from the registry in terms of race and sex (Fig. 4c).
Table 1.
All patients | SARS-CoV-2 positive patients | |||
---|---|---|---|---|
Carvedilol− | Carvedilol+ | Carvedilol− | Carvedilol+ | |
Total | 478,536 | 2,990 | 66,289 | 252 |
Age | 44.67 ± 21.74 | 67.37 ± 15.08 | 41.86 ± 21.09 | 63.52 ± 17.20 |
Sex, male | 202,994 (42.4) | 1,674 (56.0) | 30,550 (46.1) | 146 (57.9) |
Race | ||||
Black | 41,858 (8.7) | 725 (24.2) | 6,136 (9.3) | 73 (29.0) |
White | 343,549 (71.8) | 1,927 (64.4) | 46,493 (70.1) | 143 (56.7) |
Other | 60,273 (12.6) | 274 (9.2) | 9,407 (14.2) | 27 (10.7) |
Comorbidity | ||||
AIDS HIV | 1,843 (0.4) | 36 (1.2) | 248 (0.4) | 5 (2.0) |
CD | 32,555 (6.8) | 1,336 (44.7) | 3,498 (5.3) | 103 (40.9) |
CPD | 87,868 (18.4) | 1,249 (41.8) | 12,078 (18.2) | 123 (48.8) |
CHF | 26,973 (5.6) | 2,042 (68.3) | 3,331 (5.0) | 172 (68.3) |
Dementia | 8,567 (1.8) | 358 (12.0) | 1,297 (2.0) | 42 (16.7) |
Diabetes w cc | 20,670 (4.3) | 1,359 (45.5) | 3,031 (4.6) | 139 (55.2) |
Diabetes wo cc | 54,322 (11.4) | 1,629 (54.5) | 8,301 (12.5) | 150 (59.5) |
HP | 5,380 (1.1) | 298 (10.0) | 568 (0.9) | 27 (10.7) |
Malignancy | 47,660 (10.0) | 805 (26.9) | 4,606 (6.9) | 67 (26.6) |
MST | 23,690 (5.0) | 413 (13.8) | 2,385 (3.6) | 36 (14.3) |
MLD | 29,730 (6.2) | 583 (19.5) | 3,711 (5.6) | 53 (21.0) |
MSLD | 2,906 (0.6) | 119 (4.0) | 307 (0.5) | 11 (4.4) |
MI | 7,913 (1.7) | 630 (21.1) | 1,017 (1.5) | 54 (21.4) |
PUD | 11,785 (2.5) | 309 (10.3) | 1,289 (1.9) | 32 (12.7) |
PVD | 23,925 (5.0) | 1,158 (38.7) | 2,626 (4.0) | 99 (39.3) |
RD | 26,068 (5.4) | 1,820 (60.9) | 3,479 (5.2) | 181 (71.8) |
RhD | 13,887 (2.9) | 232 (7.8) | 1,607 (2.4) | 18 (7.1) |
Age is shown as mean ± standard deviation. All other characteristics are shown as number of cases (percentage). P values were calculated by two-sided t test for age and Fisher’s exact test for other variables. AIDS HIV, acquired immunodeficiency syndrome and human immunodeficiency virus; CD, cerebrovascular disease; CPD, chronic pulmonary disease; CHF, congestive heart failure; Diabetes w/wo cc, diabetes with/without chronic complications; HP, hemiplegia or paraplegia; MST, metastatic solid tumor; MLD, mild liver disease; MSLD, moderate or severe liver disease; MI, myocardial infarction; PUD, peptic ulcer disease; PVD, peripheral vascular disease; RD, renal disease; RhD, rheumatic disease.
We found that carvedilol not only showed favorable results in the EHR-based validation, but also has a promising antiviral profile from NCATS, showing high potencies for multiple desired activities (Fig. S5). The NCATS profile of carvedilol is comparable to that of remdesivir, whose profile was deemed highly desirable91. We then investigated carvedilol’s anti-SARS-CoV-2 activity experimentally. We treated A549-ACE2 cells with 0.3–20 μM of carvedilol for 2 hours followed by infection with SARS-CoV-2 at a multiplicity of infection (MOI) of 0.5 and incubation for 2 days. Carvedilol showed a low cell toxicity (Fig. 4d). Cells were subsequently fixed and immunostained to detect for S protein, which was used as a marker for infection. We found that carvedilol inhibited SARS-CoV-2 infection with an EC50 value of 4.1 μM (Fig. 4d), mechanistically supporting our SARS-CoV-2-human interactome-based prediction and EHR-based findings. Lastly, we conducted drug-target network analysis of carvedilol’s targets and SARS-CoV-2 host factors (Extended Fig. 4 and Table S11). We found that carvedilol could potentially affect the SARS-CoV-2 host factors (i.e., VCAM1 and KCNH2) through PPIs with its targets (Extended Fig. 6).
Discussion
In this study, we leveraged high-throughput Y2H and quantitative TMT-AP/MS to generate the first binary and co-complex SARS-CoV-2-human protein-protein interactome network, expanding the known map produced solely by label-free AP/MS. This interactome validated 218 previously published SARS-CoV-2 host factors, and more importantly, revealed 361 novel ones. By comparing with previous interactomes, this interactome has higher overlaps among the interactomes and differentially expressed genes captured by bulk and single-cell RNA-seq of SARS-CoV-2 infection. The host factors identified in this interactome, particularly those altered in response to SARS-CoV-2 infection, present an invaluable opportunity for understanding the disease pathobiology of COVID-19 and prioritizing potential drug targets for treatment development.
Among the novel interacting partners for S protein, we identified several human proteins which may play important roles in SARS-CoV-2 infection. CORO1C33 and STON234 are expressed on the cell membrane. CORO1C is highly expressed in lung (Table S5). STON2 is ubiquitously expressed and involved in endocytic machinery34. It is possible that SARS-CoV-2 can enter host cells through binding of S protein not only to ACE2, NRP192,93 and BSG94, but also other (unknown) factors such as CORO1C and STON2. We also noticed two proteins, EPPK195 and SPECC1L96, that both express on the cell junctions. It has been suggested that SARS-CoV-2 could spread through cell-to-cell transmission97. These cell junction proteins that can be targeted by SARS-CoV-2 S protein may facilitate its cell-to-cell transmission.
We identified a previously uncharacterized human transcription factor, ZNF579, that interacts with SARS-CoV-2 accessory protein ORF3a, and report that this interaction leads to the de-repression of HSPA6. Notably, HSPA6 is significantly upregulated after SARS-CoV-2 infection in cell culture models40, indicating that the disruption of ZNF579 by ORF3a may be relevant in the context of infection. HSPA6 is a HSP70 family molecular chaperone, which are known to be involved in the entry, replication, assembly, and release of various viral pathogens98. We speculate that SARS-CoV-2 has evolved this activity to ensure sufficient levels of molecular chaperones are available to assist with the production of viral proteins in cells.
Next, using this newly discovered SARS-CoV-2-human protein-protein interactome, we performed drug repurposing and identified a list of top 23 candidate drugs with known clinical trial evidence. In this study, we used undirected human protein interactome network and degree preserved node shuffling technique. We also tested different variations of the proximity analysis and found that using directed human protein interactome network and using degree preserved link shuffling resulted in overall highly consistent Z-scores compared to the original results in this study (Extended Fig. 5). We found that although some of these drugs can directly target the host factors, most of them indirectly affect the host factors through PPIs with their targets (Fig. 3b). For example, our predicted drug candidates are validated by well-established NCATS assays (Fig. S5). In addition, among the drugs which did not have NCATS assay results, alprazolam (Iranian Registry of Clinical Trials: IRCT20211015052773N1), L-Citrulline (ClinicalTrials.gov: NCT04404426 and NCT04570384), nadroparin (European Union Clinical Trials Register: EUCTR2020-001709-21-FR, EUCTR2020-001739-28-BE, EUCTR2020-005884-29-IT), vortioxetine (NCT05047952), and myrrh (Clinical Trials Registry - India: CTRI/2020/07/026669, CTRI/2020/12/029575, CTRI/2021/01/030825, Australian New Zealand Clinical Trials Registry: ACTRN12622000215729) are in clinical trials for COVID-19. Nevertheless, future experimental and clinical studies for novel predicted drug candidates are highly warranted.
Further, we have identified carvedilol and hydrochlorothiazide as potential host-targeting treatments for COVID-19 supported by multiple lines of evidence (strong network proximities to SARS-CoV-2 host factors, significantly reduced SARS-CoV-2 positive test risks in patients using these drugs based on large-scale EHR data, and experimental validation of anti-SARS-CoV-2 activity). As drug repurposing focuses on drugs that are already in existing patient databases, we are able to test hypotheses using EHR data as we90,99 and other teams100,101 demonstrated. The unique strengths of EHRs include their provision of large patient populations useful for detecting small differences and the availability of a large number of patient factors recorded without risk of recall bias, allowing for high-dimensional covariate adjustment to minimize confounding49,100,102. Our findings are consistent with previous reports that hydrochlorothiazide100 and carvedilol101 have potential beneficial effects for COVID-19 patients. Another beta-blocker metoprolol (Z = −2.327, FDR = 0.003) was also among the top 189 drugs (Table S7), which has been tested in a small clinical trial with positive effects103. These results confirm that the unique integration of SARS-CoV-2-human interactome findings and patient analysis approaches using two large-scale EHR databases from two independent health care systems, along with in vitro anti-viral observations, offer a powerful strategy for COVID-19 therapeutic discovery. This kind of systems biology strategy can be applied to future pandemics as well.
To understand the potential mechanisms of carvedilol’s anti-SARS-CoV-2 activity, we examined the carvedilol’s mechanism-of-action impacted by SARS-CoV-2 host factors using network analysis (Extended Fig. 4 and Table S11). Among the 579 unique host factors, 237 (41%) have PPIs with carvedilol targets. A large portion of the human proteins in the enriched pathways (protein translation [26/37, 70%], mRNA splicing [14/21, 67%], glucose metabolism [9/15, 60%], and neutrophil mediated immunity [14/27, 52%]) have PPIs with carvedilol targets, suggesting a potential mechanism-of-action in which carvedilol inhibits SARS-CoV-2 replication through multiple important pathways such as protein translation and mRNA splicing. We found several carvedilol targets that exhibited closer network distance to the virus host factors and COVID-19 pathways, such as GJA1, KCNH2, NDUFC2, VCAM1, and VEGFA (Extended Fig. 4). For example, VCAM1 plays important roles and has elevated levels in COVID-19104,105, and carvedilol can inhibit expression of VCAM1106. These results offer hypotheses that can be tested for the anti-SARS-CoV-2 effect of carvedilol. Yet, future experimental validation to decipher the anti-SARS-CoV-2 mechanism-of-action of carvedilol is highly warranted as well106.
We acknowledge several limitations. The network-based SARS-CoV-2 treatment discovery may be affected by the incompleteness of the human protein-protein interactome and drug-target network. Therefore, we relied not only on the network discoveries, but also incorporated other types of evidence, such as EHR-based validation and experimental validation. Our EHR-based validation is retrospective and can only be applied to commonly used drugs due to data availability. Although we adjusted for several confounding factors, other unknown factors may still have effect on the results of EHR-based validation. Therefore, the drugs identified in this study must be validated using randomized clinical trials before they can be used in patients with COVID-19. Lastly, we focused on the SARS-CoV-2-human protein interactome in this study. Combining multiple data resources (such as CRISPR107, genome-wide association studies108, rare variants109, synthetic lethality-based genetics interactions110, and metabolomics and proteomics111) may help identify comprehensive knowledge of COVID-19 using various advanced computational (e.g., genome-scale metabolic modeling112) and multi-omics data integration approaches.
Materials and Methods
SARS-CoV-2 ORF clones
ORF3b (plasmid no. 141384; Addgene), NSP4 (plasmid no. 141369; Addgene), NSP12 (plasmid no. 141378; Addgene), NSP13 (plasmid no. 141379; Addgene), and NSP14 (plasmid no. 141380; Addgene) were a gift from Nevan Krogan, University of California, San Francisco. NSP6 (plasmid no. 149309; Addgene) and NSP16 (plasmid no. 141269; Addgene) were a gift from Fritz Roth, University of Toronto, which we cloned into our pHAGE-CMV-GAW-3xMyc-IRES-PURO construct using Gateway. E, M, N, NSP1, NSP2, NSP3, NSP5, NSP7, NSP8, NSP9, NSP10, NSP15, ORF3a, ORF6, ORF7a, ORF7b, ORF8, ORF9b, ORF9c, ORF10, and S, cloned into pCAG-FLAG and pcDNA6B-FLAG constructs, were a gift from Pei-Hui Wang, Shandong University. All SARS-CoV-2 ORFs were codon-optimized and expressed in either pLVX-EF1alpha-eGFP-2xStrep-IRES-Puro (plasmid no. 141395; Addgene), pHAGE-CMV-GAW-3xMyc-IRES-PURO, pCAG-FLAG, or pcDNA6B-FLAG mammalian expression vectors.
Y2H
Y2H screens were carried out as previously described10,23–26. In brief, viral ORFs were cloned into pDEST-AD and pDEST-DB vectors using Gateway LR to generate N-terminal ORF fusions. Similarly, human ORFeome 8.122 was cloned into pDEST-AD and pDEST-DB vectors. All AD and DB expression clones were transformed into Y2H Saccharomyces cerevisiae strains MATa Y8800 and MATα Y8930 (genotype: leu2-3, 112 trp1-901 his3Δ200 ura3-52 gal4Δ gal80Δ GAL2::ADE2 GAL1::HIS3@LYS2 GAL7::lacZ@MET2 cyh2R), respectively. To screen out autoactivating DB-ORFs, all DB-ORF MATα Y8930 transformants were mated pairwise against empty pDEST-AD MATa Y8800 transformants and scored for growth on SC-Leu-Trp+3AT and SC-Leu-Trp-Ade plates, where DB-ORFs that triggered reporter activity were removed from further experiments. To increase screening throughput, 24 human ORF AD or DB clones were pooled into single human ORF AD or DB wells, respectively. Viral ORF AD and DB clones were then mated pairwise against pools of human ORF DB and AD clones, respectively. Mated transformants were incubated overnight at 30 °C before being plated onto SC-Leu-Trp to select for mated diploid yeast. After another overnight incubation at 30 °C, diploid yeast was plated onto SC-Leu-Trp-His+3AT and SC-Leu-Trp-Ade selection plates. After another overnight incubation at 30 °C, plates were replica-cleaned and incubated again for three days at 30 °C for final interaction calling.
PLATE-seq
Each colony was picked 6 times into 96-well plates containing 15 μL of 2.5 mg/mL Zymolyase (catalog no. E1004; Zymo Research) and incubated for 45 min at 37 °C followed by 10 min at 95 °C to prepare yeast cell lysate used as PLATE-seq DNA template. PLATE-seq was carried out as previously described26. In brief, plasmid(s) from individual wells of 96-well plates were PCR amplified using a plasmid-specific forward primer and a reverse primer consisting of a well-position-specific barcode and TruSeq 3′ sequencing adapter. Amplicons derived from the same 96-well plate were pooled and purified using QIAquick PCR Purification Kit (catalog no. 28104, Qiagen). Each amplicon pool was subject to Tn5 tagmentation to fragment the amplicons and append adapters consisting of a plate-specific barcode and TruSeq 5′ sequencing adapter. Tagmented DNA was purified using QIAquick PCR Purification Kit (catalog no. 28104, Qiagen) and pooled across all 96-well plates. These pools were then subjected to low-cycle PCR both to extend the TruSeq end adapters with sequences compatible for binding to the Illumina flow cell and to enrich for only DNA fragments consisting of TruSeq adapter sequences on both ends of the plate specific and well-position-specific barcodes. PLATE-seq libraries were paired-end sequenced on an Illumina MiSeq.
Affinity purification
Caco-2 (HTB-37; ATCC) cells were cultured in EMEM (catalog no. 30–2003; ATCC) with 15% FBS (catalog no. 30–2020; ATCC) at 37 °C with 5% CO2. All 28 SARS-CoV-2 ORFs were codon-optimized and cloned into mammalian expression vectors that contained Strep, Myc, or FLAG affinity tags. SARS-CoV-2 ORF plasmids and corresponding empty vectors were individually transfected in biological duplicates into Caco-2 cells using Lipofectamine 3000 Transfection Reagent (catalog no. L3000001; Invitrogen) following manufacturer’s instructions. Cells were harvested 72 hr post-transfection and lysed using RIPA lysis buffer (50 mM Tris-HCl [pH 7.5], 150 mM NaCl, 1% (v/v) Nonidet P 40 Substitute, 5 mM EDTA, phosphatase inhibitor (catalog no. 4906845001; Roche), and protease inhibitor cocktail (catalog no. 11873580001; Roche)). Samples were incubated for 30 min at 4 °C and then centrifuged at 13,000 ×g for 15 min at 4 °C. Supernatants were collected and incubate with either MagStrep “type3” XT beads (catalog no. 24090–002; IBA Lifesciences), Myc-Trap Agarose (catalog no. yta-10; ChromoTek) or Anti-FLAG M2 Affinity Gel (catalog no. A2220; Millipore) overnight at 4 °C. Strep-tagged samples were washed with 10x Buffer W (catalog no. 2-1003-100; IBA Lifesciences) three times at 4°C. Myc- and FLAG-tagged samples were washed with RIPA buffer. Strep-tagged samples were eluted using 10x Buffer BXT (catalog no. 2-1042-025; IBA Lifesciences). Myc- and FLAG-tagged samples were eluted using IP elution buffer (100 mM Tris-HCl [pH 7.5], 1% (v/v) SDS) and incubated for 15 min at 65 °C. Other primary antibodies used in this study include c-Myc Monoclonal Antibody (catalog no. 13–2500; Invitrogen) and Monoclonal Anti-FLAG M2 Antibody (catalog no. F3165; Sigma-Aldrich).
Proteomic sample preparation
IP eluates were reduced using 200 mM TCEP for 1 hr at 55 °C. Samples were then alkylated using 375 mM iodoacetamide for 30 min at room temperature in the absence of light. Samples were digested using Trypsin Gold, Mass Spectrometry Grade (catalog no. V5280; Promega) at an enzyme-to-substrate ratio of 1:100 and incubated overnight with nutation at 37 °C. Peptide concentrations were measured using Pierce Quantitative Colorimetric Peptide Assay (catalog no. 23275; Thermo Scientific). Samples were normalized and resuspended using 1M Triethylammonium bicarbonate (TEAB) for TMT experiments (catalog no. 90114; Thermo Scientific). Samples were labeled using TMT10plex Isobaric Mass Tagging Kit (catalog no. 90113; Thermo Scientific) at a (w/w) label-to-peptide ratio of 10:1 for 1 hr at room temperature. Labeling reactions were quenched by the addition of 5% hydroxylamine and immediately pooled and dried using a SpeedVac. Labeled peptides were enriched and fractionated using Pierce High pH Reversed-Phase Peptide Fractionation Kit according to the manufacturer’s protocol (catalog no. 84868; Thermo Scientific).
LC-MS/MS
Fractions were analyzed using an EASY-nLC 1200 System (catalog no. LC140; Thermo Scientific) equipped with an in-house 3 μm C18 resin- (Michrom BioResources) packed capillary column (75 μm × 25 cm) coupled to an Orbitrap Fusion Lumos Tribrid Mass Spectrometer (catalog no. IQLAAEGAAPFADBMBHQ; Thermo Scientific). The mobile phase and elution gradient used for peptide separation were as follows: 0.1% formic acid in water as buffer A and 0.1% formic acid in 80% acetonitrile as buffer B; 0–5 min, 5%−10% B; 5–65 min, 10–55% B; 66–67 min, 55%−95% B; 67–68 min, 2% B; 68–72 min, 95% B; 72–80 min, 5% B; with a flow rate set to 200 nL/min. MS1 precursors were detected at m/z = 375–1500 and resolution = 120,000. A CID-MS2-HCD-MS3 method was used for MSn data acquisition. Precursor ions with charge of 2+ to 7+ were selected for MS2 analysis at resolution = 30,000, isolation width = 0.4 m/z, maximum injection time = 50 ms, and CID collision energy at 35%. 6 SPS precursors were selected for MS3 analysis and ions were fragmented using HCD collision energy at 65%. Spectra were recorded using Thermo Xcalibur Software Version 4.1 (catalog no. OPTON-30965; Thermo Scientific) and Tune application version 3.0 (Thermo Scientific). Raw data were searched using Proteome Discoverer Software 2.3 (Thermo Scientific) against an UniProtKB human database containing all SARS-COV-2 proteins. Search parameters specified precursor mass and fragment mass tolerance of 15 ppm. Peptide-spectrum matches (PSMs) were searched with SEQUEST HT and Percolator and filtered at FDR < 1%.
Downstream proteomic analysis
We developed a novel pipeline using a customized linear model (inspired by MSstatsTMT113) to identify high-confidence viral-host interactions from TMT-AP/MS datasets. Briefly, PSMs filtered at 1% FDR were selected for quantification by (1) the number of reporter intensity values per fraction, (2) percent isolation interference, and (3) precursor intensity values to select for one instance of a peptide peak. If more than one PSM passed these criteria, then the average of the reporter ion intensities per channel of these PSMs were taken to represent the quantification of the peptide peak. The reporter intensity values of selected PSMs were log transformed, weighed with their respective precursor intensities, and averaged to obtain protein level quantification values. Our pipeline’s novelty lies in its ability to retain useful information separated across fractions at the PSM level while ensuring no violation of the assumption of independence, such that our linear fixed-effects model with conditions (e.g., sample vs. control), as a fixed effect, can be utilized. An improved p-value calculation was used through Empirical Bayes estimation of prior variance as implemented in R limma package114.
The fold change (FC) and p-values obtained from this linear model-based approach are used to generate volcano plots for each viral bait protein compared to control. A baseline cutoff was set at a FC of greater than 2 and FDR of less than 10%, on top of which a hyperbolic curve is optimized using the distribution of the log-transformed FCs of all identified proteins to identify high-confidence interactors. As a result, the actual cutoffs used for each AP/MS experiment are often significantly more stringent than the baseline values. A PSM cutoff, along with a peptide-coverage percent cutoff (i.e., the percentage of all possible trypsin digested peptides, accounting for up to two missed cleavages that can be found), based on the number of the viral protein’s PSMs and peptide-coverage percentage, is also implemented prior to the optimization of this hyperbolic curve.
Co-immunoprecipitation
HEK 293T (CRL-3216; ATCC) cells were cultured in DMEM (catalog no. 30–2002; ATCC) supplemented with 10% FBS (catalog no. 30–2020; ATCC) and incubated at 37 °C with 5% CO2. Cells were seeded onto six-well plates and grown until reaching 70–80% confluency. SARS-CoV-2 N, ORF3a, ORF7b, histone H1.4 or N+histone H1.4, Sec61 or ORF7b+Sec61, STT3A or ORF7b+STT3A, and empty vector controls were transfected into cells by combining 2 μg of DNA with 10 μL of 1 mg/mL PEI (catalog no. 23966; Polysciences Inc.) and 150 μL Opti-MEM (catalog no. 31985062; Gibco). After 24 hr incubation, cells were gently washed three times with DPBS (1X) (catalog no. 14040117; Gibco), resuspended with 200 μL cell lysis buffer (10 mM Tris-HCl [pH 8.0], 137 mM NaCl, 1% (v/v) Triton X-100, 10% (v/v) glycerol, 2 mM EDTA, and protease inhibitor cocktail (catalog no. 11873580001; Roche)) and incubated on ice for 30 min. Extracts were then cleared by centrifugation at 16,000 ×g for 10 min at 4 °C. To perform co-immunoprecipitation (co-IP), 100 μL cell lysate was incubated with 5 μL Red Anti-FLAG M2 Affinity Gel (catalog no. F2426; Millipore) overnight at 4 °C under gentle rotation. Bound proteins were then washed three times with cell lysis buffer, eluted with 50 μl elution buffer (10 mM Tris-HCl [pH 8.0], 1% (v/v) SDS) and incubated for 10 min at 65 °C. Cell lysates and co-IP samples were then treated with 6X SDS protein loading buffer (1 M Tris-HCl [pH 6.8], 10% (v/v) SDS, 50% (v/v) glycerol, 0.03% (v/v) bromophenol blue, and 10% (v/v) β-mercaptoethanol), subjected to SDS-PAGE, and transferred onto PVDF membranes (catalog no. GE10600023; Amersham). For immunoblotting analysis, V5 Tag Monoclonal Antibody (catalog no. R960–25; Invitrogen), c-Myc Monoclonal Antibody (catalog no. 13–2500; Invitrogen), Monoclonal Anti-FLAG M2 Antibody (catalog no. F1804; Sigma-Aldrich), or ZNF579 Polyclonal Antibody (catalog no. A303–275A; Bethyl Laboratories) were used at 1:1,000 dilutions.
qPCR
293T cells were cultured as above, and ORFa-FLAG or empty vector were introduced with Lipofectamine 2000 Transfection Reagent (catalog no. 11668030; Invitrogen) using manufacturer instructions. Transfection experiments were performed in duplicate. Media was replaced 6 hours after transfection, and RNA was harvested using TRIzol Reagent (catalog no. 15596018; Invitrogen). Reverse transcription was performed with the Maxima First Strand cDNA Synthesis Kit for RT-qPCR, with dsDNase (catalog no. K1671; Thermo Scientific). qPCR was performed on a LightCycler 480 System using LightCycler FastStart DNA Master SYBR Green I (catalog no. 03003230001; Roche Diagnostics). We used two primer sets for HSPA6 (FWD1: CAAGGTGCGCGTATGCTAC, REV1: GCTCATTGATGATCCGCAACAC, FWD2: CATCGCCTATGGGCTGGAC, REV2: GGAGAGAACCGACACATCGAA), and performed 3 technical replicates of 3 concentrations of cDNA (1:10, 1:100, 1:1000) for each replicate, and then compared expression levels normalized to GAPDH using the double-delta Cp method.
Interactome comparative analysis
We compared our SARS-CoV-2-human interactome to a collection of three previously reported interactomes5–7, and compared with ours in terms of the overlap (Fisher’s exact test) with the differentially expressed genes in SARS-CoV-2 from several SARS-CoV-2 RNA-seq/proteomics datasets. These datasets include: (1) a single-cell dataset that contains CD8, Epithelial (Epi) Ciliated, Epi-Secretory, Epi-Squamous, Macro, Mono, and NK cells from BALF39. We performed comparisons of virus+ vs. virus− cells for each cell type; (2) bulk RNA-seq of human bronchial epithelial cells infected with SARS-CoV-240 (GSE147507), denoted as SARS2-DEG; (3) proteomic dataset of human Caco-2 cells infected with SARS-CoV-241, denoted as SARS2-DEP. (4) bulk RNA-seq of upper airway from COVID-19 patients vs. non-COVID-19 patients (GSE156063)42, denoted as DE-NS; (5) bulk RNA-seq of peripheral blood mononuclear cell (PBMC) isolated from COVID-19 patients vs. non-COVID-19 patients (GSE157103)43, denoted as DE-PBMC. For differential expression analysis, a cutoff of |log2FC| > 0.5 and FDR < 0.05 was considered significant. We calculated the Jaccard index (J) and overlap coefficient (C)115 for two gene sets A and B as below:
(1) |
(2) |
Functional enrichment analysis
Functional enrichment of our SARS-CoV-2 host factors were analyzed using Enrichr116 against the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Ontology (GO) biological process data sets. Pathways and GO terms with FDR < 0.05 were considered significantly enriched. GO terms were summarized using Revigo117.
Selective pressure and evolutionary rates
The nonsynonymous and synonymous substitution rate ratio (dN/dS ratio)118 and the evolutionary rate ratio119 of our SARS-CoV-2 host factors were evaluated as described in a previous study120. For dN/dS ratio, dN/dS<1 was considered purifying selection; dN/dS=1 was considered neutral evolution; and dN/dS>1 was considered positive Darwinian selection. The evolutionary rate ratio >1 was regarded as a fast rate and <1 as a slow rate119.
Tissue gene expression specificity
We evaluated the gene expression specificity of the SARS-CoV-2 host factors in 33 tissues using the RNA-Seq data from GTEx V8 (https://www.gtexportal.org/home/)51. The expression specificity of gene i in tissue t was defined as
(3) |
where Ei was the mean and σi was the standard deviation of gene i’s expression across all considered tissues, and Eit was the mean expression of gene i in tissue t.
Construction of human protein-protein interactome and drug-target network
The human protein-protein interactome and the drug-target network were used to screen for drugs against the SARS-CoV-2 host factors. The human protein-protein interactome, composed of 17,706 protein nodes and 351,444 unique PPI edges was constructed in our previous studies48,49,121,122. Briefly, several types of high-quality PPI evidence gathered from public databases and datasets were considered, including: binary PPIs identified by high-throughput yeast two-hybrid in three datasets49,123,124; low- or high-throughput experimentally discovered kinase-substrate interactions from KinomeNetworkX125, PhosphoNetworks126, Human Protein Resource Database (HPRD)127, DbPTM 3.0128, Phospho.ELM129, and PhosphositePlus130; signaling networks identified using low-throughput experiments in SignaLink2.0131; protein complexes revealed by robust affinity purification-mass spectrometry in BioPlex V2.016132; and curated PPIs from Instruct133, IntAct134, BioGRID135, MINT136, PINA137, and InnateDB138 that were identified by yeast two-hybrid studies, affinity purification-mass spectrometry, protein three-dimensional structures, or low-throughput experiments. For comparison, we also built a directed version of the human protein interactome using the PPI direction information (including kinase-substrate and signaling networks) from PhosphositePlus130 and SignaLink2.0131.
The drug-target network was constructed using several data sources as described in our recent studies48,49,121: DrugBank database (v4.3)139, BindingDB140, ChEMBL (v20)141, Therapeutic Target Database (TTD)142, PharmGKB database143, and IUPHAR/BPS Guide to PHARMACOLOGY144. Binding affinities Ki, Kd, IC50 or EC50 ≤ 10 μM were used as cutoff for the drug-target interactions. All networks were visualized using Cytoscape 3.8.0145. Clinical trial information was retrieved from the International Clinical Trials Registry Platform (ICTRP, assessed in May 2022).
Network proximity-based drug and drug combination screening
The “closest” network proximity measure was used to screen for 2,938 FDA approved or investigational drugs. The “closest” distance dAB for two gene/protein sets A (e.g., drug targets) and B (e.g., SARS-CoV-2 host factors) was calculated as:
(4) |
where d(a,b) is the shortest path length of a and b in the human protein-protein interactome. Network proximity dAB was further normalized to obtain a Z score using a permutation test with randomly selected proteins from the interactome with similar degree distributions to A and B (degree preserved node shuffling). Permutation tests were repeated 1,000 times. We prioritized drugs by Z < −2 and FDR < 0.05. For comparison, we also conducted degree preserved link shuffling using the double edge swap method to swap the links 10 × the size of the human protein interactome times. For individual drug target level network proximity to the disease modules, we used the “shortest” measure that measures the average shortest distances of a target to the disease proteins (host factors):
(5) |
The antiviral profiles of the prioritized drugs were retrieved from NCATS (https://opendata.ncats.nih.gov/covid19/assays). NCATS contains experimental high-throughput screening results for drugs from a series of screenings (some accompanied by counterscreens) to evaluate their anti-SARS-CoV-2 potential. We included the following screening results: SARS-CoV-2 cytopathic effect (CPE) and its counterscreen SARS-CoV-2 cytopathic effect (host tox Counter) / Cytotoxicity; human fibroblast toxicity (hCYTOX); spike-ACE2 protein-protein interaction (AlphaLISA) and its counterscreen spike-ACE2 protein-protein interaction (TruHit Counter); ACE2 enzymatic activity; SARS-CoV pseudotyped particle entry (CoV-PPE) and its counterscreen SARS-CoV pseudotyped particle entry counter screen (CoV-PPE_cs); MERS-CoV pseudotyped particle entry (MERS-PPE) and its counterscreen MERS-CoV pseudotyped particle entry counter screen (MERS-PPE_cs); and 3CL enzymatic activity. Based on the NCATS SARS-CoV-2 data, we further selected a list of top drugs from the network proximity-based prioritization that show ideal activities in at least two of these screenings.
COVID-19 patient data observations
Two independent datasets revealed corroborating evidence for the drug carvedilol which was identified by our interactome prioritization framework. The first dataset (discovery dataset) was from the Northwestern Medicine Enterprise Data Warehouse (NMEDW). We first identified 512,198 patients who had SARS-CoV-2 reverse transcription-polymerase chain reaction (RT-PCR) test results recorded in NMEDW. Patients with a positive RT-PCR test were considered COVID-19 positive, where the earliest time of the test was recorded as the effective time. Patients that did not have any positive or presumptive positive RT-PCR tests and the latest PCR test was negative (excluding pending and undetermined results) were considered COVID-19 negative, where the latest time of the test was recorded as the effective time. By these metrics, 29,224 patients with pending or undetermined results were removed, yielding 482,974 patients of interest. We excluded patients without age or sex information yielding a cohort of 481,526 patients, 66,541 of which were COVID-19 positive. We then extracted the carvedilol (and other drugs) administration information for all patients in the final cohort. If a patient had a carvedilol administration record with an administration date in the 6-month time window leading to the effective RT-PCR result date and an administered dose > 0, the patient was considered carvedilol+. We also extracted comorbidity information of the cohort for propensity score (PS) matching, for which we used the Charlson Comorbidity Index (CCI). All comorbidities and corresponding patient numbers are listed in Table 1.
The second dataset (external validation dataset) was an institutional review board-approved COVID-19 registry dataset that included 168,712 individuals tested for SARS-CoV-2 infection (83,340 of which were positive cases) from March 8th to May 26th, 2021, at the Cleveland Clinic in Ohio and Florida, United States (Table S10). Pooled oropharyngeal and nasopharyngeal swab specimens were used to test for SARS-CoV-2 by RT-PCR assay in the Cleveland Clinic Pathology and Laboratory Medicine Institute. All SARS-CoV-2 testing followed the guidelines established by the Centers for Disease Control and Prevention of United States. The dataset included baseline demographic information, medications, and COVID-19 test results. We used REDCap146 electronic data capture tools to extract the patient data from the electronic health records (EPIC Systems), and the data were manually checked by a professional team trained on uniform sources for the study variables. A carvedilol exposure group (carvedilol+) included patients that were actively taking carvedilol at the time of SARS-CoV-2 testing. Positive laboratory test results for COVID-19 were used as the primary outcome. PS was used to match age, sex, and race to reduce various confounding factors. Odds ratio was used to evaluate the carvedilol benefit to primary outcome. All analyses were conducted by matchit package in the R v4.1.0 platform.
Anti-SARS-CoV-2 activity assay for carvedilol
A549 (CCL-185; ATCC) cells exogenously expressing angiotensin-converting enzyme 2 (ACE2) (A549-ACE2) were a gift from Benjamin R. Tenoever, Icahn School of Medicine at Mount Sinai. A549-ACE2 cells were cultured in DMEM (catalog no. 11965092; ThermoFisher) with 10% FBS (catalog no. 100–106; GeminiBio) and used for SARS-CoV-2 infection. SARS-CoV-2 virus (nCoV/Washington/1/2020) was provided by the Biocontainment Laboratory–University of Texas Medical Branch Galveston National Laboratory, Texas, United States. Vero E6 (CRL-1586; ATCC) cells were used to propagate and titer SARS-CoV-2. SARS-CoV-2 infections were performed under biosafety level 3 conditions at the Biocontainment Laboratory–University of Chicago Howard T. Ricketts Laboratory, Illinois, United States. A549-ACE2 cells cultured in DMEM with 2% FBS were treated with carvedilol for 2 hours at the indicated concentrations. Cells were infected with an MOI of 0.5 in media containing the appropriate concentration of drug. 48 hr post-infection, cells were fixed with 10% formalin (catalog no. 305–510; Fisherbrand), blocked, and probed with mouse anti-SARS-CoV-2-spike antibody (catalog no. GTX632604; GeneTex) diluted 1:1,000 for 4 hr, rinsed, and probed with anti-mouse-HRP (catalog no. MP7401; Vector Labratories) for 1 hr, washed, and then developed with DAB substrate (catalog no. 34065; ThermoScientifc) for 10 min. Spike positive cells (n>40) were quantified by light microscopy as blinded samples. A sigmoid fit was used to extract EC50 values using MATLAB.
To measure the effect of carvedilol on cell viability, cells were treated with various concentrations of carvedilol diluted in 2% DMEM for 48 hours. The drug solution was then removed, and cells were fixed with 10% formalin solution. The cells were stained with Crystal Violet 0.25% for 30 minutes. The plate was spun dried in a tabletop centrifuge and absorbance of each well was measured using a TECAN Infinite 200 Pro at 595 nm. The % survival was calculated relative to DMSO treated cells.
Extended Data
Supplementary Material
Acknowledgements
This work was primarily supported by the National Institute of Aging (NIA) under Award Number U01AG073323 and R01AG066707 to F.C. and the National Institute of General Medical Sciences R01GM124559 to H. Y. This work was supported in part by NIA grants 3R01AG066707-01S1, 3R01AG066707-02S1, and R56AG074001 to F.C., the National Institute of General Medical Sciences (R01GM125639, R01GM130885, RM1GM139738), The National Institute of Diabetes and Digestive and Kidney Diseases (R01DK115398) to HY.
Footnotes
Code availability
The network proximity framework can be found in https://github.com/ChengF-Lab/COVID-19_Map.
Competing interests
The authors declare that there are no competing interests.
Supplementary Files
This is a list of supplementary files associated with this preprint. Click to download.
Data availability
We downloaded the GTEx v8 dataset from https://gtexportal.org/home/. The human protein-protein interactome and drug-target network are found in https://github.com/ChengF-Lab/COVID-19_Map. An interactive version of Fig. 1b can be found in https://github.com/ChengF-Lab/COVID-19_PPI. All other data can be found in the supplementary tables. We accessed ZNF579 ChIP-seq data from the ENCODE portal under accession ENCSR018MQH.
References
- 1.Phillips N. The coronavirus is here to stay - here’s what that means. Nature 590, 382–384 (2021). [DOI] [PubMed] [Google Scholar]
- 2.Veldhoen M. & Simas J.P. Endemic SARS-CoV-2 will maintain post-pandemic immunity. Nat Rev Immunol 21, 131–132 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Durmus Tekir S.D. & Ulgen K.O. Systems biology of pathogen-host interaction: networks of protein-protein interaction within pathogens and pathogen-human interactions in the post-genomic era. Biotechnol J 8, 85–96 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Gordon D.E. et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature 583, 459–468 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gordon D.E. et al. Comparative host-coronavirus protein interaction networks reveal pan-viral disease mechanisms. Science 370(2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Li J. et al. Virus-Host Interactome and Proteomic Survey Reveal Potential Virulence Factors Influencing SARS-CoV-2 Pathogenesis. Med (N Y) 2, 99–112 e7 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Stukalov A. et al. Multilevel proteomics reveals host perturbations by SARS-CoV-2 and SARS-CoV. Nature 594, 246–252 (2021). [DOI] [PubMed] [Google Scholar]
- 8.Rajagopala S.V. Mapping the Protein-Protein Interactome Networks Using Yeast Two-Hybrid Screens. Adv Exp Med Biol 883, 187–214 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Causier B. Studying the interactome with the yeast two-hybrid system and mass spectrometry. Mass Spectrom Rev 23, 350–67 (2004). [DOI] [PubMed] [Google Scholar]
- 10.Yu H. et al. High-quality binary protein interaction map of the yeast interactome network. Science 322, 104–10 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Murphy J.P. et al. Multiplexed Relative Quantitation with Isobaric Tagging Mass Spectrometry Reveals Class I Major Histocompatibility Complex Ligand Dynamics in Response to Doxorubicin. Anal Chem 91, 5106–5115 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Santin Y.G. Uncovering the In Vivo Proxisome Using Proximity-Tagging Methods. Bioessays 41, e1900131 (2019). [DOI] [PubMed] [Google Scholar]
- 13.Berggard T., Linse S. & James P. Methods for the detection and analysis of protein-protein interactions. Proteomics 7, 2833–42 (2007). [DOI] [PubMed] [Google Scholar]
- 14.ten Have S., Boulon S., Ahmad Y. & Lamond A.I. Mass spectrometry-based immuno-precipitation proteomics - the user’s guide. Proteomics 11, 1153–9 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhang G., Annan R.S., Carr S.A. & Neubert T.A. Overview of peptide and protein analysis by mass spectrometry. Curr Protoc Protein Sci Chapter 16, Unit 16 1 (2010). [DOI] [PubMed] [Google Scholar]
- 16.Gingras A.C. & Raught B. Beyond hairballs: The use of quantitative mass spectrometry data to understand protein-protein interactions. FEBS Lett 586, 2723–31 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Asara J.M., Christofk H.R., Freimark L.M. & Cantley L.C. A label-free quantification method by MS/MS TIC compared to SILAC and spectral counting in a proteomics screen. Proteomics 8, 994–9 (2008). [DOI] [PubMed] [Google Scholar]
- 18.Collier T.S. et al. Direct comparison of stable isotope labeling by amino acids in cell culture and spectral counting for quantitative proteomics. Anal Chem 82, 8696–702 (2010). [DOI] [PubMed] [Google Scholar]
- 19.Megger D.A. et al. Comparison of label-free and label-based strategies for proteome analysis of hepatoma cell lines. Biochim Biophys Acta 1844, 967–76 (2014). [DOI] [PubMed] [Google Scholar]
- 20.Li Z. et al. Systematic comparison of label-free, metabolic labeling, and isobaric chemical labeling for quantitative proteomics on LTQ Orbitrap Velos. J Proteome Res 11, 1582–90 (2012). [DOI] [PubMed] [Google Scholar]
- 21.Stepath M. et al. Systematic Comparison of Label-Free, SILAC, and TMT Techniques to Study Early Adaption toward Inhibition of EGFR Signaling in the Colorectal Cancer Cell Line DiFi. J Proteome Res 19, 926–937 (2020). [DOI] [PubMed] [Google Scholar]
- 22.Team M.G.C.P. et al. The completion of the Mammalian Gene Collection (MGC). Genome Res 19, 2324–33 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Das J. et al. Cross-species protein interactome mapping reveals species-specific wiring of stress response pathways. Sci Signal 6, ra38 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Vo T.V. et al. A Proteome-wide Fission Yeast Interactome Reveals Network Evolution Principles from Yeasts to Human. Cell 164, 310–323 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fragoza R. et al. Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations. Nat Commun 10, 4141 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wierbowski S.D. et al. A massively parallel barcoded sequencing pipeline enables generation of the first ORFeome and interactome map for rice. Proc Natl Acad Sci U S A 117, 11836–11842 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hoffmann M. et al. SARS-CoV-2 Cell Entry Depends on ACE2 and TMPRSS2 and Is Blocked by a Clinically Proven Protease Inhibitor. Cell 181, 271–280 e8 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cinatl J. et al. Treatment of SARS with human interferons. Lancet 362, 293–4 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Hoehl S. et al. Evidence of SARS-CoV-2 Infection in Returning Travelers from Wuhan, China. N Engl J Med 382, 1278–1280 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lamers M.M. et al. SARS-CoV-2 productively infects human gut enterocytes. Science 369, 50–54 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Xiao F. et al. Evidence for Gastrointestinal Infection of SARS-CoV-2. Gastroenterology 158, 1831–1833 e3 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Huang Y., Yang C., Xu X.F., Xu W. & Liu S.W. Structural and functional properties of SARS-CoV-2 spike protein: potential antivirus drug development for COVID-19. Acta Pharmacol Sin 41, 1141–1149 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Xavier C.P. et al. Structural and functional diversity of novel coronin 1C (CRN2) isoforms in muscle. J Mol Biol 393, 287–99 (2009). [DOI] [PubMed] [Google Scholar]
- 34.Martina J.A., Bonangelino C.J., Aguilar R.C. & Bonifacino J.S. Stonin 2: an adaptor-like protein that interacts with components of the endocytic machinery. J Cell Biol 153, 1111–20 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Wei J. & Hui A. Review of Ribosome Interactions with SARS-CoV-2 and COVID-19 mRNA Vaccine. Life (Basel) 12(2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Banerjee A.K. et al. SARS-CoV-2 Disrupts Splicing, Translation, and Protein Trafficking to Suppress Host Defenses. Cell 183, 1325–1339 e21 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zou M. et al. The Molecular Mechanism of Multiple Organ Dysfunction and Targeted Intervention of COVID-19 Based on Time-Order Transcriptomic Analysis. Front Immunol 12, 729776 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Schmidt N. et al. The SARS-CoV-2 RNA-protein interactome in infected human cells. Nat Microbiol 6, 339–353 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Ren X. et al. COVID-19 immune features revealed by a large-scale single-cell transcriptome atlas. Cell 184, 1895–1913 e19 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Blanco-Melo D. et al. Imbalanced Host Response to SARS-CoV-2 Drives Development of COVID-19. Cell 181, 1036–1045 e9 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Bojkova D. et al. Proteomics of SARS-CoV-2-infected host cells reveals therapy targets. Nature 583, 469–472 (2020). [DOI] [PubMed] [Google Scholar]
- 42.Mick E. et al. Upper airway gene expression reveals suppressed immune responses to SARS-CoV-2 compared with other respiratory viruses. Nat Commun 11, 5854 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Overmyer K.A. et al. Large-Scale Multi-omic Analysis of COVID-19 Severity. Cell Syst 12, 23–40 e7 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Sironi M., Cagliani R., Forni D. & Clerici M. Evolutionary insights into host-pathogen interactions from mammalian sequence data. Nat Rev Genet 16, 224–36 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Cheng F. et al. Systems Biology-Based Investigation of Cellular Antiviral Drug Targets Identified by Gene-Trap Insertional Mutagenesis. PLoS Comput Biol 12, e1005074 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Rozenblatt-Rosen O. et al. Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins. Nature 487, 491–5 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhong Q. et al. An inter-species protein-protein interaction network across vast evolutionary distance. Mol Syst Biol 12, 865 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cheng F. et al. A genome-wide positioning systems network algorithm for in silico drug repurposing. Nat Commun 10, 3476 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Cheng F. et al. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nat Commun 9, 2691 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Luo Y. et al. A multidimensional precision medicine approach identifies an autism subtype characterized by dyslipidemia. Nat Med 26, 1375–1379 (2020). [DOI] [PubMed] [Google Scholar]
- 51.GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat Genet 45, 5805 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Jing Y. et al. Potential influence of COVID-19/ACE2 on the female reproductive system. Mol Hum Reprod 26, 367–373 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Ren Y. et al. The ORF3a protein of SARS-CoV-2 induces apoptosis in cells. Cell Mol Immunol 17, 881–883 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Xia H. et al. Evasion of Type I Interferon by SARS-CoV-2. Cell Rep 33, 108234 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Rui Y. et al. Unique and complementary suppression of cGAS-STING and RNA sensing- triggered innate immune responses by SARS-CoV-2 proteins. Signal Transduct Target Ther 6, 123 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wang R. et al. ORF3a Protein of Severe Acute Respiratory Syndrome Coronavirus 2 Inhibits Interferon-Activated Janus Kinase/Signal Transducer and Activator of Transcription Signaling via Elevating Suppressor of Cytokine Signaling 1. Front Microbiol 12, 752597 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Consortium E.P. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Davis C.A. et al. The Encyclopedia of DNA elements (ENCODE): data portal update. Nucleic Acids Res 46, D794–D801 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Kelleher D.J. & Gilmore R. An evolving view of the eukaryotic oligosaccharyltransferase. Glycobiology 16, 47R–62R (2006). [DOI] [PubMed] [Google Scholar]
- 60.Ruiz-Canada C., Kelleher D.J. & Gilmore R. Cotranslational and posttranslational N-glycosylation of polypeptides by distinct mammalian OST isoforms. Cell 136, 272–83 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Lindenbach B.D. & Rice C.M. trans-Complementation of yellow fever virus NS1 reveals a role in early RNA replication. J Virol 71, 9608–17 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Beatty P.R. et al. Dengue virus NS1 triggers endothelial permeability and vascular leak that is prevented by NS1 vaccination. Sci Transl Med 7, 304ra141 (2015). [DOI] [PubMed] [Google Scholar]
- 63.Lu H., Cherepanova N.A., Gilmore R., Contessa J.N. & Lehrman M.A. Targeting STT3A-oligosaccharyltransferase with NGI-1 causes herpes simplex virus 1 dysfunction. FASEB J 33, 6801–6812 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Marceau C.D. et al. Genetic dissection of Flaviviridae host factors through genome-scale CRISPR screens. Nature 535, 159–63 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Puschnik A.S. et al. A Small-Molecule Oligosaccharyltransferase Inhibitor with Panflaviviral Activity. Cell Rep 21, 3032–3039 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Parnas O. et al. A Genome-wide CRISPR Screen in Primary Immune Cells to Dissect Regulatory Networks. Cell 162, 675–86 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Mohd Ropidi M.I., Khazali A.S., Nor Rashid N. & Yusof R. Endoplasmic reticulum: a focal point of Zika virus infection. J Biomed Sci 27, 27 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Linxweiler M., Schick B. & Zimmermann R. Let’s talk about Secs: Sec61, Sec62 and Sec63 in signal transduction, oncology and personalized medicine. Signal Transduct Target Ther 2, 17002 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Heaton N.S. et al. Targeting Viral Proteostasis Limits Influenza Virus, HIV, and Dengue Virus Infection. Immunity 44, 46–58 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Iwasa A. et al. Contribution of Sec61alpha to the life cycle of Ebola virus. J Infect Dis 204 Suppl 3, S919–26 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Cascarina S.M. & Ross E.D. A proposed role for the SARS-CoV-2 nucleocapsid protein in the formation and regulation of biomolecular condensates. FASEB J 34, 9832–9842 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Dutta N.K., Mazumdar K. & Gordy J.T. The Nucleocapsid Protein of SARS-CoV-2: a Target for Vaccine Development. J Virol 94(2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Zeng W. et al. Biochemical characterization of SARS-CoV-2 nucleocapsid protein. Biochem Biophys Res Commun 527, 618–623 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Grifoni A. et al. A Sequence Homology and Bioinformatic Approach Can Predict Candidate Targets for Immune Responses to SARS-CoV-2. Cell Host Microbe 27, 671–680 e2 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Buttinelli M., Panetta G., Rhodes D. & Travers A. The role of histone H1 in chromatin condensation and transcriptional repression. Genetica 106, 117–24 (1999). [DOI] [PubMed] [Google Scholar]
- 76.Ye X. et al. Linker Histone in Diseases. Int J Biol Sci 13, 1008–1018 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Liu X. et al. HIST1H1C Regulates Interferon-beta and Inhibits Influenza Virus Replication by Interacting with IRF3. Front Immunol 8, 350 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Conn K.L., Hendzel M.J. & Schang L.M. Linker histones are mobilized during infection with herpes simplex virus type 1. J Virol 82, 8629–46 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Zhou Y. et al. Network-based drug repurposing for novel coronavirus 2019-nCoV/SARS-CoV-2. Cell Discov 6, 14 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Schafer P.H. et al. Apremilast is a selective PDE4 inhibitor with regulatory effects on innate immunity. Cell Signal 26, 2016–29 (2014). [DOI] [PubMed] [Google Scholar]
- 81.Perino A., Ghigo A., Scott J.D. & Hirsch E. Anchoring proteins as regulators of signaling pathways. Circ Res 111, 482–92 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Werling L.L., Keller A., Frank J.G. & Nuwayhid S.J. A comparison of the binding profiles of dextromethorphan, memantine, fluoxetine and amitriptyline: treatment of involuntary emotional expression disorder. Exp Neurol 207, 248–57 (2007). [DOI] [PubMed] [Google Scholar]
- 83.Kumar S. et al. Mammalian hybrid pre-autophagosomal structure HyPAS generates autophagosomes. Cell 184, 5950–5969 e22 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Cuevas R.A. et al. MOV10 Provides Antiviral Activity against RNA Viruses by Enhancing RIG-I-MAVS-Independent IFN Induction. J Immunol 196, 3877–86 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Jeon S. et al. Identification of Antiviral Drug Candidates against SARS-CoV-2 from FDA-Approved Drugs. Antimicrob Agents Chemother 64, e00819–20 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Dyall J. et al. Repurposing of clinically developed drugs for treatment of Middle East respiratory syndrome coronavirus infection. Antimicrob Agents Chemother 58, 4885–93 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Cong Y. et al. MERS-CoV pathogenesis and antiviral efficacy of licensed drugs in human monocyte-derived antigen-presenting cells. PLoS One 13, e0194868 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Johansen L.M. et al. FDA-approved selective estrogen receptor modulators inhibit Ebola virus infection. Sci Transl Med 5, 190ra79 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Martin W.R. & Cheng F. Repurposing of FDA-Approved Toremifene to Treat COVID-19 by Blocking the Spike Glycoprotein and NSP14 of SARS-CoV-2. J Proteome Res 19, 4670–4677 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Zhou Y. et al. A network medicine approach to investigation and population-based validation of disease manifestations and drug repurposing for COVID-19. PLoS Biol 18, e3000970 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Kc G.B. et al. A machine learning platform to estimate anti-SARS-CoV-2 activities. Nature Machine Intelligence 3, 527–535 (2021). [Google Scholar]
- 92.Daly J.L. et al. Neuropilin-1 is a host factor for SARS-CoV-2 infection. Science 370, 861865 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Cantuti-Castelvetri L. et al. Neuropilin-1 facilitates SARS-CoV-2 cell entry and infectivity. Science 370, 856–860 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Wang K. et al. CD147-spike protein is a novel route for SARS-CoV-2 infection to host cells. Signal Transduct Target Ther 5, 283 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Shimada H. et al. Epiplakin modifies the motility of the HeLa cells and accumulates at the outer surfaces of 3-D cell clusters. J Dermatol 40, 249–58 (2013). [DOI] [PubMed] [Google Scholar]
- 96.Saadi I. et al. Deficiency of the cytoskeletal protein SPECC1L leads to oblique facial clefting. Am J Hum Genet 89, 44–55 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Zeng C. et al. SARS-CoV-2 Spreads through Cell-to-Cell Transmission. bioRxiv, 2021.06.01.446579 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Wan Q., Song D., Li H. & He M.L. Stress proteins: the biological functions in virus infection, present and challenges for target-based antiviral drug development. Signal Transduct Target Ther 5, 125 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Fang J. et al. Endophenotype-based in silico network medicine discovery combined with insurance record data mining identifies sildenafil as a candidate drug for Alzheimer’s disease. Nat Aging 1, 1175–1188 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Israel A. et al. Identification of drugs associated with reduced severity of COVID-19 - a case-control study in a large population. Elife 10(2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Skayem C. & Ayoub N. Carvedilol and COVID-19: A Potential Role in Reducing Infectivity and Infection Severity of SARS-CoV-2. Am J Med Sci 360, 300 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Zhou Y., Wang F., Tang J., Nussinov R. & Cheng F. Artificial intelligence in COVID-19 drug repurposing. Lancet Digit Health 2, e667–e676 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Clemente-Moragon A. et al. Metoprolol in Critically Ill Patients With COVID-19. J Am Coll Cardiol 78, 1001–1011 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Shen J. et al. The Epidemiological and Mechanistic Understanding of the Neurological Manifestations of COVID-19: A Comprehensive Meta-Analysis and a Network Medicine Observation. Front Neurosci 15, 606926 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Kumar N. et al. SARS-CoV-2 Spike Protein S1-Mediated Endothelial Injury and Pro-Inflammatory State Is Amplified by Dihydrotestosterone and Prevented by Mineralocorticoid Antagonism. Viruses 13(2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Chen J.W. et al. Carvedilol inhibits tumor necrosis factor-alpha-induced endothelial transcription factor activation, adhesion molecule expression, and adhesiveness to human mononuclear cells. Arterioscler Thromb Vasc Biol 24, 2075–81 (2004). [DOI] [PubMed] [Google Scholar]
- 107.Samelson A.J. et al. BRD2 inhibition blocks SARS-CoV-2 infection by reducing transcription of the host cell receptor ACE2. Nat Cell Biol 24, 24–34 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Roberts G.H.L. et al. Expanded COVID-19 phenotype definitions reveal distinct patterns of genetic association and protective effects. Nat Genet 54, 374–381 (2022). [DOI] [PubMed] [Google Scholar]
- 109.Mantovani S. et al. Rare variants in Toll-like receptor 7 results in functional impairment and downregulation of cytokine-mediated signaling in COVID-19 patients. Genes Immun 23, 51–56 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Pal L.R. et al. Synthetic lethality-based prediction of anti-SARS-CoV-2 targets. iScience 25, 104311 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Shen B. et al. Proteomic and Metabolomic Characterization of COVID-19 Patient Sera. Cell 182, 59–72 e15 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Cheng K. et al. Genome-scale metabolic modeling reveals SARS-CoV-2-induced metabolic changes and antiviral targets. Mol Syst Biol 17, e10260 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Huang T. et al. MSstatsTMT: Statistical Detection of Differentially Abundant Proteins in Experiments with Isobaric Labeling and Multiple Mixtures. Mol Cell Proteomics 19, 17061723 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Ritchie M.E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43, e47 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Menche J. et al. Disease networks. Uncovering disease-disease relationships through the incomplete interactome. Science 347, 1257601 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Kuleshov M.V. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 44, W90–7 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Supek F., Bosnjak M., Skunca N. & Smuc T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One 6, e21800 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Hirsh A.E., Fraser H.B. & Wall D.P. Adjusting for selection on synonymous sites in estimates of evolutionary distance. Mol Biol Evol 22, 174–7 (2005). [DOI] [PubMed] [Google Scholar]
- 119.Bezginov A., Clark G.W., Charlebois R.L., Dar V.U. & Tillier E.R. Coevolution reveals a network of human proteins originating with multicellularity. Mol Biol Evol 30, 332–46 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Cheng F. et al. Studying tumorigenesis through network evolution and somatic mutational perturbations in the cancer interactome. Mol Biol Evol 31, 2156–69 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Cheng F., Kovacs I.A. & Barabasi A.L. Network-based prediction of drug combinations. Nat Commun 10, 1197 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Smith I.N., Thacker S., Seyfi M., Cheng F. & Eng C. Conformational Dynamics and Allosteric Regulation Landscapes of Germline PTEN Mutations Associated with Autism Compared to Those Associated with Cancer. Am J Hum Genet 104, 861–878 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Rolland T. et al. A proteome-scale map of the human interactome network. Cell 159, 1212–1226 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Rual J.F. et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature 437, 1173–8 (2005). [DOI] [PubMed] [Google Scholar]
- 125.Cheng F., Jia P., Wang Q. & Zhao Z. Quantitative network mapping of the human kinome interactome reveals new clues for rational kinase inhibitor discovery and individualized cancer therapy. Oncotarget 5, 3697–710 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Hu J. et al. PhosphoNetworks: a database for human phosphorylation networks. Bioinformatics 30, 141–2 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Keshava Prasad T.S. et al. Human Protein Reference Database--2009 update. Nucleic Acids Res 37, D767–72 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Lu C.T. et al. DbPTM 3.0: an informative resource for investigating substrate site specificity and functional association of protein post-translational modifications. Nucleic Acids Res 41, D295–305 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Dinkel H. et al. Phospho.ELM: a database of phosphorylation sites--update 2011. Nucleic Acids Res 39, D261–7 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Hornbeck P.V. et al. PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res 43, D512–20 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Csabai L., Olbei M., Budd A., Korcsmaros T. & Fazekas D. SignaLink: Multilayered Regulatory Networks. Methods Mol Biol 1819, 53–73 (2018). [DOI] [PubMed] [Google Scholar]
- 132.Huttlin E.L. et al. The BioPlex Network: A Systematic Exploration of the Human Interactome. Cell 162, 425–440 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Meyer M.J., Das J., Wang X. & Yu H. INstruct: a database of high-quality 3D structurally resolved protein interactome networks. Bioinformatics 29, 1577–9 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Orchard S. et al. The MIntAct project--IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res 42, D358–63 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Oughtred R. et al. The BioGRID interaction database: 2019 update. Nucleic Acids Res 47, D529–D541 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Licata L. et al. MINT, the molecular interaction database: 2012 update. Nucleic Acids Res 40, D857–61 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Cowley M.J. et al. PINA v2.0: mining interactome modules. Nucleic Acids Res 40, D8625 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Breuer K. et al. InnateDB: systems biology of innate immunity and beyond--recent updates and continuing curation. Nucleic Acids Res 41, D1228–33 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Law V. et al. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res 42, D1091–7 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 140.Liu T., Lin Y., Wen X., Jorissen R.N. & Gilson M.K. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res 35, D198–201 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Gaulton A. et al. ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40, D1100–7 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 142.Yang H. et al. Therapeutic target database update 2016: enriched resource for bench to clinical drug target and targeted pathway information. Nucleic Acids Res 44, D1069–74 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Whirl-Carrillo M. et al. Pharmacogenomics knowledge for personalized medicine. Clin Pharmacol Ther 92, 414–7 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Pawson A.J. et al. The IUPHAR/BPS Guide to PHARMACOLOGY: an expert-driven knowledgebase of drug targets and their ligands. Nucleic Acids Res 42, D1098–106 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Shannon P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13, 2498–504 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Harris P.A. et al. Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform 42, 377–81 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
We downloaded the GTEx v8 dataset from https://gtexportal.org/home/. The human protein-protein interactome and drug-target network are found in https://github.com/ChengF-Lab/COVID-19_Map. An interactive version of Fig. 1b can be found in https://github.com/ChengF-Lab/COVID-19_PPI. All other data can be found in the supplementary tables. We accessed ZNF579 ChIP-seq data from the ENCODE portal under accession ENCSR018MQH.