Skip to main content
Signal Transduction and Targeted Therapy logoLink to Signal Transduction and Targeted Therapy
letter
. 2020 Dec 11;5:290. doi: 10.1038/s41392-020-00417-y

Drug repurposing for COVID-19 using machine learning and mechanistic models of signal transduction circuits related to SARS-CoV-2 infection

Carlos Loucera 1,2,#, Marina Esteban-Medina 1,2,#, Kinza Rian 1, Matías M Falco 1,2,3, Joaquín Dopazo 1,2,3,4,, María Peña-Chilet 1,2,3,
PMCID: PMC7729141  PMID: 33311438

Dear Editor,

Drug repurposing is a convenient alternative when the need for new drugs in an unexpected medical scenario is urgent, as is the case of emerging pathogens. In recent years, approaches based on network biology have demonstrated to be superior to gene-centric ones.1 Here, we use an innovative methodology that combines mechanistic modeling of the signal transduction circuits related to SARS-CoV-2 infection (the COVID-19 disease map) with a machine-learning algorithm that learns potential causal interactions between proteins, already targets of drugs, and specific signaling circuits in the COVID-19 disease map, to suggest potentially repurposable drugs.

Mechanistic models of pathways provide a natural bridge from variations at the scale of gene activity (transcription) to variations in phenotype (at the level of cells, tissues, or organisms). Actually, mechanistic models of human signaling pathways have been successfully used to uncover specific molecular mechanisms behind different diseases, to reveal modes of action of drugs, and to suggest personalized treatments. However, the most interesting property of mechanistic models is that they can be used to predict the consequences of interventions as, for example, the effect of targeted drugs.2 The availability of a COVID-19 disease map3 can be used to build a realistic mechanistic model of the SARS-CoV-2 infection and all the downstream functional consequences that occur in the host cells. This disease map is a set of signaling transduction circuits that contain human proteins that interact with viral proteins and their upstream and downstream connections (see Supplementary Results and Supplementary Table S1, with the 277 resulting circuits from a total of 49 KEGG pathways). These affected circuits ultimately trigger cell functionalities, whose perturbation by the virus causes the COVID-19 symptoms or disease hallmarks. The main hallmarks used here to properly fit in the Uniprot annotations that define the circuit functionalities were: (1) host–virus interaction, (2) inflammatory response, (3) immune activity, (4) antiviral defense, (5) endocytosis, (6) replication, and (7) energetics. This disease map will be dynamically updated as new biological knowledge is generated by the Disease Maps community.3

Interestingly, the notion of causality provided by the mechanistic model of the COVID-19 disease map can be exploited beyond the own pathways modeled. Actually, machine-learning methods can be used to extrapolate the effect that other proteins, even if these are not part of the disease map modeled, can have over the signaling circuits of the map. We have recently demonstrated that machine learning can be used over a large dataset of gene expression data to learn how the disease hallmarks of Fanconi Anemia, a rare condition caused by defective DNA repair in the cells, could be predicted even from proteins apparently unconnected to its disease map. Such an approach, used for drug repurposing, produced a list of potentially repurposable drugs, some of which (e.g., Gefitinib and Afatinib) were further validated.4

Here, we assume that other proteins which, according to the signaling circuit activity estimations of our mechanistic model, have an influence on the status of these hallmarks might be playing some type of upstream regulator role. Consequently, this potential modulator capacity could make them suitable candidates to become therapeutic targets. Since we are interested in drug repurposing, we will only consider as candidates therapeutic targets from drugs that are already approved for other indications. To achieve so, a total of 2045 human proteins, that are known drug targets (KDTs) of a total of 1735 drugs were extracted from DrugBank. Then, a machine-learning procedure is used to “learn”, using a Multi-task Learning model (specifically a Multi-Output Random Forest regressor combined with SHapley Additive exPlanations to determine the influences of KDTs on specific signaling circuits, see Supplementary Methods for details), the relationships between the KDTs and the activities of the COVID-19 circuits that conform the disease map, as estimated by the mechanistic model, as sketched in Fig. 1a. Over 11,000 gene expression experiments representing different organs and conditions taken from the GTEx repository were used for the learning procedure (see Supplementary Methods). The drugs that target proteins with a highly relevant influence over the COVID-19 modeled hallmarks will be the candidates for repurposing.

Fig. 1.

Fig. 1

Summary of the drug repurposing strategy and the results obtained. a Schema of the procedure followed for finding drug targets that affect the COVID-19 disease map. b Two different patterns of circuits affected by six drugs (see text). c Radar plot representing the impact of the most relevant drugs listed in Supplementary Table S3 over the different COVID-19 hallmarks, quantified as the number of circuits of the corresponding hallmark are potentially affected

The results showed that 380 out of 2045 original KDTs, targeted by 679 different drugs, out of 1735 tested, have a direct relevant influence on at least one signaling circuit of the COVID-19 disease map (Supplementary Table S2 and Supplementary Fig. S1). Enrichment analysis renders significant Gene Ontology (GO) biological processes related to immune activity, especially to T cell, but also to inflammatory response, and other infectious processes, such as hepatitis, HIV, and papillomavirus infections. Moreover, GO functionalities corresponding to all COVID-19 hallmarks are represented as well. Interestingly, the enrichment in virus-caused perturbations in GEO expression showed enrichment in datasets infected with SARS-CoV and other respiratory-related infections (see Supplementary Fig. S2).

Among the drugs predicted to have a relevant effect, some of them are currently under clinical trials. Interestingly, these drugs define different functional profile templates, which may be useful to speculate similar consequences for other drugs with similar patterns of influence over the signaling circuits that define the disease hallmarks. Thus, some drugs, as Sirolimus, have a strong impact over most of the circuits, while others as Ciclosporin affect only a small number of them (seven circuits) as depicted in Fig. 1b. Indeed, Ciclosporin shares with the well-known hydroxychloroquine circuits within Toll-like and adipocytokine signaling pathways. However, while Ciclosporin is relevant for Insulin signaling, hydroxychloroquine influences Inflammatory mediator regulation of TRP channels and HIF-1 signaling pathways. Chloroquine and ciclosporin are representative of two different modes of action by either affecting massively to almost all COVID-19 disease map circuits or only affecting a few specific ones, respectively. In fact, a recently published compilation of drugs currently in clinical trials allowed to validate many predictions. Actually, most of the cited drugs targeting human proteins were predicted by the model (Supplementary Table S3). A list of the drugs targeting some of the most relevant KDTs can be found in Supplementary Table S4. A detailed functional analysis of the circuits affected by the KDTs corresponding to these drugs, selected by the hallmarks they affect and the SHAP relevance, illustrates the different drug mechanisms of action in terms of how different COVID-19 hallmarks are potentially affected (see Fig. 1c) See Supplementary Fig. S3 and Supplementary Results for a detailed discussion of the findings.

The results presented here, although promising, can be considered only a subset of the potential drug candidates for repurposing, given that the detailed definition of the COVID-19 disease map is still an ongoing effort.3 As the map is updated, new reanalysis can render more interesting drug candidates for repurposing. It is worth mentioning that this approach can be used to search for drugs that tailor highly specific interventions over particular disease hallmarks or even particular signaling circuits, which would eventually help in the reduction of undesirable side-effects shown by some drugs.

Finally, the use of mechanistic models for drug repurposing has an extra advantage: it is well-known that the rate of success in drug discovery is of ~10% because many drugs fail in the last phases of clinical trials due to a lack of knowledge of the disease mechanism as well as problems of toxicity and bioavailability.5 Repurposing solves the last two problems and mechanistic modeling solves the first one by providing the biological knowledge, in terms of the mechanistic link between the drug and the effect on the disease, that other repurposing methodologies lack.

Supplementary information

Supplementary Materials (2.4MB, docx)

Acknowledgements

This work is supported by grants SAF2017-88908-R from the Spanish Ministry of Economy and Competitiveness, PT17/0009/0006, ACCI2018/29 from CIBER-ISCIII and COV20/00788 from the ISCIII, co-funded with European Regional Development Funds (ERDF), the grant “Large-scale drug repurposing in rare diseases by genomic Big Data analysis with machine learning methods” from the Fundación BBVA (G999088Q), as well as H2020 Programme of the European Union grants Marie Curie Innovative Training Network “Machine Learning Frontiers in Precision Medicine” (MLFPM) (GA 813533).

Author contributions

C.L. developed and applied all the machine-learning algorithms; M.E.M. carried out the mechanistic modeling analysis; K.R. developed the mechanistic modeling software; M.M.F. helped with the analysis; J.D. and M.P.C. conceived the work and wrote the paper. All authors read and approved the final paper.

Data availability

The datasets used during the current study are available in the GTEx repository, https://www.gtexportal.org/home/, the DrugBank database https://www.drugbankplus.com/, and the KEGG database https://www.genome.jp/kegg/. The COVID-19 Disease Map used here is available at 10.5281/zenodo.3935733, and new versions will be made publicly available at 10.17881/covid19-disease-map. A R/Bioconductor package for the Hipathia method is available at https://www.bioconductor.org/packages/release/bioc/html/hipathia.html.

Competing interests

The authors declare no competing interests.

Footnotes

These authors contributed equally: Carlos Loucera, Marina Esteban-Medina

Contributor Information

Joaquín Dopazo, Email: joaquin.dopazo@juntadeandalucia.es.

María Peña-Chilet, Email: maria.pena.chilet.ext@juntadeandalucia.es.

Supplementary information

The online version of this article (10.1038/s41392-020-00417-y) contains supplementary material, which is available to authorized users.

References

  • 1.Cheng F, et al. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nat. Commun. 2018;9:1–12. doi: 10.1038/s41467-017-02088-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Amadoz A, et al. A comparison of mechanistic signaling pathway activity analysis methods. Brief. Bioinforma. 2019;20:1655–1668. doi: 10.1093/bib/bby040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ostaszewski M, et al. COVID-19 Disease Map, building a computational repository of SARS-CoV-2 virus-host interaction mechanisms. Sci. Data. 2020;7:136. doi: 10.1038/s41597-020-0477-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Esteban-Medina M, Peña-Chilet M, Loucera C, Dopazo J. Exploring the druggable space around the Fanconi anemia pathway using machine learning and mechanistic models. BMC Bioinforma. 2019;20:370. doi: 10.1186/s12859-019-2969-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hay M, et al. Clinical development success rates for investigational drugs. Nat. Biotechnol. 2014;32:40. doi: 10.1038/nbt.2786. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Materials (2.4MB, docx)

Data Availability Statement

The datasets used during the current study are available in the GTEx repository, https://www.gtexportal.org/home/, the DrugBank database https://www.drugbankplus.com/, and the KEGG database https://www.genome.jp/kegg/. The COVID-19 Disease Map used here is available at 10.5281/zenodo.3935733, and new versions will be made publicly available at 10.17881/covid19-disease-map. A R/Bioconductor package for the Hipathia method is available at https://www.bioconductor.org/packages/release/bioc/html/hipathia.html.


Articles from Signal Transduction and Targeted Therapy are provided here courtesy of Nature Publishing Group

RESOURCES