Prioritizing antiviral drugs against SARS-CoV-2 by integrating viral complete genome sequences and drug chemical structures

Lihong Peng; Ling Shen; Junlin Xu; Xiongfei Tian; Fuxing Liu; Juanjuan Wang; Geng Tian; Jialiang Yang; Liqian Zhou

doi:10.1038/s41598-021-83737-5

. 2021 Mar 18;11:6248. doi: 10.1038/s41598-021-83737-5

Prioritizing antiviral drugs against SARS-CoV-2 by integrating viral complete genome sequences and drug chemical structures

Lihong Peng ^1,^#, Ling Shen ^1,^#, Junlin Xu ², Xiongfei Tian ¹, Fuxing Liu ¹, Juanjuan Wang ¹, Geng Tian ³, Jialiang Yang ^3,^✉, Liqian Zhou ^1,^✉

PMCID: PMC7973547 PMID: 33737523

Abstract

The outbreak of a novel febrile respiratory disease called COVID-19, caused by a newfound coronavirus SARS-CoV-2, has brought a worldwide attention. Prioritizing approved drugs is critical for quick clinical trials against COVID-19. In this study, we first manually curated three Virus-Drug Association (VDA) datasets. By incorporating VDAs with the similarity between drugs and that between viruses, we constructed a heterogeneous Virus-Drug network. A novel Random Walk with Restart method (VDA-RWR) was then developed to identify possible VDAs related to SARS-CoV-2. We compared VDA-RWR with three state-of-the-art association prediction models based on fivefold cross-validations (CVs) on viruses, drugs and virus-drug associations on three datasets. VDA-RWR obtained the best AUCs for the three fivefold CVs, significantly outperforming other methods. We found two small molecules coming together on the three datasets, that is, remdesivir and ribavirin. These two chemical agents have higher molecular binding energies of − 7.0 kcal/mol and − 6.59 kcal/mol with the domain bound structure of the human receptor angiotensin converting enzyme 2 (ACE2) and the SARS-CoV-2 spike protein, respectively. Interestingly, for the first time, experimental results suggested that navitoclax could be potentially applied to stop SARS-CoV-2 and remains to further validation.

Subject terms: Computational biology and bioinformatics, Drug discovery, Computer science, Information technology

Introduction

In late December, 2019, there was an outbreak of a novel febrile respiratory illness (COVID-19) in Wuhan, Hubei in China^1,2. The illness was caused by a novel coronavirus named SARS-CoV-2 by the World Health Organization (WHO) and can transmit from human to human². As of 10 a.m. Cest time on October, 18, 2020, 40,118,333 cases of SARS-CoV-2 infection and 1,114,749 cases of SARS-CoV-2-caused death have been confirmed around the world³. From February, 2020, WHO is seeking U.S. $675 million for COVID-19 preparedness to prevent human to human transmission⁴.

SARS-CoV-2 is a new human-infecting single-stranded RNA virus². It is very similar to two coronaviruses: severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV) and Middle East respiratory syndrome (MERS) coronavirus (MERS-CoV). In November, 2002, SARS first emerged in Guangdong, China, and resulted in 8,098 infection cases with a fatality rate of 9.6%¹. In September, 2012, MERS was first found in humans in the Middle East and resulted in 2,465 laboratory-confirmed cases of infection with a fatality rate 34.4%⁵.

As SARS-CoV-2 is an emerging virus, no specific antiviral treatment has been developed⁶. Therefore, finding effective drug treatment options is urgently needed for combating SARS-CoV-2⁷. However, it seems unrealistic to test new drugs targeting SARS-CoV-2 within such limited time⁸. An efficient method is to screen possible drugs from available public data repositories containing FDA-approved compounds^7,9. Under such situation, computational methods could be chosen to identify special antiviral drug candidates^10–12.

Although little is known about SARS-CoV-2, its complete genome sequence suggests strong homology with SARS-CoV¹³. To identify possible antiviral drugs, in this study, we investigated the relationship between the complete genome sequences of viruses similar to SARS-CoV-2, the chemical structures of drugs, and Virus-Drug Association (VDA) network topology. We then developed a novel Random Walk with Restart method (VDA-RWR) to find possible VDAs related to SARS-CoV-2 by integrating the genome sequences and the chemical structures into a unified framework. We compared VDA-RWR with NGRHMDA¹⁴, SMiR-NBI¹⁵ and LRLSHMDA¹⁶. These three methods were applied to biological association prediction in other application areas and obtained better prediction performance. We found that remdesivir and ribavirin come together on three datasets.

Molecular docking is a key bioinformatics modeling tool for drug discovery and used to predict the “best-fit” intermolecular binding between a small molecule and a target or two proteins at the atomic level. It characterizes the behavior of ligands in the binding sites of target proteins as well as elucidates fundamental biochemical processes¹⁷. The docking process comprises two basic steps: predicting conformation, position, and orientation of ligands within the binding sites and ranking these conformations based on the binding affinity¹⁸. We used AutoDock¹⁹, a molecular docking software, to measure the molecular activities of the predicted two compounds (remdesivir and ribavirin) binding to the SARS-CoV-2 spike protein/human receptor angiotensin converting enzyme 2 (ACE2). The docking showed that remdesivir and ribavirin have higher binding energies of − 7.0 kcal/mol and − 6.59 kcal/mol with the structure of the spike protein receptor-binding domain bound to the ACE2 receptor, respectively.

Results

Experimental settings

In this section, we conducted extensive experiments to investigate the performance of our proposed VDA-RWR method. For the VDA matrix $Y_{n \times m}$ from $n$ viruses and $m$ drugs, fivefold Cross-Validations (CVs) were performed under the following three different experimental settings.

Fivefold Cross Validation 1 (CV1): CV on viruses, that is, random rows in $Y$ (i.e., viruses) were selected for testing.
Fivefold Cross Validation 2 (CV2): CV on drugs, that is, random columns in $Y$ (i.e., drugs) were selected for testing.
Fivefold Cross Validation 3 (CV3): CV on virus-drug pairs, that is, random entries in $Y$ (i.e., virus-drug pairs) were selected for testing.

Under CV1, in each round, 80% of rows in $Y$ were used as training set and the remaining 20% of rows were used as test set. Under CV2, in each round, 80% of columns in $Y$ were used as training set and the remaining 20% of columns were used as test set. Under CV3, in each round, 80% of entries in $Y$ were used as training set and the remaining 20% of entries were test set. These three settings CV1, CV2, and CV3 specially refer to potential VDA identification for (1) new viruses (especially for SARS-CoV-2), (2) new drugs, and (3) new virus-drug pairs, respectively.

Parameters $r$ , $μ$ , and $α$ denote the global restart rate, the transition probability, and the weight between the virus network and the drug network, respectively. For these three parameters, we performed cross validations on the training set to find the optimal values. In addition, the iteration stopped when ${∥p_{t + 1} - p_{t}∥}_{2} \leq 1 e - 11$ . SMiR-NBI need not set the parameters. For the parameters in NGRHMDA and LRLSHMDA, we conducted grid search to find the optimal values. The detailed settings are shown on Table 1.

Table 1.

The optimal values of parameters in VDA-RWR, NGRHMDA and LRLSHMDA.

Method	Dataset 1	Dataset 2	Dataset 3
NGRHMDA	α = 0.4, β = 0.8	α = 0.6, β = 0.9	α = 0.9, β = 0.9
LRLSHMDA	ηM = 0.9, ηD = 0.3	ηM = 0.8, ηD = 0.1	ηM = 0.6, ηD = 0.1
VDA-RWR	r = 0.7, μ = 0.9, α = 0.5	r = 0.5, μ = 0.9, α = 0.9	r = 0.7, μ = 0.9, α = 0.9

Open in a new tab

Evaluation metrics

Sensitivity, specificity, F1 score, accuracy and AUC were widely applied to evaluate the proposed methods. Sensitivity denotes the ratio of correctly predicted positive VDAs to all positive VDAs. Specificity is the ratio of correctly predicted negative VDAs to all negative VDAs (all the unknown associations were labeled as negative). F1 score denotes the harmonic mean of recall and precision. Accuracy represents the ratio of correctly predicted positive and negative VDAs to all positive and negative VDAs. We used these five metrics to evaluate the performance of VDA-RWR. They were defined as follows:

S e n s i t i v i t y = \frac{TP}{T P + F N}

S p e c i f i c i t y = \frac{TN}{T N + F P}

A c c u r a c y = \frac{T P + T N}{T P + T N + F P + F N}

F 1 s c o r e = \frac{2 T P}{2 T P + F P + F N}

where $TP$ , $FP$ , $TN$ and $FN$ represent true positive, false positive, true negative and false negative, respectively.

AUC is the average area under the receiver operating characteristics (ROC) curve. The curve can be plotted by the ratio of True Positive Rate (TPR) to False Positive Rate (FPR) according to different thresholds. TPR and FPR are defined via Eqs. (4–5).

T P R = \frac{TP}{T P + F N} = \frac{TP}{T}

F P R = \frac{FP}{F P + T N} = \frac{FP}{F}

For these five evaluation metrics, higher values represent better performance.

Performance evaluation under three fivefold CVs

We compared VDA-RWR with NGRHMDA¹⁴, SMiR-NBI¹⁵ and LRLSHMDA¹⁶. NGRHMDA was presented to find potential microbe-disease associations by integrating neighbor-based collaborative filtering and graph-based scoring¹⁴. SMiR-NBI can comprehensively identify new pharmacogenomic biomarkers by constructing a heterogeneous network connecting genes, drugs, and miRNAs¹⁵. LRLSHMDA was applied to predict human microbe-disease associations based on Laplacian regularized least squares¹⁶. These three state-of-the-art approaches obtained good performance in their corresponding applications. We performed these four methods for 100 times on three different fivefold CV settings on three datasets. The final performance was averaged over the five rounds for 100 times. The results are shown in Tables 2, 3, and4. The best results were shown in bold in each column.

Table 2.

The performance comparison of four methods on three datasets under CV1.

Datasets	Methods	Sensitivity	Specificity	F1 score	Accuracy	AUC
Dataset 1	NGRHMDA	0.7278	0.3991	0.0643	0.4092	0.7026
	SMiR-NBI	0.8086	0.2164	0.0366	0.2296	0.5806
	LRLSHMDA	0.1299	0.6171	0.0084	0.6057	0.1844
	VDA-RWR	0.4977	0.7959	0.1055	0.7905	0.8157
Dataset 2	NGRHMDA	0.3987	0.5521	0.0329	0.5495	0.4301
	SMiR-NBI	0.8238	0.0949	0.0332	0.1087	0.4003
	LRLSHMDA	0.3507	0.4667	0.0179	0.4643	0.3173
	VDA-RWR	0.5106	0.6832	0.0844	0.6801	0.6932
Dataset 3	NGRHMDA	0.4435	0.4560	0.0232	0.4563	0.4058
	SMiR-NBI	0.9124	0.0459	0.0227	0.0567	0.4092
	LRLSHMDA	0.1801	0.5817	0.0074	0.5766	0.2920
	VDA-RWR	0.5270	0.7025	0.0812	0.7006	0.7276

Open in a new tab

Bold values indicates the best values for the different methods under the same evaluation.

Table 3.

The performance comparison of four methods on three datasets under CV2.

Datasets	Methods	Sensitivity	Specificity	F1 score	Accuracy	AUC
Dataset 1	NGRHMDA	0.6435	0.6719	0.0850	0.6713	0.8329
	SMiR-NBI	0.8510	0.1917	0.0393	0.2064	0.6021
	LRLSHMDA	0.7938	0.5773	0.1122	0.5820	0.8249
	VDA-RWR	0.5070	0.8932	0.1294	0.8846	0.9182
Dataset 2	NGRHMDA	0.4867	0.8027	0.0719	0.7967	0.8017
	SMiR-NBI	0.9971	0.0929	0.0404	0.1098	0.7205
	LRLSHMDA	0.7720	0.4166	0.0639	0.4232	0.7334
	VDA-RWR	0.5045	0.7981	0.0814	0.7926	0.8025
Dataset 3	NGRHMDA	0.4579	0.6785	0.0279	0.6758	0.6772
	SMiR-NBI	0.9751	0.0434	0.0243	0.0549	0.5665
	LRLSHMDA	0.7420	0.5264	0.0493	0.5290	0.7468
	VDA-RWR	0.5054	0.8098	0.0628	0.8061	0.8168

Open in a new tab

Bold values indicates the best values for the different methods under the same evaluation.

Table 4.

The performance comparison of four methods on three datasets under CV3.

Datasets	Methods	Sensitivity	Specificity	F1 score	Accuracy	AUC
Dataset 1	NGRHMDA	0.5783	0.5567	0.0615	0.5572	0.6459
	SMiR-NBI	0.8331	0.1936	0.0385	0.2079	0.5723
	LRLSHMDA	0.8034	0.5813	0.1119	0.5863	0.8403
	VDA-RWR	0.4824	0.7831	0.1153	0.8278	0.8582
Dataset 2	NGRHMDA	0.4544	0.3562	0.0218	0.3581	0.3011
	SMiR-NBI	0.8349	0.0942	0.0336	0.1080	0.4156
	LRLSHMDA	0.7838	0.4840	0.0733	0.4896	0.8248
	VDA-RWR	0.5022	0.6643	0.0574	0.6613	0.6675
Dataset 3	NGRHMDA	0.3582	0.4081	0.0119	0.4074	0.2554
	SMiR-NBI	0.9230	0.0427	0.0230	0.0536	0.4365
	LRLSHMDA	0.8124	0.5239	0.0552	0.5275	0.8169
	VDA-RWR	0.5053	0.7057	0.0556	0.7032	0.7123

Open in a new tab

Bold values indicates the best values for the different methods under the same evaluation.

On dataset 1 and dataset 3, VDA-RWR outperformed other three methods in terms of specificity, accuracy, F1 score and AUC under three CVs. On dataset 2, although the sensitivity of VDA-RWR was slightly lower, VDA-RWR computed better specificity, accuracy, F1 score and AUC under majority of conditions. The slight difference can be produced by different data structures. AUC is one more important evaluation metric compared to other four measurements. AUC = 0.5 represents random performance and AUC = 1 shows perfect performance. VDA-RWR obtained the best AUCs under majority of conditions. In general, VDA-RWR is proper to discover potential VDAs.

In addition, under CV1, VDA-RWR computed better specificity, accuracy, F1 score and AUC on the three datasets. This result showed that VDA-RWR can effectively find possible antiviral drugs for new viruses (for example, SARS-CoV-2). Under CV2, VDA-RWR outperformed other three methods in terms of specificity, accuracy, F1 score and AUC on dataset 1 and dataset 3. Although the sensitivity, specificity and accuracy values of VDA-RWR were slightly lower than other individual methods on dataset 2, it obtained the best F1 score and AUC. Thus AUC can identify potential viruses associated with new drugs. Under CV3, VDA-RWR calculated the best specificity, F1 score and accuracy. Figure 1 showed the AUC values of four methods under CV1, CV2, and CV3. The results demonstrated that VDA-RWR obtained relatively higher AUCs under three different CVs. It suggested that VDA-RWR could be used to infer potential VDAs.

The AUC values of VDA-RWR under different CVs on three datasets.

Case study

In this section, we want to find possible drugs for SARS-CoV-2 after verifying the performance of our proposed VDA-RWR method. We predicted the top 10 drugs with the highest association scores with SARS-CoV-2 on three datasets. The results were shown in Tables 5, 6, and 7, respectively. Among the predicted top 10 small molecules associated with SARS-CoV-2 on dataset 1, all drugs were supported by recent works. Among the predicted top 10 chemical agents related to SARS-CoV-2 on dataset 2, there were 9 VDAs validated by current literatures, that is, 90% chemical agents were reported. Among the predicted top 10 antiviral drugs against SARS-CoV-2 on dataset 3, all compounds were validated by recent publications.

Table 5.

The predicted top 10 drugs associated with SARS-CoV-2.

Rank	Drug	Evidence
1	Remdesivir	PMID: 32020029, 31996494, 32022370, 31971553, 32035018, 32035533, 32036774, 32194944, 32275812, 32145386, 32838064
		https://doi.org/10.1101/2020.01.28.922922
2	Oseltamivir	PMID: 32034637, 32127666
3	Ribavirin	PMID: 32034637, 32127666, 32227493, 26492219, 32771797
4	Zanamivir	PMID: 32511320
5	Presatovir	PMID: 32147628
6	Elvitegravir	PMID: 32147628
7	Zidovudine	PMID: 32568013
8	Emtricitabine	PMID: 32488835
9	Mycophenolic acid	PMID: 32579258
10	Chloroquine	PMID: 32020029, 32145363, 32074550, 32236562

Open in a new tab

Table 6.

The predicted top 10 drugs associated with SARS-CoV-2 on dataset 2.

Rank	Drug	Evidence
1	Favipiravir	PMID: 32346491, 32967849, 32972430
2	Remdesivir	PMID: 32020029, 31996494, 32022370, 31971553, 32035018, 32035533, 32036774, 32194944, 32275812, 32145386, 32838064
		https://doi.org/10.1101/2020.01.28.922922
3	Cidofovir	PMID: 32546018
		https://doi.org/10.1007/s10067-020-05133-0
4	Galidesivir	PMID: 32711596
5	Niclosamide	PMID: 32125140, 32221153
6	Mycophenolic acid	PMID: 3257258
7	Itraconazole	https://doi.org/10.22541/au.159467021.16927198
8	Brequinar	PMID: 32426387
9	Navitoclax	Unconfirmed
10	Ribavirin	PMID: 32034637, 32127666, 32227493, 26492219, 32771797

Open in a new tab

Table 7.

The predicted top 10 drugs associated with SARS-CoV-2 on dataset 3.

Rank	Drug	Evidence
1	Nitazoxanide	PMID: 32127666, 32568620, 32448490
2	Ribavirin	PMID: 3203637, 32127666, 32227493, 26492219, 32771797
3	Chloroquine	PMID: 32020029, 32145363, 32074550, 32236562
4	Hexachlorophene	PMID: 15950190
5	Camostat	PMID: 32347443
6	Favipiravir	PMID: 32246834
7	Umifenovir	PMID: 32941741
8	Remdesivir	PMID: 32020029, 31996494, 32022370, 31971553, 32035018, 32035533, 32036774, 32194944, 32275812, 32145386, 32838064
		https://doi.org/10.1101/2020.01.28.922922
9	Amantadine	PMID: 32361028
10	Niclosamide	PMID: 32125140, 32221153

Open in a new tab

The results on Tables 5, 6, and 7 showed that there were two FDA-approved drugs coming together on three datasets, that is, remdesivir and ribavirin. Remdesivir is a small molecular compound undergoing a clinical trial and shows superior antiviral activity against many RNA viruses including orthocoronavirinae, filoviridae, paramyxoviridae, and pneumoviridae^20–22. Sheahan et al.¹⁷ presented that it can improve pulmonary function and reduce severe lung pathology in mice. Similar to SARS-CoV-2, both Ebola virus (EBOV) and MERS-CoV may result in severe acute respiratory diseases. And remdesivir has been used as inhibitors of EBOV and MERS-CoV^20,21. More importantly, an array of works have reported that remdesivir is highly effective in controlling SARS-CoV-2 infection and has been directly applied to the treatment of COVID-19^{6,7,9,23–28}. Specially, on October 22, 2020, FDA approved remdesivir for use in adults, pediatric patients with age of 12 years, and older and weighing at least 40 kg²⁹. All these results showed that remdesivir may be the best anti-SARS-CoV-2 drug.

Ribavirin is identified as another anti-SARS-CoV-2 drug with a higher association score. Huang et al.⁵ found that 28 of 38 patients treated by ribavirin have been discharged. Zhang et al.³⁰ reported that a patient has been treated with antiviral drugs including ribavirin. Therefore, ribavirin may be applied to treat COVID-19 caused by SARS-CoV-2. Interestingly, for the first time, experimental results suggested that navitoclax could be potentially applied to stop SARS-COV-2. Navitoclax has been applied to boost the treatment and basic science of chronic lymphoid leukemia, hematological malignancies, non-Hodgkin's lymphoma, solid tumors, and EGFR activating mutation.

Molecular docking

The molecular docking between the above two antiviral drugs (remdesivir and ribavirin) and the spike protein and ACE2 are described in Table 8. The results showed that remdesivir and ribavirin have higher binding energies of − 7.0 kcal/mol and − 6.59 kcal/mol with the structure of the spike protein receptor-binding domain bound to the ACE2 receptor, respectively. The subfigure in each circle denotes the residues at the binding site of the spike protein/ACE2 and their corresponding orientations. For example, the amino acids K68 and Q493 were predicted to be the key residues for remdesivir binding to the SARS-CoV-2 spike protein/ACE2 while K353, R403, Q493 and G496 were predicted as the key residues for ribavirin binding to these two target proteins.

Table 8.

Molecular docking between remdesivir and ribavirin and the SARS-CoV-2 spike (S) protein/ACE2.

Ligand	Molecular formula	Binding energy (kcal/mol)	Binding sites	Distance(Å)
Remdesivir	C₂₇H₃5N₆O₈P	− 7.0	K68	2.0
Remdesivir	C₂₇H₃5N₆O₈P	− 7.0	Q493	2.3
Ribavirin	C₁₈H₂₆CIN₃	− 6.59	K353	2.2
			R403	2.1
			Q493	2.0
			G496	1.9

Open in a new tab

In Table 8, green denotes the structure of ACE2 and cyan denotes the SARS-CoV-2 spike protein in the figures of molecular docking.

Discussion

Finding possible antiviral drugs against SARS-CoV-2 is extremely urgent with the rapid spread of COVID-19. However, it seems very difficult to design a novel drug for COVID-19 within a very short time. One of efficient ways is to identify new clues of the treatment from FDA-approved drugs.

In our proposed VDA-RWR method, we computed the association scores for each virus-drug pair to predict potential antiviral drugs against SARS-CoV-2 based on random walk with restart and biological information of viruses and drugs. The originality of our proposed VDA-RWR method remains, constructing three small datasets and inferring possible antiviral chemical agents against SARS-CoV-2 from FDA-approved drugs. The comparative experiments showed better performance of the VDA-RWR method. Higher AUC values under three fivefold CVs on three datasets and molecular binding energies indicated that the selected small molecules are likely to be used to stop the transmission of COVID-19.

VDA-RWR can obtain superior performance under the three fivefold CVs on three datasets. This observation may be attributed to random walk with restart, a state-of-the-art model that can randomly walk on the heterogeneous virus-drug network and effectively compute association score for each virus-drug pair. More importantly, VDA-RWR integrated various biological information including the complete genome sequences of viruses and chemical structures of chemical agents.

The proposed VDA-RWR method is also helpful in design and interpretation of pharmacological experiment related to COVID-19. More importantly, VDA-RWR can be further applied to predict antiviral drugs against novel viruses without any associated chemical agents.

Methods

Virus-drug association data

Dataset 1

Virus data

We considered 11 viruses similar to SARS-CoV-2. These viruses include influenza A viruses including A-H1N1³², A-H5N1³³, and A-H7N9³⁴, chronic hepatitis C virus (HCV)³⁵, human immunodeficiency virus type 1 (HIV-1)³⁶, human immunodeficiency virus type 2 (HIV-2)³⁷, hendra virus³⁸, human cytomegalovirus³⁹, MERS-CoV⁴⁰, respiratory syncytial virus⁴¹ and SARS-CoV⁴². The complete genome sequences of these viruses are downloaded from the NCBI database⁴³. We used MAFFT⁴⁴ (https://mafft.cbrc.jp/alignment/software/, version 7, open source license: GPL or BSD), a multiple sequence alignment tool, to compute virus-virus sequence similarity matrix $S_{v}$ . All parameters were set as the default values provided by MAFFT.

Drug data

We manually curated drugs associated with these 11 viruses from the DrugBank⁴⁵ and NCBI⁴³ databases and published literatures reported by the PubMed database⁴⁶ and collected 78 small molecules after removing macromolecules. Based on the assumption that two drugs are more similar if they share more chemical substructures, drug-drug similarity can be computed. Extended connectivity fingerprints (ECFPs)⁴⁷ are circular fingerprints and developed for structure–activity modeling and molecular feature description. We used RDKit⁴⁸ (https://github.com/rdkit/rdkit, releases 131, open source license: BSD), an open-source cheminformatics software, to compute ECFPs of drugs with a radius of 2. Drug-drug chemical structure similarity matrix $S_{d}$ can be computed by the ECFPs of drugs.

VDAs

We searched the publicly available repositories including the DrugBank⁴⁵ and NCBI⁴³ databases and publications reported by the PubMed database⁴⁶. At the time of writing, we obtained 96 virus-drug associations (VDAs) between 11 viruses and 78 drugs. We described A-H1N1³², A-H5N1³³, and A-H7N9³⁴ as three viruses although they belong to influenza A for the sake of description.

Dataset 2

The DrugVirus.info database⁴⁹ (https://drugvirus.info/) provided various VDA-related resources. We obtained 770 VDAs from 69 viruses and 128 drugs after removing the viruses whose complete genome sequences are unknown from the database. The chemical structure of drugs and the complete genome sequences of viruses were downloaded from the DrugBank database and the NCBI database, respectively. Similar to dataset 1, we used RDKit and MAFFT to calculate drug similarity and virus similarity.

Dataset 3

We retrieved 407 VDAs from 34 viruses and 203 drugs by searching documents related to viruses and drugs based on text mining techniques. Similar to dataset 1, we computed drug similarity matrix and virus similarity matrix. The details of three datasets are shown in Table 9.

Table 9.

Statistics for the virus-drug association networks.

Datasets	Viruses	Drugs	VDAs
Dataset 1	12	78	96
Dataset 2	69	128	770
Dataset 3	34	203	407

Open in a new tab

In this study, the set of known VDAs was considered as the ‘gold standard’ dataset and was applied to evaluate the performance of our proposed VDA-RWR method. We described the known VDAs as a matrix $Y$ :

Y_{ij} = \{\begin{matrix} 1 & i f v_{i} a s s o c i a t e s w i t h d_{j} \\ 0 & otherwise \end{matrix})

where $v_{i}$ and $d_{j}$ represent the $i$ th virus and $j$ th drug, respectively.

The VDA-RWR method

Inspired by the method provided by Valdeolivas et al.⁵⁰, we developed a VDA prediction method based on Random Walk with Restart on the heterogeneous network (VDA-RWR). The proposed VDA-RWR method comprised two steps. First, a random walk-based model integrating various biological data was learned to explain the constructed ‘gold standard’ dataset. Second, this model was used to find potential VDAs for viruses and drugs absent from the ‘gold standard’ dataset. The details are shown Fig. 2.

We first considered virus-virus similarity graph $G_{v}$ , drug-drug similarity graph $G_{d}$ , and VDA graph $G_{a}$ , which formed a heterogeneous network. We defined $S_{v (n \times n)}$ , $S_{d (m \times m)}$ , and $Y_{(n \times m)}$ as their corresponding adjacency matrices. The adjacency matrix of the heterogeneous network can be denoted as: $W = [\begin{matrix} S_{v} & Y \\ Y^{T} & S_{d} \end{matrix}]$ , where $Y^{T}$ denoted the transpose of the VDA matrix $Y$ .

We then calculated the transition probabilities of random walk with restart on the heterogeneous network. Suppose $W = [\begin{matrix} W_{vv} & W_{vd} \\ W_{dv} & W_{dd} \end{matrix}]$ represented the matrix of transitions on the heterogeneous network, where $W_{vv}$ / $W_{dd}$ denoted the walk within the virus/drug network, $W_{vd}$ / $W_{dv}$ described the jump from the virus/drug network to the drug/virus network. Given the probability $μ$ of jumping from the virus/drug network to the drug/virus network, the transition probability from virus $v_{i}$ to virus $v_{j}$ was defined as

W_{vv} (i, j) = \{\begin{matrix} \frac{S_{v} (i, j)}{\sum_{k = 1}^{n} S_{v} (i, k)} & i f \sum_{k = 1}^{m} Y (i, k) = 0 \\ \frac{(1 - μ) S_{v} (i, j)}{\sum_{k = 1}^{n} S_{v} (i, k)} & otherwise \end{matrix})

The transition probability from virus $v_{i}$ to drug $d_{j}$ was defined as

W_{vd} (i, j) = \{\begin{matrix} \frac{μ Y (i, j)}{\sum_{k = 1}^{m} Y (i, k)} & i f \sum_{k = 1}^{m} Y (i, k) \neq 0 \\ 0 & otherwise \end{matrix})

The transition probability from drug $d_{i}$ to drug $d_{j}$ was defined as

W_{dd} (i, j) = \{\begin{matrix} \frac{S_{d} (i, j)}{\sum_{k = 1}^{m} S_{d} (i, k)} & i f \sum_{k = 1}^{n} Y (k, i) = 0 \\ \frac{(1 - μ) S_{d} (i, j)}{\sum_{k = 1}^{m} S_{d} (i, k)} & otherwise \end{matrix})

The transition probability from drug $d_{i}$ to virus $v_{j}$ was defined as

W_{dv} (i, j) = \{\begin{matrix} \frac{μ Y (j, i)}{\sum_{k = 1}^{n} Y (k, i)} & i f \sum_{k = 1}^{n} Y (k, i) \neq 0 \\ 0 & otherwise \end{matrix})

For a given virus/drug, the particle can either jump between graphs or stay in the current graph with a defined probability $r \in (0, 1)$ . Therefore, we finally defined the random walk with a restart probability $r$ as:

p_{t + 1} = r W p_{t} + (1 - r) p_{0}

where $p_{t}$ represented the computed association probability at the $t$ -th step random walk. We defined the initial probability as: $p_{0} = [\begin{matrix} α u_{0} \\ (1 - α) t_{0} \end{matrix}]$ , where $u_{0}$ and $t_{0}$ denoted the initial probability on the drug network and the virus network, respectively. If we tend to identify possible drugs for a given virus $v_{i}$ , it is considered as a seed node in the virus network. Here, $v_{i}$ was assigned as 1 and other nodes as 0, constructing the initial probability of the virus network $t_{0}$ . All nodes in the drug network $u_{0}$ were assigned as equal probabilities with the sum of 1. For example, to find potential antiviral drugs against SARS-CoV-2, we set SARS-CoV-2 as a seed node, and all drugs in the drug network were assigned as the same probabilities with the values of $1 / m$ . The parameter $α$ was used to control the weight of the virus network and the drug network. In addition, a virus is new if it does not associate with any drugs, and a drug is new if it is not applied to any viruses.

Molecular docking

Molecular docking technique was applied to compute the intermolecular binding abilities between the predicted anti-SARS-CoV-2 drugs and the SARS-CoV-2 spike protein/human ACE2. The chemical structures of drugs were downloaded in the form of the PDB format files from the DrugBank database. We used AutoDockTools to convert these PDB files into pdbqt files needed by AutoDock4. The structures of SARS-CoV-2 spike receptor-binding domain bound with ACE2 (PDB ID: 6M0J) were downloaded from the RCSB Protein Data Bank⁵¹. The spike protein and ACE2 were used as receptors, and the predicted anti-SARS-CoV-2 drugs were used as ligands for the molecular docking.

We first removed solvent and organic compounds and preprocessed the receptor proteins based on PyMOL³¹ (https://github.com/schrodinger/pymol-open-source, release 2.4.0, open source license: BSD-like). The receptors’ atoms were assigned the AD4 type and Gasteiger charges were considered before docking. Molecular docking software, AutoDock¹⁹ (http://autodock.scripps.edu/, AutoDock 4.2.6, open source license: GPL), was then used to conduct molecular docking. The binding pocket was defined by AutoGrid4, the grid size was set to 82 × 154 × 84 with a spacing of 0.375 Å, and the grid center was placed at the area of SARS-CoV-2 spike receptor-binding domain bounding with ACE2 (x = − 36.884, y = 29.245, z = − 0.005). The LGA (Lamarckian genetic algorithm) with default parameter provided by AutoDock4 was used as the search method. The docking contained two main processes: computation of conformation, position, and orientation of ligands within the binding sites and ranking of these conformations based on the binding affinities¹⁸.

Conclusion

To find potential antiviral drugs, in this study, we integrated the complete genome sequences of viruses, the chemical structures of drugs, and the VDA network. We then developed a VDA prediction method based on random walk with restart on the heterogeneous network. The results suggested that remdesivir and ribavirin may be applied to the treatment of COVID-19. In the emergency situation, this study focused more on finding antiviral drugs. In the future, we will further integrate more biological data and design more powerful models to improve the accuracy of VDA identification. We hope that our proposed VDA-RWR method could help the screening of drugs for preventing COVID-19.

Supplementary Information

Supplementary Information 1.^{(1.7MB, zip)}

Supplementary Information 2.^{(1.4MB, zip)}

Supplementary Information 3.^{(1.6MB, zip)}

Acknowledgements

We are thankful for help from Guangyi Liu, Ming Kuang, and Longjie Liao from Hunan University of Technology, and Ruyi Dong, Lebin Liang, Qinqin Lu, and Jidong Lang from Geneis (Beijing) Co. Ltd. We would like to thank all authors of the cited references.

Author contributions

L.-H.P. and L.S. contributed equally to this work. L.-H.P., J.-L.X., J.-L.Y., and L.-Q.Z. conceived the study, designed the schedule, and analyzed the data. F.-X.L. screened the viruses similar to SARS-CoV-2, X.-F.T. downloaded the genome sequences of viruses and computed virus similarity matrix, L.S. computed drug similarity matrix, L.S., X.-F.T., F.-X.L., and J.-J.W. constructed VDA network, L.S. run random walk algorithm, L.-H.P., G.T., and L.-Q.Z. wrote the paper, J.-L.Y. revised the original draft. All authors read and approved the final manuscript.

Funding

This research was funded by National Natural Science Foundation of China (Grant 61803151) and Natural Science Foundation of Hunan province (Grant 2018JJ3570, 2018JJ2461).

Data availability

Source codes and datasets are freely available for download at https://github.com/plhhnu/VDA-RWR/.

Competing interests

Authors Geng Tian and Jialiang Yang were employed by the company Geneis (Beijing) Co. Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be constructed as a potential conflict of interest.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Lihong Peng and Ling Shen.

Contributor Information

Jialiang Yang, Email: yangjl@geneis.cn.

Liqian Zhou, Email: zhoulq11@163.com.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-021-83737-5.

References

1.Hui DS, et al. The continuing 2019-nCoV epidemic threat of novel coronaviruses to global health: The latest 2019 novel coronavirus outbreak in Wuhan, China. Int. J. Infect. Dis. 2020;91:264–266. doi: 10.1016/j.ijid.2020.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Lu R, et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. The Lancet. 2020;395:565–574. doi: 10.1016/S0140-6736(20)30251-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.WHO. Coronavirus disease 2019 (COVID-19) Situation Report- COVID-19 Weekly Epidemiological Update. (2020). Organ. WHO (2020). (https://www.who.int/docs/default-source/coronaviruse/situation-reports/20201020weekly-epi-update-10.pdf. Accessed 20 Oct 2020.
4.Organization, W. H. US $675 Million Needed for New Coronavirus Preparedness and Response Global Plan [Internet]. Geneva: World Health Organization; 2020 [cited 2020 Feb 5]. (2020).
5.Huang C, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet. 2020;395:497–506. doi: 10.1016/S0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Wang M, et al. Remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus (2019-nCoV) in vitro. Cell Res. 2020;30:269–271. doi: 10.1038/s41422-020-0282-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Lu H. Drug treatment options for the 2019-new coronavirus (2019-nCoV) Biosci. Trends. 2020;14:69–71. doi: 10.5582/bst.2020.01020. [DOI] [PubMed] [Google Scholar]
8.Chen H, Du Q. Potential natural compounds for preventing SARS-CoV-2 (2019-nCoV) infection. Preprints. 2020 doi: 10.20944/preprints202001.0358.v3. [DOI] [Google Scholar]
9.Li Y, et al. Therapeutic drugs targeting 2019-nCoV main protease by high-throughput screening. bioRxiv. 2020 doi: 10.1101/2020.01.28.922922. [DOI] [Google Scholar]
10.Kumar S. Drug and vaccine design against novel coronavirus (2019-nCoV) spike protein through computational approach. Preprints. 2020 doi: 10.20944/preprints202002.0071.v1. [DOI] [Google Scholar]
11.Zhang H, et al. Deep learning based drug screening for novel coronavirus 2019-nCov. Interdiscip. Sci. Comput. Life Sci. 2020 doi: 10.20944/preprints202002.0061.v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Beck BR, Shin B, Choi Y, Park S, Kang K. Predicting commercially available antiviral drugs that may act on the novel coronavirus (SARS-CoV-2) through a drug-target interaction deep learning model. Comput. Struct. Biotechnol. J. 2020;18:784–790. doi: 10.1016/j.csbj.2020.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Zhou P, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Huang Y-A, et al. Prediction of microbe–disease association from the integration of neighbor and graph with collaborative recommendation model. J. Transl. Med. 2017;15:1–11. doi: 10.1186/s12967-016-1111-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Li J, et al. Network-based identification of microRNAs as potential pharmacogenomic biomarkers for anticancer drugs. Oncotarget. 2016;7:45584–45596. doi: 10.18632/oncotarget.10052. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Wang F, et al. LRLSHMDA: Laplacian regularized least squares for human microbe–disease association prediction. Sci. Rep. 2017;7:1–11. doi: 10.1038/s41598-016-0028-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.McConkey BJ, Sobolev V, Edelman M. The performance of current methods in ligand–protein docking. Curr. Sci. 2002;1:845–856. [Google Scholar]
18.Meng X-Y, Zhang H-X, Mezei M, Cui M. Molecular docking: A powerful approach for structure-based drug discovery. Curr. Comput. Aided Drug Des. 2011;7:146–157. doi: 10.2174/157340911795677602. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Morris GM, et al. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. 2009;30:2785–2791. doi: 10.1002/jcc.21256. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Tchesnokov EP, Feng JY, Porter DP, Götte M. Mechanism of inhibition of ebola virus RNA-dependent RNA polymerase by remdesivir. Viruses. 2019;11:326. doi: 10.3390/v11040326. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Sheahan TP, et al. Comparative therapeutic efficacy of remdesivir and combination lopinavir, ritonavir, and interferon beta against MERS-CoV. Nat. Commun. 2020;11:1–14. doi: 10.1038/s41467-019-13940-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Brown AJ, et al. Broad spectrum antiviral remdesivir inhibits human endemic and zoonotic deltacoronaviruses with a highly divergent RNA dependent RNA polymerase. Antiviral Res. 2019;169:104541. doi: 10.1016/j.antiviral.2019.104541. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Morse JS, Lalonde T, Xu S, Liu WR. Learning from the past: Possible urgent prevention and treatment options for severe acute respiratory infections caused by 2019-nCoV. ChemBioChem. 2020;21:730–738. doi: 10.1002/cbic.202000047. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Paules CI, Marston HD, Fauci AS. Coronavirus infections: more than just the common cold. JAMA. 2020;323:707–708. doi: 10.1001/jama.2020.0757. [DOI] [PubMed] [Google Scholar]
25.Chen YW, Yiu C-P, Wong K-Y. Prediction of the 2019-nCoV 3C-like protease (3CLpro) structure: Virtual screening reveals velpatasvir, ledipasvir, and other drug repurposing candidates. ChemRxiv. 2020 doi: 10.26434/chemrxiv.11831103. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Zumla A, Hui DS, Azhar EI, Memish ZA, Maeurer M. Reducing mortality from 2019-nCoV: Host-directed therapies should be an option. The Lancet. 2020;395:35–36. doi: 10.1016/S0140-6736(20)30305-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Wang F-S, Zhang C. What to do next to control the 2019-nCoV epidemic? The Lancet. 2020;395:391–393. doi: 10.1016/S0140-6736(20)30300-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Malik YS, et al. Emerging novel coronavirus (2019-nCoV)—current scenario, evolutionary perspective based on genome analysis and recent developments. Vet. Q. 2020;40:68–76. doi: 10.1080/01652176.2020.1727993. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.FDA. FDA’s approval of Veklury (remdesivir) for the treatment of COVID-19—The Science of Safety and Effectiveness. https://www.fda.gov/drugs/drug-safety-and-availability/fdas-approval-veklury-remdesivir-treatment-covid-19-science-safety-and-effectiveness, October 22, 2020. Accessed 24 Oct 2020.
30.Zhang Z, et al. Clinical features and treatment of 2019-nCov pneumonia patients in Wuhan: Report of a couple cases. Virol. Sin. 2020;35:330–336. doi: 10.1007/s12250-020-00203-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Schrodinger, L. The PyMOL molecular graphics system. Version1, (2010).
32.Fraser C, et al. Pandemic potential of a strain of influenza A (H1N1): EARLY FINDINGS. Science. 2009;324:1557–1561. doi: 10.1126/science.1176062. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Claas EC, et al. Human influenza A H5N1 virus related to a highly pathogenic avian influenza virus. The Lancet. 1998;351:472–477. doi: 10.1016/S0140-6736(97)11212-0. [DOI] [PubMed] [Google Scholar]
34.Gao R, et al. Human infection with a novel avian-origin influenza A (H7N9) virus. N. Engl. J. Med. 2013;368:1888–1897. doi: 10.1056/NEJMoa1304459. [DOI] [PubMed] [Google Scholar]
35.Fried MW, et al. Peginterferon Alfa-2a plus ribavirin for chronic hepatitis C virus infection. N. Engl. J. Med. 2002;347:975–982. doi: 10.1056/NEJMoa020047. [DOI] [PubMed] [Google Scholar]
36.Navia MA, et al. Three-dimensional structure of aspartyl protease from human immunodeficiency virus HIV-1. Nature. 1989;337:615–620. doi: 10.1038/337615a0. [DOI] [PubMed] [Google Scholar]
37.Mörner A, et al. Primary human immunodeficiency virus type 2 (HIV-2) isolates, like HIV-1 isolates, frequently use CCR5 but show promiscuity in coreceptor usage. J. Virol. 1999;73:2343–2349. doi: 10.1128/JVI.73.3.2343-2349.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Young PL, Halpin K, Mackenzie JS, Field HE. Isolation of hendra virus from pteropid bats: A natural reservoir of Hendra virus. J. Gen. Virol. 2000;81:1927. doi: 10.1099/0022-1317-81-8-1927. [DOI] [PubMed] [Google Scholar]
39.Chee MS, et al. Analysis of the protein-coding content of the sequence of human cytomegalovirus strain AD169. In: McDougall JK, et al., editors. Cytomegaloviruses. New York: Springer; 1990. pp. 125–169. [DOI] [PubMed] [Google Scholar]
40.de Groot RJ, et al. Commentary: Middle east respiratory syndrome coronavirus (MERS-CoV): Announcement of the coronavirus study group. J. Virol. 2013;87:7790–7792. doi: 10.1128/JVI.01244-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Mazur NI, et al. The respiratory syncytial virus vaccine landscape: Lessons from the graveyard and promising candidates. Lancet Infect. Dis. 2018;18:295–311. doi: 10.1016/S1473-3099(18)30292-5. [DOI] [PubMed] [Google Scholar]
42.Bosch BJ, et al. Severe acute respiratory syndrome coronavirus (SARS-CoV) infection inhibition using spike protein heptad repeat-derived peptides. Proc. Natl. Acad. Sci. 2004;101:8455–8460. doi: 10.1073/pnas.0400576101. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Sayers EW, et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2020;48:D9. doi: 10.1093/nar/gkz899. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Katoh K, Rozewicki J, Yamada KD. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019;20:1160–1166. doi: 10.1093/bib/bbx108. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Wishart DS, et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2018;46:D1074–D1082. doi: 10.1093/nar/gkx1037. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Canese, K. & Weis, S. PubMed: the bibliographic database. in The NCBI Handbook [Internet]. 2nd edition (National Center for Biotechnology Information (US), 2013).
47.Rogers D, Hahn M. Extended-connectivity fingerprints. J. Chem. Inf. Model. 2010;50:742–754. doi: 10.1021/ci100050t. [DOI] [PubMed] [Google Scholar]
48.Landrum, G., et al. RDKit: Open-Source Cheminformatics Software. (2016).
49.Andersen PI, et al. Discovery and development of safe-in-man broad-spectrum antiviral agents. Int. J. Infect. Dis. 2020;93:268–276. doi: 10.1016/j.ijid.2020.02.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Valdeolivas A, et al. Random walk with restart on multiplex and heterogeneous biological networks. Bioinformatics. 2019;35:497–505. doi: 10.1093/bioinformatics/bty637. [DOI] [PubMed] [Google Scholar]
51.Berman HM, et al. The protein data bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information 1.^{(1.7MB, zip)}

Supplementary Information 2.^{(1.4MB, zip)}

Supplementary Information 3.^{(1.6MB, zip)}

Data Availability Statement

Source codes and datasets are freely available for download at https://github.com/plhhnu/VDA-RWR/.

[CR1] 1.Hui DS, et al. The continuing 2019-nCoV epidemic threat of novel coronaviruses to global health: The latest 2019 novel coronavirus outbreak in Wuhan, China. Int. J. Infect. Dis. 2020;91:264–266. doi: 10.1016/j.ijid.2020.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Lu R, et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. The Lancet. 2020;395:565–574. doi: 10.1016/S0140-6736(20)30251-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.WHO. Coronavirus disease 2019 (COVID-19) Situation Report- COVID-19 Weekly Epidemiological Update. (2020). Organ. WHO (2020). (https://www.who.int/docs/default-source/coronaviruse/situation-reports/20201020weekly-epi-update-10.pdf. Accessed 20 Oct 2020.

[CR4] 4.Organization, W. H. US $675 Million Needed for New Coronavirus Preparedness and Response Global Plan [Internet]. Geneva: World Health Organization; 2020 [cited 2020 Feb 5]. (2020).

[CR5] 5.Huang C, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet. 2020;395:497–506. doi: 10.1016/S0140-6736(20)30183-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Wang M, et al. Remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus (2019-nCoV) in vitro. Cell Res. 2020;30:269–271. doi: 10.1038/s41422-020-0282-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Lu H. Drug treatment options for the 2019-new coronavirus (2019-nCoV) Biosci. Trends. 2020;14:69–71. doi: 10.5582/bst.2020.01020. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Chen H, Du Q. Potential natural compounds for preventing SARS-CoV-2 (2019-nCoV) infection. Preprints. 2020 doi: 10.20944/preprints202001.0358.v3. [DOI] [Google Scholar]

[CR9] 9.Li Y, et al. Therapeutic drugs targeting 2019-nCoV main protease by high-throughput screening. bioRxiv. 2020 doi: 10.1101/2020.01.28.922922. [DOI] [Google Scholar]

[CR10] 10.Kumar S. Drug and vaccine design against novel coronavirus (2019-nCoV) spike protein through computational approach. Preprints. 2020 doi: 10.20944/preprints202002.0071.v1. [DOI] [Google Scholar]

[CR11] 11.Zhang H, et al. Deep learning based drug screening for novel coronavirus 2019-nCov. Interdiscip. Sci. Comput. Life Sci. 2020 doi: 10.20944/preprints202002.0061.v1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Beck BR, Shin B, Choi Y, Park S, Kang K. Predicting commercially available antiviral drugs that may act on the novel coronavirus (SARS-CoV-2) through a drug-target interaction deep learning model. Comput. Struct. Biotechnol. J. 2020;18:784–790. doi: 10.1016/j.csbj.2020.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Zhou P, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Huang Y-A, et al. Prediction of microbe–disease association from the integration of neighbor and graph with collaborative recommendation model. J. Transl. Med. 2017;15:1–11. doi: 10.1186/s12967-016-1111-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Li J, et al. Network-based identification of microRNAs as potential pharmacogenomic biomarkers for anticancer drugs. Oncotarget. 2016;7:45584–45596. doi: 10.18632/oncotarget.10052. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Wang F, et al. LRLSHMDA: Laplacian regularized least squares for human microbe–disease association prediction. Sci. Rep. 2017;7:1–11. doi: 10.1038/s41598-016-0028-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.McConkey BJ, Sobolev V, Edelman M. The performance of current methods in ligand–protein docking. Curr. Sci. 2002;1:845–856. [Google Scholar]

[CR18] 18.Meng X-Y, Zhang H-X, Mezei M, Cui M. Molecular docking: A powerful approach for structure-based drug discovery. Curr. Comput. Aided Drug Des. 2011;7:146–157. doi: 10.2174/157340911795677602. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Morris GM, et al. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J. Comput. Chem. 2009;30:2785–2791. doi: 10.1002/jcc.21256. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Tchesnokov EP, Feng JY, Porter DP, Götte M. Mechanism of inhibition of ebola virus RNA-dependent RNA polymerase by remdesivir. Viruses. 2019;11:326. doi: 10.3390/v11040326. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Sheahan TP, et al. Comparative therapeutic efficacy of remdesivir and combination lopinavir, ritonavir, and interferon beta against MERS-CoV. Nat. Commun. 2020;11:1–14. doi: 10.1038/s41467-019-13940-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Brown AJ, et al. Broad spectrum antiviral remdesivir inhibits human endemic and zoonotic deltacoronaviruses with a highly divergent RNA dependent RNA polymerase. Antiviral Res. 2019;169:104541. doi: 10.1016/j.antiviral.2019.104541. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Morse JS, Lalonde T, Xu S, Liu WR. Learning from the past: Possible urgent prevention and treatment options for severe acute respiratory infections caused by 2019-nCoV. ChemBioChem. 2020;21:730–738. doi: 10.1002/cbic.202000047. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Paules CI, Marston HD, Fauci AS. Coronavirus infections: more than just the common cold. JAMA. 2020;323:707–708. doi: 10.1001/jama.2020.0757. [DOI] [PubMed] [Google Scholar]

[CR25] 25.Chen YW, Yiu C-P, Wong K-Y. Prediction of the 2019-nCoV 3C-like protease (3CLpro) structure: Virtual screening reveals velpatasvir, ledipasvir, and other drug repurposing candidates. ChemRxiv. 2020 doi: 10.26434/chemrxiv.11831103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Zumla A, Hui DS, Azhar EI, Memish ZA, Maeurer M. Reducing mortality from 2019-nCoV: Host-directed therapies should be an option. The Lancet. 2020;395:35–36. doi: 10.1016/S0140-6736(20)30305-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Wang F-S, Zhang C. What to do next to control the 2019-nCoV epidemic? The Lancet. 2020;395:391–393. doi: 10.1016/S0140-6736(20)30300-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Malik YS, et al. Emerging novel coronavirus (2019-nCoV)—current scenario, evolutionary perspective based on genome analysis and recent developments. Vet. Q. 2020;40:68–76. doi: 10.1080/01652176.2020.1727993. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.FDA. FDA’s approval of Veklury (remdesivir) for the treatment of COVID-19—The Science of Safety and Effectiveness. https://www.fda.gov/drugs/drug-safety-and-availability/fdas-approval-veklury-remdesivir-treatment-covid-19-science-safety-and-effectiveness, October 22, 2020. Accessed 24 Oct 2020.

[CR30] 30.Zhang Z, et al. Clinical features and treatment of 2019-nCov pneumonia patients in Wuhan: Report of a couple cases. Virol. Sin. 2020;35:330–336. doi: 10.1007/s12250-020-00203-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Schrodinger, L. The PyMOL molecular graphics system. Version1, (2010).

[CR32] 32.Fraser C, et al. Pandemic potential of a strain of influenza A (H1N1): EARLY FINDINGS. Science. 2009;324:1557–1561. doi: 10.1126/science.1176062. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Claas EC, et al. Human influenza A H5N1 virus related to a highly pathogenic avian influenza virus. The Lancet. 1998;351:472–477. doi: 10.1016/S0140-6736(97)11212-0. [DOI] [PubMed] [Google Scholar]

[CR34] 34.Gao R, et al. Human infection with a novel avian-origin influenza A (H7N9) virus. N. Engl. J. Med. 2013;368:1888–1897. doi: 10.1056/NEJMoa1304459. [DOI] [PubMed] [Google Scholar]

[CR35] 35.Fried MW, et al. Peginterferon Alfa-2a plus ribavirin for chronic hepatitis C virus infection. N. Engl. J. Med. 2002;347:975–982. doi: 10.1056/NEJMoa020047. [DOI] [PubMed] [Google Scholar]

[CR36] 36.Navia MA, et al. Three-dimensional structure of aspartyl protease from human immunodeficiency virus HIV-1. Nature. 1989;337:615–620. doi: 10.1038/337615a0. [DOI] [PubMed] [Google Scholar]

[CR37] 37.Mörner A, et al. Primary human immunodeficiency virus type 2 (HIV-2) isolates, like HIV-1 isolates, frequently use CCR5 but show promiscuity in coreceptor usage. J. Virol. 1999;73:2343–2349. doi: 10.1128/JVI.73.3.2343-2349.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR38] 38.Young PL, Halpin K, Mackenzie JS, Field HE. Isolation of hendra virus from pteropid bats: A natural reservoir of Hendra virus. J. Gen. Virol. 2000;81:1927. doi: 10.1099/0022-1317-81-8-1927. [DOI] [PubMed] [Google Scholar]

[CR39] 39.Chee MS, et al. Analysis of the protein-coding content of the sequence of human cytomegalovirus strain AD169. In: McDougall JK, et al., editors. Cytomegaloviruses. New York: Springer; 1990. pp. 125–169. [DOI] [PubMed] [Google Scholar]

[CR40] 40.de Groot RJ, et al. Commentary: Middle east respiratory syndrome coronavirus (MERS-CoV): Announcement of the coronavirus study group. J. Virol. 2013;87:7790–7792. doi: 10.1128/JVI.01244-13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Mazur NI, et al. The respiratory syncytial virus vaccine landscape: Lessons from the graveyard and promising candidates. Lancet Infect. Dis. 2018;18:295–311. doi: 10.1016/S1473-3099(18)30292-5. [DOI] [PubMed] [Google Scholar]

[CR42] 42.Bosch BJ, et al. Severe acute respiratory syndrome coronavirus (SARS-CoV) infection inhibition using spike protein heptad repeat-derived peptides. Proc. Natl. Acad. Sci. 2004;101:8455–8460. doi: 10.1073/pnas.0400576101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Sayers EW, et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2020;48:D9. doi: 10.1093/nar/gkz899. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Katoh K, Rozewicki J, Yamada KD. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform. 2019;20:1160–1166. doi: 10.1093/bib/bbx108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.Wishart DS, et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 2018;46:D1074–D1082. doi: 10.1093/nar/gkx1037. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR46] 46.Canese, K. & Weis, S. PubMed: the bibliographic database. in The NCBI Handbook [Internet]. 2nd edition (National Center for Biotechnology Information (US), 2013).

[CR47] 47.Rogers D, Hahn M. Extended-connectivity fingerprints. J. Chem. Inf. Model. 2010;50:742–754. doi: 10.1021/ci100050t. [DOI] [PubMed] [Google Scholar]

[CR48] 48.Landrum, G., et al. RDKit: Open-Source Cheminformatics Software. (2016).

[CR49] 49.Andersen PI, et al. Discovery and development of safe-in-man broad-spectrum antiviral agents. Int. J. Infect. Dis. 2020;93:268–276. doi: 10.1016/j.ijid.2020.02.018. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR50] 50.Valdeolivas A, et al. Random walk with restart on multiplex and heterogeneous biological networks. Bioinformatics. 2019;35:497–505. doi: 10.1093/bioinformatics/bty637. [DOI] [PubMed] [Google Scholar]

[CR51] 51.Berman HM, et al. The protein data bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Prioritizing antiviral drugs against SARS-CoV-2 by integrating viral complete genome sequences and drug chemical structures

Lihong Peng

Ling Shen

Junlin Xu

Xiongfei Tian

Fuxing Liu

Juanjuan Wang

Geng Tian

Jialiang Yang

Liqian Zhou

Abstract

Introduction

Results

Experimental settings

Table 1.

Evaluation metrics

Performance evaluation under three fivefold CVs

Table 2.

Table 3.

Table 4.

Figure 1.

Case study

Table 5.

Table 6.

Table 7.

Molecular docking

Table 8.

Discussion

Methods

Virus-drug association data

Dataset 1

Virus data

Drug data

VDAs

Dataset 2

Dataset 3

Table 9.

The VDA-RWR method

Figure 2.

Molecular docking

Conclusion

Supplementary Information

Acknowledgements

Author contributions

Funding

Data availability

Competing interests

Footnotes

Contributor Information

Supplementary Information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases