Multiscale fusion network drives the repurposing of anticancer drugs

Zhaoman Wan; Nan Jiang; Mingming Su; Xinlei Zhang; Yang Cao; Aiping Wu; Peng Zhang; Taijiao Jiang

doi:10.1002/ctm2.1745

letter

. 2024 Jun 25;14(7):e1745. doi: 10.1002/ctm2.1745

Multiscale fusion network drives the repurposing of anticancer drugs

Zhaoman Wan ¹, Nan Jiang ², Mingming Su ³, Xinlei Zhang ³, Yang Cao ⁴, Aiping Wu ^1,^✉, Peng Zhang ^5,^✉, Taijiao Jiang ^6,^✉

PMCID: PMC11199060 PMID: 38924682

Dear Editor,

Drug repurposing is at the forefront of a transformative shift in computational methods driving new applications of approved or investigational drugs. ¹ , ² With the development of network pharmacology, repositioning algorithms for drug effects or drug targets are constantly expanding, but integrating multidimensional data to achieve precise repurposing is still a challenge. ³ , ⁴ , ⁵ , ⁶ , ⁷ , ⁸ , ⁹ We focus on drug attribute characteristics and propose a scalable systematic paradigm. Using the Genomics of Drug Sensitivity in Cancer (GDSC) database for anti‐tumor drugs, a integrated drug similarity network (iDSN) derived from different drug similarity networks (DSNs) based on chemical structure and drug target sequence data is constructed to infer potential drug pathways from drug properties and realise drug repurposing.

Initially, we processed drug profile data by vectorizing it (Figure 1A). Based on chemical and pharmacological properties, we constructed two separate DSNs: chem‐DSN and pharm‐DSN. These were then merged into an iDSN using a nonlinear fusion algorithm called Similarity Network Fusion (SNF) (Figure 1B). To validate the iDSN's potential in therapeutic similarity, we utilized a spectral clustering model with seven gold‐standard annotations from PubChem (Figure 1C). Downstream analysis was delineated across three dimensions for drug repurposing (Figure 1D): (1) identifying similar components within classes, amalgamating pharmacological mechanisms with pathway annotation; (2) establishing associations between drug network clusters and distinct biological pathways; (3) prioritizing higher‐ranked drug pairs for drug repositioning.

**The overall flowchart of the proposed framework**. (A) In the first part, we preprocess structural data and target protein sequences based on 276 anti‐tumor drugs containing target protein, PubChem CID and pathway annotations in GDSC to construct eigenvectors or eigen decomposition matrices that can be used for similarity network analysis. (B) In the second part, we constructed two structure‐based single‐property DSNs by vector space model and multiple sequence alignment denoted as chem‐DSN and pharm‐DSN, respectively, and fused them into multi‐scale network, named iDSN. (C) In the third part, we used the drug classification notes on DrugBank and PubChem as the gold standard to test the classification effect of DSN according to the adjusted Rand index. (D) In the fourth part, 276 drugs included in the iDSN were grouped into 16 clusters (Cluster 1–Cluster 16) and performed a systematic statistic analysis for annotating iDSN clusters from various perspectives to guide drug repositioning. Possible drug repositioning applications were obtained based on high‐similarity drug pairs or the occurrence of unexpected drugs within iDSN clusters.

Employing spectral clustering, iDSN exhibited a more distinct clustering structure compared to the chem‐DSN and a more evenly distributed structure than the pharm‐DSN (Figures 2A and S1). With the advantage of framework transparency, pharmacological properties contribute more than chemical properties through the quantitative assessment in clustering (Figure 2B). In 11 clusters, pharmacological features accounted for over 70% of edge similarity, and four clusters were entirely determined by pharm‐DSN. In comparison to single‐property DSNs, iDSN demonstrated superior performance across all three metrics (Figure 2C).

**Clustering validation and feature contribution of DSNs**. (A) Heatmaps of DSNs. chem‐DSN and pharm‐DSN were integrated into the iDSN. The sidebar to the left of the networks corresponds to the cluster label. (B) Structural features used alone were defined in the stem‐and‐leaf display with drug clusters to evaluate their respective contributions. (C) Categorical internal evaluation by Davies–Bouldin index, Calinski–Harabaz score and Silhouette coefficient for chem‐DSN, pharm‐DSN and iDSN. (D) Categorical external evaluation by adjusted RAND coefficient for chem‐DSN, pharm‐DSN and iDSN based on multiscale drug classification annotations. (E) Normalised mutual information bar chart of chem‐DSN, pharm‐DSN and iDSN with IC50‐based benchmark drug classification separately. (F) Data types contribution defined in the pie chart include chemical structure, drug targets and fusion value.

Evaluation using six diverse benchmark datasets of drug categories confirmed iDSN's stronger correlations with all benchmark annotations compared to single‐property DSNs (Table S1). Comparing clustering performance among single‐property DSNs, pharm‐DSN displayed better interaction with cell line (ARI = .470) and pathway (ARI = .585) annotations (Figure 2D). Importantly, iDSN based on the cross‐fusion network algorithm achieves higher performance on IC50 (ARI = .502, NMI = .53), indicating improved generalisation and accuracy through multi‐feature fusion (Figures 2E and S2). Data contribution analysis within the IC50‐based cluster of the three DSNs highlighted iDSN's predominant contribution (77.23%), while chem‐DSN and pharm‐DSN contributed less (9.83% and 12.94%, respectively), underscoring iDSN's dominance in the IC50‐based network (Figure 2F). To validate the superior performance, our method was compared with state‐of‐the‐art approaches, encompassing traditional machine learning, network propagation and matrix factorisation. The framework demonstrated a significantly higher value (SC = .58) compared to other methods. Similar results were observed with the NMI index using the IC50 dataset, where our framework outperformed in interactivity score (ARI = .512) (Table S2).

To facilitate drug precision repositioning, we uncovered the drug preferences within each cluster for various molecular functions using the KEGG pathway and Gene Ontology (GO) annotations ¹⁰ (Figure 3). Notably, certain highly similar drug pairs within clusters exhibited consistent downstream pathway annotations, indicating the potential for drug repositioning by leveraging common targets or similar cellular signalling pathways to achieve therapeutic effects. The results revealed that several drug clusters exhibited significant enrichment annotations on GO analysis, such as Cluster 4, Cluster 2, and Cluster 5 included in the histone deacetylation, ADP ribosylation and phosphorylation respectively based on biological process. Furthermore, we observed that some individual drug clusters met different KEGG enrichment pathways under secondary classification, but specific on GO enrichment analysis in biological processes, cell components or molecular functions. For instance, in Cluster 9, we observed enrichment in KEGG pathways related to cancer and cell growth and death, with emphasis on apoptosis and molecular functions associated with dimerisation in biological processes, which suggests that Cluster 9 may exhibit a pharmacodynamic pattern, potentially influencing protein dimerisation and participating in pathways related to cancer or cell growth and death through apoptosis regulation.

**Target‐based enrichment analyses**. (A) Enrichment landscape of GO biological processes, molecular function, and cellular components based on taxonomic clusters. (B) Enrichment landscape of pathway identifiers from the KEGG database. Deeper colour signifies greater significance in all panels, respectively. Common biological themes shared by multiple clusters are boxed with names provided in the plot.

Exploring drug pairs with high similarity in the iDSN reveals potential drug repositioning opportunities. For the top 100 similar drug pairs, 86% had consistent pathway annotations, confirming the reliability of our drug similarity calculations. However, some pairs with high similarity scores had different annotations, mainly linked to six pathways in four classification clusters (Figure 4A). For instance, a closely related set of drug pairs, including BMS‐536924, BMS‐754807, GSK1904529A, Linsitinib and NVP‐ADW742, exhibited connections to annotations in the IGF1R signalling and RTK signalling pathways, suggesting potential shared targets or interactions with similar cellular signalling pathways. In a major cluster, drugs associated with the kinases pathway (KIN001‐244) showed high similarity to drugs linked to the Metabolism pathway (BX‐912 and OSU‐03012) and the Mitosis pathway (MPS‐1‐IN‐1), unveiling potential crosstalk for therapeutic strategies (Table S3). Among them, the drug pair with the highest similarity is CMK‐LJI308 (ranked sixth), annotated in the kinase pathway and the PI3K/MTOR signalling pathway, respectively. Drug pairs with different annotation pathways in specific spectral clustering clusters indicate distinct subgroups with unique downstream pathway preferences. Some clusters may exhibit pathway‐specific therapeutic effects, while others show divergent pathway orientations (Figure 4B). To explore the global repositioning associations, we aligned pathway annotations with clustering results and observed a well‐balanced distribution of downstream pathways across drug clusters (Figure 4C). Within one cluster containing nine drugs, four were associated with the IGF1R signalling pathway, and the remaining drugs were linked to the RTK signalling pathway. In another cluster with 37 drugs, 14 were mapped to the RTK pathway, while the remaining drugs were connected to the kinase pathway. Furthermore, we independently analysed highly similar drug pairs within these clusters (Figure 4D).

**Repositioning association with the different pathway**. (A) The consistent distribution of the top 100 similarity drug pairs on the pathway and the proportion of different pathway pairs. (B) The cumulative frequency histogram calculated the corresponding consistent pathway and nonconsistent pathway in the cluster under the distribution of the top 100 highly similar drug pairs in the drug cluster. The area plot represents whether the set of consistent and non‐consistent paths under each cluster evaluates the p‐value after taking the negative logarithm of the significance difference. The corresponding frequencies are represented by the left ordinate while the statistical test index is represented by the right. (C) The global anti‐tumor drugs in the iDSN are grouped into 16 clusters. Node colour signifies the pathway assignment of the drug. (D) Pathway association network for global drug mapping. The peripheral ring represents the GDSC drug annotation pathway, and the arc length is determined by the central angle corresponding to the proportion of the drug counts included in the pathway. The lines represent inconsistent pathways mapped under 276 drug global associations. The inner ring represents the potential pathway of the drug on the outer circumference of the same radius. The colour‐indicated pathways correspond to inconsistent pathway‐source drug pairs with a similarity greater than .7, and the grey‐indicated pathways correspond to highly similar drugs that are all mapped on the same pathway.

In conclusion, this scalable structure‐derived framework offers fresh insights into deducing characteristic downstream pathways and repurposing drugs via common drug structural properties. With the accumulation of drug informatics data and the development of future drugs, we will continue to expand our data, to deepen our understanding of feature integration, and to further improve the algorithm's performance for new drug development.

AUTHOR CONTRIBUTIONS

Zhaoman Wan performed the analysis and prepared the manuscript with the help of Yang Cao, Mingming Su, Xinlei Zhang, L.Y. and H.X. Aiping Wu, Peng Zhang and Taijiao Jiang supervised the studies, designed the analysis and revised the manuscript. All authors reviewed and approved the manuscript.

CONFLICT OF INTERESTS STATEMENT

M.S. and X.Z. are the co‐founders of Beijing Cloudna Technology Co., Ltd., and the other authors declare no competing interests.

Supporting information

Supporting Information

CTM2-14-e1745-s002.pptx^{(1.6MB, pptx)}

Supporting Information

CTM2-14-e1745-s001.docx^{(26.3KB, docx)}

Supporting Information

CTM2-14-e1745-s003.xlsx^{(11.2KB, xlsx)}

Supporting Information

CTM2-14-e1745-s005.xlsx^{(11.4KB, xlsx)}

Supporting Information

CTM2-14-e1745-s004.xlsx^{(10.7KB, xlsx)}

Contributor Information

Aiping Wu, Email: wap@ism.cams.cn.

Peng Zhang, Email: zhangpengdyx@163.com.

Taijiao Jiang, Email: jiang_taijiao@gzlab.ac.cn.

DATA AVAILABILITY STATEMENT

The materials of datasets used for the current study were accessible in the specific public database (as we described in the Methods section). The analysis codes are available at https://github.com/zmwwan/DSNcode/.

REFERENCES

1.1. Kumar A, Zhang KYJ. Advances in the development of shape similarity methods and their application in drug discovery. Front Chem. 2018;6:315. [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Gandini E, Marcou G, Bonachera F, Varnek A, Pieraccini S, Sironi M. Molecular similarity perception based on machine‐learning models. Int J Mol Sci. 2022;23(11):6114. [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Kang H, Hou L, Gu Y, Lu X, Li J, Li Q. Drug‐disease association prediction with literature based multi‐feature fusion. Front Pharmacol. 2023;14:1205144. [DOI] [PMC free article] [PubMed] [Google Scholar]
4. He S, Wen Y, Yang X, et al. PIMD: an integrative approach for drug repositioning using multiple characterization fusion. Genomics Proteom Bioinform. 2020;18(5):565‐581. [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Law V, Knox C, Djoumbou Y, et al. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 2014;42(Database issue):D1091‐D1097. [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Wang B, Mezlini AM, Demir F, et al. Similarity network fusion for aggregating data types on a genomic scale. Nat Methods. 2014;11(3):333‐337. [DOI] [PubMed] [Google Scholar]
7. Öztürk H, Ozkirimli E, Özgür A. A comparative study of SMILES‐based compound similarity functions for drug‐target interaction prediction. BMC Bioinform. 2016;17:128. [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Lee I, Keum J, Nam H. DeepConv‐DTI: prediction of drug‐target interactions via deep learning with convolution on protein sequences. PLoS Comput Biol. 2019;15(6):e1007129. [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Wang S, Li J, Wang D, Xu D, Jin J, Wang Y. Predicting drug‐disease associations through similarity network fusion and multi‐view feature projection representation. IEEE J Biomed Health Inform. 2023;27(10):5165‐5176. [DOI] [PubMed] [Google Scholar]
10. Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000;28(1):27‐30. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

CTM2-14-e1745-s002.pptx^{(1.6MB, pptx)}

Supporting Information

CTM2-14-e1745-s001.docx^{(26.3KB, docx)}

Supporting Information

CTM2-14-e1745-s003.xlsx^{(11.2KB, xlsx)}

Supporting Information

CTM2-14-e1745-s005.xlsx^{(11.4KB, xlsx)}

Supporting Information

CTM2-14-e1745-s004.xlsx^{(10.7KB, xlsx)}

Data Availability Statement

The materials of datasets used for the current study were accessible in the specific public database (as we described in the Methods section). The analysis codes are available at https://github.com/zmwwan/DSNcode/.

[ctm21745-bib-0001] 1.1. Kumar A, Zhang KYJ. Advances in the development of shape similarity methods and their application in drug discovery. Front Chem. 2018;6:315. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ctm21745-bib-0002] 2. Gandini E, Marcou G, Bonachera F, Varnek A, Pieraccini S, Sironi M. Molecular similarity perception based on machine‐learning models. Int J Mol Sci. 2022;23(11):6114. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ctm21745-bib-0003] 3. Kang H, Hou L, Gu Y, Lu X, Li J, Li Q. Drug‐disease association prediction with literature based multi‐feature fusion. Front Pharmacol. 2023;14:1205144. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ctm21745-bib-0004] 4. He S, Wen Y, Yang X, et al. PIMD: an integrative approach for drug repositioning using multiple characterization fusion. Genomics Proteom Bioinform. 2020;18(5):565‐581. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ctm21745-bib-0005] 5. Law V, Knox C, Djoumbou Y, et al. DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res. 2014;42(Database issue):D1091‐D1097. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ctm21745-bib-0006] 6. Wang B, Mezlini AM, Demir F, et al. Similarity network fusion for aggregating data types on a genomic scale. Nat Methods. 2014;11(3):333‐337. [DOI] [PubMed] [Google Scholar]

[ctm21745-bib-0007] 7. Öztürk H, Ozkirimli E, Özgür A. A comparative study of SMILES‐based compound similarity functions for drug‐target interaction prediction. BMC Bioinform. 2016;17:128. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ctm21745-bib-0008] 8. Lee I, Keum J, Nam H. DeepConv‐DTI: prediction of drug‐target interactions via deep learning with convolution on protein sequences. PLoS Comput Biol. 2019;15(6):e1007129. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ctm21745-bib-0009] 9. Wang S, Li J, Wang D, Xu D, Jin J, Wang Y. Predicting drug‐disease associations through similarity network fusion and multi‐view feature projection representation. IEEE J Biomed Health Inform. 2023;27(10):5165‐5176. [DOI] [PubMed] [Google Scholar]

[ctm21745-bib-0010] 10. Kanehisa M, Goto S. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000;28(1):27‐30. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Multiscale fusion network drives the repurposing of anticancer drugs

Zhaoman Wan

Nan Jiang

Mingming Su

Xinlei Zhang

Yang Cao

Aiping Wu

Peng Zhang

Taijiao Jiang

FIGURE 1.

FIGURE 2.

FIGURE 3.

FIGURE 4.

AUTHOR CONTRIBUTIONS

CONFLICT OF INTERESTS STATEMENT

Supporting information

Contributor Information

DATA AVAILABILITY STATEMENT

REFERENCES

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Multiscale fusion network drives the repurposing of anticancer drugs

Zhaoman Wan

Nan Jiang

Mingming Su

Xinlei Zhang

Yang Cao

Aiping Wu

Peng Zhang

Taijiao Jiang

FIGURE 1.

FIGURE 2.

FIGURE 3.

FIGURE 4.

AUTHOR CONTRIBUTIONS

CONFLICT OF INTERESTS STATEMENT

Supporting information

Contributor Information

DATA AVAILABILITY STATEMENT

REFERENCES

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases