Abstract
The topic of Drug-Target Interaction (DTI) topic has emerged nowadays since the COVID-19 outbreaks. DTI is one of the stages of finding a new cure for a recent disease. It determines whether a chemical compound would affect a particular protein, known as binding affinity. Recently, significant efforts have been devoted to artificial intelligence (AI) powered DTI. However, the use of transfer learning in DTI has not been explored extensively. This paper aims to make a more general DTI model by investigating DTI prediction method using Transfer learning. Three popular models will be tested and observed: CNN, RNN, and Transformer. Those models combined in several scenarios involving two extensive public datasets on DTI (BindingDB and DAVIS) to find the most optimum architecture. In our finding, combining the CNN model and BindingDB as the source data became the most recommended pre-trained model for real DTI cases. This conclusion was proved with the 6% AUPRC increase after fine-tuning the BindingDB pre-trained model to DAVIS dataset than without pre-training the model first.
Keywords: drug-target interaction, transfer learning, drug discovery, deep learning, SMILES
References
- 1.Ezzat A, Wu M, Li XL, Kwoh CK. Computational prediction of drug–target interactions using chemogenomic approaches: an empirical survey. Brief Bioinform [Internet]. 2019;20(4):1337–1357. doi: 10.1093/bib/bby002. Jul 19 Available from. [DOI] [PubMed] [Google Scholar]
- 2.Liu T, Lin Y, Wen X, Jorissen RN, Gilson MK. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 2007;35:98–201. doi: 10.1093/nar/gkl999. JanDatabase issue. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Davis MI, Hunt JP, Herrgard S, Ciceri P, Wodicka LM, Pallares G, et al. Comprehensive analysis of kinase inhibitor selectivity. Nat Biotechnol. 2011;29(11):1046–1051. doi: 10.1038/nbt.1990. Oct. [DOI] [PubMed] [Google Scholar]
- 4.Öztürk H, Özgür A, Ozkirimli E. DeepDTA: deep drug–target binding affinity prediction. Bioinformatics. 2018;34(17):i821–i829. doi: 10.1093/bioinformatics/bty593. [Internet]Sep 1Available from. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pahikkala T, Airola A, Pietilä S, Shakyawar S, Szwajda A, Tang J, et al. Toward more realistic drug–target interaction predictions. Brief Bioinform [Internet]. 2015;16(2):325–337. doi: 10.1093/bib/bbu010. Mar 1 Available from. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.He T, Heidemeyer M, Ban F, Cherkasov A, Ester M. SimBoost: a read-across approach for predicting drug–target binding affinities using gradient boosting machines. J Cheminform. 2017;9(1):24. doi: 10.1186/s13321-017-0209-z. [Internet]Available from. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cortés-Ciriano I, Ain QU, Subramanian V, Lenselink EB, Méndez-Lucio O, Ijzerman AP, et al. Polypharmacology modelling using proteochemometrics (PCM): Recent methodological developments, applications to target families, and future prospects. MedChemComm. 2015;6 [Google Scholar]
- 8.Mikolov T, Sutskever I, Chen K, Corrado G, Dean J. Advances in Neural Information Processing Systems. 2013. Distributed representations ofwords and phrases and their compositionality. [Google Scholar]
- 9.Wen M, Zhang Z, Niu S, Sha H, Yang R, Yun Y, et al. Deep-Learning-Based Drug-Target Interaction Prediction. J Proteome Res. 2017;16(4) doi: 10.1021/acs.jproteome.6b00618. [DOI] [PubMed] [Google Scholar]
- 10.Gao K, Nguyen DD, Sresht V, Mathiowetz AM, Tu M, Wei GW. Are 2D fingerprints still valuable for drug discovery? Phys Chem Chem Phys. 2020;22(16) doi: 10.1039/d0cp00305k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zou N, Zhu Y, Zhu J, Baydogan M, Wang W, Li J. A Transfer Learning Approach for Predictive Modeling of Degenerate Biological Systems. Technometrics [Internet] 2015;57(3):362–373. doi: 10.1080/00401706.2015.1044117. Jul 3Available from. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Mieth B, Hockley JRF, Görnitz N, Vidovic MMC, Müller KR, Gutteridge A, et al. Using transfer learning from prior reference knowledge to improve the clustering of single-cell RNA-Seq data. Sci Rep. 2019;9(1):20353. doi: 10.1038/s41598-019-56911-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Turki T, Wei Z, Wang JTL. Transfer Learning Approaches to Improve Drug Sensitivity Prediction in Multiple Myeloma Patients. IEEE Access. 2017;5:7381–7393. [Google Scholar]
- 14.Mourragui S, Loog M, van de Wiel MA, Reinders MJT, Wessels LFA. PRECISE: a domain adaptation approach to transfer predictors of drug response from pre-clinical models to tumors. Bioinformatics. 2019;35(14):i510–i519. doi: 10.1093/bioinformatics/btz372. Jul 15 [Internet]Available from. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Pan SJ, Yang Q. A Survey on Transfer Learning - {IEEE} Xplore Document. IEEE Trans Knowl Data Eng. 2009;22(10):1345–1359. [Google Scholar]
- 16.Gaulton A, Hersey A, Nowotka M, Bento AP, Chambers J, Mendez D, et al. The ChEMBL database in 2017. Nucleic Acids Res. 2017;45(D1):D945–D954. doi: 10.1093/nar/gkw1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cai C, Wang S, Xu Y, Zhang W, Tang K, Ouyang Q, et al. Transfer Learning for Drug Discovery. J Med Chem. 2020;63(16):8683–8694. doi: 10.1021/acs.jmedchem.9b02147. [Internet]Aug 27Available from. [DOI] [PubMed] [Google Scholar]
- 18.Shin B, Park S, Kang K, Ho J. Self-Attention Based Molecule Representation for Predicting Drug-Target Interaction. 2019.
- 19.Kim S, Thiessen PA, Bolton EE, Chen J, Fu G, Gindulyte A, et al. PubChem substance and compound databases. Nucleic Acids Res. 2016;44(D1):D1202–D1213. doi: 10.1093/nar/gkv951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kotsias PC, Arús-Pous J, Chen H, Engkvist O, Tyrchan C, Bjerrum EJ. Direct steering of de novo molecular generation with descriptor conditional recurrent neural networks. Nat Mach Intell. 2020;2(5):254–265. [Internet]Available from. [Google Scholar]
- 21.Yang K, Swanson K, Jin W, Coley C, Eiden P, Gao H, et al. Analyzing Learned Molecular Representations for Property Prediction. J Chem Inf Model. 2019;59(8) doi: 10.1021/acs.jcim.9b00237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Huang K, Fu T, Glass LM, Zitnik M, Xiao C, Sun J. DeepPurpose: a deep learning library for drug–target interaction prediction. Bioinformatics. 2020;36(22–23):5545–5547. doi: 10.1093/bioinformatics/btaa1005. [Internet]Dec 1Available from. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Olivas ES, Guerrero JDM, Martinez Sober M, Magdalena Benedito JR, Serrano López AJ. Handbook of research on machine learning applications and trends: Algorithms, methods, and techniques. Handbook of Research on Machine Learning Applications and Trends: Algorithms. Methods, and Techniques. 2009 [Google Scholar]
