Abstract
Drug discovery is a lengthy and resource-intensive process that requires innovative computational techniques to expedite the transition from laboratory research to life-saving medications. Here, we introduce Qsarna, a comprehensive online platform that combines machine learning for activity prediction with traditional molecular docking to streamline virtual screening workflows. Our platform employs a fragment-based generative model, enabling the exploration of novel chemical spaces with the desired pharmacophoric features. Users can share results with others, and docking poses can be examined directly within the platform. In our case study, we successfully identified three new hits for monoamine oxidase B with nanomolar potency, which were later confirmed by experimental assays. The user-friendly web interface requires minimal computational expertise, making advanced virtual screening accessible to scientists regardless of their main field of study. Qsarna represents a significant advancement in computational drug discovery by seamlessly integrating complementary in silico approaches and democratizing access to advanced virtual screening technologies.


Introduction
The discovery and development of new therapeutic agents remains one of the most challenging endeavors in biomedical sciences, with estimated costs exceeding $2.5 billion per approved drug and timelines extending beyond 10–15 years. The significant investment of time and resources, combined with high attrition rates in clinical trials, has prompted pharmaceutical companies and research institutions to seek more efficient drug discovery methods. , In this context, computational methods have become essential tools, enabling the rapid evaluation of millions of compounds and the prioritization of the most promising candidates for experimental testing. Virtual screening, in particular, has revolutionized early-stage drug discovery by enabling researchers to systematically assess large chemical spaces and identify compounds with desired properties before initiating costly experimental work. , The strategic implementation of computational screening methods early in the drug discovery process has been shown to reduce the number of compounds requiring experimental evaluation by a substantial margin, leading to significant cost savings and accelerated development timelines while maintaining or improving the quality of identified lead compounds. ,
Drug discovery paradigms have evolved significantly over the past few decades, with virtual screening approaches transitioning from simple physicochemical filters to sophisticated machine learning models capable of capturing complex structure–activity relationships. Quantitative structure–activity relationship (QSAR) models, first developed in the 1960s, laid the foundation for modern computational drug discovery by establishing mathematical relationships between molecular structures and biological activity. Today, advanced machine learning algorithms have surpassed these traditional methods, enabling the autonomous identification of relevant features in molecular structures and uncovering complex patterns from extensive chemical data sets. Recent breakthroughs in deep learning architectures, such as graph neural networks and graph transformer models, , have demonstrated unprecedented accuracy in predicting drug–target interactions, ADMET properties, and potential toxicity risks. , Furthermore, the emergence of generative models has opened new avenues in drug discovery by enabling the de novo design of molecules with desired properties. These AI-powered approaches can explore vast regions of chemical space that are inaccessible to traditional screening methods, generating novel molecular structures that maintain synthetic accessibility while optimizing multiple biological and physicochemical properties simultaneously. − The integration of multiple screening strategies, combining structure-based methods like molecular docking with ligand-based approaches such as similarity searching and both predictive and generative machine learning models, has become the gold standard in modern virtual screening campaigns. , This synergistic approach leverages the strengths of each method while compensating for their individual limitations, resulting in more robust and reliable predictions for candidate compounds and the generation of promising novel chemical entities.
Despite significant advances in virtual screening methodologies, several critical challenges continue to impede their widespread adoption in drug discovery programs. Several AI-driven platforms have been developed in recent years to streamline early-stage drug discovery. Chemistry42 is an AI-driven platform combining over 40 generative models with ligand- and structure-based design workflows, offering fine control over molecular properties and synthesis feasibility. DrugFlow provides an integrated, user-friendly environment for docking, QSAR, ADMET prediction, and virtual screening, tailored for ease of use by nonexperts. MolProphet is a web-based platform focused on accessible AI-powered drug discovery, featuring pocket prediction, molecule generation from purchasable building blocks, and synthesis planning. Unfortunately, most of the tools described here are commercial and available only through licensing, which can limit accessibility for academic users.
While commercial software solutions exist, there is a notable scarcity of freely available, comprehensive virtual screening platforms accessible to the broader scientific community. Most existing tools focus on single aspects of virtual screening, such as molecular docking or QSAR predictions, but fail to integrate multiple complementary approaches in a unified workflow. This siloed approach often forces researchers to manually combine results from different tools, potentially missing valuable insights that could emerge from a more integrated analysis. Although several online platforms for molecular docking, − QSAR modeling, , and ADMET prediction − exist, and mobile applications like MedChem Game have demonstrated the potential for accessible drug design tools, none of these solutions provides a comprehensive, guided workflow that combines classical structure-based methods with modern machine learning approaches. This limitation is particularly problematic for medicinal chemists, who, despite their expertise in synthetic chemistry and drug design, may lack the computational background required to effectively utilize these tools. The existing software solutions often demand significant expertise in computational chemistry, including understanding complex parameter settings, file format conversions, and result interpretation. Furthermore, the absence of clear guidance through the virtual screening process can lead to the suboptimal use of these tools, potentially missing promising drug candidates or generating unreliable predictions. The need for a user-friendly integrated platform that combines the power of AI models with traditional virtual screening approaches while guiding users through the process remains largely unmet in the field.
To address critical gaps in virtual screening accessibility and integration, we present Qsarna (QSAR navigator), a web-based platform combining machine learning with structure-based approaches. Our platform provides end-to-end support for virtual screening campaigns, from initial compound library management to candidate selection while maintaining a user-friendly interface for researchers across the drug discovery pipeline. By providing these tools freely to academic researchers through a web interface, Qsarna eliminates the need for local computing resources and democratizes access to advanced methodologies. Our case study on monoamine oxidase inhibitors demonstrates improved success in identifying promising drug candidates compared to single-method approaches. We believe that Qsarna will accelerate drug discovery by enabling the efficient identification of lead compounds.
Software
Qsarna is an online platform for the automated virtual screening of compounds, seamlessly combining traditional molecular docking software, QSAR machine learning models, and fragment-based generative design. The sections below describe the user-friendly interface design that allows nonexpert users to perform virtual screening experiments effortlessly.
Graphical User Interface
The graphical user interface was designed to be intuitive. We prepared interactive onboarding tutorials to help new users familiarize themselves with all of the functionalities. We also provide a PDF manual describing all of the modules in detail. Upon registration, an example project is created so that users explore the possibilities of the application without the need to upload their own compounds and prepare screening pipelines. The work on virtual screening campaigns starts with the creation of a new project. Multiple users can be invited to collaborate on adding compound libraries, analyzing screening results, and running additional experiments. All of these functions are accessible through the main project dashboard, presented as a grid of tiles (Figure ). Screenshots of the most important modules in Qsarna are presented in Supporting Information F.
1.
An overview of the Qsarna functionalities. Qsarna allows users to create collaborative projects. The project dashboard displays a grid of tiles similar to those shown here. Data modules enable the uploading of compounds’ structures and activity measurement results. Virtual screening identifies potent molecules using QSAR models and molecular docking simulation with uploaded compounds or automatically linked fragments. Results are visualized to show ligand–target interactions.
Molecular Visualization
The work with protein structures is facilitated in Qsarna. When new protein structures are uploaded and configured for docking, the chosen bounding box for the running pose search is shown in real time. The results of the screening campaigns can be visualized directly in the application by using the viewer module. Multiple poses of docked ligands can be viewed, and the closest amino acids and ligand–protein interactions are automatically displayed. After the visual inspection, the user can mark the selected poses and assign them a grade on a 5-point scale. Additionally, we provide a molecular dynamics visualization module in which users can upload their molecular dynamics simulation results and inspect the ligand interaction as a function of time.
Data Management
In Qsarna, all data are managed in a relational database, which is regularly backed up. To ensure data integrity, we employ ACID compliance (atomicity, consistency, isolation, and durability), which means that either the whole transaction is committed to the database or the changes are rolled back. The compounds uploaded by users are automatically preprocessed and deduplicated. If compounds already exist in the user’s project, the records are merged to assign all activity information to one molecular structure. Additionally, chemical libraries can be shared with other users, who can leave comments on the compounds.
Software Architecture
Qsarna is a Django application implemented in Python 3. The user data including all uploaded compound libraries and analysis results are stored in a PostgreSQL database that is backed up regularly. Unstructured molecular data such as docking poses and protein structures are saved as SDF and PDB files. Similarly, trained machine learning models are stored in a binary file format. All computationally exhaustive tasks are handled by a Celery queue, and the status of each task can be viewed on the task queue page.
Our application supports flexible deployment options ranging from local installations to enterprise-scale cloud infrastructure. The application can be deployed locally using a preconfigured Docker container that ensures consistent environments across different systems. Our cloud implementation leverages Amazon Web Services (AWS). Data integrity and availability are ensured through Amazon RDS, which provides automated backup capabilities and read replicas for uninterrupted database access during high-demand periods. Sensitive research data benefit from server-side encryption via Amazon S3 storage. While the main application runs on Amazon EC2 instances, computationally intensive tasks are automatically distributed across dedicated processing nodes using Amazon SQS for job queuing and Amazon Batch for managed compute environments. This design enables the system to dynamically scale resources based on workload demands. Detailed architectural diagrams for both deployment configurations are provided in Supporting Information E.
Methods
Qsarna provides three primary tools for hit identification. The first is molecular docking, a traditional virtual screening method. The second and third are machine learning techniques: QSAR models that enhance the quality of nominated hits based on activity data and fragment-based generative modeling, which allows users to expand beyond known molecules. The combination of these tools improves screening reliability by applying orthogonal filters. By training QSAR models on docking results or experimental activity data, the prioritization of docked molecules can be improved, reducing false positives and uncovering nonobvious structure–activity relationships. The generative fragment-linking module explores chemical space creatively and targets parts of the receptor that may be missed by traditional ligand-based screening.
Molecular Docking
Molecular docking is a structure-based virtual screening method. Qsarna utilizes Smina, a docking software based on AutoDock Vina. While defining a new docking protocol, users can adjust docking parameters, such as the exhaustiveness of the pose search, the number of generated poses, and the number of tautomers to be docked. Additionally, users can specify the binding pocket within the application using a protein visualizer that displays the docking bounding box in real time. All ligands are automatically prepared prior to molecular docking, which includes calculating their protonation states (OpenBabel), tautomers, stereoisomers, and low-energy conformations (RDKit). The virtual screening module in Qsarna was tested against three diverse molecular targets using decoy compounds from the DUD-E database. The results showed that Qsarna performed comparably to Glide (commercial docking software) on two of the three targets when identifying active molecules (detailed results in Supporting Information C).
Ligand-Based Activity Prediction
Libraries can also be searched for new active compounds using machine learning QSAR models, following the methodology from the study of Cieślak et al. The AutoML tool integrated into Qsarna builds three types of machine learning models: random forests, support vector machines, and artificial neural networks. These models utilize either Morgan fingerprints, Avalon fingerprints, MACCS keys, or RDKit molecular descriptors. After selecting a set of compounds (minimum 100 for reliable model training) with experimental or computed labels, such as IC50 values or docking scores, the data set is automatically constructed and divided into training, validation, and testing sets. The models are automatically trained on one or more data splits while searching for the optimal set of hyperparameters. Models can be trained to predict continuous activity measurements (e.g., IC50) or binary activity classes defined by a threshold. Finally, the trained models can be displayed along with their performance evaluations. These models become accessible in the virtual screening module, enabling ligand-based virtual screening of large, unannotated compound libraries. For extremely large libraries, training QSAR models to approximate docking scores provides a rapid alternative to traditional docking-based screening, where only a subset of the original data needs to be docked and used for ML model training.
For all projects, we also provide ADMET predictors for a variety of molecular properties, including the blood–brain barrier permeability, hERG binding, CACO-2 permeability, bioavailability, and logD. All of these models were trained on the public data sourced from the Therapeutics Data Commons initiative using the same machine learning models and descriptors available in Qsarna. These properties are automatically predicted for all newly added compounds, facilitating lead identification and multiparametric optimization. Our ADMET models provide comparable results with other publicly available web-based software, such as ADMETlab and admetSAR (the evaluation details can be found in Supporting Information D).
Fragment-Based Design
The drug-like chemical space is extensive, making it often insufficient to depend solely on existing virtual libraries to identify the best binders for a particular target. This is particularly important in identifying novel chemotypes and in avoiding already exploited or patented structures. Consequently, Qsarna offers a generative module that facilitates the creation of entirely new molecules. This module follows the principles of fragment-based drug discovery, allowing users to connect to or expand upon fragments, which are small structures known for their binding capabilities. Our model, CRET, links these fragments using only known linkers sourced from public databases, enhancing the chances that the resulting molecule can be synthesized. By utilizing experimentally determined or docked fragments, our tool explores the surrounding chemical space to develop focused libraries with potential new candidates. What is important is that the fragment linking exploits the position of the fragments within the binding pocket, ensuring that crucial interactions are preserved. Next, the generated molecules can be redocked using our docking module, prioritizing poses where the fragments are positioned near their initial location. CRET enables the creation of unique and diverse libraries of synthesizable compounds, as confirmed by a benchmark shown in Supporting Information G. This way, users can expand the chemical space around weakly active fragments and create focused libraries of novel compounds that are not found in public compound databases.
Case Study
We validated our virtual screening software’s capabilities by following the protocol outlined by Cieślak et al., aiming to discover new monoamine oxidase inhibitors solely with the tools available in Qsarna.
We began by uploading all compounds from ChEMBL that had assigned K i or IC50 values for either MAO-A or MAO-B assays. Qsarna then preprocessed these data automatically, removing duplicates and salt counterions, where the ligand is neutralized. We made one modification to the protocol by converting the regression problem into a binary classification problem, designating the positive class to all compounds with K i or IC50 below 100 nM. This allowed us to integrate the results from both functional and binding assays. Additionally, we combine activity measurements from multiple organisms for which the amino acid composition of the binding pocket is conserved. Machine learning models were trained separately for the MAO-A and MAO-B targets, achieving 0.92 ROC AUC for MAO-A and 0.88 ROC AUC for MAO-B. The ROC curves are presented in Supporting Information A.
Subsequently, the top three machine learning models screened the MolPort catalog of available compounds. Only 556 compounds were retained by applying a classification threshold greater than 0.65. Each of these compounds was preprocessed in Qsarna and docked to the MAO-A and MAO-B structures sourced from PDB (PDB IDs: 2BXR and 2V5Z, respectively), and the final selection of molecules relied on visual inspection of the binding poses and consideration of the economic factor.
Finally, the selected group of 19 compounds was acquired and evaluated for MAO-A and MAO-B activities with the Merck inhibitor screening kit (MAK295 and MAK296), which led to the discovery of more potent binders than those reported in the original study by Cieślak et al. The most effective compound reached 1.37 nM IC50 for MAO-B. The multistep filtering pipeline and most potent MAO-B inhibitors are displayed in Figure . The experimental details for determining IC50 are presented in Supporting Information B.
2.
Results of the MAO inhibitor case study. (A) A large MolPort library was screened using the QSAR models trained on ChEMBL data and then further filtered by molecular docking and visual inspection. (B) The three most potent MAO-B inhibitors identified in biochemical assays and their dose–response curves are shown.
Conclusions
We present Qsarna, an online platform designed to automate the search for novel bioactive compounds in a virtual screening campaign by effectively navigating the chemical space. This tool allows users to manage their data and share experimental findings with team members. The platform features automated machine learning and molecular docking screening tools. Moreover, the fragment-based module provides generative models for developing combinatorial libraries. The platform’s hybrid approach is particularly valuable for targets with limited experimental data, hit expansion campaigns, and reducing chemical bias through the integration of both ligand- and structure-based methodologies. We believe that Qsarna will expedite drug discovery efforts by automating computational screening procedures. Additionally, the user-friendly interface can support experiments and guide scientists who are not strongly familiar with in silico drug design procedures.
Supplementary Material
Acknowledgments
This research was supported by the Ministry of Science and Higher Education (Poland) Grant No. DWD/5/0543/2021.
Qsarna is available online to all users at https://qsarna.com. Every user has a monthly limit on the number of compounds analyzed and calculations performed, as the service is provided free of charge with limited resources. For academic projects, we are open to increasing these limits upon request.
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.jcim.5c00720.
The raw data for all compounds tested in biochemical assays containing SMILES strings, percentage inhibition at three concentrations, and IC50 for the secondary screen (XLSX)
Section A includes the ROC curves of the top three models trained to predict MAO inhibition; Section B includes the experimental details for IC50 determination; Section C presents the results of the decoy-based virtual screening evaluation; Section D includes a comparison of available web-based ADMET tools; Section E depicts the software architecture; Section F shows the graphical user interface; and Section G includes the evaluation of the generative fragment-linking module (PDF)
M.C. designed and implemented the application and conducted screening experiments to validate the platform. J.Ł. designed the cloud architecture and advised on the application deployment. O.K.-K. carried out biochemical assays and analyzed the results. J.K.-T. consulted on the project and reviewed the manuscript. T.D. supervised the project and prepared the manuscript.
The authors declare no competing financial interest.
Published as part of Journal of Chemical Information and Modeling special issue “Chemical Compound Space Exploration by Multiscale High-Throughput Screening and Machine Learning”.
References
- DiMasi J. A., Grabowski H. G., Hansen R. W.. Innovation in the pharmaceutical industry: new estimates of R&D costs. J. Health Econ. 2016;47:20–33. doi: 10.1016/j.jhealeco.2016.01.012. [DOI] [PubMed] [Google Scholar]
- Kiriiri G. K., Njogu P. M., Mwangi A. N.. Exploring different approaches to improve the success of drug discovery and development projects: a review. Future J. Pharm. Sci. 2020;6:27. doi: 10.1186/s43094-020-00047-9. [DOI] [Google Scholar]
- Sabe V. T., Ntombela T., Jhamba L. A., Maguire G. E., Govender T., Naicker T., Kruger H. G.. Current trends in computer aided drug design and a highlight of drugs discovered via computational techniques: A review. Eur. J. Med. Chem. 2021;224:113705. doi: 10.1016/j.ejmech.2021.113705. [DOI] [PubMed] [Google Scholar]
- Gentile F., Yaacoub J. C., Gleave J., Fernandez M., Ton A.-T., Ban F., Stern A., Cherkasov A.. Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking. Nat. Protoc. 2022;17:672–697. doi: 10.1038/s41596-021-00659-2. [DOI] [PubMed] [Google Scholar]
- Liu F., Mailhot O., Glenn I. S., Vigneron S. F., Bassim V., Xu X., Fonseca-Valencia K., Smith M. S., Radchenko D. S., Fraser J. S.. et al. The impact of library size and scale of testing on virtual screening. Nat. Chem. Biol. 2025;21:1039–1045. doi: 10.1038/s41589-024-01797-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sadybekov A. V., Katritch V.. Computational approaches streamlining drug discovery. Nature. 2023;616:673–685. doi: 10.1038/s41586-023-05905-z. [DOI] [PubMed] [Google Scholar]
- Cherkasov A., Muratov E. N., Fourches D., Varnek A., Baskin I. I., Cronin M., Dearden J., Gramatica P., Martin Y. C., Todeschini R.. et al. QSAR modeling: where have you been? Where are you going to? J. Med. Chem. 2014;57:4977–5010. doi: 10.1021/jm4004285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu Z., Pan S., Chen F., Long G., Zhang C., Yu P. S.. A comprehensive survey on graph neural networks. IEEE Trans. Neural Networks Learn. Syst. 2021;32:4–24. doi: 10.1109/TNNLS.2020.2978386. [DOI] [PubMed] [Google Scholar]
- Rong Y., Bian Y., Xu T., Xie W., Wei Y., Huang W., Huang J.. Self-supervised graph transformer on large-scale molecular data. Adv. Neural Inf. Process. Syst. 2020;33:12559–12571. [Google Scholar]
- Maziarka Ł., Majchrowski D., Danel T., Gaiński P., Tabor J., Podolak I., Morkisz P., Jastrzeębski S.. Relative molecule self-attention transformer. J. Cheminf. 2024;16:3. doi: 10.1186/s13321-023-00789-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Z., Chen L., Zhong F., Wang D., Jiang J., Zhang S., Jiang H., Zheng M., Li X.. Graph neural network approaches for drug-target interactions. Curr. Opin. Struct. Biol. 2022;73:102327. doi: 10.1016/j.sbi.2021.102327. [DOI] [PubMed] [Google Scholar]
- Fallani A., Nugmanov R., Arjona-Medina J., Wegner J. K., Tkatchenko A., Chernichenko K.. Pretraining graph transformers with atom-in-a-molecule quantum properties for improved ADMET modeling. J. Cheminf. 2025;17:25. doi: 10.1186/s13321-025-00970-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen J., Si Y.-W., Un C.-W., Siu S. W.. Chemical toxicity prediction based on semi-supervised learning and graph convolutional neural network. J. Cheminf. 2021;13:93. doi: 10.1186/s13321-021-00570-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cremer J., Medrano Sandonas L., Tkatchenko A., Clevert D.-A., De Fabritiis G.. Equivariant graph neural networks for toxicity prediction. Chem. Res. Toxicol. 2023;36:1561–1573. doi: 10.1021/acs.chemrestox.3c00032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tong X., Liu X., Tan X., Li X., Jiang J., Xiong Z., Xu T., Jiang H., Qiao N., Zheng M.. Generative models for de novo drug design. J. Med. Chem. 2021;64:14011–14027. doi: 10.1021/acs.jmedchem.1c00927. [DOI] [PubMed] [Google Scholar]
- Fromer J. C., Coley C. W.. Computer-aided multi-objective optimization in small molecule discovery. Patterns. 2023;4:100678. doi: 10.1016/j.patter.2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y., Zhang L., Liu Z.. Multi-objective de novo drug design with conditional graph generative model. J. Cheminf. 2018;10:33. doi: 10.1186/s13321-018-0287-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Polishchuk P.. CReM: chemically reasonable mutations framework for structure generation. J. Cheminf. 2020;12:28. doi: 10.1186/s13321-020-00431-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gupta R., Srivastava D., Sahu M., Tiwari S., Ambasta R. K., Kumar P.. Artificial intelligence to deep learning: machine intelligence approach for drug discovery. Mol. Diversity. 2021;25:1315–1360. doi: 10.1007/s11030-021-10217-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Danel T., Łeęski J., Podlewska S., Podolak I. T.. Docking-based generative approaches in the search for new drug candidates. Drug Discovery Today. 2023;28:103439. doi: 10.1016/j.drudis.2022.103439. [DOI] [PubMed] [Google Scholar]
- Ivanenkov Y. A., Polykovskiy D., Bezrukov D., Zagribelnyy B., Aladinskiy V., Kamya P., Aliper A., Ren F., Zhavoronkov A.. Chemistry42: an AI-driven platform for molecular design and optimization. J. Chem. Inf. Model. 2023;63:695–701. doi: 10.1021/acs.jcim.2c01191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen C., Song J., Hsieh C.-Y., Cao D., Kang Y., Ye W., Wu Z., Wang J., Zhang O., Zhang X.. et al. DrugFlow: an AI-driven one-stop platform for innovative drug discovery. J. Chem. Inf. Model. 2024;64:5381–5391. doi: 10.1021/acs.jcim.4c00621. [DOI] [PubMed] [Google Scholar]
- Yang K., Xie Z., Li Z., Qian X., Sun N., He T., Xu Z., Jiang J., Mei Q., Wang J.. et al. MolProphet: a One-Stop, General purpose, and AI-Based platform for the early stages of Drug Discovery. J. Chem. Inf. Model. 2024;64:2941–2947. doi: 10.1021/acs.jcim.3c01979. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mashiach E., Schneidman-Duhovny D., Andrusier N., Nussinov R., Wolfson H. J.. FireDock: a web server for fast interaction refinement in molecular docking. Nucleic Acids Res. 2008;36:W229–W232. doi: 10.1093/nar/gkn186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mashiach E., Nussinov R., Wolfson H. J.. FiberDock: a web server for flexible induced-fit backbone refinement in molecular docking. Nucleic Acids Res. 2010;38:W457–W461. doi: 10.1093/nar/gkq373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murail S., De Vries S. J., Rey J., Moroy G., Tufféry P.. SeamDock: an interactive and collaborative online docking resource to assist small compound molecular docking. Front. Mol. Biosci. 2021;8:716466. doi: 10.3389/fmolb.2021.716466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sandal M., Duy T. P., Cona M., Zung H., Carloni P., Musiani F., Giorgetti A.. GOMoDo: a GPCRs online modeling and docking webserver. PLoS One. 2013;8:e74092. doi: 10.1371/journal.pone.0074092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kochnev Y., Ahmed M., Maldonado A. M., Durrant J. D.. MolModa: accessible and secure molecular docking in a web browser. Nucleic Acids Res. 2024;52:W498–W506. doi: 10.1093/nar/gkae406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y.-L., Wang F., Shi X.-X., Jia C.-Y., Wu F.-X., Hao G.-F., Yang G.-F.. Cloud 3D-QSAR: a web tool for the development of quantitative structure-activity relationship models in drug discovery. Brief. Bioinf. 2021;22:bbaa276. doi: 10.1093/bib/bbaa276. [DOI] [PubMed] [Google Scholar]
- Kumar S., Bhowmik R., Oh J. M., Abdelgawad M. A., Ghoneim M. M., Al-Serwi R. H., Kim H., Mathew B.. Machine learning driven web-based app platform for the discovery of monoamine oxidase B inhibitors. Sci. Rep. 2024;14:4868. doi: 10.1038/s41598-024-55628-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fu L., Shi S., Yi J., Wang N., He Y., Wu Z., Peng J., Deng Y., Wang W., Wu C.. et al. ADMETlab 3.0: an updated comprehensive online ADMET prediction platform enhanced with broader coverage, improved performance, API functionality and decision support. Nucleic Acids Res. 2024;52:W422–W431. doi: 10.1093/nar/gkae236. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schyman P., Liu R., Desai V., Wallqvist A.. vNN web server for ADMET predictions. Front. Pharmacol. 2017;8:889. doi: 10.3389/fphar.2017.00889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tian H., Ketkar R., Tao P.. ADMETboost: a web server for accurate ADMET prediction. J. Mol. Model. 2022;28:408. doi: 10.1007/s00894-022-05373-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wei Y., Li S., Li Z., Wan Z., Lin J.. Interpretable-ADMET: a web service for ADMET prediction and optimization based on deep neural representation. Bioinformatics. 2022;38:2863–2871. doi: 10.1093/bioinformatics/btac192. [DOI] [PubMed] [Google Scholar]
- Danel T., Łski J., PodlewskaIgor S., Podolak T.. MedChem Game: Gamification of Drug Design. J. Chem. Educ. 2024;101:4454–4461. doi: 10.1021/acs.jchemed.4c00253. [DOI] [Google Scholar]
- Koes D. R., Baumgartner M. P., Camacho C. J.. Lessons learned in empirical scoring with smina from the CSAR 2011 benchmarking exercise. J. Chem. Inf. Model. 2013;53:1893–1904. doi: 10.1021/ci300604z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trott O., Olson A. J.. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 2010;31:455–461. doi: 10.1002/jcc.21334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Boyle N. M., Banck M., James C. A., Morley C., Vandermeersch T., Hutchison G. R.. Open Babel: An open chemical toolbox. J. Cheminf. 2011;3:33. doi: 10.1186/1758-2946-3-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- RDKit: Open-source cheminformatics. http://www.rdkit.org.
- Mysinger M. M., Carchia M., Irwin J. J., Shoichet B. K.. Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking. J. Med. Chem. 2012;55:6582–6594. doi: 10.1021/jm300687e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friesner R. A., Banks J. L., Murphy R. B., Halgren T. A., Klicic J. J., Mainz D. T., Repasky M. P., Knoll E. H., Shelley M., Perry J. K.. et al. Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J. Med. Chem. 2004;47:1739–1749. doi: 10.1021/jm0306430. [DOI] [PubMed] [Google Scholar]
- Cieślak M., Danel T., Krzysztyńska-Kuleta O., Kalinowska-Tłuścik J.. Machine learning accelerates pharmacophore-based virtual screening of MAO inhibitors. Sci. Rep. 2024;14:8228. doi: 10.1038/s41598-024-58122-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karmaker (“Santu”) S. K., Hassan M. M., Smith M. J., Xu L., Zhai C., Veeramachaneni K.. Automl to date and beyond: Challenges and opportunities. ACM Comput. Surv. 2022;54:1–36. doi: 10.1145/3470918. [DOI] [Google Scholar]
- Rogers D., Hahn M.. Extended-connectivity fingerprints. J. Chem. Inf. Model. 2010;50:742–754. doi: 10.1021/ci100050t. [DOI] [PubMed] [Google Scholar]
- Gedeck P., Rohde B., Bartels C.. QSAR- how good is it in practice? Comparison of descriptor sets on an unbiased cross section of corporate data sets. J. Chem. Inf. Model. 2006;46:1924–1936. doi: 10.1021/ci050413p. [DOI] [PubMed] [Google Scholar]
- Durant J. L., Leland B. A., Henry D. R., Nourse J. G.. Reoptimization of MDL keys for use in drug discovery. J. Chem. Inf. Comput. Sci. 2002;42:1273–1280. doi: 10.1021/ci010132r. [DOI] [PubMed] [Google Scholar]
- Martins I. F., Teixeira A. L., Pinheiro L., Falcao A. O.. A Bayesian approach to in silico blood-brain barrier penetration modeling. J. Chem. Inf. Model. 2012;52:1686–1697. doi: 10.1021/ci300124c. [DOI] [PubMed] [Google Scholar]
- Karim A., Lee M., Balle T., Sattar A.. CardioTox net: a robust predictor for hERG channel blockade based on deep learning meta-feature ensembles. J. Cheminf. 2021;13:60. doi: 10.1186/s13321-021-00541-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang N.-N., Dong J., Deng Y.-H., Zhu M.-F., Wen M., Yao Z.-J., Lu A.-P., Wang J.-B., Cao D.-S.. ADME properties evaluation in drug discovery: prediction of Caco-2 cell permeability using a combination of NSGA-II and boosting. J. Chem. Inf. Model. 2016;56:763–773. doi: 10.1021/acs.jcim.5b00642. [DOI] [PubMed] [Google Scholar]
- Ma C.-Y., Yang S.-Y., Zhang H., Xiang M.-L., Huang Q., Wei Y.-Q.. Prediction models of human plasma protein binding rate and oral bioavailability derived by using GA-CG-SVM method. J. Pharm. Biomed. Anal. 2008;47:677–682. doi: 10.1016/j.jpba.2008.03.023. [DOI] [PubMed] [Google Scholar]
- Huang, K. ; Fu, T. ; Gao, W. ; Zhao, Y. ; Roohani, Y. H. ; Leskovec, J. ; Coley, C. W. ; Xiao, C. ; Sun, J. ; Zitnik, M. . Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development. In NeurIPS Datasets and Benchmarks, 2021.
- Xiong G., Wu Z., Yi J., Fu L., Yang Z., Hsieh C., Yin M., Zeng X., Wu C., Lu A.. et al. ADMETlab 2.0: an integrated online platform for accurate and comprehensive predictions of ADMET properties. Nucleic Acids Res. 2021;49:W5–W14. doi: 10.1093/nar/gkab255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng, F. ; Li, W. ; Zhou, Y. ; Shen, J. ; Wu, Z. ; Liu, G. ; Lee, P. W. ; Tang, Y. . admetSAR: a comprehensive source and free tool for assessment of chemical ADMET properties, 2012. [DOI] [PubMed]
- Cieślak, M. ; Danel, T. ; Kalinowska-Tłuścik, J. . Structure-guided fragment linking algorithm enables chemically feasible ligand design with predicted binding modes. ChemRxiv. 2025, 10.26434/chemrxiv-2025-jz8d3. [DOI]
- Gaulton A., Bellis L. J., Bento A. P., Chambers J., Davies M., Hersey A., Light Y., McGlinchey S., Michalovich D., Al-Lazikani B., Overington J. P.. ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res. 2012;40:D1100–D1107. doi: 10.1093/nar/gkr777. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov I. N., Bourne P. E.. The protein data bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Colibus L., Li M., Binda C., Lustig A., Edmondson D. E., Mattevi A.. Three-dimensional structure of human monoamine oxidase A (MAO A): relation to the structures of rat MAO A and human MAO B. Proc. Natl. Acad. Sci. U.S.A. 2005;102:12684–12689. doi: 10.1073/pnas.0505975102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Binda C., Wang J., Pisani L., Caccia C., Carotti A., Salvati P., Edmondson D. E., Mattevi A.. Structures of human monoamine oxidase B complexes with selective noncovalent inhibitors: safinamide and coumarin analogs. J. Med. Chem. 2007;50:5848–5852. doi: 10.1021/jm070677y. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Qsarna is available online to all users at https://qsarna.com. Every user has a monthly limit on the number of compounds analyzed and calculations performed, as the service is provided free of charge with limited resources. For academic projects, we are open to increasing these limits upon request.


