Abstract
Motivation
The ligand/protein binding interaction is typically investigated by docking and molecular dynamics (MD) simulations. In particular, docking-based virtual screening (VS) is used to select the best ligands from database of thousands of compounds, while MD calculations assess the energy stability of the ligand/protein binding complexes. Considering the broad use of these techniques, it is of great demand to have one single software that allows a combined and fast analysis of VS and MD results. With this in mind, we have developed the Drug Discovery Tool (DDT) that is an intuitive graphics user interface able to provide structural data and physico-chemical information on the ligand/protein interaction.
Results
DDT is designed as a plugin for the Visual Molecular Dynamics (VMD) software and is able to manage a large number of ligand/protein complexes obtained from AutoDock4 (AD4) docking calculations and MD simulations. DDT delivers four main outcomes: i) ligands ranking based on an energy score; ii) ligand ranking based on a ligands’ conformation cluster analysis; iii) identification of the aminoacids forming the most occurrent interactions with the ligands; iv) plot of the ligands’ center-of-mass coordinates in the Cartesian space. The flexibility of the software allows saving the best ligand/protein complexes using a number of user-defined options.
Availability and implementation
DDT_site_1 (alternative DDT_site_2); the DDT tutorial movie is available here.
Supplementary information
Supplementary data are available at Bioinformatics online.
1 Introduction
Computer-aided drug design plays today a pivotal role in the identification of new drug candidates. Among the most successful approaches is the docking-based virtual screening (VS), which is used to predict the interaction between a ligand, typically small molecules, and its protein or nucleic acid target (hereafter protein for the sake of simplicity) (Morris et al., 2009). The resulting ligand/protein complexes are evaluated and ranked according to a scoring function, which estimates the strength of the binding interaction. However, the approximation of the docking sampling produces many false positives, thus molecular dynamics calculations (MD) are usually performed to validate the docking results (D’Amore et al., 2014; Di Leva et al., 2019). Despite the several software available for the analysis of the VS and MD results, none of them allows handling both the docking complexes and the corresponding MD trajectories. This operation requires indeed the use of different programs, resulting not user-friendly and causing delay in the drug design pipeline. The Drug Discovery Tool (DDT) presented here addresses this need allowing the even inexpert user to retrieve structural and energetics data on the ligand/protein complexes obtained from VS and MD calculations. DDT has a user-friendly graphics user interface (GUI) that is implemented in the worldwide used VMD software (Humphrey et al., 1996).
2 Materials and methods
The DDT pipeline is shown in Figure 1 and is composed of two modules: the VS package (red) and the MD package (green).
VS package—DDT processes the AutoDock4 (AD4) docking results retrieving from the AD4 .dlg file the most occurred ligand binding mode, that is the largest cluster family of the ligand bound conformations (LC), and its binding energy score (ES). In addition, DDT easily allows the identification of the residues closer to the docked ligands (NA), showing as histograms their occurrence frequency in the docked poses. DDT also plots in the Cartesian space the coordinates of the ligands’ centre-of-mass for each docked ligand (MC), which can be displayed as spheres in the protein’s binding site (SHOW MC). All these features make DDT a most useful tool in drug design, especially as explorative instrument in case in which the precise location of the binding site is unknown.
MD package—This module of DDT is designed for the post-processing analysis of trajectories of ligand/protein complexes obtained from MD simulations or related techniques (e.g. Monte Carlo and Funnel-Metadynamics). Similarly to the VS package, the residues closer to the ligand are identified from histograms accounting for their frequency of occurrence in the trajectory (NA); while the ligand’s centre-of-mass coordinates along the trajectory are plotted in the Cartesian space (MC) and displayed as spheres in the protein binding site (SHOW MC). DDT can also perform the clustering in families of the conformations assumed by the ligand along the trajectory (CLUSTER) and evaluate each cluster using an AD4-like ligand/protein binding ES. The latter is computed using the atoms’ parameters reported in the GROMACS topology file (Abraham et al., 2015). Therefore, topology files from different software (e.g. NAMD and AMBER) should be first converted in the GROMACS format before using ES. All the analysis tools can be applied to the whole trajectory or to a user-defined selection of the trajectory.
More details and a list of all the features of DDT can be found in the Supplementary Information as tutorial and manual document.
3 Use-case
We present in this section a compelling use-case of DDT in the identification of the first ligand-candidates of the shelterin protein TPP1. In this protein, seven amino acids (E168, E169, E171, R180, L183, L212 and E215), named TEL-patch (Fig. 1), have been identified as responsible for the binding of TPP1 with telomerase and thus for the activation of the telomere maintenance mechanism (Nandakumar et al., 2012). For this reason, TPP1 represents a novel and promising anticancer molecular target. On this target we carried out a VS campaign followed by MD calculations to screen over 7 million of compounds. We used DDT to analyse the results and select the best ligands for the pharmacological evaluation. The fact of using one single software (i.e. VMD) and the versatility of DDT with its several features allow performing the analysis of the VS and MD results in a straightforward way, especially considering the large amount of structures and data analysed. Specifically, we used the VS package to analyse ∼100k potential TPP1 ligands obtained from VS calculations, selecting those binding to the TEL-patch and those interacting with the seven TEL-patch residues using the MC and NA functions, respectively. The ∼20k selected compounds were further filtered based on the frequency of occurrence of their binding conformation and their binding ES, using the LC and ES values, respectively. Each of the resulting 100 complexes underwent 0.5 µs MD simulations in order to assess their energy stability. The obtained trajectories were analysed using the DDT MD package. In detail, the NA function allowed identifying the ligands interacting for longer time with the TEL-patch residues. Then, the LC command was applied to the 60 selected trajectories to perform a ligand conformation cluster analysis, and finally each cluster family was scored based on the DDT ligand/protein binding ES. 23 compounds with the lowest ES scores were finally proposed for the currently ongoing pharmacological tests.
4 Conclusions
DDT is an intuitive and user-friendly GUI to analyse the structural and energetics properties of ligand/protein complexes. The several user-defined options make DDT a versatile tool for expert and novel investigators in computer-aided drug design studies.
Funding
This work was supported by the Swiss National Science Foundation [Project No. 200021_163281], the Italian MIUR-PRIN 2017 (2017FJZZRC), the Swiss National Supercomputing Centre (CSCS) [Project ID u8] and the COST action CA15135 (MuTaLig).
Conflict of Interest: none declared.
Supplementary Material
Contributor Information
Simone Aureli, Institute of Computational Science - Center for Computational Medicine in Cardiology, Faculty of Biomedical Sciences, Università della Svizzera italiana (USI), Lugano CH-6900, Switzerland.
Daniele Di Marino, Institute of Computational Science - Center for Computational Medicine in Cardiology, Faculty of Biomedical Sciences, Università della Svizzera italiana (USI), Lugano CH-6900, Switzerland; Department of Life and Environmental Sciences, Polytechnic University of Marche, Ancona, Italy.
Stefano Raniolo, Institute of Computational Science - Center for Computational Medicine in Cardiology, Faculty of Biomedical Sciences, Università della Svizzera italiana (USI), Lugano CH-6900, Switzerland.
Vittorio Limongelli, Institute of Computational Science - Center for Computational Medicine in Cardiology, Faculty of Biomedical Sciences, Università della Svizzera italiana (USI), Lugano CH-6900, Switzerland; Department of Pharmacy, University of Naples “Federico II”, Naples I-80131, Italy.
References
- Abraham M.J. et al. (2015) GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. Software-X, 1, 19–15. [Google Scholar]
- D’Amore C. et al. (2014) Design, synthesis, and biological evaluation of potent dual agonists of nuclear and membrane bile acid receptors. J. Med. Chem., 57, 937–954. [DOI] [PubMed] [Google Scholar]
- Di Leva F.S. et al. (2019) Discovery of ((1, 2, 4-oxadiazol-5-yl) pyrrolidin-3-yl)ureidyl derivatives as selective non-steroidal agonists of the G-protein coupled bile acid receptor-1. Sci. Rep., 9, 2504.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morris G.M. et al. (2009) AutoDock4 and AutoDockTools4: automated docking with selective receptor flexibility. J. Comput. Chem., 30, 2785–2791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humphrey W. et al. (1996) VMD: visual molecular dynamics. J. Mol. Graphics, 14, 33–38. [DOI] [PubMed] [Google Scholar]
- Nandakumar J. et al. (2012) The TEL patch of telomere protein TPP1 mediates telomerase recruitment and processivity. Nature, 492, 285–289. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.