Skip to main content
Data in Brief logoLink to Data in Brief
. 2022 Apr 10;42:108146. doi: 10.1016/j.dib.2022.108146

Datasets comprising the quality validations of simulated protein-ligand complexes and SYBYL docking scores of bioactive natural compounds as inhibitors of Mycobacterium tuberculosis protein-targets

Sravan Kumar Miryala a, Soumya Basu a, Aniket Naha a, Reetika Debroy a, Sudha Ramaiah a, Anand Anbarasu a,⁎⁎, Saravanan Natarajan b,
PMCID: PMC9035630  PMID: 35479419

Abstract

Docking scores and simulation parameters to study the potency of natural compounds against protein targets in Mycobacterium tuberculosis (Mtb) were retrieved through molecular docking and in-silico structural investigation. The molecular docking datasets comprised 15 natural compounds, seven conventional anti-tuberculosis (anti-TB) drugs and their seven corresponding Mtb target proteins. Mtb protein targets were actively involved in translation mechanism, nucleic acid metabolism and membrane integrity. Standard structural screening and stereochemical optimizations were adopted to generate the 3D protein structures and their corresponding ligands prior to molecular docking. Force-field integration and energy minimization were further employed to obtain the proteins in their ideal geometry. Surflex-dock algorithm using Hammerhead scoring functions were used to finally produce the docking scores between each protein and the corresponding ligand(s). The best-docked complexes selected for simulation studies were subjected to topology adjustments, charge neutralizations, solvation and equilibrations (temperature, volume and pressure). The protein-ligand complexes and molecular dynamics parameter files have been provided. The trajectories of the simulated parameters such as density, pressure and temperature were generated with integrated tools of the simulation suite. The datasets can be useful to computational and molecular medicine researchers to find therapeutic leads relevant to the chemical behaviours of a specific class of compounds against biological systems. Structural parameters and energy functions provided a set of standard values that can be utilised to design simulation experiments regarding similar macromolecular interactions.

Keywords: Docking, Simulation, Natural compounds, Tuberculosis, Therapeutics

Specifications Table

Subject Subject area: Biological Sciences
Sub-section: Structural biology
Specific subject area In silico structural analyses of protein-ligand complexes with molecular dynamics based on chemical interactions
Type of data Tables and Figures
How the data were acquired We selected seven conventional drug targets in Mycobacterium tuberculosis whose 3D structures were downloaded from the Protein Data Bank (PDB) database (https://www.rcsb.org/). The structural and functional domains of the proteins were screened from UniProt (https://www.uniprot.org/), InterPro (https://www.ebi.ac.uk/interpro/) and Pfam (http://pfam.xfam.org/) databases. The ligand structures were acquired from the National centre for Biotechnology Information (NCBI) PubChem compound database (https://pubchem.ncbi.nlm.nih.gov/) and Drug bank database (https://www.drugbank.com/). The absence of a ligand structure in drug repositories was compensated by drawing the same with the ChemSketch tool followed by 3D structures generation using OpenBabel online server (http://www.cheminfo.org/Chemistry/Cheminformatics/FormatConverter/index.html). The bond integrities were validated using the Avogadro tool. The datasets comprising molecular docking scores of natural compounds (ligands) and the classical anti-Tuberculosis drugs with their respective targets were generated using the SYBYL-Surflex-docking tool kit. The best-docked complexes were subjected to topology adjustments for individual proteins and ligands using CHARMM36-Mar2019 force-field-TIP3P water-model and CGenFF (https://cgenff.umaryland.edu/) online server with default parameters respectively. The integrated simulation suite GROMACS 2018.1 was utilized. The optimized macromolecular complexes were solvated by centering in an aqueous dodecahedron box of uniform edge distance of 1.0 nm. Subsequently, requisite counter ions (Na+ or Cl) were added to balance the charges of the solvated system. Energy was minimized using integrated steepest descent algorithm for 50,000 steps and convergence-tolerance of 1000 kJ/mol nm−1. System equilibration with standard NVT (constant Number of particles, Volume and Temperature) and NPT (constant Number of particles, Pressure and Temperature) ensembles were performed for
100 ps. A constant pressure of 0 (zero) bar and temperature of 300 K with uniform density of ∼1040 kg/m3 was set for parameterization. Final molecular dynamics simulation (MDS) was carried out for 75 ns. Grace software was employed to visualize the trajectories of simulation parameters. A chronological list of commands and other associated parameter files to run simulation along with the entire MD-simulation files have been provided in the associated Mendeley dataset folder as mentioned in subsequent sections.
Data format Data is in raw and analysed form.
Description of data collection The structural chemistry data was acquired from authorised databases and repositories, followed by necessary optimisations using licensed (academic and professional) software. The reported docking scores and simulation parameters are based on universally accepted terms/standards.
Data source location Institution: Vellore Institute of Technology, Vellore
City/Town/Region: Vellore, Tamil Nadu
Country: India
Data accessibility Data is available within this article and the raw data files in excel format and other standard formats for simulation has been uploaded on public repository and datasets with active link below is provided as the supplementary data.
Link: (https://data.mendeley.com/datasets/94rh86jfpk/3),
DOI: 10.17632/94rh86jfpk.3
Related research article The presented dataset is associated with our recent publication mentioned below[1]:
S. K. Miryala, S. Basu, A. Naha, R. Debroy, S. Ramaiah, A. Anbarasu & S. Natarajan (2021). Identification of bioactive natural compounds as efficient inhibitors against Mycobacterium tuberculosis protein-targets: A molecular docking and molecular dynamics simulation study. Journal of Molecular Liquids341, 117,340. https://doi.org/10.1016/j.molliq.2021.117340

Value of the Data

  • 1)
    There are four distinct types of datasets presented in this manuscript:
    • a)
      Raw docking scores like Crash score, G-score, PMF score, d-score, Chem scoresand C-scores can help understand different chemical factors affecting ligand-protein binding.
    • b)
      Optimized protein-ligand complexes used for simulation will provide comprehensive idea about the fundamental format of input biomolecular structural complexes to run molecular dynamics simulations.
    • c)
      The optimized molecular dynamics parameter files, output files and list of commands will definitely facilitate in further analysis and performing essential dynamics studies besides guiding researchers to replicate similar experimental approaches and objectives.
    • d)
      Trajectories of optimised conditions for simulation regarding individual protein-ligand complexes can give a fair idea of the set of conditions required to simulate a specific type of biomolecule (protein) interacting with a certain class of compounds.
  • 2)

    The datasets can be of interest to bioinformaticians, computational biologists, phytochemists and molecular medicine researchers, who can figure out leads relevant to the chemical behaviours of a certain class of compounds against biological systems.

  • 3)

    The docking scores can further be exploited either based on individual compounds or collective understanding of a specific class of compounds or analysis of specific chemical parameters based on individual scoring algorithms.

  • 4)

    The compounds that were not considered as per criteria presented in the main publication can further be explored similarly against other potent targets [1].

  • 5)

    The optimised simulation parameters can readily guide researchers by providing a set of standard values that can be utilised to design simulation experiments regarding the same/similar macromolecules

  • 6)

    The simulation profiles may encourage designing of efficient therapeutic agents by providing crucial interaction dynamics values.

1. Data Description

The presented datasets depict the feasibility of certain classes of natural compounds as therapeutic candidates against Mtb protein targets. Supplementary Files-1–7 (Docking_scores) (https://data.mendeley.com/datasets/94rh86jfpk/3) portrayed the docking scores comprising crash score, G-score, PMF score, d-score, chem scores, polar, total score, consensus (C) score, number of Hydrogen-bonds of the natural compounds against Mtb targets [Arabinosyl transferase (PDB ID: 3PTY); DNA Gyrase subunit A (PDB ID: 4G3N); Ribosomal protein S1 (PDB ID: 4NNI); 2′-O-Methyltransferase (PDB ID: 5KYG); Enoyl (acyl-carrier protein) reductase (PDB ID: 5VRL); F-ATP synthase epsilon chain (PDB ID: 5YIO) and RNA polymerase subunit C (PDB ID: 5ZX3)] as compared to respective classical drugs (Ethambutol, Levofloxacin, Pyrazinamide, Capreomycin, Isoniazid, Bedaquiline and Rifampicin). The optimized protein-ligand complexes used as input files for parameretization and simulation has been provided as “MDS_input_files” (https://data.mendeley.com/datasets/94rh86jfpk/3). The molecular dynamics parameter files along with the set of commands to run MDS are available as “MDS_parameter_files” (https://data.mendeley.com/datasets/94rh86jfpk/3). Fig. 1-3 represented the quality-check parameters after equilibrating certain protein-ligand complexes comprising classical and natural compounds prior to MD run depicting the electron Density, Pressure and Temperature levels [1]. The differences in molecular weight and number of atoms were reflected upon the electron density function of the individual protein-ligand complexes. The datasets for generating the figures has been provided explicitly in the Supplementary Files-8–14 (Figure_datasets) (https://data.mendeley.com/datasets/94rh86jfpk/3). The entire simulation dataset has been segmented appropriately based on the PDB IDs of the studied protein-targets. The different input and output files generated are made available under ``MD_simulation_files'' (https://data.mendeley.com/datasets/94rh86jfpk/3).

Fig. 2.

Fig. 2:

Quality check parameters after equilibrating the respective protein-ligand complexes prior to MD run depicting the Density gradients, Pressure and Temperature levels. (A) 4G3N with Levofloxacin and Laccaic Acid, (B) 5YIO with Bedaquiline and Laccaic Acid.

Fig. 1.

Fig 1:

Quality check parameters after equilibrating the respective protein-ligand complexes prior to MD run depicting the Density gradients, Pressure and Temperature levels. (A) 3PTY with Ethambutol and Glycyrrhizin, (B) 5KYG with Capreomycin and Glycyrrhizin, (C) 5VRL with Isoniazid and Glycyrrhizin.

Fig. 3.

Fig. 3:

Quality check parameters after equilibrating the respective protein-ligand complexes prior to MD run depicting the Density gradients, Pressure and Temperature levels. (A) 4NNI with Pyrazinamide and Swertiamarin, (B) 5ZX3 with Rifampicin and Swertiamarin.

2. Experimental Design, Materials and Methods

Seven Mtb proteins were selected, which are already targets of conventional anti-TB drugs [2]. Their 3D structures were obtained from RCSB-PDB (https://www.rcsb.org/), while the functional domains/motifs were obtained from InterPro (https://www.ebi.ac.uk/interpro/), Pfam (http://pfam.xfam.org/), and UniProt (https://www.uniprot.org/) databases. The classical drugs and the natural compounds were retrieved from the DrugBank (https://www.drugbank.com/) and PubChem Compound (https://pubchem.ncbi.nlm.nih.gov/) databases. ChemSketch tool [3] was employed in the absence of ligand structures for 2D structure construction followed by generation of 3D coordinates using the OpenBabel Chemical File Format Converter (http://www.cheminfo.org/Chemistry/Cheminformatics/FormatConverter/index.html). Further, the ligands were optimised with the Avogadro tool [4]. Molecular docking between the conventional anti-TB drugs and natural compounds with their respective targets was performed usingthe SYBYL-Surflex-docking tool kit (Tripos International, USA). The protein structures were refined to remove bound ligands and water molecules, fixing side chains, adding hydrogen atoms, followed by atomic-level charge designation using AMBER7 F99 force field. Thereafter, the proteins were energy minimised by Powell's method with Tripos force field followed by Protomol generation. Hammerhead functional scorings determined the polar, crash, entropic, hydrophobic and repulsive properties to yield the docked score datasets [5,6]. The MDS analyses for 75 nanoseconds (ns) was performed for each of the best-docked complexes with GROMACS 2018.1 suite [7]. Protein topologies were generated using CHARMM36-Mar2019 force-field mechanics and TIP3P model (for water cluster), while ligand topologies were built using CGenFF (https://cgenff.umaryland.edu/) online server with default parameters. The protein structures were placed within the center of the dodecahedron box of uniform edge distance of 1.0 nm, followed by solvation and addition of requisite counter ions (Na+ or Cl) to the system. Steepest descent algorithm for 50,000 steps and convergence-tolerance of 1000 kJ/mol nm−1 were utilised for energy minimisation following system equilibration under standard NVT (constant Number of particles, Volume and Temperature) and NPT (constant Number of particles, Pressure and Temperature) ensembles for 100 ps [7], [8], [9], [10], [11], [12], [13], [14]. The trajectories of simulation parameters were visualised using Grace software (https://plasma-gate.weizmann.ac.il/Grace/).

Ethics Statements

The work did not involve any human subjects, animal experiments and data from social media platforms.

CRediT authorship contribution statement

Sravan Kumar Miryala: Data curation, Formal analysis, Visualization, Writing – original draft. Soumya Basu: Formal analysis, Visualization, Writing – original draft. Aniket Naha: Formal analysis, Visualization, Writing – original draft. Reetika Debroy: Formal analysis, Visualization, Writing – original draft. Sudha Ramaiah: Conceptualization, Methodology, Validation, Writing – review & editing. Anand Anbarasu: Funding acquisition, Conceptualization, Project administration, Supervision. Saravanan Natarajan: Funding acquisition, Conceptualization, Project administration, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

AA, SR, MSK, SB, AN, and RD would like to thank the management of VIT for providing the necessary facilities to carry out this research work. SN would like to thank the Director, ICMR-NIRT for providing the necessary supports to carry out this research work.

The authors gratefully acknowledge the Indian Council of Medical Research (ICMR), New Delhi, Government of India for the research grant [IRIS ID: 2020–0690] and ICMR-NIRT, Chennai, for the support in meeting the article publication charges. SB and AN thank ICMR for their research fellowships.

Contributor Information

Anand Anbarasu, Email: aanand@vit.ac.in.

Saravanan Natarajan, Email: saravanan.n@nirt.res.in.

Data Availability

References

  • 1.Miryala S.K., Basu S., Naha A., Debroy R., Ramaiah S., Anbarasu A., Natarajan S. Identification of bioactive natural compounds as efficient inhibitors against mycobacterium tuberculosis protein-targets: a molecular docking and molecular dynamics simulation study. J. Mol. Liq. 2021;341 doi: 10.1016/j.molliq.2021.117340. [DOI] [Google Scholar]
  • 2.Ibeji C.U., Salleh N.A.M., Sum J.S., Ch'ng A.C.W., Lim T.S., Choong Y.S. Demystifying the catalytic pathway of mycobacterium tuberculosis isocitrate lyase. Sci. Rep. 2020;10:18925. doi: 10.1038/s41598-020-75799-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li Z., Wan H., Shi Y., Ouyang P. Personal experience with four kinds of chemical structure drawing software: review on ChemDraw, ChemWindow, ISIS/Draw, and ChemSketch. J. Chem. Inf. Comput. Sci. 2004;44:1886–1890. doi: 10.1021/ci049794h. [DOI] [PubMed] [Google Scholar]
  • 4.Hanwell M.D., Curtis D.E., Lonie D.C., Vandermeersch T., Zurek E., Hutchison G.R. Avogadro: an advanced semantic chemical editor, visualization, and analysis platform. J. Cheminform. 2012;4:17. doi: 10.1186/1758-2946-4-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Malathi K., Ramaiah S. Molecular docking and molecular dynamics studies to identify potential OXA-10 extended spectrum β-Lactamase non-hydrolysing inhibitors for pseudomonas aeruginosa. Cell Biochem. Biophys. 2016;74:141–155. doi: 10.1007/s12013-016-0735-8. [DOI] [PubMed] [Google Scholar]
  • 6.Naha A., Vijayakumar S., Lal B., Shankar B.A. Genome sequencing and molecular characterisation of XDR acinetobacter baumannii reveal complexities in resistance : novel combination of Sulbactam-Durlobactam holds promise for therapeutic intervention. J. Cell. Biochem. 2021:1–25. doi: 10.1002/jcb.30156. [DOI] [PubMed] [Google Scholar]
  • 7.Lemkul J. From proteins to perturbed hamiltonians: a suite of tutorials for the GROMACS-2018 molecular simulation package [article v1.0] Living J. Comput. Mol. Sci. 2019;1:1–53. doi: 10.33011/livecoms.1.1.5068. [DOI] [Google Scholar]
  • 8.Basu S., Naha A., Veeraraghavan B., Ramaiah S., Anbarasu A. In silico structure evaluation of BAG3 and elucidating its association with bacterial infections through protein-protein and host-pathogen interaction analysis. J. Cell. Biochem. 2021 doi: 10.1002/jcb.29953. [DOI] [PubMed] [Google Scholar]
  • 9.Jayaraman M., Rajendra S.K., Ramadas K. Structural insight into conformational dynamics of non-active site mutations in KasA: a mycobacterium tuberculosis target protein. Gene. 2019;720 doi: 10.1016/j.gene.2019.144082. [DOI] [PubMed] [Google Scholar]
  • 10.Basu S., Veeraraghavan B., Ramaiah S., Anbarasu A. Novel cyclohexanone compound as a potential ligand against SARS-CoV-2 main-protease. Microb. Pathog. 2020;149 doi: 10.1016/j.micpath.2020.104546. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Vasudevan K., Basu S., Arumugam A., Naha A., Ramaiah S., Anbarasu A., Veeraraghavan B. Identification of potential carboxylic acid-containing drug candidate to design novel competitive NDM inhibitors: an in-silico approach comprising combined virtual screening and molecular dynamics simulation. Res. Prepr. 2021 doi: 10.21203/rs.3.rs-784343/v1. [DOI] [Google Scholar]
  • 12.Thillainayagam M., Malathi K., Ramaiah S. In-Silico molecular docking and simulation studies on novel chalcone and flavone hybrid derivatives with 1, 2, 3-triazole linkage as vital inhibitors of plasmodium falciparum dihydroorotate dehydrogenase. J. Biomol. Struct. Dyn. 2018;36:3993–4009. doi: 10.1080/07391102.2017.1404935. [DOI] [PubMed] [Google Scholar]
  • 13.Thillainayagam M., Ramaiah S., Anbarasu A. Molecular docking and dynamics studies on novel benzene sulfonamide substituted pyrazole-pyrazoline analogues as potent inhibitors of plasmodium falciparum histo aspartic protease. J. Biomol. Struct. Dyn. 2020;38:3235–3245. doi: 10.1080/07391102.2019.1654923. [DOI] [PubMed] [Google Scholar]
  • 14.Elmore D.E., Dougherty D.A. Molecular dynamics simulations of wild-type and mutant forms of the mycobacterium tuberculosis MscL channel. Biophys. J. 2001;81:1345–1359. doi: 10.1016/S0006-3495(01)75791-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement


Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES