Skip to main content
Scientific Data logoLink to Scientific Data
. 2022 Apr 1;9:148. doi: 10.1038/s41597-022-01261-1

AB-DB: Force-Field parameters, MD trajectories, QM-based data, and Descriptors of Antimicrobials

Silvia Gervasoni 1, Giuliano Malloci 1,, Andrea Bosin 1, Attilio V Vargiu 1, Helen I Zgurskaya 2, Paolo Ruggerone 1
PMCID: PMC8976083  PMID: 35365662

Abstract

Antibiotic resistance is a major threat to public health. The development of chemo-informatic tools to guide medicinal chemistry campaigns in the efficint design of antibacterial libraries is urgently needed. We present AB-DB, an open database of all-atom force-field parameters, molecular dynamics trajectories, quantum-mechanical properties, and curated physico-chemical descriptors of antimicrobial compounds. We considered more than 300 molecules belonging to 25 families that include the most relevant antibiotic classes in clinical use, such as β-lactams and (fluoro)quinolones, as well as inhibitors of key bacterial proteins. We provide traditional descriptors together with properties obtained with Density Functional Theory calculations. Noteworthy, AB-DB contains less conventional descriptors extracted from μs-long molecular dynamics simulations in explicit solvent. In addition, for each compound we make available force-field parameters for the major micro-species at physiological pH. With the rise of multi-drug-resistant pathogens and the consequent need for novel antibiotics, inhibitors, and drug re-purposing strategies, curated databases containing reliable and not straightforward properties facilitate the integration of data mining and statistics into the discovery of new antimicrobials.

Subject terms: Virtual screening, Pharmaceutics, Antimicrobial resistance, Cheminformatics, Electronic structure of atoms and molecules


Measurement(s) molecular physical property analysis objective
Technology Type(s) Computer Modeling

Background & Summary

The increasing spread of antibiotic resistance in clinics is causing a global health crisis. Mobile genetic element encoding for resistance genes can be transferred among bacterial populations, leading to the need to make more efficient the discovery of new antibiotics and molecules able to improve their efficacy1,2. Gram-negative bacteria, such as Escherichia coli, Pseudomonas aeruginosa and Acinetobacter baumannii, are particularly challenging due to the presence of an outer membrane which reduces the permeability of antimicrobials and therefore their efficacy35. A major obstacle is represented by efflux pumps that act in synergy with the outer membrane ejecting a plethora of compounds with various chemical-physical properties, among which are different classes of antibiotics68. In addition, inactivating enzymes such as β-lactamases9 contribute to exacerbate the problem. To date, both academia and industry struggle to identify new antibiotic classes and optimize available compounds1015. Although several strategies have been adopted (e.g., drug repurposing16 or systematic exploitation of natural compounds17), holistic approaches able to take into account multiple factors contributing to antimicrobial resistance are lacking18. Previous works focused primarily on the role of chemical-physical properties of antimicrobial molecules in their accumulation profile, searching for general “rules”19,20. For example, O’Shea and Moser19 found that antibiotics effective towards Gram-negative bacteria are generally characterized by high molecular weight (MW, around 600 Da) and high polarity (as expressed by cLogD7.4 below 0). More recently, Richter et al.21 identified the presence of a primary amine, flexible bond number (5 or less) and globularity (describing molecular shape), as key features to predict the accumulation of antibiotics in Gram-negative bacteria22. Predictive rules of efflux inhibition and avoidance were also identified23. This latter study combined standard molecular descriptors to properties derived from structure-based analyses (e.g., interaction descriptors extracted from molecular docking), allowing for a more complete and multi-factorial view. Definition of general rules able to predict both the permeability and the activity of antimicrobial compounds can greatly benefit from the application of machine learning approaches able to speed up the drug discovery process24,25. The application of these methods requires collection of data for the learning phase26,27, that highlights the need for curated molecular databases providing ready-to-use features.

In this regards, over the years, several molecular databases containing standard descriptors (e.g., PubChem28, DrugBank29, ChEMBL30, ZINC1531) or quantum-mechanical (QM) properties3236 have been reported. These data have been extensively used in quantitative-structure-activity-relationships (QSAR) studies37,38, and recent works exploited additional information coming from molecular dynamics (MD) simulations3943, that represent an effective tool to address key structural and kinetic features of biological systems4447. However, although freely available servers for the automatic generation of force-field (FF) parameters are available4852, small molecule parameterization remains often a non-trivial task53.

Following a previous work54, we present a homogeneous database of accurate all-atom FF parameters of more than 300 antimicrobial compounds, together with μs-long MD trajectories and QM-related data (e.g., ground-state optimized geometries). We additionally provide molecular descriptors of different nature: i) classical parameters usually considered in QSAR studies (e.g., MW, atom/ring counts, LogP, …); ii) MD-derived properties (e.g., root-mean-square fluctuations, statistics of intra- and inter-molecular H-bonds, hydration-shells structure and dynamics, …); iii) QM-based parameters (e.g., energies of frontier molecular orbitals, electronic gap, electric dipole moment, …). The computational protocol adopted is schematically depicted in Fig. 1. The molecules considered, ranging in size from cycloserine (13 atoms, MW = 102.09 Da) to rifalazil (132 atoms, MW = 941.09 Da), cover 24 classes of antimicrobial compounds with different mechanisms of action, plus miscellaneous compounds (e.g., fluorescent dyes such as rhodamine 6G and HT33342). In particular, about 30% of compounds in the whole dataset are β-lactams, and 10% are inhibitors of key bacterial proteins. High MW compounds (>~1000 Da) such as polymyxin, glycol- and lipo-peptides were omitted from the selection, due to the high computational costs/convergence issues associated to the QM calculation. A schematic depiction of representative compounds showing the overall chemical variability of the sample is given in Fig. 2. The complete set of antimicrobial families and compounds is reported in Table 1.

Fig. 1.

Fig. 1

Schematic view of the computational protocol adopted to generate AB-DB. The different steps reporting some representative molecular descriptors are highlighted: molecular characterization, QM calculations, FF parameters generation, MD simulations. For further technical details see Methods section.

Fig. 2.

Fig. 2

Schematic representation of four representative antimicrobials belonging to different classes, namely (a) penicillins, (b) (fluoro)quinolones, (c) aminoglycosides, and (d) aminocoumarins. Graphics rendered with DiscoveryStudio101.

Table 1.

List of families and antimicrobial compounds included in AB-DB.

Family # Compounds
Aminocoumarins 8 chlorobiocin, novobiocin, declovanillobiocin, isovanillobiocin, novclobiocin 101, plazomicin, ribostamycin, vanillobiocin
Aminoglicosides 15 amikacin, apramycin, arbekacin, dibekacin, gentamicin C1, hygrovetine, isepamicin, kanamycin, neomycin, netilmicin, paromomycin, sisomicin, spectinomycin, streptomycin, tobramycin
Anthracenediones 2 mitoxantrone, pixantrone
Anthracyclines 4 daunorubicin, doxorubicin, epirubicin, idarubicin
β-lactamase inhibitors 11 avibactam, bal0029880, clavulanic acid, durlobactam, enmetazobactam, nacubactam, relebactam, sulbactam, tazobactam, thienamycin, zidebactam
Carbapenems 14 biapenem, doripenem, ertapenem, faropenem1, imipenem, LK-157, meropenem, olivanic acid, panipenem, razupenem, ritipenem, sanfetrinem, tebipenem, tomopenem
Cephalosporins 39 cefaclor, cefadroxil, cefalonium, cefamandole nafate, cefamandole sodium, cefazolin, cefdinir, cefditoren, cefepime, cefetamet, cefiderocol, cefixime, cefmenoxime, cefmetazole, cefonicid, cefoperazone, ceforanide, cefotaxime, cefotetan, cefoxitin, cefpiramide, cefpirome, cefpodoxime, cefprozil, cefsulodin, ceftaroline, ceftazidime, ceftibuten, ceftizoxime, ceftobiprole, ceftriaxone, cefuroxime, cephalexin, cephaloridine, cephalotin, cephapirin, cephradine, loracarbef, nitrocefin
DHFR inhibitors 7 brodimoprim, epiroprim, iclaprim (R), iclaprim (S), tetroxoprim, trimethoprim, triclosan
Efflux pumps inhibitors 9 D13-9001, MBX2319, MBX2931, MBX3132, MBX3135, NMP, PAβN, amitriptyline, chlorpromazine
Quinolones 36 cinoxacin, ciprofloxacin, clinafloxacin, danofloxacin, delafloxacin, difloxacin, DX-619, enoxacin, enrofloxacin, fleroxacin, flumequine, garenoxacin, gatifloxacin, gemifloxacin, grepafloxacin, levofloxacin, lomefloxacin, marbofloxacin, moxifloxacin, nadifloxacin, nalidixic acid, norfloxacin, ofloxacin, orbifloxacin, oxolinic acid, pazufloxacin, pefloxacin, pipemidic acid, prulifloxacin, rosoxacin, rufloxacin, sarafloxacin, sitafloxacin, sparfloxacin, temafloxacin, trovafloxacin
Fusidanes 2 fusidica acid, helvolic acid
Lincosamides 4 clindamycin, desalicetin, lincomycin, pirlimycin
Macrolides 9 azithromycin, cethromycin, clarithromycin, dirithromycin, erythromycin, modithromycin, roxithromycin, spiramycin, telithromycin
Monobactams 10 aztreonam, BAL19764, BAL30072, carumonam, gloximonam, nacubactam, nocardicin, oximonam, pirazmonam, tigemonam
Nitrofurans 12 furazolidone, nifurfoline, nifurquinazol, nifurtoinol, nitrofurantoin, nitrovin, nifuratel, nifuroxazide, nifurtimox, nifurzide, nitrofurazone, tinidazole
Nucleosides 4 A-500359A, A-503083E, capuramycin, puromycin
Oxacephem 2 latamoxef, flomoxef
Oxazolidinones 8 contezolid, eperezolid, linezolid, posizolid, radezolid, ranbezolid, sutezolid, tedizolid
Penicillins 20 6-APA, amoxicillin, ampicillin, azlocillin, carbenicillin, cloxacillin, dicloxacillin, epicillin, flucloxacillin, hetacillin, methicillin, mezlocillin, nafcillin, oxacillin, penicillin G, penicillin V, piperacillin, sulbenicillin, temocillin, ticarcillin
Phenicols 5 azidamfenicol, chloramphenicol, florfenicol, tevenel, thiamphenicol
Rifamycins 2 rifalazil, rifampicin
Streptogramins 1 dalfopristin
Sulphonamides 20 sulfabenzamide, sulfacetamide, sulfachlorpyridazine, sulfadiazine, sulfadimethoxine, sulfaguanidine, sulfamerazine, sulfameter, sulfamethazine, sulfamethizole, sulfamethoxazole, sulfamethoxypyridazine, sulfamonomethoxine, sulfamylon, sulfanitran, sulfaphenazole, sulfapyridine, sulfaquinoxaline, sulfathiazole, sulfisoxazole
Tetracyclines 10 chlortetracycline, demeclocycline, doxycycline, meclocycline, methacycline, minocycline, omadacycline, oxytetracycline, tetracycline, tigecycline
Others 20 acriflavine, cycloserine, dapsone, deoxycholate, enterobactin, ethambutol, ethidium, ethionamide, fosfomycin, halicin, HT33342, isoniazid, metronidazole, propidium, pseudomonic acid A, rhodamine 6G, taurocholate, tetraphenylphosphonium, WCK-4234

Boldface labels identify molecules for which two protonation states were considered. Molecules for which the PubChem 3D structure is not available are highlighted in italic. Column “#“ reports the total number of compounds for each family. 1Faropenem belongs to penems family, in AB-DB it is included in the carbapenems family.

To the best of our knowledge AB-DB is unique in supplying homogeneously-derived properties of antimicrobial compounds. The accurate FF parameters can be reused for further MD simulations of compounds either alone or interacting with their macromolecular target(s). The MD trajectories can be exploited for ligand- or structure-based studies, in particular for molecular docking. The successful application of this technique requires the knowledge of the bio-active conformation of ligands5558, that is not always found by classical searching algorithms59,60. Our curated, homogeneous and not straightforward properties can feed machine learning models towards the discovery of new antimicrobials61,62. Input/output files are also supplied to ensure data reproducibility. In the near future we plan to update AB-DB including more compounds, covering additional antimicrobial classes.

Methods

For each antimicrobial compound we obtained the 3D structure data file (.sdf format) from the PubChem database, except for 13 compounds for which the 3D conformation is not available. In those cases, marked in italic in Table 1, the starting structure was taken from the ChEMBL database, or from available X-ray structures. We then used the ChemAxon’s Marvin suite of programs63 to calculate the dominant protonation states at physiological pH. For known uncertain cases (e.g., tetracyclines), for which several micro-species with similar population were predicted, the choice has been driven by available experimental data on pKa values. The comparison between the experimental and calculated pKa values of the ionizable groups of a representative set of challenging molecules is reported in Table 2. The large relative errors (even exceeding 100%) were expected for these classes of antimicrobials since pKa determination in Marvin ChemAxon is based on molecular charge distribution, and these molecules are characterized by a complex electronic structure (e.g., with several possible resonance states). For each molecule we then proceeded with QM calculations and FF generation followed by all-atom MD simulations. Properties generated in each step will be referred to as QSAR, MD, and QM descriptors, respectively. The three types of analysis performed in this study are graphically exemplified in Fig. 3 for a test-case molecule and described in details below.

Table 2.

Average experimental and predicted (boldface) pKa values. Relative percentage error of computed pKa value is reported in parentheses.

Quinolones
pKa 1 = carboxylic acid pKa 2 = piperazinyl amine
ciprofloxacin103 6.29, 5.6 (11.7%) 8.26, 8.8 (6.2%)
difloxacin103 5.98, 5.5 (7.7%) 7.75, 7.0 (9.8%)
enoxacin103 6.57, 5.3 (19.2%) 7.17, 8.7 (21.1%)
enrofloxacin103 6.01, 5.6 (7.7%) 7.93, 7.2 (8.7%)
fleroxacin103 5.78, 5.5 (5.7%) 7.86, 6.5 (17.5%)
lomefloxacin103 5.78, 5.5 (5.7%) 8.74, 8.8 (0.5%)
norfloxacin103 5.97, 5.6 (6.6%) 8.42, 8.8 (4.2%)
ofloxacin103 6.04, 5.3 (11.5%) 8.09, 6.7 (5.0%)
pefloxacin103 6.45, 5.6 (13.9%) 7.84, 7.0 (10.5%)
sarafloxacin [104] 5.99, 5.6 (7.4%) 7.84, 8.8 (2.7%)
levofloxacin104 5.6, 5.4 (3.95%) 7.9, 6.7 (15.2%)
Sulfonamides
pKa 1 = amine pKa 2 = amide
sulfamerazine103 2.15, 2.0 (7.0%) 6.82, 7.0 (2.4%)
sulfamethazine103 2.24, 2.0 (10.7%) 7.51, 7.0 (7.0%)
sulfathiazole103 2.05, 2.0 (0.2%) 7.14, 6.9 (2.9%)
Tetracyclines
pKa1 = OH(C3) pKa2 = OH(C12) pKa3 = OH(C12)
chlortetracycline103 3.64, 7.0 (92.6%) 6.57, 8.8 (33.6%) 8.64, 6.18 (28.4%)
demeclocycline103 3.37, 2.6 (22.6%) 7.36, 8.1 (9.5%) 9.44, 6.3 (32.9%)
doxycycline103 3.02, 7.3 (142.4%) 7.97, 8.2 (2.3%) 9.15, 5.8 (36.6%)
meclocycline103 4.05, 8.0 (97.3%) 6.87, 5.8 (15.7%) 9.59, 7.0 (26.8%)
oxytetracycline103 3.32, 7.3 (118.4%) 7.02, 8.1 (14.9%) 8.74, 7.0 (19.7%)
tetracycline103 3.17, 7.2 (127.5%) 6.79, 7.8 (29.9%) 9.07, 6.2 (31.4%)
Aminoglycosides
pKa1 = N1 pKa2 = N3 pKa3 = N2’ pKa4 = N6’ pKa5 = N3”
amikacin105 9.89, 9.6 (2.8%) 7.64, 9.0 (17.5%) 8.81, 8.2 (6.9%) 8.05, 8.4 (4.6%)
gentamicin C1106 7.67, 9.5 (23.9%) 6.19, 9.0 (45.4%) 7.4, 10.1 (36.5%) 9.86, 8.5 (13.8%) 8.78, 7.41 (15.6%)
netilmicin105 8.15, 9.2 (12.9%) 6.52, 8.3 (27.3%) 8.15, 7.2 (11.7%) 9.32, 8.8 (5.6%) 8.48, 9.7 (14.4%)
sisomicin105 7.43, 9.2 (23.8%) 6.21, 8.3 (33.7%) 8.01, 7.4 (7.6%) 9.31, 8.8 (5.5%) 8.50, 9.6 (11.5%)
tobramycin105 7.56, 9.5 (25.7%) 6.70, 7.4 (9.9%) 7.75, 8.1 (5.0%) 9.11, 9.0 (1.0%) 7.71, 8.6 (11.7%)
Cephalosporins
pKa = Amine
loracarbef107 6.84, 7.2 (5.4%)
DHFR inhibitors
pKa = diaminopyrimidine
trimethoprim108 7.50, 7.2 (4.5%)
Rifamycins
pKa1= 4-hydroxy pKa2= 3-piperazine nitrogen
rifampicin109 1.7, 10.0 (485.9%) 7.9, 7.1 (9.8%)

Fig. 3.

Fig. 3

Graphical exemplification of the three types of analysis performed to generate AB-DB. Puromycin molecule has been selected as a test-case. QSAR: number of rotatable bonds (left), QM and FF: atomic partial charges (center), from negative (red) to positive (blue) values, MD: conformations extracted from MD trajectories (right). Graphics rendered with PyMOL102.

Quantum-mechanical calculations and force-field generation

The 3D structures of the molecules downloaded from the PubChem or ChEMBL databases (see above) underwent quantum-chemical calculations at the Density Functional Theory (DFT) level64, using the Gaussian16 package65. We employed the hybrid B3LYP functional66, in conjunction with the split-valence 6–31G** Gaussian basis-set67. The combination B3LYP/6–31G** represents a good compromise between accuracy and computational cost and is widely used for small molecules6870. In all cases we disabled molecular symmetry (Symmetry = None), adopted restrictive convergence criteria for self-consistent-field iterations (10−8 Ha, SCF(Conver = 8)), and used a pruned (99,590) grid (Int = UltraFine) for numerical integration. For the few cases for which convergence criteria were not reached, geometry optimization was first performed with a smaller basis-set (6–31G) and converged geometry and molecular orbitals were used as a starting point for the subsequent B3LYP/6–31G** step. For each compound we optimized the ground-state structure employing the Polarizable Continuum Model71 to mimic the effect of water solvent (SCRF = (PCM,Solvent = Water)), particularly to avoid formation of strong intra-molecular H-bonds. We then performed full vibrational analyses obtaining real frequencies in all cases, thus confirming the geometries obtained to be global minima. We processed the output of Gaussian16 with GaussSum72 to extract molecular orbital data. On the optimized geometry we then performed B3LYP/6–31G** single-point energy calculations in vacuum to generate the atomic partial charges fitting the molecular electrostatic potential. We used the Merz-Kollman scheme73 to construct a grid of points around the molecule under the constraint of reproducing the overall electric dipole moment of the molecule (Pop = (ESP,Dipole,Regular)). The two-step restrained electrostatic potential (RESP) method74 implemented in the Antechamber package75 was used to generate atomic partial charges at the DFT level, instead of the automatic AM1-BCC charges76. This step enabled the generation of the FF files using the General Amber Force Field 2 (GAFF2)77. In a single case, namely the siderophore enterobactin loaded with Fe(III)78, FF files were obtained using the metal center parameter builder module79 of the Amber18 package80, slightly modified accordingly to the QM settings described above.

Molecular dynamics simulations

All-atom MD simulations were performed in explicit water solution using Amber18. Systems were solvated within a box of TIP3P water model81 and K+/Cl counter ions82, to reach an ionic concentration of 0.1 M, using the program tleap of Amber1880. GAFF2 parameters obtained as described above were adopted for antimicrobial compounds. All systems underwent an energy minimization, a heating followed by a cooling phase, and a short productive dynamics to relax the simulation box. Finally the production 1 μs-long MD simulation was performed, under the NPT ensemble (1 Atm and 310 K) using the isotropic Berendsen barostat83 and the Langevin thermostat84. Further details on MD settings can be found in ref. 54.

Descriptors generation

From the output of QM and MD simulations we extracted all molecular descriptors (~80 in total for each compound, see list in Table S1). Most QSAR descriptors were computed on the QM optimzed geometries using the calculator plugin of the Marvin ChemAxon program63. Given the importance of octanol/water partition coefficient in drug design85, we provide an additional estimate of this parameter by means of the XLOGP3 program86. Furthermore, for each compound we derived the molecular properties associated with the “entry rules”, a series of guidelines that have been recently proposed to increase small-molecule accumulation in Gram-negative bacteria21. QM-based properties were obtained from the Gaussian16 output files of the implicit-solvent geometry optimization. Isotropic and anisotropic polarizabilities were derived from the polarizability tensor according to ref. 87. We additionally provided the molecular dipole moment in vacuum consistent with the atomic partial charges of the FF files, computed as described above. From the all-atom MD simulations we obtained structural and dynamical features by means of the CPPTRAJ program88. First and second water shells were extracted using a lower (upper) cutoff of 3.4 (5.0) Å. For the analysis of intra- and inter-molecular H-bonds we adopted angle and distance cutoffs of 135° (donor-hydrogen-acceptor angle) and 3.5 Å (donor-acceptor), respectively89. The number and population of structural clusters were determined using a hierarchical agglomerative algorithm90 and the molecule root-mean-squaredeviation (RMSD) value as a metric. To evaluate atomic root-mean-square fluctuations (RMSF) we used the utility g_rmsf of the GROMACS package91. During the MD runs we also monitored three morphology descriptors related to the gyration tensor, i.e., asphericity, acylindricity, and kappa2, as implemented in the PLUMED plugin92. Asphericity and acylindricity give a measure of the deviation of the mass distribution from spherical and cylindrical symmetry, respectively; the relative shape anisotropy kappa2 is limited between 0 and 1 and reflects both symmetry and dimensionality93. The minimal projection area (MPA) is the minimum of the circular areas projected perpendicularly to the principal axes of inertia of the molecule, calculated based on the atomic van der Waals radii (Å). The dynamical evolution of the MPA have been determined with the combined use of Open Babel94 and ChemAxon’s calculator plugin63.

Data Records

AB-DB is available on figshare95. The computed molecular descriptors are given in the comma separated file all-descriptors.csv. A compressed TAR archive for each family is provided (e.g., carbapenems.tgz). In turn, every archive contains sub-folders named after the compound and the net charge considered in the calculations (e.g., carbapenems/ertapenem_-1/). For 34 molecules the two protonation states most populated at pH = 7.4 were considered (see Methods section). In these cases two folders per compound are reported, with different values of the net charge (e.g., quinolones/ciprofloxacin_0/ where the compound is considered as zwitterionic, and quinolones/ciprofloxacin_-1/ where the nitrogen atom of the piperazine ring is considered in its neutral form). Each compound folder contains a 2D sketch of the molecule (2d.png), and a total of 20 files distributed into three sub-directories reporting QM (QM/), FF (FF/), and MD (MD/) data. Figure 4 shows a schematic representation of the database structure describing the path of all files provided.

Fig. 4.

Fig. 4

Schematic representation of the AB-DB structure reporting filenames of each sub-directory.

Quantum-mechanical data

QM/ folders contain files derived from QM calculations (see Quantum-mechanical calculations and force-field generation section). In details, the opt-freq.com and opt-freq.log are the input and output files of the Gaussian16 geometry optimization and frequency analysis in implicit solvent. The minimization steps are collected in the optimization.xyz file and the final optimized structure is given in structure data file format as optimized.sdf. This file, generated with Open Babel94 from the corresponding .xyz file and carefully checked manually, is also provided for reproducibility purposes since it has been used to compute QSAR descriptors. We also collected the electronic structure and the harmonic vibrational frequencies into electronic.dat and vibrational.dat files, respectively. The elec-pot.com and elec-pot.log are respectively the input and output files of the Gaussian16 single-point energy calculation in vacuum, performed to derive atomic partial charges. The resulting electrostatic potential file is elec-pot.dat.

Force-field data

For each compound we supply in the corresponding FF/ folder the mol.mol2 and Amber mol.prep files, containing the optimized structure of the molecule with RESP partial charges. The Amber force-field modification file mol.frcmod with all parameters not included in the GAFF2 is also provided. For reproducibility purposes we make available the Amber parameter/topology mol_solv.parm7, and coordinate/restart mol_solv.rst7 files used to perform the MD simulation in explicit solvent. The corresponding mol_solv.pdb file generated using the ambpdb program80 is also provided.

Molecular dynamics data

MD/ folders store the μs-long MD trajectories performed in explicit water solution (see Molecular dynamics simulations section) in the file trajectory.pdb (100000 frames). The representatives of the ten most populated clusters extracted from the trajectory are given in clusters.pdb, and their corresponding fraction in clusters.dat. The statistics of intra- and inter-molecular H-bonds are collected in hbonds-intra.dat and hbonds-inter.dat, respectively.

Technical Validation

AB-DB is built making use of the different computational steps detailed above: molecular characterization, QM calculations, FF generation, MD simulations, and extraction of physico-chemical descriptors. Concerning the starting configurations used for the subsequent steps, we carefully checked the protonation state of all compounds at physiological pH, paying particular attention to uncertain cases with two major populated species (see Table 2 and Methods section for details). In details, for these ambiguous cases, we searched the literature for experimental values of pKa, that were consistently used as reference throughout the class. As for the QM calculations, the DFT level of theory adopted is routinely used and has proven to be reliable for small organic molecules, providing FF parameters compatible with available FFs for macromolecules50. B3LYP/6–31G** calculations represent a good compromise between accuracy and computational cost96. Therefore, no further validation is here provided for QM calculations. In the following we present a thorough justification of the reliability of our data through the comparison with experiments.

Descriptors validation

Calculation of classical parameters reporting the topological properties of molecules, such as the number of atoms or the count of aliphatic bonds, is quite straightforward. Most popular databases (e.g. PubChem, DrugBank) indeed employ ChemAxon’s tools to automatically compute these properties. In AB-DB, we likewise used the same programs to obtain the QSAR descriptors. However, for LogP, which is known to be a key feature for antimicrobial penetration kinetics61,97,98, we exploited another widely used method (XLOGP3, see Methods section). Note that accurate prediction of LogP is a well-known challenge in computational chemistry, and is also common to find severe disagreement through experimental results obtained for the same compound99,100. In order to assess the quality of our predictions, we collected available experimental LogP values for a subset of molecules. Table 3 compares the experimental data, falling in the range [−1.69, 5.15], with the computed ones, highlighting the differences between the two methods. In most cases the two predicted values are similar and agree with the experimental LogP. However, as expected, ambiguous situations were also found. Methicillin, for instance, was well predicted by XLOGP3 (computed 1.96 vs. experimental ~1.90) while cxcalc yielded a poor estimation (computed 0.79). On the contrary, the latter program agrees with experiments for lomefloxacin (computed −0.43 vs. experimental −0.47), whereas the former failed (computed 0.27).

Table 3.

Experimental logP (LogP exp) and LogP values computed by XLOGP3 and cxcalc.

Family Compound LogP exp XLOGP3 (%ERR) LogP cxcalc (%ERR)
aminocoumarins chlorobiocin 5.15110 5.98 (16) 4.94 (4)
novobiocin 3.1111 3.96 (28) 3.26 (5)
streptogramins dalfopristin 2.5799 2.23 (13) 1.58 (39)
macrolides telithromycin 2.1112 4.16 (98) 5.05 (140)
penicillins dicloxacill 2.91113 3.78 (30) 2.91 (0)
cloxacillin 2.43113 3.15 (30) 2.30 (5)
oxacillin 2.31113 2.53 (10) 1.70 (26)
penicillin G 1.70113 1.95 (15) 1.08 (36)
methicillin 1.896114 1.96 (3) 0.79 (58)
temocillin 1.3999 2.4 (73) 1.2 (14)
quinolones delafloxacin 2.6399 2.76 (5) 2.56 (3)
difloxacin 0.84100 1.49 (77) 1.75 (108)
grepafloxacin 0.66115 0.72 (9) 0.07 (89)
sitafloxacin −0.16116 0.76 (575) −0.17 (6)
lomefloxacin −0.47117 0.27 (157) −0.43 (9)
cephalosporins cefpiramide 0.9599 0.84 (12) −0.97 (202)
cefotetan 0.3199 0.64 (106) −0.38 (223)
ceforanide −1.3599 −1.87 (39) −3.18 (136)
lincosamides clindamycin 0.78118 1.76 (126) 1.04 (33)
β-lactamase inhibitors tazobactam −1.6999 −1.33 (21) −1.4 (17)

Relative error (%) is reported in parentheses.

Validation of force-field parameters and molecular dynamics trajectories

To assess the reliability of the FF generated for all compounds we computed the RMSD between the QM B3LYP/6–31G** optimized geometry and the molecular mechanics minimum-energy structure obtained with the GAFF2 parameters of the database. Table 4 shows the good agreement between the two sets of structures, differing on average by less than 1 Å, with an overall mean value of 0.5 ± 0.1 Å. The registered low RMSDs prove the accuracy of the FF parameters presented in AB-DB and used for the MD simulations.

Table 4.

Average RMSDs and standard deviations (Å) between the DFT and molecular mechanics optimized geometries for the classes of antimicrobial compounds included in AB-DB.

Family Mean ± STD
aminocoumarins 0.4 ± 0.1
aminoglycosides 0.3 ± 0.1
anthracyclines 0.4 ± 0.1
beta-lactamase-inhibitors 0.3 ± 0.2
carbapenems 0.5 ± 0.2
cephalosporins 0.6 ± 0.2
dhfr-inhibitors 0.4 ± 0.2
efflux-pumps-inhibitors 0.4 ± 0.2
fusidanes 0.4 ± 0.1
lincosamides 0.4 ± 0.2
macrolides 0.4 ± 0.1
monobactams 0.6 ± 0.1
nucleosides 0.6 ± 0.2
oxacephem 0.5 ± 0.1
oxazolidinones 0.4 ± 0.1
penicillins 0.5 ± 0.2
phenicols 0.5 ± 0.1
quinolones 0.3 ± 0.1
rifamycins 0.5 ± 0.1
streptogramins 0.6 ± 0.0
sulphonamides 0.6 ± 0.3
tetracyclines 0.4 ± 0.3
others 0.3 ± 0.2

To give a measure of the quality of MD simulations, we compared the representative conformations extracted from MD trajectories (cluster representatives) of selected compounds with their 3D experimental structure available on the Protein Data Bank, in complex with biological targets. When multiple experimental structures were available for the same compound, we considered the one with the highest resolution. The total number of 85 experimental structures collected are listed in Table S2, reporting the corresponding PDB code and the weighted average RMSDs (<RMDS>w), obtained using cluster populations as weights. The average values associated to selected families are also given. The mean value of <RMDS>w considering all families is 1.8 ± 0.8 Å, with the highest and smallest value reached by aminocoumarins (3.6 ± 0.4 Å) and tetracyclines (0.9 ± 0.2 Å), respectively. As expected, bigger and more flexible molecules give rise to higher <RMDS>w, whereas smaller and more rigid compounds show lower values. Overall, the performed MD simulations based on GAFF2 parameters appear to be able to sample molecular conformations found in available experimental structures.

As emphasized in previous works54,60, MD simulations enable to go beyond a static picture of molecules, providing ranges of properties accounting for their dynamical nature and their impact on biological activity. Prominent examples are represented by MD simulations performed to differentiate the most active inhibitors of ERK2 kinase39 and Ptch1 multidrug efflux transporter58.

Supplementary information

Table S1 (57.7KB, pdf)
Table S2 (66.2KB, pdf)

Acknowledgements

This work was supported by National Institute of Allergy and Infectious Diseases/NIH grant no. R01AI136799 and “Progetti biennali d’Ateneo Finanziati dalla Fondazione di Sardegna - annualità 2020.” We thank Dr. Giovanni Serra (University of Cagliari) for technical assistance with computational resources.

Author contributions

G.M., A.B., A.V.V., H.I.Z. and P.R. conceived the project, S.G. and G.M. performed the calculations, S.G., G.M. and A.B. analysed the results, curated the data and implemented the database. All authors reviewed the manuscript.

Code availability

QSAR calculations were performed using the ChemAxon’s Marvin suite of programs, version 21.1463. For QM calculations we used the Gaussian16 package, revision A.0365. The Amber18 package80 was used for MD simulations and FF generation. We used simple bash scripts to iteratively extract descriptors from outputs and generate AB-DB data-files.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

The online version contains supplementary material available at 10.1038/s41597-022-01261-1.

References

  • 1.Partridge SR, Kwong SM, Firth N, Jensen SO. Mobile genetic elements associated with antimicrobial resistance. Clin. Microbiol. Rev. 2018;31:e00088–17. doi: 10.1128/CMR.00088-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Eichenberger, E. M. & Thaden, J. T. Epidemiology and mechanisms of resistance of extensively drug resistant Gram-negative bacteria. Antibiotics8, 10.3390/antibiotics8020037 (2019). [DOI] [PMC free article] [PubMed]
  • 3.Breijyeh Z, Jubeh B, Karaman R. Resistance of Gram-negative bacteria to current antibacterial agents and approaches to resolve it. Molecules. 2020;25:1340. doi: 10.3390/molecules25061340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Im W, Khalid S. Molecular simulations of Gram-negative bacterial membranes come of age. Annu. Rev. Phys. Chem. 2020;71:171–188. doi: 10.1146/annurev-physchem-103019-033434. [DOI] [PubMed] [Google Scholar]
  • 5.Klobucar K, Brown ED. New potentiators of ineffective antibiotics: Targeting the Gram-negative outer membrane to overcome intrinsic resistance. Curr. Opin. Chem. Biol. 2022;66:102099. doi: 10.1016/j.cbpa.2021.102099. [DOI] [PubMed] [Google Scholar]
  • 6.Zgurskaya HI, Walker JK, Parks JM, Rybenkov VV. Multidrug efflux pumps and the two-faced Janus of substrates and inhibitors. Acc. Chem. Res. 2021;54:930–939. doi: 10.1021/acs.accounts.0c00843. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zgurskaya HI, Malloci G, Chandar B, Vargiu AV, Ruggerone P. Bacterial efflux transporters’ polyspecificity - a gift and a curse? Curr. Opin. Microbiol. 2021;61:115–123. doi: 10.1016/j.mib.2021.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Alav I, et al. Structure, assembly, and function of tripartite efflux and type 1 secretion systems in Gram-negative bacteria. Chem. Rev. 2021;121:5479–5596. doi: 10.1021/acs.chemrev.1c00055. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Tooke CL, et al. β-lactamases and β-lactamase inhibitors in the 21st century. J. Mol. Biol. 2019;431:3472–3500. doi: 10.1016/j.jmb.2019.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Blaskovich MAT, Zuegg J, Elliott AG, Cooper MA. Helping chemists discover new antibiotics. ACS Infect. Dis. 2015;1:285–287. doi: 10.1021/acsinfecdis.5b00044. [DOI] [PubMed] [Google Scholar]
  • 11.Zuegg J, Hansford KA, Elliott AG, Cooper MA, Blaskovich MAT. How to stimulate and facilitate early stage antibiotic discovery. ACS Infect. Dis. 2020;6:1302–1304. doi: 10.1021/acsinfecdis.0c00163. [DOI] [PubMed] [Google Scholar]
  • 12.Lewis K. The science of antibiotic discovery. Cell. 2020;181:29–45. doi: 10.1016/j.cell.2020.02.056. [DOI] [PubMed] [Google Scholar]
  • 13.Miethke M, et al. Towards the sustainable discovery and development of new antibiotics. Nat. Rev. Chem. 2021;5:726–749. doi: 10.1038/s41570-021-00313-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Klug, D. M. et al. There is no market for new antibiotics: this allows an open approach to research and development. Wellcome Open Res. 6, 10.12688/wellcomeopenres.16847.1 (2021). [DOI] [PMC free article] [PubMed]
  • 15.Durand-Reville TF, et al. Rational design of a new antibiotic class for drug-resistant infections. Nature. 2021;597:698–702. doi: 10.1038/s41586-021-03899-0. [DOI] [PubMed] [Google Scholar]
  • 16.Farha MA, Brown ED. Drug repurposing for antimicrobial discovery. Nat. Microbiol. 2019;4:565–577. doi: 10.1038/s41564-019-0357-1. [DOI] [PubMed] [Google Scholar]
  • 17.Oselusi SO, Christoffels A, Egieyeh SA. Cheminformatic characterization of natural antimicrobial products for the development of new lead compounds. Molecules. 2021;26:3970. doi: 10.3390/molecules26133970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ropponen H-K, Richter R, Hirsch AK, Lehr C-M. Mastering the Gram-negative bacterial barrier - chemical approaches to increase bacterial bioavailability of antibiotics. Adv. Drug Deliv. Rev. 2021;172:339–360. doi: 10.1016/j.addr.2021.02.014. [DOI] [PubMed] [Google Scholar]
  • 19.O’Shea R, Moser HE. Physicochemical properties of antibacterial compounds: Implications for drug discovery. J. Med. Chem. 2008;51:2871–2878. doi: 10.1021/jm700967e. [DOI] [PubMed] [Google Scholar]
  • 20.Brown DG, May-Dracka TL, Gagnon MM, Tommasi R. Trends and exceptions of physical properties on antibacterial activity for Gram-positive and Gram-negative pathogens. J. Med. Chem. 2014;57:10144–10161. doi: 10.1021/jm501552x. [DOI] [PubMed] [Google Scholar]
  • 21.Richter MF, et al. Predictive compound accumulation rules yield a broad-spectrum antibiotic. Nature. 2017;545:299–304. doi: 10.1038/nature22308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Haloi N, et al. Rationalizing the generation of broad spectrum antibiotics with the addition of a positive charge. Chem. Sci. 2021;12:15028–15044. doi: 10.1039/D1SC04445A. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Mehla J, et al. Predictive rules of efflux inhibition and avoidance in Pseudomonas aeruginosa. mBio. 2021;12:e02785–20. doi: 10.1128/mBio.02785-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Carracedo-Reboredo P, et al. A review on machine learning approaches and trends in drug discovery. Comput. Struct. Biotechnol. J. 2021;19:4538–4558. doi: 10.1016/j.csbj.2021.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Stokes JM, et al. A deep learning approach to antibiotic discovery. Cell. 2020;180:688–702.e13. doi: 10.1016/j.cell.2020.01.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tkatchenko A. Machine learning for chemical discovery. Nat. Commun. 2020;11:4125. doi: 10.1038/s41467-020-17844-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Schneider P, et al. Rethinking drug design in the artificial intelligence era. Nat. Rev. Drug Discov. 2020;19:353–364. doi: 10.1038/s41573-019-0050-3. [DOI] [PubMed] [Google Scholar]
  • 28.Kim S, et al. Pubchem in 2021: new data content and improved web interfaces. Nucleic Acids Res. 2021;49:D1388–D1395. doi: 10.1093/nar/gkaa971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wishart DS, et al. Drugbank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 2006;34:D668–72. doi: 10.1093/nar/gkj067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Gaulton A, et al. The ChEMBL database in 2017. Nucleic Acids Res. 2017;45:D945–D954. doi: 10.1093/nar/gkw1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Sterling T, Irwin JJ. Zinc 15 - ligand discovery for everyone. J. Chem. Inf. Model. 2015;55:2324–2337. doi: 10.1021/acs.jcim.5b00559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Montavon, G. et al. Machine learning of molecular electronic properties in chemical compound space. New J. Phys. 15, 10.1088/1367-2630/15/9/095003 (2013).
  • 33.Ramakrishnan, R., Dral, P. O., Rupp, M. & von Lilienfeld, O. A. Quantum chemistry structures and properties of 134 kilo molecules. Sci. Data1, 10.1038/sdata.2014.221 (2014). [DOI] [PMC free article] [PubMed]
  • 34.Isert, C., Atz, K., Jiménez-Luna, J. & Schneider, G. Qmugs: Quantum mechanical properties of drug-like molecules. arxivhttps://arxiv.org/abs/2107.00367 (2021). [DOI] [PMC free article] [PubMed]
  • 35.NIST computational chemistry comparison and benchmark database. NIST Standard Reference Database Number 101 http://cccbdb.nist.gov/, 10.18434/T47C7Z (2020).
  • 36.Ropo, M., Schneider, M., Baldauf, C. & Blum, V. First-principles data set of 45,892 isolated and cation-coordinated conformers of 20 proteinogenic amino acids. Sci. Data3, 10.1038/sdata.2016.9 (2016). [DOI] [PMC free article] [PubMed]
  • 37.Ahamad S, Islam A, Ahmad F, Dwivedi N, Hassan MI. 2/3d-QSAR, molecular docking and md simulation studies of FtsZ protein targeting benzimidazoles derivatives. Comput. Biol. Chem. 2019;78:398–413. doi: 10.1016/j.compbiolchem.2018.12.017. [DOI] [PubMed] [Google Scholar]
  • 38.Zin PPK, Borrel A, Fourches D. Benchmarking 2d/3d/MD-QSAR models for imatinib derivatives: How far can we predict? J. Chem. Inf. Model. 2020;60:3342–3360. doi: 10.1021/acs.jcim.0c00200. [DOI] [PubMed] [Google Scholar]
  • 39.Ash J, Fourches D. Characterizing the chemical space of ERK2 kinase inhibitors using descriptors computed from molecular dynamics trajectories. J. Chem. Inf. Model. 2017;57:1286–1299. doi: 10.1021/acs.jcim.7b00048. [DOI] [PubMed] [Google Scholar]
  • 40.Fourches D, Ash J. 4d-Quantitative structure-activity relationship modeling: making a comeback. Expert Opin. Drug Discov. 2019;14:1227–1235. doi: 10.1080/17460441.2019.1664467. [DOI] [PubMed] [Google Scholar]
  • 41.Jamal S, Grover A, Grover S. Machine learning from molecular dynamics trajectories to predict caspase-8 inhibitors against Alzheimer’s disease. Front. Pharmacol. 2019;10:780. doi: 10.3389/fphar.2019.00780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Esposito C, Wang S, Lange UEW, Oellien F, Riniker S. Combining machine learning and molecular dynamics to predict p-glycoprotein substrates. J. Chem. Inf. Model. 2020;60:4730–4749. doi: 10.1021/acs.jcim.0c00525. [DOI] [PubMed] [Google Scholar]
  • 43.Bak A. Two decades of 4d-QSAR: A dying art or staging a comeback? Int. J. Mol. Sci. 2021;22:5212. doi: 10.3390/ijms22105212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Boehr DD, Nussinov R, Wright PE. The role of dynamic conformational ensembles in biomolecular recognition. Nat. Chem. Biol. 2009;5:789–796. doi: 10.1038/nchembio.232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Baron R, McCammon AJ. Molecular recognition and ligand association. Annu. Rev. Phys. Chem. 2013;64:151–175. doi: 10.1146/annurev-physchem-040412-110047. [DOI] [PubMed] [Google Scholar]
  • 46.Zivanovic S, et al. Exploring the conformational landscape of bioactive small molecules. J. Chem. Theory Comput. 2020;16:6575–6585. doi: 10.1021/acs.jctc.0c00304. [DOI] [PubMed] [Google Scholar]
  • 47.Salo-Ahen, O. M. H. et al. Molecular dynamics simulations in drug discovery and pharmaceutical development. Processes9, 10.3390/pr9010071 (2021).
  • 48.Jo S, Kim T, Iyer V, Im W. CHARMM-GUI: A web-based graphical user interface for CHARMM. J. Comput. Chem. 2008;29:1859–1865. doi: 10.1002/jcc.20945. [DOI] [PubMed] [Google Scholar]
  • 49.Vanommeslaeghe K, Raman EP, MacKerell AD. J. Automation of the CHARMM general force field (CGenFF) ii: Assignment of bonded parameters and partial atomic charges. J. Chem. Inf. Model. 2012;52:3155–3168. doi: 10.1021/ci3003649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Dupradeau F-Y, et al. R.E.DD.B.: A database for RESP and ESP atomic charges, and force field libraries. Nucleic Acids Res. 2008;36:D360–D367. doi: 10.1093/nar/gkm887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Bryce, R. Amber parameter database. http://amber.manchester.ac.uk/.
  • 52.Morado J, et al. Paramol: A package for automatic parameterization of molecular mechanics force fields. J. Chem. Inf. Model. 2021;61:2026–2047. doi: 10.1021/acs.jcim.0c01444. [DOI] [PubMed] [Google Scholar]
  • 53.Lin, F.-Y. & MacKerell, A. D. J. Force Fields for Small Molecules (Humana, New York, 2019). [DOI] [PMC free article] [PubMed]
  • 54.Malloci G, et al. A database of force-field parameters, dynamics, and properties of antimicrobial compounds. Molecules. 2015;20:13997–14021. doi: 10.3390/molecules200813997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Auer J, Bajorath J. Distinguishing between bioactive and modeled compound conformations through mining of emerging chemical patterns. J. Chem. Inf. Model. 2008;48:1747–1753. doi: 10.1021/ci8001793. [DOI] [PubMed] [Google Scholar]
  • 56.Bai F, et al. Bioactive conformational generation of small molecules: A comparative analysis between force-field and multiple empirical criteria based methods. BMC Bioinform. 2010;11:545. doi: 10.1186/1471-2105-11-545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Anighoro A, de la Vega de León A, Bajorath J. Predicting bioactive conformations and binding modes of macrocycles. J. Comput. Aided Mol. Des. 2016;30:841–849. doi: 10.1007/s10822-016-9973-5. [DOI] [PubMed] [Google Scholar]
  • 58.Kovachka S, et al. Molecular insights into the Patched1 drug efflux inhibitory activity of Panicein A Hydroquinone: a computational study. Phys. Chem. Chem. Phys. 2021;23:8013–8022. doi: 10.1039/D0CP05719C. [DOI] [PubMed] [Google Scholar]
  • 59.Ebejer J-P, Morris GM, Deane CM. Freely available conformer generation methods: How good are they? J. Chem. Inf. Model. 2012;52:1146–1158. doi: 10.1021/ci2004658. [DOI] [PubMed] [Google Scholar]
  • 60.Malloci, G., Serra, G., Bosin, A. & Vargiu, A. V. Extracting conformational ensembles of small molecules from molecular dynamics simulations: Ampicillin as a test case. Computation4, 10.3390/computation4010005 (2016).
  • 61.Cooper SJ, et al. Molecular properties that define the activities of antibiotics in Escherichia coli and Pseudomonas aeruginosa. ACS Infect. Dis. 2018;4:1223–1234. doi: 10.1021/acsinfecdis.8b00036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Zhao S, et al. Defining new chemical space for drug penetration into gram-negative bacteria. Nat. Chem. Biol. 2020;16:1293–1302. doi: 10.1038/s41589-020-00674-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.ChemAxon. Marvin version 21.14. http://www.chemaxon.com (2021).
  • 64.Kohn W. Nobel lecture: Electronic structure of matter - wave functions and density functionals. Rev. Mod. Phys. 1999;71:1253–1266. doi: 10.1103/RevModPhys.71.1253. [DOI] [Google Scholar]
  • 65.Frisch, M. J. et al. Gaussian 16, revision a.03. Gaussian Inc. Wallingford CT (2016).
  • 66.Becke AD. Density-functional thermochemistry. iii. The role of exact exchange. J. Chem. Phys. 1993;98:5648–5652. doi: 10.1063/1.464913. [DOI] [Google Scholar]
  • 67.Pople, J. Quantum chemical models (Nobel Lecture). Angew. Chem. Int. Ed. 38, 1894–1902, 10.1002/(SICI)1521-3773(19990712)38:13/14<1894::AID-ANIE1894>3.0.CO;2-H (1999). [DOI] [PubMed]
  • 68.Tirado-Rives J, Jorgensen WL. Performance of B3LYP density functional methods for a large set of organic molecules. J. Chem. Theory Comput. 2008;4:297–306. doi: 10.1021/ct700248k. [DOI] [PubMed] [Google Scholar]
  • 69.Cleydson B. R. d. S. et al. Pinheiro Evaluation of Quantum Chemical Methods and Basis Sets Applied in the Molecular Modeling of Artemisinin. Comput. Mol. Biol. 3, 66–79, 10.4236/cmb.2013.33009 (2013).
  • 70.Narbe Mardirossian and Martin Head-Gordon Thirty years of density functional theory in computational chemistry: an overview and extensive assessment of 200 density functionals. Mol. Phys. 115, 2315-2372, 10.1080/00268976.2017.1333644 (2017).
  • 71.Tomasi J, Mennucci B, Cammi R. Quantum mechanical continuum solvation models. Chem. Rev. 2005;105:2999–3094. doi: 10.1021/cr9904009. [DOI] [PubMed] [Google Scholar]
  • 72.O’Boyle N, Tenderholt A, Langner K. cclib: a library for package-independent computational chemistry algorithms. J. Comput. Chem. 2008;29:839–845. doi: 10.1002/jcc.20823. [DOI] [PubMed] [Google Scholar]
  • 73.Singh UC, Kollman PA. An approach to computing electrostatic charges for molecules. J. Comput. Chem. 1984;5:129–145. doi: 10.1002/jcc.540050204. [DOI] [Google Scholar]
  • 74.Bayly CI, Cieplak P, Cornell W, Kollman PA. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: the RESP model. J. Phys. Chem. 1993;97:10269–10280. doi: 10.1021/j100142a004. [DOI] [Google Scholar]
  • 75.Wang J, Wang W, Kollman PA, Case DA. Automatic atom type and bond type perception in molecular mechanical calculations. J. Mol. Graphics Modell. 2006;25:247–260. doi: 10.1016/j.jmgm.2005.12.005. [DOI] [PubMed] [Google Scholar]
  • 76.Jakalian A, Jack DB, Christopher I. Bayly Fast, efficient generation of high-quality atomic charges. AM1-BCC model: II. Parameterization and validation. J. Comput. Chem. 2009;23:1623–1641. doi: 10.1002/jcc.10128. [DOI] [PubMed] [Google Scholar]
  • 77.Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general amber force field. J. Comput. Chem. 2004;25:1157–1174. doi: 10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
  • 78.Moynié, L. et al. The complex of ferric-enterobactin with its transporter from Pseudomonas aeruginosa suggests a two-site model. Nat. Commun. 10, 10.1038/s41467-019-11508-y (2019). [DOI] [PMC free article] [PubMed]
  • 79.Li P, Merz KM., Jr. MCPB.py: A Python based metal center parameter builder. J. Chem. Inf. Model. 2016;56:599–604. doi: 10.1021/acs.jcim.5b00674. [DOI] [PubMed] [Google Scholar]
  • 80.Case, D. et al. AMBER. University of California and San Francisco 2018.
  • 81.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. doi: 10.1063/1.445869. [DOI] [Google Scholar]
  • 82.Joung IS, Cheatham TE. Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. J. Phys. Chem. B. 2008;112:9020–9041. doi: 10.1021/jp8001614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola A, Haak JR. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 1984;81:3684–3690. doi: 10.1063/1.448118. [DOI] [Google Scholar]
  • 84.Loncharich RJ, Brooks BR, Pastor RW. Langevin dynamics of peptides: The frictional dependence of isomerization rates of n-acetylalanyl-n′-methylamide. Biopolymers. 1992;32:523–535. doi: 10.1002/bip.360320508. [DOI] [PubMed] [Google Scholar]
  • 85.Isik M, et al. Assessing the accuracy of octanol-water partition coefficient predictions in the SAMPL6 part II logP Challenge. J. Comput. Aided Mol. Des. 2020;34:335–370. doi: 10.1007/s10822-020-00295-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Cheng T, et al. Computation of octanol-water partition coefficients by guiding an additive model with knowledge. J. Chem. Inf. Model. 2007;47:2140–2148. doi: 10.1021/ci700257y. [DOI] [PubMed] [Google Scholar]
  • 87.Sauer, S. P. A. Molecular Electromagnetism: A Computational Chemistry Approach (OUP Oxford, 2011).
  • 88.Roe DR, Cheatham TE. PTRAJ and CPPTRAJ: Software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 2013;9:3084–3095. doi: 10.1021/ct400341p. [DOI] [PubMed] [Google Scholar]
  • 89.Nag A, Chakraborty D, Chandra A. Effects of ion concentration on the hydrogen bonded structure of water in the vicinity of ions in aqueous NaCl solutions. J. Chem. Sci. 2008;120:71–77. doi: 10.1007/s12039-008-0009-0. [DOI] [Google Scholar]
  • 90.Shao J, Tanner SW, Thompson N, Cheatham TE. Clustering molecular dynamics trajectories: 1. Characterizing the performance of different clustering algorithms. J. Chem. Theory Comput. 2007;3:2312–2334. doi: 10.1021/ct700119m. [DOI] [PubMed] [Google Scholar]
  • 91.van der Spoel, D. et al. Gromacs User Manual version 3.0. Nijenborgh 4, 9747 AG Groningen, The Netherlands (2001).
  • 92.Bonomi M, et al. Plumed: A portable plugin for free-energy calculations with molecular dynamics. Comput. Phys. Comm. 2009;180:1961–1972. doi: 10.1016/j.cpc.2009.05.011. [DOI] [Google Scholar]
  • 93.Theodorou DN, Suter UW. Shape of unperturbed linear polymers: polypropylene. Macromolecules. 1985;18:1206–1214. doi: 10.1021/ma00148a028. [DOI] [Google Scholar]
  • 94.O’Boyle N, et al. Open Babel: An open chemical toolbox. J. Cheminf. 2011;3:33. doi: 10.1186/1758-2946-3-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Gervasoni S. 2022. AB-DB: Force-Field parameters, MD trajectories, QM-based data, and Descriptors of Antimicrobials. figshare. [DOI] [PMC free article] [PubMed]
  • 96.Rodrigues-Oliveira AF, Ribeiro FWM, Cervi G, Correra TC. Evaluation of common theoretical methods for predicting infrared multiphotonic dissociation vibrational spectra of intramolecular hydrogen-bonded ions. ACS Omega. 2018;3:9075–9085. doi: 10.1021/acsomega.8b00815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Davis TD, Gerry CJ, Tan DS. General platform for systematic quantitative evaluation of small-molecule permeability in bacteria. ACS Chem. Biol. 2014;9:2535–2544. doi: 10.1021/cb5003015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Blokhina SV, Sharapova AV, Ol’khovich MV, Volkova TV, Perlovich GL. Solubility, lipophilicity and membrane permeability of some fluoroquinolone antimicrobials. Eur. J. Pharm. Sci. 2016;93:29–37. doi: 10.1016/j.ejps.2016.07.016. [DOI] [PubMed] [Google Scholar]
  • 99.Stratton CF, Newman DJ, Tan DS. Cheminformatic comparison of approved drugs from natural product versus synthetic origins. Bioorg. Med. Chem. Lett. 2015;25:4802–4807. doi: 10.1016/j.bmcl.2015.07.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Dorwald, F. Z. Lead Optimization for Medicinal Chemists (Wiley-VCH Verlag & Co. KGaA, Boschstr. 12, 69469 Weinheim, Germany, 2012).
  • 101.Systémes, B. D. Biovia discovery studio 2021. San Diego: Dassault Systémes (2021).
  • 102.The PyMOL molecular graphics system, version 1.3r1. The PyMOL Molecular Graphics System, Version 1.3, Schrödinger, LLC (2010).
  • 103.Babic S, Horvat AJ, Mutavdžc-Pavlovic D, Kaštelan-Macan M. Determination of pKa values of active pharmaceutical ingredients. Trends Analyt. Chem. 2007;26:1043–1061. doi: 10.1016/j.trac.2007.09.004. [DOI] [Google Scholar]
  • 104.Hirano T, et al. Mechanism of the inhibitory effect of zwitterionic drugs (levofloxacin and grepafloxacin) on carnitine transporter (OCTN2) in Caco-2 cells. Biochim. Biophys. Acta Biomembr. 2006;1758:1743–1750. doi: 10.1016/j.bbamem.2006.07.002. [DOI] [PubMed] [Google Scholar]
  • 105.Alkhzem AH, Woodman TJ, Blagbrough IS. Individual pKa values of tobramycin, kanamycin b, amikacin, sisomicin, and netilmicin determined by multinuclear NMR spectroscopy. ACS Omega. 2020;5:21094–21103. doi: 10.1021/acsomega.0c02744. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Lesniakac W, Larena JM, Harrisd WR, Pecoraroc VL, Schachtab J. An isocratic separation of underivatized gentamicin components, 1H NMR assignment and protonation pattern. Carbohydr. Res. 2003;338:2853–2862. doi: 10.1016/j.carres.2003.08.005. [DOI] [PubMed] [Google Scholar]
  • 107.Topal BD, Golcu A, Ozkan SA. Electrochemical investigation and determination of the antibacterial loracarbef by voltammetric methods. Anal. Lett. 2009;42:689–705. doi: 10.1080/00032710802678637. [DOI] [Google Scholar]
  • 108.Cocco L, et al. Protonated state of methotrexate, trimethoprim, and pyrimethamine bound to dihydrofolate reductase. Arch. Biochem. Biophys. 1983;226:567–577. doi: 10.1016/0003-9861(83)90326-0. [DOI] [PubMed] [Google Scholar]
  • 109.Rifampin. Tuberculosis88, 151–154, 10.1016/S1472-9792(08)70024-6 (2008).
  • 110.Song J, et al. Discovery of prenyltransferase inhibitors with in vitro and in vivo antibacterial activity. ACS Infect. Dis. 2020;6:2979–2993. doi: 10.1021/acsinfecdis.0c00472. [DOI] [PubMed] [Google Scholar]
  • 111.Ibrahim, T. S. et al. Novel 1,2,4-oxadiazole-chalcone/oxime hybrids as potential antibacterial DNA gyrase inhibitors: Design, synthesis, ADMET prediction and molecular docking study. Bioorg. Chem. 111, 10.1016/j.bioorg.2021.104885 (2021). [DOI] [PubMed]
  • 112.Sebastiano MR, et al. Impact of dynamically exposed polarity on permeability and solubility of chameleonic drugs beyond the rule of 5. J. Med. Chem. 2018;61:4189–4202. doi: 10.1021/acs.jmedchem.8b00347. [DOI] [PubMed] [Google Scholar]
  • 113.Tsuji A, Kubo O, Miyamoto E, Yamana T. Physicochemical properties of β-lactam antibiotics: Oil-water distribution. J. Pharm. Sci. 1977;66:1675–79. doi: 10.1002/jps.2600661205. [DOI] [PubMed] [Google Scholar]
  • 114.Rashid U, et al. Structure based medicinal chemistry-driven strategy to design substituted dihydropyrimidines as potential antileishmanial agents. Eur. J. Med. Chem. 2016;115:230–244. doi: 10.1016/j.ejmech.2016.03.022. [DOI] [PubMed] [Google Scholar]
  • 115.Ito T, Yano I, Masuda S, Hashimoto Y. & ichi Inu, K. Distribution characteristics of levofloxacin and grepafloxacin in rat kidney. Pharm. Res. 1999;16:534–539. doi: 10.1023/A:1018871029244. [DOI] [PubMed] [Google Scholar]
  • 116.Rankovic Z. CNS physicochemical property space shaped by a diverse set of molecules with experimentally determined exposure in the mouse brain. J. Med. Chem. 2017;60:5943–5954. doi: 10.1021/acs.jmedchem.6b01469. [DOI] [PubMed] [Google Scholar]
  • 117.Klosinska-Szmurlo E, et al. Physicochemical properties of lomefloxacin, levofloxacin, and moxifloxacin relevant to the biopharmaceutics classification system. Acta Chim. Slov. 2014;61:827–34. [PubMed] [Google Scholar]
  • 118.Gunaydin H. Probabilistic approach to generating MPOs and its application as a scoring function for CNS drugs. ACS Med. Chem. Lett. 2016;7:89–93. doi: 10.1021/acsmedchemlett.5b00390. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. Gervasoni S. 2022. AB-DB: Force-Field parameters, MD trajectories, QM-based data, and Descriptors of Antimicrobials. figshare. [DOI] [PMC free article] [PubMed]

Supplementary Materials

Table S1 (57.7KB, pdf)
Table S2 (66.2KB, pdf)

Data Availability Statement

QSAR calculations were performed using the ChemAxon’s Marvin suite of programs, version 21.1463. For QM calculations we used the Gaussian16 package, revision A.0365. The Amber18 package80 was used for MD simulations and FF generation. We used simple bash scripts to iteratively extract descriptors from outputs and generate AB-DB data-files.


Articles from Scientific Data are provided here courtesy of Nature Publishing Group

RESOURCES