Skip to main content
PLOS One logoLink to PLOS One
. 2024 Nov 25;19(11):e0308969. doi: 10.1371/journal.pone.0308969

Auto QSAR-based active learning docking for hit identification of potential inhibitors of Plasmodium falciparum Hsp90 as antimalarial agents

Thato Matlhodi 1, Lisema Patrick Makatsela 1, Tendamudzimu Harmfree Dongola 2, Mthokozisi Blessing Cedric Simelane 3, Addmore Shonhai 2, Njabulo Joyfull Gumede 4, Fortunate Mokoena 1,*
Editor: Yash Gupta5
PMCID: PMC11588265  PMID: 39585817

Abstract

Malaria which is mainly caused by Plasmodium falciparum parasite remains a devastating public health concern, necessitating the need to develop new antimalarial agents. P. falciparum heat shock protein 90 (Hsp90), is indispensable for parasite survival and a promising drug target. Inhibitors targeting the ATP-binding pocket of the N-terminal domain have anti-Plasmodium effects. We proposed a de novo active learning (AL) driven method in tandem with docking to predict inhibitors with unique scaffolds and preferential selectivity towards PfHsp90. Reference compounds, predicted to bind PfHsp90 at the ATP-binding pocket and possessing anti-Plasmodium activities, were used to generate 10,000 unique derivatives and to build the Auto-quantitative structures activity relationships (QSAR) models. Glide docking was performed to predict the docking scores of the derivatives and > 15,000 compounds obtained from the ChEMBL database. Re-iterative training and testing of the models was performed until the optimum Kennel-based Partial Least Square (KPLS) regression model with a regression coefficient R2 = 0.75 for the training set and squared correlation prediction Q2 = 0.62 for the test set reached convergence. Rescoring using induced fit docking and molecular dynamics simulations enabled us to prioritize 15 ATP/ADP-like design ideas for purchase. The compounds exerted moderate activity towards P. falciparum NF54 strain with IC50 values of ≤ 6μM and displayed moderate to weak affinity towards PfHsp90 (KD range: 13.5–19.9μM) comparable to the reported affinity of ADP. The most potent compound was FTN-T5 (PfN54 IC50:1.44μM; HepG2/CHO cells SI≥ 29) which bound to PfHsp90 with moderate affinity (KD:7.7μM), providing a starting point for optimization efforts. Our work demonstrates the great utility of AL for the rapid identification of novel molecules for drug discovery (i.e., hit identification). The potency of FTN-T5 will be critical for designing species-selective inhibitors towards developing more efficient agents against malaria.

Introduction

Sub-Saharan Africa, especially marginalized populations, has recorded over 90% of the 608, 000 deaths caused by malaria in 2022 [1]. Despite the presence of a malaria vaccine, the emergence of resistant strains indicates a threat to the gains made from decades of implementing malaria control strategies [2, 3]. It has also been suggested that changes in climate conditions such as increased temperatures and heavy rainfall may result in an increased mosquito population, putting more people at risk of contracting malaria [1]. Countries such as Rwanda [4] and East Asia [5] have begun to report the spread and dissemination of first-line treatment options artemisinin-tolerant P. falciparum strains emphasizing the urgent need to develop potent and reliable anti-parasitic drugs. Future antimalarials should inhibit Plasmodium infection and growth, potentially counteracting the likelihood of rapid development of drug resistance. Innovative approaches could explore validated drug target proteins implicated in drug resistance, such as PfHsp90 [6].

In P. falciparum, Hsp90 plays a crucial role during parasite adaptation and development, from the vector and host environment, which are often accompanied by abrupt increases in temperature amongst other stresses [7, 8]. PfHsp90 is expressed and essential for the parasite’s survival at all erythrocytic [912] and hepatic stages of development [13]. Distinct expression profiles of PfHsp90 have been correlated to poor disease prognosis in P. falciparum-infected individuals [14], making it a prime drug target. PfHsp90 is a dimeric protein composing of the N-terminal domain (NTD), middle domain and c-terminal domains respectively serving as binding sites of ATP, client proteins and co-chaperones [15, 16]. Most Hsp90 inhibitors are small molecules, which compete with ATP for binding the NTD. In the literature these small molecules, including geldanamycin (GDA), 17-AAG, 17-DMAG and PUH-71, were shown to be effective antimalarial agents by inhibiting the activities of PfHsp90 [1719]. Treatment of parasite cultures with GDA prevented their growth from the late ring to trophozoite stages of development [17]. In vivo studies demonstrated that in P. berghei-infected mice models, parasite load was reduced by treatment with GDA [17] and harmine [20]. PfHsp90 was identified as one of the artemisinin-based combination therapies’ (ACT’s) resistance-conferring alleles [6] and has been suggested to interact directly with chloroquine resistance transporter (CRT) protein [18]. Thus, targeting PfHsp90 for malaria treatment could be highly profitable as conserved proteins are less prone to variation under selection pressure, possibly overcoming the hurdle of drug resistance [21].

The high sequence conservation between the druggable ATP binding pocket of PfHsp90 versus human Hsp90 poses a risk with regards to human Hsp90 off-target activity. However, harmine was demonstrated to interact selectively with PfHsp90 using Arg98, which is substituted with Lys112 in human Hsp90 [20]. Following this discovery, a combination of rational design and microwave-assisted synthesis were used to derive several analogues of harmine [22, 23], leading the observation thattetrahydro-β-carboline possess moderate activity against P. falciparum [22, 24]. Comparative structural analyses conducted by Wang and colleagues (2014) revealed that a glycine hinge loop lining, found in the NTD of both chaperones, adopt different conformations. In PfHsp90, these residues include Gly118, Gly121, and Gly123 [25] adopting a straight conformation which enables better accommodation of some compounds. Therefore, selectivity towards PfHsp90 can be obtained by targeting the Arg98 interaction and cushioning especially hydrophobic segments of compounds into the glycine rich region [25]. These observations led Wang and colleagues (2016) to use structure guided design to identify amino alcohol-carbazoles as selective inhibitors of PfHsp90 [26]. Since then, other studies have also suggested new compound scaffolds displaying anti-Plasmodium activity through targeting PfHsp90 [27, 28]. We have recently used pharmacophore models to suggest four chemically diverse inhibitors targeting PfHsp90 [29], suggesting the benefit of rational design in drug discovery efforts aimed at selective inhibition of the molecular chaperone.

Artificial intelligence and machine learning/active learning (ML/AL)-based virtual screening methods have proven effective in designing candidate compounds that have advanced to clinical trials.ls [30]. A selective A2A receptor antagonist, for instance, is currently undergoing phase 1b/2 studies to be used in patients with solid tumors with elevated adenosine levels [29]. To identify hits and lead compounds, quantitative structure-activity relationship models, or Auto-QSAR, have been employed [30]. Auto-QSAR is essentially an application of artificial intelligence and machine learning. Previous studies have implemented auto-QSAR to identify candidate hits for the etiology of Alzheimer’s disease (AD) [33] and and Chagas disease [31].

Furthermore, the utility of AL models that incorporate docking scores in streamlining the drug discovery process have been demonstrated. In a recent study, regression-based AL models were used to rapidly prioritize compounds in large-scale docking, enhancing efficiency and reducing costs [31]. AL models can effectively select promising candidates for experimental validation and contribute to accelerating the development of therapeutic agents. These instances underline the potential of docking-based AL models in drug discovery. By leveraging the predictive capabilities of docking scores within AL frameworks, researchers can enhance the drug discovery pipeline’s speed and efficacy.

With this in mind, we aimed to use AL models to generate new chemical entities targeting PfHsp90. As such, compound 10 (S1B Fig in S1 File) from the literature [28] was used as a reference compound; its choice was motivated by its demonstrated potency towards P. falciparum and low cytotoxicity towards human cells. Optimization of compound 10 was undertaken by reaction-based enumeration to generate AL models enriched by molecular docking. The models were trained using analogues of compound 10 and a subset of de novo compounds were docked against PfHsp90 and re-iteratively tested for activity. Candidate hit compounds generated were evaluated for whole cell potency towards P. falciparum NF54 drug-sensitive strain, and the safety profiles of the compounds were established by cytotoxicity assays using mammalian Chinese hamster cells (CHO) and human hepatocellular carcinoma (HepG2). Select compounds which exhibited anti-plasmodial activity at the asexual blood stages were then evaluated for their binding affinity towards PfHsp90.

Methods

In silico analyses

Schrödinger Release 2022–1 was used for all molecular modelling calculations on Maestro (v12.9) [32], as a graphical user interface (GUI). Several modules in Maestro for ligand preparation, protein preparation, docking, QSAR modelling and docking post-processing were used. The Fig 1 shows the in silico process flow-chart of the steps undertaken to design and generate potential novel inhibitors of PfHsp90.

Fig 1. In silico flow diagram showing methodology used on how the reference compounds 10 and 7 were subjected to induced fit docking (IFD) using human Hsp90 (PDB code: 1YET) and PfHsp90 (PDB code: 3K60).

Fig 1

Followed by the selection of compound 10 which was subsequently subjected to pathfinder reaction-based enumeration to yield 10, 000 unique design ideas. These design ideas and compounds from the ChEMBL database were further used to train 13 Auto-QSAR models through reiterative training and testing and scoring of the compounds using standard precision (SP) Glide docking. The adsorption, distribution, metabolism, excretion, and toxicity (ADME/T) were used to filter the compounds, followed by rescoring by IFD and molecular dynamics simulations (MDS) to understand the stability of the protein-ligand complex and conformational changes induced upon ligand binding. The relative binding free energies were estimated by molecular mechanics with generalized born surface area (MM-GB/SA).

Protein preparation and receptor grid generation

The three-dimensional (3D) structures of the NTDs of PfHsp90 bound to ADP (PDB code: 3K60-2.3 Å; Chain A; [33], human Hsp90 in complex with geldanamycin (GDA) (PDB code: 1YET-1.9 Å; [34]) and human Hsp90 bound to ADP (PDB code: 1YBQ-1.5 Å; [35]) were obtained from the protein data bank (http://www.rcsb.org/; [36]). The protein preparation wizard module of Schrödinger Maestro [37] was used to add hydrogen atoms, correct bond orders, disulphide bonds, filling in of missing/incorrect side chains and loops of each protein. The system was minimized with Root-Mean-Square Deviation (RMSD) convergence of 0.30Å using an optimized potential for liquid simulations 4 (OPLS4) force field [32]. Grid files at the centroid of ADP/GDA were generated using Receptor Grid Generation module for subsequent Glide docking [38].

Ligand preparation

In a previous study conducted by Everson and colleagues (2021), compounds 7 (P. falciparum 3D7a IC50 = 0.98 ± 0.654 μM) and compound 10 (P. falciparum D7a I C50 = 1.11 ± 0.969 μM) were shown to be near sub-molar potency towards chloroquine-sensitive P. falciparum strain. Due to their proven potency, these compounds were then used as reference compounds for this study. Therefore, the 2D structure of each compound was drawn using 2D sketcher on Maestro and converted to low-energy 3D structures with tautomeric states using the Ligprep module [39]. The input structures were optimized using OPLS4 force field, generating possible ionization states at a target pH of 7.4 +/- 2.0 using Epik. Stereoisomers were computed to retain specified chiralities at most, generating 32 per ligand [39]. This procedure was repeated for the >2 million compounds from ChEMBL database (https://www.ebi.ac.uk/chembl/g/#browse/compounds) and 10 000 enumerated products.

Induced fit docking (IFD)

To predict the binding modes and nature of interaction in PfHsp90 upon binding different ligand, Induced Fit Docking (IFD) was implemented as described by [29]. This study implemented IFD for two main purposes: the first was to dock compound 7 and 10 to the ATP binding pocket of PfHsp90, to understand their affinity and mode of binding; and secondly, to re-score newly generated ligands following Auto QSAR prediction. Therefore, a grid box of the binding site for the prepared structure of PfHsp90-NTD was generated considering the co-crystallized ligand ADP as a centroid. This was followed by the removal of ADP from ATP binding region to provide more room for ligand docking and specifying Asn37, Arg98 and Phe124 as binding residues [33]. To refine the side chains of residues located within a 5Å distance from the ligand, the Prime refinement step was employed. After the initial docking of each ligand, up to 20 poses were selected for further refinement using the XP mode. The best pose of each complex was selected based on docking scores, and a visual inspection of the binding orientation. The visual inspection involved assessing the residues of the protein involved in binding and orientation of the ligand. Most importantly, assessing the presence of a hydrogen bond between PfHsp90 and the residue, Arg98, which confers selectivity.

Pathfinder reaction-based enumeration

Compound 10 exhibited a better fit into the ATP bin ding pocket of PfHsp90 following induced fit docking analysis. The possible routes to synthesize compound 10 were predicted by retrosynthesis analysis using Pathfinder [40]. Briefly, the 2D structure of compound 10 was minimized and the lowest energy conformer was determined using the macro model module [40]. The regiochemistry of bonds that can be disconnected were displayed with a maximum depth set to 1. Possible retrosynthesis pathways were estimated by employing Pathfinder, revealing nine pathways for the coupling reactions that are possible for the synthesis of compound 10. Pathways 1 and 2 were Amide_coupling-1 & 2, pathway 3 was amination-1, pathway 4- was Hayima-1, pathway 5 was Negishi, pathway 6 was oxadiazole-1, pathway 7- was Stille, and finally, pathways 8 and 9 were Suzuki-1 and-2 cross-coupling reactions. The enumeration reaction was employed by using pathway 6. The oxadiazole-1 pathway was chosen due to its synthetic accessibility and favourable reaction conditions, which could result in a higher success rate in generating diverse compound structures [41]. We surmised thatdesign idea compounds from pathway 6 woulddemonstrate chemical features that would have more affinity and more conducive to interaction with the PfHsp90 receptor. To do this, the reactants were defined, with reactant 1 containing the benzonitrile moiety was varied by enumerating nitrile fragments from the e-molecules database. The core/original reactant for reactant 2 was retained (Fig 2). Since, this core contains important functional groups for recognition of PfHsp90. Default physiochemical parameters such as a molecular weight (MW) between 150 and 575g/mol, LogP between −1.50 and 5.0, a topological polar surface area (TPSA) between 30 and 150, HBA between 0 and 12, and HBD between 0 and 5, and a maximum number of rotatable bonds less than 10 were retained for the design ideas [42]. Reactive functional groups using smiles arbitrary target specification (SMARTS) and Pan Assay Interfering Structures (PAINS) offenders were removed [43]. This enumeration round resulted in 10 000 design ideas that needed to be further tested for their binding affinities to PfHsp90.

Fig 2. Pathway 6 which is oxadiazole-1 coupling was chosen as our pathway of interest.

Fig 2

Reactant 1 showing the benzonitrile moiety, whilst reactant 2 depicting the core structure containing the pyridine-3-carboxylic acid moiety.

ChEMBL database and enumerated compound preparation

The chemical space was enriched by adding approximately 15 000 randomly selected compounds from the >2 million compounds from ChEMBL database (https://www.ebi.ac.uk/chembl/g/#browse/compounds). A total of 10 cycles of GlideSP docking was conducted in each round compounds from ChEMBL and the enumerated ideas were randomly selected and prepared for docking.

Classical glide SP ligand docking

Glide-based workflow have previously been used to screen many compounds against a target receptor quickly and accurately [38]. Glide energy terms offer SP (standard precision) for reliably docking ligands with high accuracy, or XP (extra precision) mode, which further eliminates false positives by extensive sampling and advanced scoring, resulting in high enrichment [38]. In this study, the two sets of ligands (1000 enumerated design ideas and 1000 ChEMBL database ligands for the initial GlideSP docking) were subjected to ligand docking studies to select ligands exhibiting favourable binding affinity (≥ -5 kcal/mol) towards PfHsp90. The ligands exhibiting favourable binding energies were then used to build the first active learning (AL) model.

Receptor grid generation

A receptor grid was generated based on ligand binding residues to specify the position and size of the receptors active site for glide SP ligand docking, utilizing the receptor grid generation tool in Maestro v12.9. ADP was decoupled from the receptor and positional constraints were defined as Asn37, Arg98 and Ile173 in PfHsp90 [25, 33].

Auto QSAR-AL models enriched by GlideSP docking

Auto quantitative structure activity relationship (QSAR) is a best practice protocol for generating models with limited user input and understanding. It also builds categorical or numerical models based on physicochemical, topological descriptors and binary fingerprints (i.e., radial, linear, dendritic, and 2-D molecular prints) [42], where a given model is trained against a particular random subset of input structures [44]. This study used the Auto QSAR to construct the AL models. The initial and second models were built by utilizing ligands exhibiting docking scores ≥ -5 kcal/mol. Subsequently, models 3 to 9 were constructed employing ligands characterized by docking scores of ≥-6.0 kcal/mol. Models 10 and 11 were developed specifically with ligands demonstrating docking scores ≥-6.5 kcal/mol. Lastly, models 12 and 13 were built using ligands possessing docking scores of ≥-7 kcal/mol. In each iterative cycle, the ligands that were chosen in prior rounds were added with newly selected ligands. These combined ligand sets were then partitioned into a training subset encompassing 75% of the data and a test subset comprising the remaining 25%. Internal validation of the model was assessed using prediction parameters such as predictive precision of the root mean square error (RMSE), standard deviation (SD), the accuracy of the training set (R2) and lastly the accuracy of the test set (Q2) in order to rank of all models [44]. The models were trained, tested, and re-trained using ligands from enumerated designs from pathway 6 and the ChEMBL dataset until the model reached convergence. To assess the performance of model 13 Mean Absolute Error (MAE) analysis was performed by measuring errors between paired observations (predicted activity and observed activity).

Molecular dynamics simulations

Select docking poses of PfHsp90-inhibitor complexes were subjected to Molecular Dynamics Simulations using the Desmond package on Maestro [45]. The complexes were solvated in transferable intermolecular potential with 3 points (TIP3P) water model and enclosed into orthorhombic boxes with minimized volumes. Ions were added to neutralize the charges using force field OPLS4. Then, the simulation was conducted as described by [46] with minor adjustments. In general, the simulation was allowed to proceed for 50–150 ns, 100 ps trajectories and 1000 frames. The NPT ensemble class was selected at a temperature of 300 K and a pressure of 1.01 bar.

Free binding energy calculations

The last trajectory from molecular dynamics simulation was subjected to molecular mechanics with generalized born surface area (MM-GBSA) to establish the free binding energies contributing to the protein-ligand interactions. In this study, employing the prime module in Maestro (v13.2) from the Schrödinger Suite 2022–1, the molecular mechanics with generalized born surface area (MM-GBSA) was performed. This was conducted to calculate free binding energies to determine the stability of the protein-ligand complexes from the docking conformations. MM-GBSA estimates the binding free energy of a ligand-receptor complex by combining molecular mechanics force fields, which describe the intermolecular interactions, with a solvation model based on the Generalized Born (GB) theory [47].

The calculations for the docked complexes were subjected to an OPLS4 force field, a VSGB solvation model and the sampling was minimized. The free binding energies were therefore calculated using the following equations:

ΔGbind=ΔE+ΔGsolv+ΔGSA (1)

ΔE denotes the difference in minimized complex energy and Σ energies of unbound receptor and ligand. ΔGsolv is the difference in GBSA solvation energy of receptor-ligand complex and Σ solvation energies of unbound receptor and ligand. ΔGSA represents the difference in surface area energy of IFD complex and Σ surface area energies of unbound receptor and ligand.

ΔE=Ecomplex-Eprotein-Eligand (2)

Ecomplex, Eprotein, and Eligand represent the minimized energies for the protein-inhibitor complex, the protein, and inhibitor, respectively.

ΔGsolv=ΔGsolv(complex)-ΔGsolv(protein)-ΔGsolv(ligand) (3)
ΔGSA=ΔGSA(complex)-ΔGSAprotein-ΔGSAligand (4)

ΔGSA is the nonpolar contribution to the solvent energy of the surface zone. GSA(complex), GSA(protein), and GSA(ligand) denote the surface energies of the protein-inhibitor complex, the protein, and the ligand, respectively.

In vitro methods

Reagents

Unless otherwise stated, all reagents were purchased from Thermo Fischer Scientific (USA), Sigma-Aldrich (USA) and Promega (Madison, USA).

In vitro anti-Plasmodium assay

In vitro anti-Plasmodial assays were conducted at the H3D testing centre at the University of Cape Town. All 15 commercially available compounds were evaluated for anti-Plasmodium activity using a parasite lactate dehydrogenase assay as a marker for parasite survival, as previously described by [48]. Briefly, the parasites were synchronized at the ring stage using d-sorbitol in water. Approximately 2 mg/mL stock solutions of reference drugs, chloroquine and artesunate, were prepared in water and DMSO respectively then stored in -20°C. Test compounds and reference drugs were serially diluted to give 10 concentrations with a final volume of 100 μL in each well. Parasites were incubated in the presence of the compounds at 37°C under hypoxic conditions (4% CO2 and 3% O2 in N2) for 72 h. The absorbance was measured at 620 nm on a microplate reader. Survival was plotted against concentration and the IC50-values were obtained using a non-linear dose-response curve fitting analysis via the Dotmatics software platform.

In vitro cytotoxicity

Selectivity of the compounds for the parasites was determined using two cytotoxicity assays both of which were conducted at the H3D Centre. Compounds were screened against the mammalian cells, Chinese Hamster Ovarian (CHO) and hepatocellular carcinoma (HepG2), using the 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazoliumbromide (MTT) assay [49]. Cells were plated to a density of 105 cells/well in 96-well plates and allowed to attach for 24h. After that compounds were added at various concentrations from 50μM to 16nM and the cells incubated for a further 48h. Emetine was used as the control. After that MTT was added and plates were read 4h later. ‘‘Survival” was plotted against concentration and the IC50-values were obtained using a non-linear dose-response curve fitting analysis via the Dotmatics software platform.

Expression and purification of PfHsp90-NTD and surface plasmon resonance

The NTD of PfHsp90 was expressed and purified as described in [29] and 20μg/ml of the protein was used to study the binding affinities of selected ligands using BioNavis Navi 420A ILVES multi-parametric surface plasmon resonance (MP-SPR) system (BioNavis, Finland) as previously described [50]. Briefly, PfHsp90-NTD was immobilized to a Carboxymethyl dextran coated sensor slide (BioNavis SPR102- CMD-3D). The sensor slide was activated with 0.1 M EDC/0.05 M NHS. At pH 5, PfHsp90-NTD suspended in 5 mM of sodium acetate was immobilized on one channel of the 3 CMD chip. 1 M ethanolamine HCl was used to deactivate the chip by removing excess NHS = EDC, and the sensor cleaned with NaCl/NaOH unspecific binding molecules. Varying concentrations (0-2000/5000 nM) of compounds FTN-T2 and FTN-T5 were injected and introduced to the flow cell, allowing binding to the surface. Data Viewer (BioNavis, Finland) and Trace Drawer software version 1.8 (Ridgeview instruments, Uppsala, Sweden) were used to process and analyze the steady-state equilibrium constant data and estimate the binding affinities.

Results

Induced fit docking (IFD) of reference compounds

Previous studies have demonstrated the benefit of rational strategies in designing selective inhibitors of PfHsp90 [26, 28, 29]. We used compounds 7 and 10 from Everson et al., 2021 which were shown to have potency towards P. falciparum drug-sensitive strain while exhibiting a safety profile towards mammalian cells. This study sought to generate novel inhibitors with unique scaffolds and preferential selectivity towards binding PfHsp90, a validated malaria drug target. The main challenge with selective targeting of PfHsp90 is the correspondingly high sequence and structural similarity of the ATP/druggable pocket of the chaperone versus that of human Hsp90. Compounds concomitantly inhibiting PfHsp90 and human Hsp90 would likely result in unintended toxicity.

The IFD results showed compound 7 had a slightly lower docking score of -8.031 kcal/mol (data not shown), making fewer contacts with the PfHsp90 binding pocket. Only two interactions were observed, a hydrogen bond with Phe124 and a water-mediated hydrogen bond with Lys44 (Fig 3). Compound 10, conversely, had a better fitting into the ATP binding pocket of PfHsp90, evidenced by a docking score of -10.485 kcal/mol (data not shown). Compound 10 made several hydrogen water-mediated bond interactions with amino acid residues such as Lys44 and Arg98, respectively and pi-pi interactions with Phe124 and Trp148 (Fig 3). It was surmised that the interaction with Arg98 could be the basis for the selectivity of the compound as has been previously described for other selective inhibitors of PfHsp90 [18, 20].

Fig 3. Details the ligand interaction diagrams for compound 10 and 7 in the PfHsp90 N-terminal domain ATP binding site.

Fig 3

A. Compound 10 making pi stacking (shown by green lines) interactions with Phe124 and Trp148, a water mediated hydrogen bond with Arg98 and lastly a direct hydrogen bond with Lys44. B. Compound 7 making a water mediated hydrogen bond with Lys44 and a direct hydrogen bond with Phe124. Positively charged amino acids indicated in blue, negatively charged amino acids in orange, polar in light blue, non-polar in green, and hydrogen bond interactions in purple.

The PfHsp90-compound 10 complex was examined for functional groups forming important interactions. It was seen that the phenyl ring of the 2-phenyl-1,3,4-oxadiazole moiety did not participate in hydrogen bonding or π-πinteraction. Therefore, the phenyl ring was disconnected from the 2-phenyl-1,3,4-oxadiazole moiety (Fig 2), giving the synthetic precursors such as benzonitrile (reactant 1) and the core-containing pyridine-3-carboxylic acid (reactant 2). Here, the intention was to generate compounds with better affinity for PfHsp90 than compound 10 without altering the core structure.

Pathfinder analysis and reaction-based enumeration of compound 10

To generate analogues of compound 10, incorporating unique fragment and with probably more affinity for PfHsp90, we used Pathfinder reaction-based enumeration to create an extensive library of synthetically tractable compounds in silico as was conducted by Konze and colleagues (2019). Approximately 10, 000 design ideas of enumerated products incorporating unique drug-like fragments from commercial databases in place of reactant 2 of compound 10 were generated.

AutoQSAR models

Table 1 details the 13 models built using the AL approach including their statistical parameters. In each round of model building, parameters such as the values of the ranking score of the model (score) being the standard deviation of the model (SD), the training set accuracy (R2), root-mean square error of the test set predictions (RMSE), and test set accuracy (Q2) were used to select the best performing model. A value of 1 in R2 and Q2 indicate a perfect prediction and a value of 0 for SD and RMSE indicate accuracy. Table 1 indicates the parameters for evaluating each model‘s performance (overall score, R2, Q2, RMSE and SD) and the total number of compounds in the training/test used to build the model. For example, the first model was built by randomly selecting 2000 compounds (1000 enumerated design ideas and 1000 from ChEMBL) aswell as compound 7 and 10. Ligands with docking scores ranging from -5.0 kcal/mol to -7.6 kcal/mol were retrieved yielding 123 ligands that satisfy the criteria (≥ -5.0 kcal/mol), including the two reference compounds. These high-affinity compounds were then employed to train Auto QSAR models, utilizing an AL approach. AutoQSAR divided the selected ligands into the training and test sets, with a random training set at 75% (92 compounds) and test set at 25% (31 compounds), the protocol was repeated for 13 models. MAE results of our model ranges from 1 to -1, giving an indication that the predictions are quite accurate as the closer MAE is to 0, the more accurate the model is.

Table 1. Statistical data for all 13 AutoQSAR prediction models.

Model Model code Score S.D. R2 RMSE Q2 Training set Test set
1 Kpls_linear_36 0.75 0.37 0.77 0.36 0.67 92 31
2 Kpls_molprint2D_31 0.64 0.45 0.63 0.42 0.54 141 47
3 Kpls_radial_34 0.65 0.46 0.68 0.43 0.59 161 54
4 Kpls_molprint2D_42 0.58 0.44 0.68 0.47 0.53 170 57
5 Kpls_radial_37 0.67 0.46 0.67 0.44 0.60 190 64
6 Kpls_desc_37 0.60 0.48 0.59 0.45 0.59 210 71
7 Kpls_desc_44 0.54 0.47 0.59 0.48 0.53 231 78
8 Kpls_molprint2D_27 0.59 0.52 0.58 0.49 0.60 262 88
9 Kpls_desc_33 0.59 0.45 0.71 0.50 0.60 290 97
10 Kpls_radial_33 0.63 0.46 0.71 0.49 0.62 303 101
11 Kpls_radial_36 0.58 0.52 0.63 0.53 0.57 317 106
12 Kpls_radial_42 0.60 0.55 0.64 0.56 0.58 323 108
13 Kpls_desc_23 0.56 0.48 0.75 0.56 0.62 342 114

It can be noted on Table 1 that all the models were generated using kernel-based partial least square regression (KPLS) differing in terms of binary fingerprints. Model 1 used linear, models 2,4 and 8 used molprint2D, models 3, 5, 10–12 used radial and the remaining models used KPLS descriptions binary fingerprints (Table 1). It is possible that the difference in the binary fingerprints of the top scoring models was caused by the inclusion criteria implemented as compounds were randomly used to train the model. The compounds were chemically and structurally diverse. Models 2 to model 8 lost some correlations (represented by overall score). This can be explained by the random variation or statistical fluctuations [51]. However, the observed decrease in correlation does not necessarily indicate a significant decline in model performance but rather reflect the inherent variability in the data.

Fig 4A represents that scatter plot of model 1 built with KPLS_linear_36 (Table 1), which is a QSAR model generated by KPLS with linear fingerprints, had an overall score of 0,75, SD of 0,37, R2 of 0,77, RMSE of 0,36 and Q2 of 0,67. In general, model 1 had good predictive activity for compounds with -5 to -7 kcal/mol suggesting that for the trend line to be more linear towards the more active compounds (activity score -8 to -10 kcal/mol), more compounds need to be included in the high activity range. It is can also be noted that model1 was trained with a set of compounds with similar chemical characteristics as they are clustered around the same region.

Fig 4. Auto QSAR-AL scatter plots.

Fig 4

A: model 1 showing the performance of the QSAR KPLS model’s predicting activity for experimental binding affinity for the test set. Model trained 123 ligands with docking scores ranging from -5.0 kcal/mol to -7.6 kcal/mol. B: Model 13, where the model has reached convergance. AutoQSAR randomly divided the selected ligands into the training at 75% (Blue dots) and test sets at 25% (Red dots).

The protocol used for building model 1 was repeated to train the second model, with inclusion of top scoring compounds, 1000 enumerated design ideas and 1000 ChEMBL database compounds for model training. The same procedure was repeated until the model reached convergence, thus improving the quality of the data by coupling high affinities PfHsp90 with activities of the compounds. As the procedure was repeated, a growing number of compounds were added to the high activity region (between -8 kcal/mol and -10 kcal/mol in model 13) (Fig 4B). Some noise displayed the Auto-QSAR plot in model 2, therefore the activity was adjusted to -6 kcal/mol for training models 3 to 9 (S4-S6 Figs in S1 File).To further avoid noise in the plots, models 10 and 11 (S6 Fig in S1 File) were developed specifically with ligands demonstrating docking scores ≥6.5 kcal/mol, lastly, models 12 and 13 were trained using ligands possessing docking scores of ≥-7 kcal/mol. Notably, the trend line was linear towards the observed high activity region. Model 13 (Fig 4B), which is where the model attained its convergence, denotes that the model was eventually able to recognize compounds it has not initially selected (S2-S6 Figs in S1 File).

Induced fit docking

The AL model enhanced by classical GlideSP ligand docking yielded 236 best docked compounds, with the ability to bind to the receptor PfHsp90-NTD at the ATP binding site. The ligands exhibited docking scores of -10 kcal/mol to -6 kcal/mol, from a total of 43097 docked poses. The 236 compounds were then subjected to IFD against receptor PfHsp90. Rescoring by extra precision (XP), IFD resulted in 113 top-scoring compounds which were selected basedon the docking score, e-model score, IFD score and a visual inspection of the binding mode (data not shown). It was interesting to note that, even though AutoQSAR models were only trained on compounds with docking scores between -5 and -10 kcal/mol, IFD, a more robust scoring function, was able to calculate compounds with docking scores between > -5 kcal/mol and > -13 kcal/mol. This is because IFD tends to use a more extensive sampling of ligand conformations and protein-ligand interactions compared to GlideSP resulting in increased conformational sampling and improved accuracy of predicting ligand binding modes and binding affinities. Lipinski‘s rule of 5 was used to filter that compounds further to 62 compounds (docking score of -10 to -15 kcal/mol, see S2 Table in S1 File). However, most of the compounds were not commercially available and a total of 15 compounds were purchased (see S2 Table in S1 File). The IFD and MM/GBSA results of the top-scoring compounds are presented in Table 2 and the data for the reminder of the compounds can found in the S2 Table in S1 File.

Table 2. Induced fit docking and MMGBSA results of the top-scoring compounds.

Compound
ID
Docking Glide emodel IFDScore ΔGBind ΔGBind
Coulomb
ΔGBind
Covalent
ΔGBind
Hbond
GBind
Lipo
ΔGBind
Packing
ΔGBind
Solv GB
ΔGBind vdW
FTN-T1 -13.49 -83.95 -516.39 -67.31 -40.70 2.48 -6.28 -6.80 -1.67 25.98 -40.31
FTN-T2 -13.10 -66.56 -517.56 -58.66 -40.74 2.88 -6.32 -5.05 -1.39 27.27 -35.31
FTN-T3 -10.457 -101.9 -523.50 -47.95 -37.40 -5.47 -6.80 -5.59 -1.54 33.26 -35.31
FTN-T4 -11.81 -89.54 -521.42 -41.60 -35.74 5.60 -5.37 -5.99 -1.40 25.52 -24.21
FTN-T6 -12.05 -63.46 -519.19 -35.61 -25.42 3.75 -3.73 -6.63 -1.63 29.80 -31.76
FTN-T5 -11.75 -70.28 -514.41 -34.72 -12.96 1.94 -3.33 -8.04 -1.44 19.60 -30.48
FTN-T9 -10.86 -60.37 -518.39 -25.08 -31.65 6.60 -3.25 -8.47 -3.02 38.98 -24.27
Comp 10 -10.49 -76.16 -532.21 -28.47 -16.36 4.24 -3.67 -7.57 -4.04 33.01 -34.08
Harmine -8.29 -41.86 -517.44 -27.52 -3.23 1.65 -1.65 -11.2 -2.70 18.39 -28.76

Most top scoring study compounds had a higher affinity (docking score range: -10.86 to -13.49kcal/mol and ΔGBind range: -25.08 to -67.31kcal/mol) for PfHsp90 compared to the reference compound 10 (Comp 10) exhibiting docking and ΔGBind scores of -10.49kcal/mol and -28.47kcal/mol, respectively. Interestingly, the study compounds also exhibited higher affinities compared to harmine (Table 2), an established selective inhibitor of PfHsp90. It was of interest to this study to obtain compounds which binds preferentially or selectively towards PfHsp90 over human Hsp90. Therefore, human Hsp90 was also as a receptor for IFD analysis using all 62 study compounds. While high docking scores were obtained for human Hsp90 (S3 Table in S1 File range: -8.06 to -12.1kcal/mol), the binding affinities of the compounds were higher for PfHsp90 (Table 2; S2 Table in S1 File). As previously mentioned, interactions with Arg98 contribute to selectivity. When comparing the 2D interaction diagrams of PfHsp90 (S7 Fig in S1 File) with HsHsp90 (S8 Fig in S1 File), it is interesting to note that despite the disparities in the docking scores, the study compounds do not display any interaction with the selectivity conferring residue Arg98, except for FTN-T3. This may suggest that other interactions play a role in the higher docking scores observed in PfHsp90.

Visual inspection of the 2D interaction diagrams of the top-scoring compounds revealed that these are well accommodated within the ATP binding site of the PfHsp90 (Fig 5; S7 Fig in S1 File). All the compounds represented in Fig 5 display water mediated-/hydrogen bond interactions with ATP binding residues such as Asn37, Asp79, Gly83, Asn92, Lys44, Phe124, and Ala38 [33]. The binding of the compounds with these residues suggest that they will likely compete with ATP for binding PfHsp90. Interestingly, only compound FTN-T3 seems to be interacting with the selectivity conferring Arg98 via water-mediated hydrogen bond (Fig 5). Harmine, a proven inhibitor of PfHsp90, can be observed to interact with Asn37 through a water mediated hydrogen bond, as well as a salt bridge and hydrogen bond with Asp79 (Fig 5). We noted that most of the top-scoring compounds make contacts with previously described ATP binding residues, they do not interact with Arg98, which has been described to contribute to selectivity [20].

Fig 5. 3D representations of the compounds FTN-T1, FTN-T2, FTN-T3, FTN-T4, FTN-T5, FTN-T6, FTN-T9 and Harmine in the PfHsp90-NTD binding pocket.

Fig 5

The 3D structure of PfHsp90 is rendered in green ribbons, with residues found at the ATP binding pocket shown in red sticks. Residues Ala38, Arg98 and Ile173 which are unique to PfHsp90 represented by blue sticks. Water molecules and hydrogen bonds represented in red and yellow, respectively.

The Auto QSAR model generated compounds which were analogous to each other, compounds FTN-T1, FTN-T2, FTN-T3, FTN-T4, FTN-T6 and FTN-T9 (Table 2; Fig 5) contain a 7H-Purine scaffold, like ADP/ATP. Purine containing compounds, such as ATP and its analogs, are known to have a high affinity to the ATP-binding pocket Hsp90. This is because the purine ring system allows for the formation of multiple hydrogen bonds and hydrophobic interactions with the binding pocket residues, resulting in a higher affinity towards the protein. Compound FTN-T5 contains a 1-methylpyrimidin-2(1H)-one moiety, which likely played a role in its overall lower binding affinity towards PfHsp90. Compound FTN-T5 seems to be interacting to PfHsp90 through mainly hydrophobic interactions. It is possible that its overall lower docking can be attributed to the bad water contacts (Fig 5; S7 Fig in S1 File). It should be noted that the main drawback of docking is that it does not account for the effects of waters in binding affinity estimations and ligand strain energy caused by the binding event.

Molecular dynamic simulations

The dynamic behaviour and conformational changes induced on PfHsp90 by top-scoring compounds was evaluated by molecular dynamics simulations. Fig 6(A)–6(C) show the Root Mean Square Deviation (RMSD) and Root mean fluctuations (RMSF) of the PfHsp90 and five top scoring compounds and a known inhibitor harmine. RMSDs are used to assess the average displacement between a group of atoms in a specific frame relative to a reference frame. The protein RMSD plots (Fig 6A) offers insights into the structural conformation of the PfHsp90 protein throughout the simulations. By aligning all frames on the reference backbone and calculating the RMSD based on atom selection, the stability of the protein’s and the ligand structures can be effectively monitored [52]. A well-equilibrated simulation is characterized by RMSD fluctuations around 1–3 Å range for small, globular proteins. In our study, PfHsp90 remains stable in its interactions with all tested ligands, exhibiting an average RMSD of 2 Å for all complexes (Fig 6A). More considerable RMSD changes would indicate significant conformational fluctuations, with RMSD values fixed around 1.7 Å, it is deduced that PfHsp90 was greatly stable.

Fig 6. Molecular dynamics simulation results for compounds FTN-T1, FTN-T2, FTN-T3, FTN-T4, FTN-T5, FTN-T9 and Harmine.

Fig 6

A: Ligand RMSD. B: Protein RMSD plots. C: RMSF plots. D-I: 2D representation of the interaction diagrams for contacts made by the compounds FTN-T1, FTN-T2, FTN-T3, FTN-T4, FTN-T5 and Harmine, in the binding pocket of PfHsp90-NTD. Reference compound represented as COMP 10.

The ligand RMSD plots for compounds FTN-T1, FTN-T3, and FTN-T4 exhibited remarkable stability throughout the 100ns simulation, converging at average RMSD values of 1.8Å (Fig 6B). The comp10, FTN-T2, FTN-T5, FTN-T9 and harmine complexes, on the other hand, displayed signs of binding instability as the ligand RMSD values fluctuated throughout the simulation trajectory (Fig 6B), with compounds FTN-T2 and FTN-T5 generally equilibrating at average RMSD of 4.65 Å and 6.6 Å, respectively. The most significant deviations were seen for comp10, harmine and FTN-T9, however, displaying average RMSDs of 7 Å.

The RMSF plots (Fig 6C) for all simulated protein-ligand complexes provided valuable insights into the dynamic behaviour of the protein residues throughout the simulations. We observed prominent RMSF fluctuations in the residue index window spanning positions 20 to 30, 50 to 70 and 100 to 125. These regions mainly correspond to loops in PfHsp90 indicating that the loops are less structured and more flexible, leading to pronounced fluctuations during the simulation.

The 2D ligand-protein contacts diagram revealed that most of the top scoring compounds bound in the ATP binding pocket of PfHsp90. FTN-T4 displays the most stability, remaining bound throughout the 100ns simulation time. The exceptional stability of compound FTN-T1 and FTN-T4 is most likely explained by the direct strong hydrogen bond with the NH2 group with Asp79 remaining stable for 99% and 92% of the simulation window, respectively. Phe124 maintained hydrogen bonds with the OH groups of FTN-T1 and FTN-T4, which remained stable for 84% and 60% of the simulation time, respectively. It should also be noted that the simulation interaction diagram (SID) from MDS (Fig 7) managed to capture some important hydrogen bond interaction network involving water. Compound FTN-T1 makes a direct contact with Arg98. Meanwhile compounds FTN-T3, FTN-T4 and FTN-T5 form water-mediated hydrogen bonds. Similar to harmine which interacts with Arg98 via three Pi-cation interaction, compound FTN-T2 similarly interacts with Arg98. The data from the 2D interaction diagrams of the study compounds and harmine inspired the prediction that these compounds would likely be selective. Even though MDS does’nt measure the effects of favourable and unfavourable waters as a docking post-processing approach. Methods such as MM-GB/SA [47] and more recently Water Map [53, 54], IFD-MD [55] prior to FEP+ were introduced to account or this challenge. In future studies we will further explore the great utility of the later approaches in our lead optimization efforts.

Fig 7. SPR sensorgrams for compounds FTN-T2, FTN-T5, harmine and GDA and Kinetics constants measured by SPR for the interaction between tested compounds and immobilized PfHsp90 (E).

Fig 7

The association rate constant is represented by Ka (1/Ms), dissociation rate constant Kd (1/s), and the equilibrium constant denoting affinity KD (μM).

Anti-Plasmodium and cytotoxicity of promising compounds

The 15 compounds were tested in vitro for anti-plasmodial activity and profiled for cytotoxicity against the CHO and HepG2 cell lines. Most of the compounds were inactive (PfNF54-IC50 ≥ 6 μM) (data not shown) and at least four compounds showed moderate activity and reasonable selectivity indeces of ≥9, (Table 3), suggesting a good safety profile except for compound FTN_T2. Compound FTN-T5 showed promising anti-plasmodial activity (PfNF54-IC50 < 1.5 μM) and low cytotoxicity was observed in CHO and HepG2 cell lines as displayed by good selectivity margins (average SI ≥ 30) (Table 3). The anti-Plasmodium obtained for all the study compounds were lower than harmine (Table 3) and the known inhibitor geldanamycin (IC50 = 0.02μM; (11)). However, compound FTN-T5 displayed activity comparable to the reference compound 10 (Table 3).

Table 3. Table displaying IC50 values in μM of promising compound activity against P. falciparum cells, as well as cytotoxicity towards human cells and respective selectivity indices.

In vitro IC50 (μM)
COMPOUND ID PfNF54/ Pf3D7 Pf3D7Pf3D7 Mammalian cells (SI) HepG2(SI) Study
FTN-T2 4.49 ± 1.51 >50 (>8) 1.50 (0) This study
FTN-T3 5.42 ± 0.58 >50 (>9) >50 (>9) This study
FTN-T4 5.01 ± 0.99 >50 (>10) 27.71(6) This study
FTN-T5 1.44 ± 0.50 >50 (>35) 42.29(29) This study
FTN-T9 4.86 ± 1.14 >50 (>10) >50 (>10) This study
Harmine 0.05 ± 0.00 ND ND Shahinas et al., 2010
Compound 10 1.11 ± 0.97 >24 (>21) >24 (>21) Everson et al., 2021

*Mammalian cells refer to either Chinese hamster ovarian cells (CHO) or human fibroblast cell line.

Binding affinities of promising compounds

Surface plasmon resonance is a technique that reveals information regarding binding affinities, and the kinetics parameters representing the interaction of a protein and its ligands. Fig 7 displays the SPR sensorgram for the interactions between compounds FTN-T2 and FTN-T5 and PfHsp90-NTD (Fig 7A and 7B) aswell as harmine (Fig 7C) and Geldanamycin (Fig 7D). The sensorgrams exhibit a dose-dependent response of these compounds when interacting with the immobilized PfHsp90-NTD. FTN-T2 displays a modest binding affinity (KD = 7 μM; Fig 7E) comparable to GDA but weaker than harmine. FTN-T5, on the other hand, displays a KD of 19 μM, which falls within the same order of magnitude as the reported affinity of ADP [19] for PfHsp90-NTD. Altogether, FTN-T2 shows promising affinity for PfHsp90-NTD while the affinity of FTN-T5 show somewhat weak affinity.

Structure-activity relationship analysis

As mentioned previously, compound 10 was utilized to generate analogues using Pathfinder reaction-based enumeration and AutoQSAR. The identified top scoring compounds were subjected to various in vitro and in silico experiments to determine their respective activities. According to in silico data, the generated compounds display improved binding energies, compared to compound 10, when bound to PfHsp90. The compounds FTN-T1, FTN-T2, FTN-T3, FTN-T4, FTN-T8 and FTN-T9 contain purine moieties, while FTN-T5 consists of a pyrimidine. Purine-based compounds possess the ability to form multiple hydrogen bonds as well as hydrophobic interactions that result in higher bonding affinity. Purine groups containing halogens exhibit the highest docking scores, as observed from the docking scores of FTN-T1 (-13.49 kcal/mol) and FTN-T2 (-13.10 kcal/mol). However, FTN-T1 (consisting of a chloride bonded to the carbon at the 2-position of the purine) seems to be stable in the binding pocket of PfHsp90, as evidence by ligand RMSD values that equilibrate around 1.8Å (Fig 6B), as opposed to FTN-T2 (consisting of a bromide bonded to the carbon at the 8-position of the purine) at around 4.65Å after 100ns. Compared to FTN-T1, FTN-T2 and FTN-T3, which consist of a purine moiety, FTN-T5 consists of a pyrimidine.

It is worth noting that despite the lower docking score of FTN-T5, it displayed the highest antiplasmodial activity among the purchased compounds (Table 3; IC50 = 1.44 μM), followed by FTN-T2 (Table 3; IC50 = 4.49 μM). On the other hand, corroborating the docking results, the purine moiety containing FTN-T2 (Fig 7; KD = 7.11 μM) displays a higher binding affinity for PfHsp90, compared to the pyrimidine containing FTN-T5 (Fig 7; KD = 19 μM), as observed from SPR results.

Discussions

Malaria remains a global public health concern, necessitating the discovery of novel therapeutics to combat drug-resistant strains [56]. The PfHsp90 protein, a molecular chaperone, is critical in the parasite’s survival and virulence [7, 8]. Therefore, by targeting PfHsp90, we set out to generate starting points for effective antimalarial drugs, likely to circumvent drug resistance. In this study, we employed an innovative approach, Auto-QSAR-based Active Learning Docking, to generate potential inhibitors of PfHsp90 as promising anti-Plasmodium agents. Reaction-based enumeration was implemented to convert compound 10 to generate 10 000 design ideas. While ultra-large screening of large libraries of compounds has previously been achieved [57], the number of ligands produced would have been computationally expensive for our resources. We, therefore, devised an AL enhance by docking protocol to overcome the resource limitation while transversing a large chemical space in a relatively short time. The protocol was employed to enrich the data set before subjecting them to IFD. A is a classification under supervised machine learning techniques that develops highly accurate models, effective for exploring chemical space with docking and deep learning as a substitute for complex or expensive implement scoring functions [58]. The previous studies of [43, 57] have applied active learning in the drug discovery field and were able to demonstrate that ligand-based QSAR models are capable of “learning” a docking score over a particular domain, in applications including molecular docking and free energy calculations, while significantly lowering the computational expenses of screening an extensive library.

The capabilities of the AL approach coupled with docking to screen an extensive database were demonstrated by the 13 rounds of screening the design ideas with Glide SP docking and predicting their docking scores using various AutoQSAR models prior to docking compounds. The active learning model was able to “learn” how to predict the activity of highly active compounds against PfHsp90, from the initial point where the model could only predict with the highest level of accuracy compounds in the ranges of -5 kcal/mol to -7 kcal/mol as the model was only trained in this set of compounds and having similar chemical characteristics. With the addition of more compounds with different chemical properties to the model, more compounds can be seen in the high activity region ranging between -8 kcal/mol and -10 kcal/mol (S2 to S6 Figs in S1 File), suggesting that the model was able to learn how to predict the activity of highly active compounds. The active learning model yielded 236 ligands promoted to the following filtering point: IFD and Molecular dynamics simulations. The strong complementarity affinities of these compounds for PfHsp90 could be explained by due to their resemblance of its natural ligand. Thus, it is most likely it is proposed that these ligands might outcompete ADP/ATP given their high docking scores.

The 15 purchased compounds were evaluated for anti-Plasmodium activity. The findings highlight the promise of FTN-T5 as a hit compound for further optimization, given its promising activity against the PfNF54 strain of P falciparum. FTN-T5 manifested low cytotoxicity towards CHO and HepG2 cell lines. Biophysical investigations revealed that FTN-T5 binding to PfHsp90-NTD with weak affinity. The binding affinity data was not surprising as MDS predicted high fluctuations in the PfHsp90-FTN-T5 complex, suggesting a degree of non-specific binding. While it is possible that the weak affinity of FTN-T5 could be explained by pan-inhibition of other Hsp90 isoform due to the similarity of the ATP binding pocket architecture, the observed discrepancy between the anti-Plasmodium activity and binding affinity of FTN-T5 warrants further investigations. We suggest methods such as pull down assays as FTN-T5 displayed low cytotoxity, raising the prospect that it does not bind to human Hsp90.

FTN-T5, a pyrimidine-based compound, displaying fewer interactions with the ATP binding pocket residues. It appears that pyrimidine-based compounds may have reduced affinity for Hsp90 compared to purine-based compounds. Compound FTN-T2 seems to bind to PfHsp90 with modest affinity, comparable to GDA. FTN-T2 contain a purine moiety, exhibit strong interactions with the ATP-binding pocket of PfHsp90. The purine ring system facilitates the formation of multiple hydrogen bonds and hydrophobic interactions with key binding pocket residues, leading to higher affinity for the protein.

Overall, this study identity compound FTN-T5 and FTN-T2 as promising starting point for future multiparameter optimization to improve their binding affinity and potency. Given that PfHsp90 is a promising drug target in malaria, we believe that the study has contributed more compounds to be explored for optimization efforts. As the search for effective anti-malarial agents continues, these findings advocate for the iterative refinement of FTN-T2 and FTN-T5, using its favorable attributes while addressing some of their weakness.

Limitations of the study

The MMGBSA and and induced fit docking which consider target site to be flexible add too much variability for the used machine learning model.

Supporting information

S1 File

(DOCX)

pone.0308969.s001.docx (4.5MB, docx)

Acknowledgments

We thank the Centre for high performance (CHPC), CSIR, South Africa for providing computing facilities. We are grateful to the holistic drug discovery and development (H3D) testing centre based that the University of Cape Town for screening our compounds against P. falciparum and perfoming cytotoxicity assays.

Data Availability

All relevant data are within the manuscript and its Supporting information files.

Funding Statement

FM was awarded the Grand challenges Africa drug discovery seed grant (GCA/Round10/DD-065), funded by the Bill and Melinda Gates foundation is hereby acknowledged. "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript"".

References

  • 1.Organization WH. World malaria World malaria report report. 2023.
  • 2.Blasco B, Leroy D, Fidock DA. Antimalarial drug resistance: linking Plasmodium falciparum parasite biology to the clinic. Nat Med [Internet]. 2017;23(8):917–28. Available from: doi: 10.1038/nm.4381 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Cowell AN, Winzeler EA. The genomic architecture of antimalarial drug resistance. Brief Funct Genomics. 2019. Sep;18(5):314–28. doi: 10.1093/bfgp/elz008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Uwimana A, Legrand E, Stokes BH, Ndikumana J-LM, Warsame M, Umulisa N, et al. Emergence and clonal expansion of in vitro artemisinin-resistant Plasmodium falciparum kelch13 R561H mutant parasites in Rwanda. Nat Med [Internet]. 2020;26(10):1602–8. Available from: doi: 10.1038/s41591-020-1005-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ashley EA, Dhorda M, Fairhurst RM, Amaratunga C, Lim P, Suon S, et al. Spread of artemisinin resistance in Plasmodium falciparum malaria. N Engl J Med. 2014. Jul;371(5):411–23. doi: 10.1056/NEJMoa1314981 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Corey VC, Lukens AK, Istvan ES, Lee MCS, Franco V, Magistrado P, et al. A broad analysis of resistance development in the malaria parasite. Nat Commun [Internet]. 2016;7(1):11901. Available from: doi: 10.1038/ncomms11901 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Shonhai A. Plasmodial heat shock proteins: targets for chemotherapy. FEMS Immunol Med Microbiol. 2010. Feb;58(1):61–74. doi: 10.1111/j.1574-695X.2009.00639.x [DOI] [PubMed] [Google Scholar]
  • 8.Acharya P, Kumar R, Tatu U. Chaperoning a cellular upheaval in malaria: heat shock proteins in Plasmodium falciparum. Mol Biochem Parasitol. 2007. Jun;153(2):85–94. doi: 10.1016/j.molbiopara.2007.01.009 [DOI] [PubMed] [Google Scholar]
  • 9.Zhang M, Wang C, Otto TD, Oberstaller J, Liao X, Adapa SR, et al. Uncovering the essential genes of the human malaria parasite Plasmodium falciparum by saturation mutagenesis. Science. 2018. May;360(6388). doi: 10.1126/science.aap7847 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Su X, Wellems TE. Sequence, transcript characterization and polymorphisms of a Plasmodium falciparum gene belonging to the heat-shock protein (HSP) 90 family. Gene [Internet]. 1994;151(1):225–30. Available from: https://www.sciencedirect.com/science/article/pii/0378111994906610 [DOI] [PubMed] [Google Scholar]
  • 11.Kumar R, Musiyenko A, Barik S. The heat shock protein 90 of Plasmodium falciparum and antimalarial activity of its inhibitor, geldanamycin. Malar J. 2003. Sep;2:30. doi: 10.1186/1475-2875-2-30 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Banumathy G, Singh V, Pavithra SR, Tatu U. Heat Shock Protein 90 Function Is Essential for Plasmodium falciparum Growth in Human Erythrocytes*. J Biol Chem [Internet]. 2003;278(20):18336–45. Available from: https://www.sciencedirect.com/science/article/pii/S0021925819549353 doi: 10.1074/jbc.M211309200 [DOI] [PubMed] [Google Scholar]
  • 13.Posfai D, Eubanks AL, Keim AI, Lu K-Y, Wang GZ, Hughes PF, et al. Identification of Hsp90 Inhibitors with Anti-Plasmodium Activity. Antimicrob Agents Chemother. 2018. Apr;62(4). doi: 10.1128/AAC.01799-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Pallavi R, Acharya P, Chandran S, Daily JP, Tatu U. Chaperone expression profiles correlate with distinct physiological states of Plasmodium falciparum in malaria patients. Malar J [Internet]. 2010;9(1):236. Available from: doi: 10.1186/1475-2875-9-236 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Jackson SE. Hsp90: structure and function. Top Curr Chem. 2013;328:155–240. doi: 10.1007/128_2012_356 [DOI] [PubMed] [Google Scholar]
  • 16.Sreedhar AS, Kalmár E, Csermely P, Shen Y-F. Hsp90 isoforms: functions, expression and clinical importance. FEBS Lett. 2004. Mar;562(1–3):11–5. doi: 10.1016/s0014-5793(04)00229-7 [DOI] [PubMed] [Google Scholar]
  • 17.Pallavi R, Roy R, Reddy R, Venketesh S, Kumar R, Gupta AK, et al. Heat Shock Protein 90 as a Drug Target against Protozoan Infections BIOCHEMICAL CHARACTERIZATION OF HSP90 FROM PLASMODIUM FALCIPARUM AND TRYPANOSOMA EVANSI AND EVALUATION OF ITS INHIBITOR AS A CANDIDATE DRUG *. J Biol Chem. 2010;285(49):37964–75. doi: 10.1074/jbc.M110.155317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Shahinas D, Folefoc A, Taldone T, Chiosis G, Crandall I, Pillai DR. A purine analog synergizes with chloroquine (CQ) by targeting Plasmodium falciparum Hsp90 (PfHsp90). PLoS One. 2013;8(9):e75446. doi: 10.1371/journal.pone.0075446 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Murillo-Solano C, Dong C, Sanchez CG, Pizarro JC. Identification and characterization of the antiplasmodial activity of Hsp90 inhibitors. Malar J [Internet]. 2017;16(1):292. Available from: doi: 10.1186/s12936-017-1940-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Shahinas D, Macmullin G, Benedict C, Crandall I, Pillai DR. Harmine is a potent antimalarial targeting Hsp90 and synergizes with chloroquine and artemisinin. Antimicrob Agents Chemother. 2012. Aug;56(8):4207–13. doi: 10.1128/AAC.00328-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Shonhai A, Maier AG, Przyborski JM, Blatch GL. Intracellular Protozoan Parasites of Humans: The Role of Molecular Chaperones in Development and Pathogenesis. Science (80-). 2011;143–57. doi: 10.2174/092986611794475002 [DOI] [PubMed] [Google Scholar]
  • 22.Eagon S, Hammill JT, Bach J, Everson N, Sisley TA, Walls MJ, et al. Antimalarial activity of tetrahydro-β-carbolines targeting the ATP binding pocket of the Plasmodium falciparum heat shock 90 protein. Bioorganic & Med Chem Lett [Internet]. 2020;30(21):127502. Available from: http://europepmc.org/abstract/MED/32822760 [DOI] [PubMed] [Google Scholar]
  • 23.Eagon S, Anderson MO. Microwave-Assisted Synthesis of Tetrahydro-β-carbolines and β-Carbolines. European J Org Chem [Internet]. 2014. Mar 1;2014(8):1653–65. Available from: doi: 10.1002/ejoc.201301580 [DOI] [Google Scholar]
  • 24.Bayih AG, Folefoc A, Mohon AN, Eagon S, Anderson M, Pillai DR. In vitro and in vivo anti-malarial activity of novel harmine-analog heat shock protein 90 inhibitors: a possible partner for artemisinin. Malar J. 2016. Dec;15(1):579. doi: 10.1186/s12936-016-1625-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Wang T, Bisson WH, Mäser P, Scapozza L, Picard D. Differences in conformational dynamics between Plasmodium falciparum and human Hsp90 orthologues enable the structure-based discovery of pathogen-selective inhibitors. J Med Chem. 2014. Mar;57(6):2524–35. doi: 10.1021/jm401801t [DOI] [PubMed] [Google Scholar]
  • 26.Wang T, Mäser P, Picard D. Inhibition of Plasmodium falciparum Hsp90 Contributes to the Antimalarial Activities of Aminoalcohol-carbazoles. J Med Chem [Internet]. 2016. Jul 14;59(13):6344–52. Available from: doi: 10.1021/acs.jmedchem.6b00591 [DOI] [PubMed] [Google Scholar]
  • 27.Daniyan MO, Ojo OT. In silico identification and evaluation of potential interaction of Azadirachta indica phytochemicals with Plasmodium falciparum heat shock protein 90. J Mol Graph Model. 2019. Mar;87:144–64. doi: 10.1016/j.jmgm.2018.11.017 [DOI] [PubMed] [Google Scholar]
  • 28.Everson N, Bach J, Hammill JT, Falade MO, Rice AL, Guy RK, et al. Identification of Plasmodium falciparum heat shock 90 inhibitors via molecular docking. Bioorg Med Chem Lett [Internet]. 2021;35:127818. Available from: https://www.sciencedirect.com/science/article/pii/S0960894X21000445 doi: 10.1016/j.bmcl.2021.127818 [DOI] [PubMed] [Google Scholar]
  • 29.Mafethe O, Ntseane T, Dongola TH, Shonhai A, Gumede NJ, Mokoena F. Pharmacophore Model-Based Virtual Screening Workflow for Discovery of Inhibitors Targeting Plasmodium falciparum Hsp90. ACS Omega [Internet]. 2023. Sep 26; Available from: doi: 10.1021/acsomega.3c04494 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Jiménez-Luna J, Grisoni F, Weskamp N, Schneider G. Artificial intelligence in drug discovery: recent advances and future perspectives. Expert Opin Drug Discov. 2021. Sep;16(9):949–59. doi: 10.1080/17460441.2021.1909567 [DOI] [PubMed] [Google Scholar]
  • 31.Marin E, Kovaleva M, Kadukova M, Mustafin K, Khorn P, Rogachev A, et al. Regression-Based Active Learning for Accessible Acceleration of Ultra-Large Library Docking. J Chem Inf Model [Internet]. 2024. Apr 8;64(7):2612–23. Available from: doi: 10.1021/acs.jcim.3c01661 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Schrödinger Release 2021–2. Maestro. Maestro. 2021;Schrödinge(LLC):New York, NY.
  • 33.Corbett KD, Berger JM. Structure of the ATP-binding domain of Plasmodium falciparum Hsp90. Proteins. 2010. Oct;78(13):2738–44. doi: 10.1002/prot.22799 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Stebbins CE, Russo AA, Schneider C, Rosen N, Hartl FU, Pavletich NP. Crystal Structure of an Hsp90 –Geldanamycin Complex: Targeting of a Protein Chaperone by an Antitumor Agent. Cell. 1997;89:239–50. doi: 10.1016/s0092-8674(00)80203-2 [DOI] [PubMed] [Google Scholar]
  • 35.Obermann WM, Sondermann H, Russo AA, Pavletich NP, Hartl FU. In vivo function of Hsp90 is dependent on ATP binding and ATP hydrolysis. J Cell Biol. 1998. Nov;143(4):901–10. doi: 10.1083/jcb.143.4.901 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, et al. The Protein Data Bank. Nucleic Acids Res. 2000;28:235–42. doi: 10.1093/nar/28.1.235 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Schrödinger Release 2021–3. Protein Preparation Wizard; Epik, Schrödinger, LLC, New York, NY, 2021. Impact, Schrödinger. 2021;Prime, Sch(New York, NY):LLC.
  • 38.Schrödinger Release 2021–2. Glide. Schrödinger. 2021;LLC(New York, NY).
  • 39.Schrödinger Release 2021–3. LigPrep. LigPrep. 2021;Schrödinge(LLC):New York, NY.
  • 40.Türtscher PL, Reiher M. Pathfinder─Navigating and Analyzing Chemical Reaction Networks with an Efficient Graph-Based Approach. J Chem Inf Model [Internet]. 2023. Jan 9;63(1):147–60. Available from: doi: 10.1021/acs.jcim.2c01136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Biernacki K, Daśko M, Ciupak O, Kubiński K, Rachon J, Demkowicz S. Novel 1,2,4-Oxadiazole Derivatives in Drug Discovery. Pharmaceuticals (Basel). 2020. May;13(6). doi: 10.3390/ph13060111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Gumede NJ. Pathfinder-Driven Chemical Space Exploration and Multiparameter Optimization in Tandem with Glide/IFD and QSAR-Based Active Learning Approach to Prioritize Design Ideas for FEP+ Calculations of SARS-CoV-2 PL(pro) Inhibitors. Molecules. 2022. Dec;27(23). doi: 10.3390/molecules27238569 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Konze KD, Bos PH, Dahlgren MK, Leswing K, Tubert-Brohman I, Bortolato A, et al. Reaction-Based Enumeration, Active Learning, and Free Energy Calculations To Rapidly Explore Synthetically Tractable Chemical Space and Optimize Potency of Cyclin-Dependent Kinase 2 Inhibitors. J Chem Inf Model [Internet]. 2019. Sep 23;59(9):3782–93. Available from: doi: 10.1021/acs.jcim.9b00367 [DOI] [PubMed] [Google Scholar]
  • 44.Dixon SL, Duan J, Smith E, Von Bargen CD, Sherman W, Repasky MP. AutoQSAR: an automated machine learning tool for best-practice quantitative structure-activity relationship modeling. Future Med Chem. 2016. Oct;8(15):1825–39. doi: 10.4155/fmc-2016-0093 [DOI] [PubMed] [Google Scholar]
  • 45.Bowers KJ, Chow DE, Xu H, Dror RO, Eastwood MP, Gregersen BA, et al. Scalable Algorithms for Molecular Dynamics Simulations on Commodity Clusters. In: SC ‘06: Proceedings of the 2006 ACM/IEEE Conference on Supercomputing. 2006. p. 43.
  • 46.Patel HM, Ahmad I, Pawara R, Shaikh M, Surana S. In silico search of triple mutant T790M/C797S allosteric inhibitors to conquer acquired resistance problem in non-small cell lung cancer (NSCLC): a combined approach of structure-based virtual screening and molecular dynamics simulation. J Biomol Struct Dyn. 2021. Mar;39(4):1491–505. doi: 10.1080/07391102.2020.1734092 [DOI] [PubMed] [Google Scholar]
  • 47.Genheden S, Ryde U. The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities. Expert Opin Drug Discov. 2015. May;10(5):449–61. doi: 10.1517/17460441.2015.1032936 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Makler MT, Ries JM, Williams JA, Bancroft JE, Piper RC, Gibbins BL, et al. Parasite lactate dehydrogenase as an assay for Plasmodium falciparum drug sensitivity. Am J Trop Med Hyg. 1993. Jun;48(6):739–41. doi: 10.4269/ajtmh.1993.48.739 [DOI] [PubMed] [Google Scholar]
  • 49.Mosmann T. Rapid colorimetric assay for cellular growth and survival: application to proliferation and cytotoxicity assays. J Immunol Methods. 1983. Dec;65(1–2):55–63. doi: 10.1016/0022-1759(83)90303-4 [DOI] [PubMed] [Google Scholar]
  • 50.Muthelo T, Mulaudzi V, Netshishivhe M, Dongola TH, Kok M, Makumire S, et al. Inhibition of Plasmodium falciparum Hsp70-Hop partnership by 2-phenylthynesulfonamide. Front Mol Biosci. 2022;9:947203. doi: 10.3389/fmolb.2022.947203 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Aggarwal R, Ranganathan P. Common pitfalls in statistical analysis: The use of correlation techniques. Perspect Clin Res. 2016;7(4):187–90. doi: 10.4103/2229-3485.192046 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Martínez L. Automatic identification of mobile and rigid substructures in molecular dynamics simulations and fractional structural fluctuation analysis. PLoS One. 2015;10(3):e0119264. doi: 10.1371/journal.pone.0119264 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Peccati F, Jiménez-Osés G. Enthalpy—Entropy Compensation in Biomolecular Recognition: A Computational Perspective. ACS Omega [Internet]. 2021. May 4;6(17):11122–30. Available from: doi: 10.1021/acsomega.1c00485 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Biswal J, Jayaprakash P, Rayala SK, Venkatraman G, Rangasamy R, Poopandi S, et al. Water Mapping and Scoring Approaches to Predict the Role of Hydration Sites in the Binding Affinity of PAK1 Inhibitors. Comb Chem High Throughput Screen. 2022;25(4):660–76. doi: 10.2174/1386207324666210308110646 [DOI] [PubMed] [Google Scholar]
  • 55.Miller EB, Murphy RB, Sindhikara D, Borrelli KW, Grisewood MJ, Ranalli F, et al. Reliable and Accurate Solution to the Induced Fit Docking Problem for Protein—Ligand Binding. J Chem Theory Comput [Internet]. 2021. Apr 13;17(4):2630–9. Available from: doi: 10.1021/acs.jctc.1c00136 [DOI] [PubMed] [Google Scholar]
  • 56.World Health Organization. World Malaria Report. Vol. WHO/HTM/GM, World Health Organization. 2021. 238 p.
  • 57.Gentile F, Yaacoub JC, Gleave J, Fernandez M, Ton A-T, Ban F, et al. Artificial intelligence—enabled virtual screening of ultra-large chemical libraries with deep docking. Nat Protoc [Internet]. 2022;17(3):672–97. Available from: doi: 10.1038/s41596-021-00659-2 [DOI] [PubMed] [Google Scholar]
  • 58.Yang Z, Nasrallah IM, Shou H, Wen J, Doshi J, Habes M, et al. A deep learning framework identifies dimensional representations of Alzheimer’s Disease from brain structure. Nat Commun [Internet]. 2021;12(1):7065. Available from: doi: 10.1038/s41467-021-26703-z [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Peter Mbugua Njogu

24 Mar 2024

PONE-D-24-01107Auto QSAR-based Active learning docking for hit identification of potential inhibitors of Plasmodium falciparum Hsp90 as antimalarial agentsPLOS ONE

Dear Dr. Mokoena,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

==============================

Academic Editor:

There is a need to rationalize all the assumptions made in your study. The reviewers have made general and specific recommendations that should be implemented and/or responded to for clarity and better understanding by our readers.

Specifically, address the following concerns: 

1. Resolution of all images. 

2. Homology modelling between the ATP binding pockets of the Plasmodium falciparum heat shock protein 90 (PfHsp90) and the human heat shock protein 90 (hHsp90). Are there any differences in amino acids sequence that could drive selective toxicity? 

3. Pharmacophore modelling based on the study conducted. From this study, what are the probable binding modes and chemical interactions between the tested compounds and the molecular target (ATP binding pocket of the PfHsp90)? What are the chemical groups essential for the suggested chemical interactions?

4. It would be prudent to depict the interactions between the most active novel compound and the molecular target (PfHsp90). 

5. Could the Authors speculate to what extent the identified compounds could be applicable to the other plasmodial species that infect humans. 

​Please submit your revised manuscript by May 08 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Peter Mbugua Njogu, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. Thank you for stating the following financial disclosure: "FM was awarded the Grand challenges Africa drug discovery seed grant (GCA/Round10/DD-065), funded by the Bill and Melinda Gates foundation is hereby acknowledged. "

Please state what role the funders took in the study.  If the funders had no role, please state: ""The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."" 

If this statement is not correct you must amend it as needed. 

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

4. Please note that in order to use the direct billing option the corresponding author must be affiliated with the chosen institute. Please either amend your manuscript to change the affiliation or corresponding author, or email us at plosone@plos.org with a request to remove this option.

5. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Comments to the Author

The manuscript entitled: “Auto QSAR-based Active learning docking for hit identification of potential inhibitors of Plasmodium falciparum Hsp90 as antimalarial agents” investigates the application of molecular docking and active learning (AL) models to discover novel inhibitors of PfHsp90, a validated antiplasmodial target. The authors were able to purchase 15 new compounds for validation against PfHsp90 using AL models, demonstrating affinities ranging from 13.5 – 19.9 µM and IC50 values lower than 6 µM in in vitro assays against the parasite. However, several critical issues require attention, necessitating a major revision before the acceptance of the paper. Primarily, the quality of the images provided is inadequate, impairing the analysis of results. Enhanced image quality must be done to facilitate comprehensive data interpretation. Additionally, the inclusion of the 2D structures of the purchased compounds should be added (this is very important) within the main manuscript. Also elucidating their binding modes in the active site environment of PfHsp90 would greatly benefit readers' understanding. In addition, a SAR analysis must be done for these new inhibitors, based on their affinity to the protein target (PfHsp90) and according to their potency against the parasite, highlighting the significance of specific scaffolds or substituent groups in driving observed activity. Finally, some points in the manuscript are too extensive and should be more concise to facilitate readability, in many points is difficult to understand what the authors want to express. Overall, while the findings of this study hold promise for publication in PLOS ONE, substantial revisions are required to address the aforementioned issues as well as the following points.

- All figures must be recreated with significantly higher resolution, as the current images lack clarity and hinder comprehensive analysis.

- There are instances within the manuscript, such as in line 472, where the authors have employed a comma (',') as the decimal separator in numerical values. It is recommended that they review and replace these commas with periods ('.') for consistency and clarity.

- Throughout the paper. Double-check when Plasmodium, Pf, Plasmodium falciparum, etc¸ are cited since these should be written in italics (PfHsp90 -> PfHsp90). IC50 -> IC50; chEMBL -> ChEMBL;

- The chemical structures of the novel and purchased compounds must be represented in the manuscript. As well as a detailed SAR analysis of these compounds against the protein and the parasite.

- Line 60. In the introduction, explain more about how climate conditions complicate the treatment of malaria. I don’t think the right word should be ‘complicates’ but something more related to increasing the risk of more people being infected with malaria.

- An image comparing PfHsp90 with human Hsp90 active sites and highlighting the differences and the amino acids commented on by the authors would be beneficial for the readers.

- From lines 108 to 115 I don’t think this is necessary to be said. Also, the example of using ML of A2A receptor antagonist should be replaced by some examples of using ML to design new antiplasmodial/antimalarial inhibitors. This will be better for the audience reading it and make a direct correlation between the use of ML to design new inhibitors targeting malaria.

- In line 130, the authors write ‘AL models’ but no definition about what is the abbreviation AL (Active learning) was given before.

- Line 272, why did the author select the division 75/25 for the training set?

- Line 275, what about the MAE analysis of the created model?

- In docking methodology, the authors should say that they also have made a visual inspection in addition to the docking score to analyze the obtained poses (see ref https://pubs.acs.org/doi/abs/10.1021/acs.jmedchem.0c02227). Moreover, what were the interactions analyzed to perform the visual inspection, this could be helpful to other analyzing inhibitors against PfHsp90.

- Line 410 ‘pi-pi’ -> π-π

- In the "Induced fit docking of reference compounds" section of the results, it is necessary for the authors to include an image illustrating the ligand(s) positioned within the active site, emphasizing the primary interactions discussed. While it may serve as a complement to Figure 3, it's important to note that the resolution of Figure 3 is insufficient, rendering it impossible to discern details. Therefore, a new figure with higher resolution should be provided to adequately visualize the ligand interactions in the active site.

- Line 424, “10, 000” -> 10000

- It is not usual the construction of AL models using docking scores, the authors should provide a discussion about that with recent literature in which the use of AL models created by docking results resulted in outstanding outcomes. Usually, more robust calculations are performed for AL models (e.g. FEP) – check references: https://www.sciencedirect.com/science/article/pii/S2667318522000204 and https://pubs.acs.org/doi/10.1021/acs.jcim.3c00681.

- Line 473, the correct is ‘predictive affinity’ since this is a regression model using Gibbs free energy provided by docking or experimental Ki/Kd. Activity is for a classification model (not for regression model), in which a compound is active or inactive. Check this in other parts of the manuscript as well, please.

- In Figure 4, the X and Y axis labels should be changed to Affinity instead of Activity.

- Line 478, what about the applicability domain? I’m not sure if the authors said something about this in the manuscript.

- Results sections – ‘induced fit docking’ - An image of the best novel and predicted compounds in the active site with the intermolecular interactions with the target should be provided.

- Table 2: What is the energy unit in Table 2? kcal/mol? In addition, Table 2 must be adapted to fit on the page.

- In line 531, it is imperative to elucidate the disparities in interactions that lead to variations in docking scores between compounds targeting PfHsp90 and hHsp90. Furthermore, the text should explicitly outline the key interactions necessary to confer selectivity for PfHsp90 over hHsp90. Clarification on these points is essential for a comprehensive understanding of the molecular basis underlying the selectivity of compounds toward PfHsp90.

- Figure 5 must be done again to improve the resolution and coloring of the atoms of the ligand according to the heteroatoms to facilitate the visualization. All atoms with the same color for the ligand make it hard to analyze the pose.

- Line 538, ‘these residues suggest that they will likely compete with ATP for binding PfHsp90’, experimental biochemical assays were done using PfHsp90, right? I suppose the authors have checked the compounds' inhibition mechanism against PfHsp90. In this way, the ‘likely’ word here can be replaced.

- ‘Molecular dynamic simulations’ -> Molecular dynamics simulations

- In the "Molecular dynamics simulations" section, the analysis of results was hindered by the low resolution of Figure 6, rendering it impossible to interpret. Additionally, the authors must conduct supplementary analyses on the MD simulation trajectories. For example, they should examine and plot the distances of the hydrogen bonds formed between the compounds and the protein target throughout the simulation. Furthermore, it is essential to verify the persistence of these hydrogen bonds after a 100 ns simulation to provide a more comprehensive assessment of compound stability in conjunction with RMSD analysis. Relying solely on RMSD/RMSF may not be sufficient to ensure thorough evaluation.

Reviewer #2: Matlhod et al used compound 10 as a reference PfHsp90 to generate 10000 models using combinations of computational techniques. These models, together with compounds from the CheMBL data base were subjected to several training and testing rounds to identify compounds with high binding affinity to PfHsp90. A few of the top rank compounds were purchased and evaluated in vitro against the drug susceptible strain of Plasmodium falciparum. I will recommend publication of the work subject to the following revision:

(1) The authors should screen at least the best in vitro hit compound from this study against drug resistant strain of Plasmodium falciparum.

(2) There are several repetitions in the manuscript, especially between the methods and results sections, consider cutting back on these.

(3) The manuscript should be sent for thorough language editing as there many incomplete sentences or inappropriate punctuations.

(4) In both experimental parts, authors have written “Survival was plotted”. This is not very correct, it must be rephrased.

(5) Since computational methods suggested high binding to humanHsp90, others might consider testing for toxicity against another human cell line like HEK293

Others

Page 11, line 62-65, rephrase the sentence ‘’ Furthermore, areas such as Rwanda (4) and East Asia (5) have begun to report the spread and dissemination of first-line treatment options to address artemisinin-tolerant P. falciparum strains to emphasize the urgent need to develop potent and reliable anti-parasitic drugs’’. It is ambiguous.

Page 11, line 67, reference needed at the end.

Page 12, line 75 to 77, references needed at the end of each sentence.

Page 14, line 131, a figure containing the structure of compound 10 should be included nearby and mentioned in the text.

Page 20, line 269, “≥6.5” double check that a minus sign(-) is not missing

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Nov 25;19(11):e0308969. doi: 10.1371/journal.pone.0308969.r002

Author response to Decision Letter 0


17 May 2024

Response to reviewer 1

General comments

However, several critical issues require attention, necessitating a major revision before the acceptance of the paper. Primarily, the quality of the images provided is inadequate, impairing the analysis of results. Enhanced image quality must be done to facilitate comprehensive data interpretation.

Response: We have done our best to improve the quality of the images that we generated. We hope that this improvement reflects to the satisfaction of the editor and the reviewers.

Additionally, the inclusion of the 2D structures of the purchased compounds should be added (this is very important) within the main manuscript.

Response: The 2D structure of the purchased compounds have been provided in the supplementary data.

Also elucidating their binding modes in the active site environment of PfHsp90 would greatly benefit readers' understanding.

Response: The study aimed to generate novel inhibitors of PfHsp90 targeting the N-terminal domain of the protein. These would compete with ATP/ADP for binding based on the nature of the compounds' interactions with PfHsp90. In the case of inhibitors interacting with Arg98, we speculate that these would interact selectively. We have revised the results sections to make this clear. Please refer to the track change version to see the extent of the changes reflected.

In addition, a SAR analysis must be done for these new inhibitors, based on their affinity to the protein target (PfHsp90) and according to their potency against the parasite, highlighting the significance of specific scaffolds or substituent groups in driving observed activity.

Response: Thank you. Pleaser refer to page 28 of the manuscript line 611-633 .

Finally, some points in the manuscript are too extensive and should be more concise to facilitate readability, in many points is difficult to understand what the authors want to express.

Overall, while the findings of this study hold promise for publication in PLOS ONE, substantial revisions are required to address the aforementioned issues as well as the following points.

Response: We have done our utmost best to revise and remove redundancies. Please refer to the track changed version to assess the extend of our effort. We hope this has made things less extensive.

Specific comments

All figures must be recreated with significantly higher resolution, as the current images lack clarity and hinder comprehensive analysis.

Response: Thank you for your constructive feedback regarding the figures in our manuscript. We acknowledge the importance of clear and high-resolution images for comprehensive analysis and have recreated all figures at a significantly higher resolution.

- There are instances within the manuscript, such as in line 472, where the authors have employed a comma (',') as the decimal separator in numerical values. It is recommended that they review and replace these commas with periods ('.') for consistency and clarity.

Response: Table 1 has been revised to have period to separate the decimals. Thank you for this observation. Please see revision on

- Throughout the paper. Double-check when Plasmodium, Pf, Plasmodium falciparum, etc¸ are cited since these should be written in italics (PfHsp90 -> PfHsp90). IC50 -> IC50; chEMBL -> ChEMBL;

Response: The editorial changes have been effected through the manuscript.

- The chemical structures of the novel and purchased compounds must be represented in the manuscript. As well as a detailed SAR analysis of these compounds against the protein and the parasite.

Response: Thank you for the suggestion. The reference compound 10, harmine, as well as the top scoring compounds (purchased) were compiled in one image. Additionally, a SAR section of these compounds has been included starting from line 661-633.

- Line 60. In the introduction, explain more about how climate conditions complicate the treatment of malaria. I don’t think the right word should be ‘complicates’ but something more related to increasing the risk of more people being infected with malaria.

Response: Thank you. This has been revised to “It has also been suggested that changes in climate conditions such as increased temperatures and heavy rainfall may result in an increased mosquito population, putting more people at risk of contracting malaria (1).” Please see line 70-73.

.

- An image comparing PfHsp90 with human Hsp90 active sites and highlighting the differences and the amino acids commented on by the authors would be beneficial for the readers.

Response: Thank you for your feedback. The image displaying both ATP binding pockets of PfHsp90 and HsHsp90 has been provided. This image also displays the residues involved in inhibitor binding, as well as the glycine rich loop (GHL).

- From lines 108 to 115 I don’t think this is necessary to be said. Also, the example of using ML of A2A receptor antagonist should be replaced by some examples of using ML to design new antiplasmodial/antimalarial inhibitors. This will be better for the audience reading it and make a direct correlation between the use of ML to design new inhibitors targeting malaria.

Response: Thank you for your insightful feedback. We have removed the mentioned lines (108-115) as per your suggestion. Moreover, we appreciate your recommendation to replace the example with instances of using ML to design antiplasmodial/antimalarial inhibitors. We want to emphasize that this study marks the pioneering use of ML in designing such antimalarial inhibitors.

- In line 130, the authors write ‘AL models’ but no definition about what is the abbreviation AL (Active learning) was given before.

Response: We beg the pardon of the reviewer. The definition was provided. We have now put ML/AL as abbreviations of machine learning and active learning in brackets to clarify things better.

- Line 272, why did the author select the division 75/25 for the training set?

Response: The 75/25 training and testing data split, commonly used in machine learning, was chosen deliberately to ensure model training on the majority of data while evaluating its performance on unseen data. Initially, we aimed to ensure a diverse representation of compounds across the entire spectrum of docking scores, ranging from -6 kcal/mol to the highest docking scores. This approach is consistent with recommendations in the literature to encompass a wide range of chemical space to enhance model robustness and generalization (Gumede, 2022).

Furthermore, during the initial stages of model training, we observed that the majority of compounds were clustered within the lower docking score range. To address this imbalance and promote learning across the entire spectrum of docking scores, we opted for a 75/25 division. This decision was aimed at broadening the coverage of the model and enabling it to effectively predict the activity of compounds with higher docking scores.

- Line 275, what about the MAE analysis of the created model?

Response: Thank you for your query regarding the Mean Absolute Error (MAE) analysis of our created model. The MAE analysis was conducted and the results of our model indeed range from 1 to -1 (Table S5), indicating the accuracy of our predictions. As MAE measures the average absolute difference between predicted activities and observed activities, a value closer to 0 signifies greater accuracy in the model's predictions.

- In docking methodology, the authors should say that they also have made a visual inspection in addition to the docking score to analyze the obtained poses (see ref https://pubs.acs.org/doi/abs/10.1021/acs.jmedchem.0c02227). Moreover, what were the interactions analyzed to perform the visual inspection, this could be helpful to other analyzing inhibitors against PfHsp90.

Response: Thank you for the feedback. The implementation of visual inspection to assess the binding interactions between the ligands and protein, in combination with docking score, IFD score and e-model score, was stated. We do acknowledge the need to elaborate on the nature of the binding interactions that need to be observed to assert selectivity towards PfHsp90, and thank the reviewer for taking note of this. This addition was made in the induced fit docking methodology.

- Line 410 ‘pi-pi’ -> π-π

Response: Thank you for bringing to our attention the need for the correct representation of 'pi-pi' interactions as 'π-π' in line 410 of the manuscript. We have promptly made the necessary correction to ensure accuracy in our terminology.

- In the "Induced fit docking of reference compounds" section of the results, it is necessary for the authors to include an image illustrating the ligand(s) positioned within the active site, emphasizing the primary interactions discussed. While it may serve as a complement to Figure 3, it's important to note that the resolution of Figure 3 is insufficient, rendering it impossible to discern details. Therefore, a new figure with higher resolution should be provided to adequately visualize the ligand interactions in the active site.

Response: Thank you for your valuable feedback regarding the "Induced fit docking of reference compounds" section in our results. We recognize the importance of providing a clear visualization of ligand interactions within the active site. To address this concern, we have included a new figure with higher resolution, emphasizing the primary interactions discussed. This figure serves as a complement to Figure 3, which we acknowledge had insufficient resolution for discerning details. We trust that the inclusion of this new figure enhances the clarity and comprehensibility of our findings

- Line 424, “10, 000” -> 10000

Response: Thank you for pointing out the need to correct the formatting of "10,000" to "10000" in line 424 of the manuscript. We have made the necessary adjustment to ensure consistency in formatting throughout the document

- It is not usual the construction of AL models using docking scores, the authors should provide a discussion about that with recent literature in which the use of AL models created by docking results resulted in outstanding outcomes. Usually, more robust calculations are performed for AL models (e.g. FEP) – check references: https://www.sciencedirect.com/science/article/pii/S2667318522000204 and https://pubs.acs.org/doi/10.1021/acs.jcim.3c00681.

Response: Thank you for your insightful comments regarding the construction of AL models using docking scores. We acknowledge that this approach may seem unconventional, as docking scores are typically not the sole criterion for model training in traditional machine learning applications in drug discovery. However, our methodology aligns with recent innovations where docking scores are integrated into AL frameworks to refine and prioritize the selection process for subsequent rounds of simulation or experimental validation. In response to your comment, we have included a more detailed discussion on the use of docking scores in AL models (line 129 – 143).

“Recent studies demonstrate the efficacy of Active Learning (AL) models that incorporate docking scores in streamlining the drug discovery process. Notably, Marin et al.(2024) study on regression-based AL models highlights how these can rapidly prioritize compounds in large-scale docking, enhancing efficiency and reducing costs. Similarly, Aniceto et al. (2023) applied AL to optimize virtual screening processes for urease inhibitors, demonstrating the model's ability to increase hit rates effectively.

Another significant contribution comes from Gumede (2022) study, which combines QSAR-based AL with docking scores to prioritize SARS-CoV-2 PLpro inhibitors. This innovative approach has shown that AL models can effectively select promising candidates for further computational and experimental analysis, accelerating the development of therapeutic agents

These instances underline the potential of docking-based AL models in drug discovery. By leveraging the predictive capabilities of docking scores within AL frameworks, researchers can enhance the drug discovery pipeline's speed and efficacy. This method not only improves the prioritization of therapeutic compounds but also promises broader applications in developing novel drugs for various diseases.”

- Line 473, the correct is ‘predictive affinity’ since this is a regression model using Gibbs free energy provided by docking or experimental Ki/Kd. Activity is for a classification model (not for regression model), in which a compound is active or inactive. Check this in other parts of the manuscript as well, please.

Response: Thank you for your comments concerning on the use of "predictive affinity" versus "activity." We appreciate your perspective and the distinction you've highlighted between regression and classification models within the context of molecular modelling. However, we would like to respectfully disagree with the suggestion to modify the terminology from "activity" to "predictive affinity". In the literature, the term "activity" is frequently used in a broader sense to describe the efficacy of a compound, encompassing both its binding affinity and its functional outcome at a molecular or cellular level. While it's true that "activity" is commonly associated with classification models where compounds are categorized as active or inactive, in the context of regression models predicting binding affinities, "activity" can still be appropriate. The term "activity" in the context of regression models is widely used in the literature to refer to the predicted or observed binding affinities of compounds (Peter et al., 2018). It encompasses the quantitative measure of how strongly a compound interacts with its target, whether derived from experimental data or computational predictions.

We believe that our use of "activity" is consistent with established conventions in the field, where it often represents a continuum of molecular interactions rather than a binary classification. We have also reviewed other sections of our manuscript for consistency in terminology and found them to align with the broader scientific usage as supported by our references.

We hope this clarification addresses your concerns, and we thank you for prompting a thorough review of our terminology. We remain committed to accurate and clear scientific communication

-Peter, S. C., Dhanjal, J. K., Malik, V., Radhakrishnan, N., Jayakanthan, M., & Sundar, D. 2018. Quantitative Structure-Activity Relationship (QSAR): Modeling Approaches to Biological Applications. Encyclopedia of Bioinformatics and Computational Biology, 661-676. https://doi.org/10.1016/B978-0-12-809633-8.20197-0.)

.

- In Figure 4, the X and Y axis labels should be changed to Affinity instead of Activity.

Response: Thank you for your observation. We understand your suggestion to change the labels from "Activity" to "Affinity". However, in the context of our study, at this stage the glide docking process employed was aimed at discriminate between binders and non-binders rather than quantifying the exact affinity of the binding interactions. As such, "Activity" as used here broadly categorizes the ability of compounds to interact with the target, rather than their precise binding affinities. Additionally, the labels on the X and Y axes are auto-generated by the program. We opted to retain the original output labels to maintain consistency and transparency in our data presentation.

We appreciate your attention to the details of our figure presentation and hope this explanation clarifies the terminology used in our study.

- Results sections – ‘induced fit docking’ - An image of the best novel and predicted compounds in the active site with the intermolecular interactions with the target should be provided.

Response: Thank you for the recommendation. An image of these compounds docked within the binding pocket has been provided.

- Table 2: What is the energy unit in Table 2? kcal/mol? In addition, Table 2 must be adapted to fit on the page.

Response: Thank you for your inquiry regarding the energy unit in Table 2 and the formatting of the table. We confirm that the energy unit in Table 2 is indeed k

Attachment

Submitted filename: 17May2024 Reviewer 2 comments_Finalized.docx

pone.0308969.s002.docx (18.3KB, docx)

Decision Letter 1

Yash Gupta

5 Aug 2024

Auto QSAR-based Active learning docking for hit identification of potential inhibitors of Plasmodium falciparum Hsp90 as antimalarial agents

PONE-D-24-01107R1

Dear Dr. Mokoena,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Yash Gupta, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

I think authors should add a limitation to the study stating mmGBSA and induced fit docking which consider target site to be flexible add too much variability for the used machine learning model.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: Authors made appropriate corrections/explanations to the requested changes in the previous round. I do not have any objection to the manuscript.

Reviewer #3: Summary: The manuscript by Mokoena et al. presents an innovative approach combining Auto-QSAR models with active learning (AL), a type of machine learning, and molecular docking methods to identify potential inhibitors of Plasmodium falciparum Hsp90 (PfHsp90), a promising target for antimalarial drug development. The methods are detailed and well-structured, and the study successfully identifies several compounds with moderate activity against PfHsp90 and provides a starting point for further research and discovery.

Comment 1: The integration of Auto-QSAR, active learning, and docking is a commendable approach that enhances the efficiency of hit identification. The use of various computational techniques, including induced fit docking, molecular dynamics simulations, and MM-GBSA calculations, strengthens the reliability of the findings.

Comment 2: Some sections of the manuscript could be more concise to improve readability. The authors should emphasize the novelty and potential impact of the identified compounds on malaria treatment would strengthen this section, as well as eliminate any redundant information

Comment 3: The authors may have already addressed this, but the manuscript would benefit from ensuring all images are of the highest resolution to facilitate better comprehension of the results.

Concluding Remarks: I mostly agree with the previous reviewers. The manuscript presents a thorough study with significant potential implications for antimalarial drug discovery via PfHsp90. With improvements in image quality, conciseness, and a more detailed SAR analysis, I believe this manuscript does have a scientific impact and does fit this journal. Once what I consider minor improvements are completed, I believe this should move forward for publication.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: Yes: Richard Beteck

Reviewer #3: No

**********

Acceptance letter

Yash Gupta

8 Aug 2024

PONE-D-24-01107R1

PLOS ONE

Dear Dr. Mokoena,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Yash Gupta

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File

    (DOCX)

    pone.0308969.s001.docx (4.5MB, docx)
    Attachment

    Submitted filename: 17May2024 Reviewer 2 comments_Finalized.docx

    pone.0308969.s002.docx (18.3KB, docx)

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting information files.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES