Skip to main content
Royal Society Open Science logoLink to Royal Society Open Science
. 2024 Oct 2;11(10):240546. doi: 10.1098/rsos.240546

Searching for potential acetylcholinesterase inhibitors: a combined approach of multi-step similarity search, machine learning and molecular dynamics simulations

Quynh Mai Thai 1,2, Minh Quan Pham 3,4, Phuong-Thao Tran 5, Trung Hai Nguyen 1,2,, Son Tung Ngo 1,2
PMCID: PMC11444763  PMID: 39359466

Abstract

Targeting acetylcholinesterase is one of the most important strategies for developing therapeutics against Alzheimer’s disease. In this work, we have employed a new approach that combines machine learning models, a multi-step similarity search of the PubChem library and molecular dynamics simulations to investigate potential inhibitors for acetylcholinesterase. Our search strategy has been shown to significantly enrich the set of compounds with strong predicted binding affinity to acetylcholinesterase. Both machine learning prediction and binding free energy calculation, based on linear interaction energy, suggest that the compound CID54414454 would bind strongly to acetylcholinesterase and hence is a promising inhibitor.

Keywords: AChE, binding free energy, machine learning, MD simulations

1. Introduction

Alzheimer’s disease (AD) is the most common type of dementia and causes the deterioration of memory and cognitive functions in millions of senior adults worldwide [1,2]. In addition, there has been a sharp rise in the number of patients in recent years [3]. Unfortunately, despite considerable efforts by the scientific community [48], effective treatment for AD remains elusive [1,9].

According to the cholinergic hypothesis [10,11], the mechanism of AD is associated with acetylcholinesterase (AChE), an enzyme that hydrolyses acetylcholine (ACh), a neurotransmitter. The enzyme AChE has been recognized as a primary target for the development of drugs to treat AD [12,13]. Inhibiting AChE in cholinergic neurons may hinder synaptic depression and prevent the breakdown of ACh. There are currently several US Food and Drug Administration approved commercial drugs such as donepezil [12], galantamine [14] and rivastigmine [15]. Nevertheless, these drugs induce numerous adverse side effects. Consequently, the investigation of new inhibitors targeting AChE continues to attract significant interest among researchers [1618].

Computational approaches have been increasingly employed to search for potential inhibitors that can bind with a strong affinity to a protein target and inhibit its biological activity [1925]. Molecular docking [26] has been commonly used for initial screening of large databases of compounds to reduce to a short list of potential compounds. More computationally demanding methods, such as free energy perturbation [27,28], fast pulling of ligands [29], and linear interaction energy (LIE) [30,31], are used for more accurate prediction of binding free energy for compounds in the shortlist. Recently, machine learning (ML) methods have shown great potential in fast and accurate screening of potential drugs [3234]. One promising application of ML methods is in initial screening of large numbers of ligands for potential inhibitors [3537]. The prediction from ML is then validated through molecular dynamics (MD) simulations and physics-based free energy calculations [24,25,37].

In this study, we apply the ML method in combination with a multi-step similarity search of the PubChem database to screen for potential AChE inhibitors. The best-predicted compound is then validated by MD simulations and binding free energy calculation based on LIE.

2. Material and methods

Our computational strategy to screen for potential AChE inhibitors is depicted in figure 1. First, a ML model was employed to predict binding free energy for 1584 National Cancer Institute (NCI) compounds, of which the top compounds having the strongest predicted binding free energy were selected. Next, multiple rounds of similarity search from the PubChem library were combined with ML prediction to enrich the set of top compounds. These top compounds were then docked into the AChE binding pocket to create initial structures for MD simulations. Finally, data from MD simulations were used to carry out binding free energy calculations based on the LIE method.

Figure 1.

Computational scheme for screening of potential AChE inhibitors.

Computational scheme for screening of potential acetylcholinesterase (AChE) inhibitors. MD, molecular docking.

2.1. Data collection

The SMILES strings of 1584 compounds were downloaded from the NCI Diversity Set VII [38]. Structures of human AChEs in complex with 10 inhibitors were downloaded from the Protein Data Bank with the identifiers 6CQV [39], 4M0E, 4M0F [40], 4EY5, 4EY6, 4EY7 [41], 6O50 [42], 7D9Q, 7D9O and 7D9P [43].

2.2. Training of machine learning models

Regression models were trained in our previous work (QM Thai, TH Nguyen, GB Lenon, HTT Phung Horng, PT Thai, Nguyen, Lenon, Phung JT, Tran, submitted). Here, we provide an overview of the model training process. SMILES and experimental binding constants (K i) for 1426 compounds were collected from the binding database [44,45]. K i was converted into binding free energy using the formula ΔG=RTlnKi , where R is gas constant and T = 298 K is the absolute temperature. A total of 1064 compounds were randomly selected for training and the rest (362) were used for testing the models.

We trained four ML models: linear regression (LR), random forest (RF), extreme gradient boosting (XGBoost) [46] and graph-convolutional networks (Graph-Conv) [47]. Features for LR, RF and XGBoost models were calculated using the RDKitDescriptors toolkit, implemented in DeepChem [48]. These features are descriptors describing physicochemical properties, such as molecular weight, number of valence electrons, numbers of hydrogen bond (HB) donors and acceptors, and polar surface area. GraphConv is a deep learning approach that can learn features on the fly and therefore, removes the need to manually extract features. The model performance on the test set of 361 compounds was measured by root mean square error (RMSE) and the two correlation coefficients, Pearson’s R and Spearman’s ρ . XGBoost gave the best test performance, with lowest RMSE = 1.357 ± 0.096 kcal mol−1 ( 1 kcal mol−1 = 4.184 kJ mol−1) and highest correlation coefficients, Pearson’s R=0.813±0.027 and Spearman’s ρ=0.808±0.026 . The trained model showed significant improvement compared with the one from our previous work [25], which was trained on fewer data. Electronic supplementary material, figure S1 shows a comparison between experimental and ML-predicted binding free energy for the test compounds.

2.3. Prediction of binding free energy using machine learning model

The best XGBoost model was employed to predict binding affinity for 1584 NCI compounds. The top 17 compounds with predicted binding free energy lower than −10 kcal mol−1 (electronic supplementary material, table S1) were selected for the subsequent similarity search of the PubChem library.

2.4. Similarity search of PubChem library

A two-round similarity search was performed. In the first round, compounds similar to at least one of those in the list of the top 17 compounds (electronic supplementary material, table S1) were searched from the PubChem library [49], resulting in 5393 unique compounds. Binding free energy was predicted for these newly identified compounds using the XGBoost model. In the second round, another list of the top nine compounds with the lowest predicted binding free energy were chosen (electronic supplementary material, table S2). A similarity search in PubChem for this list resulted in 1768 unique compounds for which ML prediction of binding free energy was also made.

2.5. Molecular docking

AutoDock Vina [50] with modified empirical parameters [51] was employed to dock the ligand with lowest binding free energy by XGBoost into the binding site of AChE, whose crystal structure was downloaded from the Protein Data Bank with identifier 4M0E [40]. The receptor and ligands were parameterized using a force field provided by AutoDockTools. The centre of the docking grid was chosen as the centre of mass of dihydrotanshinone I bound to AChE in the crystal structure. The docking grid size was chosen as 26  ×  26  ×  26 Å3. Structures with the lowest energy from docking were captured for further analysis through MD simulations.

2.6. Molecular dynamics simulations

Atomistic simulations in aqueous solution were carried out for 11 AChE–inhibitor complexes, including 10 crystal structures with PDB identifiers 6CQV, 4M0E, 4M0F, 4EY5, 4EY6, 4EY7, 6O50, 7D9Q, 7D9O and 7D9P, and the docked structure best candidate inhibitor predicted by ML to AChE is 4M0E. Simulations were also performed for 11 free ligands in water boxes. GROMACS 2019.6 [52] was employed to sample conformational space of the receptor–ligand complexes and ligands in aqueous solution. The receptor and ions were parameterized by the Amber99SB-iLDN force field [53] ,whereas the TIP3P water model [54] was used for water molecules. The bonded and Lennard Jones parameters of ligands were taken from the general Amber force field (GAFF) [55], whereas the point charges were fitted using the restrained electrostatic potential method [55] implemented in the AmberTools18 [56] and ACPYPE packages [57]. The electrostatic potential grids used for fitting point charges were obtained from density functional theory (DFT) quantum mechanical calculations using the double hybrid Mp2 (Møller–Plesset second-order perturbation theory) functional, basis set 6-31G(d,p), and implicit solvent ( ε=78.4 ). The AChE–ligand complexes were placed in a box with dodecahedral periodic boundary conditions, ensuring a minimum separation of 16.0 Å from the complex to the box edge. The volume of the box was approximately 937.53 nm³ and the total number of atoms is roughly 92 000 for AChE–ligand complexes. For the free ligands, the simulation box has a volume of 58.60 nm³ and contains 5700 atoms in total.

First, the complexes underwent energy minimization using the steepest descent technique. Then the minimized systems were subjected to a relaxation process through 100 ps of NVT and NPT simulations, during which a slight harmonic force was applied to restrain the Cα atoms. The final snapshot from the NPT simulations served as the initial configuration for the subsequent unbiased MD simulations. These MD simulations were carried out for a duration of 50 ns each. To guarantee adequate sampling of the complexes’ conformations, MD simulations were repeated four times using different random number seeds.

2.7. Linear interaction energy calculation

Binding free energy calculations based on MD simulations have been widely used to study protein–ligand binding [58,59]. In this work, the AChE–ligand binding free energy ΔGLIE was estimated using the LIE method [30,31]. According to this method, the binding free energy is expressed as a linear function of the differences in mean interaction energies between the bound and unbound states. In particular, ΔGLIE is written as

,ΔGLIE=α(<Vl-svdw>b<Vl-svdw>u) +β(<Vl-scou>b<Vl-scou>u)+γ

where vdw and cou indicate van der Waals and Coulomb terms, respectively. The subscript l-s stands for ligand-surrounding and indicates the interaction energy between ligand and the surrounding environment. The subscripts b and u stand for bound and unbound states, respectively. MD trajectories for the bound AChE–ligand complexes in water were used to calculate the interaction energy in the bound state whereas the MD trajectories for the free ligand in water were used to calculate the interaction energy in the unbound state. Moreover, the shallow binding cleft of AChE bears a resemblance to the active site of SARS-CoV-2 Mpro. This suggests potential similarities in the physical behaviour of these receptors. Consequently, the empirical parameters α=0.288 , β=-0.049 and γ=-5.88 from the SARS-SARS-CoV-2 Mpro system [60] are suggested for calculating the ligand-binding affinity of the AChE+inhibitor.

2.8. Analysis tools

The root mean square deviation (RMSD) was calculated using the GROMACS tool ‘gmx rms’ [52]. The program ‘gmx hbond’ was used for analysing the intermolecular hydrogen bonds (HBs). HBs between the residual AChE and ligand were calculated when the angle between acceptor–hydrogen–donor is ≥135° and the distance between acceptor and donor is ≤3.5 Å. The intermolecular side chain contact (SC) between the ligand and the residual AChE was calculated based on the distance between their non-hydrogen atoms, which must be ≤4.5 Å. In addition, pharmacokinetic parameters and toxicity of potential compounds were predicted via PreADMET Web app [61]. This method has been shown to provide predictions that are highly correlated with experimental data.

3. Results and discussion

3.1. Machine learning prediction of binding free energy

The best ML model, XGBoost, was used to predict the binding free energy for 1584 compounds in the NCI library, the 5393 PubChem compounds that resulted from the first similarity search round and 1768 PubChem compounds in the second similarity search round. Figure 2 shows the distributions of the predicted binding free energy, ΔGpred , where we can see clearly that our multi-step similarity search of the PubChem library helped enrich the ligand set with more and more ligands having strong predicted binding affinity to AChE. In particular, the predicted binding free energy for compounds in the NCI compound set has the interquartile range −8.05 to −6.77 kcal mol−1 and a median value of −7.42 kcal mol−1. After the first round of similarity search in the PubChem library, the compound set adopted a significantly lower interquartile range of −9.77 to −8.54 kcal mol−1 with a median value of −9.30 kcal mol−1. The second similarity search shifts the distribution of predicted binding free energy further down to an interquartile range of −10.30 to −8.85 kcal mol−1 with the median at −9.65 kcal mol−1. This apparently shows that our similarity search procedure from the PubChem library is very effective in finding compounds having strong predicted binding free energy to AChE. The predicted strongest-binding compound has the PubChem CID 54414454 and XGBoost binding free energy of −11.51 kcal mol−1. Subsequent MD simulation and binding free energy estimation of this compound would provide important insights into the structural and energetic aspects of the binding process with atomistic details.

Figure 2.

Distribution of the binding free energy predicted by XGBoost model for ligands in the NCI library and for ligands resulting from the first (1st PubChem Search) and second (2nd PubChem Search) rounds of similarity search of the PubChem library.

Distribution of the binding free energy predicted by the XGBoost model for ligands in the NCI library and for ligands that resulted from the first (1st PubChem Search) and second (2nd PubChem Search) rounds of similarity search of the PubChem library.

3.2. Molecular dynamics simulations

Two independent MD simulations, each having a 50 ns trajectory, were performed for the complex between compound 54414454 and AChE. The RMSD plot over simulation time (electronic supplementary material, table S3) shows that the system has reached equilibrium. Important insights into the binding process can be gained by investigating how the ligand interacts with the surrounding amino acids in the binding pocket of AChE. Figure 3 shows the probability (fraction of time) of compound 54414454 making side chain (hydrophobic) contacts as well as HBs with residues of AChE. While the compound makes fewer HBs with AChE, its binding to AChE is driven mostly by hydrophobic interaction. In particular, it is found that the four residues TYR124, GLU202, PHE295 and TYR337 might play important roles in the binding of compound 54414454 since they contribute to both HB contacts and significant side chain contacts (figure 4).

Figure 3.

Probability for the compound 54414454 to form side chain (SC) and hydrogen bond (HB) contacts with residues of AChE.

Probability for the compound 54414454 to form side chain (SC) and hydrogen bond (HB) contacts with residues of acetylcholinesterase. The probability was estimated as the fraction of time the contacts were formed over the total simulation time.

Figure 4.

2D presentation of binding pose of 54414454 with AchE using a snapshot of MD simulation.

2-Dimensional presentation of binding pose of 54414454 with acetylcholinesterase using a snapshot of molecular docking simulation. The diagram was prepared by using Maestro software.

3.3. Binding free energy calculations

To validate our binding free energy calculation approach based on the LIE method, we calculated binding free energy ΔGLIE for 10 experimentally available AChE–ligand complexes for which both crystal structures and IC50 data are available. Table 1 shows a comparison between our calculation results and experiment. The calculated ΔGLIE is highly correlated with the experimental values (figure 5), with the correlation coefficient Pearson’s R = 0.75. The RMSE of our calculated result (with respect to experimental ΔGEXP ) is not too high (RMSE = 2.37 kcal mol−1). Therefore, the linear interaction energy approach can estimate binding free energy for AChE–ligand in good agreement with experiment. We then applied this approach for the best candidate compound CID54414454 predicted by the XGBoost model and the result is shown in the last row of table 1. The LIE gives an estimate of binding free energy for CID54414454 that is a little bit weaker than the 10 experimentally available complexes. A plausible explanation is that the LIE calculations for the 10 experimentally available complexes were based on native poses of the ligands in which the ligand already adopted the most favourable binding structures. On the other hand, the LIE calculation for compound CID54414454 was based on the docked pose, which may not be the most energetically favourable pose and the subsequent MD simulations were not able to relax the complex to the right binding pose. Nevertheless, the newly found ligand is still a good binder to AChE and can serve as a basis for further AD drug development targeting AChE.

Table 1.

Binding free energy calculation results using linear interaction energy (LIE) method for 10 experimentally available complexes and the best candidate compound CID54414454 predicted by our machine learning model.

PDB ID resolution (Å) ligand IC50 (nm) ΔG EXP (kcal mol−1) a ΔGLIE (kcal mol−1)
6CQV [39] 2.60 HI6 636 000 −4.39 −10.33 ± 0.36
4M0E [40] 2.00 dihydrotanshinone I 1000 −8.24 −11.02 ± 0.15
4EY6 [41] 2.40 galantamine 100 −9.61 −10.63 ± 0.14
4EY5 [41] 2.30 huperzine A 21 −10.54 −12.18 ± 0.19
6O50 [42] 2.35 EBW 8 −11.11 −12.84 ± 0.40
4M0F [40] 2.30 territrem B 6 −11.29 −13.48 ± 0.54
7D9Q [43] 2.66 H1R 3 −11.70 −12.08 ± 0.82
7D9O [43] 2.45 H0L 3 −11.77 −12.76 ± 0.20
4EY7 [41] 2.35 donepezil 2 −11.94 −12.43 ± 0.72
7D9P [43] 2.85 H0R 0.86 −12.44 −11.87 ± 0.61
CID54414454 −10.45 ± 0.26
a

The experimental values of ΔGEXP are estimated from the inhibition constants IC50 via the formula ΔG = RTln(IC50). The computed error of ΔGLIE is the s.e. of the mean over two independent trajectories.

Figure 5.

Comparison of binding free energy between calculation based on LIE method and experiment for 10 AChE–ligand complexes.

Comparison of binding free energy between calculation based on linear interaction energy (LIE) method and experiment for 10 acetycholinesterase–ligand complexes.

4. Conclusion

We have applied ML and MD simulations to search for potential AChE inhibitors. Our combined approach of using an ML model and multi-step similarity search of the PubChem library has substantially enriched the set of compounds with strong predicted binding free energy to AChE. The best compound, CID54414454, that resulted from the search was further investigated using MD simulations to gain molecular insights into the binding process. Important residues such as TYR124, GLU202, PHE295 and TYR337 were suggested to play an important role in binding of CID54414454 to AChE. Our LIE binding free energy calculations for 10 experimentally available AChE–ligand complexes showed good correlation with experiment. The LIE calculation for CID54414454 suggested that the compound would be a promising inhibitor for AChE and could serve as the basis for further drug development targeting AChE.

Contributor Information

Quynh Mai Thai, Email: quynhmai150399@gmail.com.

Minh Quan Pham, Email: pham-minh.quan@inpc.vast.vn; pham-minh.quan@inpc.vast.vn.

Phuong-Thao Tran, Email: thaotp119@gmail.com.

Trung Hai Nguyen, Email: nguyentrunghai@tdtu.edu.vn.

Son Tung Ngo, Email: sontungngo84@gmail.com; ngosontung@tdtu.edu.vn.

Ethics

This work did not require ethical approval from a human subject or animal welfare committee.

Data accessibility

Relevant data necessary to reproduce all results in the paper are provided within the main text and in the online supplementary material that accompanies this article [62].

Data and relevant code for this research work are stored in GitHub at [63] and have been archived within the Zenodo repository [64].

Declaration of AI use

We have not used AI-assisted technologies in creating this article.

Authors’ contributions

Q.M.T.: data curation, formal analysis, investigation, methodology, visualization, writing—original draft; M.Q.P.: data curation, validation, visualization; P.-T.T.: data curation, methodology, visualization; T.H.N.: conceptualization, formal analysis, investigation, methodology, software, validation, writing—original draft; S.T.N.: conceptualization, formal analysis, inevstigation, project administration, writing—original draft.

All authors gave final approval for publication and agreed to be held accountable for the work performed herein.

Conflict of interest declaration

We declare we have no competing interests.

Funding

This research was funded by the Ho Chi Minh City Foundation for Science and Technology Development under grant number 115/QĐ-SKHCN.

References

  • 1. Querfurth HW, LaFerla FM. 2010. Alzheimer’s disease. N. Engl. J. Med. 362 , 329–344. ( 10.1056/NEJMra0909142) [DOI] [PubMed] [Google Scholar]
  • 2. Selkoe DJ, Hardy J. 2016. The amyloid hypothesis of Alzheimer’s disease at 25 years. EMBO Mol. Med. 8 , 595–608. ( 10.15252/emmm.201606210) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Alzheimer’s Association. 2018. Alzheimer’s disease facts and figures. Alzheimers Dement. 14 , 367–429. ( 10.1016/j.jalz.2018.02.001) [DOI] [Google Scholar]
  • 4. Ngo ST, Truong DT, Tam NM, Nguyen MT. 2017. EGCG inhibits the oligomerization of amyloid beta (16-22) hexamer: theoretical studies. J. Mol. Graph. Model. 76 , 1–10. ( 10.1016/j.jmgm.2017.06.018) [DOI] [PubMed] [Google Scholar]
  • 5. Dhouafli Z, Cuanalo-Contreras K, Hayouni EA, Mays CE, Soto C, Moreno-Gonzalez I. 2018. Inhibition of protein misfolding and aggregation by natural phenolic compounds. Cell. Mol. Life Sci. 75 , 3521–3538. ( 10.1007/s00018-018-2872-2) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Alghazwi M, Smid S, Musgrave I, Zhang W. 2019. In vitro studies of the neuroprotective activities of astaxanthin and fucoxanthin against amyloid beta (Aβ1-42) toxicity and aggregation. Neurochem. Int. 124 , 215–224. ( 10.1016/j.neuint.2019.01.010) [DOI] [PubMed] [Google Scholar]
  • 7. Tran PT, Hoang VH, Lee J, Hien TTT, Tung NT, Ngo ST. 2019. In vitro and in silico determination of glutaminyl cyclase inhibitors. RSC Adv. 9 , 29619–29627. ( 10.1039/C9RA05763C) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Armiento V, Spanopoulou A, Kapurniotu A. 2020. Peptide-based molecular strategies to interfere with protein misfolding, aggregation, and cell degeneration. Angew. Chem. Int. Edn Engl. 59 , 3372–3384. ( 10.1002/anie.201906908) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Nasica-Labouze J, et al. 2015. Amyloid β protein and Alzheimer’s disease: when computer simulations complement experimental studies. Chem. Rev. 115 , 3518–3563. ( 10.1021/cr500638n) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Quinn DM. 1987. Acetylcholinesterase: enzyme structure, reaction dynamics, and virtual transition states. Chem. Rev. 87 , 955–979. ( 10.1021/cr00081a005) [DOI] [Google Scholar]
  • 11. Drachman DA, Leavitt J. 1974. Human memory and the cholinergic system. A relationship to aging? Arch. Neurol. 30 , 113–121. ( 10.1001/archneur.1974.00490320001001) [DOI] [PubMed] [Google Scholar]
  • 12. Kasa P, Papp H, Kasa P, Torok I. 2000. Donepezil dose-dependently inhibits acetylcholinesterase activity in various areas and in the presynaptic cholinergic and the postsynaptic cholinoceptive enzyme-positive structures in the human and rat brain. Neuroscience 101 , 89–100. ( 10.1016/s0306-4522(00)00335-3) [DOI] [PubMed] [Google Scholar]
  • 13. Sudhapriya N, Manikandan A, Kumar MR, Perumal PT. 2019. Cu-mediated synthesis of differentially substituted diazepines as AChE inhibitors; validation through molecular docking and Lipinski’s filter to develop novel anti-neurodegenerative drugs. Bioorg. Med. Chem. Lett. 29 , 1308–1312. ( 10.1016/j.bmcl.2019.04.002) [DOI] [PubMed] [Google Scholar]
  • 14. Sippl W, Contreras JM, Parrot I, Rival YM, Wermuth CG. 2001. Structure-based 3D QSAR and design of novel acetylcholinesterase inhibitors. J. Comput. Aided Mol. Des. 15 , 395–410. ( 10.1023/a:1011150215288) [DOI] [PubMed] [Google Scholar]
  • 15. Ul-Haq Z, Khan W, Kalsoom S, Ansari FL. 2010. In silico modeling of the specific inhibitory potential of thiophene-2,3-dihydro-1,5-benzothiazepine against BChE in the formation of β-amyloid plaques associated with Alzheimer’s disease. Theor. Biol. Med. Model. 7 , 22. ( 10.1186/1742-4682-7-22) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Bourne Y, Kolb HC, Radić Z, Sharpless KB, Taylor P, Marchot P. 2004. Freeze-frame inhibitor captures acetylcholinesterase in a unique conformation. Proc. Natl Acad. Sci. USA 101 , 1449–1454. ( 10.1073/pnas.0308206100) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Jiang CS, Ge YX, Cheng ZQ, Song JL, Wang YY, Zhu K, Zhang H. 2019. Discovery of new multifunctional selective acetylcholinesterase inhibitors: structure-based virtual screening and biological evaluation. J. Comput. Aided Mol. Des. 33 , 521–530. ( 10.1007/s10822-019-00202-2) [DOI] [PubMed] [Google Scholar]
  • 18. Turkan F, Cetin A, Taslimi P, Karaman M, Gulçin İ. 2019. Synthesis, biological evaluation and molecular docking of novel pyrazole derivatives as potent carbonic anhydrase and acetylcholinesterase inhibitors. Bioorg. Chem. 86 , 420–427. ( 10.1016/j.bioorg.2019.02.013) [DOI] [PubMed] [Google Scholar]
  • 19. Patel D, Patel JS, Ytreberg FM. 2021. Implementing and assessing an alchemical method for calculating protein–protein binding free energy. J. Chem. Theory Comput. 17 , 2457–2464. ( 10.1021/acs.jctc.0c01045) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Bonatto V, Shamim A, Rocho FDR, Leitão A, Luque FJ, Lameira J, Montanari CA. 2021. Predicting the relative binding affinity for reversible covalent inhibitors by free energy perturbation calculations. J. Chem. Inform. Model. 61 , 4733–4744. ( 10.1021/acs.jcim.1c00515) [DOI] [PubMed] [Google Scholar]
  • 21. Suh D, Jo S, Jiang W, Chipot C, Roux B. 2019. String method for protein–protein binding free-energy calculations. J. Chem. Theory Comput. 15 , 5829–5844. ( 10.1021/acs.jctc.9b00499) [DOI] [PubMed] [Google Scholar]
  • 22. Ryde U, Söderhjelm P. 2016. Ligand-binding affinity estimates supported by quantum-mechanical methods. Chem. Rev. 116 , 5520–5566. ( 10.1021/acs.chemrev.5b00630) [DOI] [PubMed] [Google Scholar]
  • 23. Subramanian G, Ramsundar B, Pande V, Denny RA. 2016. Computational modeling of β-secretase 1 (BACE-1) inhibitors using ligand based approaches. J. Chem. Inform. Model. 56 , 1936–1949. ( 10.1021/acs.jcim.6b00290) [DOI] [PubMed] [Google Scholar]
  • 24. Nguyen TH, Tam NM, Tuan MV, Zhan P, Vu VV, Quang DT, Ngo ST. 2023. Searching for potential inhibitors of SARS-COV-2 main protease using supervised learning and perturbation calculations. Chem. Phys. 564 , 111709. ( 10.1016/j.chemphys.2022.111709) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Nguyen TH, Tran PT, Pham NQA, Hoang VH, Hiep DM, Ngo ST. 2022. Identifying possible AChE inhibitors from drug-like molecules via machine learning and experimental studies. ACS Omega 7 , 20673–20682. ( 10.1021/acsomega.2c00908) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Gehlhaar DK, Verkhivker G, Rejto PA, Fogel DB, Fogel LJ, Freer ST. 1995. Docking conformationally flexible small molecules into a protein binding site through evolutionary programming. In Evolutionary programming IV: proceedings of the fourth annual conference on evolutionary programming (eds McDonnell JR, Reynolds RG, Fogel DB), pp. 615–627. Cambridge, MA: MIT Press. ( 10.7551/mitpress/2887.003.0053) [DOI] [Google Scholar]
  • 27. Zwanzig RW. 1954. High-temperature equation of state by a perturbation method. I. Nonpolar gases. J. Chem. Phys. 22 , 1420–1426. ( 10.1063/1.1740409) [DOI] [Google Scholar]
  • 28. Jespers W, Åqvist J, Gutiérrez-de-Terán H. 2021. Free energy calculations for protein–ligand binding prediction. In Protein–ligand interactions and drug design (ed. Ballante F), pp. 203–226. New York, NY: Springer. ( 10.1007/978-1-0716-1209-5_12) [DOI] [PubMed] [Google Scholar]
  • 29. Ngo ST, Hung HM, Nguyen MT. 2016. Fast and accurate determination of the relative binding affinities of small compounds to HIV-1 protease using non-equilibrium work. J. Comput. Chem. 37 , 2734–2742. ( 10.1002/jcc.24502) [DOI] [PubMed] [Google Scholar]
  • 30. Åqvist J, Medina C, Samuelsson JE. 1994. A new method for predicting binding affinity in computer-aided drug design. Protein Eng. Design Select. 7 , 385–391. ( 10.1093/protein/7.3.385) [DOI] [PubMed] [Google Scholar]
  • 31. Jones-Hertzog DK, Jorgensen WL. 1997. Binding affinities for sulfonamide inhibitors with human thrombin using Monte Carlo simulations with a linear response method. J. Med. Chem. 40 , 1539–1549. ( 10.1021/jm960684e) [DOI] [PubMed] [Google Scholar]
  • 32. Lamberti MJ, Wilkinson M, Donzanti BA, Wohlhieter GE, Parikh S, Wilkins RG, Getz K. 2019. A study on the application and use of artificial intelligence to support drug development. Clin. Ther. 41 , 1414–1426. ( 10.1016/j.clinthera.2019.05.018) [DOI] [PubMed] [Google Scholar]
  • 33. Ghislat G, Rahman T, Ballester PJ. 2021. Recent progress on the prospective application of machine learning to structure-based virtual screening. Curr. Opin. Chem. Biol. 65 , 28–34. ( 10.1016/j.cbpa.2021.04.009) [DOI] [PubMed] [Google Scholar]
  • 34. Zhu H. 2020. Big data and artificial intelligence modeling for drug discovery. Annu. Rev. Pharmacol. Toxicol. 60 , 573–589. ( 10.1146/annurev-pharmtox-010919-023324) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Chen JQ, Chen HY, Dai WJ, Lv QJ, Chen CYC. 2019. Artificial intelligence approach to find lead compounds for treating tumors. J. Phys. Chem. Lett. 10 , 4382–4400. ( 10.1021/acs.jpclett.9b01426) [DOI] [PubMed] [Google Scholar]
  • 36. Gao K, Nguyen DD, Chen J, Wang R, Wei GW. 2020. Repositioning of 8565 existing drugs for COVID-19. J. Phys. Chem. Lett. 11 , 5373–5382. ( 10.1021/acs.jpclett.0c01579) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Thai QM, Pham TNH, Hiep DM, Pham MQ, Tran PT, Nguyen TH, Ngo ST. 2022. Searching for AChE inhibitors from natural compounds by using machine learning and atomistic simulations. J. Mol. Graph. Model. 115 , 108230. ( 10.1016/j.jmgm.2022.108230) [DOI] [PubMed] [Google Scholar]
  • 38. NCI Diversity Set VII. See https://dtp.cancer.gov/organization/dscb/obtaining/available_plates.htm (accessed 24 May 2023).
  • 39. Bester SM, Guelta MA, Cheung J, Winemiller MD, Bae SY, Myslinski J, Pegan SD, Height JJ. 2018. Structural insights of stereospecific inhibition of human acetylcholinesterase by VX and subsequent reactivation by HI-6. Chem. Res. Toxicol. 31 , 1405–1417. ( 10.1021/acs.chemrestox.8b00294) [DOI] [PubMed] [Google Scholar]
  • 40. Cheung J, Gary EN, Shiomi K, Rosenberry TL. 2013. Structures of human acetylcholinesterase bound to dihydrotanshinone I and territrem B show peripheral site flexibility. ACS Med. Chem. Lett. 4 , 1091–1096. ( 10.1021/ml400304w) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Cheung J, Rudolph MJ, Burshteyn F, Cassidy MS, Gary EN, Love J, Franklin MC, Height JJ. 2012. Structures of human acetylcholinesterase in complex with pharmacologically important ligands. J. Med. Chem. 55 , 10282–10286. ( 10.1021/jm300871x) [DOI] [PubMed] [Google Scholar]
  • 42. Gerlits O, Ho KY, Cheng X, Blumenthal D, Taylor P, Kovalevsky A, Radić Z. 2019. A new crystal form of human acetylcholinesterase for exploratory room-temperature crystallography studies. Chem. Biol. Interact. 309 , 108698. ( 10.1016/j.cbi.2019.06.011) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Zhou Y, et al. 2021. Kinetics-driven drug design strategy for next-generation acetylcholinesterase inhibitors to clinical candidate. J. Med. Chem. 64 , 1844–1855. ( 10.1021/acs.jmedchem.0c01863) [DOI] [PubMed] [Google Scholar]
  • 44. Gilson MK, Liu T, Baitaluk M, Nicola G, Hwang L, Chong J. 2016. BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res. 44 , D1045–D1053. ( 10.1093/nar/gkv1072) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Liu T, Lin Y, Wen X, Jorissen RN, Gilson MK. 2007. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 35 , D198–D201. ( 10.1093/nar/gkl999) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Chen T, Guestrin C. XGBoost: a scalable tree boosting system. In KDD ’16: Proc. 22nd ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, pp. 785–794. ( 10.1145/2939672.2939785) [DOI] [Google Scholar]
  • 47. Duvenaud DK, Maclaurin D, Iparraguirre J, Bombarell R, Hirzel T, Aspuru-Guzik Aet al. 2015. Convolutional networks on graphs for learning molecular fingerprints (eds Cortes C, Lawrence N, Lee D, Sugiyama M, Garnett R). In Proc. 28th Int. Conf. Neural Information Processing Systems, pp. 2224–2232. Cambridge, MA: MIT Press. [Google Scholar]
  • 48. Ramsundar B, Eastman P, Walters P, Pande V. 2019. Deep learning for the life sciences: applying deep learning to genomics, microscopy, drug discovery and more. Sebastopol, CA: O’Reilly Media. [Google Scholar]
  • 49. Kim S, et al. 2016. PubChem substance and compound databases. Nucleic Acids Res. 44 , D1202–D1213. ( 10.1093/nar/gkv951) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Trott O, Olson AJ. 2010. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 31 , 455–461. ( 10.1002/jcc.21334) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Pham TNH, et al. 2022. Improving ligand-ranking of AutoDock Vina by changing the empirical parameters. J. Comput. Chem. 43 , 160–169. ( 10.1002/jcc.26779) [DOI] [PubMed] [Google Scholar]
  • 52. Abraham MJ, Murtola T, Schulz R, Páll S, Smith JC, Hess B, Lindahl E. 2015. GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2 , 19–25. ( 10.1016/j.softx.2015.06.001) [DOI] [Google Scholar]
  • 53. Aliev AE, Kulke M, Khaneja HS, Chudasama V, Sheppard TD, Lanigan RM. 2014. Motional timescale predictions by molecular dynamics simulations: case study using proline and hydroxyproline sidechain dynamics. Proteins 82 , 195–215. ( 10.1002/prot.24350) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. 1983. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79 , 926–935. ( 10.1063/1.445869) [DOI] [Google Scholar]
  • 55. Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. 2004. Development and testing of a general Amber force field. J. Comput. Chem. 25 , 1157–1174. ( 10.1002/jcc.20035) [DOI] [PubMed] [Google Scholar]
  • 56. Case D, et al. 2017. Amber18. San Francisco, CA: University of California. [Google Scholar]
  • 57. Sousa da Silva AW, Vranken WF. 2012. ACPYPE - antechamber Python parser interface. BMC Res. Notes 5 , 367. ( 10.1186/1756-0500-5-367) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Chen J, Wang X, Pang L, Zhang JZH, Zhu T. 2019. Effect of mutations on binding of ligands to guanine riboswitch probed by free energy perturbation and molecular dynamics simulations. Nucleic Acids Res. 47 , 6618–6631. ( 10.1093/nar/gkz499) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Chen J, Wang J, Zhu W. 2017. Mutation L1196M-induced conformational changes and the drug resistant mechanism of anaplastic lymphoma kinase studied by free energy perturbation and umbrella sampling. Phys. Chem. Chem. Phys. 19 , 30239–30248. ( 10.1039/C7CP05418A) [DOI] [PubMed] [Google Scholar]
  • 60. Ngo ST, Tam NM, Pham MQ, Nguyen TH. 2021. Benchmark of popular free energy approaches revealing the inhibitors binding to SARS-CoV-2 Mpro. J. Chem. Inform. Model. 61 , 2302–2312. ( 10.1021/acs.jcim.1c00159) [DOI] [PubMed] [Google Scholar]
  • 61. Lee S, Lee I, Kim H, Chang G, Chung J, No K. 2003. The preADME approach: Web-based program for rapid prediction of physico-chemical, drug absorption and drug-like properties. In EuroQSAR 2002 designing drugs and crop protectants: processes, problems and solutions (ed. Ford MG), pp. 418–420. Massachusetts, USA: Blackwell Publishing. [Google Scholar]
  • 62. Thai QM, Minh Quan P, Tran PT, Nguyen TH, Ngo ST. 2024. Data from: Searching for potential AChE inhibitors: a combined approach of multi-step similarity search, machine learning and MD simulations. Figshare. ( 10.6084/m9.figshare.c.7461924) [DOI] [PMC free article] [PubMed]
  • 63. Nguyen TH. 2024. AchE_ML_2306. GitHub. https://github.com/nguyentrunghai/AchE_ML_2306/releases/tag/V1.0
  • 64. Nguyen TH. 2024. nguyentrunghai/AchE_ML_2306: ML training code for AChE. Zenodo. ( 10.5281/zenodo.11522758) [DOI]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Relevant data necessary to reproduce all results in the paper are provided within the main text and in the online supplementary material that accompanies this article [62].

Data and relevant code for this research work are stored in GitHub at [63] and have been archived within the Zenodo repository [64].


Articles from Royal Society Open Science are provided here courtesy of The Royal Society

RESOURCES