Abstract
Pharmacophore modeling is an important step in computer-aided drug design for identifying interaction points between the receptor and ligand complex. Pharmacophore-based models can be used for de novo drug design, lead identification, and optimization in virtual screening as well as for multi-target drug design. There is a need to develop a user-friendly interface to filter the pharmacophore points resulting from multiple ligand conformations. Here, we present ELIXIR-A, a Python-based pharmacophore refinement tool, to help refine the pharmacophores between multiple ligands from multiple receptors. Furthermore, the output can be easily used in virtual pharmacophore-based screening platforms, thereby contributing to the development of drug discovery.
Introduction
Drug discovery and development is a complex, time-consuming, and expensive process. Computer-aided drug design approaches have the potential to accelerate this process cost-effectively when compared to the laborious traditional compound screening methods. Existing computational techniques include quantitative structure–activity relationship (QSAR), molecular docking-based high-throughput, and pharmacophore-based virtual compound screening. In the usual virtual drug discovery process, molecular docking, pharmacophore models, and 3D QSAR models are often used in combination.1−3 Existing QSAR or docking approaches do not have the capability of pharmacophore refinement, and thus, there is a need for a technique for refining pharmacophores to identify the best set of pharmacophores for the modern drug discovery process.
The pharmacophore concept was introduced by Ehrlich in early 1909.4 A pharmacophore is an ensemble of steric and electronic features that is necessary to ensure optimal supramolecular interactions with a specific biological target structure and to trigger (or to block) its biological response.5 Pharmacophores describe specific ligand–receptor interactions as a generalized pattern. The first 3D pharmacophore screening software was developed by Gund in 1977.6 Pharmacophore modeling approaches can be broadly classified as ligand-based or receptor-based.7 Software programs such as LigandScout,8 DISCO,9 GASP,10 GALAHAD,11 HipHop,12 HypoGen,13 MOE (Chemical Computing Group, https://www.chemcomp.com), PharmaGist,14 MolAlign,15 and PHASE16 have been developed to construct ligand-based pharmacophore models. Their performance differs based on the efficiency of the algorithm to handle ligand flexibility and alignment and requires a set of pharmacologically active ligands.7 On the contrary, receptor-based methods, such as GBPM,17 LigandScout,8 Pharmit,18−20 PyRod,21 and ZINCPharmer,22 analyze the receptor–ligand complex structures to isolate essential pharmacophoric features.23 A 2D pharmacophore fingerprint is a form of a binary code that contains pharmacophore properties.24−27 These pharmacophore fingerprints containing molecular fragments have been applied with multiple artificial intelligence-related models such as PTML,28,29 Pharmacoprint,27 and Pharm-IF.30
In situations where ligands are not known for the target receptor, methods such as CavityPlus,31 GRID,32 HS-Pharm,33 Pocket version 4.0,19,20 Shaper2,34 GRAIL,35 and SuperStar36 have been developed to identify hotspots (highly probable ligand binding sites) on the receptor. Druggability simulations (molecular dynamics simulations conducted in the presence of drug-like organic molecules) assess ligand hotspots while maintaining receptor flexibility. Tools like SILCS-Pharm from the Mackerell lab37,38 and Pharmmaker from the Bahar lab39 have been developed to extract pharmacophore features from druggability simulations. However, there is a need for a systematic tool that analyzes and compares multiple pharmacophore models irrespective of their method of construction.
Here, we present the Enhanced Ligand Exploration and Interaction Recognition Algorithm (ELIXIR-A), an open-source, user-friendly application that serves the purpose of both pharmacophore modeling and pharmacophore mapping. ELIXIR-A is a Python-based application that can import pharmacophore models created in Visual Molecular Dynamics (VMD),40 as well as manual coordinate input from any other platform. In addition, the output files from ELIXIR-A can be easily visualized in VMD41 and can be exported to pharmacophore-based virtual screening platforms like Pharmit.18
Methodology
ELIXIR-A was developed to unify and simplify the interaction data from multiple pharmacophore models. This tool can accept two sets of pharmacophore models created directly by a pharmacophore generating platform. For example, the platform can accept ligand-based pharmacophore models created directly by Pharmit as the input. ELIXIR-A is developed on the modern 3D data processing library Open3D42 to include the algorithms of fast global registration43 and colored point cloud registration (colored ICP)44,45 (Figure 1). ELIXIR-A initially prepares each pharmacophore point represented as a point cloud with its radius, and the pharmacophores are color coded into different types. Then, ELIXIR-A aligns two structures and calculates the initial transformation matrix based on their geometric properties only. Finally, the two pharmacophore clouds are superpositioned with their geometric and pharmacophore type information, and the overlaid pharmacophore point is refined with matched points.
Figure 1.
Flow diagram of the computational procedures of ELIXIR-A.
Pharmacophore Point Clouds
As a shape similarity approach, ELIXIR-A takes pharmacophore points and applies each of them to a three-dimensional volume with a point cloud. Each pharmacophore type consists of 1000 uniformly distributed points in a sphere. The radius of the pharmacophore cloud is defined in the occupancy category of the pdb file. The pharmacophore type is marked with three characters in the residue name category (Table S1). Throughout the complete analysis, the Biopandas framework is used to handle the point cloud data.46
Global Registration with RANSAC Iteration
The first alignment is the global alignment with features. Two pharmacophore point clouds are treated with the same process to calculate a vector of descriptors called the Fast Point Feature Histogram (FPFH).47 FPFH is a 33-dimensional vector that describes the geometric characteristics and principles for a point. The FPFH vector can search points with similar features.47 As ELIXIR-A attempts to align pharmacophore sites from multiple ligands or receptors, there may be distinct differences between some areas of the clouds considered as “noise”. It is known from experience that substantial noise levels can affect the alignment of point clouds with FPFH matching. To enhance the search of the FPFH matching algorithm, the random sample consensus (RANSAC) algorithm, proposed by Fischler and Bolles,48 is included in the registration process. RANSAC can estimate parameters of a mathematical model from a set of observed data with “noise”. The global registration process calculates a preliminary rigid rotation and transformation matrix with their geometric characteristics. A fitness score will be given to evaluate this initial transformation.
Colored ICP
The second alignment is the local ICP alignment with pharmacophore features. The pharmacophore alignment utilizing colored features is inspired by the alignment of the red, green, blue model (RGB) image and its corresponding depth image by Park et al.44 In general, the colored ICP algorithm extends the color vector as the extra dimension of the point coordinate data. The standard ICP algorithm will repeat the transformation of matrices to find the minimum square distance between two clouds. In comparison, the colored ICP algorithm will find the best fit of the extended paired matrices with the pharmacophore “color” information for each point. ELIXIR-A uses a slightly different number of iterations, fitness values, and root-mean-square deviation (RMSD) values from the default to optimize pharmacophore alignment. ELIXIR-A also supports the user adjustment of these critical parameters to obtain a more appropriate fit for the transformation.
Pharmacophore Refinement
After applying preliminary global transformation and local colored ICP transformation from the point cloud superposition to the first pharmacophore input, two pharmacophore points are partially overlapped. It is necessary to remove the nonoverlapped pharmacophore using the refinement algorithm. The algorithm first creates two 3′ N matrices A and B for two groups of pharmacophore datasets. A is defined as the transforming source group, and B is defined as the fixed target. The Euclidean distance for each point in group A to find the corresponding point in group B is calculated. Then, the Euclidean distance for each point in group B to find the corresponding point in group A is evaluated. In each group, if there is no corresponding point within the threshold distance, this point will be considered irrelevant and removed from the group. According to the definition of the pharmacophore, it is possible for different types of pharmacophores to exist in superposition; therefore, this refinement only calculates the geometric properties and does not consider the color parameters of the pharmacophore. The alignment of the two refined pharmacophore points will be visualized as the van der Waals surface (vdW) in VMD. These files will be saved in the same path as the two input pharmacophore files
Fitness Score Function
ELIXIR-A uses a fitness value to evaluate the effectiveness of the transformation for both alignment algorithms. Fitness is calculated using the formula below to find the volume ratio of overlap. The higher the fitness value, the better the alignment performance
Benchmark Compound Validation
To validate the efficiency of the pharmacophore refinement algorithm, a molecular dataset consisting of active inhibitors and inactive decoys targeting specific protein receptors was designed to reduce the testing bias. All the pharmacophore models were screened on the Pharmit platform, and the Directory of Useful Decoys (DUD-e) dataset was used as the small molecule library.49 The benchmark receptors were human immunodeficiency virus type 1 protease (HIVPR), acetylcholinesterase (ACES), and cyclin-dependent kinase 2 (CDK2). These proteins are in different protein families (protease, esterase, and kinase). The feature “shared pharmacophores” in the LigandScout 4.4 Demo version was used as the comparison pharmacophore refinement algorithm in this study. Schrödinger Phase (release 2020–1) was used as another comparison software. The E-Pharmacophore plugin prepared the pharmacophore model under the receptor–ligand complex mode. The hypothesis alignment plugin was used for pharmacophore alignment with 2.0 Å feature matching tolerance and three minimum number of matching features. The overlapped pharmacophore was selected by manual inspection.
The molecular weight (MW), octanol–water partition coefficient (logP), and the total polar surface area (TPSA) were used to compare physicochemical properties between the screened active and decoy molecules. These physicochemical properties were calculated by open babel v2.4.0.50 The properties were compared using the two-tailed Student’s t test.51 The equality of two variances was given by the F-test.52 The significance level was set to 0.05 (α = 0.05).
Enrichment Factor
Enrichment factors (EF) of the pharmacophore-based virtual screening represent the ability of the pharmacophore model to find true positive active inhibitors in the database in contrast to random selection. EF can be calculated using the ratio of the active inhibitors in the screened subgroup over the ratio of active inhibitors in the whole database. The higher the EF value, the better the pharmacology model performance
Results
Configuration
ELIXIR-A is designed as a GUI plugin based on VMD. The script itself is pre-configured. The first step is to install VMD and then edit the startup files in the VMD folder. The physical system requirements for ELXIR-A will not exceed the requirements of the VMD software itself. Once the setup files have been updated, the ELXIR-A folder must be placed in the VMD TCL plugin directory. The ELIXIR-A option can be found under the top menu. A Python environment is required to run the script for ELIXIR-A. The package was tested in cPython 3.8.8 with the following packages (Numpy==1.20.1; Open3D==0.13.0; Biopandas==0.2.9.dev0; and Matplotlib==3.3.4).42,46,53,54
Execution of Pharmacophore Alignment Jobs
ELIXIR-A provides two ways to introduce the pharmacophore information to the GUI (Figure 2). The user can choose to pre-edit the text file with the “.pdb” extension via “Import pdb file:”, or directly use the Pharmit saved session file “.json”, which is the same as the pre-edited pdb file. The guidelines for the text files are described in Table S2.
Figure 2.
ELIXIR-A interface. The target receptor can be loaded to visualize the pharmacophore–receptor interactions. According to different data sets, the default parameters can be optimized to improve the alignment performance.
Visualization of Results
The first pharmacophore cluster is recommended as the first primary site. The second cluster is interactively rotated and transformed to retrieve the conformation that best fits the primary site. Once all the data are prepared in the GUI, the “submit” button can be used to analyze the pharmacophore points’ alignment. This demonstration used ELIXIR-A to find refined structure-based pharmacophores from HIVPR (Figure 3). These models were prepared in Pharmit and visualized in Maestro.55 The receptor HIVPR is an essential enzyme of HIV replication. The active site of HIVPR was at the core of the dimerization interface (Figure 4). The pharmacophores of the ligand(s) were extracted from Pharmit Engine (Figure 3).56 In the VMD OpenGL Display window, two groups of vdW spheres are represented as pharmacophores after the ELXIR-A alignment. The transformed pharmacophores were presented with solid spheres. The fixed pharmacophores were presented as transparent spheres. The ligand regarding the fixed pharmacophores is also shown in Figure 4 left. The initial transformation has 15.39% of point cloud overlap, and the colored ICP improved the overlay by up to 65.66%. After refinement, two separate pharmacophore spheres in each group did not overlap and were therefore removed. A total of eight pharmacophores shared the superposition space, and this result can be used for further drug discovery. In addition to the HIVPR, the use of similarity refinement for the analysis of two other types of proteins is also applied in Table 1.
Figure 3.
Pharmacological description of the two HIVPR in the crystal structure in complex with inhibitors.
Figure 4.
Structure-based pharmacophore refinement between HIVPR-inhibitor complexes utilizing ELIXIR-A. The transformed pharmacophores are presented with solid spheres. The fixed pharmacophores are presented using transparent spheres.
Table 1. ELIXIR-A Aligned Pharmacophore Cluster Selection.
| best fitness
score |
||||||||
|---|---|---|---|---|---|---|---|---|
| PDB IDa | target | ligand IDb | initial points | refined points | RMSD (Å) | global registration | colored ICP | ref |
| 1YT9 | HIVPR | OIS | 11 | 8 | 0.267 | 15.39% | 65.66% | (57) |
| 2FDE | 385 | 9 | 8 | (58) | ||||
| 1Q84 | ACES | TZ4 | 14 | 5 | 0.256 | 14.65% | 42.15% | (59) |
| 2CEK | N8T | 7 | 3 | (60) | ||||
| 6INL | CDK2 | AJR | 7 | 4 | 0.287 | 30.612% | 31.82% | (61) |
| 4EOR | 4SP | 6 | 4 | (62) | ||||
The protein complexes were downloaded from the Protein Data Bank (https://www.rcsb.org).
These ligands are the local substrates corresponding to the protein complexes. Detailed information on the ligands is available at https://www.rcsb.org/ligand/(ligand ID).
Benchmark Compound Validation
A benchmark compound study was used to validate the effectiveness of the module in generating enriched screened molecules using refined pharmacophores. For comparison, the same analysis was carried out for the benchmark compounds using the two software packages that possess pharmacophore alignment modules, LigandScout and Phase, which were accessible to the researchers. We used three targets for benchmarking after considering 11 data sets that are freely available for validation in the Zinc15 database, which also would give positive results for all three screening platforms, i.e., ELIXIR-A, Phase, and LigandScout. The usage of three targets is in line with other works for testing screening algorithms.38,63−65
ELIXIR-A refined pharmacophores were extracted from the overlaid points (Figure 4, right). Five features with three different types were built from 20 initial features. The HIVPR pharmacophore model prepared directly by Pharmit could not generate any screened results because the initial model contained too many pharmacophore features, confirming the need for a tool to refine pharmacophores for effective virtual screening. To compare the performance of ELIXIR-A in comparison to other software capable of pharmacophore screening, we ran the screening algorithms using ∼25,000 starting molecules for three distinct receptors.
For HIVPR, the ELIXIR-A algorithm was able to output 19 molecules, with three being active (3/536) with an enrichment factor of 10.69 (Table 2). LigandScout was able to retrieve ∼47% active compounds (253/536), while the enrichment factor was only 4.91. Phase found three aromatic ring features based on the alignment of two structure-based complexes. The enrichment factor of 0.45 suggested that this model did not match the true binding modes. HIVPR has four key hydrophobic features located in the drug pocket, which are important for binding.38,66 ELIXIR-A and LigandScout both found three of the four hydrophobic features for virtual screening (Figure 5). Thus, high enrichment factors were obtained using ELIXIR-A and LigandScout for the HIVPR target.
Table 2. Enrichment Analysis for Refined Pharmacophore Models.
| active
and inactive compounds in the dataset |
virtual screening resultsb |
|||||
|---|---|---|---|---|---|---|
| approach | target | active | decoy | active | decoy | EF |
| ELIXIR-A | HIVPR | 536 | 35,750 | 3 | 16 | 10.69 |
| LigandScout | 253 | 3236 | 4.91 | |||
| Phase | 36 | 5409 | 0.45 | |||
| ELIXIR-A | ACES | 408 | 26,250 | 201 | 3349 | 3.70 |
| LigandScout | 190 | 2913 | 4.00 | |||
| Phase | 167 | 8308 | 1.29 | |||
| ELIXIR-A | CDK2 | 335 | 27,850 | 189 | 1597 | 8.90 |
| LigandScouta | ||||||
| Phase | 33 | 345 | 7.34 | |||
LigandScout lacked sufficient independent matching of pharmacophore features within tolerances to generate alignment.
Some molecules in the dataset have multiple tautomer confirmations. The output will only include a unique active/decoy molecule with minimum RMSD from pharmacophore matching.
Figure 5.
Refined pharmacophore models of HIVPR generated by ELIXIR-A (A), LigandScout (B), and Phase (C).
For ACES, ELIXIR-A was able to retrieve ∼50% (201/408) active compounds with an enrichment factor of 3.70 (Table 2). LigandScout was able to retrieve ∼47% active molecules with an enrichment factor of 4.00. Phase retrieved ∼40% active molecules with an enrichment factor of 1.29 due to the high number of screened decoys (8038/26250). The ACES active binding pocket was long, narrow, and hydrophobic.60 Conserved aromatic residues were found at the peripheral site.67 All three algorithms in this study modeled one hydrophobic feature at the active site and two hydrophobic or aromatic features at the peripheral site (Figure 6). Comparatively, ELIXIR-A was able to match the highest number of active compounds in the dataset.
Figure 6.
Refined pharmacophore models of ACES generated by ELIXIR-A (A), LigandScout (B), and Phase (C).
For CDK2, ELIXIR-A was able to retrieve ∼55% (189/335) active compounds with an enrichment factor of 8.90 (Table 2). Phase retrieved ∼10% (33/335) active molecules with an enrichment factor of 7.34. Multiple CDK2 pharmacophores were reported with different pharmacophore features.68 Meanwhile, most of them include one hydrogen bond acceptor, one hydrogen bond donor, and two hydrophobic features. The ELIXIR-A model included all these common features and therefore matched half of the active compounds in the dataset (Figure 7). The Phase model recognizes the nonpolar features as the aromatic rings, which matched 10% of the active molecules. Here, ELIXIR-A was able to retrieve a much higher number of active compounds as compared to Phase, while LigandScout failed to retrieve any.
Figure 7.
Refined pharmacophore models of CDK2 generated by ELIXIR-A (A) and Phase (B).
This benchmark compound validation indicated that ELIXIR-A could be a valuable tool to refine pharmacophores for better enrichment during virtual screening.
The physicochemical properties of ligands screened by various pharmacophore refinement models were compared (Table S3). It is noted that at least one of the physicochemical properties (MW, TPSA, or logP) differed significantly between the active and decoy molecules in the seven models. In the ACES-ELIXIR-A model, MW, TPSA, and logP showed significant differences. In the CDK2-ELIXIR-A model, MW and logP showed a significant difference. These results suggested that ELIXIR-A was able to satisfactorily differentiate between active and decoy molecules in terms of key physicochemical aspects. It is noted that further physicochemical studies such as QSAR could be utilized to further enhance the ELIXIR-A pharmacophore refinement platform.
Discussion
Pharmacophores are a set of steric and electronic features that recognize optimal supramolecular interactions. Structure-based modeling using protein–ligand interactions and ligand-based modeling using common chemical features from a set of active/inactive ligands are the two common approaches for building 3D pharmacophore models.69,70 ELIXIR-A is not typical pharmacophore modeling software; rather, it is a pharmacophore refinement algorithm that uses sets of pharmacophores as input and aligns these pharmacophores in 3D space to identify any overlap. ELIXIR-A uses a computer vision-inspired ICP variant algorithm to align multiple pharmacophore models with similar geometric and physicochemical properties as point clouds into a refined point cloud. Although researchers have applied this ICP algorithm for drug discovery to match three-dimensional protein structures71 as well as the alignment of protein binding cavities,72,73 it has not been used for pharmacophore refinement. ELIXIR-A fills this need.
ELIXIR-A uses a geometry-based approach to find pharmacophore similarity between two protein–ligand binding pockets before calculating the binding energy between the receptor and small ligand molecules using colored ICP. ICP requires a good initial transformation to ensure that the point cloud converges to a minimum acceptable value.74 Also, the presence of outliers (nonuniform points) can affect the alignment in ICP. ELIXIR-A utilizes FPFH matching and RANSAC iterations to solve the global fitting and outlier problems. Another algorithm that can perform similar calculations is the Kabsch algorithm. The Kabsch algorithm is widely used in bioinformatics and can calculate the RMSD between two 3D protein or pharmacophore structures via rotation and translation.75,76 When the point correspondences are known, the Kabsch algorithm can be applied. Since point correspondences are not known, the Kabsch algorithm cannot be applied here and thus ICP was used. When using colored ICP, the whole cloud is paired to figure out the corresponding point, and the points in one group can be mapped with multiple corresponding points in the other group. It is also possible to find some important unpaired pharmacophore points when comparing the output results. Another algorithm that is commonly used for ligand-based pharmacophore generation is linear assignment for molecular dataset alignment (LAMDA), which finds globally optimal pairing between objects.70 LAMDA was not considered for ELIXIR-A due to its limitation in ligand-based pharmacophore generation. Genetic algorithms (GA) were used to calculate the initial transformation position before ICP transformation.77−80 For example, GA is used for pharmacophore generation via molecular docking-based data sets such as AutoDock Vina.81 Meanwhile, ELIXIR-A used FPFH matching with RANSAC iteration for global alignment. This is mainly because of the good compatibility between Global Registration and color ICP supported by Open3D. Although ELIXIR-A currently cannot use GA, when the molecular structures of the compared proteins are very different, such a powerful algorithm would be helpful to optimize refinement results.
For the initial pharmacophore analysis, ELIXIR-A and LigandScout used the same method to find the rigid superposition of two pharmacophore clusters, while ELIXIR-A used the ICP variant algorithm and LigandScout used the Hungarian algorithm. In general, the time complexity of the ICP algorithm in ELIXIR-A was O(N2), and the time complexity of the Hungarian algorithm ranged from O(N4) to O(N3). Thus, ELIXIR-A provided better time complexity with the potential to efficiently solve large pharmacophore alignment problems.
Conclusions
ELIXIR-A is a python-based VMD plugin used to help refine pharmacophore models in situations where a large number of pharmacophores are present from multiple models due to various ligand–receptor interactions and many conformations of a ligand on a receptor binding site. The ELIXIR-A GUI can help refine the pharmacophores generated from multiple ligands from multiple receptors using the ICP variant algorithm. The output from ELIXIR-A can be used in virtual pharmacophore-based platforms for compound screening.
Requirements and Availability
The ELIXIR-A package was developed in python3 for the data operations and tcl/tk for the user interface. The package was tested on the Ubuntu 20.04, macOS Big Sur, and Windows 10 systems. The python versions were 3.7.4 or later. The latest version of ELXIR-A is available for download on GitHub (https://github.com/sfernando-BAEN/ELIXIR-A/releases). VMD was downloaded from their official websites. A 64-bit version of VMD is recommended for this package. ELIXIR-A follows the Apache-2.0 license and is open-source on GitHub.
Acknowledgments
The authors gratefully appreciate the support from the Texas A&M High Performance Research Computing (HPRC) group.
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsomega.1c07144.
Additional pharmacophore definitions, a record of the ELIXIR-A input and output pdb files, selected physicochemical properties of the pharmacophore models, and pharmacophore output figures (PDF)
Author Contributions
H.W. developed the ELIXIR-A tool, analyzed the data, and wrote the manuscript. N.M. contributed to the discussion and wrote the manuscript. L.M.P. and S.F. reviewed the manuscript and supervised the overall research.
The authors declare no competing financial interest.
Supplementary Material
References
- Jana S.; Ganeshpurkar A.; Singh S. K. Multiple 3D-QSAR modeling, e-pharmacophore, molecular docking, and in vitro study to explore novel AChE inhibitors. RSC Adv. 2018, 8, 39477–39495. 10.1039/C8RA08198K. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lauria A.; Ippolito M.; Fazzari M.; Tutone M.; Di Blasi F.; Mingoia F.; Almerico A. M. IKK-β inhibitors: An analysis of drug–receptor interaction by using Molecular Docking and Pharmacophore 3D-QSAR approaches. J. Mol. Graphics Modell. 2010, 29, 72–81. 10.1016/j.jmgm.2010.04.008. [DOI] [PubMed] [Google Scholar]
- Liu G.; Wang W.; Wan Y.; Ju X.; Gu S. Application of 3D-QSAR, pharmacophore, and molecular docking in the molecular design of diarylpyrimidine derivatives as HIV-1 nonnucleoside reverse transcriptase inhibitors. Int. J. Mol. Sci. 2018, 19, 1436. 10.3390/ijms19051436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ehrlich P. Über den jetzigen Stand der Chemotherapie. Ber. Dtsch. Chem. Ges. 1909, 42, 17–47. 10.1002/cber.19090420105. [DOI] [Google Scholar]
- Wermuth C.-G.; Ganellin C. R.; Lindberg P.; Mitscher L. A. Glossary of terms used in medicinal chemistry (IUPAC Recommendations 1997). Annu. Rep. Med. Chem. 1998, 33, 385–395. 10.1016/S0065-7743(08)61101-X. [DOI] [Google Scholar]
- Gund P.Three-dimensional pharmacophoric pattern searching. In Progress in molecular and subcellular biology; Springer, 1977; pp. 117–143. [Google Scholar]
- Yang S.-Y. Pharmacophore modeling and applications in drug discovery: challenges and recent advances. Drug Discovery Today 2010, 15, 444–450. 10.1016/j.drudis.2010.03.013. [DOI] [PubMed] [Google Scholar]
- Wolber G.; Langer T. LigandScout: 3-D pharmacophores derived from protein-bound ligands and their use as virtual screening filters. J. Chem. Inf. Model. 2005, 45, 160–169. 10.1021/ci049885e. [DOI] [PubMed] [Google Scholar]
- Martin Y. C.DISCO: what we did right and what we missed. Pharmacophore perception, development, and use in drug design ;International University Line; 2000,2, 49. [Google Scholar]
- Jones G.; Willet P.; Glen R.. GASP: genetic algorithm superimposition program. Pharmacophore perception, development, and use in drug design ;International University Line; 2000, 85–106. [Google Scholar]
- GALAHADTM is distributed by Tripos Inc., St. Louis S. H. R., , MO 63144, USA (http://www.tripos.com).
- Barnum D.; Greene J.; Smellie A.; Sprague P. Identification of common functional configurations among molecules. J. Chem. Inf. Comput. Sci. 1996, 36, 563–571. 10.1021/ci950273r. [DOI] [PubMed] [Google Scholar]
- Li H.; Sutter J.; Hoffmann R.. HypoGen: an automated system for generating 3D predictive pharmacophore models. Pharmacophore perception, development, and use in drug design ;International University Line; 2000,2, 171. [Google Scholar]
- Schneidman-Duhovny D.; Dror O.; Inbar Y.; Nussinov R.; Wolfson H. J. PharmaGist: a webserver for ligand-based pharmacophore detection. Nucleic Acids Res. 2008, 36, W223–W228. 10.1093/nar/gkn187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown B. P.; Mendenhall J.; Meiler J. BCL:: MolAlign: Three-Dimensional Small Molecule Alignment for Pharmacophore Mapping. J. Chem. Inf. Model. 2019, 59, 689–701. 10.1021/acs.jcim.9b00020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dixon S. L.; Smondyrev A. M.; Knoll E. H.; Rao S. N.; Shaw D. E.; Friesner R. A. PHASE: a new engine for pharmacophore perception, 3D QSAR model development, and 3D database screening: 1. Methodology and preliminary results. J. Comput.-Aided Mol. Des. 2006, 20, 647–671. 10.1007/s10822-006-9087-6. [DOI] [PubMed] [Google Scholar]
- Ortuso F.; Langer T.; Alcaro S. GBPM: GRID-based pharmacophore model: concept and application studies to protein–protein recognition. Bioinformatics 2006, 22, 1449–1455. 10.1093/bioinformatics/btl115. [DOI] [PubMed] [Google Scholar]
- Sunseri J.; Koes D. R. Pharmit: interactive exploration of chemical space. Nucleic Acids Res. 2016, 44, W442–W448. 10.1093/nar/gkw287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen J.; Lai L. Pocket v. 2: further developments on receptor-based pharmacophore modeling. J. Chem. Inf. Model. 2006, 46, 2684–2691. 10.1021/ci600246s. [DOI] [PubMed] [Google Scholar]
- Chen J.; Ma X.; Yuan Y.; Pei J.; Lai L. Protein-protein interface analysis and hot spots identification for chemical ligand design. Curr. Pharm. Des. 2014, 20, 1192–1200. 10.2174/13816128113199990065. [DOI] [PubMed] [Google Scholar]
- Schaller D.; Pach S.; Wolber G. PyRod: Tracing water molecules in molecular dynamics simulations. J. Chem. Inf. Model. 2019, 59, 2818–2829. 10.1021/acs.jcim.9b00281. [DOI] [PubMed] [Google Scholar]
- Koes D. R.; Camacho C. J. ZINCPharmer: pharmacophore search of the ZINC database. Nucleic Acids Res. 2012, 40, W409–W414. 10.1093/nar/gks378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaller D.; Šribar D.; Noonan T.; Deng L.; Nguyen T. N.; Pach S.; Machalz D.; Bermudez M.; Wolber G. Next generation 3D pharmacophore modeling. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2020, 10, e1468 10.1002/wcms.1468. [DOI] [Google Scholar]
- McGregor M. J.; Muskal S. M. Pharmacophore fingerprinting. 1. Application to QSAR and focused library design. J. Chem. Inf. Comput. Sci. 1999, 39, 569–574. 10.1021/ci980159j. [DOI] [PubMed] [Google Scholar]
- Bonachéra F.; Parent B.; Barbosa F.; Froloff N.; Horvath D. Fuzzy tricentric pharmacophore fingerprints. 1. Topological fuzzy pharmacophore triplets and adapted molecular similarity scoring schemes. J. Chem. Inf. Model. 2006, 46, 2457–2477. 10.1021/ci6002416. [DOI] [PubMed] [Google Scholar]
- Wood D. J.; Vlieg J. d.; Wagener M.; Ritschel T. Pharmacophore fingerprint-based approach to binding site subpocket similarity and its application to bioisostere replacement. J. Chem. Inf. Model. 2012, 52, 2031–2043. 10.1021/ci3000776. [DOI] [PubMed] [Google Scholar]
- Warszycki D.; Struski Ł.; Śmieja M.; Kafel R.; Kurczab R. Pharmacoprint: A Combination of a Pharmacophore Fingerprint and Artificial Intelligence as a Tool for Computer-Aided Drug Design. J. Chem. Inf. Model. 2021, 61, 5054–5065. 10.1021/acs.jcim.1c00589. [DOI] [PubMed] [Google Scholar]
- Vásquez-Dománguez E.; Armijos-Jaramillo V. D.; Tejera E.; González-Dáaz H. Multioutput perturbation-theory machine learning (PTML) model of ChEMBL data for antiretroviral compounds. Mol. Pharmaceutics 2019, 16, 4200–4212. 10.1021/acs.molpharmaceut.9b00538. [DOI] [PubMed] [Google Scholar]
- Kleandrova V. V.; Scotti M. T.; Scotti L.; Speck-Planche A. Multi-target drug discovery via ptml modeling: applications to the design of virtual dual inhibitors of cdk4 and her2. Curr. Top. Med. Chem. 2021, 21, 661–675. 10.2174/1568026621666210119112845. [DOI] [PubMed] [Google Scholar]
- Sato T.; Honma T.; Yokoyama S. Combining machine learning and pharmacophore-based interaction fingerprint for in silico screening. J. Chem. Inf. Model. 2010, 50, 170–185. 10.1021/ci900382e. [DOI] [PubMed] [Google Scholar]
- Xu Y.; Wang S.; Hu Q.; Gao S.; Ma X.; Zhang W.; Shen Y.; Chen F.; Lai L.; Pei J. CavityPlus: a web server for protein cavity detection with pharmacophore modelling, allosteric site identification and covalent ligand binding ability prediction. Nucleic Acids Res. 2018, 46, W374–W379. 10.1093/nar/gky380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goodford P. J. A computational procedure for determining energetically favorable binding sites on biologically important macromolecules. J. Med. Chem. 1985, 28, 849–857. 10.1021/jm00145a002. [DOI] [PubMed] [Google Scholar]
- Barillari C.; Marcou G.; Rognan D. Hot-spots-guided receptor-based pharmacophores (HS-Pharm): a knowledge-based approach to identify ligand-anchoring atoms in protein cavities and prioritize structure-based pharmacophores. J. Chem. Inf. Model. 2008, 48, 1396–1410. 10.1021/ci800064z. [DOI] [PubMed] [Google Scholar]
- Tran-Nguyen V.-K.; Da Silva F.; Bret G.; Rognan D. All in one: Cavity detection, druggability estimate, cavity-based pharmacophore perception, and virtual screening. J. Chem. Inf. Model. 2019, 59, 573–585. 10.1021/acs.jcim.8b00684. [DOI] [PubMed] [Google Scholar]
- Schuetz D. A.; Seidel T.; Garon A.; Martini R.; Körbel M.; Ecker G. F.; Langer T. GRAIL: Grids of pharmacophore interaction fields. J. Chem. Theory Comput. 2018, 14, 4958–4970. 10.1021/acs.jctc.8b00495. [DOI] [PubMed] [Google Scholar]
- Verdonk M. L.; Cole J. C.; Taylor R. SuperStar: a knowledge-based approach for identifying interaction sites in proteins. J. Mol. Biol. 1999, 289, 1093–1108. 10.1006/jmbi.1999.2809. [DOI] [PubMed] [Google Scholar]
- Yu W.; Lakkaraju S. K.; Raman E. P.; Fang L.; MacKerell A. D. Jr. Pharmacophore modeling using site-identification by ligand competitive saturation (SILCS) with multiple probe molecules. J. Chem. Inf. Model. 2015, 55, 407–420. 10.1021/ci500691p. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu W.; Lakkaraju S. K.; Raman E. P.; MacKerell A. D. Jr. Site-identification by ligand competitive saturation (SILCS) assisted pharmacophore modeling. J. Comput.-Aided Mol. Des. 2014, 28, 491–507. 10.1007/s10822-014-9728-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee J. Y.; Krieger J. M.; Li H.; Bahar I. Pharmmaker: Pharmacophore modeling and hit identification based on druggability simulations. Protein Sci. 2020, 29, 76–86. 10.1002/pro.3732. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bakan A.; Nevins N.; Lakdawala A. S.; Bahar I. Druggability assessment of allosteric proteins by dynamics simulations in the presence of probe molecules. J. Chem. Theory Comput. 2012, 8, 2435–2447. 10.1021/ct300117j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Humphrey W.; Dalke A.; Schulten K. VMD: visual molecular dynamics. J. Mol. Graph. 1996, 14, 33–38. 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
- Zhou Q.-Y.; Park J.; Koltun V.. Open3D: A modern library for 3D data processing. arXiv preprint arXiv:1801.09847 2018.
- Zhou Q.-Y.; Park J.; Koltun V. In Fast global registration. European conference on computer vision; Springer, 2016; pp. 766–782. [Google Scholar]
- Park J.; Zhou Q.-Y.; Koltun V. In Colored point cloud registration revisited .Proceedings of the IEEE international conference on computer vision; ICCV; 2017; pp. 143–152. [Google Scholar]
- Besl P. J.; McKay N. D. In Method for registration of 3-D shapes .Sensor fusion IV: control paradigms and data structures; International Society for Optics and Photonics: 1992; pp. 586–606. [Google Scholar]
- Raschka S. BioPandas: Working with molecular structures in pandas DataFrames. J. Open Source Softw. 2017, 2, 279. 10.21105/joss.00279. [DOI] [Google Scholar]
- Rusu R. B.; Blodow N.; Beetz M. In Fast point feature histograms (FPFH) for 3D registration. 2009 IEEE international conference on robotics and automation; IEEE: 2009; pp. 3212–3217. [Google Scholar]
- Fischler M. A.; Bolles R. C. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 1981, 24, 381–395. 10.1145/358669.358692. [DOI] [Google Scholar]
- Mysinger M. M.; Carchia M.; Irwin J. J.; Shoichet B. K. Directory of useful decoys, enhanced (DUD-E): better ligands and decoys for better benchmarking. J. Med. Chem. 2012, 55, 6582–6594. 10.1021/jm300687e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Boyle N. M.; Banck M.; James C. A.; Morley C.; Vandermeersch T.; Hutchison G. R. Open Babel: An open chemical toolbox. Aust. J. Chem. 2011, 3, 1. 10.1186/1758-2946-3-33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sealy Gossetm W. S. The probable error of a mean. Biometrika 1908, 1–25. 10.2307/2331554. [DOI] [Google Scholar]
- Snedecor G. W.Statistical methods: applied to experiments in agriculture and biology; The Iowa state college press: 1956. [Google Scholar]
- Hunter J. D. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 2007, 9, 90–95. 10.1109/MCSE.2007.55. [DOI] [Google Scholar]
- Van Der Walt S.; Colbert S. C.; Varoquaux G. The NumPy array: a structure for efficient numerical computation. Comput. Sci. Eng. 2011, 13, 22–30. 10.1109/MCSE.2011.37. [DOI] [Google Scholar]
- Schrödinger L.Schrödinger, Release 2020–4: Maestro; Schrödinger: LLC: 2020. [Google Scholar]
- Release S.4: Glide; Schrödinger: LLC, New York, NY: 2018. [Google Scholar]
- Yeung C. M.; Klein L. L.; Flentge C. A.; Randolph J. T.; Zhao C.; Sun M.; Dekhtyar T.; Stoll V. S.; Kempf D. J. Oximinoarylsulfonamides as potent HIV protease inhibitors. Bioorg. Med. Chem. Lett. 2005, 15, 2275–2278. 10.1016/j.bmcl.2005.03.008. [DOI] [PubMed] [Google Scholar]
- Miller J. F.; Andrews C. W.; Brieger M.; Furfine E. S.; Hale M. R.; Hanlon M. H.; Hazen R. J.; Kaldor I.; McLean E. W.; Reynolds D.; Sammond D. M.; Spaltenstein A.; Tung R.; Turner E. M.; Xu R. X.; Sherrill R. G. Ultra-potent P1 modified arylsulfonamide HIV protease inhibitors: the discovery of GW0385. Bioorg. Med. Chem. Lett. 2006, 16, 1788–1794. 10.1016/j.bmcl.2006.01.035. [DOI] [PubMed] [Google Scholar]
- Bourne Y.; Kolb H. C.; Radić Z.; Sharpless K. B.; Taylor P.; Marchot P. Freeze-frame inhibitor captures acetylcholinesterase in a unique conformation. Proc. Natl. Acad. Sci. 2004, 101, 1449–1454. 10.1073/pnas.0308206100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Colletier J. P.; Sanson B.; Nachon F.; Gabellieri E.; Fattorusso C.; Campiani G.; Weik M. Conformational Flexibility in the Peripheral Site of Torpedo californica Acetylcholinesterase Revealed by the Complex Structure with a Bifunctional Inhibitor. J. Am. Chem. Soc. 2006, 128, 4526–4527. 10.1021/ja058683b. [DOI] [PubMed] [Google Scholar]
- Talapati S. R.; Nataraj V.; Pothuganti M.; Gore S.; Ramachandra M.; Antony T.; More S. S.; Krishnamurthy N. R. Structure of cyclin-dependent kinase 2 (CDK2) in complex with the specific and potent inhibitor CVT-313. Acta Crystallogr., Sect. F: Struct. Biol. Commun. 2020, 76, 350–356. 10.1107/S2053230X20009243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Echalier A.; Cot E.; Camasses A.; Hodimont E.; Hoh F.; Jay P.; Sheinerman F.; Krasinska L.; Fisher D. An integrated chemical biology approach provides insight into Cdk2 functional redundancy and inhibitor sensitivity. Chem. Biol. 2012, 19, 1028–1040. 10.1016/j.chembiol.2012.06.015. [DOI] [PubMed] [Google Scholar]
- Sanders M. P. A.; Barbosa A. J. M.; Zarzycka B.; Nicolaes G. A. F.; Klomp J. P. G.; de Vlieg J.; del Rio A. Comparative analysis of pharmacophore screening tools. J. Chem. Inf. Model. 2012, 52, 1607–1620. 10.1021/ci2005274. [DOI] [PubMed] [Google Scholar]
- Spitzer G. M.; Heiss M.; Mangold M.; Markt P.; Kirchmair J.; Wolber G.; Liedl K. R. One concept, three implementations of 3D pharmacophore-based virtual screening: distinct coverage of chemical search space. J. Chem. Inf. Model. 2010, 50, 1241–1247. 10.1021/ci100136b. [DOI] [PubMed] [Google Scholar]
- Schuster D. 3D pharmacophores as tools for activity profiling. Drug Discovery Today: Technol. 2010, 7, e205–e211. 10.1016/j.ddtec.2010.11.006. [DOI] [PubMed] [Google Scholar]
- Lam P.; Jadhav P. K.; Eyermann C. J.; Hodge C. N.; Ru Y.; Bacheler L. T.; Meek J. L.; Otto M. J.; Rayner M. M.; Wong Y. N.; Chang C. H.; Weber P. C.; Jackson D. A.; Sharpe T. R.; Erickson-Viitanen S. Rational design of potent, bioavailable, nonpeptide cyclic ureas as HIV protease inhibitors. Science 1994, 263, 380–384. 10.1126/science.8278812. [DOI] [PubMed] [Google Scholar]
- Lu S.-H.; Wu J. W.; Liu H.-L.; Zhao J.-H.; Liu K.-T.; Chuang C.-K.; Lin H.-Y.; Tsai W.-B.; Ho Y. The discovery of potential acetylcholinesterase inhibitors: a combination of pharmacophore modeling, virtual screening, and molecular docking studies. J. Biomed .Sci. 2011, 18, 1–13. 10.1186/1423-0127-18-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zou J.; Xie H.-Z.; Yang S.-Y.; Chen J.-J.; Ren J.-X.; Wei Y.-Q. Towards more accurate pharmacophore modeling: Multicomplex-based comprehensive pharmacophore map and most-frequent-feature pharmacophore model of CDK2. J. Mol. Graphics Modell. 2008, 27, 430–438. 10.1016/j.jmgm.2008.07.004. [DOI] [PubMed] [Google Scholar]
- Wolber G.; Seidel T.; Bendix F.; Langer T. Molecule-pharmacophore superpositioning and pattern matching in computational drug design. Drug Discovery Today 2008, 13, 23–29. 10.1016/j.drudis.2007.09.007. [DOI] [PubMed] [Google Scholar]
- Richmond N. J.; Abrams C. A.; Wolohan P. R. N.; Abrahamian E.; Willett P.; Clark R. D. GALAHAD: 1. Pharmacophore identification by hypermolecular alignment of ligands in 3D. J. Comput.-Aided Mol. Des. 2006, 20, 567–587. 10.1007/s10822-006-9082-y. [DOI] [PubMed] [Google Scholar]
- Douguet D.; Payan F.. SENSAAS (SENsitive Surface As A Shape): utilizing open-source algorithms for 3D point cloud alignment of molecules. arXiv preprint arXiv:1908.11267 2019.
- Eguida M.; Rognan D. A computer vision approach to align and compare protein cavities: application to fragment-based drug design. J. Med. Chem. 2020, 63, 7127–7142. 10.1021/acs.jmedchem.0c00422. [DOI] [PubMed] [Google Scholar]
- Bertolazzi P.; Liuzzi G.; Guerra C. In A global optimization algorithm for protein surface alignment .2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop; IEEE: 2009; pp. 93–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen J.; Belaton B. In An improved iterative closest point algorithm for rigid point registration .International Conference on Machine Learning and Cybernetics; Springer: 2014; pp. 255–263. [Google Scholar]
- Kabsch W. Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants. J. Appl. Crystallogr. 1993, 26, 795–800. 10.1107/S0021889893005588. [DOI] [Google Scholar]
- Wolber G.; Dornhofer A. A.; Langer T. Efficient overlay of small organic molecules using 3D pharmacophores. J. Comput.-Aided Mol. Des. 2007, 20, 773–788. 10.1007/s10822-006-9078-7. [DOI] [PubMed] [Google Scholar]
- Agarwal S.; Sharma I.; Varma D. A.; Raj A. N. J. In A hybrid [ICP and GA] image registration algorithm for depth images .2014 International Conference on Smart Structures and Systems (ICSSS); IEEE: 2014; pp. 8–14. [Google Scholar]
- Torres D.; Cuevas F. In Three-dimensional Point-cloud Registration using a Genetic Algorithm and the Iterative Closest Point Algorithm ;IJCCI (ECTA-FCTA), 2011; pp. 547–552. [Google Scholar]
- Jones G.; Willett P.; Glen R. C. A genetic algorithm for flexible molecular overlay and pharmacophore elucidation. J. Comput.-Aided Mol. Des. 1995, 9, 532–549. 10.1007/BF00124324. [DOI] [PubMed] [Google Scholar]
- Holland J. H. Genetic algorithms. Sci. Am. 1992, 267, 66–72. 10.1038/scientificamerican0792-66. [DOI] [Google Scholar]
- Trott O.; Olson A. J. AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J. Comput. Chem. 2009, 31, 455–461. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








