In Silico Mechanistic Profiling to Probe Small Molecule Binding to Sulfotransferases

Virginie Y Martiny; Pablo Carbonell; David Lagorce; Bruno O Villoutreix; Gautier Moroy; Maria A Miteva

doi:10.1371/journal.pone.0073587

. 2013 Sep 6;8(9):e73587. doi: 10.1371/journal.pone.0073587

In Silico Mechanistic Profiling to Probe Small Molecule Binding to Sulfotransferases

Virginie Y Martiny ^1,², Pablo Carbonell ^3,⁴, David Lagorce ^1,², Bruno O Villoutreix ^1,², Gautier Moroy ^1,², Maria A Miteva ^1,^2,^*

Editor: Daniel S Sem⁵

PMCID: PMC3765257 PMID: 24039991

Abstract

Drug metabolizing enzymes play a key role in the metabolism, elimination and detoxification of xenobiotics, drugs and endogenous molecules. While their principal role is to detoxify organisms by modifying compounds, such as pollutants or drugs, for a rapid excretion, in some cases they render their substrates more toxic thereby inducing severe side effects and adverse drug reactions, or their inhibition can lead to drug–drug interactions. We focus on sulfotransferases (SULTs), a family of phase II metabolizing enzymes, acting on a large number of drugs and hormones and showing important structural flexibility. Here we report a novel in silico structure-based approach to probe ligand binding to SULTs. We explored the flexibility of SULTs by molecular dynamics (MD) simulations in order to identify the most suitable multiple receptor conformations for ligand binding prediction. Then, we employed structure-based docking-scoring approach to predict ligand binding and finally we combined the predicted interaction energies by using a QSAR methodology. The results showed that our protocol successfully prioritizes potent binders for the studied here SULT1 isoforms, and give new insights on specific molecular mechanisms for diverse ligands’ binding related to their binding sites plasticity. Our best QSAR models, introducing predicted protein-ligand interaction energy by using docking, showed accuracy of 67.28%, 78.00% and 75.46%, for the isoforms SULT1A1, SULT1A3 and SULT1E1, respectively. To the best of our knowledge our protocol is the first in silico structure-based approach consisting of a protein-ligand interaction analysis at atomic level that considers both ligand and enzyme flexibility, along with a QSAR approach, to identify small molecules that can interact with II phase dug metabolizing enzymes.

Introduction

Drug metabolizing enzymes (DMEs) play a key role in the metabolism of endogenous molecules, xenobiotics and drugs introduced into the human body [1–3]. While their principal role is to detoxify organisms by modifying endogenous and exogenous compounds for a rapid excretion, such as pollutants or drugs, in some cases they render their substrates more toxic thereby inducing severe side effects and adverse drug reactions [4–8]. Phase I DMEs catalyze oxidative reactions leading to metabolites that may be either excreted or additionally modified by the phase II DMEs catalyzing conjugation reactions. In some cases, phase II DMEs can directly modify a compound without passing through the phase I DMEs. Overall, most previous investigations have been prioritizing the phase I DMEs, in particular cytochromes P450 (CYPs) [9–12]. Yet, phase II DMEs metabolize a broad range of compounds that can either be beneficial or lead to toxicity, poor drug bioavailability or adverse drug reactions [4–6,13]. Therefore, much efforts are needed to explore their impact on drug efficacy and safety.

Here we focus on sulfotransferases (SULTs), a family of enzymes that metabolize a large number of drugs [3]. SULTs [14] (Figure 1) catalyze the sulfoconjugation from the co-factor 3′-Phosphoadenosine 5′-Phosphosulfate (PAPS) to a hydroxyl or amino group of the substrate by executing a nucleophilic attack. At high concentrations some substrates inhibit the enzyme [15] and dead-end complexes with bound inactive cofactor PAP have been identified [15–17]. Sulfoconjugation usually facilitates excretion, but in some particular cases the pharmacological activity of some drugs increases (e.g., the hypotensive prodrug minoxidil becomes fully active after sulfate conjugation). Further, SULTs can convert some chemicals to carcinogens or to activators of promutagens by creating highly reactive sulfate esters that can bind covalently to DNA (e.g. 7,12-dimethylbenz(a) anthracene) [4,6–8]. SULTs that are responsible for the metabolism of small endogenous compounds and xenobiotics are localized in the cytosol [4,8,18]. Four families of human SULTs have been identified by now, SULT1, SULT2, SULT4 and SULT6, and more than 30 X-Ray structures, holo or apo, have been reported in the Protein Data Bank (PDB) [14,19]. Among them, SULT1, metabolizing a wide variety of compounds like phenols, thyroid hormones and drugs (e.g. minoxidil, paracetamol, 17α-ethinylestradiol), is the most expressed one (found in liver, lung, intestine, kidney, thyroid, blood or brain [18]).

A: SULT1A3 is represented in cartoon colored according to its secondary structure, red for helices, yellow for strands and green for loops; the co-factor PAP and the co-crystallized ligand L-dopamine are shown in sticks. B: SUTL1A3 is represented in grey surface, with the co-factor PAP and the co-crystallized ligand L-dopamine in sticks.

Notably, experimental and computational approaches have been proposed to predict Absorption, Distribution, Metabolism, Excretion and Toxicity (ADME-Tox) properties of drugs or the response to environmental toxins [1,20–22]. ADME-Tox predictions [12,22–26] are challenging but extremely important in prioritizing appropriate small molecules not only during the selection of potent candidates in drug discovery projects but also, to some extent, for chemical biology studies. Classical in silico ADME-Tox predictions are mostly based on statistical approaches using annotated databases, like Quantitative Structure-Activity Relationships and Quantitative Structure-Property Relationships (QSAR/QSPR) [26–28]. However, the complexity of ADME-Tox molecular mechanisms, for instance specific interactions with DMEs or with other ADME-Tox-related proteins, requires a deep mechanistic understanding [10–12,29] of the ligand-protein interactions at atomic level. Such knowledge should become more accessible for basically all proteins within the next 15 years as structural genomics projects gain full speed [30]. Indeed, in recent years in silico approaches exploiting the 3D structure of ADME-Tox related proteins, like docking/scoring or pharmacophore approaches, were successfully developed to complement QSAR models [10–12,29,31–34].

Here we report a novel in silico structure-based approach to probe ligand binding to SULTs. We developed a protocol combining docking-scoring methods with QSAR modeling in order to predict SULTs ligand binding. One of the key characteristics of DMEs (CYPs, SULTs) is that they are promiscuous, showing a remarkable plasticity of the active site to adapt its conformation to diverse ligands [12,14,17,35,36]. Therefore, we explored the flexibility of SULTs by molecular dynamics (MD) simulations in order to elucidate the molecular mechanisms involved in ligand binding and to identify the most suitable multiple receptor conformations for ligand binding prediction [29,37–39]. Then, we employed a structure-based docking-scoring approach [34,40,41] to probe ligand binding and finally we combined the predicted interaction energies with a QSAR methodology. To the best of our knowledge our protocol is the first in silico structure-based approach consisting of a protein-ligand interaction analysis at atomic level that considers flexibility, in both the ligands and the enzymes, along with a QSAR approach, to identify small molecules that can interact with II phase DMEs. Our results show that the automated protocol successfully prioritizes potent binders for the three studied here isoforms: SULT1A1, SULT1A3 and SULT1E1. Such approach could be very helpful for drug discovery or chemical biology endeavors, alerting for possible SULTs binding and thus risks of toxicity or poor bioavailability, or for possible strategies for the design of a prodrug.

Results

Collecting SULT1 Ligands

We decided to explore three isoforms, SULT1A1, SULT1A3 and SULT1E1, belonging to the largest SULT1 family and for which a number of substrates and inhibitors have been idebntified [14,18,42]. We collected 157, 117 and 80 known binders (substrates or inhibitors) for SULT1A1, SULT1A3 and SULT1E1, respectively (see Materials & Methods section). Several drugs (paracetamol, minoxidil), estrogens and toxic compounds (like bisphenol A used in plastic industry and recently discovered to be toxic) are present in our collection. Chemical structure clustering (see Materials & Methods section for details) resulted in 60, 50 and 33 diverse active compounds for SULT1A1, SULT1A3 and SULT1E1, respectively (examples for each isoform are shown in Table 1 and Figure S1). Putative decoys were taken from the diverse chemical compound libraries ChemBridge™ PremiumSet™ and the Maybridge® HitFinder™. The actives and decoys were merged and filtered using a soft drug-like filter in terms of physicochemical properties (see Materials & Methods section). For each isoform, we obtained two validation datasets, ChemBridge and Maybridge, containing: 46556 and 13148 compounds for SULT1A1; 49546 and 13138 compounds for SULT1A3; 49529 and 13121 compounds for SULT1E1, respectively.

Table 1. Examples of diverse active molecules for each SULT1 isoform assigned in different clusters.

Isoform	Compound	Known Function	Type
SULT1A1	Diflunisac	Non-Steroidal Anti-Inflammatory Drug (NSAID)	Inhibitor: IC₅₀=3.7.10^-6M
	Bisphenol A	Organic compound raising concerns about its presence in consumer products and foods	Substrate: Km=4.2.10^-6 M
	Triclosan	Antibacterial and antifungal agent	Inhibitor: IC₅₀=2.3.10^-6 M
SULT1A3	Resveratrol	Antioxidant	Substrate: Km=1.3.10^-6 M
	Curcumin	Curcumin is a curcuminoid of the Indian spice turmeric.	Inhibitor: IC₅₀=4.10^-6 M
	Mefenamic Acid	Non-Steroidal Anti-Inflammatory Drug (NSAID)	Inhibitor: IC₅₀=150.10^-9 M
SULT1E1	Daidzein	Daidzein is an isoflavone and acts as antioxidant	Substrate: Km=8.10^-9 M
	17αethinylestradiol	Ethinyl estradiol is a derivative of estrogen used in oral contraceptive pills	Substrate: Km=3.10^-6 M
	Trans-Piceatannol	Antioxidant	Inhibitor : Ki=400.10^-9 M

Open in a new tab

Molecular Dynamics Simulations and Multiple Receptor Conformations

We ran three MD simulations for each of the isoforms, SULT1A1, SULT1A3 and SULT1E1, with bound cofactor PAP and without bound ligands. Previously, it has been demonstrated that PAPS and PAP have equivalent stabilizing effects on these SULT isoforms [17,43]. Thus, we used PAP (substituting PAPS) in order to generate protein conformations capable to accommodate substrates and inhibitors. All trajectories showed stable potential energies from 0.5 to 2.0 ns of the production range (Figure S2, S3 and S4 given in the supporting information). Our analysis focuses mainly on the plasticity of the binding sites observed during the MD simulations. The list of residues of the binding sites is given in the supporting information (Text S1). The Solvent Accessible Surface Area (SASA) values of the binding sites along the MD production are shown in Figure 2. Among the three isoforms, SULT1A3 displays the highest values of SASA, i.e. the most open binding site, while SULT1E1 displays the lowest values, i.e. the most closed pocket. These results suggest a different dynamic behavior of the binding pockets for the three isoforms.

For each isoform, we extracted 4500 structures from the three MD productions, from 0.5 to 2.0 ns. In order to select multiple receptor conformations with diverse binding site conformations, we employed Hierarchical Ascendant Classification (HAC) based on the matrix of Root Mean Square Deviation (RMSD) for all atoms of the binding site and of the cofactor. We imposed a RMSD difference of at least 1.3 Å, resulting in 11 conformations for SULT1A1, 7 for SULT1A3 and 7 for SULT1E1 (Figure 3).

Binding pockets characterization

We performed structural analysis of the binding sites of the MD generated conformations and of the X-ray structures (Table 2). The volumes of the binding sites of different SULT1A1 conformations vary from 707.6 to 949.5 Å³. We found pockets with volumes quite similar to that of the X-ray structure (for instance clusters 2, 3, 4, 7) and pockets with volumes larger than that of the X-ray structure. Differences were also observed for SASA. For example, the SASA of the pocket of the cluster 4 conformation is similar to that of the X-ray structure (Figure 4A and 4B). In contrast, the cluster 10 conformation (Figure 4C) reveals a closed pocket in terms of SASA. These results indicate that loop 1 may open as a gate (Figure 4A, 4B, and 4C), as previously suggested in [14,44]. The different clusters centroids show various volumes and SASA, suggesting that the binding site of SULT1A1 may easily adapt its conformation to ligands of different sizes and shapes.

Table 2. Structural characteristics of the binding sites for different isoforms and multiple receptor conformations.

Isoform	Structure	Pocket Volume (Å³)	Pocket Solvent Accessible Surface Area (Å²)
SULT1A1	X-Ray	735.7	963
	Cluster 1	900.7	940
	Cluster 2	787.9	919
	Cluster 3	707.6	914
	Cluster 4	743.2	925
	Cluster 5	895.5	1009
	Cluster 6	882.3	900
	Cluster 7	781.1	837
	Cluster 8	804.5	894
	Cluster 9	949.5	914
	Cluster 10	894.0	870
	Cluster 11	912.5	859
SULT1A3	X-Ray	875.8	770
	Cluster 1	2470.7	1426
	Cluster 2	2749.1	1522
	Cluster 3	2043.4	1425
	Cluster 4	2949.1	1518
	Cluster 5	2588.7	1568
	Cluster 6	2440.0	1455
	Cluster 7	1958.8	1338
SULT1E1	X-Ray	1118.8	379
	Cluster 1	1385.7	356
	Cluster 2	873.8	274
	Cluster 3	952.7	290
	Cluster 4	1337.5	399
	Cluster 5	1774.8	516
	Cluster 6	2023.6	620
	Cluster 7	2123.8	637

Open in a new tab

A: SULT1A1 X-Ray structure. B: SULT1A1 cluster 4 MD structure. C: SULT1A1 cluster 10 MD structure. D: SULT1A3 X-Ray structure. E: SULT1A3 cluster 3 MD structure. F: SULT1A3 cluster 7 MD structure. G: SULT1E1 X-Ray structure. H: SULT1E1 cluster 3 MD structure. I: SULT1E1 cluster 4 MD structure.

Results for SULT1A3 are unexpected. As seen from Table 2, all obtained clusters centroids have huge binding sites with volume three times larger than that of the X-ray structure while SASA can be twice higher than in the X-Ray structure. The structural comparison (Figure 4D, 4E, 4F and Figure 2) shows that the binding site remains open with large volume and SASA during the entire MD trajectories. It is likely that the bound L-dopamine in the X-ray structure maintains the pocket conformation more closed than in the modeled MD apo structures, e.g. an induced fit is present in the complex, as similar phenomena have already been observed in other protein-ligand complex structures [45]. Indeed, the amino group of L-dopamine is located between the two carboxylic groups of Glu146 and Asp86, thus involved in two salt bridges. During the MD simulations without bound ligand, the repulsing interaction between the two carboxylic groups led to the opening of loop 1 (amino acid residues 85-90). Interestingly, the superposition of the two available X-ray structures for SULT1A3, one bound with L-dopamine (PDB ID 2A3R) [46] and one apo (PDB ID 1CJM) [47], shows a displacement of loop 1 of 4.9 Å. Thus, the substrate-binding pocket is largely open in the X-ray apo structure likewise in the modeled MD apo structures. The conformational change of loop 1 of SULT1A3 accompanying the ligand binding [14,44] is strongly supported by the holo and apo X-ray structures and the MD apo conformations.

Next, the MD structures of SULT1E1 show diverse pocket conformations, larger or smaller than the X-ray one (Figure 4G, 4H, 4I and Figure 2). SULT1A1 and SULT1E1 have some similarities regarding their binding sites plasticity. For the two isoforms, the binding pockets of the representative MD structures are either larger or smaller than those of the X-Ray structures. Thus, these isoforms can accommodate their binding sites according to the ligands size.

Some differences of the binding sites for different SULT1 isoforms observed here could explain some of the substrate specificities known for the three isoforms. Overall, the volumes of different binding site conformations of SULT1E1 are larger than those of SULT1A1. The larger pocket of SULT1E1 may facilitate its favorable interactions with large ligands. Indeed, steroids, which are large and quite rigid molecules, are specific substrates for SULT1E1. Further, all MD centroid structures for SULT1A3 have significantly larger binding pockets than the holo X-ray structure. As such, opening of the binding site may facilitate interactions with more bulky ligands than the co-crystallized dopamine. In fact, large ligands also interact with SULT1A3, for instance α-zearalenol, dobutamine or SKF38393. However, experimental SULT1A3 structures with bound large ligands would be very helpful to clarify the ligand-binding mechanism for this isoform.

Identifying the Best Multiple Receptor Conformations by Virtual Screening

Virtual Screening (VS) experiments were performed using docking-scoring approach in order to identify the protein conformations, which can better discriminate known binders from putative decoys. We ran 56 VS for all isoforms. A total of 24 VS were carried out for SULT1A1 on the 11 representative MD conformations and the X-ray structure with two compounds datasets, ChemBridge and Maybridge. Similarly, we performed 16 VS for SULT1A3 and 16 VS for SULT1E1. The best results obtained for the MD centroid conformations and the X-ray structures for each isoform are shown by enrichment graphs in Figure 5. Protein-ligand interaction energies for active centroids as predicted by docking (averaged for the X-ray and the two best performing MD structures) and compared to the experimental affinities are given for illustration in Table S1 (see the Supporting Information). As it is widely accepted, docking approaches cannot achieve an exact prediction of the binding affinities, rather they help to prioritize active molecules among a large number of compounds.

The X-Ray structure is in black and the MD structures in blue and green. 100% refers to all screened compounds including all actives and decoys. The Receiver Operating Characteristic Curve Areas (AUC) are also given. A: SULT1A1 with ChemBridge™ as decoys. B: SULT1A1 with Maybridge® as decoys. C: SULT1A3 with ChemBridge™ as decoys. D: SULT1A3 with Maybridge® as decoys. E: SULT1E1 with ChemBridge™ as decoys. F: SULT1E1 with Maybridge® as decoys.

Two MD structures for SULT1A1 achieve better enrichment results than the X-ray structure on the ChemBridge™ dataset (Figure 5A). Indeed, the structures of clusters 4 and 10 show Area Under Curve (AUC) for the Receiver Operating Characteristic (ROC) curve (not shown) larger than the X-ray structure. The early enrichments are also better for these two MD structures than for the X-ray one. Similar results are obtained on the Maybridge® dataset (Figure 5B). The best performing is the cluster 10 structure adopting slightly larger binding pocket than the holo X-ray structure allowing thus better prediction for the larger actives. In the case of SULT1A3 (Figure 5C and 5D) the VS experiments did not suggest a MD structure performing better than the holo X-ray one, yet early enrichments are slightly better for two MD structures than for the X-ray one in the case of the ChemBridge™ dataset. The MD structures and the holo X-ray for SULT1A3 show similar AUC, lower than those for SULT1A1. The best MD structures for SULT1A3, clusters 3 and 7, have binding sites twice larger than the holo X-ray one, thus, not allowing to correctly rank the small actives and prioritizing large ligands, actives or decoys. Our analysis showed a balanced representation of small and large actives for SULT1A3 in our datasets (see Dataset 1. sdf for SULT1A1, Dataset 2. sdf for SULT1A3 and Dataset 3. sdf for SULT1E1 given in the supporting information). Next, very good results have been obtained for SULT1E1 (Figure 5E and 5F). The two best MD structures demonstrate much better early enrichment and larger AUC in comparison to the X-ray structure. The best performing is the cluster 3 structure showing slightly smaller volume of the binding site than the X-ray one.

Table 3. Druggability scores for different isoforms and conformations.

Isoform	Structure	Druggability
SULT1A1	X-Ray	0.91
	Cluster 1	0.92
	Cluster 2	0.77
	Cluster 3	0.62
	Cluster 4	0.88
	Cluster 5	0.81
	Cluster 6	0.83
	Cluster 7	0.63
	Cluster 8	0.82
	Cluster 9	0.94
	Cluster 10	0.97
	Cluster 11	0.85
SULT1A3	X-Ray	0.92
	Cluster 1	0.41
	Cluster 2	0.32
	Cluster 3	0.36
	Cluster 4	0.43
	Cluster 5	0.62
	Cluster 6	0.81
	Cluster 7	0.79
SULT1E1	X-Ray	0.91
	Cluster 1	0.78
	Cluster 2	0.97
	Cluster 3	0.96
	Cluster 4	0.94
	Cluster 5	0.46
	Cluster 6	0.41
	Cluster 7	0.23

Open in a new tab

Overall considering the flexibility of the binding sites via multiple receptor conformations improved the enrichments. We achieved better discrimination for binders of SULT1A1 and SULT1E1 than for SULT1A3. The discrimination performance for SULT1A3 is lower because during the MD simulations this isoform adopted large and open binding site conformations due to the two closely placed Glu146 and Asp86 in the initial structure. On the other hand, the holo X-ray binding pocket is extremely closed, therefore not allowing correct positioning and ranking of large active molecules. In both cases, the enrichment is worsened in comparison with the other isoforms.

We also analyzed the druggability of the MD and X-ray structures. We calculated the druggability score using DogSiteScorer [48]. DogSite calculates several pockets descriptors and employs support vector machine method to return a score of druggability between 0 and 1 (0 – non-druggable, 1 - druggable). A strong druggability score (>0.9) was attributed to the X-ray structures for the three isoforms (see Table 3). Interestingly, our multiple receptor protocol successfully provides 3 MD conformations with a higher druggability score than the X-ray structures for SULT1A1 and for SULT1E1. Moreover, the conformations best discriminating the actives for SULT1A1 and SULT1E1 show druggability scores of 0.97 and 0.96, respectively. Further, lower druggability scores were obtained for the MD conformations of SULT1A3 as compared to the X-ray structure. Despite of the very high druggability score of the X-ray SULT1A3, the obtained enrichment is not very high. Thus, the druggability score is a useful indicator but is not sufficient for a final selection of the best multiple receptor conformations, a validation with known binders can be critical.

To exemplify the predicted protein-ligand interactions, we focused on the first retrieved actives for SULT1E1, known to metabolize estrogen hormones. The first actives ranked by the employed docking-scoring were the estrogen derivatives equilenin and 4-hydroestradiol for the X-ray structure and for the cluster 3 MD structure, respectively (Figure 6). We compared the best scored pose of equilenin to the bioactive conformation of the co-crystallized ligand 3,5,3',5'-tetrachloro-biphenyl-4,4'-diol. Figure 6A shows that equilenin and 3,5,3',5'-tetrachloro-biphenyl-4,4'-diol are well superimposed and that the hydroxyl group is correctly oriented in the docked pose. Indeed, the ligand hydroxyl group points toward the co-factor close to the phenylalanine gates Phe80 and Phe141 and forms a hydrogen bond with H107, which is believed to act as a catalytic base deprotonating the hydroxyl group during the sulfate transfer [49]. Docking of the best-ranked 4-hydroxyestradiol into the MD cluster 3 structure was also successful (Figure 6B), the ligand hydroxyl group, supposed to be sulfated, points toward the co-factor, close to the phenylalanine gates Phe80 and Phe141.

A: X-Ray structure of SULT1E1 with the co-crystallized ligand (3,5,3',5'-tetrachloro-biphenyl-4,4'-diol) in cyan and the best scored pose of the docked equilenin in magenta. B: Best scored pose of the best ranked compound 4-hydroxyestradiol in magenta docked into the MD Cluster 3 structure.

New QSAR models for SULT1

We considered further the molecular structures of the SULT1 active compounds in order to train QSAR classification models. For each isoform, we built two models: a model based solely on topological information of compound structures using extended connectivity fingerprint descriptors (ECFPs) [50] and another model that combined such information with the binding energies computed on the MD protein conformations that best retrieved the active compounds. Such information was then employed to train classifiers using three machine learning methods: support vector machine-based (SVM), random forest and naïve Bayes. The models obtained with the support vector machine gave slightly better performances (see Table S2). Table 4 shows the performance of the SVM classifier with and without included predicted binding energy values. For SULT1E1, we obtained an accuracy of 73.94% (the percentage of correctly predicted active compounds) in a leave-one-out (LOO) cross-validation of the training set, which was increased to 75.46% when introducing the energy values as an additional input feature of the training set. In the case of SULT1A3, the accuracy was raised from 73.80% to 78.00% when using the predicted interaction energies. The QSAR models for SULT1A1 showed lower performance; the obtained accuracy in the leave-one-out cross validation of 60.85% using ECFPs was increased to 67.28% when binding energy information was added. The lower accuracy of SULT1A1 models might be due the higher chemical diversity of the active compounds for this isoform. In all cases, the validation test performed on the external set was successful, with more than 80% of the active compounds correctly classified when using the models with the best performance. The high performance on the external sets might be due to the fact that the external sets contain compounds similar to those of the training set within a Tanimoto similarity of 0.6 (FCFP_4). Therefore our models have been here validated within a domain of applicability given by the clustering threshold.

Table 4. Performance of the SVM QSAR model for each isoform.

Isoform	Accuracy of QSAR model without binding energy information (LOO cross validation)	Accuracy of QSAR model including binding energy information (LOO cross validation)	Accuracy (external set)	Size of the training set	Size of the external set
SULT1E1	73.94%	75.46%	95.24%	120	86
SULT1A3	73.80%	78.00%	89.53%	100	66
SULT1A1	60.85%	67.28%	85.85%	66	42

Open in a new tab

Discussion

Phase II DMEs encounter drugs or other xenobiotics, in general, modifying them into more hydrophilic metabolites to be easily eliminated from the human body. However, highly reactive, mutagenic or carcinogenic metabolites can also be produced [4–8]. Phase II DMEs have up to now attracted much less attention than cytochromes P450 [3,29,51] despite of their critical roles in detoxification. For instance, UGTs and SULTs, two major DMEs, are involved in the metabolism of many clinically used drugs [3]. Inhibiting SULTs can alter their natural functions that can cause drug–drug interactions [16,29,52]. All together these data demonstrate the importance of exploring phase II DMEs. As several X-ray structures for these enzymes are available, such investigation can be carried out at the atomic level. In the present study, we aim at developing a method for prediction of interactions of these enzymes with putative ligands, drugs or xenobiotics like pollutants.

To this end, we developed a novel approach to predict small molecules binding on three main SULT1 isoforms by combination of a docking-scoring approach, which takes into consideration the flexibility of the protein binding site, and QSAR modeling. Indeed, the SULT1 active sites accommodate very diverse ligands in terms of size and chemistry, thus, it is of crucial importance to introduce possible conformational changes for ligand binding prediction. Although many experimental 3D structures of SULTs have been resolved, only a few 3D in silico studies have been reported to date [42–44]. These studies were not fully focused on the exploration of SULT to integrate the protein flexibility for predicting ligand-target interactions for a large number of compounds. Conformational changes observed in the binding sites of SULT1 family suggest that gating of loop 1 can be involved in the ligand binding [14,17,44]. Overall, our MD simulations support similar movements. However, more complex ligand-binding mechanisms might be possible for SULT1A3. It was suggested that substrates may not be completely uncoupled from binding of the cofactor [17].

The better enrichments that we obtained using the MD best structures compared to the rigid X-ray structures confirm that it is indeed critical to take into account the flexibility of the binding sites for SULT1 in order to better discriminate binders from non-binders. The conformational changes of the binding sites of SULT1, e.g. loop 1 gating, facilitate the binding of diverse ligands, which is of major importance for the SULTs role, i.e. the detoxification. We achieved very good predictive results using docking-scoring into the best MD conformations for SULT1A1 and SULT1E1. Recently, Stjernschantz and co-workers [43] screened experimentally 34 potential endocrine-disrupting compounds on the murine and human SULT1E1 to identify selective inhibitors of the human enzyme. They then docked the identified active compounds using the software GOLD [53] and performed subsequent MD simulations of the docked complexes. This process helped to explain in part the selectivity for some of the inhibitors of human SULT1E1. Using our VS docking experiments we retrieved the most potent inhibitors identified in [53] (estrogen derivatives) as the top ranked compounds, confirming the very good prediction performance of our approach for SULT1E1.

Regarding the SULTs substrate diversity and specificities, several SULTs families have been studied by hierarchical clustering of the chemical structures. Two studies, exploring the similarity for local sequences and structural features of the binding site and compound activity profiles [17,54], suggested that SULT1A1 and SULT1A3 could display different substrate specificities due to local sequence and structural differences of the binding sites. As mentioned above, two negatively charged groups are present in the binding site of SULT1A3, Asp86 and Glu146, while two hydrophobic Ala residues are located in the same positions in SULT1A1. Yet, the authors highlighted the difficulty of predicting small-molecule binding patterns for SULT family only from sequence or structural analysis [54]. It is interesting to note that we obtained better QSAR prediction for SULT1A3 than for SULT1A1 suggesting that SULT1A1 is more promiscuous than SULT1A3. These results are consistent with our MD simulations for SULT1A1 suggesting that its binding site is very flexible as adopting open or closed conformations along the MD trajectories that would facilitate the accommodation of diverse ligands. Indeed, SULT1A1 interacts with a large diversity of compounds, including simple phenols, polycyclic aromatic compounds, estrogens. Further, Schapira et al. [42] performed VS experiments using docking of 50,000 compounds (including drugs, clinical candidates and endogenous molecules) into SULT1A3 and SULT1E1 in order to explore substrate selectivity profile. They distinguished correctly the preferential substrate classes for the two isoforms, i.e. catecholamines (e.g. L-dopamine) for SULT1A3 and steroids for SULT1E1. All docking results were carried out into rigid X-ray structures extracted from X-ray structures co-crystallized with the same preferred class of compounds, however, considering the receptor flexibility would be crucial to predict binders structurally different from the co-crystallized ligands.

In order to identify structural factors of substrates controlling the sulfonation several QSAR models have been developed for SULT [55–57]. Taskinen et al. [58] built a QSAR model for SULT1A3 based on 53 catechol compounds using Partial Least Square (PLS). They found that the lipophilicity was positively correlated with sulfonation rate, but specific effects of polar functional groups were more important than the general lipophilicity. In particular, the presence of a positively charged amino group favored sulfonation, whereas the presence of a carboxylate anion strongly decreased reactivity. Finally, the authors obtained a QSAR model with Q² value of 0.72. The same group developed 3D QSAR models performing comparative molecular field analysis (CoMFA) on 95 diverse compounds, obtaining in this case a Q² value of 0.624 for the best model [55]. The two negatively charged residues Glu146 and Asp86 of the binding site explain the preference for a positive charge in the substrate. Such properties of the binding site are explicitly taken into our QSAR model via the predicted protein-ligand binding energy. We achieved 89.53% prediction accuracy for the external set of 66 active compounds for SULT1A3. In fact, the predicted binding energies improved the QSAR models accuracy for the three isoforms highlighting thus the complementarity of the structure-based and QSAR approaches.

We should note that SULTs have not been extensively experimentally screened and we did not find real decoys in publicly available chemistry databases and in several commercial collections (see the Methods section). Therefore, developing in silico models, structure-based or QSAR, to predict ligand binding for SULTs is challenging, it might be possible that some of the used putative decoys are active for some SULT isoforms. Yet, our results demonstrate that the developed approach can be successfully employed to predict ligand binding to the three SULT1s isoforms studied here. Resulting in accurate QSAR models based on structure-based methodology, our approach constitutes a major advance in the in silico prediction of ADME-Tox properties of small molecules related to interactions with phase II DMEs.

Material and Methods

Compound datasets preparation

Small molecules known to bind to SULT1A1, SULT1A3 and SULT1E1, substrates or inhibitors, were collected (Figure 7) from the databases BRENDA [59], Aureus Sciences (http://www.aureus-sciences.com/), TOXNET (http://toxnet.nlm.nih.gov/), PubChem [60] and literature [6,17,18]. We did not found experiment data for inactive molecules, thus we took putative decoy molecules those from two diverse chemical compound collections, ChemBridge™ PremiumSet™ (http://www.chembridge.com/) and the Maybridge® HitFinder™ (http://www.maybridge.com/) composed of 50000 and 14000 compounds, respectively. In order to select only drug-like molecules, all datasets of actives and putative decoys were filtered using the FAF-Drugs 2 server [61] using “soft” drug-like physicochemical properties (see Text S1 in supporting information) while the search utility to remove toxic/reactive groups was turned off. After filtering, 49496 and 13088 decoys remained from the ChemBridge™ and Maybridge® datasets, respectively. In order to select chemically diverse molecules for validation of our approach and to avoid possible over-representation of a chemical series, we performed several initial tests to cluster the actives using the fingerprints ECFP_4, ECFP_6, FCFP_4 and FCFP_4 as implemented in Pipeline Pilot v.7.5 (SciTegic, Inc/Accelrys). Finally, the filtered drug-like actives were clustered using FCFP_4 with a Tanimoto similarity criterion of 0.6. We then created two compound datasets for each SULT1 isoform taking the active centroids of each cluster for the corresponding isoform, and the decoys from the Chembridge™ et and the Maybridge® filtered datasets, respectively.

Protein structures preparation

For the three most important SULT1 isoforms, SULT1A1, SULT1A3 and SULT1E1, we selected X-ray structures with co-crystallized ligands and as complete as possible. For SULT1A1, 5 holo PDB structures are available, all sharing very similar conformations (all atom RMSD between the 5 structures vary between 0.170 and 0.239 Å). The visual analysis of these 5 structures did not show striking differences in the binding sites. For SULT1A3, only 1 holo structure is available. The second one is an apo-form with many missing residues around the cofactor binging site. For SULT1E1, only 1 holo structure is available. Thus, we took the X-ray structures co-crystallized with the ligands: p-nitrophenol (PDB ID: 1LS6 [6]), L-dopamine (PDB ID: 2A3R [46]) and 3,5,3',5'-tetrachloro-biphenyl-4,4’-diol (PDB ID: 1G3M [62]), for the isoforms SULT1A1, SULT1A3 and SULT1E1, respectively. All ligands and water molecules were removed. We kept PAP as cofactor for all MD and docking simulations. The pK_a values of titratable groups were calculated with the Finite Difference Poisson Boltzman approach [63] using the web server tool Protein Continuum Electrostatics (PCE) [64] with the default parameters (dielectric value of 4 and 80 for solute and solvent, respectively). No abnormal titration behaviors were obtained and His protonation was assigned according to the computed pKas.

Molecular dynamics simulations

For each SULT1 isoform we performed three molecular dynamic (MD) simulations using CHARMM c35b1 version [65]. We used the all atoms PARAM27 force field [66,67]. All simulations were performed using monomer structures because previous MD studies for SULT1A1 and SULT1E1 suggested identical behavior for monomers and dimers [43,44]. For each SULT1 structure we kept the cofactor PAP. Topology and parameters of PAP were assigned by using the web server SwissParam [68]. The solvation was taken into account by the Generalized Born implicit solvent function FACTS [69]. Non-bonded interactions were truncated in a cut-off distance of 12 Å with a shift function for electrostatics and switch function for the van der Waals interactions. The protein structures were initially minimized using 500 steps of steepest descent algorithm followed by 500 steps of conjugate gradient algorithm. Distances between heavy atoms and hydrogen atoms were constrained by SHAKE algorithm [70] allowing a time step of 2 fs. The system was heated during 100 ps to reach 300 K and then equilibrated during 200 ps with a temperature window of 300±10 K. The production time was 2 ns for each MD simulation run. We have 3 trajectories per isoform at different initial velocities.

Multiple receptor conformations selection

For each isoform we extracted 4500 structures from the three merged MD trajectories. SASA of the binding sites along the MD trajectories was calculated using CHARMM program and the volume was calculated using CASTp web server [71]. For each isoform the RMSD between the 4500 structures were calculated for all atoms of the binding site and of the cofactor (see Text S1 in the supporting information). We clustered different conformations of the binding sites by applying HAC on the obtained RMSD matrix using the aggregative method ward as implemented in R software [72] and a RMSD distance of at least 1.3Å. We took the centroid structure of each cluster in order to define a representative set of protein conformations for subsequent validation by virtual screening experiments.

Virtual screening experiments

First we performed preliminary docking experiments with DOCK6.0 [73], AutoDock [74] and Vina 1.1.1 [75], to dock several small and large ligands, dopamine, 4-nitrophenol, pentachlorophenol, estradiol and 3,5,3',5'-tetrachloro-biphenyl-4,4'-diol, into the three isoforms. We obtained the best docking results using Vina with a RMSD of the bioactive conformations within 1Å. Thus, for the subsequent prediction of the SULTs binders, we performed VS experiments using Vina. We used grid resolution of 1 Å, number of binding modes of 10 and exhaustiveness of 8. Fifty six VS runs were performed on X-ray structures and protein structures extracted from MD for the two datasets, the ChemBridge™ and the Maybridge®, containing 49496 and 13088 compounds, respectively. For SULT1A1, the grid size was of 24 Å for the three X, Y, Z axes and the center coordinates were set to 22.979, 105.852 and 57.607. For SULT1A3, the grid size was of 26 Å, 24 Å and 30 Å for the three X, Y, Z axes, respectively, and the center coordinates were set to 56.936, 120.268 and -0.116. For SULT1E1, the grid size was of 18 Å, 22 Å and 22 Å for the three X, Y, Z axes, respectively, and the center coordinates were set to -4.619, -16.110 and 33.948.

QSAR classification model of active compounds

The centroids of the clustered active compounds known to bind SULT1A1, SULT1A3 and SULT1E1 (Figure 7) were selected as positive sets in order to train QSAR classification models for compounds binding SULTs. The resulting size was 60 for 1A1, 50 for 1A3 and 33 for 1E1. For each positive set of active compounds, we built balanced training sets by random sampling of negatives from the list of the 13088 putative decoys of the Maybridge® library. Thus, the resulting training sets contained 120, 100, and 66 molecules for 1A1, 1A3, and 1E1, respectively (Table 4). The process was repeated 10 times for each isoform in order to compute average results from multiple random samples of the negative set. To describe topological features of the compounds we used extended connectivity fingerprints (ECFPs) [50] up to an atom vicinity of 2. To reduce the dimensionality of the resulting input feature matrix, we applied principal component analysis using the statistical package R.

Then we used three machine learning methods: a support vector machine by using the kernlab R package (ksvm function) [76], a random forest-based predictor by using the randomForest R package [77], and a naïve Bayesian predictor by using the caret R package [78]. For each of them we built two QSAR models: for the first one we used only ECFP descriptors, while for the second one the protein-ligand binding energies calculated on the best performing MD receptor conformation were added as an additional descriptor. Performance of each QSAR classification model was assessed by the percentage of correctly classified compounds in comparison to the total number of compounds in the set through leave-one-out cross-validation. In addition, we used an external validation dataset that contained all known active compounds after the FAF-Drugs 2 filtering, where the molecules identical with those of the training set were removed (resulting in 86 compounds for 1A1, 66 compounds for 1A3, 42 compounds for 1E1).

Supporting Information

Dataset S1

Active compound centroids for SULT1A1.

(SDF)

Click here for additional data file.^{(125.5KB, sdf)}

Dataset S2

Active compound centroids for SULT1A3.

(SDF)

Click here for additional data file.^{(105.9KB, sdf)}

Dataset S3

Active compound centroids for SULT1E1.

(SDF)

Click here for additional data file.^{(70.1KB, sdf)}

Figure S1

2D structure of diverse active molecules for each SULT1 isoform.

(TIFF)

Click here for additional data file.^{(160.9KB, tiff)}

Figure S2

Potential energies (kcal/mol) for SULT1A1 isoform. A: The potential energy of the MD run 1. B: The potential energy of the MD run 2. C: The potential energy of the MD run 3.

(TIFF)

Click here for additional data file.^{(2.9MB, tiff)}

Figure S3

Potential energies (kcal/mol) for SULT1A3 isoform. A: The potential energy of the MD run 1. B: The potential energy of the MD run 2. C: The potential energy of the MD run 3.

(TIFF)

Click here for additional data file.^{(2.6MB, tiff)}

Figure S4

Potential energies (kcal/mol) for SULT1E1 isoform. A: The potential energy of the MD run 1. B: The potential energy of the MD run 2. C: The potential energy of the MD run 3.

(TIFF)

Click here for additional data file.^{(2.5MB, tiff)}

Table S1

Protein-ligand interaction energies for active centroids as predicted by docking, averaged for the X-ray and the two best performing MD structures, and compared to the experimental affinities.

(DOCX)

Click here for additional data file.^{(131.9KB, docx)}

Table S2

Performance of QSAR models built by using support vector machine, random forest, and naïve Bayes machine-learning methods.

(DOCX)

Click here for additional data file.^{(51.4KB, docx)}

Text S1

Drug-like filter parameters and binding sites’ residues.

(DOCX)

Click here for additional data file.^{(71.3KB, docx)}

Acknowledgments

We thank the INSERM institute, the University Paris Diderot, and the Agence Nationale de la Recherche. We thank Dr. Anne Badel for helpful discussions.

Funding Statement

V.Y.M. was supported by the doctoral school ‘MTCE’ at the Universities Paris Descartes and Paris Diderot. P.C. was supported by Genopole (ATIGE grant) and Agence Nationale de la Recherche. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Merlot C (2010) Computational toxicology--a tool for early safety evaluation. Drug Discov Today 15: 16-22. doi:10.1016/j.drudis.2009.09.010. PubMed: 19835978. [DOI] [PubMed] [Google Scholar]
2. Wang J, Urban L, Bojanic D (2007) Maximising use of in vitro ADMET tools to predict in vivo bioavailability and safety. Expert Opin Drug Metab Toxicol 3: 641-665. doi:10.1517/17425255.3.5.641. PubMed: 17916053. [DOI] [PubMed] [Google Scholar]
3. Testa B, Pedretti A, Vistoli G (2012) Reactions and enzymes in the metabolism of drugs and other xenobiotics. Drug Discov Today 17: 549-560. doi:10.1016/j.drudis.2012.01.017. PubMed: 22305937. [DOI] [PubMed] [Google Scholar]
4. Bojarová P, Williams SJ (2008) Sulfotransferases, sulfatases and formylglycine-generating enzymes: a sulfation fascination. Curr Opin Chem Biol 12: 573-581. doi:10.1016/j.cbpa.2008.06.018. PubMed: 18625336. [DOI] [PubMed] [Google Scholar]
5. Shimada T (2006) Xenobiotic-metabolizing enzymes involved in activation and detoxification of carcinogenic polycyclic aromatic hydrocarbons. Drug Metab Pharmacokinet 21: 257-276. doi:10.2133/dmpk.21.257. PubMed: 16946553. [DOI] [PubMed] [Google Scholar]
6. Gamage NU, Duggleby RG, Barnett AC, Tresillian M, Latham CF et al. (2003) Structure of a human carcinogen-converting enzyme, SULT1A1. Structural and kinetic implications of substrate inhibition. J Biol Chem 278: 7655-7662. doi:10.1074/jbc.M207246200. PubMed: 12471039. [DOI] [PubMed] [Google Scholar]
7. Surh YJ (1998) Bioactivation of benzylic and allylic alcohols via sulfo-conjugation. Chem Biol Interact 109: 221-235. doi:10.1016/S0009-2797(97)00134-8. PubMed: 9566747. [DOI] [PubMed] [Google Scholar]
8. Chapman E, Best MD, Hanson SR, Wong C-H (2004) Sulfotransferases: Structure, Mechanism, Biological Activity, Inhibition, and Synthetic Utility. Angew Chem Int Ed 43: 3526-3548 doi:10.1002/anie.200300631. PubMed: 15293241. [DOI] [PubMed] [Google Scholar]
9. Veith H, Southall N, Huang R, James T, Fayne D et al. (2009) Comprehensive characterization of cytochrome P450 isozyme selectivity across chemical libraries. Nat Biotechnol 27: 1050-1055. doi:10.1038/nbt.1581. PubMed: 19855396. [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Sun H, Scott DO (2010) Structure-based drug metabolism predictions for drug design. Chem Biol Drugs Des 75: 3-17. doi:10.1111/j.1747-0285.2009.00899.x. PubMed: 19878193. [DOI] [PubMed] [Google Scholar]
11. Vedani A, Smiesko M (2009) In silico toxicology in drug discovery - concepts based on three-dimensional models. Altern Lab Anim 37: 477-496. PubMed: 20017578. [DOI] [PubMed] [Google Scholar]
12. Stoll F, Göller AH, Hillisch A (2011) Utility of protein structures in overcoming ADMET-related issues of drug-like compounds. Drug Discov Today 16: 530-538. doi:10.1016/j.drudis.2011.04.008. PubMed: 21554979. [DOI] [PubMed] [Google Scholar]
13. Meng S, Wu B, Singh R, Yin T, Morrow JK et al. (2012) SULT1A3-mediated regiospecific 7-O-sulfation of flavonoids in Caco-2 cells can be explained by the relevant molecular docking studies. Mol Pharm 9: 862-873. doi:10.1021/mp200400s. PubMed: 22352375. [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Dong D, Ako R, Wu B (2012) Crystal structures of human sulfotransferases: insights into the mechanisms of action and substrate selectivity. Expert Opin Drug Metab Toxicol 8: 635-646. doi:10.1517/17425255.2012.677027. PubMed: 22512672. [DOI] [PubMed] [Google Scholar]
15. Gamage NU, Tsvetanov S, Duggleby RG, McManus ME, Martin JL (2005) The structure of human SULT1A1 crystallized with estradiol. An insight into active site plasticity and substrate inhibition with multi-ring substrates. J Biol Chem 280: 41482-41486. doi:10.1074/jbc.M508289200. PubMed: 16221673. [DOI] [PubMed] [Google Scholar]
16. Rohn KJ, Cook IT, Leyh TS, Kadlubar SA, Falany CN (2012) Potent inhibition of human sulfotransferase 1A1 by 17alpha-ethinylestradiol: role of 3'-phosphoadenosine 5'-phosphosulfate binding and structural rearrangements in regulating inhibition and activity. Drug Metab Dispos 40: 1588-1595. doi:10.1124/dmd.112.045583. PubMed: 22593037. [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Allali-Hassani A, Pan PW, Dombrovski L, Najmanovich R, Tempel W et al. (2007) Structural and chemical profiling of the human cytosolic sulfotransferases. PLOS Biol 5: e97. doi:10.1371/journal.pbio.0050097. PubMed: 17425406. [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Gamage N, Barnett A, Hempel N, Duggleby RG, Windmill KF et al. (2006) Human sulfotransferases and their role in chemical metabolism. Toxicol Sci 90: 5-22. PubMed: 16322073. [DOI] [PubMed] [Google Scholar]
19. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN et al. (2000) The Protein Data Bank. Nucleic Acids Res 28: 235-242. doi:10.1093/nar/28.1.235. PubMed: 10592235. [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Hamon J, Whitebread S, Techer-Etienne V, Le Coq H, Azzaoui K et al. (2009) In vitro safety pharmacology profiling: what else beyond hERG? Future Med Chem 1: 645-665. doi:10.4155/fmc.09.51. PubMed: 21426031. [DOI] [PubMed] [Google Scholar]
21. Grigorov MG (2011) Analysis of time course Omics datasets. Methods Mol Biol 719: 153-172. doi:10.1007/978-1-61779-027-0_7. PubMed: 21370083. [DOI] [PubMed] [Google Scholar]
22. Gleeson MP (2008) Generation of a set of simple, interpretable ADMET rules of thumb. J Med Chem 51: 817-834. doi:10.1021/jm701122q. PubMed: 18232648. [DOI] [PubMed] [Google Scholar]
23. Nigsch F, Lounkine E, McCarren P, Cornett B, Glick M et al. (2011) Computational methods for early predictive safety assessment from biological and chemical data. Expert Opin Drug Metab Toxicol 7: 1497-1511. doi:10.1517/17425255.2011.632632. PubMed: 22050465. [DOI] [PubMed] [Google Scholar]
24. Lagorce D, Reynes C, Camproux A-C, Miteva MA, Sperandio O et al. (2011) In Silico ADME/Tox Predictions. In: Tsaioun K, Kates SA. ADMET for Medicinal Chemists. John Wiley & Sons, Inc. pp. 29-124. [Google Scholar]
25. Martiny VY, Pajeva I, Wiese M, Davis AM, Miteva MA (2013). hemoinformatic And chemogneomic Approach to ADMET In: Wang J, Urban L, editors. Predictive ADMET Integrated approaches in Drug Discovery and Development: John Wiley & Sons. [Google Scholar]
26. Gleeson MP, Modi S, Bender A, Robinson RL, Kirchmair J et al. (2012) The challenges involved in modeling toxicity data in silico: a review. Curr Pharm Des 18: 1266-1291. doi:10.2174/138161212799436359. PubMed: 22316153. [DOI] [PubMed] [Google Scholar]
27. Varnek A, Baskin I (2012) Machine learning methods for property prediction in chemoinformatics: Quo Vadis? J Chem Inf Model 52: 1413-1437. doi:10.1021/ci200409x. PubMed: 22582859. [DOI] [PubMed] [Google Scholar]
28. Michielan L, Moro S (2010) Pharmaceutical perspectives of nonlinear QSAR strategies. J Chem Inf Model 50: 961-978. doi:10.1021/ci100072z. PubMed: 20527756. [DOI] [PubMed] [Google Scholar]
29. Moroy G, Martiny VY, Vayer P, Villoutreix BO, Miteva MA (2012) Toward in silico structure-based ADMET prediction in drug discovery. Drug Discov Today 17: 44-55. doi:10.1016/j.drudis.2011.10.023. PubMed: 22056716. [DOI] [PubMed] [Google Scholar]
30. Nair R, Liu J, Soong TT, Acton TB, Everett JK et al. (2009) Structural genomics is the largest contributor of novel structural leverage. J Struct Funct Genomics 10: 181-191. doi:10.1007/s10969-008-9055-6. PubMed: 19194785. [DOI] [PMC free article] [PubMed] [Google Scholar]
31. Xie L, Bourne PE (2011) Structure-based systems biology for analyzing off-target binding. Curr Opin Struct Biol 21: 189-199. doi:10.1016/j.sbi.2011.01.004. PubMed: 21292475. [DOI] [PMC free article] [PubMed] [Google Scholar]
32. Vaz RJ, Zamora I, Li Y, Reiling S, Shen J et al. (2010) The challenges of in silico contributions to drug metabolism in lead optimization. Expert Opin Drug Metab Toxicol 6: 851-861. doi:10.1517/17425255.2010.499123. PubMed: 20565339. [DOI] [PubMed] [Google Scholar]
33. Wolber G, Langer T (2005) LigandScout: 3-D pharmacophores derived from protein-bound ligands and their use as virtual screening filters. J Chem Inf Model 45: 160-169. doi:10.1021/ci049885e. PubMed: 15667141. [DOI] [PubMed] [Google Scholar]
34. Taboureau O, Baell JB, Fernández-Recio J, Villoutreix BO (2012) Established and emerging trends in computational drug discovery in the structural genomics era. Chem Biol 19: 29-41. doi:10.1016/j.chembiol.2011.12.007. PubMed: 22284352. [DOI] [PubMed] [Google Scholar]
35. Teague SJ (2003) Implications of protein flexibility for drug discovery. Nat Rev Drug Discov 2: 527-541. doi:10.1038/nrd1129. PubMed: 12838268. [DOI] [PubMed] [Google Scholar]
36. Berger I, Guttman C, Amar D, Zarivach R, Aharoni A (2011) The molecular basis for the broad substrate specificity of human sulfotransferase 1A1. PLOS ONE 6: e26794. doi:10.1371/journal.pone.0026794. PubMed: 22069470. [DOI] [PMC free article] [PubMed] [Google Scholar]
37. Sperandio O, Mouawad L, Pinto E, Villoutreix BO, Perahia D et al. (2010) How to choose relevant multiple receptor conformations for virtual screening: a test case of Cdk2 and normal mode analysis. Eur Biophys J 39: 1365-1372. doi:10.1007/s00249-010-0592-0. PubMed: 20237920. [DOI] [PubMed] [Google Scholar]
38. Isvoran A, Badel A, Craescu CT, Miron S, Miteva MA (2011) Exploring NMR ensembles of calcium binding proteins: perspectives to design inhibitors of protein-protein interactions. BMC Struct Biol 11: 24. doi:10.1186/1472-6807-11-24. PubMed: 21569443. [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Ivetac A, McCammon JA (2011) Molecular recognition in the case of flexible targets. Curr Pharm Des 17: 1663-1671. doi:10.2174/138161211796355056. PubMed: 21619526. [DOI] [PMC free article] [PubMed] [Google Scholar]
40. Sperandio O, Villoutreix BO, Miteva MA (2011) Structure-Based Virtual Screening. In: Miteva Ma, editor. In silico lead discovery. pp. 20-46 [Google Scholar]
41. Cavasotto CN, Phatak SS (2011) Docking methods for structure-based library design. Methods Mol Biol 685: 155-174. doi:10.1007/978-1-60761-931-4_8. PubMed: 20981523. [DOI] [PubMed] [Google Scholar]
42. Campagna-Slater V, Schapira M (2009) Evaluation of virtual screening as a tool for chemical genetic applications. J Chem Inf Model 49: 2082-2091. doi:10.1021/ci900219u. PubMed: 19702241. [DOI] [PubMed] [Google Scholar]
43. Stjernschantz E, Reinen J, Meinl W, George BJ, Glatt H et al. (2010) Comparison of murine and human estrogen sulfotransferase inhibition in vitro and in silico--implications for differences in activity, subunit dimerization and substrate inhibition. Mol Cell Endocrinol 317: 127-140. doi:10.1016/j.mce.2009.12.001. PubMed: 20025931. [DOI] [PubMed] [Google Scholar]
44. Cook I, Wang T, Almo SC, Kim J, Falany CN et al. (2013) The gate that governs sulfotransferase selectivity. Biochemistry 52: 415-424. doi:10.1021/bi301492j. PubMed: 23256751. [DOI] [PMC free article] [PubMed] [Google Scholar]
45. Rueda M, Bottegoni G, Abagyan R (2009) Consistent improvement of cross-docking results using binding site ensembles generated with elastic network normal modes. J Chem Inf Model 49: 716-725. doi:10.1021/ci8003732. PubMed: 19434904. [DOI] [PMC free article] [PubMed] [Google Scholar]
46. Lu JH, Li HT, Liu MC, Zhang JP, Li M et al. (2005) Crystal structure of human sulfotransferase SULT1A3 in complex with dopamine and 3'-phosphoadenosine 5'-phosphate. Biochem Biophys Res Commun 335: 417-423. doi:10.1016/j.bbrc.2005.07.091. PubMed: 16083857. [DOI] [PubMed] [Google Scholar]
47. Bidwell LM, McManus ME, Gaedigk A, Kakuta Y, Negishi M et al. (1999) Crystal structure of human catecholamine sulfotransferase. J Mol Biol 293: 521-530. doi:10.1006/jmbi.1999.3153. PubMed: 10543947. [DOI] [PubMed] [Google Scholar]
48. Volkamer A, Kuhn D, Grombacher T, Rippmann F, Rarey M (2012) Combining global and local measures for structure-based druggability predictions. J Chem Inf Model 52: 360-372. doi:10.1021/ci200454v. PubMed: 22148551. [DOI] [PubMed] [Google Scholar]
49. Kakuta Y, Petrotchenko EV, Pedersen LC, Negishi M (1998) The sulfuryl transfer mechanism. Crystal structure of a vanadate complex of estrogen sulfotransferase and mutational analysis. J Biol Chem 273: 27325-27330. doi:10.1074/jbc.273.42.27325. PubMed: 9765259. [DOI] [PubMed] [Google Scholar]
50. Carbonell P, Carlsson L, Faulon JL (2013) The stereo signature molecular descriptor. J Chem Inf Model. [DOI] [PubMed] [Google Scholar]
51. Shawahna R, Uchida Y, Declèves X, Ohtsuki S, Yousif S et al. (2011) Transcriptomic and quantitative proteomic analysis of transporters and drug metabolizing enzymes in freshly isolated human brain microvessels. Mol Pharm 8: 1332-1341. doi:10.1021/mp200129p. PubMed: 21707071. [DOI] [PubMed] [Google Scholar]
52. Eagle K (2012) Hypothesis: holiday sudden cardiac death: food and alcohol inhibition of SULT1A enzymes as a precipitant. J Appl Toxicol 32: 751-755. doi:10.1002/jat.2764. PubMed: 22678655. [DOI] [PMC free article] [PubMed] [Google Scholar]
53. Jones G, Willett P, Glen RC, Leach AR, Taylor R (1997) Development and validation of a genetic algorithm for flexible docking. J Mol Biol 267: 727-748. doi:10.1006/jmbi.1996.0897. PubMed: 9126849. [DOI] [PubMed] [Google Scholar]
54. Najmanovich RJ, Allali-Hassani A, Morris RJ, Dombrovsky L, Pan PW et al. (2007) Analysis of binding site similarity, small-molecule similarity and experimental binding profiles in the human cytosolic sulfotransferase family. Bioinformatics 23: e104-e109. doi:10.1093/bioinformatics/btl292. PubMed: 17237076. [DOI] [PubMed] [Google Scholar]
55. Sipilä J, Hood AM, Coughtrie MW, Taskinen J (2003) CoMFA modeling of enzyme kinetics: K(m) values for sulfation of diverse phenolic substrates by human catecholamine sulfotransferase SULT1A3. J Chem Inf Comput Sci 43: 1563-1569. doi:10.1021/ci034089e. PubMed: 14502490. [DOI] [PubMed] [Google Scholar]
56. Sharma V, Duffel MW (2005) A comparative molecular field analysis-based approach to prediction of sulfotransferase catalytic specificity. Methods Enzymol 400: 249-263. doi:10.1016/S0076-6879(05)00014-5. PubMed: 16399353. [DOI] [PubMed] [Google Scholar]
57. Ekuase EJ, Liu Y, Lehmler HJ, Robertson LW, Duffel MW (2011) Structure-activity relationships for hydroxylated polychlorinated biphenyls as inhibitors of the sulfation of dehydroepiandrosterone catalyzed by human hydroxysteroid sulfotransferase SULT2A1. Chem Res Toxicol 24: 1720-1728. doi:10.1021/tx200260h. PubMed: 21913674. [DOI] [PMC free article] [PubMed] [Google Scholar]
58. Taskinen J, Ethell BT, Pihlavisto P, Hood AM, Burchell B et al. (2003) Conjugation of catechols by recombinant human sulfotransferases, UDP-glucuronosyltransferases, and soluble catechol O-methyltransferase: structure-conjugation relationships and predictive models. Drug Metab Dispos 31: 1187-1197. doi:10.1124/dmd.31.9.1187. PubMed: 12920175. [DOI] [PubMed] [Google Scholar]
59. Scheer M, Grote A, Chang A, Schomburg I, Munaretto C et al. (2011) BRENDA, the enzyme information system in 2011. Nucleic Acids Res 39: D670-D676. doi:10.1093/nar/gkq1089. PubMed: 21062828. [DOI] [PMC free article] [PubMed] [Google Scholar]
60. Bolton EE, Wang Y, Thiessen PA, Bryant SH (2008) 12 PubChem: Integrated Platform of Small Molecules and Biological Activities. In: Ralph AW, David CS. Annual Reports in Computational Chemistry Chapter. Elsevier; pp. 217-241. [Google Scholar]
61. Lagorce D, Maupetit J, Baell J, Sperandio O, Tufféry P et al. (2011) The FAF-Drugs2 server: a multistep engine to prepare electronic chemical compound collections. Bioinformatics 27: 2018-2020. doi:10.1093/bioinformatics/btr333. PubMed: 21636592. [DOI] [PubMed] [Google Scholar]
62. Shevtsov S, Petrotchenko EV, Pedersen LC, Negishi M (2003) Crystallographic analysis of a hydroxylated polychlorinated biphenyl (OH-PCB) bound to the catalytic estrogen binding site of human estrogen sulfotransferase. Environ Health Perspect 111: 884-888. doi:10.1289/ehp.6056. PubMed: 12782487. [DOI] [PMC free article] [PubMed] [Google Scholar]
63. Warwicker J, Watson HC (1982) Calculation of the electric potential in the active site cleft due to alpha-helix dipoles. J Mol Biol 157: 671-679. doi:10.1016/0022-2836(82)90505-8. PubMed: 6288964. [DOI] [PubMed] [Google Scholar]
64. Miteva MA, Tufféry P, Villoutreix BO (2005) PCE: web tools to compute protein continuum electrostatics. Nucleic Acids Res 33: W372-W375. doi:10.1093/nar/gki365. PubMed: 15980492. [DOI] [PMC free article] [PubMed] [Google Scholar]
65. Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S et al. (1983) CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. J Comput Chem 4: 187-217. doi:10.1002/jcc.540040211. [Google Scholar]
66. MacKerell AD, Bashford D, Bellott, Dunbrack RL, Evanseck JD et al. (1998) All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of Proteins. J Phys Chem B 102: 3586-3616. doi:10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
67. Mackerell AD, Feig M, Brooks CL (2004) Extending the treatment of backbone energetics in protein force fields: Limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. J Comput Chem 25: 1400-1415. doi:10.1002/jcc.20065. PubMed: 15185334. [DOI] [PubMed] [Google Scholar]
68. Zoete V, Cuendet MA, Grosdidier A, Michielin O (2011) SwissParam: a fast force field generation tool for small organic molecules. J Comput Chem 32: 2359-2368. doi:10.1002/jcc.21816. PubMed: 21541964. [DOI] [PubMed] [Google Scholar]
69. Haberthür U, Caflisch A (2008) FACTS: Fast analytical continuum treatment of solvation. J Comput Chem 29: 701-715. doi:10.1002/jcc.20832. PubMed: 17918282. [DOI] [PubMed] [Google Scholar]
70. Ryckaert J-P, Ciccotti G, Berendsen HJC (1977) Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J Comput Phys 23: 327-341. doi:10.1016/0021-9991(77)90098-5. [Google Scholar]
71. Liang J, Edelsbrunner H, Woodward C (1998) Anatomy of protein pockets and cavities: measurement of binding site geometry and implications for ligand design. Protein Sci 7: 1884-1897. doi:10.1002/pro.5560070905. PubMed: 9761470. [DOI] [PMC free article] [PubMed] [Google Scholar]
72. RDevelopmentCoreTeam (2009) R : A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]
73. Moustakas DT, Lang PT, Pegg S, Pettersen E, Kuntz ID et al. (2006) Development and validation of a modular, extensible docking program: DOCK 5. J Comput Aid Mol Des 20: 601-619. doi:10.1007/s10822-006-9060-4. [DOI] [PubMed] [Google Scholar]
74. Morris GM, Huey R, Lindstrom W, Sanner MF, Belew RK et al. (2009) AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J Comput Chem 30: 2785-2791. doi:10.1002/jcc.21256. PubMed: 19399780. [DOI] [PMC free article] [PubMed] [Google Scholar]
75. Trott O, Olson AJ (2010) AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem 31: 455-461. PubMed: 19499576. [DOI] [PMC free article] [PubMed] [Google Scholar]
76. Karatzoglou A, Smola A, Hornik K, Zeileis A (2004) kernlab - An S4 Package for Kernel Methods in R. J Stat Softw 11: 1-20. [Google Scholar]
77. Liaw A, Wiener M (2002) Classification and Regression by randomForest. R NEWS 2 3: 18-22. [Google Scholar]
78. Kuhn M (2008) Building predictive models in R using the caret package. J Stat Softw 28: 1-26.27774042 [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Dataset S1

Active compound centroids for SULT1A1.

(SDF)

Click here for additional data file.^{(125.5KB, sdf)}

Dataset S2

Active compound centroids for SULT1A3.

(SDF)

Click here for additional data file.^{(105.9KB, sdf)}

Dataset S3

Active compound centroids for SULT1E1.

(SDF)

Click here for additional data file.^{(70.1KB, sdf)}

Figure S1

2D structure of diverse active molecules for each SULT1 isoform.

(TIFF)

Click here for additional data file.^{(160.9KB, tiff)}

Figure S2

Potential energies (kcal/mol) for SULT1A1 isoform. A: The potential energy of the MD run 1. B: The potential energy of the MD run 2. C: The potential energy of the MD run 3.

(TIFF)

Click here for additional data file.^{(2.9MB, tiff)}

Figure S3

Potential energies (kcal/mol) for SULT1A3 isoform. A: The potential energy of the MD run 1. B: The potential energy of the MD run 2. C: The potential energy of the MD run 3.

(TIFF)

Click here for additional data file.^{(2.6MB, tiff)}

Figure S4

Potential energies (kcal/mol) for SULT1E1 isoform. A: The potential energy of the MD run 1. B: The potential energy of the MD run 2. C: The potential energy of the MD run 3.

(TIFF)

Click here for additional data file.^{(2.5MB, tiff)}

Table S1

Protein-ligand interaction energies for active centroids as predicted by docking, averaged for the X-ray and the two best performing MD structures, and compared to the experimental affinities.

(DOCX)

Click here for additional data file.^{(131.9KB, docx)}

Table S2

Performance of QSAR models built by using support vector machine, random forest, and naïve Bayes machine-learning methods.

(DOCX)

Click here for additional data file.^{(51.4KB, docx)}

Text S1

Drug-like filter parameters and binding sites’ residues.

(DOCX)

Click here for additional data file.^{(71.3KB, docx)}

[B1] 1. Merlot C (2010) Computational toxicology--a tool for early safety evaluation. Drug Discov Today 15: 16-22. doi:10.1016/j.drudis.2009.09.010. PubMed: 19835978. [DOI] [PubMed] [Google Scholar]

[B2] 2. Wang J, Urban L, Bojanic D (2007) Maximising use of in vitro ADMET tools to predict in vivo bioavailability and safety. Expert Opin Drug Metab Toxicol 3: 641-665. doi:10.1517/17425255.3.5.641. PubMed: 17916053. [DOI] [PubMed] [Google Scholar]

[B3] 3. Testa B, Pedretti A, Vistoli G (2012) Reactions and enzymes in the metabolism of drugs and other xenobiotics. Drug Discov Today 17: 549-560. doi:10.1016/j.drudis.2012.01.017. PubMed: 22305937. [DOI] [PubMed] [Google Scholar]

[B4] 4. Bojarová P, Williams SJ (2008) Sulfotransferases, sulfatases and formylglycine-generating enzymes: a sulfation fascination. Curr Opin Chem Biol 12: 573-581. doi:10.1016/j.cbpa.2008.06.018. PubMed: 18625336. [DOI] [PubMed] [Google Scholar]

[B5] 5. Shimada T (2006) Xenobiotic-metabolizing enzymes involved in activation and detoxification of carcinogenic polycyclic aromatic hydrocarbons. Drug Metab Pharmacokinet 21: 257-276. doi:10.2133/dmpk.21.257. PubMed: 16946553. [DOI] [PubMed] [Google Scholar]

[B6] 6. Gamage NU, Duggleby RG, Barnett AC, Tresillian M, Latham CF et al. (2003) Structure of a human carcinogen-converting enzyme, SULT1A1. Structural and kinetic implications of substrate inhibition. J Biol Chem 278: 7655-7662. doi:10.1074/jbc.M207246200. PubMed: 12471039. [DOI] [PubMed] [Google Scholar]

[B7] 7. Surh YJ (1998) Bioactivation of benzylic and allylic alcohols via sulfo-conjugation. Chem Biol Interact 109: 221-235. doi:10.1016/S0009-2797(97)00134-8. PubMed: 9566747. [DOI] [PubMed] [Google Scholar]

[B8] 8. Chapman E, Best MD, Hanson SR, Wong C-H (2004) Sulfotransferases: Structure, Mechanism, Biological Activity, Inhibition, and Synthetic Utility. Angew Chem Int Ed 43: 3526-3548 doi:10.1002/anie.200300631. PubMed: 15293241. [DOI] [PubMed] [Google Scholar]

[B9] 9. Veith H, Southall N, Huang R, James T, Fayne D et al. (2009) Comprehensive characterization of cytochrome P450 isozyme selectivity across chemical libraries. Nat Biotechnol 27: 1050-1055. doi:10.1038/nbt.1581. PubMed: 19855396. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10. Sun H, Scott DO (2010) Structure-based drug metabolism predictions for drug design. Chem Biol Drugs Des 75: 3-17. doi:10.1111/j.1747-0285.2009.00899.x. PubMed: 19878193. [DOI] [PubMed] [Google Scholar]

[B11] 11. Vedani A, Smiesko M (2009) In silico toxicology in drug discovery - concepts based on three-dimensional models. Altern Lab Anim 37: 477-496. PubMed: 20017578. [DOI] [PubMed] [Google Scholar]

[B12] 12. Stoll F, Göller AH, Hillisch A (2011) Utility of protein structures in overcoming ADMET-related issues of drug-like compounds. Drug Discov Today 16: 530-538. doi:10.1016/j.drudis.2011.04.008. PubMed: 21554979. [DOI] [PubMed] [Google Scholar]

[B13] 13. Meng S, Wu B, Singh R, Yin T, Morrow JK et al. (2012) SULT1A3-mediated regiospecific 7-O-sulfation of flavonoids in Caco-2 cells can be explained by the relevant molecular docking studies. Mol Pharm 9: 862-873. doi:10.1021/mp200400s. PubMed: 22352375. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14] 14. Dong D, Ako R, Wu B (2012) Crystal structures of human sulfotransferases: insights into the mechanisms of action and substrate selectivity. Expert Opin Drug Metab Toxicol 8: 635-646. doi:10.1517/17425255.2012.677027. PubMed: 22512672. [DOI] [PubMed] [Google Scholar]

[B15] 15. Gamage NU, Tsvetanov S, Duggleby RG, McManus ME, Martin JL (2005) The structure of human SULT1A1 crystallized with estradiol. An insight into active site plasticity and substrate inhibition with multi-ring substrates. J Biol Chem 280: 41482-41486. doi:10.1074/jbc.M508289200. PubMed: 16221673. [DOI] [PubMed] [Google Scholar]

[B16] 16. Rohn KJ, Cook IT, Leyh TS, Kadlubar SA, Falany CN (2012) Potent inhibition of human sulfotransferase 1A1 by 17alpha-ethinylestradiol: role of 3'-phosphoadenosine 5'-phosphosulfate binding and structural rearrangements in regulating inhibition and activity. Drug Metab Dispos 40: 1588-1595. doi:10.1124/dmd.112.045583. PubMed: 22593037. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B17] 17. Allali-Hassani A, Pan PW, Dombrovski L, Najmanovich R, Tempel W et al. (2007) Structural and chemical profiling of the human cytosolic sulfotransferases. PLOS Biol 5: e97. doi:10.1371/journal.pbio.0050097. PubMed: 17425406. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B18] 18. Gamage N, Barnett A, Hempel N, Duggleby RG, Windmill KF et al. (2006) Human sulfotransferases and their role in chemical metabolism. Toxicol Sci 90: 5-22. PubMed: 16322073. [DOI] [PubMed] [Google Scholar]

[B19] 19. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN et al. (2000) The Protein Data Bank. Nucleic Acids Res 28: 235-242. doi:10.1093/nar/28.1.235. PubMed: 10592235. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B20] 20. Hamon J, Whitebread S, Techer-Etienne V, Le Coq H, Azzaoui K et al. (2009) In vitro safety pharmacology profiling: what else beyond hERG? Future Med Chem 1: 645-665. doi:10.4155/fmc.09.51. PubMed: 21426031. [DOI] [PubMed] [Google Scholar]

[B21] 21. Grigorov MG (2011) Analysis of time course Omics datasets. Methods Mol Biol 719: 153-172. doi:10.1007/978-1-61779-027-0_7. PubMed: 21370083. [DOI] [PubMed] [Google Scholar]

[B22] 22. Gleeson MP (2008) Generation of a set of simple, interpretable ADMET rules of thumb. J Med Chem 51: 817-834. doi:10.1021/jm701122q. PubMed: 18232648. [DOI] [PubMed] [Google Scholar]

[B23] 23. Nigsch F, Lounkine E, McCarren P, Cornett B, Glick M et al. (2011) Computational methods for early predictive safety assessment from biological and chemical data. Expert Opin Drug Metab Toxicol 7: 1497-1511. doi:10.1517/17425255.2011.632632. PubMed: 22050465. [DOI] [PubMed] [Google Scholar]

[B24] 24. Lagorce D, Reynes C, Camproux A-C, Miteva MA, Sperandio O et al. (2011) In Silico ADME/Tox Predictions. In: Tsaioun K, Kates SA. ADMET for Medicinal Chemists. John Wiley & Sons, Inc. pp. 29-124. [Google Scholar]

[B25] 25. Martiny VY, Pajeva I, Wiese M, Davis AM, Miteva MA (2013). hemoinformatic And chemogneomic Approach to ADMET In: Wang J, Urban L, editors. Predictive ADMET Integrated approaches in Drug Discovery and Development: John Wiley & Sons. [Google Scholar]

[B26] 26. Gleeson MP, Modi S, Bender A, Robinson RL, Kirchmair J et al. (2012) The challenges involved in modeling toxicity data in silico: a review. Curr Pharm Des 18: 1266-1291. doi:10.2174/138161212799436359. PubMed: 22316153. [DOI] [PubMed] [Google Scholar]

[B27] 27. Varnek A, Baskin I (2012) Machine learning methods for property prediction in chemoinformatics: Quo Vadis? J Chem Inf Model 52: 1413-1437. doi:10.1021/ci200409x. PubMed: 22582859. [DOI] [PubMed] [Google Scholar]

[B28] 28. Michielan L, Moro S (2010) Pharmaceutical perspectives of nonlinear QSAR strategies. J Chem Inf Model 50: 961-978. doi:10.1021/ci100072z. PubMed: 20527756. [DOI] [PubMed] [Google Scholar]

[B29] 29. Moroy G, Martiny VY, Vayer P, Villoutreix BO, Miteva MA (2012) Toward in silico structure-based ADMET prediction in drug discovery. Drug Discov Today 17: 44-55. doi:10.1016/j.drudis.2011.10.023. PubMed: 22056716. [DOI] [PubMed] [Google Scholar]

[B30] 30. Nair R, Liu J, Soong TT, Acton TB, Everett JK et al. (2009) Structural genomics is the largest contributor of novel structural leverage. J Struct Funct Genomics 10: 181-191. doi:10.1007/s10969-008-9055-6. PubMed: 19194785. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B31] 31. Xie L, Bourne PE (2011) Structure-based systems biology for analyzing off-target binding. Curr Opin Struct Biol 21: 189-199. doi:10.1016/j.sbi.2011.01.004. PubMed: 21292475. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B32] 32. Vaz RJ, Zamora I, Li Y, Reiling S, Shen J et al. (2010) The challenges of in silico contributions to drug metabolism in lead optimization. Expert Opin Drug Metab Toxicol 6: 851-861. doi:10.1517/17425255.2010.499123. PubMed: 20565339. [DOI] [PubMed] [Google Scholar]

[B33] 33. Wolber G, Langer T (2005) LigandScout: 3-D pharmacophores derived from protein-bound ligands and their use as virtual screening filters. J Chem Inf Model 45: 160-169. doi:10.1021/ci049885e. PubMed: 15667141. [DOI] [PubMed] [Google Scholar]

[B34] 34. Taboureau O, Baell JB, Fernández-Recio J, Villoutreix BO (2012) Established and emerging trends in computational drug discovery in the structural genomics era. Chem Biol 19: 29-41. doi:10.1016/j.chembiol.2011.12.007. PubMed: 22284352. [DOI] [PubMed] [Google Scholar]

[B35] 35. Teague SJ (2003) Implications of protein flexibility for drug discovery. Nat Rev Drug Discov 2: 527-541. doi:10.1038/nrd1129. PubMed: 12838268. [DOI] [PubMed] [Google Scholar]

[B36] 36. Berger I, Guttman C, Amar D, Zarivach R, Aharoni A (2011) The molecular basis for the broad substrate specificity of human sulfotransferase 1A1. PLOS ONE 6: e26794. doi:10.1371/journal.pone.0026794. PubMed: 22069470. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B37] 37. Sperandio O, Mouawad L, Pinto E, Villoutreix BO, Perahia D et al. (2010) How to choose relevant multiple receptor conformations for virtual screening: a test case of Cdk2 and normal mode analysis. Eur Biophys J 39: 1365-1372. doi:10.1007/s00249-010-0592-0. PubMed: 20237920. [DOI] [PubMed] [Google Scholar]

[B38] 38. Isvoran A, Badel A, Craescu CT, Miron S, Miteva MA (2011) Exploring NMR ensembles of calcium binding proteins: perspectives to design inhibitors of protein-protein interactions. BMC Struct Biol 11: 24. doi:10.1186/1472-6807-11-24. PubMed: 21569443. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B39] 39. Ivetac A, McCammon JA (2011) Molecular recognition in the case of flexible targets. Curr Pharm Des 17: 1663-1671. doi:10.2174/138161211796355056. PubMed: 21619526. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B40] 40. Sperandio O, Villoutreix BO, Miteva MA (2011) Structure-Based Virtual Screening. In: Miteva Ma, editor. In silico lead discovery. pp. 20-46 [Google Scholar]

[B41] 41. Cavasotto CN, Phatak SS (2011) Docking methods for structure-based library design. Methods Mol Biol 685: 155-174. doi:10.1007/978-1-60761-931-4_8. PubMed: 20981523. [DOI] [PubMed] [Google Scholar]

[B42] 42. Campagna-Slater V, Schapira M (2009) Evaluation of virtual screening as a tool for chemical genetic applications. J Chem Inf Model 49: 2082-2091. doi:10.1021/ci900219u. PubMed: 19702241. [DOI] [PubMed] [Google Scholar]

[B43] 43. Stjernschantz E, Reinen J, Meinl W, George BJ, Glatt H et al. (2010) Comparison of murine and human estrogen sulfotransferase inhibition in vitro and in silico--implications for differences in activity, subunit dimerization and substrate inhibition. Mol Cell Endocrinol 317: 127-140. doi:10.1016/j.mce.2009.12.001. PubMed: 20025931. [DOI] [PubMed] [Google Scholar]

[B44] 44. Cook I, Wang T, Almo SC, Kim J, Falany CN et al. (2013) The gate that governs sulfotransferase selectivity. Biochemistry 52: 415-424. doi:10.1021/bi301492j. PubMed: 23256751. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B45] 45. Rueda M, Bottegoni G, Abagyan R (2009) Consistent improvement of cross-docking results using binding site ensembles generated with elastic network normal modes. J Chem Inf Model 49: 716-725. doi:10.1021/ci8003732. PubMed: 19434904. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B46] 46. Lu JH, Li HT, Liu MC, Zhang JP, Li M et al. (2005) Crystal structure of human sulfotransferase SULT1A3 in complex with dopamine and 3'-phosphoadenosine 5'-phosphate. Biochem Biophys Res Commun 335: 417-423. doi:10.1016/j.bbrc.2005.07.091. PubMed: 16083857. [DOI] [PubMed] [Google Scholar]

[B47] 47. Bidwell LM, McManus ME, Gaedigk A, Kakuta Y, Negishi M et al. (1999) Crystal structure of human catecholamine sulfotransferase. J Mol Biol 293: 521-530. doi:10.1006/jmbi.1999.3153. PubMed: 10543947. [DOI] [PubMed] [Google Scholar]

[B48] 48. Volkamer A, Kuhn D, Grombacher T, Rippmann F, Rarey M (2012) Combining global and local measures for structure-based druggability predictions. J Chem Inf Model 52: 360-372. doi:10.1021/ci200454v. PubMed: 22148551. [DOI] [PubMed] [Google Scholar]

[B49] 49. Kakuta Y, Petrotchenko EV, Pedersen LC, Negishi M (1998) The sulfuryl transfer mechanism. Crystal structure of a vanadate complex of estrogen sulfotransferase and mutational analysis. J Biol Chem 273: 27325-27330. doi:10.1074/jbc.273.42.27325. PubMed: 9765259. [DOI] [PubMed] [Google Scholar]

[B50] 50. Carbonell P, Carlsson L, Faulon JL (2013) The stereo signature molecular descriptor. J Chem Inf Model. [DOI] [PubMed] [Google Scholar]

[B51] 51. Shawahna R, Uchida Y, Declèves X, Ohtsuki S, Yousif S et al. (2011) Transcriptomic and quantitative proteomic analysis of transporters and drug metabolizing enzymes in freshly isolated human brain microvessels. Mol Pharm 8: 1332-1341. doi:10.1021/mp200129p. PubMed: 21707071. [DOI] [PubMed] [Google Scholar]

[B52] 52. Eagle K (2012) Hypothesis: holiday sudden cardiac death: food and alcohol inhibition of SULT1A enzymes as a precipitant. J Appl Toxicol 32: 751-755. doi:10.1002/jat.2764. PubMed: 22678655. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B53] 53. Jones G, Willett P, Glen RC, Leach AR, Taylor R (1997) Development and validation of a genetic algorithm for flexible docking. J Mol Biol 267: 727-748. doi:10.1006/jmbi.1996.0897. PubMed: 9126849. [DOI] [PubMed] [Google Scholar]

[B54] 54. Najmanovich RJ, Allali-Hassani A, Morris RJ, Dombrovsky L, Pan PW et al. (2007) Analysis of binding site similarity, small-molecule similarity and experimental binding profiles in the human cytosolic sulfotransferase family. Bioinformatics 23: e104-e109. doi:10.1093/bioinformatics/btl292. PubMed: 17237076. [DOI] [PubMed] [Google Scholar]

[B55] 55. Sipilä J, Hood AM, Coughtrie MW, Taskinen J (2003) CoMFA modeling of enzyme kinetics: K(m) values for sulfation of diverse phenolic substrates by human catecholamine sulfotransferase SULT1A3. J Chem Inf Comput Sci 43: 1563-1569. doi:10.1021/ci034089e. PubMed: 14502490. [DOI] [PubMed] [Google Scholar]

[B56] 56. Sharma V, Duffel MW (2005) A comparative molecular field analysis-based approach to prediction of sulfotransferase catalytic specificity. Methods Enzymol 400: 249-263. doi:10.1016/S0076-6879(05)00014-5. PubMed: 16399353. [DOI] [PubMed] [Google Scholar]

[B57] 57. Ekuase EJ, Liu Y, Lehmler HJ, Robertson LW, Duffel MW (2011) Structure-activity relationships for hydroxylated polychlorinated biphenyls as inhibitors of the sulfation of dehydroepiandrosterone catalyzed by human hydroxysteroid sulfotransferase SULT2A1. Chem Res Toxicol 24: 1720-1728. doi:10.1021/tx200260h. PubMed: 21913674. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B58] 58. Taskinen J, Ethell BT, Pihlavisto P, Hood AM, Burchell B et al. (2003) Conjugation of catechols by recombinant human sulfotransferases, UDP-glucuronosyltransferases, and soluble catechol O-methyltransferase: structure-conjugation relationships and predictive models. Drug Metab Dispos 31: 1187-1197. doi:10.1124/dmd.31.9.1187. PubMed: 12920175. [DOI] [PubMed] [Google Scholar]

[B59] 59. Scheer M, Grote A, Chang A, Schomburg I, Munaretto C et al. (2011) BRENDA, the enzyme information system in 2011. Nucleic Acids Res 39: D670-D676. doi:10.1093/nar/gkq1089. PubMed: 21062828. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B60] 60. Bolton EE, Wang Y, Thiessen PA, Bryant SH (2008) 12 PubChem: Integrated Platform of Small Molecules and Biological Activities. In: Ralph AW, David CS. Annual Reports in Computational Chemistry Chapter. Elsevier; pp. 217-241. [Google Scholar]

[B61] 61. Lagorce D, Maupetit J, Baell J, Sperandio O, Tufféry P et al. (2011) The FAF-Drugs2 server: a multistep engine to prepare electronic chemical compound collections. Bioinformatics 27: 2018-2020. doi:10.1093/bioinformatics/btr333. PubMed: 21636592. [DOI] [PubMed] [Google Scholar]

[B62] 62. Shevtsov S, Petrotchenko EV, Pedersen LC, Negishi M (2003) Crystallographic analysis of a hydroxylated polychlorinated biphenyl (OH-PCB) bound to the catalytic estrogen binding site of human estrogen sulfotransferase. Environ Health Perspect 111: 884-888. doi:10.1289/ehp.6056. PubMed: 12782487. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B63] 63. Warwicker J, Watson HC (1982) Calculation of the electric potential in the active site cleft due to alpha-helix dipoles. J Mol Biol 157: 671-679. doi:10.1016/0022-2836(82)90505-8. PubMed: 6288964. [DOI] [PubMed] [Google Scholar]

[B64] 64. Miteva MA, Tufféry P, Villoutreix BO (2005) PCE: web tools to compute protein continuum electrostatics. Nucleic Acids Res 33: W372-W375. doi:10.1093/nar/gki365. PubMed: 15980492. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B65] 65. Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S et al. (1983) CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. J Comput Chem 4: 187-217. doi:10.1002/jcc.540040211. [Google Scholar]

[B66] 66. MacKerell AD, Bashford D, Bellott, Dunbrack RL, Evanseck JD et al. (1998) All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of Proteins. J Phys Chem B 102: 3586-3616. doi:10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]

[B67] 67. Mackerell AD, Feig M, Brooks CL (2004) Extending the treatment of backbone energetics in protein force fields: Limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. J Comput Chem 25: 1400-1415. doi:10.1002/jcc.20065. PubMed: 15185334. [DOI] [PubMed] [Google Scholar]

[B68] 68. Zoete V, Cuendet MA, Grosdidier A, Michielin O (2011) SwissParam: a fast force field generation tool for small organic molecules. J Comput Chem 32: 2359-2368. doi:10.1002/jcc.21816. PubMed: 21541964. [DOI] [PubMed] [Google Scholar]

[B69] 69. Haberthür U, Caflisch A (2008) FACTS: Fast analytical continuum treatment of solvation. J Comput Chem 29: 701-715. doi:10.1002/jcc.20832. PubMed: 17918282. [DOI] [PubMed] [Google Scholar]

[B70] 70. Ryckaert J-P, Ciccotti G, Berendsen HJC (1977) Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J Comput Phys 23: 327-341. doi:10.1016/0021-9991(77)90098-5. [Google Scholar]

[B71] 71. Liang J, Edelsbrunner H, Woodward C (1998) Anatomy of protein pockets and cavities: measurement of binding site geometry and implications for ligand design. Protein Sci 7: 1884-1897. doi:10.1002/pro.5560070905. PubMed: 9761470. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B72] 72. RDevelopmentCoreTeam (2009) R : A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]

[B73] 73. Moustakas DT, Lang PT, Pegg S, Pettersen E, Kuntz ID et al. (2006) Development and validation of a modular, extensible docking program: DOCK 5. J Comput Aid Mol Des 20: 601-619. doi:10.1007/s10822-006-9060-4. [DOI] [PubMed] [Google Scholar]

[B74] 74. Morris GM, Huey R, Lindstrom W, Sanner MF, Belew RK et al. (2009) AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J Comput Chem 30: 2785-2791. doi:10.1002/jcc.21256. PubMed: 19399780. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B75] 75. Trott O, Olson AJ (2010) AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem 31: 455-461. PubMed: 19499576. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B76] 76. Karatzoglou A, Smola A, Hornik K, Zeileis A (2004) kernlab - An S4 Package for Kernel Methods in R. J Stat Softw 11: 1-20. [Google Scholar]

[B77] 77. Liaw A, Wiener M (2002) Classification and Regression by randomForest. R NEWS 2 3: 18-22. [Google Scholar]

[B78] 78. Kuhn M (2008) Building predictive models in R using the caret package. J Stat Softw 28: 1-26.27774042 [Google Scholar]

PERMALINK

In Silico Mechanistic Profiling to Probe Small Molecule Binding to Sulfotransferases

Virginie Y Martiny

Pablo Carbonell

David Lagorce

Bruno O Villoutreix

Gautier Moroy

Maria A Miteva

Roles

Abstract

Introduction

Figure 1. Visualization of the human structure sulfotransferase 1A3 (PDB ID: 2A3R).

Results

Collecting SULT1 Ligands

Table 1. Examples of diverse active molecules for each SULT1 isoform assigned in different clusters.

Molecular Dynamics Simulations and Multiple Receptor Conformations

Figure 2. Solvent Accessible Surface Area (SAS) of binding pockets for the three SULT1 isoforms for one MD production.

Figure 3. Multiple receptor conformations generation.

Binding pockets characterization

Table 2. Structural characteristics of the binding sites for different isoforms and multiple receptor conformations.

Figure 4. Binding sites of different SULT1 structures.

Identifying the Best Multiple Receptor Conformations by Virtual Screening

Figure 5. Enrichment graphs obtained on the X-ray and two best performing MD structures for each isoform.

Table 3. Druggability scores for different isoforms and conformations.

Figure 6. Best scored poses for SULTE1 actives.

New QSAR models for SULT1

Table 4. Performance of the SVM QSAR model for each isoform.

Discussion

Material and Methods

Compound datasets preparation

Figure 7. Compound datasets preparation.

Protein structures preparation

Molecular dynamics simulations

Multiple receptor conformations selection

Virtual screening experiments

QSAR classification model of active compounds

Supporting Information

Acknowledgments

Funding Statement

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases