Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2015 Nov 9;10(11):e0142232. doi: 10.1371/journal.pone.0142232

Linear Interaction Energy Based Prediction of Cytochrome P450 1A2 Binding Affinities with Reliability Estimation

Luigi Capoferri 1, Marlies C A Verkade-Vreeker 1, Danny Buitenhuis 1, Jan N M Commandeur 1, Manuel Pastor 2, Nico P E Vermeulen 1, Daan P Geerke 1,*
Editor: Claudio M Soares3
PMCID: PMC4638363  PMID: 26551865

Abstract

Prediction of human Cytochrome P450 (CYP) binding affinities of small ligands, i.e., substrates and inhibitors, represents an important task for predicting drug-drug interactions. A quantitative assessment of the ligand binding affinity towards different CYPs can provide an estimate of inhibitory activity or an indication of isoforms prone to interact with the substrate of inhibitors. However, the accuracy of global quantitative models for CYP substrate binding or inhibition based on traditional molecular descriptors can be limited, because of the lack of information on the structure and flexibility of the catalytic site of CYPs. Here we describe the application of a method that combines protein-ligand docking, Molecular Dynamics (MD) simulations and Linear Interaction Energy (LIE) theory, to allow for quantitative CYP affinity prediction. Using this combined approach, a LIE model for human CYP 1A2 was developed and evaluated, based on a structurally diverse dataset for which the estimated experimental uncertainty was 3.3 kJ mol-1. For the computed CYP 1A2 binding affinities, the model showed a root mean square error (RMSE) of 4.1 kJ mol-1 and a standard error in prediction (SDEP) in cross-validation of 4.3 kJ mol-1. A novel approach that includes information on both structural ligand description and protein-ligand interaction was developed for estimating the reliability of predictions, and was able to identify compounds from an external test set with a SDEP for the predicted affinities of 4.6 kJ mol-1 (corresponding to 0.8 pK i units).

Introduction

Cytochrome P450s (CYPs) form a ubiquitous superfamily of monooxygenases characterized by the presence of a heme cofactor, that in humans plays a crucial role in phase I drug metabolism [1]. Besides being responsible for about 50% of drug clearance via metabolism, CYPs can also be responsible for prodrug activation or metabolism-dependent toxicity [2]. Furthermore, their inactivation or inhibition can alter the metabolic pathway of co-administered drugs, potentially leading to drug-drug interactions (DDI). In the past few years this has been the cause of removal from the market of several drugs [13]. While in vitro screening for CYP binders and inhibitors is well established as a mean for predicting potential (adverse) drug-drug interactions in vivo [2,3], the interest for in silico methods has recently increased as a fast preliminary screening method in the drug discovery process [4]. However, these methods are still challenged by the substrate promiscuity and large catalytic site malleability of many CYP isoforms, including e.g. drug metabolizing CYP 3A4, 2D6 and 1A2 [4,5].

The broad substrate selectivity of CYPs is due to the large flexibility of their buried catalytic site, which allows many isoforms both to interact with different classes of molecules and to bind the same molecule in multiple orientations, thus enabling the formation of different metabolites of a single compound [6]. Variation in available experimental data is often an additional element of complexity in developing predictive models for binding of ligands (i.e. substrates and/or inhibitors) to CYPs. Several assays have been developed for screening inhibitory activity of small molecules toward CYPs, and use of different types of conditions (solvent composition, substrate probe, expression system, etc.) has been demonstrated to affect the outcome of the measurements [7,8]. Furthermore, CYP inhibitors can act via different mechanisms, including (1) competitive and reversible inhibition, (2) quasi-irreversible inhibition due to coordination with the heme iron, and (3) mechanism-based inhibition, in which an intermediate or product of the catalysis covalently modifies and inactivates the enzyme [9]. Experiments that allow to identify the inhibition mechanism of a compound are not performed routinely. Usually, relative IC 50 values, measured under specific conditions, are reported in literature (instead of measuring absolute inhibition constants), and studies on the mechanism of inhibition are mostly omitted.

Despite of the major challenges in modeling CYP binding introduced above, several computational models have been proposed to model inhibition data categorically or quantitatively, in terms of IC 50 values or inhibition constants (K i) [4].

In this regard, quantitative structure-activity relationship (QSAR) models based on molecular descriptors typically can show predictivity for small series of structurally related molecules, but are not suited for structurally diverse compounds. Using homogeneous experimental data for ~1500 diverse compounds for the most relevant CYP isoforms, Gleeson et al. [10] showed that the ability to describe the extent of CYP inhibition through traditional QSAR descriptors was limited and poor, since the role played by the molecular recognition process was neglected. Mathematical models including pharmacophore elements have been designed with higher predictivity. Similarly, 3D-QSAR and structure-based models that take into account direct information about the active site were also proposed, resulting in predictive local models; exhaustive reviews on QSAR models to predict inhibition of CYPs have been published [4,11].

Some years ago, efforts started in our laboratory to develop dynamical structural models for the prediction of the free energy of binding (ΔG bind) as a measure for ligand binding affinity and, therefore, the inhibitory potency, for series of small molecules toward specific CYP isoforms. In these models, molecular dynamics (MD) simulations are performed of the CYP-ligand complex in different representative binding conformations obtained from molecular docking. To obtain quantitative binding affinity prediction, we have applied an iterative version of Linear Interaction Energy (LIE) theory [12] as introduced by Stjernschantz and Oostenbrink [13]. The high level of detail offered by this approach clearly represents a trade-off in the accuracy of prediction versus computational time expended, for which accessibility has anyway increased in the last years. Moreover, by combining and (re)weighting results from multiple short simulations starting from different protein-ligand configurations, LIE predictions for flexible proteins such as CYPs become not only more accurate but also faster [1315].

However, a general limitation to the extensive application of any structure-based model is the difficulty in estimating the reliability of a prediction, which can be easier provided for pure statistical models. In the case of ligand-based QSAR models, for instance, a prediction is considered reliable when the query compound falls within the area of multidimensional space defined by the molecular descriptors employed by the model that has already been explored by the training set. This evaluation can be carried out in different ways, according to the method applied to interpolate the space defined by the descriptors [16]. In this context, Carriò et al. [17] recently proposed an approach (ADAN) in which several similarity criteria are taken into account to analyze this space, and a query compound can be classified in a reliability category according to the number of the criteria for which the compound is considered outlier. In the current work we introduce an approach to enable reliability estimation of LIE predictions, by using criteria that involve not only the ligand description but also a comparison between protein-ligand interactions sampled by the training and query compounds during MD.

In humans, isoform CYP 1A2 represents approximately 15% of the total CYP liver content and is responsible for 5% of CYP metabolism of the currently marketed drugs [2]. Furthermore, it is responsible for metabolism of many other exogenous and endogenous compounds, such as caffeine, teophylline, steroids, aromatic and heterocyclic amines, and polycyclic aromatic hydrocarbons (PAHs). Inhibitors of CYP 1A2 are usually planar and lipophilic, and are often small-volume molecules that are neutral or weakly basic with few hydrogen bonds donors [18,19]. Consistently, the X-ray resolved structure of this enzyme [20] showed a narrow active site in which residues of helix F and helix I form two parallel substrate binding platforms that are able to allocate flat lipophilic molecules. Additionally, Thr118, Ser122, Thr124 and Asp321 on one side of the active site, and Thr223 and Asp320 on the other, define two areas in which ligands can undertake polar interactions.

Previously reported statistical models based on molecular descriptors gave good performance in qualitative prediction of CYP 1A2 inhibition [21,22], or for quantitative estimation of the inhibitory potency for local classes of compounds (flavonoids and polycyclic aromatic compounds) [23,24]. CoMFA and GRID/GOLPE models have also been developed to enable estimation of inhibitory potency estimation for sets of chemically related compounds [25,26].

In a previous study, a quantitative model for CYP 1A2 ligand-binding affinity prediction was obtained using LIE and a training set of 8 structurally diverse compounds [27]. While the choice of the initial pose in setting up the MD simulations and LIE calculations was knowledge based, the result of the investigation indicated that such technique could be successfully applied in the development of global quantitative models for prediction of CYP 1A2 binding affinities and hence, inhibition.

Here we present a comprehensive quantitative model for the prediction of the affinity (free energy) of (reversible) binding of drug-like compounds toward CYP 1A2. Our model is based on the iterative LIE method [1315] and does not require any a priori knowledge other than the CYP 1A2 crystal structure [20] and IC 50 (or K i) data for calibration. For this purpose, a database of (> 50) molecules was collected from literature and for several compounds from each source, we experimentally determined CYP 1A2 inhibition in our laboratory to evaluate the experimental uncertainty among the different sources used. The dataset was split in a training and test set, and a LIE affinity model was calibrated. Finally, we used our criteria driven approach that accounts for both the nature of the ligands and the protein-ligand interactions, in order to evaluate the quality of external LIE predictions. For that purpose, a set of analyses was performed that was considered representative for the high number of factors playing a role in the computation of LIE predictions. The results obtained suggest that this novel approach, here applied to the CYP 1A2 affinity model, can also represent a general method for estimating the reliability of predictions from other LIE models.

Methods

Chemicals

Mefenamic acid (≥99%), tacrine (≥99%), carvedilol (≥98%), nifedipine (≥98%), ellipticine, α-naphthoflavone (≥97%), ticlopidine (≥99%), 1-naphtol (≥99%), 2-naphtol (≥98%), 4-methoxy-benzaldehyde (≥98%), 2-(p-tolyl)ethylamine (≥97%) were purchased from Sigma-Aldrich (Schnelldorf, Germany); phenacetin was obtained from Brocades-ACF (Maarssen, the Netherlands). 7-Methoxyresorufin was synthesized by the method of Burke and Mayer [28] and final purity was assessed to be higher than 95%.

Enzyme expression

The plasmid containing the human CYP 1A2 cDNA and human NADPH CYP reductase was transformed into Escherichia coli strain DH5α. CYPs were expressed in 3-L flasks containing 300 mL terrific broth (TB) with 1 mM δ-aminolevulinic acid, 0.5 mM thiamine, 400 μL/L trace elements, 100 μg/mL ampicillin, 1 mM isopropyl-β-d-thiogalactopyranoside (IPTG), and 0.5 mM FeCl3. The culture media was inoculated with 3 mL overnight culture. The cells were allowed to grow for 40 h at 28°C and 125 rpm. E. coli’s were collected by centrifugation (4000 × g, 4°C, 15 min) and resuspended in 20 mL 0.1 M potassium phosphate (KPi) glycerol buffer at pH 7.4 (containing 20% glycerol v/v, 0.25 mM ethylenediaminetetraacetic acid (EDTA), and 0.1 mM dithiothreitol (DTT)). Prior to cell disruption by emulsiflex (3 repeats), cells were treated with 0.5 mg/mL lysozyme. The membranes containing the human CYP were isolated by ultracentrifugation in a Beckmann 70Ti rotor (75 min, 40,000 rpm, 4°C), resuspended in KPi-glycerol buffer, and subsequently homogenized by pottering. The CYP 1A2 concentration was determined using the method of Omura and Sato [29] and the enzyme was stored at -80°C until use.

Inhibition assay

The inhibitory activity of several compounds towards human CYP 1A2 was determined using 7-methoxyresorufin as a substrate. 7-Methoxyresorufin was used at its K m value, which was determined to be 2.5 μM (data not shown). Incubations were carried out in a total volume of 200 μL and in the presence of an NADPH regenerating system (NRS) (final concentrations of 0.5 mM NADPH, 10 mM glucose 6-phosphate, and 0.4 unit/mL glucose-6-phosphate dehydrogenase) in a black coaster 96-well plate. CYP 1A2 was pre-incubated for 5 minutes at 37°C with 0.1 M potassium phosphate buffer (pH 7.4), 7-methoxyresorufin and inhibitors, with DMSO at a final concentration of 0.5% (v/v). For the IC 50 determinations the inhibitor concentration was varied between 10 pM and 10 mM, and NRS was added to start the reaction. Resorufin formation was followed fluorimetrically in time for 10 minutes on a Victor2 1420 multilabel counter with excitation at 530 nm and emission at 572 nm. A resorufin calibration curve was used to quantify the amount of product formed. All measurements were performed in triplicate.

Set-up of MD simulations and development of a CYP 1A2 LIE model

Using (iterative) LIE, affinity prediction requires preparation and selection of ligand poses, setting up and running MD simulations, and model calibration and binding free energy calculations [15]. Computational settings and methodological details as applied in the current work are given below, together with details for the criteria used to evaluate the predictive quality of (iterative) LIE models.

Ligand preparation

For all training and test set compounds, inhibition constants K i were derived from IC 50 values from literature according to the Cheng-Prusoff equation [30], and experimental values for the free energy of binding ΔG bind were calculated as

ΔGbind=RTln(Ki). (1)

The initial structure of each ligand was generated using Molecular Operating Environment (MOE) 2012.10 [31] and subsequently neutralized and minimized in OpenBabel 2.3.2 [32] using the MMFF94 force field [33]. The geometry was further optimized by the sqm module of Ambertools [34] at the AM1-BCC [35] semi-empirical level of theory. Molecular structures and literature IC 50 values are reported in Table A in S1 File.

Docking and clustering

The crystal structure of CYP 1A2 was obtained from the Protein Data Bank (code: 2HI4) [20] and refined as described in reference 27. Each ligand was docked into the active site of CYP 1A2 using PLANTS (Protein-Ligand Ant System) version 1.2 [36], and the ChemPLP [37] scoring function. The center of the docking sphere was placed at a distance of 0.78 nm from Fe, along the vector connecting the sulphur atom of the Fe-coordinated cysteine (Cys458) and the heme iron, from which the radius was set to 1.2 nm to define the active site of the protein. Maximally 300 docked poses with mutual root-mean-square deviations (RMSDs) in atomic positions of > 0.2 nm were retained.

Coordinates of the heavy atoms were used as variables for a principal component analysis (PCA) of the docked poses. After dimensionality reduction, the scores obtained were used for subsequent k-means clustering [38]. During this analysis, an additional component or cluster was taken into account in case it would have led to a further increment of at least 5% of the explained variance in the space of the coordinates or scores, respectively. The medoids of the clusters obtained (4 to 7 per ligand) were chosen as representative binding-conformations of the ligand in the CYP 1A2 active site. These configurations were used to filter out potential non-competitive inhibitors and as starting poses for the MD simulations used in the LIE model, as described below.

Identification of (quasi-)irreversible binders

Mechanism-based inhibitors and heme coordinating agents are characterized by specific chemical groups (Table B in S1 File) that can lead to (quasi-)irreversible inhibition in case they can interact with or be activated by the heme iron [9]. The presence of these groups in the ligands was evaluated and in case the corresponding reactive atom center was found closer than 0.6 nm from the heme iron in at least one of the representative poses, a ligand was considered a quasi-irreversible or mechanism-based inhibitor and it was excluded from the dataset.

MD simulations

Every representative binding pose obtained from clustering was used as starting configuration for use in MD. Simulations were carried out using the GROMACS 4.5.5 package [39]. Topologies of the ligands were automatically created by ACPYPE [40] using the General Amber Force Field [41] as potential, while the Amber99SB [42] force field was applied to describe the protein. To model the heme group, force field parameters reported in reference [43] were used. Each complex was solvated in a dodecahedral box filled with TIP3P water [44] (~20,000 solvent molecules), and 7 Cl- ions were added to neutralize the system. The system was energy minimized and gradually heated up to 300 K in three NVT simulations of 20 ps at 100K, 200K, and 300 K, respectively, in which harmonic potentials were used to positionally restrain Cα atoms (with force constants of 10000, 5000, and 50 kJ nm-2 mol-1, respectively) and other heavy atoms (with force constants of 2000, 1000, and 10 kJ nm-2 mol-1, respectively). Subsequently, 2.5 ns of unrestrained NpT simulations were performed at 1.01325 bar and 300 K, of which the first 0.5 ns were discarded in further analyses. A leap-frog algorithm was employed for integrating the equations of motion [45]. Heavy hydrogens (with a mass of 4.032 amu) [46] were used and all bonds were constrained using the LINCS algorithm [47], allowing a time-step of 4 fs. A Berendsen thermostat [48] was employed to maintain the temperature of the system close to its reference value, using separate temperature baths for the solvent and solute degrees of freedom, with a coupling time of 0.1 ps. A Berendsen barostat [48] with a coupling time of 0.5 ps and an isothermal compressibility of 4.5×10−5 bar-1 was used to maintain the pressure close to its reference value during NpT simulations. Van der Waals and short-range electrostatic interactions were explicitly evaluated every time step for pairs of atoms within a 0.9 nm cutoff, and a grid-based neighbor list was used and updated every 2 time steps. Long-range electrostatic interactions were included by using the smooth particle mesh Ewald method [49] with a maximum fast Fourier transform grid spacing of 0.125 nm for the reciprocal space sum.

To evaluate average ligand interaction energies of the unbound ligands in water, each ligand was solvated in a dodecahedral box filled with approximately 650 TIP3P water molecules, and no counterions were added. The MD protocol was identical to the one described for the simulations of the protein-ligand complex.

LIE model

According to LIE theory [12], ΔG bind can be calculated from differences (ΔV Ele and ΔV VdW) in the ensemble-averaged electrostatic ⟨V Ele lig−surr⟩ and van der Waals interaction energies ⟨V VdW lig−surr⟩ between the ligand and its surroundings when simulated in complex with the protein, or in the free state (water). ΔG bind of the ligand to the protein can be calculated as [12]

ΔGbind=α(VligsurrVdWproteinVligsurrVdWwater)+β(VligsurrEleproteinVligsurrElewater)=αΔVVdW+βΔVEle (2)

In Eq 2, α and β are empirical parameters for the van der Waals and electrostatic contribution of the nonbonded interactions to ΔG bind. An offset parameter (γ) might also be considered, although not strictly required for calculation of relative binding affinities. The value of such parameter has been related to the hydrophobicity of the binding site [50], and in any case it has to be determined empirically as well.

When combining results from several MD simulations per ligand that start from different protein-ligand configurations, a relative contribution W i of each simulation i to the total calculated interaction energies of the ligand can be determined as [13,51]

Wi=eΔGi/kBTieΔGi/kBT, (3)

where ΔG i is the ΔG bind value for simulation i, obtained from Eq 2. Using the W i’s, ΔG bind of the ligand averaged over the independent simulations, i can be calculated as

ΔGbind=αiWiΔViVdW+βiWiΔViEle. (4)

During model training, the W i’s are obtained by applying an iterative scheme, as described by Stjernschantz and Oostenbrink [13].

Chemical similarity analysis

For each compound, a molecular fingerprint was created according to the MACCS smart pattern [52]. Similarity among the compounds was evaluated in terms of Tanimoto Scores (TSs) between pairs of fingerprints [53] using RDKit release_2014.09.1 [54]. Every training compound was compared with the other elements of the training set and the TS with the most similar compound was stored for every ligand. The lowest value was used as cut-off for the test compounds when compared to the most similar molecule of the training set.

Nonbonded energy terms distribution analysis

The set of simulations employed in the LIE model were analyzed in terms of ΔV Ele and ΔV VdW, as described in reference 15. For each simulation of the test set compounds, the Mahalanobis distance was computed from the center of the distribution of ΔV Ele and ΔV VdW values obtained for the simulations used to train the LIE model. Supposing a uniform distribution of the energies for the training set, a query compound was considered outlier within this analysis if at least one simulation showed a squared distance higher than those expected for the 95 percentile of the χ 2 distribution for one degree of freedom (i.e., ΔV Ele or ΔV VdW) [55].

Per-residue interaction energy decomposition

Nonbonded interaction energies of the ligand with specific residues surrounding the catalytic site were analyzed during the simulations of the protein-ligand complexes. Residues included in the analysis were selected on the basis of the distance of any of their atoms to the center of the catalytic site. The center of the catalytic site was defined as the center for docking and the cut-off radius to select relevant residues was 1.2 nm. Ensemble nonbonded energy contributes for each residue over the MD simulations were scaled per simulation by the weights W i obtained from the reweighting scheme described above and summed, giving a single set of contributions per residue for each ligand.

PCA was performed for the elements of the training set in the space of the residue-decomposed energy contributes, after preliminary centering of the variables. Components that explained more than 5% of the variance in the original space were taken into account. Elements of the test set were projected on the rotated space and evaluated for being score or orthogonal outliers. The critical score distance (SD crit) was calculated as square root of the 95 percentile of the χ 2 distribution for a degrees of freedom, corresponding to the number of principal components:

SDcrit=χa,0.952 (5)

The orthogonal critical distance (OD crit) was robustly measured as

ODcrit=(median(OD)2/3+MAD(OD2/3)z0.95)3/2 (6)

where MAD is the median absolute deviation and z the 95 percentile of the cumulative normal distribution [56].

Results and Discussion

Assessing the experimental uncertainty

Inhibition studies in literature typically present data for relatively small sets of structurally related compounds. To calibrate a model for the prediction of binding affinity for different classes of molecules, we gathered experimental data from three different literature sources, providing a dataset of 73 compounds. Source 1 presents IC 50 data for 14 marketed drugs that have been screened internally by Bayer HealthCare, according to FDA and UP guidelines, using phenacetin as substrate probe on pooled human liver microsomes [57]. Source 2 consists of eight drugs and common inhibitor probes of CYP 1A2, screened for the inhibitory activity against phenacetin O-deethylation on recombinant CYPs [58]. In Source 3, 51 lactones or substituted naphthalenes and quinolines were screened for their inhibitory potency in a 7-ethoxyresorufin O-dealkylation assay on recombinant CYP 1A2 [25,59]. A sample of four compounds from each dataset was screened in-house, in order to estimate the expected error for a computational model based on the experimental uncertainty. The results from this comparison between our IC 50 data and data from Sources 1–3 are presented in Table 1, and show that only for compounds 1 and 6–9 the difference in the values from the different assays was within the experimental error. Representing the affinity as ΔG bind, the root mean square deviation (RMSD) in experimental affinity is 3.3 kJ mol-1 over the full set of measurements, with deviations spanning between 0.4 and 6.9 kJ mol-1.

Table 1. Comparison of IC 50 and ΔG bind values for CYP 1A2 as determined in-house (inhouse) and gathered from literature sources (lit), expressed in μM and kJ mol-1, respectively.

ΔΔG bind refers to the difference in ΔG bind between literature and in-house determined values.

ID Compound IC 50 (inhouse) IC 50 (lit) ΔG bind (inhouse) ΔG bind (lit) ΔΔG bind
1 Mefenamic acid a 19 ± 7 13.98 -28.9 ± 1.0 -29.7 -0.8
2 Tacrine a 2.3 ± 1.0 5.2 -34.2 ± 1.2 -32.1 2.1
3 Carvedilol a 5.0 ± 0.2 5.91 -32.2 ± 0.1 -31.8 0.4
4 Nifedipine a 0.7 ± 0.2 5.74 -37.2 ± 0.7 -31.9 5.3
5 Ellipticine b 0.007 ± 0.002 0.11 ± 0.01 -48.7 ± 0.7 -41.8 ± 0.2 6.9
6 Phenacetin b 24 ± 6 28.0 ± 0.46 -28.3 ± 0.6 -27.9 ± 0.0 0.4
7 α-Naphthoflavone b 0.030 ± 0.026 0.08 ± 0.02 -45.0 ± 3.7 -42.6 ± 0.6 2.4
8 Ticlopidine b 12 ± 7 6.1 ± 0.45 -30.1 ± 1.7 -31.8 ± 0.2 -1.7
9 1-naphthol c 2.0 ± 0.7 3.2 ± 0.8 -34.5 ± 0.9 -33.3 ± 0.6 1.2
10 2-naphthol c 4.4 ± 1.2 17 ± 1.5 -32.6 ± 0.7 -29.1 ± 0.2 3.4
11 4-methoxy-benzaldehyde c 410 ± 32 270 ± 85 -21.2 ± 0.2 -22.2 ± 0.8 -1.0
12 2-(p-tolyl)ethylamine c 120 ± 33 14 ± 2.5 -24.3 ± 0.7 -29.6 ± 0.5 -5.3

aLiterature data from Ref. 57;

bLiterature data from Ref. 58;

cLiterature data from Ref. 25.

The analysis presented here provides an estimate of the variability in measured IC 50 values under the different experimental conditions adopted in the studies from which CYP 1A2 inhibition potencies have been gathered. Based on the fact that the uncertainty of any quantitative model is dependent on the uncertainty of the experimental data on which it is based, the observed experimental variability is used as a parameter to evaluate the quality of our model. According to this, a non-overfitted reliable model is expected to show root mean regression errors (RMSE) of at least 3.3 kJ mol-1 and maximum errors in regression of 6.9 kJ mol-1 (corresponding to an error in pK i of approximatively 0.6 and 1.2, respectively, Eq 1).

Restriction of the dataset to competitive binders

Compounds from the data set were docked into the CYP 1A2 catalytic site and clustered in order to obtain representative binding poses for each compound.

It is known that several classes of CYP inhibitors can act in a (quasi-)irreversible manner: aromatic nitrogens can coordinate the heme iron, while biotransformation by CYPs can lead to reactive intermediates and subsequent covalent modification and inactivation of the enzyme itself. In these cases, quantum-mechanic (QM) based corrections should be used to address all the energetic contributes involved in bond breaking and formation. To exclude compounds that would not be properly described by the molecular mechanics (MM) force fields used in MD and LIE, a preliminary filter combining structural information from docking and knowledge-based rules was developed.

The binding poses obtained from docking and clustering were inspected and compounds were excluded from further analysis when bearing chemical groups that are known to lead to quasi-irreversible or mechanism-based inhibition (Table B in S1 File), and when (any of) the reactive atom(s) was within 0.6 nm from the heme iron.

Literature data [6073] on the mechanism of inhibition are available for 14 compounds of the dataset, Table 2. In case of discordant literature data, information from the most recent reference was taken into account. Compounds that were known from literature to be a substrate were here considered as competitive inhibitors, since they can potentially compete for binding to the CYP with other substrates. Our preliminary filter was able to correctly classify 10 of the 14 inhibitors considered (Table 2), while carvedilol, ticlopidine, clopidogrel, and riluzole were wrongly classified as mechanism-based inhibitors. Both clopidogrel and riluzole are metabolized by CYP 1A2 at a chemical position that is considered to lead to mechanism-based inhibition [69,73] representing exceptions to the general rule. Furthermore, our approach showed high precision in detecting competitive inhibitors, since no false positives were detected (Table 2).

Table 2. Comparison between predicted inhibition mechanisms and experimentally determined inhibition mechanisms reported in literature.

ID Compound Predicted Reported
1 Mefenamic acid Competitive Competitive60
2 Tacrine Competitive Substrate61
3 Carvedilol Quasi-irreversible Substrate62
4 Nifedipine Competitive Competitive63
6 Phenacetin Competitive Substrate64
7 α-Naphthoflavone Competitive Competitive/uncompetitive65
8 Ticlopidine Mechanism-based Competitive66
14 Quercetin Competitive Competitive67
17 Naringenin Competitive Substrate68
19 Clopidogrel Mechanism-based Substrate69
23 Mexiletine Competitive Competitive70
24 Furafylline Mechanism-based Mechanism-based71
25 Propranolol Competitive Substrate72
26 Riluzole Mechanism-based Substrate73

While an extensive validation of our filtering approach is beyond the scope of this work, it showed to be accurate enough to be employed as a preliminary filter for building a LIE-based model. After refinement of the entire dataset, 57 compounds were predicted to be competitive inhibitors of CYP 1A2 and subsequently simulated by MD in solvent and in complex with the enzyme, starting from every of the representative binding modes obtained from docking (Table C in S1 File).

The LIE model

Compounds of the refined dataset of 57 compounds were split in a training and test set. In order to ensure a broad chemical diversity within both data sets and a homogeneous representation of the different chemical classes, the compounds were classified according to their structure in naphthalenes/quinolines (compounds 9–10, 27–43), lactones (compounds 50–59) and other small molecules (others). From the three classes 8, 8, and 19 compounds were randomly selected to train the model (Table 3), respectively, while the other 22 compounds were used as external test set. An increasing number of MD simulations per ligand was taken into account to train several LIE models, using for each of them results from the simulations with the lowest ΔG bind. Among these models, the model with lowest standard error in prediction (SDEP) obtained during leave-one-out cross-validation (LOO-CV) was based on maximally 6 simulations per compound, and showed values of 0.587 and 0.267 for α and β in Eq 4, respectively (Fig 1). The RMSE and the SDEP for LOO-CV (SDEPCV) were 4.1 and 4.3 kJ mol-1, respectively, and were close to our estimate for the uncertainty in the experimental data (3.3 kJ mol-1). The 22 compounds that were not included in the training set were used to evaluate the predictive quality of the LIE model. The SDEP obtained for the 22 molecules of the test set was 5.8 kJ mol-1, with differences from experiments between 0.0 and 11.6 kJ mol-1 (Fig 1 and Table 4).

Table 3. Calculated (ΔGbindCalc) and observed (ΔGbindObs) free energies of binding, and corresponding residuals (ΔGbindObs—ΔGbindCalc) for the training-set compounds (kJ mol-1).

ID Compound ΔGbindObs ΔGbindCalc Residual
1 Mefenamic Acid -29.7 -34.0 4.3
2 Tacrine -32.2 -29.4 -2.8
4 Nifedipine -31.9 -29.0 -2.9
6 Phenacetin -27.9 -23.6 -4.3
7 α-naphthoflavon -42.6 -41.3 -1.3
13 HET-0016 -36.1 -36.4 0.3
16 Niflumic acid -30.2 -33.1 2.9
23 Mexiletine -34.8 -27.0 -7.8
25 Propranolol -32.9 -31.8 -1.1
27 Naphthalene -19.8 -17.6 -2.3
28 1-Methylnaphthalene -24.5 -22.3 -2.1
30 2-Methylnaphthalene -24.2 -20.9 -3.3
32 2-Fluoronaphthalene -26.5 -20.9 -5.6
35 1,3-Dimethylnaphthalene -31.0 -25.1 -5.9
36 1,4-Dimethylnaphthalene -33.0 -25.7 -7.4
38 1,5Dichloronaphthalene -29.6 -26.9 -2.7
42 2,6-Dimethylnaphthalen -26.3 -24.4 -1.9
50 ε-Caprolactone -9.7 -16.3 6.6
51 γ-Valerolactone -12.2 -15.8 3.6
52 γ-Caprolactone -13.2 -17.2 4.0
53 γ-Heptalactone -15.2 -20.2 5.0
54 γ-Nonanoic-lactone -21.9 -25.4 3.6
55 γ-Decanolactone -22.6 -25.9 3.3
56 γ-Undecanolactone -25.5 -29.9 4.4
58 δ-decanolactone -22.6 -26.3 3.6
60 2-Coumarone -22.3 -21.2 -1.1
61 2-Indanone -25.3 -18.4 -6.8
62 2,3-Dihydrobenzofuran -18.5 -18.1 -0.4
64 2-Benzoxalinone -21.4 -22.2 0.8
65 Biphenyl -23.5 -23.0 -0.5
67 4-Chlorobiphenyl -26.5 -26.6 0.1
68 Butylcyclohexane -26.2 -28.4 2.2
70 γ-Phenyl-γ-butyrolactone -16.9 -22.6 5.7
72 2-(p-tolyl)ethylamine -15.2 -19.6 4.4
73 Cotinine -14.7 -22.4 7.7

Fig 1. Correlation between calculated (∆Gbind Calc) and observed (∆GbindObs) binding free energies obtained for the CYP 1A2 LIE model (Eq 4, α = 0.587 and β = 0.267).

Fig 1

The solid line indicates ideal correlation between ∆GbindObs and ∆GbindCalc, and dashed lines represent deviations between calculated and experimental values of ±5 kJ mol−1 (corresponding to an error well within 1.0 pK i units). Compounds from the training set are represented in black. Test-set compounds that were found to be outlier in 0, 1, 2, and 3 analyses are represented in green, yellow, orange, and red, respectively.

Table 4. Calculated (ΔG bind Calc) and observed (ΔGbindObs) free energies of binding (kJ mol-1), and residuals (ΔGbindObs–ΔG bind Calc) for the test-set compounds.

Results from the reliability analyses are given as well, where a score 1 in columns (A)-(D) refers to the identification of outliers according to the following analyses: (A) Chemical similarity analysis; (B) Average interaction energy distribution analysis; (C) Ligand-residue electrostatic interaction analysis; (D) Ligand-residue van der Waals interaction analysis. In the last column (Total), the total sum of the number of analyses is reported in which a compound is identified as an outlier.

Outlier identification
ID ΔG bind Obs ΔG bind Calc Residual A B C D Total
9 -33.3 -21.7 -11.6 0 0 1 0 1
10 -29.1 -22.6 -6.5 0 0 0 0 0
11 -22.2 -20.4 -1.8 0 0 0 0 0
12 -29.6 -23.2 -6.4 1 0 1 0 2
14 -32.0 -40.0 8.0 0 1 1 1 3
17 -30.0 -37.8 7.8 0 1 1 1 3
20 -28.1 -35.1 7.0 0 1 1 1 3
29 -26.4 -22.6 -3.9 0 0 0 0 0
31 -28.9 -25.6 -3.3 0 0 0 0 0
33 -29.8 -22.8 -7.0 0 0 0 0 0
34 -32.0 -26.7 -5.3 0 0 0 0 0
37 -34.1 -27.3 -6.8 0 0 0 0 0
39 -26.9 -24.9 -2.0 0 0 0 0 0
40 -28.2 -26.3 -1.9 0 0 0 0 0
41 -31.2 -25.1 -6.1 0 0 0 0 0
43 -25.8 -25.8 0.0 0 0 0 0 0
57 -26.1 -31.4 5.4 0 0 0 1 1
59 -25.5 -31.0 5.5 0 0 0 1 1
63 -20.4 -17.7 -2.8 0 0 0 0 0
66 -22.6 -25.5 2.8 0 0 0 0 0
69 -15.7 -21.7 6.0 0 0 0 0 0
71 -9.7 -15.3 5.6 1 0 0 0 1

Analysis of the relative weight W i for each protein-ligand binding pose (Eq 3, Table C in S1 File) shows that most ligands present multiple binding conformations that significantly contribute to the calculated binding free energy. It is interesting to note that for this dataset of compounds and when not considering the physical meaning of including the contribution from more than one ligand-protein binding conformation to ∆G bind, a model based on only the binding conformation with lowest interaction energies displays RMSE and SDEPCV close to the most predictive model (which includes maximally 6 poses per ligand, Table C in S1 File), with values of 4.2 kJ mol-1 and 4.4 kJ mol-1, respectively. However, it should also be noted that using a single binding pose will still require running multiple simulations to score and rank the simulations based on MD. Therefore, the final model in Fig 1 is as compute-efficient as a model based on a single highest-ranked pose only.

A previous CYP 1A2 LIE model was published by Vasanthanathan et al. using 8 compounds as training set and a single protein simulation per ligand to evaluate ΔV Ele and ΔV VdW in Eq 2 [27]. While this LIE model showed a comparable accuracy in reproducing ΔG bind (with a reported RMSE of 3.7 kJ mol-1), Vasanthanathan found that differences in the electrostatic contributes among the training set did not affect the affinity toward CYP 1A2 (β = 0) and an offset parameter γ was introduced. Despite differences in electrostatic energy contributes, all the eight ligands presented polar groups and were found to undertake hydrogen bonds with water or amino acids within the catalytic site. In the current study the extension of the dataset to lipophilic compounds with no hydrogen bond donors or acceptors allowed a more extensive exploration of the ligand-protein interaction space, indicating a significant role for the electrostatic interactions to the binding affinity towards CYP 1A2 (β = 0.267, Fig 1) and leading to an affinity model in which the offset parameter γ was set to zero.

Assessing the reliability of LIE predictions

The availability of a relatively large set of experimental inhibition data allows us to create a consistent external test set. This makes it possible to perform a more extensive validation of the model and to evaluate an approach to estimate the quality of predictions of a LIE model. Because LIE models include empirically derived parameters, it is reasonable that a prediction for a query compound can be considered reliable only if it is inferred from a region of information defined by the protein-ligand interactions that have been already explored during training of the model. Analysis of this space is required to identify the sources for the relatively large deviations of some of the predicted test-set affinities from experiment, and is performed here subsequently in terms of (i) the chemical similarity between the test set and training compounds, (ii) average protein-ligand interaction energies, and (iii) ligand-protein interactions during simulation.

Chemical similarity analysis

Chemical similarity among the training set was evaluated as TS between each pair of molecules, represented as molecular fingerprints (Fig 2). Average similarity scores among the elements of the training set was between 0.32 (compound 72) and 0.50 (compound 27). Furthermore, every training compound was found to have a TS higher than 0.45 with at least one of the other compounds of the training set. Molecules from the test set were also compared to those of the training set. Average similarity scores ranged between 0.31 (compound 10) and 0.48 (compound 57), and only compounds 12 and 71 showed for all TSs values lower than 0.45 when compared to the training set molecules.

Fig 2. Similarity matrix of the data set.

Fig 2

Heat map of the compounds included in the training and test set, colored according to percent similarity expressed in terms of Tanimoto scores (TSs) between pairs of structural fingerprints (white = 100% similarity (TS = 1.00); black = 0% similarity (TS = 0.00)).

The relatively low average TS obtained among the training set compounds confirmed a significant diversity in terms of chemical structures. An analogous similarity/diversity can be observed when comparing the test set to the training set compounds. Within the test set, compounds 12 and 71 (Table A in S1 File) were considered to be outlier in this structural analysis because of their low TS obtained when compared to the most similar compound within the training set.

The error in prediction for the two compounds was respectively -6.4 and 5.6 kJ mol-1 (Table 4), suggesting that structural analysis based on fingerprint distance is able to identify few compounds with relatively high errors in prediction when compared to the SDEPCV. However, the chemical similarity analysis was not able to detect any of the other LIE predictions with significant deviation from experiment (Fig 1).

Average interaction energy distributions

From a statistical point of view, LIE models can be considered as linear regressions in which average ensemble interaction energies ΔV Ele and ΔV VdW are used as variables (where ΔV Ele and ΔV VdW are obtained from MD simulations of the compounds in solvent and bound to the protein in the different binding configurations obtained from docking). With this in mind, it can be assumed that a prediction will not be accurate if the MD-averaged (differences in) interaction energies as obtained for a query compound do not belong to the distribution of the averaged interaction energies employed to train the model (i.e., as obtained from the MD simulations involving the training compounds) [15].

The Mahalanobis distance from the center of the distribution of training set values for ΔV Ele and ΔV VdW was measured for each simulation of the test set. Compounds 14, 17, 20 were found to show at least one simulation for which the Mahalanobis distance was larger than the value expected for the 95 percentile of the distribution of the training set (Fig 3). Interestingly, the three compounds showed errors in affinity prediction of 8.0, 7.8, and 7.0 kJ mol-1, respectively (Table 4).

Fig 3. Distribution of ΔV Ele and ΔV VdW values (Eq 2) for training-set (black circles) and test-set (white squares) MD simulations.

Fig 3

The dashed line represents the confidence for the 95 percentile of the training set distribution. The simulations from the test set that are not comprised in this interval are labeled according to the corresponding compound ID.

Whereas these three outliers were not identified by our chemical similarity analysis, the evaluation of the distance of each simulation of the query compounds from the center of the distribution of training set simulations, in the space provided by their MD-averaged interaction energies, efficiently identified three LIE predictions with too large deviation from the observed inhibitory potency. However, several other compounds with significant deviations between calculated and observed values for ΔG bind were not detected (Fig 1 and Table 4).

Per-residue analysis of ligand-protein interactions

To evaluate the space of the electrostatic and steric interactions between the protein and compounds of the training set simulations, a per-residue decomposition was performed of the average nonbonded interaction energies (〈V VdW lig-surr〉 or 〈V Ele lig-surr〉) between the ligand and the amino acids lining the active site.

A decomposition of the electrostatic energy contributes indicated that the main residues involved in the interaction with the ligands were Thr118, Asn312, Asp313, Thr124, Asp320, and the water molecules in the active site (Figure A in S1 File), while Phe226, Ala317, Gly316, Asp313, Phe260, Ile117, Phe125, Leu497, and water represented the residues with the highest average van der Waals interaction energies 〈V VdW lig-surr〉 contributes (Figure B in S1 File). These observations overlap with previous findings: analysis of the CYP 1A2 crystal structure [20] indicated that Thr118 and Thr124 lie in a hydrophilic region that is considered important for the binding of polar substrates, while Phe226, Ala317, and Gly316 form a planar surface that is important for recognizing flat molecules. The crystal structure showed also that Phe125 contributes to tight binding of the co-crystallized alpha-naphthoflavone [20]. Additionally, a recent computational study by Yang et al. [74] suggested Asn312, Asp313, and Phe260 to be critical for binding of acetominophen to CYP 1A2.

PCA on the decomposed electrostatic contributes for each compound of the training set led to identification of two principal components. The first component accounted for 71.3% of the per-residue energy contributes variability, and was dominated by the interaction with the water molecules present in the catalytic site. Interactions with Asp313 and Thr118 dominated the second component, which explains 15.2% of the overall variability (Fig 4, panels A and B). The plot of the first two PC scores showed that the compounds are clustered in three groups: (1) compounds with non-polar groups (compounds 27, 28, 30, 32, 35, 36, 38, 42, 62, 65, 67, 68, 29, 31, 33, 34, 37, 39, 40, 41, 43, 63, 66, 69) were clustered in a small region with positive scores on the first component, and scores close to 0 on the second; (2) compounds 13, 14 and 17 (bearing a high number of hydrogen bond donor groups or acceptors) showed high scores on both principal components; and (3) all other compounds were homogeneously distributed around the origin (Fig 4, panel C). Score and orthogonal outliers in the test set were detected in the rotated space (Fig 4, panels C and D). Compounds 9, 12 and 20 were orthogonal outliers, while 14 and 17 were both orthogonal and score outliers. Errors in prediction for these compounds were respectively -11.6, -6.4, 7.0, 8.0, and 7.8 kJ mol-1 (Table 4).

Fig 4. Per-residue decomposition analysis of the electrostatic interaction energies between the ligand and its surrounding in the protein-ligand simulations.

Fig 4

(A) PCA loading plot for training-set electrostatic interaction energies; (B) Active site of CYP 1A2 from the crystallographic structure; heme group (purple carbon atoms), co-crystallized ligand α-naphthoflavone (yellow carbon atoms), and amino acids with high loading on the first two PCs (in red) are explicitly represented. (C) PCA score plot for the training-set (black circles) and test-set (white squares) compounds for the first two PCs. (D) Orthogonal distance (OD) of the compounds of the training set (black circles) and test set (white squares) from the model with 2 PCs. The dashed horizontal line represents the critical orthogonal distance, calculated for the training-set distribution.

In an analogue PCA performed on the per-residue decomposition of the average van der Waals interaction energies 〈V VdW lig-surr〉, 89.5% of the multivariate variability was accounted by 4 principal components. High loading on the first principal component (45.7% of the overall variability) was shown by the interaction with residues in the center of the catalytic site (Gly316 and Ala317 in helix I, Phe226 and Leu497 on the opposite side; Fig 5, panels A and B). On the second principal component (21.6% of the overall variability), a positive loading was provided by interaction with residues in the portion of the pocket delimited by the helices F, G, and I (residues Phe226, Phe260, Asp313, Thr118) while opposite contributes were provided by the interaction with residues on the other side of the catalytic site (Water, Leu497, Ile386, Thr321; Fig 5, panels A and B). Components 3 and 4 explained 13.2% and 9.0% of the multivariate variability, respectively, but their meaning is more uncertain. Analysis of the scores showed a homogeneous distribution of the compounds, except for compounds 14 and 17, which were found to be score outliers (Fig 5, panel C). Compounds 14 and 17 appeared to be also orthogonal outliers in the evaluation of the residuals, together with compounds 20, 57, and 59 (Fig 5, panel D). Errors in prediction for these compounds were 8.0, 7.8, 7.0, 5.4, and 5.5 kJ mol-1, respectively.

Fig 5. Per-residue decomposition analysis of the van der Waals interaction energies between the ligand and its surrounding in the protein-ligand simulations.

Fig 5

(A) PCA loading plot for training-set van der Waals interaction energies; (B) Active site of CYP 1A2 from the crystallographic structure; heme group (purple carbon atoms), the co-crystallized ligand α-naphthoflavone (yellow carbon atoms), and amino acids with high loadings in the PCA are explicitly represented. Residues with high positive loadings on the first PC are depicted in green; Residues with high loadings on the second component are also represented, both for positive (blue) and negative values (red). (C) PCA score plot for the training-set (black circles) and test-set (white squares) compounds for the first two PCs. (D) Orthogonal distance (OD) of the compounds of the training set (black circles) and test set (white squares) from the model with 4 PCs. The dashed horizontal line represents the critical orthogonal distance, calculated for the training-set distribution.

The interaction of a ligand with its (off-)target protein is depending on both steric and electrostatic properties of the binding site. Within a large and malleable binding pocket such as the catalytic site of several drug-metabolizing CYPs, a ligand can bind in different conformations and in different topological parts of the pocket (i.e., with different electrostatic and steric properties). While the averaged electrostatic and van der Waals ensemble energies are typically unable to give fine details about the topology of the interactions, analysis of the nonbonded interaction energy contributes from each residue can provide more details about how ligands interact with the catalytic site. The per-residue decomposition of the interaction energies was used to identify different areas of the catalytic site that are considered important for the recognition of CYP 1A2 substrates and inhibitors, and PCA on these data was able to characterize the regions of the catalytic site that are critical for binding of specific ligands. Moreover, we could use this analysis to identify query compounds presenting a pattern of interactions that was not properly sampled by the training set (orthogonal outliers), or for which the interactions with single residues were unusually strong, as compared to what was observed within the training set (score outliers). Both groups of molecules showed absolute errors in affinity prediction between 5.4 and 11.6 kJ mol-1, which are considerably larger when compared to the errors obtained for the training set (Table 3).

Combined approach to analyse the prediction reliability

Similar to the ADAN approach of Carrió et al. for traditional QSAR methods [17], we adopted a combined approach in which the analyses presented above were used as criteria to assess the reliability of LIE predictions. These criteria were based on (A) chemical similarity analysis, (B) average interaction energy distribution analysis, (C) per-residue analysis of the ligand-protein electrostatic interactions, and (D) per-residue analysis of the ligand-protein van der Waals interactions. According to the total number of analyses (0 to 4) in which they were found to be outliers, the query compounds were thus classified in 5 categories (Fig 6). 14 test set compounds were found to be outlier in none of the analyses (category 0), and showed a SDEP of 4.6 kJ mol-1 between calculated and experimental ΔG bind, Fig 6. Four compounds (9, 57, 59, 71) were outliers in one analysis (category 1), for which the SDEP was 7.5 kJ mol-1. One compound (12), with an error in prediction of 6.4 kJ mol-1) was outlier in two analyses, while three compounds were outlier in three analyses and showed the highest SDEP of 7.6 kJ mol-1 (14, 17, 20). None of the compounds of the test set was found to be outlier in all analyses.

Fig 6. Prediction errors obtained for the external test set compounds.

Fig 6

The compounds were grouped in a category according to the number of occurrences in which they were found to be an outlier according to analyses (A)-(D) in Table 4. Horizontal lines represent the standard error (SDEP) for a given category, while the boxes represent the standard deviation around this average.

Considering the obtained results, compounds that were outlier in an increasing number of analyses were found to show larger errors in prediction. Moreover, the compounds with no deviations showed a SDEP similar to the one obtained during LOO-CV of the model (4.6 vs. 4.3 kJ mol-1), indicating that our combined analysis on the nature of the ligands and their interactions with the protein of interest can be used for assessing the reliability of predictions by protein-structural and -dynamical based models.

Conclusions

An iterative LIE model for predicting CYP 1A2 binding affinities was presented. A dataset of IC 50 values for 73 compounds characterized by a large chemical diversity was collected from three different literature sources, and the experimental uncertainty was estimated by measuring the inhibitory potency in-house for a sample of compounds from each source under the same conditions. The RMSD between in-house measured IC 50 values and literature values of 3.3 kJ mol-1 (with a maximum variation of 6.9 kJ mol-1) was used as limit to identify possibly overfitted models.

From the dataset, 35 compounds that covered a broad range of chemical diversity (with an average Tanimoto Score among pairs of compounds of 0.39) were used to calibrate our LIE model, which showed high correlation between calculated and observed values for the binding free energy (r2 = 0.68; q2 CV = 0.66). Errors in regression (RMSE) and in prediction in cross validation (SDEPCV) were comparable to the uncertainty in the experimental data (RMSE = 4.1 kJ mol-1; SDEPCV = 4.3 kJ mol-1), indicating that the MD-based approach was able to properly address the protein and ligand flexibility that are crucial in modeling CYP450-ligand interactions.

Compounds that were not included in the training set were used to evaluate the predictivity of the model. Since we provided a parameterized model, the inhibitory potency for a query (test-set) compound can only be accurately predicted if it shows sufficient similarity with the elements of the training set that determine the model. Assuming that such similarity within a structure-based model depends on both the compound and the specific interactions it undertakes with the (off-)target protein, we proposed a set of analyses to evaluate the different elements playing a role in determining a LIE-based prediction. Analysis of the chemical structure of the ligands only was very limited in detecting test-set compounds with large deviations in prediction. By also taking into account the protein-ligand interactions and by combining results from multiple analyses in a single score, it was possible to obtain good estimation of the prediction reliability. Predictions that were found to be most reliable (i.e., for which the compounds were not found to be outlier in any of the applied analyses) showed a SDEP of 4.6 kJ mol-1, which is comparable to the SDEPCV obtained, while predictions for groups of compounds with decreasing level of reliability showed a SDEP of 7.5, 6.4, and 7.6 kJ mol-1, respectively.

In this work we presented a model for CYP 1A2 binding affinity calculation using an approach based on a highly automatable and scalable protocol that has proved to be applicable in predicting binding affinities for compounds characterized by broad chemical diversity. Additionally, we provided an innovative method that was able to estimate the reliability of single MD-based predictions, thereby efficiently including information on the ligand-protein interactions. Considering the continuous increment in accessibility of computational power, the comprehensive method we proposed here represents a promising alternative to traditional quantitative structure-activity(/property) relationship models for a vast range of biological (off-)targets for which a 3D structure is available and in which protein structural and/or dynamical information is crucial to model the interaction with ligands and predict the property or properties of interest.

Supporting Information

S1 File. Per-residue decomposition of the average electrostatic interaction energies 〈V Ele〉 (Figure A).

Per-residue decomposition of the average Van der Waals interaction energies 〈V VdW(Figure B). Structures and IC 50 values from literature for the data set of compounds (Table A). Reacting chemical groups in (quasi-)irreversible CYP inhibitors (Table B). Overview of the average electrostatic 〈V Ele〉 and van der Waals 〈V VdW〉 interaction energies (Table C).

(PDF)

Acknowledgments

We gratefully acknowledge Dr. Jörg Wichard and Dr. Thomas Steger-Hartmann (Bayer HealthCare, Berlin, Germany) for supplying us with experimental data. This work was supported by the Innovative Medicines Initiative Joint Undertaking (IMI-JU) under grant agreement no. 115002 (eTOX), resources of which are composed of financial contribution from the European Union Seventh Framework Programme (FP7/20072013) and EFPIA companies in kind contribution, and by The Netherlands Organisation for Scientific Research (NWO, VIDI grant 723.012.105).

Abbreviations

PCA

principal component analysis

CYP

Cytochrome P450

CYP 1A2

Cytochrome P450 1A2

MD

molecular dynamics

LIE

linear interaction energy

TS

Tanimoto score

Data Availability

All relevant data are within the paper and its Supporting Information file.

Funding Statement

The work was supported by Innovative Medicines Initiative Joint Undertaking (IMI-JU) under grant agreement no. 115002 (eTOX), resources of which are composed of financial contribution from the European Union Seventh Framework Programme (FP7/20072013) and EFPIA companies in kind contribution; www.etoxproject.eu. The work was also supported by The Netherlands Organisation for Scientific Research (NWO, VIDI grant 723.012.105); www.nwo.nl. Except for manuscript approval from the eTOX executive committee, the funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Guengerich FP. Cytochrome P450s and Other Enzymes in Drug Metabolism and Toxicity. AAPS J. 2006; 8: E101–E111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Wienkers LC, Heath TG. Predicting in Vivo Drug Interactions from in Vitro Drug Discovery Data. Nat Rev Drug Discov. 2005; 4: 825–833. [DOI] [PubMed] [Google Scholar]
  • 3. Obach SR, Walsky RL, Venkatakrishnan K, Gaman EA, Houston JB, Tremaine LM. The Utility of in Vitro Cytochrome P450 Inhibition Data in the Prediction of Drug-drug Interactions. J Pharmacol Exp Ther. 2006; 316: 336–348. [DOI] [PubMed] [Google Scholar]
  • 4. Kirchmair J, Williamson MJ, Tyzack JD, Tan L, Bond PJ, Bender A, et al. Computational Prediction of Metabolism: Sites, Products, SAR, P450 Enzyme Dynamics, and Mechanisms. J Chem Inf Mod. 2012; 52: 617–648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Stjernschantz E, Vermeulen NPE, Oostenbrink C. Computational Prediction of Drug Binding and Rationalisation of Selectivity Towards Cytochromes P450. Expert Opin Drug Metab Toxicol. 2008; 4: 513–527. 10.1517/17425255.4.5.513 [DOI] [PubMed] [Google Scholar]
  • 6. Guengerich FP. A Malleable Catalyst Dominates the Metabolism of Drugs. Proc Natl Acad Sci 2006; 103: 13565–13566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Yuan R, Madani S, Wei X-X, Reynolds K, Huang S-M. Evaluation of Cytochrome P450 Probe Substrates Commonly Used by the Pharmaceutical Industry to Study in Vitro Drug Interactions. Drug Metab Dispos. 2002; 30: 1311–1319. [DOI] [PubMed] [Google Scholar]
  • 8. Busby W, Ackermann J, Crespi C. Effect of Methanol, Ethanol, Dimethyl Sulfoxide, and Acetonitrile on in Vitro Activities of cDNA-expressed Human Cytochromes P-450. Drug Metab Dispos 1999; 27: 246–249. [PubMed] [Google Scholar]
  • 9. Correia MA, Ortiz de Montellano PR. Inhibition of Cytochrome P450 Enzymes In: Cytochrome P450: Structure, Mechanism, and Biochemistry; 3e; Ortiz de Montellano PR, Editor; Kluwer Academic / Plenum Publishers, New York, 2005; p. 247–322. [Google Scholar]
  • 10. Gleeson MP, Davis AM, Chohan KK, Paine SW, Boyer S, Gavaghan CL, et al. Generation of In-silico Cytochrome P450 1A2, 2C9, 2C19, 2D6, and 3A4 Inhibition QSAR Models. J Comput Aid Mol Des. 2007; 21: 559–573. [DOI] [PubMed] [Google Scholar]
  • 11. Sridhar J, Liu J, Foroozesh M, Stevens CLK. Insights on Cytochrome P450 Enzymes and Inhibitors Obtained Through QSAR Studies. Molecules 2012; 17: 9283–9305. 10.3390/molecules17089283 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Aqvist J, Medina C, Samuelsson JE. A New Method for Predicting Binding Affinity in Computer-Aided Drug Design. Protein Eng. 1994; 7: 385–391. [DOI] [PubMed] [Google Scholar]
  • 13. Stjernschantz E, Oostenbrink C. Improved Ligand-protein Binding Affinity Predictions Using Multiple Binding Modes. Biophys J. 2010; 98: 2682–2691. 10.1016/j.bpj.2010.02.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Perić-Hassler L, Stjernschantz E, Oostenbrink C, Geerke DP. CYP 2D6 Binding Affinity Predictions Using Multiple Ligand and Protein Conformations. Int J Mol Sci. 2013; 14: 24514–24530. 10.3390/ijms141224514 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Vosmeer CR, Pool R, Van Stee MF, Peric-Hassler L, Vermeulen NPE, Geerke DP. Towards Automated Binding Affinity Prediction Using an Iterative Linear Interaction Energy Approach. Int J Mol Sci. 2014; 15: 798–816. 10.3390/ijms15010798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Sahigara F, Mansouri K, Ballabio D, Mauri A, Consonni V, Todeschini R. Comparison of Different Approaches to Define the Applicability Domain of QSAR Models. Molecules. 2012; 17: 4791–4810. 10.3390/molecules17054791 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Carrió P, Pinto M, Ecker G, Sanz F, Pastor M. Applicability Domain ANalysis (ADAN): a Robust Method for Assessing the Reliability of Drug Property Predictions. J Chem Inf Model. 2014; 54: 1500–1511. 10.1021/ci500172z [DOI] [PubMed] [Google Scholar]
  • 18. Smith DA, Ackland MJ, Jones BC. Properties of Cytochrome P450 Isoenzymes and Their Substrates Part 1: Active Site Characteristics. Drug Discov Today. 1997; 2: 406–414. [Google Scholar]
  • 19. Ekins S, de Groot MJ, Jones JP. Pharmacophore and Three-dimensional Quantitative Structure Activity Relationship Methods for Modeling Cytochrome P450 Active Sites. Drug Metab Dispos. 2001; 29: 936–944. [PubMed] [Google Scholar]
  • 20. Sansen S, Yano JK, Reynald RL, Schoch GA, Griffin KJ, Stout CD, et al. Adaptations for the Oxidation of Polycyclic Aromatic Hydrocarbons Exhibited by the Structure of Human P450 1A2. J Biol Chem. 2007; 282: 14348–14355. [DOI] [PubMed] [Google Scholar]
  • 21. Vasanthanathan P, Taboureau O, Oostenbrink C, Vermeulen NPE, Olsen L, Jorgensen FS. Classification of Cytochrome P450 1A2 Inhibitors and Noninhibitors by Machine Learning Techniques. Drug Metab Dispos. 2009; 37: 658–664. 10.1124/dmd.108.023507 [DOI] [PubMed] [Google Scholar]
  • 22. Novotarskyi S, Sushko I, Körner R, Pandey AK, Tetko IV. A Comparison of Different QSAR Approaches to Modeling CYP450 1A2 Inhibition. J Chem Inf Model. 2011; 51: 1271–1280. 10.1021/ci200091h [DOI] [PubMed] [Google Scholar]
  • 23. Iori F, da Fonseca R, Ramos MJ, Menziani MC. Theoretical Quantitative Structure-activity Relationships of Flavone Ligands Interacting with Cytochrome P450 1A1 and 1A2 Isozymes. Bioorgan Med Chem. 2005; 13: 4366–4374. [DOI] [PubMed] [Google Scholar]
  • 24. Roy K, Roy PP. Comparative QSAR Studies of CYP1A2 Inhibitor Flavonoids Using 2D and 3D Descriptors. Chem Biol Drug Des. 2008; 72: 370–382. 10.1111/j.1747-0285.2008.00717.x [DOI] [PubMed] [Google Scholar]
  • 25. Korhonen LE, Rahnasto M, Mähönen NJ, Wittekindt C, Poso A, Juvonen RO, et al. Predictive Three-dimensional Quantitative Structure-activity Relationship of Cytochrome P450 1A2 Inhibitors. J Med Chem. 2005; 48: 3808–3815. [DOI] [PubMed] [Google Scholar]
  • 26. Sridhar J, Foroozesh M, Stevens CLK. QSAR Models of Cytochrome P450 Enzyme 1A2 Inhibitors Using CoMFA, CoMSIA and HQSAR. SAR QSAR Environ Res. 2011; 22: 681–697. 10.1080/1062936X.2011.623320 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Vasanthanathan P, Olsen L, Jørgensen FS, Vermeulen NPE, Oostenbrink C. Computational Prediction of Binding Affinity for CYP1A2-Ligand Complexes Using Empirical Free Energy Calculations. Drug Metab Dispos. 2010; 38: 1347–1354. 10.1124/dmd.110.032946 [DOI] [PubMed] [Google Scholar]
  • 28. Burke MD, Mayer RT. Differential Effects of Phenobarbitone and 3-Methylcholanthrene Induction on the Hepatic Microsomal Metabolism and Cytochrome P-450-binding of Phenoxazone and a Homologous Series of its N-Alkyl Ethers (Alkoxyresorufins). Chem Biol Interact. 1983; 45: 243–258. [DOI] [PubMed] [Google Scholar]
  • 29. Omura T, Sato R. The Carbon Monoxide-Binding Pigment of Liver Microsomes. I. Evidence for its Hemoprotein Nature. J Biol Chem. 1964; 239: 2370–2378. [PubMed] [Google Scholar]
  • 30. Cheng Y, Prusoff WH. Relationship Between the Inhibition Constant (k1) and the Concentration of Inhibitor which Causes 50 per cent Inhibition (I50) of an Enzymatic Reaction. Biochem Pharmacol. 1973; 22: 3099–3108. [DOI] [PubMed] [Google Scholar]
  • 31.Molecular Operating Environment (MOE) version 2012.10. 1010 Sherbooke St. West, Suite #910, Montreal, QC, Canada, H3A 2R7: Chemical Computing Group Inc.; 2012.
  • 32. O'Boyle NM, Banck R, James CA, Morley C, Vandermeersch T, Hutchison GR. Open Babel: An Open Chemical Toolbox. J Cheminform. 2011; 3: 33 10.1186/1758-2946-3-33 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Halgren TA. Merck Molecular Force Field. I. Basis, Form, Scope, Parameterization, and Performance of MMFF94. J Comput Chem. 1996; 17: 490–519. [Google Scholar]
  • 34. Case DA, Darden TA, Cheatham TE III, Simmerling CL, Wang J, Duke RE, et al. AMBER 13. San Francisco: University of California; 2012. [Google Scholar]
  • 35. Jakalian A, Bush BL, Jack DB, Bayly CI. Fast, Efficient Generation of High-Quality Atomic Charges. AM1-BCC Model: I. Method. 2000; 21: 132–146. [DOI] [PubMed] [Google Scholar]
  • 36. Korb O, Stützle T, Exner TE. An Ant Colony Optimization Approach to Flexible Protein-Ligand Docking. Swarm Intell. 2007; 1: 115–134. [Google Scholar]
  • 37. Korb O, Stützle T, Exner TE. Empirical Scoring Functions for Advanced Protein-Ligand Docking with PLANTS. J Chem Inf Model. 2009; 49: 84–96. 10.1021/ci800298z [DOI] [PubMed] [Google Scholar]
  • 38. MacQueen J. Some Methods for Classification and Analysis of Multivariate Observations. Proc Fifth Berkeley Symp on Math Statist and Prob. 1967; 1: 281–297. [Google Scholar]
  • 39. Hess B, Kutzner C, van der Spoel D, Lindahl E. GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. J Chem Theory Comp. 2008; 4: 435–447. [DOI] [PubMed] [Google Scholar]
  • 40. Sousa Da Silva AW, Vranken WF. ACPYPE—AnteChamberPYthon Parser interfacE. BMC Res Notes. 2012; 5:367 10.1186/1756-0500-5-367 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Wang J, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and Testing of a General Amber Force Field. J Comput Chem. 2004; 25: 1157–1174. [DOI] [PubMed] [Google Scholar]
  • 42. Hornak V, Abel R, Okur A, Strockbine B, Roitberg A, Simmerling C. Comparison of Multiple Amber Force Fields and Development of Improved Protein Backbone Parameters. Proteins. 2006; 65: 712–725. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Shahrokh K, Orendt A, Yost GS, Cheatham TE. Quantum Mechanically Derived AMBER-compatible Heme Parameters for Various States of the Cytochrome P450 Catalytic Cycle. J Comput Chem. 2012; 33: 119–133. 10.1002/jcc.21922 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Jorgensen WL, Madura JD. Quantum and Statistical Mechanical Studies of Liquids. 25. Solvation and Conformation of Methanol in Water. J Am Chem Soc. 1983; 105: 1407–1413. [Google Scholar]
  • 45. Hockney RW. The Potential Calculation and some Applications. New York: Academic Press, inc.; 1970. [Google Scholar]
  • 46. Feenstra KA, Hess B, Berendsen HJC. Improving Efficiency of Large Time-scale Molecular Dynamics Simulations of Hydrogen-rich Systems. J Comput Chem. 1999; 20: 786–798. [DOI] [PubMed] [Google Scholar]
  • 47. Hess B, Bekker H, Berendsen HJC, Fraaije JGEM. LINCS: A Linear Constraint Solver for Molecular Simulations. J Comput Chem. 1997; 18: 1463–1472. [Google Scholar]
  • 48. Berendsen HJC, Postma JPM, Di Nola A, Haak JR. Molecular Dynamics with Coupling to an External Bath. J Chem Phys. 1984; 81: 3684–3690. [Google Scholar]
  • 49. Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG. A Smooth Particle Mesh Ewald Method. J Chem Phys. 1995; 103: 8577–8592. [Google Scholar]
  • 50. Carlson HA, Jorgensen WL. An Extended Linear Response Method for Determining Free Energies of Hydration. J Phys Chem. 1995; 99: 10667–10673. [Google Scholar]
  • 51. Hritz J, Oostenbrink C. Efficient Free Energy Calculations for Compounds with Multiple Stable Conformations Separated by High Energy Barriers. J Phys Chem B. 2009; 113: 12711–12720. 10.1021/jp902968m [DOI] [PubMed] [Google Scholar]
  • 52. Durant JL, Leland BA, Henry DR, Nours JG. Reoptimization of MDL Keys for Use in Drug Discovery. J Chem Inf Comput Sci. 2002; 42: 1273–1280. [DOI] [PubMed] [Google Scholar]
  • 53. Nikolova N; Jaworska J. Approaches to Measure Chemical Similarity–a Review. QSAR Comb Sci. 2003; 22: 1006–1026. [Google Scholar]
  • 54.RDKit version 2014.09.1, Open-Source Cheminformatics. http://www.rdkit.org.
  • 55. De Maesschalck R, Jouan-Rimbaud D, Massart DL. The Mahalanobis Distance. Chemometr Intell Lab. 2000; 50: 1–18. [Google Scholar]
  • 56. Hubert M, Rousseeuw PJ, Vanden Branden K. ROBPCA: A New Approach to Robust Principal Components. Technometrics. 2005; 47: 64–79. [Google Scholar]
  • 57. Bayer HealthCare databases for CYP inhibition, made available within eTOX consortium.
  • 58. Cohen L, Remley M, Raunig D, Vaz A. In Vitro Drug Interactions of Cytochrome P450: An Evaluation of Fluorogenic to Conventional Substrates. Drug Metab Dispos. 2003; 31: 1005–1015. [DOI] [PubMed] [Google Scholar]
  • 59.The value for K m for use in the Cheng-Prusoff equation was taken from: Masimirembwa CM, Otter C, Berg M, Jönsson M, Leidvik B, Jonsson E, et al. Heterologous expression and kinetic characterization of human cytochromes P-450: validation of a pharmaceutical tool for drug metabolism research. Drug Metab Dispos. 1999; 27: 1117–1122. [PubMed] [Google Scholar]
  • 60. Karjalainen MJ, Neuvonen PJ, Backman JT. In Vitro Inhibition of CYP1A2 by Model Inhibitors, Anti-inflammatory Analgesics and Female Sex Steroids: Predictability of in Vivo Interactions. Basic Clin Pharmacol. 2008; 103: 157–165. [DOI] [PubMed] [Google Scholar]
  • 61. Spaldin V, Madden S, Adams DA, Edwards RJ, Davies DS, Park BK. Determination of Human Hepatic Cytochrome P450 1A2 Activity in Vitro Use of Tacrine as an Isoenzyme-specific Probe. Drug Metab Dispos. 1995; 23: 929–934. [PubMed] [Google Scholar]
  • 62. Oldham H, Clarke S. In Vitro Identification of the Human Cytochrome P450 Enzymes Involved in the Metabolism of R(+)-and S(−)-carvedilol. Drug Metab Dispos. 1997; 25: 970–977. [PubMed] [Google Scholar]
  • 63. Katoh M, Nakajima M, Shimada N, Yamazaki H, Yokoi T. Inhibition of Human Cytochrome P450 Enzymes by 1,4-dihydropyridine Calcium Antagonists: Prediction of in Vivo Drug-drug Interactions. Eur J Clin Pharmacol. 2000; 55: 843–852. [DOI] [PubMed] [Google Scholar]
  • 64. Rendic S. Summary of Information on Human CYP Enzymes: Human P450 Metabolism Data. Drug Metab Rev. 2002; 34: 83–448. [DOI] [PubMed] [Google Scholar]
  • 65. Cho US, Park EY, Dong MS, Park BS, Kim K, Kim KH. Tight-binding Inhibition by Alpha-naphthoflavone of Human Cytochrome P450 1A2. BBA-Proteins Proteom. 2003; 1648: 195–202. [DOI] [PubMed] [Google Scholar]
  • 66. Ko JW, Desta Z, Soukhova NV, Tracy T, Flockhart DA. In Vitro Inhibition of the Cytochrome P450 (CYP450) System by the Antiplatelet Drug Ticlopidine: Potent Effect on CYP2C19 and CYP2D6. Brit J Clin Pharmacol. 2000; 49: 343–351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Lautraite S, Musonda AC, Doehmer J, Edwards GO, Chipman JK. Flavonoids Inhibit Genetic Toxicity Produced by Carcinogens in Cells Expressing CYP1A2 and CYP1A1. Mutagenesis. 2002; 17: 45–53. [DOI] [PubMed] [Google Scholar]
  • 68. Breinholt VM, Offord EA, Brouwer C, Nielsen SE, Brøsen K, Friedberg T. In Vitro Investigation of Cytochrome P450-mediated Metabolism of Dietary Flavonoids. Food Chem Toxicol. 2002; 40: 609–616. [DOI] [PubMed] [Google Scholar]
  • 69. Kazui M, Nishiya Y, Ishizuka T, Hagihara K, Farid NA, Okazaki O, et al. Identification of the Human Cytochrome P450 Enzymes Involved in the Two Oxidative Steps in the Bioactivation of Clopidogrel to Its Pharmacologically Active Metabolite. Drug Metab Dispos. 2010; 38: 92–99. 10.1124/dmd.109.029132 [DOI] [PubMed] [Google Scholar]
  • 70. Wei X, Dai R, Zhai S, Thummel KE, Friedman FK, Vestal RE. Inhibition of Human Liver Cytochrome P-450 1A2 by the Class IB Antiarrhythmics Mexiletine, Lidocaine, and Tocainide. J Pharm Exp Ther. 1999; 289: 853–858. [PubMed] [Google Scholar]
  • 71. Racha JK, Rettie AE, Kunze KL. Mechanism-based Inactivation of Human Cytochrome P450 1A2 by Furafylline: Detection of a 1:1 Adduct to Protein and Evidence for the Formation of a Novel Imidazomethide Intermediate. Biochemistry-US. 1998; 37: 7407–7419. [DOI] [PubMed] [Google Scholar]
  • 72. Masubuchi Y, Hosokawa S, Horie T, Suzuki T, Ohmori S, Kitada M, et al. P450 Isozymes Involved in Propranolol Metabolism in Human Liver Microsomes. The Role of CYP2D6 as Ring-hydroxylase and CYP1A2 as N-desisopropylase. Drug Metab Dispos. 1994; 22: 909–915. [PubMed] [Google Scholar]
  • 73. Sanderink GJ, Bournique B, Stevens J, Petry M, Martinet M. Involvement of Human CYP1A Isoenzymes in the Metabolism and Drug Interactions of Riluzole in Vitro. J Pharm Exp Ther. 1997; 282: 1465–1472. [PubMed] [Google Scholar]
  • 74. Yang Y, Wong SE, Lightstone FC. Understanding a substrate's product regioselectivity in a family of enzymes: a case study of acetominophen binding in cytochrome P450s. PLoS One 2014; 9(2): e87058 10.1371/journal.pone.0087058 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Per-residue decomposition of the average electrostatic interaction energies 〈V Ele〉 (Figure A).

Per-residue decomposition of the average Van der Waals interaction energies 〈V VdW(Figure B). Structures and IC 50 values from literature for the data set of compounds (Table A). Reacting chemical groups in (quasi-)irreversible CYP inhibitors (Table B). Overview of the average electrostatic 〈V Ele〉 and van der Waals 〈V VdW〉 interaction energies (Table C).

(PDF)

Data Availability Statement

All relevant data are within the paper and its Supporting Information file.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES