Abstract
Cytochrome P450 (P450)-mediated drug-drug interactions (DDIs) are responsible for most adverse drug interactions, and occur when 2 concurrently administered drugs inhibit, upregulate, or are substrates of the same target enzyme. A machine learning approach enables the detection of DDIs with rarely used drugs, as well as newly approved drugs. To facilitate this, we present a framework for predicting DDIs by first predicting P450 interactions for both drugs, generating a fingerprint based on the predictions in addition to the molecular structures of the drugs, and training a machine learning model to predict the overall interaction. After optimization, the model detected potential DDIs with 85% accuracy, representing an improvement over a DDI-only model (ie, a model trained on structure-based fingerprints without supporting P450 model predictions). We also present a corresponding adverse outcome pathway to allow for increased model explainability through visualizing each predicted P450 interaction, further enhancing its real-world applicability. Finally, we show the importance of the model applicability domain to DDI models by demonstrating how the performance of our model degrades as the inference set becomes dissimilar to the training data.
Significance Statement
Polypharmacotherapy (especially in older populations) results in more drugs prescribed concurrently, creating an increased risk of drug-drug interactions and adverse drug reactions. Computational tools to predict potential drug-drug interactions could accurately aid in reducing risk for the patient and be used in the early stages of drug design to avoid such undesirable molecular interactions.
Key words: Drug-drug interactions, Ensemble, Machine learning, P450
Visual Abstract
Schematic of drug-drug interaction models with supporting machine learning models. The 3 dots are intended to denote that additional data and models were also included in pretraining.
1. Introduction
The major family of enzymes responsible for phase 1 metabolism is the cytochrome P450 (P450) superfamily.1, 2, 3, 4 These enzymes are ubiquitously expressed in different tissues, such as the lung and liver, and are a first line of defense against xenobiotics.5, 6, 7 Metabolism by the main drug-metabolizing enzymes, such as CYP2C9, CYP2D6, and CYP3A4,8,9 has been modeled computationally. These models, which have included quantitative structure activity relationships, 3-dimensional pharmacophores, and docking, have used a wide range of inputs, such as molecule structure, to predict the affinity of a molecule for each P450 enzyme or its inhibition.9, 10, 11, 12, 13 Molecules can interact with each P450 enzyme depending on their shape, size, electrostatics, and molecular properties, which can result in what are termed drug–drug interactions (DDIs) when 2 or more molecules interact with the same enzyme. These DDIs may also occur with other phase 1 as well as with phase 2 enzymes14,15 and transporters impacting drug development.16
More broadly, DDIs are defined as the alteration of a drug’s activity due to the coadministration of additional drugs.17 DDIs can be classified as pharmacodynamic or pharmacokinetic. Pharmacodynamic DDIs can be synergistic, additive, or antagonistic, and occur when the pharmacological effect of one drug is changed by another. On the other hand, pharmacokinetic DDIs occur when the absorption, distribution, metabolism, or excretion of one drug is affected by another drug. DDIs can range in severity, which is typically classified as minor, moderate, or major.18 That said, DDI classification is somewhat subjective, as it is typically assessed by clinical relevance according to a physician. Minor DDIs typically do not require any changes in treatment, but major DDIs can be life-threatening and require immediate medical attention. A positive correlation between severe or multiple DDIs and death among elderly patients has been found.19 One study found that most clinical DDIs do not require additional care, as they either do not manifest or do not cause significant issues.20 Regardless, when DDIs are present in patients, they typically are severe.21
Polypharmacotherapy (the concurrent use of multiple medications) is becoming more prevalent to treat complex conditions, especially in elderly populations.22,23 A 2019 study found that within a population of 237 retirement home residents, over 87% used 5 or more drugs concurrently.22 Additionally, over 43% used 10 or more drugs concurrently. As the number of drugs increases, the likelihood of potential DDIs increases dramatically, meaning that the prescription of concurrent drugs becomes increasingly complex.24,25
In practice, massive DDI data bases, such as Micromedex Drug-Reax,26 are used to evaluate drug combinations for potential DDIs.27 However, these data bases are typically compiled from reports in the literature, meaning that a DDI must be observed first for it to be listed in the data base.28,29 In silico methods have the advantage of being able to virtually evaluate all possible combinations of a set of drugs for DDIs, even if no information is available in the literature about their interactions.30 This may be especially beneficial for applications of novel or infrequently used drugs. Additionally, “in silico” methods provide an opportunity to decrease the number of costly in vivo or in vitro DDI studies, which can save resources and time.31
We now describe using DDInter,29 as we found its structure and comprehensiveness to lend itself well to our study, as we rely on the P450-related metabolism-specific DDI subdataset. DDInter is a comprehensive DDI data base that contains over 200,000 clinically relevant DDIs between 1833 Food and Drug Administration (FDA)–approved drugs. The data base was also curated by a team of clinical pharmacists. It is intended to be used as a tool for clinicians to detect and avoid potential unwanted DDIs when prescribing medications. The data base is divided into 6 different subsets by mechanism type, such as metabolism, excretion, antagonism, and more. There is also a seventh category that captures confirmed DDIs with unknown causes. Each DDI is annotated with a severity, which can be either minor, moderate, major, or unknown. We have now used this data base as the foundation of our DDI model, before supplementing it with additional mechanism-specific information.
2. Materials and methods
We have studied the effects of P450 inhibition predictions on the performance of various DDI classification models. This involved the use of 15 datasets: 1 describing DDIs, 6 describing P450 substrates, 6 describing P450 inhibition, and 2 describing the pregnane X receptor (PXR). Supplemental Table 1 describes each dataset used in this study and its source. Furthermore, we trained and evaluated a wide variety of different model configurations on each dataset, as well as the final model combination, which involved all 15 datasets (Fig. 1). Finally, we developed adverse outcome pathways (AOPs) for each DDI prediction, aiding interpretability and boosting the applicability of our DDI model system.
Fig. 1.
Schematic of drug-drug interaction models with supporting machine learning models. The 3 dots are intended to denote that additional data and models were also included in pretraining.
2.1. DDI dataset
A subset containing 56,368 DDIs was downloaded from DDInter.29 This list of interactions involves a set of 1757 drugs, all of which are orally delivered. This subset was chosen to include all the metabolism-specific interactions found in the DDInter data base, meaning that the majority are P450-mediated interactions. The DDInter dataset maps a pair of drugs to an interaction severity, which can be either minor (no treatment change required), moderate (use caution), major (potentially life-threatening), or unknown (an interaction is present, but the severity is poorly described by the source), as defined by the federally recognized drug use data base, DRUGDEX. Additional DDIs are available from DDInter, but these are caused by a wide range of nonmetabolic mechanisms, hence our focus on the metabolism-related (P450-mediated) subset only.
Simplified Molecular Input Line Entry System (SMILES) structures for each drug were sourced from the National Institutes of Health’s Chemical Identifier Resolver data base32 or PubChem.33 If the structure for a particular drug was unavailable in both resources, all interactions involving that drug were removed from the dataset. This primarily included common foods (eg, garlic). A total of 199 of the 1757 drugs were removed from the dataset during structure identification and standardization, corresponding to 5119 of the 56,368 interactions (9%).
2.2. P450 datasets
In addition to the main DDI dataset, we selected and curated 14 datasets that describe P450 inhibition, P450 substrates, and PXR.11,34, 35, 36 The selected enzymes (CYP1A2, CYP2C9, CYP2B6, CYP2C19, CYP2D6, and CYP3A4) and associated datasets were chosen for their crucial role in drug metabolism, as P450 families 1–3 metabolize approximately 80% of drugs.7,37 Six datasets comprise P450 inhibition (IC50) or agonism (EC50) data, and 6 comprise P450 substrate data, with each dataset corresponding to 1 P450 enzyme. Finally, the 2 PXR datasets are significant due to PXR’s role in upregulating CYP3A4.38,39 Most of the molecules in these datasets do not appear in the DDI dataset, and vice versa. We also explored additional datasets and models for targets that upregulate P450 activity,40 such as the constitutive androstane receptor, glucocorticoid receptor, retinoid X receptor, SF-1 protein, and vitamin D receptor. However, these datasets and associated models did not improve DDI prediction performance, so we excluded them from this study. Induction model statistics (5-fold cross-validation [CV]) can be found in Supplemental Table 2. It is important to note that retinoid X receptor, SF-1, and vitamin D receptor are not known to regulate the major P450 enzymes involved in drug metabolism, and the CV recall for constitutive androstane receptor and glucocorticoid receptor was relatively poor, which could be the reason these models did not improve DDI prediction performance.
2.3. Data curation
Both the DDI dataset and supporting datasets described in the P450 Datasets section were curated using our proprietary data curation software, eClean. This step used RDKit version 2023.09.441 to canonicalize and neutralize the SMILES structures. Structures were desalted using a list of common salts. We also converted the standard value (IC50 or EC50) columns to negative log-molar units where applicable. Additionally, records with invalid values, units, or qualifiers were removed from the 14 supporting datasets. During this step, measurement values greater than 12 negative log-molar (ie, <1 pM) were removed on the grounds that such values are physically unreasonable. Nonpotent outliers were removed if the measurement values were less than the first quartile (Q) by 1.5 times the interquartile range (IQR; ie, <Q1–1.5 ∗ IQR) on the negative log-molar scale. Finally, structures with multiple measurement values were consolidated by first removing exact duplicates (which typically correspond to the same measurement pulled from multiple sources), then removing extreme outliers (1.5 ∗ IQR above or below Q2) from each group of duplicate structures, and finally averaging the remaining measurements.
To adapt the DDI dataset for use with binary classification models, we used a synthetic negative class generation method, where we assumed nonpresence in the training set implies noninteraction.42 Thus, we assigned the entire original DDI dataset (excluding unknown severity interactions) to the positive class. Any combination of drugs that did not explicitly appear in the DDI dataset was assigned to the negative class.
2.4. Baseline model, metrics, and molecular descriptors
To establish a baseline to quantify model performance, we opted to use a randomization technique to “scramble” the ground truth labels. By randomly permuting the labels, but not the structures, we effectively created a random model. This technique is sensitive to class imbalance, which can artificially improve the baseline statistics in similar ways to how a model may be affected. To avoid any potential issues from class imbalance, we balanced the DDI dataset by introducing the appropriate number of synthetic "no interaction" datapoints, as mentioned previously. When evaluating and selecting models, we discarded any model that performed worse than the baseline model. Additionally, we quantified any accuracy improvements by using the baseline model as the zero point (ie, if a model scored twice as far from the baseline as another model, it performed 100% better).
In all cases, we used a nested 5-fold CV with hyperparameter grid searching methods for model evaluation. Hyperparameter grids were set from previous modeling efforts and experimentally determined to be sufficient for the DDI study. When selecting model types and optimizing final models, we focused on maximizing recall, as we deemed false negatives to be significantly more harmful than false positives. Recall was defined as follows:
| (1) |
where TP is the number of true positive predictions and FN is the number of false negative predictions. Thus, maximizing recall minimizes false negatives. We also captured receiver operating characteristic (ROC) curves and associated areas under the ROC curves (AUCs) to show a comprehensive picture of model performance. ROC curves capture binary model performance by plotting the model’s true positive rate against the false positive rate as the classification threshold varies. AUC values closer to 1 (larger area) correspond to better model performance.
Models were built using either functional connectivity circular fingerprints (FCFP6) or extended connectivity circular fingerprints (ECFP6), in either 1024- or 2048-bit length, as molecular descriptors.43 All fingerprints were generated from standardized SMILES using RDKit. These descriptors capture molecular features in a bit vector based on atom connections for all structure subgraphs of a maximum diameter of 6. In this way, we can train 2-dimensional structure-aware DDI models, which allows us to perform inference on previously unseen drugs. For the 14 supporting models, we used the molecular fingerprints as generated. However, to capture interactions between pairs of drugs, we introduced an additional preprocessing step, which combines pairs of fingerprints by summing the fingerprint vectors bitwise (Supplemental Table 3). In this way, we encoded shared structural similarities between 2 drugs in a pair as input for the model, while still maintaining information about the individual drugs. Additionally, we eliminated bias due to the ordering of the molecular inputs. We also tried combining the fingerprints by appending one onto the end of the other, but we found that the summation technique was acceptable. We performed comparisons to determine the most effective molecular descriptors for the DDI task and found that the 1024-bit ECFP6 produces the best performance, although all fingerprinting methods produced similar results (Supplemental Fig. 1).
2.5. Model generation
All machine learning models were built using our Assay Central software44 with Python 3.12, scikit-learn 1.3.0, Pandas 2.1.4, and Numpy 1.26.3 on a 24-core AMD Ryzen Threadripper 3960× running Ubuntu 20.04 with 128 GB RAM. Data curation and preprocessing were performed on a 2023 MacBook Pro (M2 Pro CPU, 32 GB RAM) with Python 3.11.7, Numpy 1.24.3, and Pandas 2.1.4.
Initially, we trained and optimized a binary classification model on the full DDI dataset. We tested 4 fingerprint types (1024-bit FCFP6, 2048-bit FCFP6, 1024-bit ECFP6, and 2048-bit ECFP6) and classification model types (AdaBoost, random forest, naïve Bayes, logistic regression, XGBoost, k-nearest neighbors [KNN], support vector classification [SVC], and deep learning [DL]; grid search parameters are in Supplemental Tables 4–11, class separation data are in Supplemental Figs. 2–16). Based on the 5-fold CV recall score for each combination of model and fingerprint, we selected the core DDI model configuration 1024-bit ECFP6, fingerprint sum, and DL. Additionally, we trained and optimized separate classification models for each of the 6 inhibition datasets and the 2 PXR datasets. Classification models were also trained on the P450 substrate datasets, as these do not contain a continuous prediction value. Each model was trained using a predetermined hyperparameter grid (Supplemental Tables 4–11) that was empirically evaluated to perform well in similar quantitative structure-activity relationship tasks in previous case studies.45
To enable the DDI model to leverage the support models’ predictions, we modified the input fingerprints before training the overall DDI + P450 model, as shown in Fig. 1. Specifically, during DDI inference, each pair of drugs was fed into the n supporting models, resulting in a prediction vector of length 2n (step 1). Then, the prediction vector was concatenated with the fingerprint-sum ECFP6 fingerprints (step 2). We tried several different methods of combining fingerprints and prediction vectors, such as double and reverse concatenation (Supplemental Table 12), but we found that the method did not significantly affect the results. Next, we trained a new DDI model on the resulting augmented fingerprints (step 3) and evaluated the output (step 4) via CV performance metrics (Supplemental Fig. 1).
2.6. AOPs and molecular feature importance analysis
To increase the interpretability and usability of our P450-supported DDI model, we developed an AOP to describe the possible predicted P450 interactions that cause DDIs. Specifically, if 2 drugs are predicted to be inhibitors or substrates for the same P450 target, it is suggested that an interaction will occur. This P450 prediction mechanism allows us to verify the accuracy of our DDI model by cross-checking the P450 inhibition and substrate predictions with known P450 metabolism targets for an interacting pair of drugs. This can be performed by tracking the P450 model inference results during DDI model execution and then looking for instances where a P450 model predicts inhibition for both drugs in a pair.
Additionally, we performed a feature importance analysis to identify the molecular features that contribute the most to model prediction. This analysis further explains the overall DDI prediction and AOP by revealing the exact molecular features that the inhibition models draw from.
2.7. Real-world test set
To test our model’s generalizability to real-world uses, we tested it against a set of 30 drugs from the FDA drug list (Supplemental Table 13). All the drugs selected were approved in either 2022 or 2023, and none are present in the DDInter data base. We also selected 30 additional drugs from the DDInter data base. Our real-world test case simulated the use of a novel drug alongside an existing drug, as we hypothesize that this case is more common than using 2 novel drugs concurrently. Thus, for each of the novel drugs, we predicted interactions with each of the existing drugs using our tuned model for a total of 900 predictions. Then, we leveraged the DrugBank46 drug interaction data base to form a ground truth set to validate our model predictions. However, confirming the existence of interactions involving novel drugs was made more difficult by the limited clinical data. Thus, we could not assume that absence from the DrugBank data base implied noninteraction (as we did with DDInter to create negative class data for our training set). This limited us to only measuring true positives from our model. Only 10 P450-mediated DDIs out of the 900 predictions in our test set had confirmed DDIs according to DrugBank.
3. Results
3.1. Supporting models: P450 inhibition and PXR (upregulation)
Models trained on the individual P450 inhibition and PXR datasets were optimized and evaluated separately to produce the final supporting model ensemble used to enhance the efficiency of the overall DDI model. We evaluated several classification model types (AdaBoost, DL, KNN, random forest, SVC, and XGBoost) as well as various molecular fingerprinting algorithms (FCFP6 and ECFP6 as 1024-bit and 2048-bit versions). This parameter selection process is illustrated in Supplemental Fig. 1. The training results for the best fingerprint type, 1024-bit ECFP6, are shown in Table 1. For each P450 target, we selected the best-performing model for use in our final DDI + P450 model. From Table 1, we can see that the P450 inhibition models had good AUC scores around 0.8, with only the CYP2B6 and CYP3A4 models dropping slightly below. CYP2B6 had a significantly smaller dataset (n = 228), and CYP3A4 is known to be a large and complex target.
Table 1.
Five-fold cross validation results from training the P450 inhibition supporting models
| Dataset/Model | Accuracy | F1 | AUC | Cohen κ | MCC | Precision | Recall | Tn/Fp/Fn/Tp |
|---|---|---|---|---|---|---|---|---|
| CYP1A2/DL (n = 2449) | 0.867 | 0.638 | 0.885 | 0.561 | 0.579 | 0.800 | 0.532 | 1835/71/254/289 |
| CYP2C9/SVC (n = 7625) | 0.767 | 0.553 | 0.804 | 0.402 | 0.413 | 0.867 | 0.647 | 4768/1192/588/1077 |
| CYP2B6/SVC (n = 228) | 0.777 | 0.558 | 0.782 | 0.412 | 0.425 | 0.602 | 0.559 | 145/26/25/32 |
| CYP2C19/SVC (n = 2208) | 0.903 | 0.946 | 0.824 | 0.435 | 0.438 | 0.954 | 0.939 | 99/91/124/1894 |
| CYP2D6/SVC (n = 8198) | 0.842 | 0.671 | 0.867 | 0.568 | 0.569 | 0.644 | 0.701 | 5583/731/563/1321 |
| CYP3A4/SVC (n = 4127) | 0.725 | 0.548 | 0.785 | 0.363 | 0.381 | 0.465 | 0.685 | 2307/821/315/684 |
Fn, false negative; Fp, false positive; MCC, Matthews correlation coefficient; Tn, true negative; Tp, true positive.
Like the P450 inhibition models, the PXR models had good AUCs around 0.8 (Table 2). We found that the SVC models performed the best on both PXR datasets in terms of AUC. However, the Matthews correlation coefficient for both models was quite low, especially for the model built on the smaller dataset (n = 237).
Table 2.
Five-fold cross validation results from PXR model training
| Dataset/Model | Accuracy | F1 | AUC | Cohen κ | MCC | Precision | Recall | Tn/Fp/Fn/Tp |
|---|---|---|---|---|---|---|---|---|
| PXR-2/SVC (n = 6447) | 0.784 | 0.588 | 0.817 | 0.442 | 0.444 | 0.552 | 0.629 | 4055/808/587/997 |
| PXR-4/SVC (n = 237) | 0.755 | 0.842 | 0.790 | 0.294 | 0.322 | 0.789 | 0.906 | 23/42/16/156 |
Fn, false negative; Fp, false positive; MCC, Matthews correlation coefficient; Tn, true negative; Tp, true positive.
3.2. Supporting models: substrate
The P450 substrate models were trained in a similar manner to the inhibition datasets. For each P450 target, the model with the best recall was selected for use in the DDI + P450 model. These models had slightly lower AUCs but were still greater than 0.7 in nearly all cases (Table 3).
Table 3.
Five-fold cross validation results from the P450 substrate classification model training
| Dataset/Model | Accuracy | F1 | AUC | Cohen κ | MCC | Precision | Recall | Tn/Fp/Fn/Tp |
|---|---|---|---|---|---|---|---|---|
| CYP1A2/KNN (n = 639) | 0.695 | 0.637 | 0.751 | 0.374 | 0.375 | 0.642 | 0.635 | 272/96/99/172 |
| CYP2C9/SVC (n = 639) | 0.720 | 0.543 | 0.723 | 0.349 | 0.359 | 0.644 | 0.476 | 353/61/118/107 |
| CYP2B6/SVC (n = 639) | 0.725 | 0.469 | 0.722 | 0.285 | 0.288 | 0.428 | 0.520 | 385/104/72/78 |
| CYP2C19/SVC (n = 639) | 0.728 | 0.591 | 0.762 | 0.388 | 0.390 | 0.609 | 0.578 | 339/82/92/126 |
| CYP2D6/SVC (n = 639) | 0.714 | 0.646 | 0.781 | 0.407 | 0.410 | 0.681 | 0.619 | 289/80/103/167 |
| CYP3A4/RF (n = 639) | 0.750 | 0.828 | 0.779 | 0.367 | 0.368 | 0.845 | 0.813 | 93/71/89/386 |
Fn, false negative; Fp, false positive; MCC, Matthews correlation coefficient; RF, random forest; Tn, true negative; Tp, true positive.
3.3. DDI model
Using the DDI dataset in isolation, we built and evaluated simple classification models to predict interaction presence. After finding the best-performing model configuration (DL classification model and 1024-bit ECFP6 fingerprints), a comparison was performed between these models and the baseline model (as described in the Materials and Methods section above). Specifically, we found that these models performed better (21%) than the baseline in terms of recall (Table 4). Furthermore, the models exhibited an increase (44%) in accuracy compared with the baseline.
Table 4.
Five-fold CV training results for the DDI model with and without supporting P450 predictions. The P450 models’ CV statistics are shown in Table 3. Drug pairs absent from the training set were assumed to be noninteracting. Baseline model results are included for illustration purposes
| Model | Accuracy | F1 | AUC | Recall | Tn/Fp/Fn/Tp |
|---|---|---|---|---|---|
| DDI/DL (n = 72,030) | 0.843 | 0.835 | 0.916 | 0.816 | 30,928/4604/6724/29,774 |
| DDI + support/DL (n = 72,030) | 0.855 | 0.846 | 0.932 | 0.821 | 31624/3908/6516/29,982 |
| Baseline/DL (n = 72,030) | 0.499 | 0.515 | 0.500 | 0.524 | 16,830/18,702/17,364/19,134 |
Fn, false negative; Fp, false positive; Tn, true negative; Tp, true positive.
Table 4 also shows the CV metrics for the DDI + P450 model (which uses P450 inhibition and substrate predictions as well as molecular fingerprints as input, as described in the Materials and Methods). While the improvements over the original DDI model are small, they show a distinct increase in performance across all metrics. Specifically, we see a small (2%) increase in recall over the DDI model with respect to the baseline, and an almost 4% increase in accuracy. While small, these improvements suggest that predictions of the supporting model ensemble slightly enhance the CV performance of these DDI models.
3.4. AOPs and molecular feature importance analysis
Additionally, we developed AOPs for the P450-supported DDI model to quantify the decisions of the model based on enzyme inhibition and induction. AOPs aim to describe the P450 interactions behind the model’s predictions.47 We developed a graphical representation of the predicted P450-drug interactions to quickly identify the presence and cause of a potential interaction (Figs. 2 and 3). Each colored square in the 6 subplots at the top of the figure represents the model’s prediction for a single P450 enzyme. The type of model (inhibition or substrate) and the drugs predicted are indicated next to the squares. Black represents negative predictions, gray represents inconclusive predictions, and green represents positive predictions. The top row of squares indicates whether each set of P450 predictions suggests a likely DDI. Finally, the right-most square indicates the overall interaction prediction of the 2 drugs based on the overall DDI model. For example, for CYP1A2 in Fig. 2, drug 1 (fluconazole) is predicted to be a substrate while drug 2 (pioglitazone) is not, and both inhibition predictions are uncertain. This leads to an inconclusive DDI prediction for CYP1A2, as indicated by the gray square at the top of the subplot. However, the 2 drugs are predicted to interact via CYP2C9, CYP2C19, CYP2B6, and CYP2D6, resulting in an overall positive DDI prediction, as shown by the right-most green indicator.
Fig. 2.
Example AOP graphic for fluconazole (drug1) and pioglitazone (drug2). This drug pair has been shown to interact according to DrugBank. pred. int., predicted interaction.
Fig. 3.
Example AOP graphic for rabeprazole (drug1) and budesonide (drug2). This drug pair does not have any listed P450-mediated interactions in DrugBank, although they interact via a different mechanism. pred. int., predicted interaction.
3.5. Feature importance analysis
Understanding model reasoning is crucial to providing trustworthy and verifiable results. To achieve this goal, we demonstrate the process of determining important molecular features by performing a feature importance analysis on fluconazole, which is 1 of the 2 interacting drugs from the previous section. Specifically, to support our AOPs, we targeted the P450 models, as they are key to understanding the predictions of the overall DDI model.
For SVC models using a linear kernel (CYP2C19), we can simply inspect the coefficients of the trained P450 inhibition model. Larger coefficients correspond to more important molecular features, which we can then map to the molecular structure of the drug (Table 5). On the other hand, the DL model (CYP1A2) and the nonlinear SVC models (CYP2C9, CYP2B6, CYP2D6, and CYP3A4) require the usage of permutation importance to determine the most important molecular features.48 Permutation importance determines the importance of input features by randomly permuting each one and measuring the resulting quality of the model. Interestingly, Table 5 shows that many of the same features are used by each model, suggesting some similarities between the models. This analysis can also be used to validate DDI predictions in a clinical setting by validating the results of the underlying P450 models, further improving model interpretability.
Table 5.
Feature importance analysis of the P450 inhibition models. The top 3 features in order of importance for each model are highlighted on the example drug fluconazole
| Model | First | Second | Third |
|---|---|---|---|
| CYP1A2 inhibition | ![]() |
![]() |
![]() |
| CYP2C9 inhibition | ![]() |
![]() |
![]() |
| CYP2B6 inhibition | ![]() |
![]() |
![]() |
| CYP2C19 inhibition | ![]() |
![]() |
![]() |
| CYP2D6 inhibition | ![]() |
![]() |
![]() |
| CYP3A4 inhibition | ![]() |
![]() |
![]() |
3.6. FDA novel drug test set
The results of evaluating our model on a test set of 30 novel FDA-approved drugs and a further 30 existing drugs present in the DDInter data base were mixed. Of the 10 known interactions assembled from the test set, our best model, DL classification, detected 2 with a recall score of 0.2. While this may be significantly lower than expected based on the 5-fold CV scores during the training process, it is important to consider a similarity analysis of the novel drugs. Patterson et al49 found that, in general, 2 compounds have similar activity if they have a Tanimoto similarity of >0.85. Thus, if a given previously unseen drug has a Tanimoto similarity of >0.85 to drugs in the model training set, it is likely that the drug is within the applicability domain (AD) of the model, and an accurate prediction can be made. We, therefore, analyzed the novel drugs’ Tanimoto similarity using a maximum similarity method. For each novel drug, we found the most similar drug in the DDInter data base to produce the maximum similarity score. This metric was also calculated for each pair of DDInter drugs to calculate a baseline for comparing similarity scores. We found the average similarity between drugs in the DDInter dataset to be 0.809, whereas the average similarity between drugs in the FDA test set and the DDInter dataset was 0.761. This implies that the model may perform slightly worse when predicting interactions involving drugs from the FDA test set. However, both datasets’ average Tanimoto similarity was below the defined threshold of 0.85.
4. Discussion
A wide range of in silico DDI modeling approaches has been explored in the literature. Ferdousi et al50 described a similarity-based DDI model using the Russell-Rao similarity method, correctly identifying 72% of known DDIs in their dataset. Furthermore, the authors developed a specialized feature vector system for drug representation based on biological elements of each drug. Similarly, Li et al51 used a Bayesian network-based approach to model interactions by mapping each drug pair to a set of 6 features, such as chemical similarity or side effect similarity, to predict interacting drug combinations. This achieved an AUC of 0.95 in a dataset of DDIs, and an AUC of 0.71 in a generalization dataset of previously unseen DDIs. Such similarity-based approaches ignore drug molecular structure altogether,52 leading to the development of graph-based DDI prediction models. Typically, the drugs and other entities (ie, receptors) are represented as nodes, and the edges are interactions or other relationships. Thus, various graph algorithms, such as random walk and restart, can be applied to derive DDIs from the graph.53 Yan et al54 developed a node-based drug network to predict DDIs involving fully unknown drugs by calculating initial interaction information based on the network. The authors then used a recursive least squares algorithm to calculate the interactions. Cheng et al55 evaluated 5 different machine learning models (naïve Bayes, decision tree, KNN, logistic regression, and SVC) in DDI prediction tasks. Using data gathered from multiple sources such as DrugBank56 and MetaADEDB,57 the authors then used drug descriptors based on several molecular similarity measures, such as structural similarity and genomic similarity. Ultimately, they found that SVC was the most effective model, producing a 5-fold CV AUC of 0.67. Zhang et al58 took a slightly different approach to similarity metrics by applying network-based neighbor similarity. Based on calculated similarity, the authors used a neighbor recommender method commonly used in movie recommender systems to calculate user ratings for unseen movies. Specifically, if 2 drugs are considered similar, it is likely that they have similar interactions when exposed to the same drugs. Thus, unknown interactions for a given drug can be calculated by averaging the interaction severities on the same set of drugs. This method resulted in a maximum 5-fold CV recall score of 0.76 when trained on drug substructure data.59
DL has also been applied to modeling DDIs. Lee et al60 proposed a feed-forward network model with autoencoders to predict interactions based on several similarity metrics. They found that introducing molecular information other than structural similarity profiles produced up to a 2% increase in accuracy and over a 4% increase in recall. Hou et al61 trained deep neural networks with features generated from drug SMILES. The authors encoded the training data using one-hot encoding, where each bit represents 1 of 80 DDI types across 4 major mechanisms. The deep neural networks performed better on a holdout test set than the SVC model, with AUCs of 0.903 and 0.942, respectively. In summary, there is a wide range of different machine learning algorithms, such as network models, that leverage many types of data, such as molecular structure and even gene ontology, which have been used on public data to predict DDIs. Since each research group trains and evaluates its respective models on different data bases using various techniques, it is impossible to quantitatively compare approaches and benchmark them. Many other studies have used drug interaction and structure data from sources such as DrugBank,50,56 which is a comprehensive drug–drug target information data base, and SIDER,51,54,62 a data base of adverse drug side effects. Additionally, the OFFSIDES data base has been used,54,63 which represents a collection of ∼0.5 million off-label drug side effects, and TWOSIDES54,63 is another DDI data base.
Previous DDI modeling efforts have typically focused on applying different machine learning algorithms, such as Bayesian networks.51 Many models use molecular similarity, such as Russell-Rao similarity, to make DDI predictions.50 On the other hand, our work focuses on leveraging knowledge of drug-P450 interactions to enhance the accuracy of a DDI model. Instead of predicting DDIs using similarity between drugs, our models learn how drug structure and P450 inhibition impact the potential for DDIs.64, 65, 66, 67, 68, 69 Additionally, the unique structure of our models provides some degree of interpretability through P450 interaction modeling, which is likely vital in a clinical setting.
As shown in our studies, using P450 machine learning models to help predict DDIs marginally improves the model across several performance metrics. This may be due to a few different factors. First, others have shown70, 71, 72 that DDI typically happens between an inhibitor and a substrate for an enzyme. Predicting this type of interaction could be a weak point in our model configuration, as our substrate datasets limited us to training classification models instead of regression models. This prevented us from capturing weak versus strong substrate interaction behavior. Second, P450 induction can also cause DDIs, as the serum concentration can be decreased to the point where the drug(s) no longer have the desired effect.28,72 However, DDIs caused by induction are not common in practice.73 This means that our PXR models may have little contribution to the model performance and could be introducing noise when generalizing to unseen test sets, such as the FDA novel drugs set. Third, it is possible that the datasets used in this study cover dissimilar molecular spaces, meaning that the AD of the resulting model is highly constrained, leading to poor performance in generalization tasks.
Through the limited FDA novel drug interaction test set, we showed that DDI model generalization to real-world circumstances is not straightforward. As with any model, predictions involving input data from outside the model’s AD can result in inaccurate and unusable results. Only 2 out of 10 predictions in the novel drugs dataset were correct, despite the high recall (0.8) computed with CV. The Tanimoto similarity between the novel drugs and the DDInter drugs was also low, and the model was unable to make accurate predictions.
Conclusively showing that 2 drugs have no interaction is typically not the focus of drug toxicity studies,64 partially due to the risk involved in making the claim and the complexity, as other types of interactions could also occur.74 Instead, for most clinical applications, showing the presence of a DDI is generally reported, whereas the absence of a DDI is not, so there is a publication bias that exists. Molecules that have undergone a DDI risk assessment using in vitro data and clinical exposures (ICHM12)75 can be used to exclude clinical investigation of DDIs and, as such, could be additionally added to the no interaction dataset. Thus, training classification models can be challenging due to the lack of an established negative dataset. In our study, we assumed that absence from the training set implied noninteraction, but this obviously is not always the case.29 Alternatively, it is possible that one-class classification models can be used to achieve the desired results without needing to generate or collect data for the negative class.76
By only modeling P450 enzymes, we assume that all interactions in DDInter’s metabolism-mediated DDI dataset are caused by P450 enzymes. However, P450 enzymes are only one of many types of human drug-metabolizing enzymes that could be involved in DDIs.14,77 Our DDI model could likely detect non–P450-mediated DDIs because it learned relevant structural attributes, but such detection would not be augmented by P450 predictions. Furthermore, we only modeled the P450 enzymes with enough available data and those that are known to contribute significantly to metabolism. These P450 predictions would not be expected to augment DDI predictions when mediated by unrepresented P450 enzymes.
In future work, we could address these various shortcomings of our model. For example, we could include dose level and subsequent exposure, which might impact the accuracy of predictions. This could be via applying other machine learning methods, such as Gaussian mixture models,78 large language models,79 or positive unlabeled learning techniques, wherein the classifier is given data that belong to either “positive” or “unlabeled” classes and tasked with classifying them as “positive” or “negative” instances.80 Such approaches would allow us to determine if the accuracy of the DDI model can be improved by removing potentially incorrect training data due to the negative class generation method. While it may be possible that the number of false negatives added to the dataset is low, it is well established that the incorporation of better training data nearly always results in better accuracy. We could also leverage transporter models45,81, 82, 83 by incorporating them along with P450 models to predict DDIs. Additionally, we could validate the model in clinically relevant scenarios, such as a larger collection of recorded DDIs from molecules that were not included in our DDI prediction models and that were likely outside the AD. Finally, the AOP system presented in this paper could be expanded to include information about prediction uncertainty to paint a clearer picture of the risks involved in the drug combination. While uncertainty thresholding could be included in the models, it may be useful to expose the uncertainty figures to human operators who may be better suited to make the final risk assessment based on the patient.
In summary, P450 enzymes are key in predicting DDIs, as inhibition and induction of the enzymes can vary the drug concentration in the human body.84 We have demonstrated that the incorporation of this existing knowledge can slightly improve the accuracy of DDI models using a machine learning approach that includes input for P450 inhibition, induction, and substrate predictions. We have also evaluated model performance using recently approved FDA drugs, which demonstrates the challenges with novel molecules outside the model’s AD. Future work still needs to demonstrate the real-world prospective prediction applications and potential of such P450-enhanced DDI prediction models. This may require further increasing the accuracy of our P450 models, adding additional datasets of other enzymes and transporters, and adopting alternative machine learning techniques to expand beyond the approaches described herein. DDI predictions could also be extended to when more than 2 drugs are used simultaneously. This might be feasible by taking a pairwise approach and combining the results, but interpreting the data could be complex. This indicates there are plentiful opportunities for further research on this topic.
Conflict of interest
Sean Ekins is the owner, and Jason T. Wong, Joshua S. Harris, Thomas R. Lane, and Fabio Urbina are employees of Collaborations Pharmaceuticals, Inc.
Acknowledgments
We kindly acknowledge our colleagues at Collaborations Pharmaceuticals, Inc, for their feedback, and the anonymous reviewers for their helpful suggestions. This manuscript is dedicated to the memory of Curt M. Breneman for his collaborations and discussions, which S.E. greatly valued.
Financial support
Financial support for this research was provided by the National Institutes of Health National Institute of General Medical Sciences (NIGMS) [Grant R44GM122196-02A1] and the National Institute of Environmental Health Sciences (NIEHS) [Grant 2R44ES031038] (to S.E.). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. Financial support for this research was also provided by Defense Threat Reduction Agency [Grant WHDTRA1-19-1-0020].
Data availability
All data are provided in this publication (see Supplemental Material as a zip file).
CRediT authorship contribution statement
Jason T. Wong: Investigation, Formal analysis, Writing – original draft, Validation, Visualization, Software. Joshua S. Harris: Investigation, Formal analysis, Writing – review and editing, Software. Thomas R. Lane: Conceptualization, Data curation. Fabio Urbina: Conceptualization. Sean Ekins: Conceptualization, Funding acquisition, Project administration, Writing – original draft.
Footnotes
This article has supplemental material available at dmd.aspetjournals.org.
Supplemental material
References
- 1.Guengerich F.P. Cytochrome P450s and other enzymes in drug metabolism and toxicity. AAPS J. 2006;8(1):E101–E111. doi: 10.1208/aapsj080112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Iyer K.R., Sinz M.W. Characterization of phase I and phase II hepatic drug metabolism activities in a panel of human liver preparations. Chem Biol Interact. 1999;118(2):151–169. doi: 10.1016/s0009-2797(99)00007-1. [DOI] [PubMed] [Google Scholar]
- 3.Wilkinson G.R. Drug metabolism and variability among patients in drug response. N Engl J Med. 2005;352(21):2211–2221. doi: 10.1056/NEJMra032424. [DOI] [PubMed] [Google Scholar]
- 4.Williams R.T. 2nd ed. Chapman and Hall; 1959. Detoxication Mechanisms: The Metabolism and Detoxication of Drugs, Toxic Substances, and Other Organic Compounds. [Google Scholar]
- 5.Guengerich F.P. Cytochrome P-450 3A4: regulation and role in drug metabolism. Annu Rev Pharmacol Toxicol. 1999;39:1–17. doi: 10.1146/annurev.pharmtox.39.1.1. [DOI] [PubMed] [Google Scholar]
- 6.Zhang J.Y., Wang Y., Prakash C. Xenobiotic-metabolizing enzymes in human lung. Curr Drug Metab. 2006;7(8):939–948. doi: 10.2174/138920006779010575. [DOI] [PubMed] [Google Scholar]
- 7.Zhao M., Ma J., Li M., et al. Cytochrome P450 enzymes and drug metabolism in humans. Int J Mol Sci. 2021;22(23) doi: 10.3390/ijms222312808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hudelson M.G., Ketkar N.S., Holder L.B., et al. High confidence predictions of drug-drug interactions: predicting affinities for cytochrome P450 2C9 with multiple computational methods. J Med Chem. 2008;51(3):648–654. doi: 10.1021/jm701130z. [DOI] [PubMed] [Google Scholar]
- 9.Ekins S., de Groot M.J., Jones J.P. Pharmacophore and three-dimensional quantitative structure activity relationship methods for modeling cytochrome P450 active sites. Drug Metab Dispos. 2001;29(7):936–944. [PubMed] [Google Scholar]
- 10.Kato H. Computational prediction of cytochrome P450 inhibition and induction. Drug Metab Pharmacokinet. 2020;35(1):30–44. doi: 10.1016/j.dmpk.2019.11.006. [DOI] [PubMed] [Google Scholar]
- 11.Gonzalez E., Jain S., Shah P., et al. Development of robust quantitative structure-activity relationship models for CYP2C9, CYP2D6, and CYP3A4 catalysis and inhibition. Drug Metab Dispos. 2021;49(9):822–832. doi: 10.1124/dmd.120.000320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Shoombuatong W., Prathipati P., Prachayasittikul V., et al. Towards predicting the cytochrome P450 modulation: from QSAR to proteochemometric modeling. Curr Drug Metab. 2017;18(6):540–555. doi: 10.2174/1389200218666170320121932. [DOI] [PubMed] [Google Scholar]
- 13.Lee J.H., Basith S., Cui M., Kim B., Choi S. In silico prediction of multiple-category classification model for cytochrome P450 inhibitors and non-inhibitors using machine-learning method. SAR QSAR Environ Res. 2017;28(10):863–874. doi: 10.1080/1062936X.2017.1399925. [DOI] [PubMed] [Google Scholar]
- 14.Jancova P., Anzenbacher P., Anzenbacherova E. Phase II drug metabolizing enzymes. Biomed Pap Med Fac Univ Palacky Olomouc Czech Repub. 2010;154(2):103–116. doi: 10.5507/bp.2010.017. [DOI] [PubMed] [Google Scholar]
- 15.Kiang T.K., Ensom M.H., Chang T.K. UDP-glucuronosyltransferases and clinical drug-drug interactions. Pharmacol Ther. 2005;106(1):97–132. doi: 10.1016/j.pharmthera.2004.10.013. [DOI] [PubMed] [Google Scholar]
- 16.Giacomini K.M., Galetin A., Huang S.M. The International Transporter Consortium: summarizing advances in the role of transporters in drug development. Clin Pharmacol Ther. 2018;104(5):766–771. doi: 10.1002/cpt.1224. [DOI] [PubMed] [Google Scholar]
- 17.Niu J., Straubinger R.M., Mager D.E. Pharmacodynamic drug-drug interactions. Clin Pharmacol Ther. 2019;105(6):1395–1406. doi: 10.1002/cpt.1434. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Klasco R. DRUGDEX® system (electronic version). Version 1:06–09. Thomson MICROMEDEX; 2009. [Google Scholar]
- 19.Rosas-Carrasco O., García-Peña C., Sánchez-García S., Vargas-Alarcón G., Gutiérrez-Robledo L.M., Juárez-Cedillo T. The relationship between potential drug-drug interactions and mortality rate of elderly hospitalized patients. Rev Invest Clin. 2011;63(6):564–573. [PubMed] [Google Scholar]
- 20.Choi Y.H., Lee I.H., Yang M., et al. Clinical significance of potential drug-drug interactions in a pediatric intensive care unit: a single-center retrospective study. PLoS One. 2021;16(2) doi: 10.1371/journal.pone.0246754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.McDonnell P.J., Jacobs M.R. Hospital admissions resulting from preventable adverse drug reactions. Ann Pharmacother. 2002;36(9):1331–1336. doi: 10.1345/aph.1A333. [DOI] [PubMed] [Google Scholar]
- 22.Bartošovič I., Bartošovičová I.I., Zrubáková K., Mikus P., Ochaba R., Goljerová I. Comparison of polypharmacy and pharmacotherapy among seniors in social institutions in 2001 and 2019. Article in Czech. Vnitr Lek. 2023;69(E-2):4–9. doi: 10.36290/vnl.2023.022. [DOI] [PubMed] [Google Scholar]
- 23.Mosshammer D., Haumann H., Morike K., Joos S. Polypharmacy-an upward trend with unpredictable effects. Dtsch Arztebl Int. 2016;113(38):627–633. doi: 10.3238/arztebl.2016.0627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Reason B., Terner M., Moses McKeag A., Tipper B., Webster G. The impact of polypharmacy on the health of Canadian seniors. Fam Pract. 2012;29(4):427–432. doi: 10.1093/fampra/cmr124. [DOI] [PubMed] [Google Scholar]
- 25.Karas S., Jr. The potential for drug interactions. Ann Emerg Med. 1981;10(12):627–630. doi: 10.1016/s0196-0644(81)80085-6. [DOI] [PubMed] [Google Scholar]
- 26.Micromedex. Micromedex Drug-Reax. https://www.micromedexsolutions.com/home/dispatch
- 27.Roblek T., Vaupotic T., Mrhar A., Lainscak M. Drug-drug interaction software in clinical practice: a systematic review. Eur J Clin Pharmacol. 2015;71(2):131–142. doi: 10.1007/s00228-014-1786-7. [DOI] [PubMed] [Google Scholar]
- 28.Tornio A., Filppula A.M., Niemi M., Backman J.T. Clinical studies on drug-drug interactions involving metabolism and transport: methodology, pitfalls, and interpretation. Clin Pharmacol Ther. 2019;105(6):1345–1361. doi: 10.1002/cpt.1435. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Xiong G., Yang Z., Yi J., et al. DDInter: an online drug-drug interaction database towards improving clinical decision-making and patient safety. Nucleic Acids Res. 2022;50(D1):D1200–D1207. doi: 10.1093/nar/gkab880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ai N., Fan X., Ekins S. In silico methods for predicting drug-drug interactions with cytochrome P-450s, transporters and beyond. Adv Drug Deliv Rev. 2015;86:46–60. doi: 10.1016/j.addr.2015.03.006. [DOI] [PubMed] [Google Scholar]
- 31.Safdari R., Ferdousi R., Aziziheris K., Niakan-Kalhori S.R., Omidi Y. Computerized techniques pave the way for drug-drug interaction prediction and interpretation. Bioimpacts. 2016;6(2):71–78. doi: 10.15171/bi.2016.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.National Institutes of Health Chemical Identifier Resolver. https://cactus.nci.nih.gov/chemical/structure
- 33.Kim S., Thiessen P.A., Bolton E.E., et al. PubChem substance and compound databases. Nucleic Acids Res. 2016;44(D1):D1202–D1213. doi: 10.1093/nar/gkv951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zdrazil B., Felix E., Hunter F., et al. The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods. Nucleic Acids Res. 2024;52(D1):D1180–D1192. doi: 10.1093/nar/gkad1004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Zaretzki J., Rydberg P., Bergeron C., Bennett K.P., Olsen L., Breneman C.M. RS-Predictor models augmented with SMARTCyp reactivities: robust metabolic regioselectivity predictions for nine CYP isozymes. J Chem Inf Model. 2012;52(6):1637–1659. doi: 10.1021/ci300009z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.National Institutes of Health Human pregnane X receptor (PXR) small molecule agonists: summary. https://pubchem.ncbi.nlm.nih.gov/bioassay/1347033
- 37.Dudas B., Miteva M.A. Computational and artificial intelligence-based approaches for drug metabolism and transport prediction. Trends Pharmacol Sci. 2024;45(1):39–55. doi: 10.1016/j.tips.2023.11.001. [DOI] [PubMed] [Google Scholar]
- 38.Chen Y., Tang Y., Guo C., Wang J., Boral D., Nie D. Nuclear receptors in the multidrug resistance through the regulation of drug-metabolizing enzymes and drug transporters. Biochem Pharmacol. 2012;83(8):1112–1126. doi: 10.1016/j.bcp.2012.01.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Cooper B.W., Cho T.M., Thompson P.M., Wallace A.D. Phthalate induction of CYP3A4 is dependent on glucocorticoid regulation of PXR expression. Toxicol Sci. 2008;103(2):268–277. doi: 10.1093/toxsci/kfn047. [DOI] [PubMed] [Google Scholar]
- 40.Honkakoski P., Negishi M. Regulation of cytochrome P450 (CYP) genes by nuclear receptors. Biochem J. 2000;347(Pt 2):321–337. doi: 10.1042/0264-6021:3470321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.RDKit: a software suite for cheminformatics, computational chemistry, and predictive modeling. Version 2023.09.4. 2013. https://www.rdkit.org/RDKit_Overview.pdf
- 42.Li Z., Han P., You Z.H., et al. In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences. Sci Rep. 2017;7(1) doi: 10.1038/s41598-017-10724-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Rogers D., Hahn M. Extended-connectivity fingerprints. J Chem Inf Model. 2010;50(5):742–754. doi: 10.1021/ci100050t. [DOI] [PubMed] [Google Scholar]
- 44.Lane T., Russo D.P., Zorn K.M., et al. Comparing and validating machine learning models for Mycobacterium tuberculosis drug discovery. Mol Pharm. 2018;15(10):4346–4360. doi: 10.1021/acs.molpharmaceut.8b00083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Lane T.R., Urbina F., Zhang X., et al. Machine learning models identify new inhibitors for human OATP1B1. Mol Pharm. 2022;19(11):4320–4332. doi: 10.1021/acs.molpharmaceut.2c00662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Knox C., Wilson M., Klinger C.M., et al. DrugBank 6.0: the DrugBank Knowledgebase for 2024. Nucleic Acids Res. 2024;52(D1):D1265–D1275. doi: 10.1093/nar/gkad976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Vinken M. The adverse outcome pathway concept: a pragmatic tool in toxicology. Toxicology. 2013;312:158–165. doi: 10.1016/j.tox.2013.08.011. [DOI] [PubMed] [Google Scholar]
- 48.Altmann A., Tolosi L., Sander O., Lengauer T. Permutation importance: a corrected feature importance measure. Bioinformatics. 2010;26(10):1340–1347. doi: 10.1093/bioinformatics/btq134. [DOI] [PubMed] [Google Scholar]
- 49.Patterson D.E., Cramer R.D., Ferguson A.M., Clark R.D., Weinberger L.E. Neighborhood behavior: a useful concept for validation of "molecular diversity" descriptors. J Med Chem. 1996;39(16):3049–3059. doi: 10.1021/jm960290n. [DOI] [PubMed] [Google Scholar]
- 50.Ferdousi R., Safdari R., Omidi Y. Computational prediction of drug-drug interactions based on drugs functional similarities. J Biomed Inform. 2017;70:54–64. doi: 10.1016/j.jbi.2017.04.021. [DOI] [PubMed] [Google Scholar]
- 51.Li P., Huang C., Fu Y., et al. Large-scale exploration and analysis of drug combinations. Bioinformatics. 2015;31(12):2007–2016. doi: 10.1093/bioinformatics/btv080. [DOI] [PubMed] [Google Scholar]
- 52.Han K., Cao P., Wang Y., et al. A review of approaches for predicting drug-drug interactions based on machine learning. Front Pharmacol. 2021;12 doi: 10.3389/fphar.2021.814858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Park K., Kim D., Ha S., Lee D. Predicting pharmacodynamic drug-drug interactions through signaling propagation interference on protein-protein interaction networks. PLoS One. 2015;10(10) doi: 10.1371/journal.pone.0140816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Yan C., Duan G., Zhang Y., Wu F.X., Pan Y., Wang J. In: Cai Z., Skums P., Li M., editors. Vol. 11490. Cham; Springer: 2019. IDNDDI: an integrated drug similarity network method for predicting drug-drug interactions; pp. 89–99. (Bioinformatics Research and Applications. ISBRA 2019. Lecture Notes in Computer Science). [Google Scholar]
- 55.Cheng F., Zhao Z. Machine learning-based prediction of drug-drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J Am Med Inform Assoc. 2014;21(e2):e278–e286. doi: 10.1136/amiajnl-2013-002512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Knox C., Law V., Jewison T., et al. DrugBank 3.0: a comprehensive resource for 'omics' research on drugs. Nucleic Acids Res. 2011;39(Database issue):D1035–D1041. doi: 10.1093/nar/gkq1126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Cheng F., Li W., Wang X., et al. Adverse drug events: database construction and in silico prediction. J Chem Inf Model. 2013;53(4):744–752. doi: 10.1021/ci4000079. [DOI] [PubMed] [Google Scholar]
- 58.Zhang W., Chen Y., Liu F., Luo F., Tian G., Li X. Predicting potential drug-drug interactions by integrating chemical, biological, phenotypic and network data. BMC Bioinformatics. 2017;18(1):18. doi: 10.1186/s12859-016-1415-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Vilar S., Harpaz R., Uriarte E., Santana L., Rabadan R., Friedman C. Drug-drug interaction through molecular structure similarity analysis. J Am Med Inform Assoc. 2012;19(6):1066–1074. doi: 10.1136/amiajnl-2012-000935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Lee G., Park C., Ahn J. Novel deep learning model for more accurate prediction of drug-drug interaction effects. BMC Bioinformatics. 2019;20(1):415. doi: 10.1186/s12859-019-3013-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Hou X., You J., Hu P. February 22–24, 2019. Predicting drug-drug interactions using deep neural network. Paper presented at: Proceedings of the 2019 11th International Conference on Machine Learning and Computing. Zhuhai, China. [DOI] [Google Scholar]
- 62.Kuhn M., Campillos M., Letunic I., Jensen L.J., Bork P. A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol. 2010;6:343. doi: 10.1038/msb.2009.98. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Tatonetti N.P., Ye P.P., Daneshjou R., Altman R.B. Data-driven prediction of drug effects and interactions. Sci Transl Med. 2012;4(125) doi: 10.1126/scitranslmed.3003377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Zakharov A.V., Varlamova E.V., Lagunin A.A., et al. QSAR modeling and prediction of drug-drug interactions. Mol Pharm. 2016;13(2):545–556. doi: 10.1021/acs.molpharmaceut.5b00762. [DOI] [PubMed] [Google Scholar]
- 65.Jin W., Stokes J.M., Eastman R.T., et al. Deep learning identifies synergistic drug combinations for treating COVID-19. Proc Natl Acad Sci U S A. 2021;118(39) doi: 10.1073/pnas.2105070118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Kwon S., Bae H., Jo J., Yoon S. Comprehensive ensemble in QSAR prediction for drug discovery. BMC Bioinformatics. 2019;20(1):521. doi: 10.1186/s12859-019-3135-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Wu J., Liu B., Chan W.K.B., et al. Precise modelling and interpretation of bioactivities of ligands targeting G protein-coupled receptors. Bioinformatics. 2019;35(14):i324–i332. doi: 10.1093/bioinformatics/btz336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Feng H., Zhang L., Li S., et al. Predicting the reproductive toxicity of chemicals using ensemble learning methods and molecular fingerprints. Toxicol Lett. 2021;340:4–14. doi: 10.1016/j.toxlet.2021.01.002. [DOI] [PubMed] [Google Scholar]
- 69.Nembri S., Grisoni F., Consonni V., Todeschini R. In silico prediction of cytochrome P450-drug interaction: QSARs for CYP3A4 and CYP2C9. Int J Mol Sci. 2016;17(6):914. doi: 10.3390/ijms17060914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Lin Y., Lu P., Tang C., et al. Substrate inhibition kinetics for cytochrome P450-catalyzed reactions. Drug Metab Dispos. 2001;29(4 Pt 1):368–374. [PubMed] [Google Scholar]
- 71.Lee K.S., Kim S.K. Direct and metabolism-dependent cytochrome P450 inhibition assays for evaluating drug-drug interactions. J Appl Toxicol. 2013;33(2):100–108. doi: 10.1002/jat.1720. [DOI] [PubMed] [Google Scholar]
- 72.Ogu C.C., Maxa J.L. Drug interactions due to cytochrome P450. Proc (Bayl Univ Med Cent) 2000;13(4):421–423. doi: 10.1080/08998280.2000.11927719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Guengerich F.P. In: Enzyme Systems that Metabolise Drugs and Other Xenobiotics. Anderson D., Waters M.D., Marrs T.C., Ioannides C., editors. Wiley; 2001. Cytochrome P450. [DOI] [Google Scholar]
- 74.International Transporter Consortium, Giacomini K.M., Huang S.M., et al. Membrane transporters in drug development. Nat Rev Drug Discov. 2010;9(3):215–236. doi: 10.1038/nrd3028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.ICH Drug interaction studies M12. https://database.ich.org/sites/default/files/ICH_M12_Step4_Guideline_2024_0521_0.pdf
- 76.Kang S. Using binary classifiers for one-class classification. Expert Syst Appl. 2022;187 doi: 10.1016/j.eswa.2021.115920. [DOI] [Google Scholar]
- 77.Bardhi K., Coates S., Watson C.J.W., Lazarus P. Cannabinoids and drug metabolizing enzymes: potential for drug-drug interactions and implications for drug safety and efficacy. Expert Rev Clin Pharmacol. 2022;15(12):1443–1460. doi: 10.1080/17512433.2022.2148655. [DOI] [PubMed] [Google Scholar]
- 78.Ilonen J., Paalanen P., Kamarainen J.K., Kalviainen H. August 20–24, 2006. Gaussian mixture pdf in one-class classification: computing and utilizing confidence values. Paper presented at: 18th International Conference on Pattern Recognition (ICPR'06) Hong Kong, China. [Google Scholar]
- 79.Snyder S.H., Vignaux P.A., Ozalp M.K., et al. The Goldilocks paradigm: comparing classical machine learning, large language models, and few-shot learning for drug discovery applications. Commun Chem. 2024;7(1):134. doi: 10.1038/s42004-024-01220-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Bekker J., Davis J. Learning from positive and unlabeled data: a survey. Mach Learn. 2020;109(4):719–760. doi: 10.1007/s10994-020-05877-5. [DOI] [Google Scholar]
- 81.Miller S.R., Zhang X., Hau R.K., et al. Predicting drug interactions with human equilibrative nucleoside transporters 1 and 2 using functional knockout cell lines and Bayesian modeling. Mol Pharmacol. 2021;99(2):147–162. doi: 10.1124/molpharm.120.000169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Sandoval P.J., Zorn K.M., Clark A.M., Ekins S., Wright S.H. Assessment of substrate dependent ligand interactions at the organic cation transporter OCT2 using six model substrates. Mol Pharmacol. 2018;94(3):1057–1068. doi: 10.1124/mol.117.111443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Martinez-Guerrero L.J., Morales M., Ekins S., Wright S.H. Lack of influence of substrate on ligand interaction with the human multidrug and toxin extruder, MATE1. Mol Pharmacol. 2016;90(3):254–264. doi: 10.1124/mol.116.105056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Guengerich F.P. Inhibition of cytochrome P450 enzymes by drugs-molecular basis and practical applications. Biomol Ther (Seoul) 2022;30(1):1–18. doi: 10.4062/biomolther.2021.102. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are provided in this publication (see Supplemental Material as a zip file).






















