Abstract
Many adverse drug reactions (ADRs) are caused by drug-drug interactions (DDIs), meaning they arise from concurrent use of multiple medications. Detecting DDIs using observational data has at least three major challenges: (1) The number of potential DDIs is astronomical; (2) Associations between drugs and ADRs may not be causal due to observed or unobserved confounding; and (3) Frequently co-prescribed drug pairs that each independently cause an ADR do not necessarily causally interact, where causal interaction means that at least some patients would only experience the ADR if they take both drugs. We address (1) through data mining algorithms pre-filtering potential interactions, and (2) and (3) by fitting causal interaction models adjusting for observed confounders and conducting sensitivity analyses for unobserved confounding. We rank candidate DDIs robust to unobserved confounding more likely to be real. Our rigorous approach produces far fewer false positives than past applications that ignored (2) and (3).
1. Introduction
Adverse drug reactions (ADRs) lead to nearly 74,000 emergency room visits and 195,000 hospitalizations each year in the United States1,2. Many ADRs are brought about by drug-drug interactions (DDIs), meaning that they arise as a result of concurrent use of multiple medications. Concurrent medication use is particularly common among the elderly3. DDIs are difficult to identify from clinical trials, which typically only randomize one treatment at a time and have sample sizes too low to reliably detect interaction effects for rare outcomes. Thus, DDI detection must depend on large scale observational data analysis.
In the United States, the Food Drug Administration (FDA) has implemented the FDA Adverse Event Reporting System (FAERS) database4. FAERS is a system gathering adverse event reports, medication error reports and product quality complaints resulting in adverse events submitted to the FDA by healthcare professionals (such as physicians, pharmacists, nurses, and others), consumers (such as patients, family members, lawyers, and others) and manafacturers (such as pharmaceutical companies). FAERS has been a critical source of data for pharmaco-vigilance analytics5 mostly focusing on detecting adverse effects of individual drugs. In this work, we propose to use FAERS to establish and detect DDIs.
Identifying DDIs causing an adverse drug reaction from observational databases such as FAERS poses at least three main challenges. First, the number of potential candidate drug-drug interactions is extremely large, which is computationally challenging. Second, associations between drugs and ADRs in observational data may not be causal due to observed or unobserved confounding. For example, a candidate DDI with two interacting drugs A1 and A2 causing ADR Y (as shown in Figure 1) may have both observed and unobserved confounders. In FAERS, observed confounders comprise co-prescribed drugs (C1 in Figure 1) and comorbidities captured as indications of co-prescribed drugs (C2 in Figure 1). Unobserved confounders include demographic factors, unrecorded indicators of health status such as vital signs or lab tests, and more. Third, pairs of drugs that each independently cause an ADR and are frequently co-prescribed do not necessarily causally interact, where causal interaction means that there are at least some patients who would only experience the ADR if they take both drugs.
Figure 1:
Possible confounders (C1,C2 and U) for a potential DDI A1&A2 ⇒ Y.
We address the first challenge of too many candidate DDIs by using data mining algorithms to pre-filter. Most existing studies6 in this area stop here and ignore the remaining challenges. Many of the candidate DDIs these studies flag are not causal, since they did not adjust for confounding. We address the remaining challenges by fitting causal interaction models to adjusting for observed confounders and conducting sensitivity analyses for possible unobserved confounding. Candidate DDIs robust to strong unobserved confounding in the sensitivity analyses are ranked as more likely to be real.
Past applications in this field have produced a very large number of false positives. For example, another popular DDI detection algorithm based on FAERS data (Twosides7) produces thousands of DDIs with ‘high confidence’ of causing each outcome we considered. This is medically implausible, as DDIs are not that common. Our approach is considerably more conservative than Twosides, i.e. less likely to generate false positives, for two reasons. First, while Twosides employs ad hoc heuristics to identify associations, we explicitly target a metric that implies causal interaction. Second, while Twosides assumes that its matching procedure is sufficient to remove confounding, our sensitivity analysis allows for the likely eventuality that we have failed to fully adjust for confounding. While validation is a challenge in this area because there do not exist reliable sources of ground truth about DDIs, it is more justifiable to devote limited resources to further investigate the few candidate DDIs flagged by our approach than the many flagged by alternative less rigorous approaches with high false positive rates.
2. Related Work
Significant research efforts have focused on detecting and predicting DDIs. One group of methods leverages various biochemical and molecular drug/target data to measure drug-drug similarities and score/predict DDIs. These data include chemical structures8,9, target information10, compound-target docking scores11, and drug side effects12. There have also been efforts at predicting DDIs by integrating molecular and pharmacological data13,14. Another group of methods analyze medical literature and/or electronic medical records to extract potential DDIs. Various computational techniques such as traditional regression models15, association rule mining6,16,17, disproportionality ratios18,19, biclustering20, and text mining21 have been used to detect DDIs in a fully data-driven manner. Some of these methods7,22 consider issues of confounding. However, none of the data-driven methods explicitly targets a causal interaction effect or incorporates sensitivity analysis to acknowledge that some confounding will be unobserved.
3. Methods
We divide our approach into four stages. Stage 1 entails pre-filtering to identify potential drug-drug-ADR triplets in which the two drugs may interact to cause the ADR. In stage 2, for each triplet identified in Stage 1, we identify a set of potential confounding variables that are associated with either of the drugs or the outcome. In stage 3, we estimate the causal interaction effects in the triplets identified in stage 1 adjusting for the possible confounders identified in stage 2. In stage 4, we perform sensitivity analyses to assess the robustness of the causal interaction effect estimates obtained in stage 3 to possible unobserved confounding. In the following, we refer to drug-drug-ADR triplets in which the drugs actually interact to cause the ADR as DDIs, and we refer to triplets in which the drugs may or may not interact to cause the ADR as candidate DDIs.
3.1. Data and Notation
The FAERS database comprises self-reported suspected adverse reactions. Each self-report includes the adverse outcome (suspected to be a reaction to some medication), all medications taken by the patient leading up to the adverse outcome, and the indications for which all the listed medications were prescribed. A strength of such self-reported data is that we can have high confidence that reported drugs were actually taken, in contrast to EHR data where non-adherence is a concern.
For a given candidate DDI, let A1, A2, and Y denote binary random variables over the space of reports indicating presence of the first drug of interest, the second drug of interest, and the outcome of interest, respectively. Let C be a random vector denoting additional possible confounding variables (i.e. presence of indications and other drugs) to be adjusted for when estimating the effect of A1 and A2 on Y . We will use upper case letters to refer to random variables and lower case letters to refer to their particular realizations.
If a report is for an adverse outcome other than the ADR of interest, Y = 0. A shortcoming of the dataset is that when Y = 0 we do not actually know for sure that the patient who is the subject of the report did not also experience the ADR of interest in addition to whatever outcome motivated the report. However, if the ADR of interest is rare in the population, this will not significantly bias our results.
We adopt the counterfactual framework for causal inference23 in which it is assumed that corresponding to each possible treatment assignment (a1, a2) of A1 and A2 there exists a counterfactual outcome Y (a1, a2) which is the outcome that would have been observed had the patient actually received treatment assignment (a1, a2). We say that a mechanistic interaction between A1 and A2 is present if for at least some patients Y (1, 1) = 1 but Y (1, 0) = Y (0, 1) = Y (0, 0) = 024, i.e. if at least some patients would only experience the outcome if they were to receive both treatments. Our goal will be to identify mechanistic interactions, since compared to other possible definitions of a DDI they are most relevant to public health.
3.2. Pre-filtering
We prefiltered candidate DDIs according to the following criteria:
Rare outcome: P (Y = 1) ≤ .01
Unadjusted statistical interaction: We only considered candidate DDIs with unadjusted statistical interaction .
Sufficient power: We required power ≥ 0.8 to test whether the unadjusted statistical interaction was greater than 2 at a .01 level assuming that the true unadjusted statistical interaction was equal to the larger of the observed unadjusted statistical interaction or 3.
The rare outcome assumption is necessary to be able to approximate relevant quantities using a case-control design25. The other two assumptions are simply to restrict candidate DDIs to those both more likely to be real and have sufficient support in the data to estimate their interaction effects. The unadjusted statistical interaction between drugs in almost all true DDIs will be greater than 2 for reasons that will become clear below. And effect estimates for candidate DDIs with higher power are less likely to be pure noise arising from sampling variability. In addition, for a given ADR, we also curated the set of drugs that are prescribed for the treatment of the same ADR as indication using the SIDER26 database. We removed these drugs from the potential candidate DDIs to cause the ADR under consideration.
3.3. Identification of Possible Confounders
In a careful study of an individual candidate DDI, confounder selection would be based on expert knowledge of the causes of the outcome and drugs of interest27. In this work, we are designing an automated algorithm to assess large numbers of candidate DDIs, and an automated confounder selection approach cannot require expert knowledge about individual candidate DDIs. For each drug and outcome included in each candidate DDI, we identify the variables (i.e. medications and indications) that are most strongly associated with the it (e.g., C1 and C2 of Figure 1). In particular, we find up to 50 drugs and indications that are associated with the ADR under consideration, and another 50 indications that are related to each drug of the DDI. Thus, we generated a list of up to 150 variables comprising the potential confounding variables for each candidate DDI and then we adjust for these confounders while estimating causal interaction effects as described in the following subsection. We use mutual information to identify candidate confounders of each DDI.
3.4. Causal Interaction Estimation
In this step, we want to find the mechanistic interaction between two drugs after adjusting for all the observerd confounders obtained from the previous step. In particular, we compute the Relative Excess Risk due to Interaction (RERI) of a particular DDI, which is defined as
| (1) |
where RRa0a1 denotes P (Y (a0, a1) = 1)/P (Y (a0, a0) = 1). The RERI is an important quantity because an RERI greater than 2 implies a mechanistic interaction24. We cannot directly estimate the RERI as limitations of the FAERS dataset force us to adopt a case-control design. However, for rare outcomes,
| (2) |
where ORa0a1 denotes . Note that ORa0a1 and therefore RERIOR are counterfactual quantities. If we specify a logistic regression model
| (3) |
and assume that C contains all common causes of A1 and Y and all common causes of A2 and Y , then
| (4) |
Hence we can estimate RERIOR by plugging in the parameter estimates of our logistic regression (3) into (4).
3.5. Sensitivity Analysis
It is a very strong assumption that C is sufficient to adjust for all confounding between both treatments and the outcome. Causal sensitivity analyses24,28,29 posit that there exists an unobserved confounder U not included in C as in Figure 1. For a range of parameter values determining the prevalence of U at different levels of the treatments and the strength of U’s association with the outcome, we can compute the RERIOR estimate we would have obtained had we been able to adjust for U assuming the causal graph in Figure 1. If U-adjusted RERIOR estimates are large for a wide range of sensitivity parameters governing the impact of hypothetical unobserved confounder U , this is an indication that these estimates are robust to unobserved confounding. (The level of robustness to confounding is determined by a complex function of the dependencies between treatments, observed confounders, and the outcome, i.e. robustness is not simply equivalent to a large estimated effect.)
Define assumed constant over a1,a2,c. And let ρa1a2 denote P (U = 1|A1 = a1, A2 = a2,C = c) assumed constant over c. Then the U −adjusted estimate of RERIOR is
| (5) |
For each candidate DDI, we compute over a grid of λ and ρ values spanning [1, 4] by increments of .5 for λ and [0, 1] by increments of .25 for ρ11,ρ10,ρ01, and ρ00. We refer to the resulting grid of U-adjusted RERIOR estimates as the ‘sensitivity grid’ for a candidate DDI. We also estimate by bootstrap a covariance matrix for estimates over all candidate DDIs and values of λ and ρ.
3.6. Ranking Candidate DDIs Based on Sensitivity Analysis Output
We consider multiple schemes for ranking candidate DDIs based on their sensitivity grids and the confidence intervals for the entries of those grids.
Confounding naive approach: Rank candidate DDIs by their RERI estimates
Conservative confounding naive approach: Rank candidate DDIs by the lower bound of the 0.95 confidence interval of the RERI estimate.
Confounding aware approach: Rank candidate DDIs by the pth (e.g. 0.25) quantile of the sensitivity grid.
Conservative confounding aware approach: Rank candidate DDIs by the lower bound of the 0.95 confidence interval for the pth (e.g. 0.25) quantile of the sensitivity grid.
4. Experiments
We applied our approach to detect DDIs causing Acute Renal Failure (ARF), Acute Myocardial Infarction (AMI), and Acute Liver Injury (ALI) in the FAERS database. We used the OMOP definition of UMLS concept ids to map the FAERS data to these three ADRs. Our dataset comprised 6,293,657 total reports collected from Q4 of 2012 to Q4 of 2014. 13,138 were regarding instances of ARF, 25,045 were regarding instances of AMI, and 12,911 were regarding instances of ALI. The pre-filtering process reduced these numbers to just 69 candidate DDIs for ARF, 60 for AMI, and 32 for ALI. For each outcome, we report all four scores described in the previous subsection for top candidate DDIs.
4.1. Results
Of the 69 candidate DDIs for ARF remaining after pre-filtering, after confounding adjustment only 8 had estimated RERIs ≥ 2 (Table 1(a)). The 0.25 quantile of the sensitivity grid was ≥ 2 for 6 of those 8 (Table 1(b)), and the 0.1 quantile of the sensitivity grid was ≥ 2 for 5 of the 8. (For illustration, Figure 2 depicts the distribution of interaction effect estimates on ARF over the grid of sensitivity parameters for dabigatran and diltiazem. Note that under some sensitivity parameters, the RERI would actually be larger, i.e. if the confounding lessened the effect estimate.) For only 1 of the 8 was the 0.95 confidence interval lower bound of the RERI point estimate ≥ 2. Table 2 shows all four scores mentioned in Section 3.6 (i.e., the RERI point estimate, the lower 95% CI of RERI estimate, the .25 quantile of the sensitivity grid, and the lower 95% CI of the .25 quantile of the sensitivity grid) for the top 15 candidate DDIs ranked by the conservative confounding aware approach, i.e, the lower bound of the 95% CI for the 0.25 quantile of the sensitivity grid. No candidate DDI has the lower bound of the 0.25 quantile of its sensitivity grid ≥ 2 (Table 2). Hence, there was not a single candidate DDI that we would confidently declare to mechanistically cause ARF.
Table 1.
(a) Top 15 DDIs of ARF ranked by the .25 quantile of sensitivity grid
| Drug1 | Drug2 | RERI |
| Lisinopril | Sodium Phosphate, Dibasic |
192.7 |
| Heparin | Mannitol | 10.7 |
| Potassium Chloride |
dabigatran etexilate | 4.6 |
| Lisinopril | dabigatran etexilate | 3.1 |
| carvedilol | dabigatran etexilate | 2.8 |
| Allopurinol | dabigatran etexilate | 2.4 |
| Diltiazem | dabigatran etexilate | 2.2 |
| dabigatran etexilate |
Spironolactone | 2.2 |
| Albuterol | dabigatran etexilate | 1.9 |
| Acetaminophen | dabigatran etexilate | 1.6 |
| Digoxin | dabigatran etexilate | 1.6 |
| Aspirin | dabigatran etexilate | 1.6 |
| Simvastatin | dabigatran etexilate | 1.6 |
| Metformin | Indapamide | 1.5 |
| Bisoprolol | Bumetanide | 1.5 |
(b) Top 15 DDIs of ARF ranked by the .25 quantile of sensitivity grid
| Drug1 | Drug2 | Sensitivity adjusted RERI |
| Lisinopril | Sodium Phosphate, Dibasic |
140.4 |
| Heparin | Mannitol | 8.8 |
| Potassium Chloride |
dabigatran etexilate | 3.9 |
| Lisinopril | dabigatran etexilate | 2.7 |
| carvedilol | dabigatran etexilate | 2.4 |
| Allopurinol | dabigatran etexilate | 2.2 |
| Diltiazem | dabigatran etexilate | 2 |
| dabigatran etexilate |
Spironolactone | 1.9 |
| Albuterol | dabigatran etexilate | 1.7 |
| Acetaminophen | dabigatran etexilate | 1.5 |
| Digoxin | dabigatran etexilate | 1.5 |
| Aspirin | dabigatran etexilate | 1.5 |
| Simvastatin | dabigatran etexilate | 1.5 |
| Metformin | Indapamide | 1.4 |
| Heparin | Sodium Bicarbonate | 1.4 |
Figure 2:
Distribution of U-adjusted RERI (of Diltiazen and Dabigatran on ARF) Estimates Over Sensitivity Parameters
Table 2:
Top 15 DDIs for ARF ranked by the lower .95 CI of .25 quantile of sensitivity grid
| Drug 1 | Drug 2 | RERI | Lower .95 CI RERI | Sensitivity adjusted RERI | Sensitivity adjusted RERI (Lower .95 CI) |
| Potassium Chloride | dabigatran etexilate | 4.6 | 0.9 | 3.9 | 1.4 |
| Diltiazem | dabigatran etexilate | 2.2 | 0.6 | 2.0 | 1.0 |
| Digoxin | dabigatran etexilate | 1.6 | 0.6 | 1.5 | 0.9 |
| Allopurinol | dabigatran etexilate | 1.9 | 0.8 | 2.2 | 0.9 |
| Oxycodone | Metoprolol | 1.5 | 0.7 | 1.3 | 0.8 |
| Acetaminophen | lansoprazole | 1.6 | 0.7 | 1.4 | 0.8 |
| dabigatran etexilate | Omeprazole | 1.4 | 0.6 | 1.0 | 0.8 |
| levothyroxine | dabigatran etexilate | 1.3 | 0.7 | 1.3 | 0.7 |
| Simvastatin | Citalopram | 1.6 | 0.6 | 1.5 | 0.7 |
| Ondansetron | Hydromorphone | 1.4 | 0.5 | 1.4 | 0.6 |
| Aspirin | Insulin Glargine | 1.6 | 0.8 | 1.0 | 0.6 |
| Potassium Chloride | Esomeprazole | 1.5 | 0.5 | 0.8 | 0.6 |
| Warfarin | Bisoprolol | 1.4 | 0.7 | 1.1 | 0.5 |
| Ciprofloxacin | Furosemide | 1.2 | 0.6 | 1.0 | 0.5 |
| Fentanyl | Furosemide | 1.0 | 0.4 | 0.9 | 0.5 |
Our results were qualitatively different from those obtained by Twosides. Twosides ranked each candidate DDI on a scale from 1 to 5 of confidence that the DDI is real. 3,716 of the 17,945 candidate DDIs potentially causing ARF considered by Twosides were given a score of 5. Since strong DDIs are fairly rare, we believe that this is an implausibly large proportion of drug pairs to mechanistically interact and cause ARF. We would argue our method provided more measured evidence of interaction effects.
Table 3 contains the the top 15 candidate DDIs for Acute Myocardial Infarction (AMI) ranked by the lower 95% CI of the 0.25% estimate of the sensitive grid. Only one out of the 60 initial candidate DDIs (bisoprolol and nilotinib) had a score ≥ 2 by this conservative measure. By contrast, out of 28,683 candidate DDIs for AMI considered by TwoSides, 5,119 were reported to have the highest confidence score of 5 and 19,328 a confidence score ≥ 3. Again, since true causal DDIs are rare, such a large number of DDIs indicates a preponderance of false positive results.
Table 3:
Top 15 DDIs for AMI ranked by the lower .95 CI of .25 quantile of sensitivity grid
| Drug 1 | Drug 2 | RERI | Lower .95 CI RERI | Sensitivity adjusted RERI | Sensitivity adjusted RERI (Lower .95 CI) |
| bisoprolol | nilotinib | 11.1 | 2.8 | 9.0 | 2.2 |
| ivabradine | furosemide | 1.6 | 1.4 | 1.5 | 1.3 |
| ivabradine | omeprazole | 2.2 | 1.3 | 1.9 | 1.2 |
| insulin glargine | testosterone | 1.5 | 1.2 | 1.4 | 1.1 |
| insulin, isophane | furosemide | 1.5 | 1.2 | 1.4 | 1.1 |
| bisoprolol | sitagliptin | 1.4 | 1.1 | 1.3 | 1.1 |
| sitagliptin | lansoprazole | 1.2 | 1.1 | 1.1 | 1.1 |
| sitagliptin | esomeprazole | 1.2 | 1.1 | 1.1 | 1.1 |
| allopurinol | darbepoetin alfa | 1.4 | 1.1 | 1.2 | 1.0 |
| glimepiride | lansoprazole | 1.3 | 1.1 | 1.2 | 1.0 |
| enalapril | ezetimibe | 1.8 | 1.1 | 1.6 | 1.1 |
| darbepoetin alfa | lansoprazole | 1.7 | 1.1 | 1.5 | 1.0 |
| furosemide | aluminum hydroxide | 1.1 | 1.0 | 1.1 | 1.0 |
| esomeprazole | torsemide | 1.1 | 1.0 | 1.1 | 1.0 |
| colchicine | pantoprazole | 1.1 | 1.0 | 1.1 | 1.0 |
ALI results followed a similar pattern (Table 4), where only one candidate DDI (efavirenz and rifampin) out of the 32 candidate DDIs had lower .95 CI of senstivity adjusted RERI ≥ 2. In contrast, out of 8,479 total candidate DDIs for ALI considered by TwoSides, 1,180 had confidence score 5 and 5,469 had confidence ≥ 3. Once again, this is an implausibly large number.
Table 4:
Top 15 DDIs for ALI ranked by the lower .95 CI of .25 quantile of sensitivity grid
| Drug 1 | Drug 2 | RERI | Lower .95 CI RERI | Sensitivity adjusted RERI | Sensitivity adjusted RERI (Lower .95 CI) |
| efavirenz | rifampin | 279.0 | 110.2 | 223.4 | 88.3 |
| lamivudine | efavirenz | 1.4 | 1.3 | 1.3 | 1.2 |
| lamivudine | tenofovir disoproxil | 1.4 | 1.2 | 1.3 | 1.2 |
| temozolomide | levetiracetam | 1.3 | 1.1 | 1.2 | 1.1 |
| tenofovir disoproxil | efavirenz | 1.2 | 1.1 | 1.2 | 1.0 |
| amoxicillin | clindamycin | 1.1 | 1.1 | 1.1 | 1.0 |
| heparin | esomeprazole | 1.1 | 1.0 | 1.0 | 1.0 |
| tramadol | ramipril | 1.1 | 1.0 | 1.0 | 1.0 |
| tramadol | enoxaparin | 1.1 | 1.0 | 1.0 | 1.0 |
| acyclovir | tacrolimus | 1.0 | 1.0 | 1.0 | 1.0 |
| acetaminophen | cefazolin | 1.1 | 1.0 | 1.0 | 1.0 |
| enoxaparin | esomeprazole | 1.0 | 1.0 | 1.0 | 1.0 |
| ramipril | lansoprazole | 1.0 | 1.0 | 1.0 | 1.0 |
| atenolol | lansoprazole | 1.0 | 1.0 | 1.0 | 1.0 |
| aspirin | ceftriaxone | 1.0 | 1.0 | 1.0 | 1.0 |
5. Discussion
One contribution of this work is to emphasize the importance of causality in DDI detection. Prior DDI detection work has not explicitly targeted drug pairs that mechanistically interact to cause adverse outcomes. Yet mechanistic interaction effects (in which at least some instances of the outcome would only occur if both drugs are taken) are most relevant to public health. Hence, we focus on finding drug-drug-outcome triplets with high RERI, since high RERI implies mechanistic interaction.
Another contribution of this work is to point out the importance of sensitivity analysis to DDI detection. It is very unlikely that observed variables are sufficient to adjust for confounding of any given interaction effect. Especially in a limited dataset like FAERS, common causes of each drug and the outcome will almost certainly be omitted. In the absence of sensitivity analysis, this will lead to false positives and overconfidence that DDIs are truly causal. Sensitivity analysis allows us to incorporate the assumption that we are not actually fully adjusting for confounding into our estimates of interaction effects, hence reducing overconfidence and false positive rates.
Even our sensitivity analysis does not account for all possible sources of bias from analyses of FAERS data. For example, selection on reporting of the adverse event may induce collider bias30. We may also induce bias by adjusting for certain covariates if the assumed causal structure of Figure 1 does not hold and so-called M-structures31 are present or certain covariates are actually influenced by the treatments.
Still, because of our conservative and rigorous approach to causal interaction and confounding, we believe that the candidate DDIs that we surfaced are worthy of further investigation. Each of the candidates that we surfaced with >2 lower confidence bounds on sensitivity adjusted RERI estimates have plausible biomedical mechanisms. In our candidate AMI DDI, one drug (bisoprolol) is known to increase concentration of the other (nilotinib)32. In our candidate DDI for ALI, one drug (efavirenz) is known to cause ALI on its own and higher doses are required when the other drug (rifampin) is being used concurrently33. When alternative screening approaches detect thousands of potential DDIs, it is difficult to justify devoting resources to following up on any one in particular.
The methods we apply in this paper are well known in epidemiology but for some reason have not taken hold in the data mining community that has been most active in DDI detection. We demonstrate that careful consideration of causality and bias can fit neatly into a data mining framework and be applied in automated screening tasks at a large scale.
Figures & Table
References
- 1.Lounkine Eugen, Keiser Michael J, Whitebread Steven, Mikhailov Dmitri, Hamon Jacques, Jenkins Jeremy L, Lavan Paul, Weber Eckhard, Doak Allison K, Côté Serge, et al. Large-Scale prediction and testing of drug activity on side-effect targets. Nature. 2012;486(7403):361–367. doi: 10.1038/nature11159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Ernst Frank R, Grizzle Amy J. Drug-Related morbidity and mortality: updating the cost-of-illness model. Journal of the American Pharmaceutical Association (1996) 2001;41(2):192–199. doi: 10.1016/s1086-5802(16)31229-3. [DOI] [PubMed] [Google Scholar]
- 3.United States National Center for Health Statistics. Health Number of prescription drugs used in the past 30 days by percentage of the usa population (age-adjusted estimates): Prescription drug use in the past 30 days, by sex, age, race, and hispanic origin: United states, selected years 1988–1994 through 2007–2010. Trends Pharmacol Sci. 2012.
- 4.United States National Center for Health Statistics. Health The fda adverse event reporting system (faers) FDA. 2015.
- 5.Harpaz Rave, DuMouchel William, LePendu Paea, Bauer-Mehren Anna, Ryan Patrick, Shah Nigam H. Performance of pharmacovigilance signal-detection algorithms for the fda adverse event reporting system. Clinical Pharmacology & Therapeutics. 2013;93(6):539–546. doi: 10.1038/clpt.2013.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Harpaz Rave, Chase Herbert S, Friedman Carol. BMC bioinformatics. volume 11. BioMed Central; 2010. Mining multi-item drug adverse effect associations in spontaneous reporting systems; p. S7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tatonetti Nicholas, Patrick Ye, Daneshjou Roxana, Altman Russ. Data-Driven prediction of drug effects and interactions. Science translational medicine. 2012;4(125) doi: 10.1126/scitranslmed.3003377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dey Sanjoy, Luo Heng, Fokoue Achille, Hu Jianying, Zhang Ping. Predicting adverse drug reactions through interpretable deep learning framework. BMC bioinformatics. 2018;19(21):476. doi: 10.1186/s12859-018-2544-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Vilar Santiago, Harpaz Rave, Uriarte Eugenio, Santana Lourdes, Raul Rabadan, Carol Friedman. Drug—drug interaction through molecular structure similarity analysis. Journal of the American Medical Informatics Association. 2012;19(6):1066–1074. doi: 10.1136/amiajnl-2012-000935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Huang Jialiang, Niu Chaoqun, Green Christopher, Yang Lun, Mei Hongkang, Han Jing-Dong. Systematic prediction of pharmacodynamic drug-drug interactions through protein-protein-interaction network. PLoS Comput Biol. 2013;9(3):e1002998. doi: 10.1371/journal.pcbi.1002998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Luo Heng, Zhang Ping, Huang Hui, Huang Jialiang, Kao Emily, Shi Leming, He Lin, Yang Lun. Ddi-cpi, a server that predicts drug–drug interactions through implementing the chemical–protein interactome. Nucleic Acids Research. 42(W1) doi: 10.1093/nar/gku433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhang Ping, Wang Fei, Hu Jianying, Sorrentino Robert. Label propagation prediction of drug-drug interactions based on clinical side effects. Scientific Reports. 2015;5:12339. doi: 10.1038/srep12339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gottlieb Assaf, Stein Gideon, Oron Yoram, Ruppin Eytan, Sharanb Roded. Indi: a computational frame-work for inferring drug interactions and their associated recommendations. Mol Syst Biol. 2012;8:592. doi: 10.1038/msb.2012.26. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Cheng Feixiong, Zhao Zhongming. Machine learning-based prediction of drug–drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. Journal of the American Medical Informatics Association. 2014;21(e2):e278–e286. doi: 10.1136/amiajnl-2013-002512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Han X, Quinney SK, Wang Z, Zhang P, Duke J, Desta Z, Elmendorf JS, Flockhart DA, Li L. Identification and mechanistic investigation of drug-drug interactions associated with myopathy: A translational approach. Clinical Pharmacology Therapeutics. 2015;98(3):321–327. doi: 10.1002/cpt.150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cai Ruichu, Liu Mei, Hu Yong, Melton Brittany L, Matheny Michael E, Xu Hua, Duan Lian, Waitman Lemuel R. Identification of adverse drug-drug interactions through causal association rule discovery from spontaneous adverse event reports. Artificial intelligence in medicine. 2017;76:7–15. doi: 10.1016/j.artmed.2017.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ning Xia, Zhang Chi, Wang Kai, Zhao Zhongming, Mathé Ewy. The international conference on intelligent biology and medicine 2019: computational methods for drug interactions. BMC Medical Informatics and Decision Making. 2020;20(2):1–2. doi: 10.1186/s12911-020-1051-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Noren Niklas, Sundberg Rolf, Bate Andrew, Edwards Ralph. A statistical methodology for drug–drug interaction surveillance. Stat Med. 2008;27(16):3057–3070. doi: 10.1002/sim.3247. [DOI] [PubMed] [Google Scholar]
- 19.Iyer Srinivasan, Harpaz Rave, LePendu Paea. Anna Bauer-Mehren, and Nigam Shah. Mining clinical text for signals of adverse drug-drug interactions. J Am Med Inform Assoc. 2014;21(2):353–62. doi: 10.1136/amiajnl-2013-001612. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Harpaz R, Perez H, Chase HS, Rabadan R, Hripcsak G, Friedman C. Biclustering of adverse drug events in the fda’s spontaneous reporting system. Clin Pharmacol Ther. 2011;89(2):243–250. doi: 10.1038/clpt.2010.285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Yan Su, Jiang Xiaoqian, Chen Ying. IEEE Int Conf Bioinformatics Biomed Proceedings. volume 2013. IEEE; 2013. Text mining driven drug-drug interaction detection; pp. 349–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cai Ruichu, Liu Mei, Hu Yong, Melton Brittany, Matheny Michael, Xu Hua, Duan Lian, Waitman Lemuel. Identification of adverse drug-drug interactions through causal association rule discovery from spontaneous adverse event reports. Artificial intelligence in medicine. 2017;76 doi: 10.1016/j.artmed.2017.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Rubin Donald. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66(5):688. [Google Scholar]
- 24.VanderWeele Tyler, Knol Mirjam. A tutorial on interaction. Epidemiological Methods. 2014;3(1):33–72. [Google Scholar]
- 25.VanderWeele Tyler. Explanation in causal inference: methods for mediation and interaction. Oxford University Press; 2015. [Google Scholar]
- 26.Kuhn Michael, Letunic Ivica, Jensen Lars Juhl, Bork Peer. The sider database of drugs and side effects. Nucleic acids research. 2016;44(D1):D1075–D1079. doi: 10.1093/nar/gkv1075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.VanderWeele Tyler. Principles of confounder selection. European Journal of Epidemiology. 2019;34 doi: 10.1007/s10654-019-00494-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Rosenbaum Paul, Rubin Donald. Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. Journal of the Royal Statistical Society: Series B (Methodological) 1983;45(2) [Google Scholar]
- 29.Ding Peng, VanderWeele Tyler. Sensitivity analysis without assumptions. Epidemiology. 2016;27(3) doi: 10.1097/EDE.0000000000000457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Hernan Miguel, Hernandez-Diaz Sonia, Robins James. A structural approach to selection bias. Epidemiology. 2004. pp. 615–625. [DOI] [PubMed]
- 31.Liu Wei, Brookhart Alan, Schneeweiss Sebastian, Mi Xiaojuan, Setoguchi Soko. Implications of m bias in epidemiologic studies: a simulation study. American journal of epidemiology. 2012;176(10):938–948. doi: 10.1093/aje/kws165. [DOI] [PubMed] [Google Scholar]
- 32.Haouala A, Widmer N, Montemurro M, Buclin T, Decosterd L. Drug interactions with the tyrosine kinase inhibitors imatinib, dasatinib, and nilotinib. Blood. 2011;117(8):75–87. doi: 10.1182/blood-2010-07-294330. [DOI] [PubMed] [Google Scholar]
- 33.Lopez-Cortes LF, Ruiz-Valderas R, Viciana P, Alarcon-Gonzalez A, Gomez-Mateos J, Leon-Jimenez E, Sarasanacenta M, Lopez-Pua Y, Pachon J. Pharmacokinetic interactions between efavirenz and rifampicin in hiv-infected patients with tuberculosis. Clinical Pharmacokinetics. 2002;41(9):681–690. doi: 10.2165/00003088-200241090-00004. [DOI] [PubMed] [Google Scholar]


