Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Feb 1.
Published in final edited form as: Clin Pharmacol Ther. 2018 Feb;103(2):177–179. doi: 10.1002/cpt.949

The next generation of drug safety science: coupling detection, corroboration, and validation to discover novel drug effects and drug-drug interactions

Nicholas P Tatonetti 1,2,3,4
PMCID: PMC6005687  NIHMSID: NIHMS926452  PMID: 29313964

Introduction

Rare adverse drug reactions and drug-drug interactions are difficult to detect in randomized trials and impossible to prove using observational studies. We must ascribe to a new way of conducting research that has the efficiency of a retrospective analysis and the rigor of a prospective trial. This can be achieved by integrating observational data from humans with laboratory experiments in model systems. The former establishes clinical significance and the latter supports causality.

Background

Adverse drug reactions are a leading cause of morbidity and mortality around the world. The occurrence of severe and unexpected side effects reduces the usefulness of pharmaceutical treatments. Some reactions are common across all who are treated. However, often, adverse reactions may only occur in a subpopulation of patients. Drug-drug interactions are an example of this. The patients taking either drug alone will not experience the reaction while the subset of patients exposed to both will. Pharmacogenetic variants that modulate drug response or co-morbidities that increase the risk of adverse reactions are additional examples. The challenge, of course, is discovering these subpopulation risk factors without any prior knowledge of them.

Detection of subpopulation-specific adverse reactions is statistically challenging. Clinical trials do not have the power to study drug-drug interactions or to evaluate specific subgroups. On the other hand, retrospective studies of real world clinical data have the diversity and the size to detect these patterns, but they also have unavoidable biases and systematic errors1 that produce dubious discoveries (Box 1). A new approach to drug safety research that takes the best aspects of both methods is needed -- combining the validity of a prospective trial with the opportunity for discovery provided by observational data.

Box 1. Data mining in a forest of bias.

A simple thought experiment illustrates the difficulty introduced by biases in observational data. Consider a small forest populated by red birds and blue birds. Suppose that we would like to answer the question of which bird is more prevalent in this small forest. We decide to venture into this forest and count what we see. Upon analysis of our data, we find that we observed five times as many blue birds as red birds. It is tempting to conclude that there are more blue birds than red birds in our hypothetical forest, however, this would be incorrect. In fact, we cannot make any reliable conclusions from these data. To explain, let me now reveal something about these birds. Blue birds are friendly and curious birds. When they hear people enter the forest they fly to them to investigate and show off. Red birds, on the other hand, are scared and grumpy birds. When they hear people enter the forest they fly away or hide. The behaviors of these birds, therefore, changes - or confounds - what we can observe, making the data we collected useless for the question we wanted to answer. This is not a hypothetical problem. This is exactly what occurs every day in every clinic and hospital when patient data are collected in the EHR. What we can observe is dictated by the behaviors of the patients, doctors, and nurses. Patients do not admit themselves to the hospital at random and neither do physicians prescribe medications that way.

I propose the use of a three-step methodology (Figure 1) to discover and validate novel adverse drug reactions and drug-drug interactions. The first step, detection, is to data mine a large observational resource, like the FDA's Adverse Event Reporting System (FAERS), for unexpected associations between drugs and adverse events. This procedure will produce thousands of significant hypotheses and, as is true in any data mining experiment, a large proportion of those are expected to be false discoveries. There are many signal detection and statistical data mining algorithms to choose from when detecting new drug-adverse event hypotheses2. The precise choice of method will depend on the adverse event outcome of interest and the balance desired between false positives and novel discoveries.

Figure 1. The three steps to more rigorous and efficient drugs safety surveillance.

Figure 1

Step 1. Detection. Modern signal detection and statistical data mining method are used to identify new drug safety and drug-drug interaction signals. Methods include traditional approaches, like disproportionality analysis with statistical corrections, and newer methods, like supervised machine learning and pattern detection. This step produces a lot of statistically significant associations but also a lot of false discoveries. Step 2. Corroboration. Mined hypotheses are evaluated against an independent dataset and evaluated for plausibility. These additional data could score drug effect hypotheses by their molecular connection to the known effect or provide additional clinical evidence from alternative resources. Any hypotheses that does not corroborate is removed from consideration, greatly reduces the number of false discoveries. At the end of this stage only a few to dozens of the strongest hypotheses remain. Step 3. Validation. Corroborated hypotheses are validated using a model system. Model systems may be molecular assays (e.g. chemical-protein binding affinity), cellular systems, or animal models, depending on which model is best suited for the predicted adverse reaction outcome. Drug effects that validate in all three steps have demonstrated clinical importance in humans (steps 1 and 2) and have evidence of causality (step 3). This process is both more efficient than a clinical trial and more rigorous than a retrospective analysis.

The second step, corroboration, is to integrate an independent resource (e.g. chemical network biology data or electronic health records) to prioritize the data mined hypotheses based on their plausibility. A drug that can be molecularly connected to the adverse event is more likely to be true than one that cannot. Chemical informatics methods and systems pharmacology models that use drug binding and pathway data can be used to find these molecular connections3,4. Likewise, a drug effect that can be independently replicated in electronic health records is more likely than one that cannot. This step is also performed on already collected data using computational methods and therefore can be applied to all of the mined hypotheses from the first step. This will reduce the number of significant hypotheses from thousands to hundreds or tens. The limitation here is that these models are specific to the adverse reaction being studied. For example, the same molecular model -- or electronic health record phenotype -- used for studying arrhythmias will likely not apply to glucose metabolism.

In the final step, validation, a model system is identified to experimentally test the strongest hypotheses. This is the most challenging step as many adverse reactions do not map clearly to experimental systems. It is also the most important since a hypothesis is only as good as it is falsifiable. Ideally, the experiment will be efficient with a straightforward interpretation, like a protein affinity assay. Often, however, a more complex experiment in a cellular or animal model is needed. In either case, this is both an efficient and ethical method for validating adverse drug reactions compared to launching a prospective clinical trial. These three sources of evidence, from detection to corroboration to validation, provide a compelling case for the newly discovered effect.

Discovery in practice

This strategy of detection, corroboration, and validation is not hypothetical, but one that my colleagues and I have used successfully. First, to show that co-medication of paroxetine and pravastatin leads to increased blood glucose5 and later to prove that the combination of ceftriaxone and lansoprazole prolongs the QT interval6. In both cases, we used Latent Signal Detection7 to mine unexpected drug-drug interactions from FAERS. Latent Signal Detection is a supervised machine learning algorithm that can identify associations with adverse reactions even if they are not explicitly reported. We trained one model to identify glucose dysregulation5 and the other for acquired long QT syndrome (LQTS)7. Hundreds of putative drug-drug interactions were hypothesized for each effect. In our experience, as many as 90–95% of these hypotheses will be false positives. Underscoring the importance of a computational method to corroborate these putative drug effects for plausibility.

To corroborate, or refute, the data mined hypotheses we used an independent analysis of electronic health record data. For glucose dysregulation (GD) we pulled glucose lab results run before and after exposure to the drugs, individually and in combination. Likewise, for LQTS we extracted the corrected QT value from the electrocardiogram reports of patients exposed to either or both drugs. In both, the analysis we used was efficient and could be applied to each of the mined hypotheses relatively quickly, enabling us to eliminate most of the hypotheses as implausible. At the end of this stage, we have corroborated drug-drug interactions with evidence of a clinical effect in humans from two independent analyses based on two different observational datasets. Of those that remained, we prioritized them by their effect sizes and potential for clinical impact for experimental validation.

The final step, validation, is used to prove that these associations are due to the combination of drug exposures and not because of some confounding variable. We attempted validation on the top prediction from each model (paroxetine and pravastatin for GD and ceftriaxone and lansoprazole for LQTS). In neither case, did we know the mechanism of the interaction and we took two different strategies for validation. For paroxetine and pravastatin, we used an insulin resistant mouse model, giving us the greatest chance to observe the hypothesized effect regardless of if the interaction occurred at glucose transport, insulin secretion, or gluconeogenesis. For ceftriaxone and lansoprazole, we chose a cellular assay of the drugs' effect on the hERG channel. This potassium ion channel is essential in heart rhythm and the blocking of this channel is the mechanism of action for all drugs known to prolong the QT interval. In both cases, the drug-drug interaction validated5,6 and in both cases it was the only predicted DDI we tested -- resource limitations prohibited the evaluation of additional hypotheses.

The three steps of detection, corroboration, and validation produce findings that are reproducible. Our discovery that co-medication of paroxetine and pravastatin increases blood glucose has been subsequently reproduced in rats8 and humans9 by independent groups. Similarly, in response to our publication of ceftriaxone and lansoprazole prolonging the QT interval, additional case reports have surfaced in support of the interaction10.

Discussion

The drug safety community is facing an insurmountable challenge when trying to accurately identify rare adverse reactions and drug combination effects using current pharmacovigilance methods. There are simply too many covariates, strata, and interactions for these effects to be discovered in the clinical trial phase. Retrospective methods are sensitive to biases and produce far too many false positives. The three-step process of detection and corroboration in human data coupled with validation in experimental model systems is a balance of the two -- more efficient than randomized controlled trials and more rigorous than observational analysis. The strategy of integrating computational analysis with experiments is not unique to drug safety science. Others have used similar approaches in drug repositioning, drug target prediction, and drug design.

Ultimately, the goal of any pharmacovigilance study is to alter clinical practice to improve patient safety. This is often achieved through actions taken by regulatory agencies like the FDA and the European Medicines Agency. These agencies would, of course, prefer evidence sourced from randomized controlled trials when issuing warnings or requiring withdrawals. In many cases, however, the data are not available. Discoveries supported by the three levels of evidence I present here should be considered adequate to make product safety decisions. The human data from spontaneous reporting systems and electronic health records establishes the clinical importance of the effect (or the lack thereof if that is what the data show). The prospective experiment in a model system establishes that the clinical effect observed is being caused by the hypothesized agent. This combination of clinical significance and causality evidence is as strong as a well-powered randomized trial and represents the most viable path forward for meaningful and impactful drug safety science.

Acknowledgments

Funding

Author is funded by NIGMS R01 GM107145 and NCATS OT3 TR002027.

Footnotes

Conflict of Interest

Author is a paid advisor to Advera Health, Inc. Author declares no other conflicts of interest.

References

  • 1.Hripcsak G, Albers DJ. Next-generation phenotyping of electronic health records. J Am Med Inform Assoc. 2013;20:117–121. doi: 10.1136/amiajnl-2012-001145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.van Puijenbroek EP, et al. A comparison of measures of disproportionality for signal detection in spontaneous reporting systems for adverse drug reactions. Pharmacoepidemiol Drug Saf. 2002;11:3–10. doi: 10.1002/pds.668. [DOI] [PubMed] [Google Scholar]
  • 3.Lorberbaum T, et al. Systems Pharmacology Augments Drug Safety Surveillance. Clin. Pharmacol. Ther. 2014;97:151–158. doi: 10.1002/cpt.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Jacunski A, Tatonetti NP. Connecting the Dots: Applications of Network Medicine in Pharmacology and Disease. Clin. Pharmacol. Ther. 2013;94:659–669. doi: 10.1038/clpt.2013.168. [DOI] [PubMed] [Google Scholar]
  • 5.Tatonetti NP, et al. Detecting Drug Interactions From Adverse-Event Reports: Interaction Between Paroxetine and Pravastatin Increases Blood Glucose Levels. Clin. Pharmacol. Ther. 2011;90:133–142. doi: 10.1038/clpt.2011.83. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lorberbaum T, et al. Coupling Data Mining and Laboratory Experiments to Discover Drug Interactions Causing QT Prolongation. Journal of the American College of Cardiology. 2016;68:1756–1764. doi: 10.1016/j.jacc.2016.07.761. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Lorberbaum T, Sampson KJ, Woosley RL, Kass RS, Tatonetti NP. An Integrative Data Science Pipeline to Identify Novel Drug Interactions that Prolong the QT Interval. Drug Saf. 2016;39:433–441. doi: 10.1007/s40264-016-0393-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Li F, et al. Co-administration of paroxetine and pravastatin causes deregulation of glucose homeostasis in diabetic rats via enhanced paroxetine exposure. Acta Pharmacol. Sin. 2014;35:792–805. doi: 10.1038/aps.2014.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.An L, Ravindran PP, Renukunta S, Denduluri S. Co-medication of pravastatin and paroxetine-a categorical study. J Clin Pharmacol. 2013;53:1212–1219. doi: 10.1002/jcph.151. [DOI] [PubMed] [Google Scholar]
  • 10.Lazzerini PE, Bertolozzi I, Rossi M. Combination Therapy With Ceftriaxone and Lansoprazole, Acquired Long QT Syndrome, and Torsades de Pointes Risk. doi: 10.1016/j.jacc.2016.11.090. (Journal of the …, 2017).at< http://www.onlinejacc.org/content/69/14/1876.full?sso=1&sso_redirect_count=5&access_token=>. [DOI] [PubMed]

RESOURCES