Abstract
Purpose of Review
An important component of the Food and Drug Administration’s Sentinel Initiative is the active post-market risk identification and analysis (ARIA) system, which utilizes semi-automated, parameterized computer programs to implement propensity-score adjusted and self-controlled risk interval designs to conduct targeted surveillance of medical products in the Sentinel Distributed Database. In this manuscript, we review literature relevant to the development of these programs and describe their application within the Sentinel Initiative.
Recent Findings
These quality-checked and publicly available tools have been successfully used to conduct rapid, replicable, and targeted safety analyses of several medical products. In addition to speed and reproducibility, use of semi-automated tools allows investigators to focus on decisions regarding key methodological parameters. We also identified challenges associated with the use of these methods in distributed and prospective datasets like the Sentinel Distributed Database, namely uncertainty regarding the optimal approach to estimating propensity scores in dynamic data among data partners of heterogeneous size.
Summary
Future research should focus on the methodological challenges raised by these applications as well as developing new modular programs for targeted surveillance of medical products.
Keywords: surveillance, pharmacoepidemiology, Sentinel, ARIA, targeted, drug safety
Introduction
Targeted surveillance using multiple large databases is an increasingly important part of post-market medical product safety in the United States [1, 2]. Such active, semi-automated surveillance is characterized by evaluation of specific exposure-outcome relations of interest in longitudinal, routinely-collected healthcare data, and provides safety information that complements passive surveillance or spontaneous reporting systems. Spontaneous reporting systems -- the Vaccine Adverse Event Reporting System (VAERS) and the Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS) for vaccines and drugs, respectively -- have traditionally been the backbone of medical product safety monitoring, but they are limited by selective reporting [3]. Further, and most importantly, they lack denominator data to allow for accurate quantification of the magnitude of associations of interest.
Longitudinal databases that capture routinely-collected healthcare utilization information have long been available to evaluate pharmacoepidemiologic questions of interest. However, single databases often lack the needed sample size to identify rare safety issues that also elude detection in relatively small clinical trial populations, necessitating multi-database studies in the post-market setting. Traditionally, multi-database or multi-site studies required data use agreements to be signed among all participating databases to allow their data to be centralized for analysis.
A newer model has emerged – the distributed data network [4, 5] – in which multi-site, routinely-collected, longitudinal observational data are used without requiring databases to share patient-level data with each other. One of the first distributed surveillance systems for medical products in the United States was the Vaccine Safety Datalink (VSD), initiated in 1990 [6–9]. The Centers for Disease Control and Prevention (CDC) partnered with four health maintenance organizations across the country who agreed to provide de-identified information on vaccine usage, basic demographic characteristics, and medical records to the CDC, but did not share their data with other databases. That model would go on to be replicated in other national networks.
In 2008, the FDA launched the Sentinel Initiative to create a national, actively-monitored, and distributed data system to evaluate the safety of regulated medical products in the United States [10•, 11]. The Sentinel Infrastructure, a component of the Sentinel Initiative (see Figure 1), uses a common data model to facilitate distributed analyses across multiple data partners who might otherwise format data differently [12]. The distributed data network includes 17 observational databases that cover 223 million lives with 43 million patients currently accruing new data [10, 13, 14]. Data in the Sentinel Infrastructure can also support analyses performed with customized programming according to fully developed protocols, including large observational analyses [15, 16] and cluster randomized trials [17].
Contained within the Sentinel Initiative is the congressionally-mandated active post-market risk identification and analysis (ARIA) system [18]. While the same underlying electronic data model supports both the ARIA system and the greater Sentinel Infrastructure, the distinguishing feature of ARIA is that it is limited to pre-programmed, reusable, quality-controlled and parameterized computer programs or queries. Thus, ARIA can rapidly support public health safety surveillance, as no de novo programming is required.
In brief, these queries or computer programs work by executing on each participating data partner's data that have been formatted into the Sentinel Common Data Model behind their firewalls, and then returning non-identifiable aggregate information to a central analysis center for final synthesis [19••]. In addition to leveraging the interoperability of electronic health information in a multi-site setting, such a public-private partnership allows data partners to maintain control of their data and protect patient privacy [4, 5].
Here, we focus on two programs used within ARIA for assessing associations between medical products and health outcomes of interest: the propensity-score (PS) adjusted active comparator analysis and the self-controlled risk interval design. We first discuss prior work leading to their development as pre-programmed analysis tools before moving on to describing their applications within the Sentinel System, highlighting unique challenges associated with their use in conducting safety surveillance across distributed databases.
Propensity Score Adjusted Analyses
In this section, we describe literature relevant to the development of the PS adjusted active comparator analysis tool in ARIA. Summary scores have a long history as a confounding adjustment technique in observational designs because they summarize patient characteristics into a single number [20–24], enabling efficient adjustment for a large number of potential confounders while preserving patient privacy and limiting transfer of patient-level data. Two types of summary scores are currently used: PSs, [25], which model the relationship between potential confounders and the exposure, and disease risk scores (DRSs) [26], which model the relationship between potential confounders and the outcome.
Both scores can be advantageous in the setting of newly marketed medications [27, 28]. A DRS-based approach may be preferable in the very early post-market period when there are too few exposed patients or too few outcomes to support PS modeling or traditional outcome regression models [22]. In particular, a DRS can be developed in an historical cohort of comparator patients and the resultant coefficients can be used to adjust for confounding in the analysis cohort without the need for fitting a statistical model in this group. When PS models can be fit in the analysis cohort, they tend to perform at least as well as DRS-based approaches [29]. PS approaches have the additional benefit that covariate balance after adjustment is easily displayed in a traditional baseline covariate characteristics table. For these reasons, PS based methods were implemented as a parameterized ARIA targeted surveillance tool.
Adjusting for a large number of covariates in a PS potentially allows an investigator to partially control for important confounders that are unmeasured or incompletely captured in electronic healthcare databases, such as frailty, through proxy adjustment. For example, variables that might not be direct risk factors of an outcome, such as oxygen tank usage, may in fact be correlated with underlying frailty and worth including in a PS model. Such proxy adjustment is central to the high-dimensional propensity score (hd-PS) approach, which is an algorithm that empirically identifies large numbers of potential confounding variables for inclusion in a PS model by ranking them based on their potential to cause bias [30].
Propensity scores in small samples
In the early post-market period, it is in the interest of patients and regulators to detect safety issues as soon as possible. However, in this period there will typically be few outcomes and few users of the new medical product relative to a comparator. Few new users of the target medical product can be problematic for PS methods, and especially for hd-PS, which typically includes hundreds of potential confounders in the PS model. To address this concern, the performance of hd-PS in settings where there are few exposed patients or outcome events was evaluated by re-sampling from four new user cohorts identified in administrative claims data to create increasingly smaller cohorts [31]. In these down-sampled cohorts, the authors examined the performance of hd-PS and sought to define the optimal number of empirically identified covariates to include. When there were greater than 50 exposed patients with an event, the hd-PS algorithm that ranked potential confounders by their relationships with both exposure and outcome, termed the bias-based algorithm, performed well in terms of confounding adjustment. When there were fewer than 50 exposed events, ranking covariates for inclusion based only on their association with exposure yielded better performance.
Propensity scores in distributed data settings
Compared with analyses in single databases, performing analyses in multiple databases improves statistical power, which can enable more rapid identification of potential medical product safety issues. A challenge of the distributed data setting is attaining confounding control while simultaneously minimizing or eliminating sharing of patient-level data. Three recent studies compared various methods for achieving this goal, which included comparing full patient-level data sharing, cell-aggregated sharing, distributed regression analysis, meta-analysis, and PS-based methods [27, 32, 33]. These evaluations concluded that PS methods provided adequate flexibility, confidentiality, and analytic integrity in distributed data settings. There are several ways of employing PS-based methods in distributed environments. In its simplest form, individual data partners estimate a PS within their own data and share only the patient-level PS, along with a randomly generated patient identifier, exposure and outcome status, and relevant subgroup indicators with a central analysis hub like the Sentinel Operations Center. The feasibility of this patient-level PS-sharing approach was demonstrated in a cohort of new users of clopidogrel and proton pump inhibitors versus new users of clopidogrel alone monitored for risk of myocardial infarction. Data were shared among four data partners without compromising patient privacy by using PSs [32]. Since the development of the ARIA targeted surveillance program for PS adjustment, further adaptations of the program have employed a risk set-based [34] approach that enables a PS-adjusted analysis in a distributed setting without sharing patient-level PSs [35]. The risk-set based approach has been shown to provide statistically equivalent results to a pooled analysis and has been recently applied to actual safety assessments in the Sentinel Distributed Database [15, 16].
Propensity scores in prospective analyses
While PS methods have demonstrated utility in distributed data, prospective analyses, where data periodically update over time, provide a unique set of challenges. To assess PS performance in prospective data, a recent study developed an approach to prospective, targeted safety monitoring and evaluated it using three drug-outcome pairs with known or suspected safety issues: paroxetine and severe upper gastrointestinal bleed, lisinopril and angioedema, and ciprofloxacin and Achilles tendon rupture [36]. The authors divided a single dataset into sequential monitoring periods to mimic the prospective accumulation of data, and 1:1 matched new users of the study drugs within each monitoring period on a PS estimated only among those new users in each period. Two alerting algorithms were compared: a maximized sequential probability ratio test [37] and a method that generated an alert if the effect estimate exceeded a predefined threshold for more than 3 consecutive monitoring periods. Alerts were generated by at least one of the alerting algorithms in each example over the follow-up period, demonstrating the feasibility of such a PS-matched prospective monitoring system to quickly identify adverse events. It is important to note that while this assessment emulated prospective analyses, the dataset was static and therefore did not account for issues related to changing data over time that can occur when using dynamic datasets in the Sentinel Distributed Database.
Propensity scores in both distributed settings and prospective analyses
Given the success of PS methods in both distributed data and prospective analyses separately, the next step was to assess PS methods in distributed, prospective data simultaneously. One of the concerns regarding automated safety surveillance systems in multiple databases is the opportunity for many false positives or false negatives [38]. As a first step in evaluating these concerns, a PS-based semi-automated safety monitoring approach was utilized to assess the safety of three medications across three claims databases [39]. Data from each database were divided into three-month intervals to mimic prospective accumulation of data, and new users of the three study drugs were 1:1 PS matched to new users of comparator products within each interval. The three drug-outcome relationships were rosuvastatin and rhabdomyolysis, rosuvastatin and diabetes mellitus, and telithromycin and hepatotoxicity. As expected based on prior research, none of these three examples generated a safety alert, providing some reassurance about the system's robustness against false positives. By sharing only PSs with a central data analysis center, confounding was mitigated without compromising patient privacy.
This semi-automated program served as the precursor program to the PS-adjusted active comparator analysis tool utilized in ARIA, and a subsequent paper described the program in more detail and discussed the steps required to conduct prospective semi-automated surveillance using PS matching in distributed databases [19••]. The PS adjustment tool has several functions and includes each of the features noted above. When tested on retrospective data for nine drug-outcome pairs, which included positive and negative controls, the tool generated results consistent in direction and magnitude with expectations for each. Alerts were generated for all positive controls when power was adequate, which included the three known positive associations described above. These findings further demonstrated the ability of the system to avoid generating false positives and negatives.
The PS adjustment precursor program was also applied to a true prospective safety monitoring scenario in which data accumulated in real time to evaluate the risk of hemorrhagic and ischemic events in patients taking prasugrel versus clopidogrel during the first two years of prasugrel's market availability in the US [40•]. Using the prototype, patients were matched on hd-PS and followed for outcomes using dynamic data that were updated on a bi-monthly basis. In general, results were consistent with those from randomized trials. While the first monitoring period had to be lengthened to accumulate enough users to validly estimate the hd-PS, the tool performed well and demonstrated the feasibility of this type of analysis in a prospective, distributed database setting like Sentinel. In light of these successful applications, the semi-automated PS adjusted active comparator design tool was officially implemented into the Sentinel Initiative as part of ARIA [41, 42].
Applications within the Sentinel Distributed Database
The PS adjustment tool in use by ARIA in the Sentinel Initiative can perform propensity score matching (both 1:1 and 1:n) and propensity score stratification [43]. It has been used to perform both retrospective safety assessments as well as prospective safety assessments within the Sentinel Distributed Database. Retrospective safety assessments are one-time analyses in which the analysis is performed using all existing data. Retrospective safety assessments included the following drug-outcome pairs: angiotensin converting enzyme inhibitors (ACEIs) vs. beta blockers on angioedema [44•], clindamycin vs. penicillin on risk of C. difficile infection, glyburide vs. glipizide on hypoglycemia [45], warfarin vs. statins on bleeding, dabigatran vs. warfarin on bleeding/acute myocardial infarction, apixaban vs. warfarin on gastrointestinal hemorrhage and stroke, niacin vs. fenofibrate on bleeding and stroke, and levetiracetam vs. topiramate or lamotrigine on agranulocytosis. The ACEI, clindamycin, glyburide, and warfarin vs. statins assessments were all test cases in which elevated outcome risk associated with the primary exposure was expected. These assessments were performed as empirical test cases as opposed to novel regulatory questions or considerations.
The PS adjustment tool has also been used for one prospective safety assessment, in which data regularly accumulate in real time and are re-analyzed after each update. The prospective queries targeted rivaroxaban vs. warfarin on gastrointestinal hemorrhage and stroke risk. A full description of each retrospective and prospective query has been provided elsewhere [46••]. A high-level summary of each query is available in Table 1.
Table 1.
Query | Outcomes | #DPs Returned/ Sent |
Query Period1 |
---|---|---|---|
| |||
Test cases of known associations | |||
| |||
ACEI vs. beta-blockers | Angioedema | 13/17 | 01/01/2001–09/30/2013 |
| |||
Clindamycin vs. oral penicillins | C.Difficile infection | 13/14 | 01/01/2006–12/31/2013 |
| |||
Glyburide vs. Glipizide | Hypoglycemia | 7/13 | 01/01/2008–12/31/2014 |
| |||
Warfarin vs. statins | Bleed | 14/14 | 01/01/2012 – 09/30/2015 |
| |||
One-time drug safety assessments | |||
| |||
Dabigatran vs. warfarin | ICH | 4/4 | 11/01/2010–12/31/2013 |
GI Bleed | |||
Ischemic Stroke | |||
AMI | |||
| |||
Apixaban vs. warfarin | GI Bleed | 4/4 | 02/01/2013–05/31/2015 |
ICH | |||
Stroke | |||
| |||
Niacin vs. fenofibrate | GI Bleed | 4/4 | 01/01/2007–05/31/2013 |
ICH | |||
Stroke | |||
| |||
Levetiracetam vs. lamotrigine/topiramate | Agranulocytosis | 10/17 | 01/01/2000–10/31/2013 |
| |||
Prospective drug safety assessments | |||
| |||
Rivaroxaban vs. warfarin | Ischemic Stroke | 4/4 | 11/01/2011–4/30/2015 |
ICH | |||
GI Bleed |
ACEI – Angiotensin-Converting Enzyme Inhibitor; AMI - Acute Myocardial Infarction; DP – Data Partner; GI - Gastrointestinal; ICH - Intracranial hemorrhage
Query period represents available data across all Data Partners included in that query
Self-Controlled Risk Interval Design
The self-controlled risk interval design is a specific application of the self-controlled case series design in which eligible patients are those exposed to a medical product of interest and are indexed on exposure. This contrasts with a case-crossover study, in which eligible patients are indexed upon occurrence of the outcome. However, both types of self-controlled designs control for all time-invariant confounding so most complicating factors in self-controlled designs involve adding additional model terms to account for time-varying confounding [47]. While the self-controlled case series design has been used extensively in epidemiological research [48–51], the self-controlled risk interval design was first used in the VSD and the Post-licensure Immunization Safety Monitoring system [52] for vaccine safety studies [53–60]. In this design, an exposed person acts as his or her own control and the comparison occurs between a period when the person is thought to be at heightened risk for experiencing a health outcome of interest (i.e., the risk window) and a period without heightened risk (i.e., the comparison window). Post-exposure comparison windows (as compared to pre-exposure comparison windows) are typically used because experiencing the outcome is often presumed to affect the person’s likelihood of receiving a subsequent vaccination (i.e., the healthy vaccinee effect) [61, 62]. This design was also used in the Sentinel System before it was standardized into a pre-parameterized usable program for use in ARIA [63•–65]. Of these prior uses, two were retrospective analyses and one was prospective sequential surveillance. Like the PS adjusted tool described above, a prototype of the pre-parameterized program was tested in a dynamic data setting to ensure its performance with a known association between an exposure-outcome pair [66].
Since the conversion of the self-controlled risk interval design into a semi-automated program, it has been planned to prospectively monitor the safety of nine-valent human papilloma virus vaccine [67] and has been used retrospectively to assess seizures following ranolazine exposure [68].
Discussion
We described applications of semi-automated, targeted evaluations of particular exposure-outcome pairs of interest in the Sentinel Distributed Databases using one of two quality-checked, parameterized computer programs: the PS adjustment tool and the self-controlled risk interval design tool. Both of these epidemiologic designs were well-studied in single database settings and were adapted for multi-site distributed database uses when minimizing transfer of patient-level data is of paramount importance. When these programs have been piloted on known medical product-outcome associations, they have correctly generated safety alerts and avoided false positives while maintaining patient privacy. The successful performance of the program allowed investigators to confidently apply it to true safety analyses to support post-market medical product regulatory decisions.
The successful development and application of pre-written, semi-automated surveillance programs for both drugs and vaccines has highlighted several of their strengths. The primary advantage of converting these commonly used study designs into semi-automated, pre-written programs is that safety analyses can be performed more quickly than if these designs had to be repeatedly programmed de novo. In the immediate post-approval setting, where important adverse effects may be unknown due to their rarity or differences in patient populations from the pre-market trials, rapidly identifying adverse events is critical to patient safety. Another strength of pre-programmed tools is that by automating programming, investigators have more time to spend considering key clinical and epidemiological decision points such as the relevant confounders and statistical analysis plan. Pre-programmed tools also ensure results are consistent and reproducible by preventing coding errors that can occur from ad hoc programming.
The applications described above have also identified certain challenges when conducting targeted surveillance in distributed databases. First, the constituent databases of the Sentinel Distributed Database are of heterogeneous size. When performing PS estimation, the model may fail to converge at a subset of smaller databases as a result of small sample size. The best approach for estimating PSs in small data partners within distributed data networks is still unknown, and such problems may be an inherent limitation of distributed databases like Sentinel, if small data partners are included. An alternative in such situations could be to instead use a DRS for confounding adjustment, especially if the outcome of interest is more common than new use of the target therapy. In settings where outcomes are also rare, use of historical data from the time period immediately prior to the introduction of the target therapy may be preferable for DRS development [22]. Databases of heterogeneous size are not as challenging to deal with when using a self-controlled risk interval design, however only databases with recorded outcomes are informative to the analysis.
Second, sequential analyses of prospectively accumulating data present a distinct set of methodological challenges. The Sentinel Distributed Database is a dynamic database and data are continually refreshed over time. Claims adjustments and late-arriving data can result in changes to a patient’s exposure, outcome, or covariate status over time. For example, a patient who experiences the outcome of interest in the first monitoring period may not be classified as such until the second monitoring period. However, sequential statistical analyses, first used in the clinical trial setting, depend (or are anchored on) data in prior monitoring periods remaining stable and constant [69]. Historically, in the VSD, follow-up time for prospectively monitored patients was limited to a pre-defined and short observational window. Because of this, data were allowed to stabilize following the completion of the observational window before they were ready for analysis. A notable exception to this stabilization occurs with influenza vaccine safety surveillance when partially-accrued data must be analyzed in order to have actionable information during the ongoing influenza season [54, 60, 65, 69].
In the pre-parameterized self-controlled risk interval design tool in ARIA, patients are ascertained in the analytic dataset only after they have had the opportunity to complete their full observation window. Thus, a patient’s relevant data are contained within a single snapshot of the dynamic dataset. However, outcomes that have longer time-to-onset (on the order of months) and that further require an investigator to follow the same patient over multiple data refresh periods are particularly challenging. In PS adjusted analyses, follow-up time is often “as treated,” i.e., as long as the patient is actively exposed to the medical product. Therefore, it is not feasible to prospectively monitor a patient’s entire relevant follow-up in a single snapshot of the dynamic dataset. Adapting sequential statistical analyses to cope with dynamic data over multiple data refreshes in an observational database is the subject of ongoing methods work.
While PS-based methods have been successful, alternative methods for analyzing distributed data networks while maintaining patient privacy are available. The PS adjustment tool utilizes matching and stratification, but in theory could be extended to allow for inverse probability of treatment weighting. The tool could also be extended to alternative summary scores such as the DRS, which would offer advantages in situations where outcomes are more common than new initiators of the medical product of interest as discussed above [22].
A distinct method which does not rely on covariate summary scores is distributed regression analysis, in which data partners run regression models locally and then share only intermediate summary statistics with a central analysis hub. Distributed regression has been proven to provide identical results as a fully pooled analysis [27, 70, 71]. Like the other programs developed in Sentinel, distributed regression requires a common data model and shares only aggregated information that preserves patient privacy. Efforts to automate this methodology in the Sentinel System are underway.
Conclusions
In conclusion, the development of semi-automated, parameterized computer programs that perform inferential analyses in multi-site distributed databases continues to evolve, allowing for timely and reproducible evaluations of particular exposure-outcome pairs. These programs are quality-checked and available to the public. They are also currently in routine in use as part of the FDA’s ARIA system. Future research should focus on the methodological challenges raised by these applications, as well as on developing new modular programs for targeted surveillance of medical products.
Key Points.
Semi-automated, pre-programmed tools for implementing propensity score-adjusted active comparator and self-controlled risk interval designs have been successfully applied to multiple prospective safety analyses in the Sentinel Distributed Database
Public availability of these tools enables rapid and consistent analyses across multiple data sites
Future research should focus on addressing challenges to implementing these designs in distributed, prospective databases like Sentinel
Acknowledgments
Mini-Sentinel is a pilot project sponsored by the U.S. Food and Drug Administration (FDA) to inform and facilitate development of a fully operational active surveillance system, the Sentinel System, for monitoring the safety of FDA-regulated medical products. Mini-Sentinel is one piece of the Sentinel Initiative, a multi-faceted effort by the FDA to develop a national electronic system that will complement existing methods of safety surveillance. Mini-Sentinel Collaborators include Data and Academic Partners that provide access to health care data and ongoing scientific, technical, methodological, and organizational expertise. The Mini-Sentinel Coordinating Center is funded by the FDA through the Department of Health and Human Services (HHS) Contract number HHSF223200910006I. This project was also supported in part by NIH U01EB023683.
Footnotes
Human and Animal Rights
All reported studies/experiments with human or animal subjects performed by the authors have been previously published and complied with all applicable ethical standards (including the Helsinki declaration and its amendments, institutional/national research committee standards, and international/national/institutional guidelines).
Conflict of Interest
John G. Connolly, Catherine A. Panozzo, Noelle Cocoros, Meijia Zhou, and Judith C. Maro each declare no potential conflicts of interest.
Sengwee Toh reports grants from U.S. Food and Drug Administration during the conduct of the study.
Shirley V. Wang reports grants from Sentinel Initiative, during the conduct of the study; personal fees from Aetion, Inc., grants from Novartis, grants from Agency for Healthcare Research and Quality outside the submitted work.
Joshua J. Gagne reports grants from US FDA, during the conduct of the study; grants from Novartis Pharmaceuticals Corporation, grants from Eli Lilly and Company, personal fees from Aetion, Inc, personal fees from Optum, Inc., outside the submitted work.
Candace C. Fuller reports grants from U.S. Food and Drug Administration during the conduct of the study.
References
Recently Published Papers of Particular Interest Have Been Highlighted as:
• Of importance
•• Of major importance
- 1.Behrman RE, Benner JS, Brown JS, McClellan M, Woodcock J, Platt R. Developing the Sentinel System--a national resource for evidence development. N Engl J Med. 2011 Feb 10;364(6):498–9. doi: 10.1056/NEJMp1014427. [DOI] [PubMed] [Google Scholar]
- 2.Platt R, Wilson M, Chan KA, Benner JS, Marchibroda J, McClellan M. The new Sentinel Network--improving the evidence of medical-product safety. N Engl J Med. 2009 Aug 13;361(7):645–7. doi: 10.1056/NEJMp0905338. [DOI] [PubMed] [Google Scholar]
- 3.Coloma PM, Trifirò G, Patadia V, Sturkenboom M. Postmarketing safety surveillance : where does signal detection using electronic healthcare records fit into the big picture? Drug Saf. 2013 Mar;36(3):183–97. doi: 10.1007/s40264-013-0018-x. [DOI] [PubMed] [Google Scholar]
- 4.Maro JC, Platt R, Holmes JH, Strom BL, Hennessy S, Lazarus R, et al. Design of a national distributed health data network. Ann Intern Med. 2009 Sep 1;151(5):341–4. doi: 10.7326/0003-4819-151-5-200909010-00139. [DOI] [PubMed] [Google Scholar]
- 5.Brown JS, Holmes JH, Shah K, Hall K, Lazarus R, Platt R. Distributed health data networks: a practical and preferred approach to multi-institutional evaluations of comparative effectiveness, safety, and quality of care. Med Care. 2010 Jun;48(6 Suppl):S45–51. doi: 10.1097/MLR.0b013e3181d9919f. [DOI] [PubMed] [Google Scholar]
- 6.Chen RT, Glasser JW, Rhodes PH, Davis RL, Barlow WE, Thompson RS, et al. Vaccine Safety Datalink project: a new tool for improving vaccine safety monitoring in the United States. The Vaccine Safety Datalink Team. Pediatrics. 1997 Jun;99(6):765–73. doi: 10.1542/peds.99.6.765. [DOI] [PubMed] [Google Scholar]
- 7.Fahey KR. The Pioneering Role of the Vaccine Safety Datalink Project (VSD) to Advance Collaborative Research and Distributed Data Networks. EGEMS Wash DC. 2015;3(1):1195. doi: 10.13063/2327-9214.1195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Baggs J, Gee J, Lewis E, Fowler G, Benson P, Lieu T, et al. The Vaccine Safety Datalink: a model for monitoring immunization safety. Pediatrics. 2011 May;127(Suppl 1):S45–53. doi: 10.1542/peds.2010-1722H. [DOI] [PubMed] [Google Scholar]
- 9.McNeil MM, Gee J, Weintraub ES, Belongia EA, Lee GM, Glanz JM, et al. The Vaccine Safety Datalink: successes and challenges monitoring vaccine safety. Vaccine. 2014 Sep 22;32(42):5390–8. doi: 10.1016/j.vaccine.2014.07.073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10•.Ball R, Robb M, Anderson SA, Dal Pan G. The FDA’s sentinel initiative--A comprehensive approach to medical product surveillance. Clin Pharmacol Ther. 2016 Mar;99(3):265–8. doi: 10.1002/cpt.320. [DOI] [PubMed] [Google Scholar]
- 11.Platt R, Carnahan R. The U.S. Food and Drug Administration’s Mini-Sentinel Program. Pharmacoepidemiol Drug Saf. 2012 Jan 1;21:1–303. doi: 10.1002/pds.2343. [DOI] [PubMed] [Google Scholar]
- 12.Curtis LH, Weiner MG, Boudreau DM, Cooper WO, Daniel GW, Nair VP, et al. Design considerations, architecture, and use of the Mini-Sentinel distributed data system. Pharmacoepidemiol Drug Saf. 2012 Jan 1;21:23–31. doi: 10.1002/pds.2336. [DOI] [PubMed] [Google Scholar]
- 13.Platt R. FDA’s Mini-Sentinel Program to Evaluate the Safety of Marketed Medical Products: Progress and Direction. [cited 2017 Jun 15];Presentation to the Brookings Institution on January 31, 2013. [Internet] Available from: https://www.brookings.edu/wp-content/uploads/2013/01/Richard-Platt-Presentation.pdf.
- 14. [cited 2017 Jun 29];Snapshot of Database Statistics | Sentinel System [Internet] Available from: https://www.sentinelinitiative.org/sentinel/snapshot-database-statistics.
- 15.Toh S, Reichman ME, Houstoun M, Ross Southworth M, Ding X, Hernandez AF, et al. Comparative risk for angioedema associated with the use of drugs that target the renin-angiotensin-aldosterone system. Arch Intern Med. 2012 Nov 12;172(20):1582–9. doi: 10.1001/2013.jamainternmed.34. [DOI] [PubMed] [Google Scholar]
- 16.Toh S, Hampp C, Reichman ME, Graham DJ, Balakrishnan S, Pucino F, et al. Risk for Hospitalized Heart Failure Among New Users of Saxagliptin, Sitagliptin, and Other Antihyperglycemic Drugs: A Retrospective Cohort Study. Ann Intern Med. 2016 Jun 7;164(11):705–14. doi: 10.7326/M15-2568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. [cited 2017 Jun 29];Implementation of a Randomized Controlled Trial to Improve Treatment with Oral Anticoagulants in Patients with Atrial Fibrillation (IMPACT-AFib) | Sentinel System [Internet] Available from: https://www.sentinelinitiative.org/content/implementation-randomized-controlled-trial-improve-treatment-oral-anticoagulants-patients.
- 18.Food and Drug Administration Amendments Act of 2007 [Internet]. Sect. 905, 110–85 Sep 27, 2007. Available from: https://www.gpo.gov/fdsys/pkg/PLAW-110publ85/html/PLAW-110publ85.htm.
- 19••.Gagne JJ, Wang SV, Rassen JA, Schneeweiss S. A modular, prospective, semi-automated drug safety monitoring system for use in a distributed data environment. Pharmacoepidemiol Drug Saf. 2014 Jun;23(6):619–27. doi: 10.1002/pds.3616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Stürmer T, Wyss R, Glynn RJ, Brookhart MA. Propensity scores for confounder adjustment when assessing the effects of medical interventions using nonexperimental study designs. J Intern Med. 2014 Jun;275(6):570–80. doi: 10.1111/joim.12197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Glynn RJ, Schneeweiss S, Stürmer T. Indications for propensity scores and review of their use in pharmacoepidemiology. Basic Clin Pharmacol Toxicol. 2006 Mar;98(3):253–9. doi: 10.1111/j.1742-7843.2006.pto_293.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Glynn RJ, Gagne JJ, Schneeweiss S. Role of disease risk scores in comparative effectiveness research with emerging therapies. Pharmacoepidemiol Drug Saf. 2012 May;21(Suppl 2):138–47. doi: 10.1002/pds.3231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Arbogast PG, Ray WA. Use of disease risk scores in pharmacoepidemiologic studies. Stat Methods Med Res. 2009 Feb;18(1):67–80. doi: 10.1177/0962280208092347. [DOI] [PubMed] [Google Scholar]
- 24.Wyss R, Glynn RJ, Gagne JJ. A Review of Disease Risk Scores and Their Application in Pharmacoepidemiology. Curr Epidemiol Rep. 2016 Dec 1;3(4):277–84. [Google Scholar]
- 25.Rosenbaum PR, Rubin DB. The Central Role of the Propensity Score in Observational Studies for Causal Effects. Biometrika. 1983;70(1):41–55. [Google Scholar]
- 26.Hansen BB. The prognostic analogue of the propensity score. Biometrika. 2008 Jun 1;95(2):481–8. [Google Scholar]
- 27.Toh S, Gagne JJ, Rassen JA, Fireman BH, Kulldorff M, Brown JS. Confounding adjustment in comparative effectiveness research conducted within distributed research networks. Med Care. 2013 Aug;51(8 Suppl 3):S4–10. doi: 10.1097/MLR.0b013e31829b1bb1. [DOI] [PubMed] [Google Scholar]
- 28.Toh S, Reichman ME, Houstoun M, Ding X, Fireman BH, Gravel E, et al. Multivariable confounding adjustment in distributed data networks without sharing of patient-level data. Pharmacoepidemiol Drug Saf. 2013 Nov 1;22(11):1171–7. doi: 10.1002/pds.3483. [DOI] [PubMed] [Google Scholar]
- 29.Kumamaru H, Gagne JJ, Glynn RJ, Setoguchi S, Schneeweiss S. Comparison of high-dimensional confounder summary scores in comparative studies of newly marketed medications. J Clin Epidemiol. 2016 Aug;76:200–8. doi: 10.1016/j.jclinepi.2016.02.011. [DOI] [PubMed] [Google Scholar]
- 30.Schneeweiss S, Rassen JA, Glynn RJ, Avorn J, Mogun H, Brookhart MA. High-dimensional propensity score adjustment in studies of treatment effects using health care claims data. Epidemiol Camb Mass. 2009 Jul;20(4):512–22. doi: 10.1097/EDE.0b013e3181a663cc. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Rassen JA, Glynn RJ, Brookhart MA, Schneeweiss S. Covariate selection in high-dimensional propensity score analyses of treatment effects in small samples. Am J Epidemiol. 2011 Jun 15;173(12):1404–13. doi: 10.1093/aje/kwr001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Rassen JA, Solomon DH, Curtis JR, Herrinton L, Schneeweiss S. Privacy-maintaining propensity score-based pooling of multiple databases applied to a study of biologics. Med Care. 2010 Jun;48(6 Suppl):S83–89. doi: 10.1097/MLR.0b013e3181d59541. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Toh S, Shetterly S, Powers JD, Arterburn D. Privacy-preserving analytic methods for multisite comparative effectiveness and patient-centered outcomes research. Med Care. 2014 Jul;52(7):664–8. doi: 10.1097/MLR.0000000000000147. [DOI] [PubMed] [Google Scholar]
- 34.Fireman B, Lee J, Lewis N, Bembom O, van der Laan M, Baxter R. Influenza vaccination and mortality: differentiating vaccine effects from bias. Am J Epidemiol. 2009 Sep 1;170(5):650–6. doi: 10.1093/aje/kwp173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. [cited 2017 Jun 29];Mini-Sentinel Methods Propensity Score Matching Tool Enhancements Report.pdf [Internet] Available from: https://www.sentinelinitiative.org/sites/default/files/Methods/Mini-Sentinel_Methods_Propensity-Score-Matching-Tool-Enhancements-Report_0.pdf.
- 36.Wahl PM, Gagne JJ, Wasser TE, Eisenberg DF, Rodgers JK, Daniel GW, et al. Early steps in the development of a claims-based targeted healthcare safety monitoring system and application to three empirical examples. Drug Saf. 2012 May 1;35(5):407–16. doi: 10.2165/11594770-000000000-00000. [DOI] [PubMed] [Google Scholar]
- 37.Kulldorff M, Davis RL, Kolczak M, Lewis E, Lieu T, Platt R. A Maximized Sequential Probability Ratio Test for Drug and Vaccine Safety Surveillance. Seq Anal. 2011 Jan 21;30(1):58–78. [Google Scholar]
- 38.Yih WK, Kulldorff M, Fireman BH, Shui IM, Lewis EM, Klein NP, et al. Active surveillance for adverse events: the experience of the Vaccine Safety Datalink project. Pediatrics. 2011 May;127(Suppl 1):S54–64. doi: 10.1542/peds.2010-1722I. [DOI] [PubMed] [Google Scholar]
- 39.Gagne JJ, Glynn RJ, Rassen JA, Walker AM, Daniel GW, Sridhar G, et al. Active safety monitoring of newly marketed medications in a distributed data network: application of a semi-automated monitoring system. Clin Pharmacol Ther. 2012 Jul;92(1):80–6. doi: 10.1038/clpt.2011.369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40•.Gagne JJ, Rassen JA, Choudhry NK, Bohn RL, Patrick AR, Sridhar G, et al. Near-real-time monitoring of new drugs: an application comparing prasugrel versus clopidogrel. Drug Saf. 2014 Mar;37(3):151–61. doi: 10.1007/s40264-014-0136-0. [DOI] [PubMed] [Google Scholar]
- 41. [cited 2017 Jun 29];Level 2 Modular Program Queries | Sentinel System [Internet] Available from: https://www.sentinelinitiative.org/sentinel/routine-querying-tools/level-2-modular-program-queries.
- 42. [cited 2017 Jun 29];Mini-Sentinel PROMPT Users Guide [Internet] Available from: https://www.sentinelinitiative.org/sites/default/files/Methods/Mini-Sentinel_PROMPT_Users-Guide_0.pdf.
- 43. [cited 2017 Jun 16];Surveillance Tools | Sentinel System [Internet] Available from: https://www.sentinelinitiative.org/sentinel/surveillance-tools.
- 44•.Gagne JJ, Han X, Hennessy S, Leonard CE, Chrischilles EA, Carnahan RM, et al. Successful Comparison of US Food and Drug Administration Sentinel Analysis Tools to Traditional Approaches in Quantifying a Known Drug-Adverse Event Association. Clin Pharmacol Ther. 2016 Nov;100(5):558–64. doi: 10.1002/cpt.429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Zhou M, Wang SV, Leonard CE, Gagne JJ, Fuller C, Hampp C, et al. Sentinel Modular Program for Propensity-Score Matched Cohort Analyses: Application to Glyburide, Glipizide, and Serious Hypoglycemia. Epidemiol Camb Mass. 2017 Jul 4; doi: 10.1097/EDE.0000000000000709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46••. [cited 2017 Jun 16];Developments, Applications, And Methodological Challenges to The Use Of Propensity Score Matching Approaches In FDA’s Sentinel Program | Sentinel System [Internet] Available from: https://www.sentinelinitiative.org/sentinel/methods/developments-applications-and-methodological-challenges-use-propensity-score.
- 47.Li L, Kulldorff M, Russek-Cohen E, Kawai AT, Hua W. Quantifying the impact of time-varying baseline risk adjustment in the self-controlled risk interval design. Pharmacoepidemiol Drug Saf. 2015 Dec;24(12):1304–12. doi: 10.1002/pds.3885. [DOI] [PubMed] [Google Scholar]
- 48.Gault N, Castañeda-Sanabria J, De Rycke Y, Guillo S, Foulon S, Tubach F. Self-controlled designs in pharmacoepidemiology involving electronic healthcare databases: a systematic review. BMC Med Res Methodol. 2017 Feb 8;17(1):25. doi: 10.1186/s12874-016-0278-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Petersen I, Douglas I, Whitaker H. Self controlled case series methods: an alternative to standard epidemiological study designs. BMJ. 2016 Sep 12;354:i4515. doi: 10.1136/bmj.i4515. [DOI] [PubMed] [Google Scholar]
- 50.Hallas J, Pottegård A. Use of self-controlled designs in pharmacoepidemiology. J Intern Med. 2014 Jun;275(6):581–9. doi: 10.1111/joim.12186. [DOI] [PubMed] [Google Scholar]
- 51.Baker MA, Lieu TA, Li L, Hua W, Qiang Y, Kawai AT, et al. A vaccine study design selection framework for the postlicensure rapid immunization safety monitoring program. Am J Epidemiol. 2015 Apr 15;181(8):608–18. doi: 10.1093/aje/kwu322. [DOI] [PubMed] [Google Scholar]
- 52.Nguyen M, Ball R, Midthun K, Lieu TA. The Food and Drug Administration’s Post-Licensure Rapid Immunization Safety Monitoring program: strengthening the federal vaccine safety enterprise. Pharmacoepidemiol Drug Saf. 2012 Jan;21(Suppl 1):291–7. doi: 10.1002/pds.2323. [DOI] [PubMed] [Google Scholar]
- 53.Tse A, Tseng HF, Greene SK, Vellozzi C, Lee GM VSD Rapid Cycle Analysis Influenza Working Group. Signal identification and evaluation for risk of febrile seizures in children following trivalent inactivated influenza vaccine in the Vaccine Safety Datalink Project, 2010–2011. Vaccine. 2012 Mar 2;30(11):2024–31. doi: 10.1016/j.vaccine.2012.01.027. [DOI] [PubMed] [Google Scholar]
- 54.Yih WK, Lee GM, Lieu TA, Ball R, Kulldorff M, Rett M, et al. Surveillance for adverse events following receipt of pandemic 2009 H1N1 vaccine in the Post-Licensure Rapid Immunization Safety Monitoring (PRISM) System, 2009–2010. Am J Epidemiol. 2012 Jun 1;175(11):1120–8. doi: 10.1093/aje/kws197. [DOI] [PubMed] [Google Scholar]
- 55.Greene SK, Rett M, Weintraub ES, Li L, Yin R, Amato AA, et al. Risk of confirmed Guillain-Barre syndrome following receipt of monovalent inactivated influenza A (H1N1) and seasonal influenza vaccines in the Vaccine Safety Datalink Project, 2009–2010. Am J Epidemiol. 2012 Jun 1;175(11):1100–9. doi: 10.1093/aje/kws195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.McCarthy NL, Gee J, Lin ND, Thyagarajan V, Pan Y, Su S, et al. Evaluating the safety of influenza vaccine using a claims-based health system. Vaccine. 2013 Dec 5;31(50):5975–82. doi: 10.1016/j.vaccine.2013.10.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Kawai AT, Li L, Kulldorff M, Vellozzi C, Weintraub E, Baxter R, et al. Absence of associations between influenza vaccines and increased risks of seizures, Guillain-Barré syndrome, encephalitis, or anaphylaxis in the 2012–2013 season. Pharmacoepidemiol Drug Saf. 2014 May;23(5):548–53. doi: 10.1002/pds.3575. [DOI] [PubMed] [Google Scholar]
- 58.Kawai AT, Martin D, Kulldorff M, Li L, Cole DV, McMahill-Walraven CN, et al. Febrile Seizures After 2010–2011 Trivalent Inactivated Influenza Vaccine. Pediatrics. 2015 Oct;136(4):e848–855. doi: 10.1542/peds.2015-0635. [DOI] [PubMed] [Google Scholar]
- 59.Duffy J, Weintraub E, Hambidge SJ, Jackson LA, Kharbanda EO, Klein NP, et al. Febrile Seizure Risk After Vaccination in Children 6 to 23 Months. Pediatrics. 2016 Jul;138(1) doi: 10.1542/peds.2016-0320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Li R, Stewart B, McNeil MM, Duffy J, Nelson J, Kawai AT, et al. Post licensure surveillance of influenza vaccines in the Vaccine Safety Datalink in the 2013–2014 and 2014–2015 seasons. Pharmacoepidemiol Drug Saf. 2016 Aug;25(8):928–34. doi: 10.1002/pds.3996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Nelson JC, Jackson ML, Weiss NS, Jackson LA. New strategies are needed to improve the accuracy of influenza vaccine effectiveness estimates among seniors. J Clin Epidemiol. 2009 Jul;62(7):687–94. doi: 10.1016/j.jclinepi.2008.06.014. [DOI] [PubMed] [Google Scholar]
- 62.Jackson LA, Nelson JC, Benson P, Neuzil KM, Reid RJ, Psaty BM, et al. Functional status is a confounder of the association of influenza vaccine and risk of all cause mortality in seniors. Int J Epidemiol. 2006 Apr;35(2):345–52. doi: 10.1093/ije/dyi275. [DOI] [PubMed] [Google Scholar]
- 63•.Yih WK, Lieu TA, Kulldorff M, Martin D, McMahill-Walraven CN, Platt R, et al. Intussusception risk after rotavirus vaccination in U.S. infants. N Engl J Med. 2014 Feb 6;370(6):503–12. doi: 10.1056/NEJMoa1303164. [DOI] [PubMed] [Google Scholar]
- 64.Yih WK, Greene SK, Zichittella L, Kulldorff M, Baker MA, de Jong JLO, et al. Evaluation of the risk of venous thromboembolism after quadrivalent human papillomavirus vaccination among US females. Vaccine. 2016 Jan 2;34(1):172–8. doi: 10.1016/j.vaccine.2015.09.087. [DOI] [PubMed] [Google Scholar]
- 65.Yih WK, Kulldorff M, Sandhu SK, Zichittella L, Maro JC, Cole DV, et al. Prospective influenza vaccine safety surveillance using fresh data in the Sentinel System. Pharmacoepidemiol Drug Saf. 2016 May;25(5):481–92. doi: 10.1002/pds.3908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. [cited 2017 Jun 29];Mini-Sentinel PROMPT Self Control Design Tool Technical Users Guide.pdf [Internet] Available from: https://www.sentinelinitiative.org/sites/default/files/Methods/Mini-Sentinel_PROMPT_Self-Control-Design-Tool_Technical-Users-Guide_0.pdf.
- 67. [cited 2017 Jun 15];Sequential Analysis of Gardasil 9 Safety Surveillance Plan | Sentinel System [Internet] Available from: https://www.sentinelinitiative.org/vaccines-blood-biologics/assessments/sequential-analysis-gardasil-9-safety-surveillance-plan.
- 68. [cited 2017 Jun 16];Ninth Annual Sentinel Initiative Public Workshop | Margolis Center for Health Policy. Slides 216–230. [Internet] Available from: https://healthpolicy.duke.edu/events/ninth-annual-sentinel-initiative-public-workshop.
- 69.Greene SK, Kulldorff M, Yin R, Yih WK, Lieu TA, Weintraub ES, et al. Near real-time vaccine safety surveillance with partially accrued data. Pharmacoepidemiol Drug Saf. 2011 Jun;20(6):583–90. doi: 10.1002/pds.2133. [DOI] [PubMed] [Google Scholar]
- 70.Wu Y, Jiang X, Kim J, Ohno-Machado L. Grid Binary LOgistic REgression (GLORE): building shared models without sharing data. J Am Med Inform Assoc JAMIA. 2012 Oct;19(5):758–64. doi: 10.1136/amiajnl-2012-000862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Analysis of Integrated Data without Data Integration. CHANCE. 2004 Jun 1;17(3):26–9. [Google Scholar]