Skip to main content
. 2018 Jul 2;11:295–304. doi: 10.2147/JMDH.S160029

Table 2.

Overview of the strengths and limitations of different types of real world data sources2,2325

Source Key characteristics Strengths Limitations
Supplements to registration RCTs • Additional data such as patient-reported outcomes, medical resource use, and costs gathered alongside standard, clinically focused registration RCTs
• May provide evidence on treatment patterns for common events
• Randomized design • Restricted patient population
• Carefully controlled clinical setting for data collection
• Protocol-driven resource use
• Lack of statistical power to detect events other than specified key end points
• Relatively short time frame
Practical/pragmatic clinical trials • Simple trials involving prospective, randomized study designs but with larger and more diverse patient populations than conventional RCTs
• Often focusing on obtaining policy-relevant outcomes data
• Broad patient population
• Randomized design
• Sufficient statistical power to establish significant differences in
key outcomes
• Resource use less likely to be protocol driven
• Increased cost of data collection due to larger number of patients and clinical settings involved
• Potential for reduced data quality (missing data, data entry/coding errors)
• Lack of standardization across settings
Patient/disease registries • Observational, prospective, cohort studies assessing real world safety and effectiveness, quality of care/provider performance, and cost-effectiveness
• Often conducted to collect postauthorization marketing safety data (to address specific safety concerns or to satisfy regulatory requirements)
• Larger and more diverse patient groups than RCTs
• Reflect real world outcomes, as well as treatment patterns and clinical decision making
• Longer time frame than RCTs
• Nonrandomized design
• Visit schedules not required/data not collected at fixed intervals
• Potential for reduced data quality (missing data, data entry/coding errors)
• Lack of standardization across settings
• Risk factors/outcomes may change during follow-up
• Statistical adjustments may be required to address confounding/imbalance
• Causality cannot be confirmed
Administrative data (claims databases) • Retrospective, longitudinal, and cross-sectional analyses of clinical and economic outcomes at patient level
• Claims data are collected primarily for reimbursement, but databases may also contain some clinical diagnosis/procedure information and details on related resource use and costs
• Large size of databases allows for identification of outcomes of patients with rare events
• Analyses can be performed at low cost and over a short time frame
• Nonrandomized design
• Potential for reduced data quality (missing data, data entry/coding errors)
• Limited comprehensive clinical data across health care settings
• Lack of distinction between costs and charges
Population health surveys • Designed to collect descriptions of health status and well-being, health care utilization, treatment patterns, and health care expenditures from patients, providers, or individuals in the general population • Provide unique contributions about generalizability of treatments and their impacts, and about use of and expenditures for health services
• Methodologically rigorous, relying on complex sample survey designs
• Lacking relevant data on specific products
• Data subject to issues of subjectivity and recall bias
EHRs/other technologies capturing real-time clinical treatment and outcomes • Used for medical chart reviews to produce specific information on the real world use of specific tests or medications for particular conditions • Important sources for RWD from a wide range of clinical settings throughout the world
• Expansion of electronic data capture is lowering the cost of the medical chart reviews
• May contain detailed, longitudinal information, including patient-level, disease-specific symptoms
• High-end statistical analysis tools required to transform the information for research purposes

Abbreviations: EHR, electronic health record; RCT, randomized controlled trial; RWD, real world data.