Skip to main content
. Author manuscript; available in PMC: 2020 Jan 1.
Published in final edited form as: Pac Symp Biocomput. 2020;25:695–706.

Table 4.

Characteristics of the five claims datasets at the Janssen Research and Development at Johnson & Johnson.

Dataset CCAE JMDC MDCD MDCR Optum
Number of subjects 64,222 1,976 59,861 69,164 62,348
Median Age 43 42 35 71 47
% of Female 69.21 36.69 73.82 68.08 69.68
Number of outcomes
 Acute myocardial infarction (AMI) 155 2 438 1,207 360
% of AMI 0.24 0.10 0.73 1.75 0.58
% of Obesity 7.15 0.71 16.54 6.71 9.62
% of Alcohol dependence 7.15 1.01 16.54 6.71 9.62
% of Hypertensive disorder 20.81 14.37 31.80 57.70 32.96
% of Major depressive disorder 4.17 3.88 3.55 3.16 3.34
% of Type 2 diabetes mellitus 7.49 2.83 14.63 21.83 12.71
% of Hyperlipidemia 20.96 19.23 22.00 43.21 33.85
*

The full names of the five claims datasets are CCAE (IBM MarketScan® Commercial), JMDC (Japanese Medical Data Center), MDCD (IBM MarketScan® Medicaid), MDCR (IBM MarketScan® Medicare) and Optum (Optum© De-Identified Clinformatics).