Skip to main content
. Author manuscript; available in PMC: 2014 Sep 26.
Published in final edited form as: Cell. 2013 Sep 26;155(1):10.1016/j.cell.2013.08.030. doi: 10.1016/j.cell.2013.08.030

Table 1. The clinical record datasets utilized in this study.

This table provides a brief description, the ICD encoding type, and the size of each dataset. The MED dataset, highlighted in red, was used for comparison and was not included in the full meta-analysis.

Dataset Description Encoding Type Number of unique patients
CU Columbia University, 1985-2003, New York, NY ICD9 1,505,822
DK Denmark; database covering most of the country's population ICD10 6,214,312
NYPH New York Presbyterian Hospital and Columbia University; 2004-present, New York, NY ICD9 767,978
SU Stanford University, San Francisco, CA ICD9 806,369
TX University of Texas at Houston, Houston, TX ICD9 1,599,528
UC University of Chicago, Chicago, IL ICD9 146,989
USA MarketScan insurance claims dataset ICD9 99,143,849
MED Medicare database ICD9 13,039,018

Total: 123,223,865