Table 1. Summary of Data Sets Used for the Study.
EHR Site | Total Sample Size | % Male | Median Age (in decade) | Case Size Range | Genotyping Platform | Number of SNPs Pre-imputation | Number of SNPs Post-imputation | Number of SNPs after filtering | Number of Diagnosis Codes |
---|---|---|---|---|---|---|---|---|---|
Geisinger MyCode® | 3024 | 53.0 | 40 | Min = 11; Max = 1898; Median = 32 | Illumina Human OmniExpress | 729,078 | 38,054,243 | 95,448 | 477 |
Vanderbilt BioVU | 2899 | 45.4 | 60 | Min = 11; Max = 1056; Median = 31 | Illumina 660 | 558,980 | 38,041,351 | 87,690 | 380 |