Skip to main content
. 2018 Dec 27;26(3):219–227. doi: 10.1093/jamia/ocy164

Table 2.

U.S. Census variables, sources, definitions, and transformations used for imputing missing stratification information

Stratification Variable U.S. Census Variable Source (table, variable names and or numbers) Description Variable Transformation
Race/Ethnicity Hispanic or Latino Origin by Race ACS 2008–2012 5-year summary file (B03002; 001-021) Number overall and of each race (White alone, Black or African American alone, American Indian / Alaska Native alone, Asian alone, Native Hawaiian / Other Pacific Islander, Some other race alone, Two or more races, White alone not Hispanic or Latino, Hispanic or Latino, Two races including some other race, two races excluding some other race / three or more races) by ethnicity (Hispanic, not Hispanic). Marginal distributions of race were defined as White (003, 013), Black or African American (004, 014), Asian (006, 016), American Indian/Alaska Native (005, 015), Native Hawaiian/Pacific Islander (007, 017), Other (008, 009, 018, 019). Marginal distributions of ethnicity were defined as: Not Hispanic/Latino (002), Hispanic/Latino (012)
Education Sex by educational attainment for the population 25 years and over ACS 2008-2012 5-year summary file (B15002; 001-035) Number of each educational attainment group (no schooling, nursery to fourth grade, 5th and 6th, 7th -8th, 9th, 10th, 11th, 12th with no diploma, HS grad/GED/Alternative, some college less than 1 year, some college one or more years and no degree, associate’s degree, bachelor’s degree, master’s degree, professional school degree, doctorate) by gender for those who are 25 or older. Marginal distributions of education were defined as: <12 (003–010, 020–027), 12 − <16 (011–014, 028–031), ≥ 16 (015–018, 032–035).
Rurality LSAD10 2010 Census urban area criteria 75=urbanized area (50 000 or more), 76=urban cluster (2500 to 50 000), missing=rural. 75 or 76 (suburban/urban), missing (rural)