Skip to main content
. 2023 Jun 25;58(5):1045–1055. doi: 10.1111/1475-6773.14197

TABLE 2.

Validation of the Medicare EDB and RTI race codes compared to beneficiary self‐reported race and Hispanic ethnicity, among all survey respondents (N = 1,043,559).

Classification by EDB race code a Classification by RTI race code a
Sensitivity b Specificity c PPV d NPV e Sensitivity b Specificity c PPV d NPV e
White 97.6% 69.6% 89.8% 91.3% 96.6% 93.3% 97.5% 90.9%
Black 96.0% 99.1% 92.4% 99.5% 95.6% 99.4% 95.1% 99.5%
Hispanic 28.3% 99.8% 96.0% 90.9% 85.9% 98.8% 90.7% 98.1%
AANHPI 56.1% 99.8% 92.2% 98.3% 72.1% 99.7% 89.4% 98.9%
AIAN 8.7% 99.9% 74.3% 97.7% 8.7% 99.9% 75.1% 97.7%
Other 5.1% 98.1% 1.4% 99.5% 3.0% 99.2% 1.9% 99.5%

Note: Data Sources: Data linked between Medicare Master Beneficiary Summary File (MBSF), Medicare Advantage Health Outcomes Survey (HOS), and the Medicare Advantage Consumer Assessment of Healthcare Providers and Systems (CAHPS) from 2015 to 2017.

Abbreviations: AANHPI, non‐Hispanic Asian American or Native Hawaiian Pacific Islander; AIAN, non‐Hispanic American Indian Alaska Native; EDB, Enrollment Database; NPV, negative predictive value; PPV, positive predictive value; RTI, Research Triangle Institute; N, number of beneficiaries; White, non‐Hispanic White; Black, non‐Hispanic Black; Hispanic, Hispanic any race; Other, non‐Hispanic Other race.

a

We take a broad interpretation of the Medicare race codes and validate the predictive performance of the EDB and RTI race codes for each race/ethnicity category by measuring how accurately a beneficiary in our study sample is binarily classified as either in or out of that category. Because the current Medicare race/ethnicity codes do not allow for multiracial identity, this allows us to measure predictive performance for beneficiaries that self‐report more than one race category. See eTable 2 (Supplemental material) for the full confusion matrices used to generate these metrics and Table 3 for validation of the predictive performance of the EDB and RTI race codes among survey respondents self‐reporting only one race category.

b

Sensitivity is defined as true positives/(true positives + false negatives).

c

Specificity is defined as true negatives/(true negatives + false positives).

d

Positive predictive value is defined as true positives/(true positives + false positives).

e

Negative predictive value is defined as true negatives/(true negatives + false negatives).