Skip to main content
JAMA Network logoLink to JAMA Network
. 2023 Sep 21;6(9):e2330847. doi: 10.1001/jamanetworkopen.2023.30847

Intersectional Disparities in Emergency Medicine Residents’ Performance Assessments by Race, Ethnicity, and Sex

Elle Lett 1,2,3,, Nguyen Khai Tran 4, Nkemjika Nweke 5, Mytien Nguyen 6, Jung G Kim 7, Eric Holmboe 8, William McDade 8, Dowin Boatright 9
PMCID: PMC10514741  PMID: 37733347

Key Points

Question

Are their disparities by ethnoracial identity and sex in Accreditation Council for Graduate Medical Education Milestones ratings for emergency medicine residents?

Findings

This retrospective cohort analysis of 16 634 assessments of 2708 emergency medicine residents in 128 training programs found evidence of sex-specific ethnoracial disparities in ratings on the Milestones assessments. These disparities increased over time across multiple Milestones assessments and were most severe for female residents of ethnoracial groups that are underrepresented in medicine (URM).

Meaning

These findings suggest that intersectional disparities in Milestones assessment ratings of emergency medicine residents may be an intervenable barrier to equitable representation, particularly for female residents of ethnoracial groups that are URM, specifically in emergency medicine.

Abstract

Importance

Previous studies have demonstrated sex-specific disparities in performance assessments among emergency medicine (EM) residents. However, less work has focused on intersectional disparities by ethnoracial identity and sex in resident performance assessments.

Objective

To estimate intersectional sex-specific ethnoracial disparities in standardized EM resident assessments.

Design, Setting, and Participants

This retrospective cohort study used data from the Association of American Medical Colleges and the Accreditation Council for Graduate Medical Education Milestones (Milestones) assessments to evaluate ratings for EM residents at 128 EM training programs in the US. Statistical analyses were conducted in June 2020 to January 2023.

Exposure

Training and assessment environments in EM residency programs across comparison groups defined by ethnoracial identity (Asian, White, or groups underrepresented in medicine [URM], ie, African American/Black, American Indian/Alaska Native, Hispanic/Latine, and Native Hawaiian/Other Pacific Islander) and sex (female/male).

Main Outcomes and Measures

Mean Milestone scores (scale, 0-9) across 6 core competency domains: interpersonal and communications skills, medical knowledge, patient care, practice-based learning and improvement, professionalism, and system-based practice. Overall assessment scores were calculated as the mean of the 6 competency scores.

Results

The study sample comprised 128 ACGME-accredited programs and 16 634 assessments for 2708 EM residents of which 1913 (70.6%) were in 3-year and 795 (29.4%) in 4-year programs. Most of the residents were White (n = 2012; 74.3%), followed by Asian (n = 477; 17.6%), Hispanic or Latine (n = 213; 7.9%), African American or Black (n = 160; 5.9%), American Indian or Alaska Native (n = 24; 0.9%), and Native Hawaiian or Other Pacific Islander (n = 4; 0.1%). Approximately 14.3% (n = 386) and 34.6% (n = 936) were of URM groups and female, respectively. Compared with White male residents, URM female residents in 3-year programs were rated increasingly lower in the medical knowledge (URM female score, −0.47; 95% CI, −0.77 to −0.17), patient care (−0.18; 95% CI, −0.35 to −0.01), and practice-based learning and improvement (−0.37; 95% CI, −0.65 to −0.09) domains by postgraduate year 3 year-end assessment; URM female residents in 4-year programs were also rated lower in all 6 competencies over the assessment period.

Conclusions and Relevance

This retrospective cohort study found that URM female residents were consistently rated lower than White male residents on Milestone assessments, findings that may reflect intersectional discrimination in physician competency evaluation. Eliminating sex-specific ethnoracial disparities in resident assessments may contribute to equitable health care by removing barriers to retention and promotion of underrepresented and minoritized trainees and facilitating diversity and representation among the emergency physician workforce.


This retrospective cohort study evaluates Milestone assessments among emergency medicine residents in 128 US programs to identify sex-specific ethnoracial disparities.

Introduction

Health equity is a priority for many national governing health care bodies including the US Centers for Disease Control and Prevention,1 the American Medical Association,2 and the American College of Physicians.3 Yet achieving a diverse health care workforce that is representative of the populations served remains a critical challenge. The benefits of a representative physician workforce include improved care access for individuals from minoritized ethnoracial groups or economically disadvantaged populations,4,5,6 research innovation relevant to marginalized groups,7,8,9 and improved clinical learning environments.10 These benefits are substantial in the emergency department where complex sociostructural factors determine which populations and health conditions are disproportionately represented. Emergency medicine (EM) physicians provide safety-net care to patients who may be otherwise excluded or neglected by the health care system, including patients who are uninsured,11 experiencing homelessness,12 or have comorbid psychiatric illnesses with alcohol and substance use disorders.13 An empathic and diverse emergency physicianship representative of the demographic composition of the populations they serve is necessary to optimize care.

Disparities in performance assessments represent a manifestation of discrimination, including institutional racism, creating a barrier to advancement for minoritized trainees. In undergraduate medical education (UME), studies have demonstrated both racial and gendered disparities in Medical Student Performance Evaluations. Assessments of female medical students and medical students of ethnoracial groups that are underrepresented in medicine (URM) were less likely to include competency-based descriptions and were more likely to include descriptions of personality traits compared with assessment of male residents who were not URM.14 A similar study showed that evaluations for Black medical students were less likely to emphasize exceptional ability, and for female medical students, more likely to include descriptions of personality and emotional characteristics.15 These studies indicate that disparities in UME evaluations relegate the strengths of racially minoritized and female trainees to psychosocial and emotional traits while de-emphasizing technical skills and competencies in comparison with White and male peers. Racial disparities in UME performance evaluations extend beyond the narrative evaluations to application elements used by residency program directors to make admissions decisions,16 including clinical clerkship grades and standardized summative evaluations,17 and Alpha Omega Alpha honor society membership.18

Disparities in performance assessments persist into graduate medical education and have been demonstrated in standardized resident milestone assessments for internal19,20 and EM.21,22 Similar to the UME findings, these studies show that residents who are female19,21,22 or from a URM group20 are consistently rated as less skilled than their male and non-URM counterparts. However, previous studies have been limited because they addressed racial and gender disparities23,24 separately in assessments and in only a fraction of training programs.

Invoking intersectionality,25 we argue that concomitantly estimating these disparities’ simultaneous and interactive influences is critical. This study sought to extend the previous literature by performing a retrospective cohort study of sex-specific ethnoracial discrimination in assessments of EM residents using the Accreditation Council for Graduate Medical Education (ACGME) Milestones (hereafter, Milestones) data for academic year 2014 to 2015 through academic year 2017 to 2018.

Methods

Study Participants and Setting

This retrospective cohort study was an analysis of Milestones assessments for residents in ACGME-accredited EM training programs from academic year 2014 to 2015 through academic year 2017 to 2018. Ratings are based on standardized Milestones reported to the ACGME and linked to resident demographic data provided by the Association of American Medical Colleges (AAMC). These characteristics include self-reported ethnoracial identity, binary sex, and United States Medical Licensing Examination Step 2 Clinical Knowledge (USMLE Step 2 CK) scores.

This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines for cohort studies. The study was reviewed and deemed exempt from obtaining informed consent by the Yale University institutional review because it used only deidentified data.

Exclusion Criteria and Analytic Cohort

We merged ACGME and AAMC data and excluded 308 residents whose records were not present in both sources. The initial data set comprised 24 718 assessments of 4283 residents at 209 EM programs in the US (eTable 1 and eFigure 1 in Supplement 1). We also excluded residents whose ethnoracial data were missing, either because the AAMC does not provide information for individuals who are not US citizens or permanent residents (n = 541) or for an unknown reason (n = 56). We excluded an additional 332 residents with unavailable USMLE Step 2 CK scores. Moreover, given that our study goal was to estimate sex-specific ethnoracial disparities, which requires a racially diverse pool of trainees, we also excluded programs that did not have at least 1 Asian and 1 URM trainee during the study period. By excluding residents from the programs that lacked diversity, we sought to avoid potentially biased results at the expense of a more highly powered analysis. This final exclusion criteria reduced the number of assessments included in the study from 20 343 to 16 634, residents from 3354 to 2708, and EM programs from 188 to 128 (eFigure 1 in Supplement 1).

ACGME Milestones Assessments

The Milestone assessments are “competency-based developmental outcomes (eg, knowledge, skills, attitudes, and performance) that can be demonstrated progressively by residents/fellows from the beginning of their education through graduation to the unsupervised practice of their specialties.”26 Emergency medicine residents are assessed twice during the academic year (midyear and year-end) across 23 subcompetencies grouped into 6 core ACGME competency domains: patient care, medical knowledge, systems-based practice, practice-based learning and improvement, professionalism, and interpersonal and communication skills (eTable 2 in Supplement 1). Each subcompetency was scored from 0 to 9 across 5 Milestones levels.

Conceptual Model

We drew on the theoretical framework of intersectionality for this study. Intersectionality was institutionalized in academia by the work of legal scholar, Kimberlé Crenshaw,27 and sociologist, Patricia Collins,28 among others. It has roots that date back as early as the social movements of the early 19th century.29,30,31 Intersectionality posits that sociostructural systems are interactive and that individuals with multiply marginalized identities are subject to their co-occurring and synergistic processes, manifesting ubiquitously, including in clinical learning environments. We were particularly interested in 2 of these systems—racism and sexism—exposure to which are imperfectly captured32 by ethnoracial and binary sex data provided by AAMC. This study quantifies sex-specific ethnoracial disparities in EM residency assessments which may be, in part, a manifestation of interpersonal, institutional, or structural racism and sexism.

Ethnoracial Groupings

We divided the EM residents into 3 ethnoracial groups: Asian, URM, and White. In the data, residents were able to self-identify with multiple ethnoracial groups. For those who selected more than 1 group, any URM identity was prioritized, defined per AAMC guidelines as African American or Black, Hispanic or Latine, American Indian or Alaska Native, and Native Hawaiian or Other Pacific Islander. If an individual selected Asian and White, they were included as Asian in our analyses. It is important to note that race and ethnicity are conceptualized as a proxy for exposure to structural, institutional, and interpersonal racism and ethnicism, rather than an essentialist biological or cultural variable that would drive differential resident performance. We used the term minoritized ethnoracial groups, rather than minority per Harper,33 noting that social identities and their corresponding statuses are context dependent and that individuals are minorities only within institutions structured to facilitate the overrepresentation and privilege of Whiteness. Similarly, underrepresentation in this study and in medicine broadly, is defined according to coarse racial taxonomies that do not capture within-group heterogeneity. In particular, the Asian group comprises many subpopulations that may be over- or underrepresented in medicine. We make note that although some of these groups may not be underrepresented, they are all minoritized in that they are harmed by manifestations of systemic racism in the US.

Statistical Analyses

We used hierarchical linear mixed-effects models to estimate sex-specific ethnoracial disparities in Milestone scores. The primary outcomes included the mean competency scores for the 6 competencies being assessed. Our secondary outcome was the mean assessment score for each Milestone assessment in its entirety. This choice in secondary outcome equally weighted each competency (ie, providing equal weight to patient care and medical knowledge). We fit separate models for 3-year and 4-year programs. Each model included fixed-effects for ethnoracial group (Asian, URM, White), binary sex (female/male), and time (half-years of training) as well as pairwise and tertiary interactions among those variables. The interaction terms estimated separate trajectories for each ethnoracial group−sex cross-strata and quantified the differences in those trends at each time point. We had a 3-level model with assessments (level 1) nested within EM residents (level 2) that are nested within programs (level 3). The final model had random intercepts for both residents and programs and a random slope for residents with respect to time. We also adjusted for USMLE Step 2 CK score as a measure of baseline medical knowledge prior to residency, and academic year for temporal trends in resident assessments given that each program length had 2 staggered cohorts (eTables 3 and 4 in Supplement 1).

We present linear contrasts and 95% CIs for each ethnoracial group and sex cross-strata relative to White male residents at the same time point per competency. The 95% CIs that do not include zero correspond to statistically significant results for a 2-sided hypothesis test with a type I error rate of α = .05. Because this study was the first, to our knowledge, to estimate intersectional disparities in resident milestone evaluations, it was exploratory in nature; we did not adjust for multiple comparisons.34 The reference group was purposely selected as comparing individuals from marginalized groups with the most privileged group that is least likely to be subject to ethnoracial- or sex-based discrimination. All analyses were conducted in R, version 4.2.1 (The R Foundation for Statistical Computing) and mixed-effects models were fit using the lme4 package.35 Data analyses were performed between June 2020 and January 2023.

Results

Of the 2708 EM residents in our analysis, 1913 (70.6%) were in 3-year programs and 795 (29.4%) in 4-year programs (Table). Most of EM residents in the sample were White (n = 2012; 74.3%), followed by Asian (n = 477; 17.6%), Hispanic or Latine (n = 213; 7.9%), African American or Black (n = 160; 5.9%), American Indian or Alaska Native (n = 24; 0.9%), and Native Hawaiian or Other Pacific Islander (n = 4; 0.1%). Approximately 14.3% (n = 386) and 34.6% (n = 936) of the sample were URM residents and female, respectively. The median USMLE Step 2 CK score was 243 (IQR, 232-253). We found that 60 programs (28.7% of all programs and 3.2% of programs with assessments that met our exclusion criteria) did not have Asian or URM residents in their training program (eFigure 1 in Supplement 1).

Table. Baseline Characteristics of Included Emergency Medicine Residents (n = 2708).

Characteristic Residents, No. (%)
Ethnoracial identitya
African American or Black 160 (5.9)
Asian 477 (17.6)
Hispanic or Latine 213 (7.9)
American Indian or Alaska Native 24 (0.9)
Native Hawaiian or Other Pacific Islander 4 (0.1)
White 2012 (74.3)
Ethnoracial group
Asian 465 (17.2)
URM 386 (14.3)
White 1857 (68.6)
Sex
Female 936 (34.6)
Male 1772 (65.4)
Intersectional group
Asian, female 158 (5.8)
Asian, male 307 (11.3)
URM, female 160 (5.9)
URM, male 226 (8.3)
White, female 618 (22.8)
White, male 1239 (45.8)
Program lengthb
3 y 1913 (70.6)
4 y 795 (29.4)
USMLE Step 2 CK score, median (IQR) 243 (232-253)

Abbreviations: URM, underrepresented in medicine; USMLE Step 2 CK, United States Medical Licensing Examination Step 2 Clinical Knowledge.

a

Residents were able to self-identify as more than 1 ethnoracial group.

b

Number of unique residents with at least 1 evaluation in a program of the corresponding duration.

Milestone Competency Scores

Residents in the 3-year programs were rated higher than those in 4-year programs at the mid-year assessment for postgraduate year 1 (PGY1), with similar rates of change in the mean scores over the assessment period (Figure 1; eFigure 2 in Supplement 1). However, we found that competency scores differed across ethnoracial groups, sex, and program length. In the 3-year programs, ethnoracial group−sex cross-strata had comparable ratings in all competency domains at the PGY1 midyear assessment (Figure 1; eTable 5 in Supplement 1). In the 4-year programs, URM female residents received the highest scores for all competencies except for medical knowledge in PGY1 midyear assessment (Figure 2; eTable 6 in Supplement 1). Disparities began to emerge in PGY2, with Asian and URM residents of both sexes receiving increasingly lower ratings than White male residents for all competencies at PGY2 year-end assessment. In the 4-year programs, only URM female residents received increasingly lower scores than White male residents in all competencies. The largest differences were observed in medical knowledge, particularly between URM male residents (mean [SD], 5.94 [1.45]) and White male residents (mean [SD], 6.66 [1.06]) in 3-year programs at PGY3 end-of-year, and between URM female residents (mean [SD], 3.72 [1.03]) and White male residents (mean [SD], 4.49 [1.10]) in 4-year programs at PGY2 year-end assessments.

Figure 1. Competency Score Trajectories by Ethnoracial Group and Sex Cross-Strata, 3-Year Emergency Medicine Residency Programs.

Figure 1.

Changes in the mean scores for 6 different Accreditation Council for Graduate Medical Education competencies among US emergency medicine residents in 3-y programs. Mean competency scores for each postgraduate year (PGY) time point were calculated as the mean of each subcompetency score for each corresponding competency for each racial/ethnic group and sex cross-strata. URM indicates underrepresented in medicine.

Figure 2. Competency Score Trajectories by Ethnoracial Group and Sex Cross-Strata, 4-Year Emergency Medicine Residency Programs.

Figure 2.

Changes in the mean scores for 6 different Accreditation Council for Graduate Medical Education competencies among US emergency medicine residents in 4-y programs. Mean competency scores for each postgraduate year (PGY) time point were calculated as the mean of each subcompetency score for each corresponding competency for each racial/ethnic group and sex cross-strata. URM indicates underrepresented in medicine.

In adjusted models, except for among White female residents, we observed worsening disparities in milestone scores for minoritized EM residents (Figure 3; eTables 7 and 8 in Supplement 1). Similar to the unadjusted estimates at PGY1 midyear, URM male and female residents in 3-year programs were initially rated comparably with White male residents; however, they were rated increasingly lower than White male residents in 3 of 6 competencies by PGY3 year-end. This included significantly lower scores in these domains: medical knowledge (URM male residents, −0.46 [95% CI, −0.72 to −0.20]; URM female residents, −0.47 [95% CI, −0.77 to −0.17]), patient care (URM male residents, −0.16 [95% CI, −0.31 to −0.01]; URM female residents, −0.18 [95% CI, −0.35 to −0.01]), and practice-based learning and improvement (URM male residents, −0.29 [95% CI, −0.53 to −0.05]; URM female residents, −0.37 [95% CI, −0.65 to −0.09]). Similar patterns were observed for URM female residents in 4-year programs across all competencies to a greater degree, but this was not observed for URM male residents. We also noted differential trends between 3- and 4-year programs for Asian EM residents. Asian male residents in 3-year programs received significantly lower scores in the interpersonal and communication skills domain than White male residents by PGY3 year-end (−0.17; 95% CI, −0.31 to −0.03), yet there were no differences over the assessment period in 4-year programs. Asian female residents in 3-year programs experienced increasing gaps in medical knowledge scores by PGY3 year-end (−0.46; 95% CI, −0.77 to −0.15), whereas in 4-year programs, the disparity in these scores narrowed for Asian female residents from PGY1 midyear (−0.29; 95% CI, −0.55 to −0.03) to PGY4 year-end (−0.14; 95% CI, −0.62 to 0.33).

Figure 3. Intersectional Disparities in Competency Scores by Ethnoracial Group and Sex Cross-Strata.

Figure 3.

A, Linear contrasts and corresponding 95% CIs comparing each ethnoracial group and sex cross-strata to White male residents in 3-y programs. B, 4-y programs for each postgraduate year (PGY) time point. Mixed-effects models included a random intercept for individual residents and programs as well as a random slope for time. The model also adjusted for United States Medical Licensing Examination Step 2 Clinical Knowledge Step 2 CK scores as fixed effects. URM indicates underrepresented in medicine.

Overall Assessment Scores

Consistent with competency domain scores, the mean overall Milestone scores displayed similar patterns at PGY1 midyear and throughout the assessment period (eFigure 3 in Supplement 1). We also observed comparable trends in the total scores across ethnoracial group−sex cross-strata. Among residents in 3-year programs, scores were similar between ethnoracial group−sex at PGY1 midyear, but in 4-year programs, only URM female residents (mean [SD], 2.29 [0.90]) received higher scores than White male residents (mean [SD], 2.01 [0.72]) (eFigure 4 and eTable 9 in Supplement 1). Moreover, all minoritized residents, except for White female residents, experienced progressively lower ratings by PGY2 midyear, regardless of whether they were in 3- or 4-year programs. After model adjustment, we observed no differences in the overall Milestone scores between minoritized and White male residents from PGY1 midyear to PGY2 midyear in both 3- and 4-year programs (Figure 4; eTable 10 in Supplement 1). However, by PGY3 year-end, Asian female residents (−0.22; 95% CI, −0.41 to −0.03), URM male residents (−0.21; 95% CI, −0.45 to −0.05), and URM female residents (−0.25; 95% CI, −0.43 to −0.07) in 3-year programs received lower ratings than White male residents. In 4-year programs, only URM female residents were rated increasingly lower than White male residents from PGY3 midyear (−0.25; 95% CI, −0.45 to −0.05) to PGY4 year-end (−0.39; 95% CI, −0.64 to −0.15). Similar trends were not observed for White female residents in either program length and to a limited extent for Asian male residents in 3-year programs and Asian female residents and URM male residents in 4-year programs.

Figure 4. Association of Intersectional Disparities on Assessment Score Trajectory.

Figure 4.

Linear contrasts and corresponding 95% CIs of each ethnoracial group and sex cross-strata compared with White male residents in 3-y and 4-y programs for each postgraduate year (PGY) time point. Mixed-effects models included a random intercept for individual residents and programs as well as a random slope for time. The model also adjusted for United States Medical Licensing Examination Step 2 Clinical Knowledge scores as fixed effects. URM indicates underrepresented in medicine.

Discussion

Our study quantified intersectional sex-specific ethnoracial disparities in Milestone assessments among EM residents. Compared with White male residents, URM female residents consistently received lower assessment scores with disparate rating gaps increasing over time. In 3-year programs, URM female residents were evaluated approximately 0.25 points lower than White male residents, which equated to approximately 2 months of additional training based on an average annual increase in milestone scores of 1.47 points. The disparities were even more pronounced for URM female residents in 4-year programs where the difference of 0.39 milestone points was equivalent to 3 additional months of training, considering an annual increase of 1.28 points. Additionally, we found that URM female residents in 4-year programs received increasingly lower assessment scores for all competency domains, which was only exhibited in 3 of 6 competencies in 3-year programs. The URM male residents had similarly disparate trends in assessments although the gap with respect to White male residents was blunted. In comparison, assessments for Asian male and female residents were significantly lower in only a subset of competencies (interpersonal communication skills and medical knowledge) at later assessment windows, and in specific, with program lengths. Notably, there were no statistically significant differences in assessments between White male and female residents in our analyses in either 3- or 4-year programs. Overall, disparities were more pronounced in 4-year programs in comparison with 3-year programs, which we attribute to the increased duration of exposure to factors such as bias and discrimination that may contribute to assessment score differences across groups.

Previous studies of disparities in resident assessments have focused on gendered19,21 and/or racial disparities in isolation,36 with more emphasis on the former. Our study improves on the prior literature by applying an intersectional framework to the investigation of EM resident assessment disparities. Under this framework, and in practice, the manifestations of ethnoracial and sex bias that may underlie these assessment disparities are inextricably linked; URM female residents are not subject to discrimination on the basis of their sex separate from their race or ethnicity. Through our analyses, we found that URM female residents compared with URM male and White female residents, experienced the most severe disparities; this is a novel finding that would have been obscured by a nonintersectional design. In a previous study of 8 EM residency programs, Dayal and colleagues21 found that male residents scored approximately 0.15 Milestone levels higher than female residents, which they equated to 3 to 4 months of training.21 However, this result was not replicated in a larger follow-up study,23 consistent with our intersectional analyses that did not show a statistically significant difference between White male and White female assessment scores. This suggests that the gender disparities observed by Dayal and colleagues21 may have been driven by gendered racial disparities, specific to female residents subject to the interaction of racism and sexism. This distinction highlights the importance of intersectional studies in accurately describing the effects of discrimination.

In comparison, studies on racial disparities within residency programs have focused on experiences of interpersonal racism rather than estimating inequities in standardized performance assessments.36,37,38 These studies demonstrated that URM trainees experience more discrimination than their peers, and that this is often on the basis of their race. However, these studies generally reported experiences across race and gender strata separately and did not specifically evaluate the experiences of those at the intersections (ie, URM female residents). To our knowledge, this is the first study to evaluate racial disparities in performance assessments, done so with the additional theoretical rigor of intersectionality. Beyond these conceptual improvements, our longitudinal study of EM residents in more than 120 programs provides stronger and more generalizable evidence of resident assessment disparities than previous studies with a cross-sectional design19 or with smaller sample sizes.21

Discrimination in the clinical learning environment affects retention and promotion of female, URM, and URM female physicians, who are critical for diversifying the health care workforce. Importantly, potential sex-specific ethnoracial discrimination in the form of assessment disparities, may also directly affect the psychosocial well being of minoritized trainees. Several studies have demonstrated an association between experiences of discrimination and mistreatment with attrition,39 burnout,40,41 and depression42 among medical students and physicians who are female and/or from minoritized ethnoracial backgrounds. These studies indicate that bias and discrimination may function as root causes43 leading to adverse health outcomes for URM, female, and URM female trainees. As such, eliminating the sex-specific ethnoracial in assessments as demonstrated in this study may contribute to maintaining the emotional and mental health of trainees and potentially improve patient health through a positive feedback mechanism that includes physician well-being, retention, and promotion of minoritized trainees, and more equitable health care provision.

Limitations

This study has limitations. Demographic data are an imperfect proxy of exposure to systemic discrimination,32 and intersectional sex-specific ethnoracial disparities in assessments only capture a subset of the total discrimination that URM, female, and URM female residents may experience during training. Additionally, this study only had access to binary sex data, which incompletely captures the full spectrum of gender identity and does not allow the disaggregation of transgender and nonbinary trainees. This limitation precludes the observation of any disparities in resident assessments among transgender and nonbinary EM residents, despite evidence that experiences of discrimination among this group are high.44

Notably, we excluded nearly one-third of EM programs because they did not include at least 1 URM and 1 Asian trainee during the study period (eFigure 1 in Supplement 1). This limitation reflects the absence of ethnoracial representation in EM45,46 and highlights the continued need for diversifying the health care workforce, consistent with the ACGME accreditation standards across all specialties that are URM.47 Additionally, our exclusion of international medical school graduates limits the generalizability of our results and precludes our ability to robustly assess the contribution of xenophobia to disparities in EM resident assessments.

Conclusions

This retrospective cohort study of ACGME Milestone assessments for 2708 EM residents in 128 programs found significant evidence of sex-specific racial and ethnic disparities in performance assessments throughout the training period. These disparities were most severe for URM female and Asian female residents, followed by URM male compared with White male residents. Eliminating sex-specific ethnoracial disparities in resident assessments may contribute to equitable health care by removing barriers to retention and promotion of underrepresented and minoritized trainees, thereby facilitating the diversity of the emergency physician workforce.

Achieving health equity for minoritized ethnoracial groups and other marginalized populations is an urgent public health priority. Escalating public attention on long-standing health inequities that disproportionately affect Black, Indigenous, Latine, and low-income communities, and the intersections of these in the US has amplified our collective commitment to improving representation in health care as a lever for catalyzing health justice.

Funding/Support: This study was supported through grants to Dr Boatright from the US National Institutes of Health (No. R21MD013481) and the Emergency Medicine Foundation.

Supplement 1.

eTable 1. Resident Characteristics (AAMC) and Evaluations (ACGME) Data Merge

eTable 2. ACGME Emergency Resident Milestones

eTable 3. Residents Per Academic Year, Per Training Year

eTable 4. Residents Per Evaluation Period

eTable 5. Mean Competency Scores Over the Evaluation Period for US Emergency Residents in Three-Year Programs by Each Ethnoracial Group and Sex Cross-Strata

eTable 6. Mean Competency Scores Over the Evaluation Period for US Emergency Residents in Four-Year Programs by Each Ethnoracial Group and Sex Cross-Strata

eTable 7. Adjusted Differences in Mean Competency Scores Over the Evaluation Period for US Emergency Residents in Three-Year Programs by Each Ethnoracial Group and Sex Cross-Strata

eTable 8. Adjusted Differences in Mean Competency Scores Over the Evaluation Period for US Emergency Residents in Four-Year Programs by Each Ethnoracial Group and Sex Cross-Strata

eTable 9. Mean Evaluation Scores for Three-Year and Four-Year Programs Over the Study Period by Each Ethnoracial Group and Sex Cross-Strata

eTable 10. Adjusted Differences in Mean Evaluation Scores for Three-Year and Four-Year Programs Over the Study Period by Each Ethnoracial Group and Sex Cross-Strata

eFigure 1. Analytic Cohort Development and Exclusion Criteria

eFigure 2. Mean Competency Scores for the Overall Sample

eFigure 3. Mean Assessment Score for the Overall Sample

eFigure 4. Assessment Score Trajectories by Ethnoracial Group and Sex Cross-Strata

Supplement 2.

Data Sharing Statement

References

  • 1.US Centers for Disease Control and Prevention . Health Equity. Published March 3, 2022. Accessed June 18, 2022. https://www.cdc.gov/chronicdisease/healthequity/index.htm
  • 2.Maybank A, De Maio F, Lemos D, Derige DN. Embedding racial justice and advancing health equity at the American Medical Association. Am J Med. 2022;135(7):803-805. doi: 10.1016/j.amjmed.2022.01.058 [DOI] [PubMed] [Google Scholar]
  • 3.American College of Physicians. Diversity, equity, and inclusion: Who We Are. Accessed July 16, 2022. https://www.acponline.org/about-acp/who-we-are/diversity-equity-and-inclusion
  • 4.Moy E, Bartman BA. Physician race and care of minority and medically indigent patients. JAMA. 1995;273(19):1515-1520. doi: 10.1001/jama.1995.03520430051038 [DOI] [PubMed] [Google Scholar]
  • 5.Komaromy M, Grumbach K, Drake M, et al. The role of black and Hispanic physicians in providing health care for underserved populations. N Engl J Med. 1996;334(20):1305-1310. doi: 10.1056/NEJM199605163342006 [DOI] [PubMed] [Google Scholar]
  • 6.Marrast LM, Zallman L, Woolhandler S, Bor DH, McCormick D. Minority physicians’ role in the care of underserved patients: diversifying the physician workforce may be key in addressing health disparities. JAMA Intern Med. 2014;174(2):289-291. doi: 10.1001/jamainternmed.2013.12756 [DOI] [PubMed] [Google Scholar]
  • 7.Cohen JJ. The consequences of premature abandonment of affirmative action in medical school admissions. JAMA. 2003;289(9):1143-1149. doi: 10.1001/jama.289.9.1143 [DOI] [PubMed] [Google Scholar]
  • 8.Hofstra B, Kulkarni VV, Munoz-Najar Galvez S, He B, Jurafsky D, McFarland DA. The diversity-innovation paradox in science. Proc Natl Acad Sci U S A. 2020;117(17):9284-9291. doi: 10.1073/pnas.1915378117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.King TE Jr, Dickinson TA, DuBose TD Jr, et al. The case for diversity in academic internal medicine. Am J Med. 2004;116(4):284-289. doi: 10.1016/j.amjmed.2003.12.015 [DOI] [PubMed] [Google Scholar]
  • 10.Whitla DK, Orfield G, Silen W, Teperow C, Howard C, Reede J. Educational benefits of diversity in medical school: a survey of students. Acad Med. 2003;78(5):460-466. doi: 10.1097/00001888-200305000-00007 [DOI] [PubMed] [Google Scholar]
  • 11.Zhou RA, Baicker K, Taubman S, Finkelstein AN. The uninsured do not use the emergency department more—they use other care less. Health Aff (Millwood). 2017;36(12):2115-2122. doi: 10.1377/hlthaff.2017.0218 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Salhi BA, White MH, Pitts SR, Wright DW. Homelessness and emergency medicine: a review of the literature. Acad Emerg Med. 2018;25(5):577-593. doi: 10.1111/acem.13358 [DOI] [PubMed] [Google Scholar]
  • 13.Suen LW, Makam AN, Snyder HR, et al. National prevalence of alcohol and other substance use disorders among emergency department visits and hospitalizations: NHAMCS 2014-2018. J Gen Intern Med. 2022;37(10):2420-2428. doi: 10.1007/s11606-021-07069-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Rojek AE, Khanna R, Yim JWL, et al. Differences in narrative language in evaluations of medical students by gender and under-represented minority status. J Gen Intern Med. 2019;34(5):684-691. doi: 10.1007/s11606-019-04889-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ross DA, Boatright D, Nunez-Smith M, Jordan A, Chekroud A, Moore EZ. Differences in words used to describe racial and gender groups in medical student performance evaluations. PLoS One. 2017;12(8):e0181659. doi: 10.1371/journal.pone.0181659 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Teherani A, Hauer KE, Fernandez A, King TEJ Jr, Lucey C. How small differences in assessed clinical performance amplify to large differences in grades and awards: a cascade with serious consequences for students underrepresented in medicine. Acad Med. 2018;93(9):1286-1292. doi: 10.1097/ACM.0000000000002323 [DOI] [PubMed] [Google Scholar]
  • 17.Low D, Pollack SW, Liao ZC, et al. Racial/ethnic disparities in clinical grading in medical school. Teach Learn Med. 2019;31(5):487-496. doi: 10.1080/10401334.2019.1597724 [DOI] [PubMed] [Google Scholar]
  • 18.Boatright D, Ross D, O’Connor P, Moore E, Nunez-Smith M. Racial disparities in medical student membership in the Alpha Omega Alpha Honor Society. JAMA Intern Med. 2017;177(5):659-665. doi: 10.1001/jamainternmed.2016.9623 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Klein R, Ufere NN, Rao SR, et al. ; Gender Equity in Medicine workgroup . Association of gender with learner assessment in graduate medical education. JAMA Netw Open. 2020;3(7):e2010888. doi: 10.1001/jamanetworkopen.2020.10888 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Boatright D, Anderson N, Kim JG, et al. Racial and ethnic differences in internal medicine residency assessments. JAMA Netw Open. 2022;5(12):e2247649. doi: 10.1001/jamanetworkopen.2022.47649 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Dayal A, O’Connor DM, Qadri U, Arora VM. Comparison of male vs female resident milestone evaluations by faculty during emergency medicine residency training. JAMA Intern Med. 2017;177(5):651-657. doi: 10.1001/jamainternmed.2016.9616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.See A, Pallaci M, Aluisio AR, et al. Assessment of implicit gender bias during evaluation of procedural competency among emergency medicine residents. JAMA Netw Open. 2022;5(2):e2147351. doi: 10.1001/jamanetworkopen.2021.47351 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Santen SA, Yamazaki K, Holmboe ES, Yarris LM, Hamstra SJ. Comparison of male and female resident milestone assessments during emergency medicine residency training: a national study. Acad Med. 2020;95(2):263-268. doi: 10.1097/ACM.0000000000002988 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Hauer KE, Jurich D, Vandergrift J, et al. Gender differences in milestone ratings and medical knowledge examination scores among internal medicine residents. Acad Med. 2021;96(6):876-884. doi: 10.1097/ACM.0000000000004040 [DOI] [PubMed] [Google Scholar]
  • 25.Collins PH, Bilge S. Intersectionality. John Wiley & Sons; 2020. [Google Scholar]
  • 26.Accreditation Council for Graduate Medical Education (ACGME) . Frequently Asked Questions: Milestones. Published online September 2015. Accessed July 25, 2022. https://www.acgme.org/globalassets/milestonesfaq.pdf
  • 27.Crenshaw K. Demarginalizing the Intersection of Race and Sex: A Black Feminist Critique of Antidiscrimination Doctrine, Feminist Theory and Antiracist Politics. Univ Chic Leg Forum. 1989. Accessed July 26, 2023 https://chicagounbound.uchicago.edu/cgi/viewcontent.cgi?article=1052&context=uclf [Google Scholar]
  • 28.Collins PH. Black Feminist Thought: Knowledge, Consciousness, and the Politics of Empowerment. Routledge; 1990. [Google Scholar]
  • 29.Truth S. Ain’t I a Woman? Ohio Women’s Rights Convention, May 1851. Accessed July 26, 2023. https://thehermitage.com/wp-content/uploads/2016/02/Sojourner-Truth_Aint-I-a-Woman_1851.pdf
  • 30.The Combahee River Collective. The Combahee River Collective Statement, 1977. Accessed July 26, 2023. https://www.blackpast.org/african-american-history/combahee-river-collective-statement-1977/
  • 31.Hancock AM. Intersectionality: An Intellectual History. Oxford; 2016. doi: 10.1093/acprof:oso/9780199370368.001.0001 [DOI] [Google Scholar]
  • 32.Lett E, Asabor E, Beltrán S, Cannon AM, Arah OA. Conceptualizing, contextualizing, and operationalizing race in quantitative health sciences research. Ann Fam Med. 2022;20(2):157-163. doi: 10.1370/afm.2792 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Harper SR, Am I. My brother’s teacher? black undergraduates, racial socialization, and peer pedagogies in predominantly white postsecondary contexts. Rev Res Educ. 2013;37(1):183-211. doi: 10.3102/0091732X12471300 [DOI] [Google Scholar]
  • 34.Althouse AD. Adjust for multiple comparisons? it’s not that simple. Ann Thorac Surg. 2016;101(5):1644-1645. doi: 10.1016/j.athoracsur.2015.11.024 [DOI] [PubMed] [Google Scholar]
  • 35.Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. 2015;67(1):1-48. doi: 10.18637/jss.v067.i01 [DOI] [Google Scholar]
  • 36.Yuce TK, Turner PL, Glass C, et al. National evaluation of racial/ethnic discrimination in US surgical residency programs. JAMA Surg. 2020;155(6):526-528. doi: 10.1001/jamasurg.2020.0260 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hu YY, Ellis RJ, Hewitt DB, et al. Discrimination, abuse, harassment, and burnout in surgical residency training. N Engl J Med. 2019;381(18):1741-1752. doi: 10.1056/NEJMsa1903759 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lall MD, Bilimoria KY, Lu DW, et al. Prevalence of discrimination, abuse, and harassment in emergency medicine residency training in the US. JAMA Netw Open. 2021;4(8):e2121706. doi: 10.1001/jamanetworkopen.2021.21706 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Nguyen M, Chaudhry SI, Desai MM, et al. Association of mistreatment and discrimination with medical school attrition. JAMA Pediatr. 2022;176(9):935-937. doi: 10.1001/jamapediatrics.2022.1637 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Dyrbye LN, West CP, Sinsky CA, et al. Physicians’ experiences with mistreatment and discrimination by patients, families, and visitors and association with burnout. JAMA Netw Open. 2022;5(5):e2213080. doi: 10.1001/jamanetworkopen.2022.13080 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Teshome BG, Desai MM, Gross CP, et al. Marginalized identities, mistreatment, discrimination, and burnout among US medical students: cross sectional survey and retrospective cohort study. BMJ. 2022;376:e065984. doi: 10.1136/bmj-2021-065984 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Anderson N, Lett E, Asabor EN, et al. The association of microaggressions with depressive symptoms and institutional satisfaction among a national cohort of medical students. J Gen Intern Med. 2022;37(2):298-307. doi: 10.1007/s11606-021-06786-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Phelan JC, Link BG. Is racism a fundamental cause of inequalities in health? Annu Rev Sociol. 2015;41(1):311-330. doi: 10.1146/annurev-soc-073014-112305 [DOI] [Google Scholar]
  • 44.Dimant OE, Cook TE, Greene RE, Radix AE. Experiences of transgender and gender nonbinary medical students and physicians. Transgend Health. 2019;4(1):209-216. doi: 10.1089/trgh.2019.0021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Lett E, Orji WU, Sebro R. Declining racial and ethnic representation in clinical academic medicine: a longitudinal study of 16 US medical specialties. PLoS One. 2018;13(11):e0207274. doi: 10.1371/journal.pone.0207274 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Boatright D, Branzetti J, Duong D, et al. Racial and ethnic diversity in academic emergency medicine: how far have we come? next steps for the future. AEM Educ Train. 2018;2(suppl 1):S31-S39. doi: 10.1002/aet2.10204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Accreditation Council for Graduate Medical Education . Diversity, Equity, and Inclusion. Accessed January 4, 2023. https://www.acgme.org/initiatives/diversity-equity-and-inclusion/

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1.

eTable 1. Resident Characteristics (AAMC) and Evaluations (ACGME) Data Merge

eTable 2. ACGME Emergency Resident Milestones

eTable 3. Residents Per Academic Year, Per Training Year

eTable 4. Residents Per Evaluation Period

eTable 5. Mean Competency Scores Over the Evaluation Period for US Emergency Residents in Three-Year Programs by Each Ethnoracial Group and Sex Cross-Strata

eTable 6. Mean Competency Scores Over the Evaluation Period for US Emergency Residents in Four-Year Programs by Each Ethnoracial Group and Sex Cross-Strata

eTable 7. Adjusted Differences in Mean Competency Scores Over the Evaluation Period for US Emergency Residents in Three-Year Programs by Each Ethnoracial Group and Sex Cross-Strata

eTable 8. Adjusted Differences in Mean Competency Scores Over the Evaluation Period for US Emergency Residents in Four-Year Programs by Each Ethnoracial Group and Sex Cross-Strata

eTable 9. Mean Evaluation Scores for Three-Year and Four-Year Programs Over the Study Period by Each Ethnoracial Group and Sex Cross-Strata

eTable 10. Adjusted Differences in Mean Evaluation Scores for Three-Year and Four-Year Programs Over the Study Period by Each Ethnoracial Group and Sex Cross-Strata

eFigure 1. Analytic Cohort Development and Exclusion Criteria

eFigure 2. Mean Competency Scores for the Overall Sample

eFigure 3. Mean Assessment Score for the Overall Sample

eFigure 4. Assessment Score Trajectories by Ethnoracial Group and Sex Cross-Strata

Supplement 2.

Data Sharing Statement


Articles from JAMA Network Open are provided here courtesy of American Medical Association

RESOURCES