Abstract
Background
To determine whether gender-based differences may be present in letters of recommendation written for ophthalmology residency applicants.
Methods
All applications submitted through SF Match to the UCLA Stein Eye Institute Residency Training Program from the 2017–2018 application cycle were analyzed using validated text analysis software (Linguistic Inquiry and Word Count (Austin, TX)). The main outcome measures were differences in language use in letters of recommendation by gender of applicant.
Results
Of 440 applicants, 254 (58%) were male and 186 (42%) were female. The two gender groups had similar United States Medical Licensing Exam (USMLE) Step 1 scores, undergraduate grade point averages (uGPA’s), proportions of underrepresented minority (URM) applicants and Gold Humanism Honor Society members, numbers of academic and service activities listed, and gender distributions of their letter writers (all P values > 0.05). However, letters written for male applicants were determined to use more “authentic” words than those written for female applicants (mean difference, 0.800; 95% CI, 0.001–1.590; P = 0.047). Letters written for male applicants also contained more “leisure” words (mean difference, 0.056; 95% CI, 0.008–0.104; P = 0.023) and fewer “feel” words (mean difference, 0.033; 95% CI, 0.001–0.065; P = 0.041) and “biological processes” words (mean difference, 0.157; 95% CI, 0.017–0.297; P = 0.028).
Conclusions
There were gender differences detected in recommendation letters in ophthalmology consistent with prior studies from other fields. Awareness of these differences may improve residency selection processes.
Keywords: Ophthalmology, Residency, Gender, Application, Bias
Background
In the past few years, ophthalmology residency selection has become a highly competitive process. The Ophthalmology Residency Match Summary Report 2017 revealed a mean United States Medical Licensing Examination (USMLE) Step 1 score of 243 (interquartile range, 236–252) for applicants who successfully matched into residency programs, compared to the United States and Canada mean (SD) of 229 (20). In 2017, 22% of ophthalmology residency applicants did not match into any program.
Competitive residency programs, including ophthalmology programs, rely heavily on cognitive measures in their selection processes to recruit top-tier medical school graduates [1]. However, cognitive metrics, including USMLE Step 1 scores, have been poor predictors of success in residency. In 2002, the Accreditation Council for Graduate Medical Education identified six Core Competencies that residents must master in their training. Imbedded in those competencies are many non-cognitive factors, such as interpersonal and communication skills, which can be evaluated through letters of recommendation.
Thought to provide insight into applicants’ non-cognitive traits, letters of recommendation written on behalf of applicants can grasp the attention of selection committees. An online survey completed by program directors, chairpersons, and members of resident selection committees from 65 United States ophthalmology residency programs showed that 83% of programs considered letters of recommendation to be among the most important component of applications [2]. It has also been demonstrated that letters of recommendation were the most important factor for screening and evaluating applicants for medical schools, graduate programs, psychology faculty positions, internships, and military training programs [3–7]. Selection committees rely on letters of recommendation to provide information on applicants’ motivations, qualifications, and past performances, and to reinforce the information provided by the applicants themselves [7–9].
Recent research has demonstrated gender biases in letters of recommendation in medical specialties, including diagnostic radiology, emergency medicine, urology, and general and transplant surgery [10–14]. These studies used text analysis software programs, such as Linguistic Inquiry and Word Count (Austin, TX), and determined that letter writers tended to use different words to describe male and female applicants. Gender biases may be especially relevant in the field of ophthalmology, as women accounted for only 44% of ophthalmology residents in 2014 and 23% of the total number of practicing ophthalmologists in 2015 [15]. The purpose of this study is to explore gender-based differences in recommendation letters written for ophthalmology residency applicants.
Methods
This study was exempt from review by the Institutional Review Board because only de-identified data were used for analysis. All 440 residency applications submitted through SF Match to the UCLA Stein Eye Institute Residency Training Program from the 2017–2018 application cycle were included in this study. Each application was reviewed to extract demographic information, namely gender and self-reported underrepresented minority (URM) status. The Public Services and Activities section of each application was reviewed to count the number of academic and service activities listed. Academic activities included tutoring, mentoring, and research-related activities. Service activities included volunteering in clinics and working on community projects. Academic achievement data collected included USMLE Step 1 scores and undergraduate grade point averages (uGPAs). Finally, Gold Humanism Honor Society member status in medical school was noted, and the gender of the letter writer for each letter of recommendation was documented.
Letters of recommendation were prepared for analysis by using Adobe Acrobat Pro™ (San Jose, CA) to remove all headings, greetings, and signatures. The latest (2015) version of Linguistic Inquiry and Word Count (LIWC) was used to analyze the main text of each letter. For each letter, LIWC2015 outputted a number for each word category, with larger values corresponding to increased use of words in that particular category.
LIWC has been validated and used in studies in other scientific fields to detect gender biases in letters of recommendation, as well as to predict numerous outcome measurements, including social judgments, psychological adjustments, personality changes, personalities, and health [10–14, 16–22]. LIWC2015 contains nearly 90 output variables, including 4 summary language variables (analytical thinking, clout, authenticity, and emotional tone), 41 categories related to psychological constructs (including affect, cognition, biological processes, and drives), and 6 categories related to personal concerns (including work, home, and leisure activities). LIWC2015 reads each word of the text being analyzed and increments the output variable(s) that contain(s) the word. For example, the word “cried” is included in 5 output variables: sadness, negative emotion, overall affect, verbs, and past variable. Nearly 6400 words and word stems are included in LIWC2015, and each word or word stem was placed under the appropriate output variable(s).
The results of the analyses performed by LIWC2015 were collected in a Microsoft Excel spreadsheet (Redmond, WA) and subsequently analyzed using Stata/MP 15 (College Station, TX). For each LIWC2015 output category, a two-sample, two-tailed t test was performed with respect to applicant gender under the settings of unpaired data, unequal variances, and Welch’s approximation.
Results
A total of 440 applicants (254 males and 186 females) and 1318 letters of recommendation were included in this study.
There were no differences between male and female applicants in USMLE Step 1 scores (mean difference, 1.82; 95% CI, − 4.46 to 0.830; P = 0.18), UGPAs (mean difference, 0.014; 95% CI, − 0.058 to 0.030; P = 0.54), proportion of URM applicants (mean difference, 0.053; 95% CI, − 0.008 to 0.114; P = 0.087), proportion of Gold Humanism Honor Society members (mean difference, 0.027; 95% CI, − 0.080 to 0.026; P = 0.32), number of academic activities (mean difference, 0.276; 95% CI, − 0.006 to 0.558; P = 0.055), number of service activities (mean difference, 0.373; 95% CI, − 0.162 to 0.908; P = 0.24), and gender distribution of their letter writers (mean difference, 0.024; 95% CI, − 0.079 to 0.031; P = 0.25). Table 1 summarizes these results. Due to lack of differences in application profiles between male and female applicants, subgroup analyses of above parameters were not pursued. Subgroup analysis of the letters with respect to URM status was not pursued because URM applicants had lower USMLE Step 1 scores (mean difference, 12.35; 95% CI, 7.06 to 17.65; P < 0.001) and UGPAs (mean difference, 0.153; 95% CI, 0.061 to 0.246; P = 0.002) compared to non-URM applicants, and these differences would be potential confounders.
Table 1.
Measure | Male Applicants, Mean (SD) | Female Applicants, Mean (SD) | Difference in Means (95% CI) | P Value |
---|---|---|---|---|
USMLE Step 1 score | 244.7 (13.61) | 242.89 (14.11) | 1.82 (−4.46 to 0.830) | 0.18 |
UGPA | 3.74 (0.23) | 3.76 (0.22) | 0.014 (−0.058 to 0.030) | 0.54 |
Proportion of URM applicants | 0.087 (0.282) | 0.140 (0.348) | 0.053 (−0.008 to 0.114) | 0.087 |
Proportion of Gold Humanism Honor Society members | 0.102 (0.304) | 0.075 (0.264) | 0.027 (−0.080 to 0.026) | 0.32 |
Number of academic activities | 1.24 (1.48) | 1.52 (1.49) | 0.276 (−0.006 to 0.558) | 0.055 |
Number of service activities | 5.20 (2.94) | 5.57 (2.74) | 0.373 (−0.162 to 0.908) | 0.24 |
Proportion of female letter writers | 0.725 (0.489) | 0.701 (0.514) | 0.024 (−0.079 to 0.031) | 0.25 |
Letters of recommendation written for male applicants were determined to be more “authentic” than those written for female applicants, with a mean difference of 0.800 (95% CI, 0.001–1.590; P = 0.047). Letters written for male applicants also contained more “leisure” words (such as “cook,” “chat,” and “movie”) than those written for female applicants, with a mean difference of 0.056 (95% CI, 0.008–0.104; P = 0.023). Finally, letters written for female applicants contained more “feel” words (such as “feels” and “touch”) and “biological processes” words (such as “eat” and “pain”) than those written for male applicants, with mean differences of 0.033 and 0.157, respectively (95% CIs, 0.001–0.065 and 0.017–0.297; P = 0.041 and 0.028, respectively). Table 2 summarizes the results of these four LIWC2015 output variables with gender-based differences.
Table 2.
LIWC Output Variable | Male Applicants, Mean (SD) | Female Applicants, Mean (SD) | Difference in Means (95% CI) | P Value |
---|---|---|---|---|
Authentic | 8.46 (7.53) | 7.66 (7.00) | 0.800 (0.001–1.590) | 0.047 |
Leisure | 0.48 (0.45) | 0.42 (0.43) | 0.056 (0.008–0.104) | 0.023 |
Feel | 0.28 (0.28) | 0.31 (0.30) | 0.033 (0.001–0.065) | 0.041 |
Biological Processes | 3.29 (1.21) | 3.45 (1.33) | 0.157 (0.017–0.297) | 0.028 |
Discussion
This study used validated text analysis software to analyze a large volume of recommendation letters written for ophthalmology residency applicants. We sought to determine if applicant gender influenced letter writers’ language use. Male and female applicants had no significant differences in USMLE Step 1 scores, uGPAs, proportions of URM applicants and Gold Humanism Honor Society members, numbers of academic and service activities listed, and gender distributions of their letter writers. These findings suggest that male and female applicants had achieved similar levels of success in both the academic and service settings, and applied for residency under similar circumstances. However, analysis of recommendation letters in our study revealed differences between letters written for male applicants and those written for female applicants in the following LIWC2015 output variables: authentic, leisure, feel, and biological processes.
In our study, letters of recommendation written for male ophthalmology residency applicants were determined, based on language style and use, to be more “authentic” than those written for female applicants. This result was also found in a recent study on gender biases in letters of recommendation written for urology residency applicants [10]. Creation of the “authentic” variable in LIWC2015 was inspired by literature on “reality monitoring” that suggested that stories based on real experiences are told and written in a qualitatively different manner from those that are falsified [23–25]. Specifically, true stories were more likely to be characterized by complexity and positivity, therefore implying a possible advantage for male applicants over female applicants.
Second, in our study, letters written for female applicants contained fewer “leisure” words than those written for male applicants, which may be a commentary on work ethic consistent with previous research suggesting that recommenders tend to emphasize women’s work ethic rather than ability or talent [26]. A past study on letters of recommendation written for successful applicants to faculty positions at a large medical school in the United States found that letters written for female applicants contained more grindstone adjectives, such as “committed” and “tireless,” than those written for male applicants. Furthermore, the study reported that compared to their male counterparts, female applicants were described with fewer adjectives that suggest ability and talent, such as “analytical” and “genius.” Another study on letters of recommendation written for applicants to chemistry and biochemistry faculty positions at a large research university in the United States found that letters of recommendation written for female applicants contained fewer standout adjectives, such as “magnificent,” “wonderful, and “superb,” than those written for male applicants [18].
Finally, letters written for female applicants contained more “feel” and “biological processes” words than those written for male applicants. Examples of “feel” words are “feels” and “touch,” and examples of “biological processes” words are “eat” and “pain.” It is uncertain what impact this may have on residency selection processes, but gender stereotypes that arose from traditional gender-based divisions of social roles may have contributed to these categorical word usage differences [27].
The main limitation to this study is that although LIWC2015 provided an objective measure of differential language use in the letters of recommendation, it cannot take context into account. Therefore, no conclusions other than differences in word selection can be made from the programmed analysis of the letters. Another limitation is that the letters were not interpreted by program directors, which may be more practical; however, that analysis would be non-objective and infeasible given the large number of ophthalmology residency applicants and letters. Future studies could examine the context in which the words were used and present results to program directors for interpretation on how the differences in word usage may impact how applicants are viewed and ranked by residency selection committees.
Conclusions
In conclusion, although similar levels of success were achieved in both academic and service settings by male and female ophthalmology residency applicants, their letters of recommendation differed in word usage. The differences were consistent with gender observations previously reported in medicine and other scientific fields. Awareness of areas of potential gender biases is an important first step towards better selection processes in which decisions are made based on applicants’ achievements and qualifications.
Acknowledgements
The authors would like to thank Debbie Sato, Stein Eye academic programs administrator, for her administrative contributions to this project.
Abbreviations
- LIWC
Linguistic Inquiry and Word Count
- uGPA
undergraduate grade point average
- URM
Underrepresented minority
- USMLE
United States Medical Licensing Exam
Authors’ contributions
SKO deidentified the data and formatted them for analysis. FL interpreted the data and was a major contributor in writing the manuscript. LKG, SLP, and JBR made substantial contributions to the design of the work and interpretation of data. IT oversaw the project and was a major contributor in writing the manuscript. All authors read and approved the final manuscript.
Funding
This work was supported by unrestricted funds from Research to Prevent Blindness to Stein Eye Institute, UCLA. They had no role in data collection, analysis, interpretation or in writing the manuscript.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Ethics approval and consent to participate
This study was exempt from review by the Institutional Review Board because only de-identified data were used for analysis.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Fei Lin, Email: flin@mednet.ucla.edu.
Soo Kyung Oh, Email: osk1403@ucla.edu.
Lynn K. Gordon, Email: gordon@jsei.ucla.edu
Stacy L. Pineles, Email: pineles@jsei.ucla.edu
Jamie B. Rosenberg, Email: jarosenb@montefiore.org
Irena Tsui, Email: itsui@jsei.ucla.edu.
References
- 1.Lee A, Golnik K, Oetting T, et al. Re-engineering the resident applicant selection process in ophthalmology: a literature review and recommendations for improvement. Surv Ophthalmol. 2008;53(2):164–176. doi: 10.1016/j.survophthal.2007.12.007. [DOI] [PubMed] [Google Scholar]
- 2.Johnson M, Elam C, Edwards J, et al. Medical school admission committee members’ evaluations of and impressions from recommendation letters. Acad Med. 1998;73(10 Suppl):S41–S43. doi: 10.1097/00001888-199810000-00040. [DOI] [PubMed] [Google Scholar]
- 3.Landrum R, Jeglum E, Cashin J. The decision-making process of graduate admissions committees in psychology. J Soc Behav Pers. 1994;9:239–248. [Google Scholar]
- 4.Sheehan EP, McDevitt TM, Ross HC. Looking for a job as a psychology professor? Factors affecting applicant success. Teach Psychol. 1998;25:8–11. doi: 10.1207/s15328023top2501_3. [DOI] [Google Scholar]
- 5.Lopez SJ, Oehlert ME, Moberly RL. Selection criteria for American Psychological Association-accredited internship programs: a survey of training directors. Prof Psychol Res Pr. 1996;27(5):518–520. doi: 10.1037/0735-7028.27.5.518. [DOI] [Google Scholar]
- 6.McCarthy JM, Goffin RD. Improving the validity of letters of recommendation: an investigation of three standardized reference forms. Mil Psychol. 2001;13:199–222. doi: 10.1207/S15327876MP1304_2. [DOI] [Google Scholar]
- 7.Tommasi GW, Williams KB, Nordstrom CR. Letters of recommendation: what information captures HR Professionals’ attention? J Bus Psychol. 1998;13(1):5–18. doi: 10.1023/A:1022962814637. [DOI] [Google Scholar]
- 8.Brem C, Lampman C, Johnson M. Preparation of applications for academic positions in psychology. Am Psychol. 1995;50:533–537. doi: 10.1037/0003-066X.50.7.533. [DOI] [Google Scholar]
- 9.Akos P, Kretchmar J. Gender and ethnic bias in letters of recommendation: considerations for school counselors. Prof Sch Couns. 2018;20(1):102–113. [Google Scholar]
- 10.Filippou P, Mahajan S, Deal A, et al. The Presence of Gender Bias in Letters of Recommendations Written for Urology Residency Applicants. Urology. 2019;134:56–61. doi: 10.1016/j.urology.2019.05.065. [DOI] [PubMed] [Google Scholar]
- 11.Grimm LJ, Redmond RA, Campbell JC, Rosette AS. Gender and Racial Bias in Radiology Residency Letters of Recommendation. J Am Coll Radiol. 2019. In press. [DOI] [PubMed]
- 12.Turrentine FE, Dreisbach CN, St Ivany AR, Hanks JB, Schroen AT. Influence of gender on surgical residency Applicants’ recommendation letters. J Am Coll Surg. 2019;228(4):356–365. doi: 10.1016/j.jamcollsurg.2018.12.020. [DOI] [PubMed] [Google Scholar]
- 13.Hoffman A, Grant W, McCormick M, Jezewski E, Matemavi P, Langnas A. Gendered differences in letters of recommendation for transplant surgery fellowship applicants. J Surg Educ. 2019;76(2):427–432. doi: 10.1016/j.jsurg.2018.08.021. [DOI] [PubMed] [Google Scholar]
- 14.Li S, Fant AL, McCarthy DM, Miller D, Craig J, Kontrick A. Gender differences in language of standardized letter of evaluation narratives for emergency medicine residency applicants. AEM Educ Train. 2017;1(4):334–339. doi: 10.1002/aet2.10057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Xierali IM, Nivet MA, Wilson MR. Current and future status of diversity in ophthalmologist workforce. JAMA Ophthalmol. 2016;134(9):1016–1023. doi: 10.1001/jamaophthalmol.2016.2257. [DOI] [PubMed] [Google Scholar]
- 16.Berry DS, Pennebaker JW, Mueller JS, Hiller WS. Linguistic bases of social perception. Personal Soc Psychol Bull. 1997;23:526–537. doi: 10.1177/0146167297235008. [DOI] [Google Scholar]
- 17.Madera JM, Hebl MR, Martin RC. Gender and letters of recommendation for academia: Agentic and communal differences. J Appl Psychol. 2009;94(6):1591–1599. doi: 10.1037/a0016539. [DOI] [PubMed] [Google Scholar]
- 18.Schmader T, Whitehead J, Wysocki VH. A linguistic comparison of letters of recommendation for male and female chemistry and biochemistry job applicants. Sex Roles. 2007;57(7–8):509–514. doi: 10.1007/s11199-007-9291-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Rude S, Gortner E, Pennebaker J. Language use of depressed and depression-vulnerable college students. Cogn Emot. 2004;18(8):1121–1133. doi: 10.1080/02699930441000030. [DOI] [Google Scholar]
- 20.Pennebaker JW, Lay TC. Language use and personality during crises: analyses of mayor Rudolph Giuliani's press conferences. J Res Pers. 2002;36(3):271–282. doi: 10.1006/jrpe.2002.2349. [DOI] [Google Scholar]
- 21.Pennebaker JW, King LA. Linguistic styles: language use as an individual difference. J Pers Soc Psychol. 1999;77(6):1296–1312. doi: 10.1037/0022-3514.77.6.1296. [DOI] [PubMed] [Google Scholar]
- 22.Pennebaker JW, Mayne TJ, Francis ME. Linguistic predictors of adaptive bereavement. J Pers Soc Psychol. 1997;72(4):863–871. doi: 10.1037/0022-3514.72.4.863. [DOI] [PubMed] [Google Scholar]
- 23.Pennebaker JW, Boyd RL, Jordan K, Blackburn K. The development and psychometric properties of LIWC2015. Austin: University of Texas at Austin; 2015. [Google Scholar]
- 24.Johnson MK, Raye CL. Reality monitoring. Psychol Rev. 1981;88(1):67–85. doi: 10.1037/0033-295X.88.1.67. [DOI] [Google Scholar]
- 25.Vrij A, Edward K, Roberts KP, Bull R. Detecting deceit via analysis of verbal and nonverbal behavior. J Nonverbal Behav. 2000;24(4):239–263. doi: 10.1023/A:1006610329284. [DOI] [Google Scholar]
- 26.Trix F, Psenka C. Exploring the color of glass: letters of recommendation for female and male medical faculty. Discourse Soc. 2003;14:191–220. doi: 10.1177/0957926503014002277. [DOI] [Google Scholar]
- 27.Eagly AH, Wood W, Diekman AB. The Developmental Social Psychology of Gender. 1. Mahwah, NJ: Lawrence Erlbaum Associates; 2000. Social role theory of sex differences and similarities: A current appraisal; pp. 123–174. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.