The COVID-19 pandemic has shed light on long-standing health inequities and systemic racism, with Black and Hispanic communities suffering disproportionately across the US. These disparities are seen across different conditions and glaucoma is no exception. In the United States, glaucoma disproportionately affects racial minorities. We know that race/ethnicity modulates risk for glaucoma significantly; Black individuals are four to five times more likely to have glaucoma than White individuals.1 Black patients with glaucoma are also more likely to develop vision impairment or blindness.2 Similarly, Hispanic individuals are two to three times more likely to have glaucoma than White individuals and are estimated to have a higher undiagnosed case rate compared to their White counterparts.3–5 As medicine becomes progressively predictive and personalized, these disparities in risk and severity of disease must be considered.
These statistics likely represent a complex interplay of socioeconomic, political (e.g., systemic racism) and biologic variables. This, together with the complex and clinically heterogenous nature of the disease, make glaucoma an ideal candidate for individualized and precision medicine approaches to better screen, diagnose, and monitor progression. The diagnosis and determination of glaucoma disease progression relies on data, in particular imaging methods such as fundus photographs and optical coherence tomography (OCT) scans. In recent years, machine learning, a highly effective tool for automated image processing, has been increasingly researched and applied extensively in glaucoma (Figure 1). Despite the known racial/ethnic disparities in glaucoma outcomes, imaging databases used to study and characterize glaucoma in machine learning approaches do not have appropriate representation of Black and Hispanic individuals who are most at risk.
Practicing equitable care in the age of precision medicine requires us to have a keen understanding of how each individual is affected by disease. While machine learning can be used to screen and diagnose glaucoma, we must include and accurately represent characteristics that modulate disease risk, such as race/ethnicity, in the data used to train models. However, this is not currently the standard practice. For example, large population-based studies with imaging data such as the UK Biobank are currently being used heavily in exploratory research. These studies primarily include data from a majority European or of European descent participants.6–9 Similarly, a review of publicly available datasets for ophthalmological imaging found that 75 of the 94 open access datasets are from patients in Asia, North America, and Europe, with little representation of patients from low- and middle-income countries.10 74% of these datasets did not report any demographic data, like age, sex, and race/ethnicity.10
To better understand how demographically representative the latest machine learning glaucoma research has been, we reviewed recent glaucoma and machine learning publications, using PubMed search terms including “glaucoma” and “machine learning” and also included publications selected based on expert knowledge. We included manuscripts that used machine learning on images (fundus photographs, OCT scans) to classify glaucoma or characterize glaucomatous structures and excluded studies with fewer than 500 participants or published before 2016. Of the latest literature reporting on glaucoma and machine learning, 59% did not report race and/or ethnicity (Figure 2). Of the 11 studies with patient data solely from North America, we found that only 6 reported race and/or ethnicity. Of these 6 studies, 4 used data from the Duke Glaucoma Registry and had between 17.7% - 43.2% “African-American” representation.11–14 The other 2 studies report 22% - 45.5% “Black” or “Black/African descent” representation in various samples of their data.15,16 Only one study reported inclusion of a Hispanic population; this study had only 1.5% Hispanic representation.11 The remaining 10 studies had patient data from Asia, Europe, or multiple continents; only 3 of those studies reported race and/or ethnicity (Figure 2).
While the field of glaucoma and machine learning is progressing, our brief review demonstrates that the imaging databases currently used to fuel its research are not representative of the population in the US and importantly the populations most at risk of disease. Most studies from the US do not even report race/ethnicity data; the few that do report such data and include proportionate representation of Black populations were largely from a single dataset. While our findings may not be surprising given the historically biased nature of large datasets in ophthalmology, they certainly are problematic.
Developing and training experimental machine learning models on a limited or unknown population may decrease the generalizability of these algorithms for glaucoma detection. Glaucoma has a faster onset, more severe presentation, and more rapid progression in Black populations and remains relatively understudied in Hispanic populations.17 There are also differences in optic disc size and shape, cup-to-disc ratios, vascular patterns, and retinal appearance amongst populations of different races and ethnicities, as well as disparities in socioeconomic status, access to healthcare, and exposures to systemic racism which may accelerate biological aging.18–23 Given these differences, we believe that training models on images in White populations may lead to more missed diagnoses and poorer care in Black and Hispanic populations.
Indeed, biased data has already been shown to limit predictive power of machine learning algorithms in broader populations. Prediction models for assessing risk of coronary heart disease based on the Framingham Heart Study, a study of White middle-class individuals, performed poorly in Japanese, Hispanic, and Native-American populations.24 Another study showed that commercial prediction algorithms trained in majority White populations incorrectly underestimated Black patients’ healthcare “risk” scores, in comparison to equally sick White patients.25 Similarly, a deep learning model trained on fundus photographs from primarily European-descent and Asian-descent populations performed poorly for detection of glaucoma in a primarily African-descent population.18 Conversely, when algorithms are trained on balanced and representative datasets they perform well. For example, an algorithm to diagnose common skin conditions that was trained on a dataset with appropriate representation of minority groups had similar performance across different races.26 Another study found that a deep learning model trained on knee X-rays with appropriate representation of Black patients was able to reduce previously unexplained racial disparities in pain scores seen in osteoarthritis.27
Current databases widely used in machine learning research for glaucoma are under representative of minority groups - those who are at the highest risk for severe glaucoma. It is clear that this approach can negatively impact the predictive power of precision medicine approaches for screening and diagnosis of glaucoma in these at-risk populations. This in turn will compound already poorer outcomes for Black and Hispanic communities across the US. As machine learning in glaucoma progresses, we call for an urgent commitment to increase the diversity of datasets and improve race/ethnicity reporting. This is a crucial step towards halting the propagation of inequities and ensuring just and quality care for all patients suffering from glaucoma.
Financial support:
none
Footnotes
Disclosures:
Conflicts of interest: no conflicting relationship exists for any author
References
- 1.Tielsch JM, Sommer A, Katz J, et al. Racial variations in the prevalence of primary open-angle glaucoma. The Baltimore Eye Survey. JAMA 1991;266:369–374. [PubMed] [Google Scholar]
- 2.Bramley T. Impact of Vision Loss on Costs and Outcomes in Medicare Beneficiaries With Glaucoma. Arch Ophthalmol 2008;126:849. [DOI] [PubMed] [Google Scholar]
- 3.Gupta P, Zhao D, Guallar E, et al. Prevalence of Glaucoma in the United States: The 2005–2008 National Health and Nutrition Examination Survey. Invest Ophthalmol Vis Sci 2016;57:2905–2913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Varma R, Ying-Lai M, Francis BA, et al. Prevalence of open-angle glaucoma and ocular hypertension in Latinos: the Los Angeles Latino Eye Study. Ophthalmology 2004;111:1439–1448. [DOI] [PubMed] [Google Scholar]
- 5.Nathan N, Joos KM. Glaucoma Disparities in the Hispanic Population. Semin Ophthalmol 2016;31:394–399. [DOI] [PubMed] [Google Scholar]
- 6.Fry A, Littlejohns TJ, Sudlow C, et al. Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population. Am J Epidemiol 2017;186:1026–1034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ikram MA, Brusselle GGO, Murad SD, et al. The Rotterdam Study: 2018 update on objectives, design and main results. Eur J Epidemiol 2017;32:807–850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Loeffler M, Engel C, Ahnert P, et al. The LIFE-Adult-Study: objectives and design of a population-based cohort study with 10,000 deeply phenotyped adults in Germany. BMC Public Health 2015;15:691. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Panchapakesan J, Mitchell P, Tumuluri K, et al. Five year incidence of cataract surgery: the Blue Mountains Eye Study. Br J Ophthalmol 2003;87:168–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Khan SM, Liu X, Nath S, et al. A global review of publicly available datasets for ophthalmological imaging: barriers to access, usability, and generalisability. The Lancet Digital Health 2020:S2589750020302405. [DOI] [PubMed] [Google Scholar]
- 11.Mariottoni EB, Datta S, Dov D, et al. Artificial Intelligence Mapping of Structure to Function in Glaucoma. Transl Vis Sci Technol 2020;9:19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Medeiros FA, Jammal AA, Thompson AC. From Machine to Machine: An OCT-Trained Deep Learning Algorithm for Objective Quantification of Glaucomatous Damage in Fundus Photographs. Ophthalmology 2019;126:513–521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Thompson AC, Jammal AA, Berchuck SI, et al. Assessment of a Segmentation-Free Deep Learning Algorithm for Diagnosing Glaucoma From Optical Coherence Tomography Scans. JAMA Ophthalmol 2020;138:333–339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mariottoni EB, Jammal AA, Urata CN, et al. Quantification of Retinal Nerve Fibre Layer Thickness on Optical Coherence Tomography with a Deep Learning Segmentation-Free Approach. Sci Rep 2020;10:402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Maetschke S, Antony B, Ishikawa H, et al. A feature agnostic approach for glaucoma detection in OCT volumes. PLoS One 2019;14:e0219126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Christopher M, Belghith A, Bowd C, et al. Performance of Deep Learning Architectures and Transfer Learning for Detecting Glaucomatous Optic Neuropathy in Fundus Photographs. Sci Rep 2018;8:16685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Salowe R, Salinas J, Farbman NH, et al. Primary Open-Angle Glaucoma in Individuals of African Descent: A Review of Risk Factors. J Clin Exp Ophthalmol 2015;6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Christopher M, Nakahara K, Bowd C, et al. Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms. Transl Vis Sci Technol 2020;9:27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lee RY, Kao AA, Kasuga T, et al. Ethnic variation in optic disc size by fundus photography. Curr Eye Res 2013;38:1142–1147. [DOI] [PubMed] [Google Scholar]
- 20.Girkin CA. Primary Open-Angle Glaucoma in African Americans: International Ophthalmology Clinics 2004;44:43–60. [DOI] [PubMed] [Google Scholar]
- 21.Orr P, Barrón Y, Schein OD, et al. Eye care utilization by older Americans: the SEE Project. Salisbury Eye Evaluation. Ophthalmology 1999;106:904–909. [DOI] [PubMed] [Google Scholar]
- 22.Chae DH, Wang Y, Martz CD, et al. Racial discrimination and telomere shortening among African Americans: The Coronary Artery Risk Development in Young Adults (CARDIA) Study. Health Psychol 2020;39:209–219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Wurster P, Harris A, Gonzalez AC, et al. Risk Factors for Open-angle Glaucoma in Persons of Latin American Descent. J Glaucoma 2020;29:217–225. [DOI] [PubMed] [Google Scholar]
- 24.D’Agostino RB, Grundy S, Sullivan LM, et al. Validation of the Framingham Coronary Heart Disease Prediction Scores: Results of a Multiple Ethnic Groups Investigation. JAMA 2001;286:180. [DOI] [PubMed] [Google Scholar]
- 25.Obermeyer Z, Powers B, Vogeli C, Mullainathan S. Dissecting racial bias in an algorithm used to manage the health of populations. Science 2019;366:447–453. [DOI] [PubMed] [Google Scholar]
- 26.Liu Y, Jain A, Eng C, et al. A deep learning system for differential diagnosis of skin diseases. Nat Med 2020;26:900–908. [DOI] [PubMed] [Google Scholar]
- 27.Pierson E, Cutler DM, Leskovec J, et al. An algorithmic approach to reducing unexplained pain disparities in underserved populations. Nat Med 2021;27:136–140. [DOI] [PubMed] [Google Scholar]