Skip to main content
Ophthalmology Science logoLink to Ophthalmology Science
editorial
. 2021 Jun 25;1(2):100036. doi: 10.1016/j.xops.2021.100036

Big Data and Artificial Intelligence in Ophthalmology: Where Are We Now?

Cecilia S Lee , James D Brandt, Aaron Y Lee
PMCID: PMC9560652  PMID: 36249294

The successful marriage of big data and artificial intelligence (AI) has been one of the greatest steps forward in medical research this century. The term big data, defined to encompass both large quantities and diverse types of data, was coined in the 1990s,1 and the term AI emerged even earlier: the first academic conference on AI occurred in 1956.2 However, these concepts did not become revolutionary drivers of medical research until recently, when all types of health care settings began passively collecting massive amounts of medical data. Electronic health records (EHRs) data and medical imaging data are major examples of these types of routinely collected big data. At the same time, multiomics have given rise to complex datasets with multidimensional data for each patient. These dramatic changes in data collection have enabled AI to progress rapidly toward solving challenging medical research questions.

Ophthalmology has been at the forefront of AI research, in particular machine learning and deep learning approaches, because of the ubiquitous availability of noninvasive, rapid, and relatively inexpensive ophthalmic imaging. These advances in ophthalmic imaging have provided the big data required for successful applications of computer vision advances.3,4 The Intelligent Research in Sight (IRIS®) Registry, the largest clinical database in the world led by the American Academy of Ophthalmology, is an additional example of why ophthalmology is uniquely positioned for big data questions.5,6 Both big data and AI research are data driven, rather than based on the traditional observation-to-hypothesis approach, allowing discoveries outside the boundaries of traditional research methodology. Examples include novel biomarkers in the midst of multiomics datasets,7 subtle associations that would not be detectable with smaller scale datasets,8,9 and identification of imaging features that were undetected with human eyes, such as the hyporeflective outer retinal band associated with delayed rod-mediated dark adaptation, an established marker of early age-related macular degeneration.10 This process drives the generation of new hypotheses and better insights into disease mechanisms.

Some of the many exciting study results include novel associations between progression of glaucoma and pulmonary function. This was an unexpected finding from multimodal machine learning models that combined retinal imaging, demographics, and systemic variables from the UK Biobank database, which achieved an area under the receiver operating characteristic curve of 0.97 in predicting progression of glaucoma.11 Additional examples include deep learning algorithms to detect drusen, neovascular age-related macular degeneration, and diabetic macular edema after training with 101 418 OCT images of the retina from 5761 patients,12 creating OCT angiography images from structural OCT images alone,13 forecasting future visual field deficits from a single baseline visual field,14 and fully automated, open-sourced quantification of oculoplastics measurements using routine facial photographs.15

Despite these numerous successes, several inherent and data-specific limitations must be kept in mind before interpreting study results. First, statistical significance becomes less meaningful when handling millions of data points because even minor differences will show statistical significance. To address this, clinical significance and biological plausibility should be emphasized when designing the study and analyzing their results. Second, a clear understanding of the data used to train AI algorithms is critical because the resulting algorithms may not be generalizable to the intended populations.16 Third, validation of both big data and AI study results in independent study cohorts are key.17

As this field of research continues to advance, other critical issues remain. Although incorporating data from digital imaging systems is one thing, incorporating data from clinical notes and manually entered simple data like visual acuity and IOP is another, especially because today’s clinical chart is filled with entry errors and imported text.18 How can data cleaning methods be applied effectively and accurately to large EHR datasets such as the IRIS® Registry? How do we assess the validity of the EHR data? What are scalable statistical methods for a dataset of similar size and extent as the IRIS® Registry? How do we define clinically significant differences, rather than simply identifying statistical significance, in the setting of large datasets? Despite the data-rich ophthalmic environment, we remain hampered by limited interoperability between diagnostic systems and EHRs.19 How do we incentivize manufacturers of new diagnostic methods and EHRs to share data reliably and seamlessly from the outset? While doing so, how do we protect the privacy of participants when we continue to add complex linkages between many dimensional datasets?20 What are the ethical implications of big data and AI research? How do we standardize the datasets and promote data sharing?21,22 Who is responsible for data breaches or failures of clinically deployed algorithms?

Ophthalmology Science is pleased to announce the Big Data & AI Special Issue with the hope of providing a home for studies in this growing field. In this special issue, we are particularly interested in research that highlights novel methodologies, such as nontraditional statistical approaches that can be applied to big data and machine learning or deep learning studies. Another goal is to initiate discussions of current challenges and potential strategies to overcome them. Some examples include standardization of the datasets, data sharing processes, data privacy, and obstacles to transferring research findings into clinical care. Understanding both the potential and the limitations of big data and AI approaches through an array of diverse studies and commentaries is the primary goal of this Ophthalmology Science special issue.

As vision science researchers and clinicians, we are uniquely positioned to take advantage of big data and AI research. As editors, we are grateful for this opportunity and look forward to engaging actively with the scientific community through this remarkable special issue.

Footnotes

Disclosure(s): All authors have completed and submitted the ICMJE disclosures form.

The author(s) have made the following disclosure(s): J.D.B.: Consultant – Adverum, Thea; Financial support – Santen; Lecturer – Genentech; Equity owner – Glaukos

A.Y.L.: Consultant – Genentech/Roche, Topcon, Verana Health, Johnson & Johnson, Gyroscope; Financial support – United States Food and Drug Administration, Lowry Medical Research Institute, Santen, Carl Zeiss Meditec, Microsoft, Novartis; Lecturer – Topcon

Supported by the National Eye Institute, National Institutes of Health, Bethesda, Maryland (grant nos.: K23EY029246 and R01AG060942; Latham Vision Research Innovation Award, Seattle, Washington (A.Y.L.); and Research to Prevent Blindness, Inc, New York, New York (unrestricted grant). The sponsors or funding organizations had no role in the design or conduct of this research. This article does not reflect the opinions of the Food and Drug Administration.

References

  • 1.Lohr S. The origins of “big data”: an etymological detective story. The New York Times. https://bits.blogs.nytimes.com/2013/02/01/the-origins-of-big-data-an-etymological-detective-story/. 01.02.13 Available at: Accessed 27.05.21.
  • 2.Buchanan B.G. A (very) brief history of artificial intelligence. AIMag. 2005;26(4) 53–53. [Google Scholar]
  • 3.Yanagihara R.T., Lee C.S., Ting D.S.W., Lee A.Y. Methodological challenges of deep learning in optical coherence tomography for retinal diseases: a review. Transl Vis Sci Technol. 2020;9(2):11. doi: 10.1167/tvst.9.2.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Wang S.Y., Pershing S., Lee A.Y. AAO Taskforce on AI and AAO Medical Information Technology Committee. Big data requirements for artificial intelligence. Curr Opin Ophthalmol. 2020;31(5):318–323. doi: 10.1097/ICU.0000000000000676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Chiang M.F., Sommer A., Rich W.L., et al. The 2016 American Academy of Ophthalmology IRIS® Registry (Intelligent Research in Sight) database: characteristics and methods. Ophthalmology. 2018;125(8):1143–1148. doi: 10.1016/j.ophtha.2017.12.001. [DOI] [PubMed] [Google Scholar]
  • 6.Parke D.W., II, Lum F., Rich W.L. The IRIS® Registry: Purpose and perspectives. Ophthalmologe. 2017;114(Suppl 1):1–6. doi: 10.1007/s00347-016-0265-1. [DOI] [PubMed] [Google Scholar]
  • 7.Zhang G., Xue Z., Yan C., et al. A novel biomarker identification approach for gastric cancer using gene expression and DNA methylation dataset. Front Genet. 2021;12:644378. doi: 10.3389/fgene.2021.644378. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Lee C.S., Owen J.P., Yanagihara R.T., et al. Smoking is associated with higher intraocular pressure regardless of glaucoma: a retrospective study of 12.5 million patients using the Intelligent Research in Sight (IRIS®) Registry. Ophthalmol Glaucoma. 2020;3(4):253–261. doi: 10.1016/j.ogla.2020.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Owen JP, Blazes M, Lacy M, et al. Refractive outcomes after immediate sequential vs delayed sequential bilateral cataract surgery: a cohort analysis. JAMA Ophthalmol. In press. [DOI] [PMC free article] [PubMed]
  • 10.Lee A.Y., Lee C.S., Blazes M.S., et al. Exploring a structural basis for delayed rod-mediated dark adaptation in age-related macular degeneration via deep learning. Transl Vis Sci Technol. 2020;9(2):62. doi: 10.1167/tvst.9.2.62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Mehta P., Petersen C.A., Wen J.C., et al. Automated detection of glaucoma with interpretable machine learning using clinical data and multi-modal retinal. Am J Ophthalmol. 2021 May 2;S0002-9394(21) doi: 10.1016/j.ajo.2021.04.021. 00220-8. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Faes L., Wagner S.K., Fu D.J., et al. Automated deep learning design for medical image classification by health-care professionals with no coding experience: a feasibility study. Lancet Digit Health. 2019;1(5):e232–e242. doi: 10.1016/S2589-7500(19)30108-6. [DOI] [PubMed] [Google Scholar]
  • 13.Lee C.S., Tyring A.J., Wu Y., et al. Generating retinal flow maps from structural optical coherence tomography with artificial intelligence. Sci Rep. 2019;9(1):5694. doi: 10.1038/s41598-019-42042-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Wen J.C., Lee C.S., Keane P.A., et al. Forecasting future Humphrey visual fields using deep learning. PLoS One. 2019;14(4) doi: 10.1371/journal.pone.0214875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Van Brummen A., Owen J.P., Spaide T., et al. PeriorbitAI: artificial intelligence automation of eyelid and periorbital measurements. Am J Ophthalmol. 2021 May 16;S0002-9394(21) doi: 10.1016/j.ajo.2021.05.007. 00281-6. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Cahan E.M., Hernandez-Boussard T., Thadaney-Israni S., Rubin D.L. Putting the data before the algorithm in big data addressing personalized healthcare. NPJ Digit Med. 2019;2:78. doi: 10.1038/s41746-019-0157-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Lee A.Y., Yanagihara R.T., Lee C.S., et al. Multicenter, head-to-head, real-world validation study of seven automated artificial intelligence diabetic retinopathy screening systems. Diabetes Care. 2021;44(5):1168–1175. doi: 10.2337/dc20-1877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Henriksen B.S., Goldstein I.H., Rule A., et al. Electronic health records in ophthalmology: source and method of documentation. Am J Ophthalmol. 2020;211:191–199. doi: 10.1016/j.ajo.2019.11.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.McKay KM, Apostolopoulos N, Dahrouj M, et al. Assessing the uniformity of uveitis clinical concepts and associated ICD-10 codes across health care systems sharing the same electronic health records system. JAMA Ophthalmol. In press. [DOI] [PMC free article] [PubMed]
  • 20.Tom E., Keane P., Blazes M., et al. Protecting data privacy in the age of AI-enabled ophthalmology. Trans Vis Sci Tech. 2020;9(2):36. doi: 10.1167/tvst.9.2.36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Lee A.Y., Campbell J.P., Hwang T.S., et al. Recommendations for standardization of images in ophthalmology. Ophthalmology. 2021;128(7):969–970. doi: 10.1016/j.ophtha.2021.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lee C.S., Lee A.Y., Holland G.N., et al. Big data and uveitis. Ophthalmology. 2016;123(11):2273–2275. doi: 10.1016/j.ophtha.2016.08.037. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Ophthalmology Science are provided here courtesy of Elsevier

RESOURCES