Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Feb 10.
Published in final edited form as: Ophthalmol Retina. 2019 Apr;3(4):291–293. doi: 10.1016/j.oret.2018.12.008

Demystifying the Jargon: The Bridge between Ophthalmology and Artificial Intelligence

Aaron S Coyner 1, J Peter Campbell 2, Michael F Chiang 1,2
PMCID: PMC7874933  NIHMSID: NIHMS1666081  PMID: 31014678

Abstract

Publications related to artificial intelligence (AI) and machine learning have risen exponentially in the past 5 years in the medical literature, including a number of articles involving retinal disease.14 The mathematical theories beneath machine learning methods have been around for decades but in most cases were too computationally intense to implement by hand.5,6 Recent advances in computer central processing units and graphics processing units have enabled the application of these models to solve real-world problems.7,8 This has led to rapid advances in the fields of AI and machine learning, specifically deep learning, and to a growing number of medical and ophthalmic applications.13,9 As with the rapid evolution of any new technologies, there can be confusion about new terminologies, and AI is no exception. Though they are often used interchangeably, the terms “AI,” “machine learning,” “deep learning,” and “neural networks” are not synonymous and may be confusing for ophthalmologists to distinguish. Below, we attempt to define these terms in a manner accessible to both ophthalmologists and vision researchers (Figure 1).

Definitions

Artificial Intelligence

AI is a broad category for all of the methods described herein.4,8 In general, there are two main types of AI. General AI describes algorithms, which are typically implanted in machines, and have the ability to “replicate human thought, emotion, and reason (and remain, for now, in the realm of science fiction).”4 The focus of this article is narrow AI, which comprises algorithms “that perform specific tasks as well as, or better than, humans” and are not necessarily implanted in robotlike beings.4 Within AI, there are several broad approaches, a few of which are described below; each approach has advantages, disadvantages, and ideal applications, which are beyond the scope of this brief overview.

Machine Learning

Non-AI diagnostic scoring systems and risk models have existed for many years. Often based on empirical data, scales such as the Early Treatment for Diabetic Retinopathy Study scale assign weights to predetermined disease features that algebraically sum to a final score of some prognostic significance.10 Machine learning is a category within AI, of algorithms in which the computer (rather than the human) “learns” the complex relationships and patterns from data that are used to generate some “output,” which may be a diagnosis, severity score, or treatment recommendation.11 These algorithms include very simple to extremely complex models, such as linear and logistic regression, naive Bayes classifiers, support vector machines, decision trees and random forests, and deep convolutional neural networks.11 Though the relative weights of features are determined by the computer (rather than the human) in machine learning algorithms, they are defined (and, therefore, somewhat interpretable) and based only on features that are provided to the computer a priori.

Deep Learning

Deep learning is a specific category of machine learning in which features are not assumed or input by the programmer. Rather, deep learning algorithms use data sets to create their own abstract features, which are then used to perform classification or regression.1215 For example, Gulshan et al.2 demonstrated that a deep learning network could classify diabetic retinopathy, in agreement with the Early Treatment for Diabetic Retinopathy Study scale, using only retinal fundus images as input and the consensus diagnoses of multiple clinicians as the “class labels,” without being told that features such as microaneurysms, intraretinal hemorrhages, or neovascularization were important. Although these methods are powerful, a significant caveat is that their generated features remain opaque and uninterpretable. Thus, these methods are often referred to as “black boxes.” However, it has now become clear that, with enough labeled data, deep learning methods can approach or exceed human performance for a variety of tasks, including medical image assessment.13,9,1517

Artificial and Convolutional Neural Networks

Artificial neural networks (ANNs) are a specific type of machine learning algorithm that were modeled after biological neural networks to take advantage of Hebbian theory.11,12,18 Much like biological neural networks, ANNs contain many nodes (analogous to cell bodies) that communicate with other nerve cells via weights (analogous to axons and dendrites). This type of feed-forward network takes advantage of the “nerves that fire together, wire together” postulate, in which synapses between neurons are strengthened when their neurons have correlated outputs.18 A special case of the ANN is the convolutional neural network (CNN). In a simple ANN architecture, all nodes in a layer are connected to all other nodes in the next layer.11,13 For image recognition tasks, this is not ideal; instead, a CNN feeds patches of an image to specific nodes in the next layer of nodes, thereby preserving the spatial context from which a feature was extracted.14,15,19 In this case, filters are trained to extract specific features from images (e.g., vertical lines, triangles) and mark their location on a feature map.19 A deep CNN then uses the feature map as input for the next layer, which uses new filters to create a new feature map.12,14,15,19 As this process continues, extracted features become highly abstract but highly useful for prediction. For this reason, deep CNNs have become particularly common for image-based diagnostic systems in ophthalmology.1,2

Generative Adversarial Network

As opposed to discriminative algorithms (discussed above), generative algorithms predict–or generate–new data from class labels.20 A generative adversarial network is a learning procedure that pits a generative CNN against a discriminative CNN.20 For example, imagine we’d like to train a CNN to produce realistic retinal fundus images. To do so, we train a CNN to discriminate (recognize) retinas and another CNN to, initially, generate (create) random images. The discriminator alerts the generator as to how similar its images are to those of an actual retina. As training progresses, the generator learns key features of the retina and begins producing true-to-life retinal fundus images.

Black Box

Deep learning methods are often referred to as “black boxes” because an input goes into the box, and an output emerges, but it is not exactly clear what happens in between. In contrast, simple linear algorithms which, while not always as powerful as deep learning methods, are highly interpretable: one can examine the coefficients by which each feature is multiplied and discover important predictors for the task at hand. With deep learning, a complex series of matrix multiplication and abstract image filters makes the interpretability of this task significantly harder.21,22

Heatmap

Generating heatmaps, also known as “class activation maps,” is a method that attempts to address the “black box” dilemma.21,22 This simple technique highlights areas on images that contributed to their classification labels.22 In the case of a retinal image detector, one might expect that a heatmap would show the optic nerve, macula, and retinal vasculature (or some combination thereof) to be highlighted, assuming that these areas contain the most critical information within the image. Improved methods of explaining the inner workings of deep learning networks using heatmaps are an active area of research.

Conclusions

Ophthalmological diagnosis is heavily dependent on the interpretation of images, which is often subjective and qualitative. AI has potential to disrupt this paradigm by providing objective data to assist with diagnosis. As such, it will be important for clinicians to recognize the potential benefits of AI for improving the quality and delivery of ophthalmic care, as well as its limitations and possible pitfalls. Understanding the terminology in this article is an important first step.

Figure 1.

Figure 1.

A Venn diagram of artificial intelligence approaches.

Acknowledgments

Supported by grants T15LM007088, R01EY19474, P30EY010572, and K12EY027720 from the National Institutes of Health (Bethesda, Maryland) and by unrestricted departmental funding from Research to Prevent Blindness (New York, New York). No funding organizations had any role in the design or conduct of this research.

Footnotes

Financial Disclosures: The authors made the following disclosures: M.F.C.: Scientific advisory board — Clarity Medical Systems; Consultant — Novartis; Initial member — Inteleretina, LLC.

References

  • 1.Brown JM, Campbell JP, Beers A, et al. Automated diagnosis of plus disease in retinopathy of prematurity using deep convolutional neural networks. JAMA Ophthalmol. 2018;136: 803–810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Gulshan V, Peng L, Coram M, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016;316: 2402–2410. [DOI] [PubMed] [Google Scholar]
  • 3.Rajpurkar P, Irvin J, Zhu K, et al. CheXNet: Radiologist-level pneumonia detection on chest x-rays with deep learning [published online ahead of print Dec 25] arXiv. 2017:1711.05225. [Google Scholar]
  • 4.Jones LD, Golan D, Hanna SA, Ramachandran M. Artificial intelligence, machine learning and the evolution of healthcare: A bright future or cause for concern? Bone Joint Res. 2018;7: 223–225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bayes M, Price M. An essay towards solving a problem in the doctrine of chances. By the late Rev. Mr. Bayes, F. R. S. communicated by Mr. Price, in a letter to John Canton, A.M.F. R.S. Philos Trans R Soc Lond. 1763;53:370–418. [Google Scholar]
  • 6.Markov AA. An example of statistical investigation of the text Eugene Onegin concerning the connection of samples in chains. Sci Context. 2007;19:591–600. [Google Scholar]
  • 7.Wang H, Peng H, Chang Y, Liang D. A survey of GPU-based acceleration techniques in MRI reconstructions. Quant Imaging Med Surg. 2018;8:196–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Turing AM. Computing machinery and intelligence. Mind. 1950;49:433–460. [Google Scholar]
  • 9.Coyner AS, Swan R, Campbell J, et al. Automated fundus image quality assessment in retinopathy of prematurity using deep convolutional neural networks. Ophthalmology Retina. In Press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Relhan N, Flynn HW Jr. The Early Treatment Diabetic Retinopathy Study historical review and relevance to today’s management of diabetic macular edema. Curr Opin Ophthalmol. 2017;28:205–212. [DOI] [PubMed] [Google Scholar]
  • 11.Hastie T, Tibshirani R, Friedman JH. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed, corrected 7th printing. New York, NY: Springer; 2009. [Google Scholar]
  • 12.LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–444. [DOI] [PubMed] [Google Scholar]
  • 13.Hinton GE, Osindero S, Teh YW. A fast learning algorithm for deep belief nets. Neural Comput. 2006;18:1527–1554. [DOI] [PubMed] [Google Scholar]
  • 14.Hinton GE, Salakhutdinov RR. Reducing the dimensionality of data with neural networks. Science. 2006;313:504–507. [DOI] [PubMed] [Google Scholar]
  • 15.Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst. 2012;25:1097–1105. [Google Scholar]
  • 16.De Fauw J, Ledsam JR, Romera-Paredes B, et al. Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat Med. 2018;24:1342–1350. [DOI] [PubMed] [Google Scholar]
  • 17.Simonyan KZ, Andrew. Very deep convolutional networks for large-scale image recognition [published online ahead of print Apr 10] arXiv. 2015:1409.1556. [Google Scholar]
  • 18.Hebb DO. The Organization of Behavior: A Neuropsychological Theory [book review] Sci Ed. 1950;34:336–337. [Google Scholar]
  • 19.Zeiler MD, Fergus R. Visualizing and understanding convolutional. arXiv. 2013:1311.2901. In Press. [Google Scholar]
  • 20.Goodfellow IJ, Pouget-Abadie J, Mirza M, et al. Generative adversarial networks. arXiv. 2014:1406.2661. In Press. [Google Scholar]
  • 21.Zhang Q, Zhu S-C. Visual interpretability for deep learning: A survey. arXiv. 2018:1802.00614. In Press. [Google Scholar]
  • 22.Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A. Learning deep features for discriminative localization. arXiv. 2015:1512.04150. In Press. [Google Scholar]

RESOURCES