Recommendations for initial diabetic retinopathy screening of diabetic patients using large language model-based artificial intelligence in real-life case scenarios

Nikhil Gopalakrishnan; Aishwarya Joshi; Jay Chhablani; Naresh Kumar Yadav; Nikitha Gurram Reddy; Padmaja Kumari Rani; Ram Snehith Pulipaka; Rohit Shetty; Shivani Sinha; Vishma Prabhu; Ramesh Venkatesh

doi:10.1186/s40942-024-00533-9

. 2024 Jan 24;10:11. doi: 10.1186/s40942-024-00533-9

Recommendations for initial diabetic retinopathy screening of diabetic patients using large language model-based artificial intelligence in real-life case scenarios

Nikhil Gopalakrishnan ¹, Aishwarya Joshi ¹, Jay Chhablani ², Naresh Kumar Yadav ¹, Nikitha Gurram Reddy ³, Padmaja Kumari Rani ³, Ram Snehith Pulipaka ⁴, Rohit Shetty ⁵, Shivani Sinha ⁶, Vishma Prabhu ¹, Ramesh Venkatesh ^1,^✉

PMCID: PMC10809735 PMID: 38268046

Abstract

Purpose

To study the role of artificial intelligence (AI) to identify key risk factors for diabetic retinopathy (DR) screening and develop recommendations based on clinician and large language model (LLM) based AI platform opinions for newly detected diabetes mellitus (DM) cases.

Methods

Five clinicians and three AI applications were given 20 AI-generated hypothetical case scenarios to assess DR screening timing. We calculated inter-rater agreements between clinicians, AI-platforms, and the “majority clinician response” (defined as the maximum number of identical responses provided by the clinicians) and “majority AI-platform” (defined as the maximum number of identical responses among the 3 distinct AI). Scoring was used to identify risk factors of different severity. Three, two, and one points were given to risk factors requiring screening immediately, within a year, and within five years, respectively. After calculating a cumulative screening score, categories were assigned.

Results

Clinicians, AI platforms, and the “majority clinician response” and “majority AI response” had fair inter-rater reliability (k value: 0.21–0.40). Uncontrolled DM and systemic co-morbidities required immediate screening, while family history of DM and a co-existing pregnancy required screening within a year. The absence of these risk factors required screening within 5 years of DM diagnosis. Screening scores in this study were between 0 and 10. Cases with screening scores of 0–2 needed screening within 5 years, 3–5 within 1 year, and 6–12 immediately.

Conclusion

Based on the findings of this study, AI could play a critical role in DR screening of newly diagnosed DM patients by developing a novel DR screening score. Future studies would be required to validate the DR screening score before it could be used as a reference in real-life clinical situations.

Clinical trial registration

Not applicable.

Supplementary Information

The online version contains supplementary material available at 10.1186/s40942-024-00533-9.

Keywords: New cases, Diabetes, Screening, Diabetic retinopathy, Artificial intelligence

Introduction

Diabetes mellitus (DM) is a worldwide epidemic that causes a variety of complications in the human body [1]. DM-related vascular complications usually develop after a few years, and many individuals, especially those from middle- and low-income countries, do not have annual DM diagnosis check, leaving a large group undiagnosed [2]. Diabetic retinopathy (DR) is one of the many serious eye-related complications of DM [3, 4]. DM patients are screened for DR to identify and treat sight-threatening DR (proliferative DR and/or diabetic macular edema) and to recommend follow-up for those without DR or non-proliferative DR without diabetic macular edema [5, 6]. Several population-based studies have found a disproportionate prevalence of DR around the world, with countries in the Middle East, North Africa, and the Western Pacific having the highest prevalence and countries in the South and Central America having the lowest [7]. In comparison to Western countries, the prevalence of DR in India is low, with estimates ranging from 5 to 16%. The most recent publication from the SMART India Study group found a national prevalence of 12.5% for DR and 4.0% for sight-threatening DR [6, 8]. This is despite the fact that India has the world’s second-highest number of people with DM [9]. The main reason cited for this uneven distribution of DR cases worldwide is the different screening strategies followed by different countries [10]. Furthermore, the personnel conducting the DR screening, the DR classification used, and the presence of other systemic co-morbidities all have an impact on determining the exact prevalence of DR [11, 12].

Other than the retina specialists, the initial DR screening for newly detected cases of DM is usually carried out by other ophthalmologists, and non-ophthalmologists such as optometrists and diabetologists using dilated fundoscopy or teleophthalmology tools such as mydriatic and non-mydriatic fundus cameras [10]. There are differences in the initial timing for DR screening even among ophthalmologists. These distinctions are primarily due to the area (urban/rural) in which they practice, the type of institution to which they are affiliated, and the types of patients they screen [13]. Diabetes patients have a large urban-rural divide, which hinders disease understanding and prevents routine screening as per established protocols. A streamlined strategy for initial DR screening would assist medical screening staff and patients in determining when to be screened.

AI has been debated for its potential benefits and drawbacks in medicine, including ophthalmology. Several studies have used fundus photos and deep machine learning AI for DR screening [14–16]. However, there are concerns about data acquisition, bias in data, bias in identifying ground truth, difficulty comparing different algorithms, challenges in machine learning, its application in different groups of people, and human barriers to AI adoption in health care [17]. A large language model (LLM) or natural language processing algorithm is a form of generative AI that uses massive data sets to understand, summarize, generate, and predict new text-based content [18]. Many such open source LLM-based generative AI algorithms are currently freely and easily available, including OpenAI’s ChatGPT3.5v and ChatGPT4.0v Google’s BARD, Microsoft’s Bing AI, and others [19]. Most researchers and clinicians believe that AIs based on LLM could help reduce physician burden if integrated into the electronic health record [20].

In countries with a large population and a low ophthalmologist-to-patient ratio, retina specialists screening all newly detected diabetic patients with dilated fundus examination would be demanding, not enhance the yield of DR cases, and reduce the ophthalmologist’s time for other patients [21]. We believe that AI can simplify screening recommendations for non-ophthalmologists like medical internists and DM specialists, as well as general ophthalmologists, to guide newly diagnosed DM patients to retina specialists for DR screening. We found no literature on LLM-based AI for DR screening in newly diagnosed DM.

Thus, the primary goal of this study was to investigate the role of AI in establishing a streamlined method for determining the appropriate timing of initial screening for DR with the help of ophthalmologists and various AI platforms.

Methods

This was a prospectively conducted questionnaire-based study. The study commenced by requesting ChatGPT 3.5v (OpenAI, San Francisco, CA, USA) to generate 20 hypothetical clinical case scenarios pertaining to DM and the necessity and timing for an initial dilated fundus examination conducted by an expert retina specialist. This was accomplished by utilizing various combinations and permutations of the specified keywords, including age, gender, duration, type and control of DM, obesity, kidney disease, blood pressure, cholesterol, tobacco use, pregnancy status, and family history of DM (Supplement 1).

The clinical case scenarios were subsequently distributed to a group of five retina specialists/clinicians who possessed at least over five years of clinical experience in the field of retina and DR screening. These retina specialists have professional experience in various organizational settings, including government hospitals, independent private practices, tertiary eye care hospitals serving both free and paying patients, and tertiary eye care corporate hospitals exclusively serving paying patients. The responses provided by the clinicians to the clinical case scenarios were collected using a 3-point multiple-choice format, where each clinician was required to select only one response that best addressed the appropriate timing for DR screening in each clinical case scenario. The three response options were whether the DR screening should be done immediately, within one year, or within five years. The ‘majority clinician response’ for each specific case scenario was determined by identifying the maximum number of identical responses provided by the clinicians.

Subsequently, an exact same set of clinical case scenarios with options was presented to various important AI platforms, including ChatGPT 3.5v, ChatGPT 4.0v, and Bing AI. ChatGPT 3.5v, ChatGPT 4.0v, and Bing AI were last trained in January 2022, April 2023, and somewhere in 2021, respectively. The text was entered into various AI platforms, with a specific request to provide the most appropriate single response for each clinical case scenario. The responses were generated using the same set of multiple-choice options that were presented to the clinician. The query was formulated in a manner that implies the clinician is asking about the most suitable time for a patient’s first dilated retinal examination, rather than the patient seeking advice from the AI on when to schedule a visit to a retina specialist for a dilated fundus examination. The AI did not receive any feedback after each case scenario, and the case descriptions were inputted in a sequential manner without initiating a new chat session. Figure 1 depicts the prompt used to generate an opinion as well as the AI response. The formal responses for each case scenario were documented based on the outcomes generated by various AI platforms. The determination of the ‘majority AI response’ for each specific case scenario was made by identifying the highest count of identical responses among the three distinct AI platforms. For each individual clinical case scenario, the ‘majority clinician response’ was compared to the ‘majority AI response’ for agreement.

Fig. 1 — Prompt applied to request an opinion from the ChatGPT3.5v AI platform, accompanied with the AI’s response to a specific scenario from the study

Based on the responses obtained from the clinicians and different AI platforms, a consensus was reached on the ‘most common’ response for each individual case scenario for determining the optimal timing for initial screening for diabetic patients with dilated fundus examination by a retina specialist. The determination of the ‘majority response’ involved identifying the response with the highest frequency among the clinicians and AI platforms. Specifically, a maximum of eight responses were considered, consisting of five from the clinicians and three from the various AI platforms.

The next stage of this study involved the development of a scoring system for DR screening. This scoring system aimed to assist healthcare professionals in considering the diverse risk factors associated with the development of DR. The scoring system was based on the responses provided by the clinician and the outputs generated by different AI platforms in response to various clinical case scenarios. Six risk factors were identified from the clinical case scenarios used in the questionnaire that appeared to be relevant in determining the right timing for DR screening. These include: (1) the patients’ age; (2) the type of diabetes; (3) diabetes control; (4) the presence of concurrent systemic conditions such as obesity, high BMI, renal disease, hypertension, dyslipidaemia, and tobacco use; (5) familial predisposition to diabetes; and (6) pregnancy status. The identification of risk factors that require prompt screening were assigned a score of three points for each risk factor based on the urgency of requiring DR screening. Risk factors that require screening within a year were assigned two points, while risk factors that require screening within five years were assigned one point for each risk factor. In the absence of a risk factor, a score of 0 would be assigned to it. The computation of a cumulative DR screening score would be conducted, followed by the provision of a categorical classification for the timing of screening based on the DR screening scores. The DR scores were classified into three groups according to the range of scores obtained during a specific timing for DR screening.

Considering the nature of the study, the study was exempted from institutional review board.

Statistical analysis

The inter-rater reliability agreements between the different clinicians, different AI platforms, and the ‘majority clinician response’ and ‘majority AI response’ were calculated on DATAtab: Online Statistics Calculator (DATAtab e.U. Graz, Austria. URL https://datatab.net) using Fleiss Kappa and Cohen’s Kappa analysis. The Kappa result is interpreted as follows: ĸ values ≤ 0 as indicating no agreement and 0.01–0.20 as none to slight, 0.21–0.40 as fair, 0.41– 0.60 as moderate, 0.61–0.80 as substantial, and 0.81–1.00 as almost perfect agreement [22].

Results

In the first phase of the study, the inter-rater reliability calculated by the Fleiss kappa test showed that there was a fair agreement between the 5 clinicians with κ = 0.25. The Fleiss Kappa showed that there was a fair agreement between ChatGPT 3.5, ChatGPT 4.0 and Bing AI with κ = 0.29. There was complete agreement between the ‘majority clinician response’ and the ‘majority AI response’ in 45% (n = 9) of the real-life clinical case scenarios. The Cohen’s Kappa showed that there was a fair agreement between ‘majority clinician response’ and ‘majority AI response’ with κ = 0.32. The inter-rate reliability agreements between the individual AI platforms and the ‘majority clinician response’ for ChatGPT 3.5v, ChatGPT 4.0v, and Bing AI were 0.24, 0.37, and 0.25, respectively.

We noted six risk factors that appeared to be relevant in determining the right timing for DR screening based on clinicians’ and different AI platforms’ responses to a set of 20 hypothetical AI-generated real-life clinical case scenarios. Individuals with poorly controlled or uncontrolled diabetes, as well as those with systemic co-morbidities, required prompt screening and were thus assigned a score of 3 points for each risk factor. Individuals with a family history of diabetes and pregnant women with diabetes were required to be screened within a year and were given two points for each risk factor. Additionally, individuals without the aforementioned risk factors required screening within the first five years of DM diagnosis. The patient’s age and type of diabetes had little influence on the need for immediate or early screening strategies for DR. As a result, patients over the age of 45 or with type 2 diabetes were assigned a score of one for each criterion, whereas patients under the age of 45 or with type 1 diabetes were not assigned any points. The DR screening score ranged from 0 to 10 for each clinical case scenario in this study, and three categories were formed based on these DR screening scores: (a) scores between 0 and 2 for cases requiring screening within 5 years, (b) scores between 3 and 5 for cases requiring screening within 1 year, and (c) scores between 6 and 12 for cases requiring immediate screening (Table 1).

Table 1.

Evaluation of risk factors and calculation of diabetic retinopathy screening scores for individual clinical case scenarios

Case No.	Age	Type of DM	DM control	Systemic co-morbidities	Family History of DM	Pregnancy	Majority Response	DR Screening Score
1	46	2	Poor	Yes	No	No	Immediate	8
2	30	1	Good	No	Yes	No	Within 5 years	2
3	55	2	Poor	Yes	No	No	Immediate	8
4	28	2	Good	No	Yes	Yes	Within 1 year	5
5	50	2	Poor	Yes	Yes	No	Immediate	10
6	35	2	Good	No	No	No	Within 5 years	1
7	60	2	Poor	Yes	Yes	No	Immediate	10
8	40	2	Good	Yes	Yes	Yes	Immediate	8
9	48	2	Poor	Yes	No	No	Immediate	8
10	25	1	Good	No	No	No	Within 5 years	0
11	55	2	Poor	Yes	Yes	No	Immediate	10
12	33	2	Good	No	No	Yes	Within 1 year	3
13	52	2	Poor	Yes	No	No	Immediate	8
14	38	1	Good	No	Yes	No	Within 5 years	2
15	58	2	Poor	Yes	Yes	No	Immediate	10
16	29	1	Poor	No	Yes	Yes	Immediate	7
17	46	2	Poor	Yes	No	No	Immediate	8
18	22	1	Good	No	No	No	Within 5 years	0
19	56	2	Poor	Yes	Yes	No	Immediate	10
20	31	2	Good	No	No	Yes	Within 1 year	3

Open in a new tab

Abbreviations: DM– diabetes mellitus; DR– diabetic retinopathy

Discussion

With the support of AI and clinicians, this one-of-a-kind study identifies risk factors of varying significance that may be important and relevant in determining the timing of DR screening in a newly diagnosed case of DM. The study also includes a screening score that may help non-ophthalmologists and even ophthalmologists from other specialties decide when to refer patients for DR screening to a trained retina specialist.

The prevalence of DM and, consequently, DR, as well as the availability of medical personnel and retina imaging tools for screening, differ by geographic area [7]. A number of risk factors influence the DR screening of newly diagnosed diabetic cases, which are either independent or interdependent on one another. The American Diabetes Association’s (ADA) recommendations for DR screening in newly diagnosed DM cases are the most widely accepted guidelines worldwide. According to the ADA, screening recommendations for DM patients were primarily based on two risk factors: type of DM and pregnancy status [5]. Community-based studies have identified over 12 risk factors that can hasten the development or progression of DR over time [23]. Therefore, the ADA guidelines appear to be overly simplified and inadequate. Also, it is not always possible for a retina specialist to inquire about and diagnose these risk factors using various laboratory tests in real-time situations. As a result, developing a strategy and scoring system based on a few key risk factors that is acceptable and routinely followed in clinical practice is becoming increasingly important. Individual national screening strategies have been developed to determine the timing of retina screening, the personnel who will conduct the screening, and the manner in which the screening must be performed based on disease prevalence and other risk factors [10]. These recommendations are intended to serve as a guide for ophthalmologists rather than for referring DM specialists. There is no uniform strategy for DR screening, even among retina specialists. Even in the current study, we found only a moderate level of agreement among retina specialists. In order to address this, we used the ‘majority clinician response,’ i.e., the best response was chosen as the most preferred timing for screening, establishing the most preferred practice pattern followed by clinicians.

Several latest generation chatbots developed using LLM-based generative AI applications have demonstrated promising results in generalizing to previously unseen tasks, including medical question-answering requiring scientific expert knowledge [24–26]. In order to formulate an answer, LLM understands the medical context, recall, and interpret relevant medical information and produces a response in a text-based format. Although reported performance in ophthalmology has been mixed, LLM appear to have potential for use in eye health care applications. LLM-based generative AI with ChatGPT and ChatGPT 4.0v has been used in retina for a variety of indications, including International Coding of Diseases (ICD) for various case encounters [27, 28]. AI’s current role in DR is limited to preventive care, i.e., screening [14, 17]. According to the ADA, AI can be used as an alternative to traditional screening methods in DR [29]. AI’s current role in DR is to screen retinal images for the presence or absence of DR or sight-threatening DR [30]. However, AI should not be used in patients who have known DR, have received prior DR treatment, or have symptoms of vision impairment. Different chatbot applications respond to the same situations in different ways [31]. Even in the current study, the different AI platforms only agreed on a moderate level for the same clinical case scenario. To address this issue, the ‘majority AI response’ was selected as the most preferred time for DR screening based on AI. In order to improve both the precision and speed of responses, the AI platform must receive the most up-to-date information and have real-time access to the internet. In this study, we observed that ChatGPT 4.0v performed better and had closer agreements with clinician responses than the other two AI platforms.

DR is a retinal complication of prolonged DM that affects the retinal microvasculature [32]. As a result, the longer the duration of DM, the higher the risk of developing DR or sight-threatening DR is usually considered. Individual national screening guidelines for DR, as well as global guidelines developed by the International Council of Ophthalmology, have identified uncontrolled DM and presence of hypertension and other systemic co-morbidities as risk factors which could alter the course of DR [10]. This study identified six factors based on clinicians’ and AI’s responses. According to the findings of this study, patients with poorly controlled blood sugar levels or those with co-existing systemic co-morbidities required immediate DR screening, whereas those with a family history of DM or diabetic patients who were pregnant preferred to be screened within a year. The absence of any of these risk factors made these cases less urgent for screening, and they were screened over a 5-year period. The ADA guidelines instruct patients with type 2 diabetes to undergo screening immediately after diagnosis because many patients with type 2 diabetes have the disease for a long time before being diagnosed, and immediate screening is therefore recommended. Patients with type 1 diabetes, on the other hand, must be screened within 5 years of disease diagnosis [5]. Based on this, we assigned one point to each risk factor, such as patients over the age of 45 and type 2 diabetes. In this study, we discovered that the patient’s age and type of diabetes had less of an impact on the timing of DR screening in the absence of other risk factors such as poor DM control, co-existing systemic comorbidities, pregnancy, and a family history of DM. As a result, this study identified risk factors of varying significance that may influence the progression of DR and, as a result, the timing of screening in newly diagnosed cases of DM.

The study has some limitations. The number of clinical case scenarios generated hypothetically could have been increased, or the clinical case scenarios could have been presented to a larger number of clinicians and AI applications. The study placed little emphasis on comparing the responses of various AI applications. Another limitation of this study was that when developing the case scenarios, the race of the patient was not taken into account, and the questions were only presented to Indian ophthalmologists. Instead, the clinical case scenarios could have been presented to international ophthalmologists, increasing the study’s global acceptability. The term ‘majority’ clinician and AI responses used in this study may mislead readers and be misinterpreted as the best response from the entire field of retina specialists and AI platforms. The recommendations made in this study are based on expert opinions and clinically unvalidated LLMs that have not consistently performed well in ophthalmologic field expertise in many other studies. Furthermore, inter-rater agreement was at the best only moderate between LLMs and ophthalmologists, as well as between ophthalmologists themselves. As a result, without proper clinical validations, the study recommendations may be true and beneficial to some patients but not to all. Nonetheless, the study has the advantage of developing a simplified screening strategy for non-ophthalmologists to send newly detected diabetic cases for retina screening by combining real-world clinical experience with up-to-date AI information. After validating these screening recommendations, it is possible that they will be integrated into the hospital’s electronic medical record system, alerting ophthalmologists and non-ophthalmologists to refer patients with newly detected DM for timely DR screening and thus helping to reduce the number of unnecessary referrals to retina specialists who are less likely to have any form of DR.

In conclusion, AI has the potential to be highly influential in the screening of DR in newly diagnosed patients with DM by creating an innovative DR screening score. Further studies are necessary for validating the DR screening score prior to its application as a guide in practical clinical scenarios.

Electronic supplementary material

Below is the link to the electronic supplementary material.

40942_2024_533_MOESM1_ESM.docx^{(14.6KB, docx)}

Supplementary Material 1: Supply Clinical case scenarios generated by ChatGPT 3.5v

Acknowledgements

None.

Abbreviations

DM: diabetes mellitus
DR: diabetic retinopathy
AI: artificial intelligence
LLM: large language model
ADA: American Diabetes Association

Author contributions

RV, JC– conceptualising the study, data acquisition, analysing the data, statistics and results, interpreting the findings, writing & reviewing the manuscript. VP, RPK, SSH, RAS, NKY– Clinicians responding to the clinical case scenarios. NG, AISH, NR– collecting the AI responses and collating and analysing the clinician responses. RS– critically reviewing the manuscript.

Funding

No funds, grants or other supports was received.

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Declarations

Research involving animal participants

“This article does not contain any studies with animals performed by any of the authors.”

Ethics approval and consent to participate

The study was exempted from seeking further approvals from the Institutional research board and Ethics committee.

Consent for publication

As the study was questionnaire format-based study, a waiver for was obtained from the IRB and EC of the institution regarding the consent for publication.

Competing interests

The authors have no relevant financial or non-financial interests to disclose.

Plant reproducibility

Not applicable.

Gel and blots/image manipulation

Not applicable.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.International Diabetes Federation. IDF diabetes atlas [Internet] 2021. cited 2023 Nov 15. Available from: https://diabetesatlas.org/.
2.Deshpande AD, Harris-Hayes M, Schootman M. Epidemiology of diabetes and diabetes-related complications. Phys Ther. 2008;88:1254–64. doi: 10.2522/ptj.20080020. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Sayin N. Ocular complications of diabetes mellitus. WJD. 2015;6:92. doi: 10.4239/wjd.v6.i1.92. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Vieira-Potter VJ, Karamichos D, Lee DJ. Ocular complications of diabetes and therapeutic approaches. Biomed Res Int. 2016;2016:1–14. doi: 10.1155/2016/3801570. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Solomon SD, Chew E, Duh EJ, Sobrin L, Sun JK, VanderBeek BL, et al. Diabetic Retinopathy: A position Statement by the American Diabetes Association. Diabetes Care. 2017;40:412–8. doi: 10.2337/dc16-2641. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Raman R, Ramasamy K, Rajalakshmi R, Sivaprasad S, Natarajan S. Diabetic retinopathy screening guidelines in India: All India Ophthalmological Society diabetic retinopathy task force and Vitreoretinal Society of India Consensus Statement. Indian J Ophthalmol. 2021;69:678–88. doi: 10.4103/ijo.IJO_667_20. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Teo ZL, Tham Y-C, Yu M, Chee ML, Rim TH, Cheung N, et al. Global prevalence of Diabetic Retinopathy and Projection of Burden through 2045: systematic review and Meta-analysis. Ophthalmology. 2021;128:1580–91. doi: 10.1016/j.ophtha.2021.04.027. [DOI] [PubMed] [Google Scholar]
8.Raman R, Vasconcelos JC, Rajalakshmi R, Prevost AT, Ramasamy K, Mohan V, et al. Prevalence of diabetic retinopathy in India stratified by known and undiagnosed diabetes, urban–rural locations, and socioeconomic indices: results from the SMART India population-based cross-sectional screening study. The Lancet Global Health. 2022;10:e1764–73. doi: 10.1016/S2214-109X(22)00411-9. [DOI] [PubMed] [Google Scholar]
9.Pradeepa R, Mohan V. Epidemiology of type 2 diabetes in India. Indian J Ophthalmol. 2021;69:2932–8. doi: 10.4103/ijo.IJO_1627_21. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Das T, Takkar B, Sivaprasad S, Thanksphon T, Taylor H, Wiedemann P, et al. Recently updated global diabetic retinopathy screening guidelines: commonalities, differences, and future possibilities. Eye (Lond) 2021;35:2685–98. doi: 10.1038/s41433-021-01572-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Kumar S, Kumar G, Velu S, Pardhan S, Sivaprasad S, Ruamviboonsuk P, et al. Patient and provider perspectives on barriers to screening for diabetic retinopathy: an exploratory study from southern India. BMJ Open. 2020;10:e037277. doi: 10.1136/bmjopen-2020-037277. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Kuo J, Liu JC, Gibson E, Rao PK, Margolis TP, Wilson B, et al. Factors Associated with adherence to Screening guidelines for Diabetic Retinopathy among Low-Income Metropolitan patients. Mo Med. 2020;117:258–64. [PMC free article] [PubMed] [Google Scholar]
13.Moudgil T, Bains BK, Bandhu S, Kanda N. Preferred practice pattern of physicians regarding diabetic retinopathy in diabetes mellitus patients. Indian J Ophthalmol. 2021;69:3139–43. doi: 10.4103/ijo.IJO_1339_21. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Lim JI, Regillo CD, Sadda SR, Ipp E, Bhaskaranand M, Ramachandra C, et al. Artificial Intelligence Detection of Diabetic Retinopathy: Subgroup comparison of the EyeArt System with ophthalmologists’ dilated examinations. Ophthalmol Sci. 2023;3:100228. doi: 10.1016/j.xops.2022.100228. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Shamsan A, Senan EM, Ahmad Shatnawi HS. Predicting of diabetic retinopathy development stages of fundus images using deep learning based on combined features. PLoS ONE. 2023;18:e0289555. doi: 10.1371/journal.pone.0289555. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Wang Y-L, Yang J-Y, Yang J-Y, Zhao X-Y, Chen Y-X, Yu W-H. Progress of artificial intelligence in diabetic retinopathy screening. Diabetes Metab Res Rev. 2021;37:e3414. doi: 10.1002/dmrr.3414. [DOI] [PubMed] [Google Scholar]
17.Raman R, Dasgupta D, Ramasamy K, George R, Mohan V, Ting D. Using artificial intelligence for diabetic retinopathy screening: policy implications. Indian J Ophthalmol. 2021;69:2993–8. doi: 10.4103/ijo.IJO_1420_21. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Types of artificial intelligence. https://www.javatpoint.com/types-of-artificial-intelligence.
19.Open Source Large Language Models (LLM) [Internet]. [cited 2023 Nov 15]. Available from: https://spotintelligence.com/2023/06/05/open-source-large-language-models/.
20.Yu P, Xu H, Hu X, Deng C. Leveraging generative AI and large Language models: a Comprehensive Roadmap for Healthcare Integration. Healthc (Basel) 2023;11:2776. doi: 10.3390/healthcare11202776. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Awan H, Khan MD, Felch W, Spivey B, Taylor H, Resnikoff S, et al. Status of Ophthalmic Education and the Eye Health Workforce in South Asian Association for Regional Cooperation Countries. Asia-Pacific J Ophthalmol. 2014;3:74–82. doi: 10.1097/APO.0000000000000037. [DOI] [PubMed] [Google Scholar]
22.McHugh ML. Interrater reliability: the kappa statistic. Biochem Med (Zagreb) 2012;22:276–82. doi: 10.11613/BM.2012.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Yin L, Zhang D, Ren Q, Su X, Sun Z. Prevalence and risk factors of diabetic retinopathy in diabetic patients: a community based cross-sectional study. Med (Baltim) 2020;99:e19236. doi: 10.1097/MD.0000000000019236. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Raimondi R, Tzoumas N, Salisbury T, Di Simplicio S, Romano MR et al. North East Trainee Research in Ophthalmology Network (NETRiON),. Comparative analysis of large language models in the Royal College of Ophthalmologists fellowship exams. Eye [Internet]. 2023 [cited 2023 Nov 18]; Available from: https://www.nature.com/articles/s41433-02302563-3. [DOI] [PMC free article] [PubMed]
25.Lin JC, Younessi DN, Kurapati SS, Tang OY, Scott IU. Comparison of GPT-3.5, GPT-4, and human user performance on a practice ophthalmology written examination. Eye [Internet]. 2023 [cited 2023 Nov 18]; Available from: https://www.nature.com/articles/s41433-023-02564-2. [DOI] [PMC free article] [PubMed]
26.Antaki F, Touma S, Milad D, El-Khoury J, Duval R. Evaluating the performance of ChatGPT in Ophthalmology. Ophthalmol Sci. 2023;3:100324. doi: 10.1016/j.xops.2023.100324. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Ong J, Hariprasad SM, Chhablani J, ChatGPT GPT-4 in Ophthalmology: applications of large Language Model Artificial intelligence in Retina. Ophthalmic Surg Lasers Imaging Retina. 2023;54:557–62. doi: 10.3928/23258160-20230926-01. [DOI] [PubMed] [Google Scholar]
28.Ong J, Kedia N, Harihar S, Vupparaboina SC, Singh SR, Venkatesh R, et al. Applying large language model artificial intelligence for retina international classification of diseases (ICD) coding. J Med Artif Intell. 2023;6:21–1. doi: 10.21037/jmai-23-106. [DOI] [Google Scholar]
29.Lanzetta P, Sarao V, Scanlon PH, Barratt J, Porta M, Bandello F, et al. Fundamental principles of an effective diabetic retinopathy screening program. Acta Diabetol. 2020;57:785–98. doi: 10.1007/s00592-020-01506-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Grzybowski A, Singhanetr P, Nanegrungsunk O, Ruamviboonsuk P. Artificial Intelligence for Diabetic Retinopathy Screening using Color retinal photographs: from development to Deployment. Ophthalmol Ther. 2023;12:1419–37. doi: 10.1007/s40123-023-00691-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.ChatGPT vs. Bing vs. Google Bard: Which AI Is the Most Helpful? Accessed on 18th Nov 2023. [cited 2023 Nov 18]; Available from: https://www.cnet.com/tech/services-and-software/chatgpt-vs-bing-vs-google-bard-which-ai-is-the-most-helpful.
32.Alali NM, Albazei A, Alotaibi HM, Almohammadi AM, Alsirhani EK, Alanazi TS, et al. Diabetic Retinopathy and Eye Screening: Diabetic patients Standpoint, their practice, and barriers; a cross-sectional study. JCM. 2022;11:6351. doi: 10.3390/jcm11216351. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

40942_2024_533_MOESM1_ESM.docx^{(14.6KB, docx)}

Supplementary Material 1: Supply Clinical case scenarios generated by ChatGPT 3.5v

Data Availability Statement

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

[CR1] 1.International Diabetes Federation. IDF diabetes atlas [Internet] 2021. cited 2023 Nov 15. Available from: https://diabetesatlas.org/.

[CR2] 2.Deshpande AD, Harris-Hayes M, Schootman M. Epidemiology of diabetes and diabetes-related complications. Phys Ther. 2008;88:1254–64. doi: 10.2522/ptj.20080020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Sayin N. Ocular complications of diabetes mellitus. WJD. 2015;6:92. doi: 10.4239/wjd.v6.i1.92. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Vieira-Potter VJ, Karamichos D, Lee DJ. Ocular complications of diabetes and therapeutic approaches. Biomed Res Int. 2016;2016:1–14. doi: 10.1155/2016/3801570. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Solomon SD, Chew E, Duh EJ, Sobrin L, Sun JK, VanderBeek BL, et al. Diabetic Retinopathy: A position Statement by the American Diabetes Association. Diabetes Care. 2017;40:412–8. doi: 10.2337/dc16-2641. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Raman R, Ramasamy K, Rajalakshmi R, Sivaprasad S, Natarajan S. Diabetic retinopathy screening guidelines in India: All India Ophthalmological Society diabetic retinopathy task force and Vitreoretinal Society of India Consensus Statement. Indian J Ophthalmol. 2021;69:678–88. doi: 10.4103/ijo.IJO_667_20. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Teo ZL, Tham Y-C, Yu M, Chee ML, Rim TH, Cheung N, et al. Global prevalence of Diabetic Retinopathy and Projection of Burden through 2045: systematic review and Meta-analysis. Ophthalmology. 2021;128:1580–91. doi: 10.1016/j.ophtha.2021.04.027. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Raman R, Vasconcelos JC, Rajalakshmi R, Prevost AT, Ramasamy K, Mohan V, et al. Prevalence of diabetic retinopathy in India stratified by known and undiagnosed diabetes, urban–rural locations, and socioeconomic indices: results from the SMART India population-based cross-sectional screening study. The Lancet Global Health. 2022;10:e1764–73. doi: 10.1016/S2214-109X(22)00411-9. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Pradeepa R, Mohan V. Epidemiology of type 2 diabetes in India. Indian J Ophthalmol. 2021;69:2932–8. doi: 10.4103/ijo.IJO_1627_21. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Das T, Takkar B, Sivaprasad S, Thanksphon T, Taylor H, Wiedemann P, et al. Recently updated global diabetic retinopathy screening guidelines: commonalities, differences, and future possibilities. Eye (Lond) 2021;35:2685–98. doi: 10.1038/s41433-021-01572-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Kumar S, Kumar G, Velu S, Pardhan S, Sivaprasad S, Ruamviboonsuk P, et al. Patient and provider perspectives on barriers to screening for diabetic retinopathy: an exploratory study from southern India. BMJ Open. 2020;10:e037277. doi: 10.1136/bmjopen-2020-037277. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Kuo J, Liu JC, Gibson E, Rao PK, Margolis TP, Wilson B, et al. Factors Associated with adherence to Screening guidelines for Diabetic Retinopathy among Low-Income Metropolitan patients. Mo Med. 2020;117:258–64. [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Moudgil T, Bains BK, Bandhu S, Kanda N. Preferred practice pattern of physicians regarding diabetic retinopathy in diabetes mellitus patients. Indian J Ophthalmol. 2021;69:3139–43. doi: 10.4103/ijo.IJO_1339_21. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Lim JI, Regillo CD, Sadda SR, Ipp E, Bhaskaranand M, Ramachandra C, et al. Artificial Intelligence Detection of Diabetic Retinopathy: Subgroup comparison of the EyeArt System with ophthalmologists’ dilated examinations. Ophthalmol Sci. 2023;3:100228. doi: 10.1016/j.xops.2022.100228. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Shamsan A, Senan EM, Ahmad Shatnawi HS. Predicting of diabetic retinopathy development stages of fundus images using deep learning based on combined features. PLoS ONE. 2023;18:e0289555. doi: 10.1371/journal.pone.0289555. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Wang Y-L, Yang J-Y, Yang J-Y, Zhao X-Y, Chen Y-X, Yu W-H. Progress of artificial intelligence in diabetic retinopathy screening. Diabetes Metab Res Rev. 2021;37:e3414. doi: 10.1002/dmrr.3414. [DOI] [PubMed] [Google Scholar]

[CR17] 17.Raman R, Dasgupta D, Ramasamy K, George R, Mohan V, Ting D. Using artificial intelligence for diabetic retinopathy screening: policy implications. Indian J Ophthalmol. 2021;69:2993–8. doi: 10.4103/ijo.IJO_1420_21. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Types of artificial intelligence. https://www.javatpoint.com/types-of-artificial-intelligence.

[CR19] 19.Open Source Large Language Models (LLM) [Internet]. [cited 2023 Nov 15]. Available from: https://spotintelligence.com/2023/06/05/open-source-large-language-models/.

[CR20] 20.Yu P, Xu H, Hu X, Deng C. Leveraging generative AI and large Language models: a Comprehensive Roadmap for Healthcare Integration. Healthc (Basel) 2023;11:2776. doi: 10.3390/healthcare11202776. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Awan H, Khan MD, Felch W, Spivey B, Taylor H, Resnikoff S, et al. Status of Ophthalmic Education and the Eye Health Workforce in South Asian Association for Regional Cooperation Countries. Asia-Pacific J Ophthalmol. 2014;3:74–82. doi: 10.1097/APO.0000000000000037. [DOI] [PubMed] [Google Scholar]

[CR22] 22.McHugh ML. Interrater reliability: the kappa statistic. Biochem Med (Zagreb) 2012;22:276–82. doi: 10.11613/BM.2012.031. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Yin L, Zhang D, Ren Q, Su X, Sun Z. Prevalence and risk factors of diabetic retinopathy in diabetic patients: a community based cross-sectional study. Med (Baltim) 2020;99:e19236. doi: 10.1097/MD.0000000000019236. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Raimondi R, Tzoumas N, Salisbury T, Di Simplicio S, Romano MR et al. North East Trainee Research in Ophthalmology Network (NETRiON),. Comparative analysis of large language models in the Royal College of Ophthalmologists fellowship exams. Eye [Internet]. 2023 [cited 2023 Nov 18]; Available from: https://www.nature.com/articles/s41433-02302563-3. [DOI] [PMC free article] [PubMed]

[CR25] 25.Lin JC, Younessi DN, Kurapati SS, Tang OY, Scott IU. Comparison of GPT-3.5, GPT-4, and human user performance on a practice ophthalmology written examination. Eye [Internet]. 2023 [cited 2023 Nov 18]; Available from: https://www.nature.com/articles/s41433-023-02564-2. [DOI] [PMC free article] [PubMed]

[CR26] 26.Antaki F, Touma S, Milad D, El-Khoury J, Duval R. Evaluating the performance of ChatGPT in Ophthalmology. Ophthalmol Sci. 2023;3:100324. doi: 10.1016/j.xops.2023.100324. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Ong J, Hariprasad SM, Chhablani J, ChatGPT GPT-4 in Ophthalmology: applications of large Language Model Artificial intelligence in Retina. Ophthalmic Surg Lasers Imaging Retina. 2023;54:557–62. doi: 10.3928/23258160-20230926-01. [DOI] [PubMed] [Google Scholar]

[CR28] 28.Ong J, Kedia N, Harihar S, Vupparaboina SC, Singh SR, Venkatesh R, et al. Applying large language model artificial intelligence for retina international classification of diseases (ICD) coding. J Med Artif Intell. 2023;6:21–1. doi: 10.21037/jmai-23-106. [DOI] [Google Scholar]

[CR29] 29.Lanzetta P, Sarao V, Scanlon PH, Barratt J, Porta M, Bandello F, et al. Fundamental principles of an effective diabetic retinopathy screening program. Acta Diabetol. 2020;57:785–98. doi: 10.1007/s00592-020-01506-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Grzybowski A, Singhanetr P, Nanegrungsunk O, Ruamviboonsuk P. Artificial Intelligence for Diabetic Retinopathy Screening using Color retinal photographs: from development to Deployment. Ophthalmol Ther. 2023;12:1419–37. doi: 10.1007/s40123-023-00691-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.ChatGPT vs. Bing vs. Google Bard: Which AI Is the Most Helpful? Accessed on 18th Nov 2023. [cited 2023 Nov 18]; Available from: https://www.cnet.com/tech/services-and-software/chatgpt-vs-bing-vs-google-bard-which-ai-is-the-most-helpful.

[CR32] 32.Alali NM, Albazei A, Alotaibi HM, Almohammadi AM, Alsirhani EK, Alanazi TS, et al. Diabetic Retinopathy and Eye Screening: Diabetic patients Standpoint, their practice, and barriers; a cross-sectional study. JCM. 2022;11:6351. doi: 10.3390/jcm11216351. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Recommendations for initial diabetic retinopathy screening of diabetic patients using large language model-based artificial intelligence in real-life case scenarios

Nikhil Gopalakrishnan

Aishwarya Joshi

Jay Chhablani

Naresh Kumar Yadav

Nikitha Gurram Reddy

Padmaja Kumari Rani

Ram Snehith Pulipaka

Rohit Shetty

Shivani Sinha

Vishma Prabhu

Ramesh Venkatesh

Abstract

Purpose

Methods

Results

Conclusion

Clinical trial registration

Supplementary Information

Introduction

Methods

Fig. 1.

Statistical analysis

Results

Table 1.

Discussion

Electronic supplementary material

Acknowledgements

Abbreviations

Author contributions

Funding

Data availability

Declarations

Research involving animal participants

Ethics approval and consent to participate

Consent for publication

Competing interests

Plant reproducibility

Gel and blots/image manipulation

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases