Skip to main content
Journal of Diabetes Science and Technology logoLink to Journal of Diabetes Science and Technology
. 2023 Dec 4;18(2):287–301. doi: 10.1177/19322968231215350

Machine Learning and Deep Learning Techniques Applied to Diabetes Research: A Bibliometric Analysis

Maira García-Jaramillo 1, Carolina Luque 1, Fabian León-Vargas 2,
PMCID: PMC10973853  PMID: 38047451

Abstract

Background:

The use of machine learning and deep learning techniques in the research on diabetes has garnered attention in recent times. Nonetheless, few studies offer a thorough picture of the knowledge generation landscape in this field. To address this, a bibliometric analysis of scientific articles published from 2000 to 2022 was conducted to discover global research trends and networks and to emphasize the most prominent countries, institutions, journals, articles, and key topics in this domain.

Methods:

The Scopus database was used to identify and retrieve high-quality scientific documents. The results were classified into categories of detection (covering diagnosis, screening, identification, segmentation, among others), prediction (prognosis, forecasting, estimation), and management (treatment, control, monitoring, education, telemedicine integration). Biblioshiny and RStudio were used to analyze the data.

Results:

A total of 1773 articles were collected and analyzed. The number of publications and citations increased substantially since 2012, with a notable increase in the last 3 years. Of the 3 categories considered, detection was the most dominant, followed by prediction and management. Around 53.2% of the total journals started disseminating articles on this subject in 2020. China, India, and the United States were the most productive countries. Although no evidence of outstanding leadership by specific authors was found, the University of California emerged as the most influential institution for the development of scientific production.

Conclusion:

This is an evolving field that has experienced a rapid increase in productivity, especially over the last years with exponential growth. This trend is expected to continue in the coming years.

Keywords: diabetes, artificial intelligence, machine learning, deep learning, bibliometric analysis, algorithms

Introduction

In recent years, there has been a significant increase in the global prevalence of diabetes. According to the latest edition of the Diabetes Atlas 1 published by the International Diabetes Association (IDF), there were 537 million adults (aged 20-79 years) living with diabetes worldwide in 2021. It is estimated that this number will increase by 46% by 2045.

Diabetes management involves the development and use of technological advancements aimed at aiding patients and clinical decision-making processes. One such advancement is the integration of diabetes technology with artificial intelligence (AI) methods, which has demonstrated promising results in the prevention, detection, and treatment of diabetes.2-5 These AI-driven approaches use computational algorithms to analyze complex data sets, identify patterns, and develop predictive models that can contribute to early diagnosis, personalized treatment, and effective management strategies for people with diabetes.6-8 Over time, AI-based tools have also been used to enhance aspects of diabetes education, such as prediction, dietary and exercise guidance, carbohydrate counting, insulin dose guidance, monitoring of complications, and self-control. 9 These advancements highlight the potential of AI to revolutionize the approach to diagnosing, treating, and educating individuals about this disease.

A growing body of research employing AI methods can be found in the literature in various domains of diabetes research. Reviews and bibliometric analyses have been conducted, focusing on areas such as diabetes management,10,11 diabetes prediction,12,13 prediction of diabetes complications,8,14 and kidney disease,15,16 among others.17,18 A considerable number of articles have been dedicated to the application of ML techniques in the detection of eye disease-related complications.19-22 However, a comprehensive bibliometric analysis encompassing the use of AI techniques in various aspects of diabetes research, including detection, prediction, and management, has been notably absent from the literature until now. Addressing this gap in the literature through a bibliometric review is crucial for anyone interested in identifying the most promising research areas, trends, and directions within this rapidly evolving field. Furthermore, bibliometric analysis enables the quantification of publication patterns and citation metrics, providing valuable information of the growth and influence of scientific knowledge. 23

Therefore, the primary goal of this study is to perform a bibliometric analysis of research papers published in the field of machine learning (ML) and deep learning (DL) techniques applied to diabetes research from 2000 to 2022. This involves statistical analysis of published literature to identify global research trends and networks, highlighting key countries, institutions, journals, articles, citations, and research topics. The aim is to provide insights for researchers and other stakeholders involved in knowledge generation to gain a deeper understanding of the current state and evolution of this field.

Methods

A. Search Strategy

Different databases can be used to obtain bibliometric data. Web of Science (WoS) and Scopus are well-established and widely used databases in the fields of technology and health for conducting bibliometric analyses.24,25 In this study, after conducting an initial search using both databases, it was observed that some relevant articles on the subject were not found in the WoS results. This was likely due to the absence of the author’s keywords information in the records associated with these articles. Therefore, Scopus was selected as the preferred database for conducting this bibliometric analysis. Scopus compiles scientific journals, conference proceedings, and international books. A rigorous selection of content and profiling algorithms guarantees that this database only indexes documents of high scientific quality. 26

The search criteria for the bibliometric analysis included terms such as “diabetes” and all its conjugations, and words associated with AI. These words were searched for in the title, abstract, and keywords. Moreover, to refine the search scope, the most employed ML and DL techniques were identified based on Huang et al. 8 These techniques were examined using the authors’ keywords. The search was restricted to English articles and reviews published between 2000 and 2022. As a result, 1773 documents were obtained.

Most of the documents retrieved correspond to articles (90.87%), and the rest to reviews. The publications come from 661 different journal sources. The search and selection criteria for the bibliometric analysis are illustrated in Figure 1, which also provides a concise summary of the descriptive and network analyses.

Figure 1.

Figure 1.

Flowchart of the methodology used for the search, screening, and bibliometric analysis.

B. Screening Strategy

As a result of a thorough examination of several search strings, the most relevant one was carefully selected; therefore, no articles needed to be removed from the data set. In addition, a comprehensive check was performed, and no duplicate articles were found. One adjustment made to the data set involved reclassifying articles that were initially published online in 2022 but had a publication date listed as 2023.

In this study, 3 main categories were considered to better structure the bibliometric analysis: detection, prediction, and management. Similar categories have been used in other studies analyzing the main AI contributions in specific health topics.11,27-29 This categorization seeks to organize the database based on the main purpose addressed in each article.

The detection category encompassed articles bearing fundamental objectives such as detection, early detection, diagnosis, screening, classification, risk grading, severity assessment, segmentation, discrimination, stratification, identification, and quantification. The Prediction category incorporated articles with primarily goals including prediction, prognosis, forecasting, and estimation. Ultimately, the Management category comprised articles focusing on management, treatment, control, education, monitoring, and the integration of telemedicine strategies. Articles that did not meet the criteria for these 3 categories were classified as “Others.” Manual classification of the 1773 articles was performed based on the information presented within the titles and abstracts. The Detection category amassed the highest number of articles (957), followed by the Prediction category (455), and the Management category (149). The Others category (212 articles) encompassed most review articles and instances of ML and DL applications in various health domains. These articles did not primarily focus on diabetes but used diabetes data/cohorts in their research. Therefore, we retained this category in the bibliometric analysis to obtain a comprehensive understanding of the overall publication landscape.

C. Bibliometric Analysis

A bibliometric analysis was conducted on 1773 articles to provide insight into the research landscape of ML and DL applied to diabetes. This analysis comprised both descriptive and network components. The descriptive analysis focused on several aspects, including publication trends, the main sources of publications, author productivity, and an examination of the most frequently cited documents. The network analysis identified the prominent countries and institutions in this field and explored co-occurrence patterns among author keywords.

The growth rate of publications was calculated as follows:

Growthrate=((p1p2)1n1)×100 (1)

where p1 corresponds to the number of publications in the last year, p2 is the number of publications in the first year, and n is the number of years. 22

Network analysis was used to examine the scientific, social, intellectual, and conceptual structures of scientific production in the field of interest. 30 Networks exhibit the research constituents or objects (e.g., authors, institutions, countries) and their relationships as an interconnected system in which the vertices (nodes) represent the objects and the edges (links) draw the relationships between them. In network analysis, the centrality metrics of the vertices (e.g., weighted degree, betweenness, closeness, and PageRank) are computed to provide empirical evidence of their relative importance in the network. 24

The weighted degree corresponds to the number of edge incidents in the node multiplied by the strength of each link. Betweenness centrality measures the ability of a node to carry information between groups of unconnected nodes. A high betweenness centrality for a node indicates that other nodes are dependent on it to access information. 31 Closeness is a measure of how close nodes are to each other, that is, the ability of nodes to effectively transport information to nearby nodes. Finally, PageRank is an alternative measure of the impact of a node in a network. A high PageRank indicates that a node is of “high quality” or “high domain” in the research. 32

Both descriptive and network analyses were conducted using R version 4.3.1, RStudio 2023.06.1+524, and Biblioshiny. Biblioshiny is a web-based application that provides a user-friendly interface for the Bibliometrix R package. 33 Microsoft Excel was also used for the document categorization and counting tasks.

For network analysis, the Louvain algorithm option was also considered in Biblioshiny to identify disjoint subsets of vertices with distinctive relational patterns.34-36 The associated parameters were set to “number of nodes = 100” (those with the highest normalized degree), “minimum number of edges = 1,” “normalization = association,” “remove isolated nodes = yes,” ‘layout = Auto,” and “repulsion force = 0.1.” Additional details can be found in Aria and Cuccurullo’s study.33,37

Descriptive Analysis

Publication Trends

Studies on ML and DL techniques applied to diabetes research began in the 2000s and have progressively increased from 2012 to 2022, with a substantial increase in the last 3 years (see Figure 2). In 2022, the highest level of publications was obtained, with 575 studies. The annual growth rate of publications on the subject was 33.49% (Equation 1), with an average citation per document of 20.71. The highest peak in the citation frequency per year occurred in 2020, with 7236 citations. After 2020, the annual citation decreased to 2513 in 2022. This drop in citation frequency is due to more recent articles having a shorter exposure time to the public than previous ones.

Figure 2.

Figure 2.

Total number of publications and average citations per year in the field, including productivity over time for detection, prediction, management, and other categories.

When examining the annual production by categories (Figure 2), it is evident that investigations within Detection have increased notably since 2012, while those classified in Prediction, Management, and Others have increased since 2017. The annual growth rate of studies within the Detection category was 30.01% (2000–2022), Prediction 22.83% (2001–2022), Others 21.84% (2002–2022), and Management 16.57% (2001–2022).

Since 2020, there has been an annual publication of more than 80 articles within the Detection and Prediction categories, which are the most active areas of research. Specifically, within the Detection category, there were 147 publications in 2020, 203 in 2021, and 322 in 2022, with average citations per document of 27.33, 12.34, and 4.33, respectively. In the case of Prediction, there were 89, 100, and 150 articles in the last 3 years with 21.26, 10.54, and 4.33 citations on average per document, respectively. Within the Detection category, articles published in 2018 received the highest annual citation count of 4434, whereas for the Prediction category, this milestone occurred in 2020 with 1892 citations.

Featured Research Journals

From 2000 to 2022, 661 journal sources were dedicated to research involving the application of ML and DL techniques in diabetes. Notably, the most influential journals, in terms of the number of published articles, accounted for 17% of the total studies. The top 10 most productive journals in the field are outlined in Table 1. These journals possess impact factors ranging from 1 to 9 and have accumulated total citations spanning 200 to 2000.

Table 1.

Top 10 Most Relevant Journals.

Journals Publisher Impact factor (2022) Total citations Documents per category Total
D P M O
IEEE Access IEEE 4.820 1963 42 12 4 2 60
Sensors MDPI 3.900 474 23 8 8 3 42
Computers in biology and medicine Elsevier 6.698 1167 30 4 4 0 38
Applied sciences (Switzerland) MDPI 2.838 625 18 5 5 3 31
IEEE Journal of biomedical and health informatics IEEE 8.330 1031 9 10 6 3 28
Journal of diabetes science and technology Sage 1.142 839 6 8 8 6 28
Diagnostics MDPI 3.610 228 17 5 5 3 24
Computer methods and programs in biomedicine Elsevier 7.027 895 9 8 2 1 20
Multimedia tools and applications Springer 3.600 341 15 3 0 2 20
Artificial intelligence in medicine Elsevier 8.520 696 8 9 0 1 18

Abbreviations: D, Detection; M, Management; O, Others; P, Prediction.

The number of active journals disseminating scientific knowledge on this subject reveals a notable increase. Between 2000 and 2010, there were only 46 journals; however, from 2011 to 2017, an additional 101 journals emerged. Moreover, over the past 5 years, 443 new research journals have been activated, while 53.2% of the total journals started disseminating articles in 2020 to 2022.

The most relevant journals published the first studies before 2020. For example, IEEE Access first published on the topic in 2018, Sensors in 2009, Computers in Biology and Medicine in 2013, Applied Sciences in 2018, IEEE Journal of Biomedical and Health Informatics in 2014, Journal of Diabetes Science and Technology in 2008, Diagnostics in 2019, Computer Methods and Programs in Biomedicine in 2005, Multimedia Tools and Applications in 2019, and Artificial Intelligence in Medicine in 2008.

Most papers published in IEEE Access (70%), Sensors (54.8%), Computers in Biology and Medicine (78.9%), Applied Sciences (58.1%), Diagnostics (70.8%), and Multimedia Tools and Applications (75%) are related to the Detection category, as indicated in Table 1. Conversely, studies that encompass both the Detection and Prediction categories are prominent in the IEEE Journal of Biomedical and Health Informatics (67.9%), Computer Methods and Programs in Biomedicine (85%), and Artificial Intelligence in Medicine (94.4%). In the Journal of Diabetes Science and Technology, articles related to the Prediction and Management categories predominate (57.1%).

Author Productivity Insights

A total of 6910 authors actively participated in research related to the application of ML and DL techniques in diabetes. The average collaboration cooperation index (i.e., the number of authors per article) is 5.25. It is noteworthy that only 3% of the investigations were attributed to a single author, underscoring the rarity of publications without co-authorship in this field.

Furthermore, within the pool of retrieved authors, there appears to be no distinct leadership pattern. This is likely because the application of ML and DL techniques in diabetes research is a relatively recent and rapidly evolving area of study involving numerous academics from various disciplines. Consequently, the concentration of scientific production within a select group of authors has yet to be observed.

Most Cited Documents

Table 2 presents the top 3 most highly cited articles within each of the Detection, Prediction, and Management categories. This includes the reference, title, purpose, primary ML and DL methods employed, journal, and total citations (TC) for each article.

Table 2.

Most Cited Documents by Category.

Category Ref (year) Title Purpose Methods Journal TC
Detection Kermany et al 38 Identifying medical diagnoses and treatable diseases by image-based deep learning Develop a diagnostic tool using deep learning to screen patients with common treatable retinal diseases that can cause blindness. TL Cell 2239
Abramoff et al 39 Improved Automated Detection of Diabetic Retinopathy on a Publicly Available Dataset Through Integration of Deep Learning Automate the detection of diabetic retinopathy. CNN Investigative ophthalmology & visual science 628
Sinthanayothin et al 40 Automated detection of diabetic retinopathy on digital fundus images Pre-process retinal digital images, localize the major retinal landmarks, and recognize diabetic pathologies. MLP neural network Diabetic medicine 420
Prediction Zou et al 41 Predicting Diabetes Mellitus With Machine Learning Techniques Predict diabetes mellitus. DT, RF and NN Frontiers in genetics 407
Sneha and Gangil 42 Analysis of diabetes mellitus for early prediction using optimal features selection Identify the important attributes that contribute to the problem of diabetes and develop a technique for early prediction of diabetes using these attributes. DT, RF, SVM, NB, KNN Journal of Big data 173
Hasan et al 43 Diabetes Prediction Using Ensembling of Different Machine Learning Classifiers Create a framework for diabetes prediction using an ensemble of different machine learning classifiers. KNN, DT, RF, AdaBoost, NB, and XGBoost IEEE Access 163
Management Contreras and Vehi 10 Artificial intelligence for diabetes management and decision support: literature review Review the literature on the application of AI to diabetes management and decision support, and propose a functional taxonomy for this topic. FL, RL, FNN, ANN, RF, SVR, RA, Kernel, EA, DT, NB, NLP, K-means, KNN, Bagging, Bayes Journal of medical Internet research 220
Fagherazzi and Ravaud 2 Digital diabetes: Perspectives for diabetes prevention, management and research Explore the potential of digital technologies, Big Data, and artificial intelligence in diabetes prevention, management, and research. Not disclosed Diabetes and metabolism 78
Stein and Brooks 44 A Fully Automated Conversational Artificial Intelligence for Weight Loss: Longitudinal Observational Study Among Overweight and Obese Adults Examine the impact of conversational AI on weight loss and meal healthiness, as well as to assess user engagement and acceptance of the weight loss health coach AI. Not disclosed JMIR diabetes 75

Abbreviations: AI, artificial intelligence; ANN, artificial neural networks; CNN, convolutional neural networks; DT, decision tree; EA, evolutionary algorithms; FL, fuzzy logic; FNN, fuzzy neural networks; KNN, k-nearest neighbor; MLP, multilayer perceptron; NB, naïve Bayes; NN, neural network; RA, regression algorithms; RF, random forest; RL, reinforcement learning; SVM, support vector machines; SVR, support vector regression; TC, total citations; TL, transfer learning.

The 3 articles with the highest number of citations in the entire database are situated within the Detection category,38-40 which is likely due to the larger volume of articles within this category compared with the others. Here, the prominent methods include transfer learning (TL), convolutional neural networks (CNNs), and multilayer perceptron (MLP) neural networks (NNs), all of which are closely related to image processing. However, no specific technical details are provided for each of these methods.

In contrast, articles within the Prediction category presented a greater amount of technical information regarding the methods used.41-43 Decision tree (DT) and random forest (RF) techniques were commonly employed across the 3 most frequently cited articles in this category. Zou et al 41 additionally used the NN approach, while Sneha and Gangil 42 incorporated the support vector machines (SVM), naïve Bayes (NB), and k-nearest neighbor (KNN) methods. Hasan et al 43 also employed the latter 2 methods, along with the AdaBoost and XGBoost techniques.

Articles within the Management category2,10,44 did not mention specific methods. This is because 2 of the most cited articles appear to be literature reviews,2,10 emphasizing the potential of using AI techniques in diabetes management rather than directly comparing these techniques. However, the article by Contreras and Vehi 10 provides insights into the most frequently employed techniques in 7 categories related to diabetes management. These categories encompass several methods, including fuzzy logic (FL), reinforcement learning (RL), fuzzy neural networks (FNN), artificial neural networks (ANN), support vector regression (SVR), regression algorithms (RAs), kernel methods, evolutionary algorithms (EAs), natural language processing (NLP), K-means clustering, KNNs, bagging, NB, RFs, and DTs.

Network Analysis

Countries Collaboration

The country of the corresponding author reveals that research concerning the use of ML and DL techniques in the context of diabetes encompasses approximately 76 distinct countries. However, data on the country of origin are absent in 15% of the collected documents. The findings reveal a pronounced concentration of research activities in Asia (57.7%), followed by Europe (19.6%) and North America (15%). Conversely, a more modest presence was observed in Africa (3.1%), Oceania (2.8%), and South America (1.8%). Figure 3 illustrates the collaborative relationships among countries.

Figure 3.

Figure 3.

Social structure. Countries’ collaboration world map. The greater the color intensity, the greater the centrality of the node associated with the country.

China (18.1%) and India (15.8%) lead in research output in this field, followed by the United States (14.85%). These 3 countries remain in the ranking when examining productivity by category (Detection, Prediction, and Management).

India stands out with 180 collaborations involving 51 nations. Authors from India primarily collaborate with researchers from the United States (12%), Saudi Arabia (9%), the United Kingdom (9%), South Korea (7%), and Malaysia (6%). China, on the contrary, has 206 collaborations with 42 countries, with significant partnerships with the United States (25%), the United Kingdom (11%), Australia (7%), Singapore (5%), and India (4%). The United States records 242 collaborations with 52 countries, emphasizing strong connections with the United Kingdom (8%), Canada (6%), South Korea (6%), Australia (5%), and Spain (5%). Overall, 28.26% of the articles in this field resulted from international collaborations.

Figure 4 illustrates the top 5 countries with the highest productivity over time. China has significantly increased its publication output starting from 2011, surpassing both the United States and India in research productivity by 2021, thus becoming the leading country in this field. From 2000 to 2022, articles produced by China, the United States, and India have amassed a total of 4841, 4542, and 4100 citations, respectively, with an average of 18.70, 23.70, and 14.10 citations per article. Other countries have received a total number of citations that is less than 1800.

Figure 4.

Figure 4.

Top 5 countries with the highest productivity over time. (a) Absolute values and (b) Values in logarithmic scale.

Institutional Collaboration

The assessment of the institutions’ contributions was conducted by examining the affiliations of authors, leading to the evaluation of 3043 institutions. This evaluation was performed by counting the number of articles associated with each institution. The top 5 most relevant institutions in this regard are Imperial College London (The United Kingdom) and the University of California (The United States), each with 22 articles, followed by Mansoura University (Egypt) and the National University of Singapore, both with 21 articles, and Capital Medical University (China) with 20 articles.

It is worth emphasizing that the research affiliated with any institution mentioned is relatively recent. For instance, Imperial College London has been actively involved in studies in this field since 2015, and all these institutions have reached their peak productivity levels in the last 2 years (2021-2022).

Figure 5 shows the institutional collaboration network. Nodes represent institutions, and the connection between nodes indicates institutional collaboration. The thickness of the edges indicates the frequency of collaboration. The thicker the edge, the greater the number of collaborations between the connected institutions.

Figure 5.

Figure 5.

Social structure. Collaboration network between institutions. Nodes represent institutions, and the connection between nodes indicates institutional collaboration. The size of the circle indicates the degree of the node.

The network of collaborations between institutions exhibits 10 clusters with sizes ranging from 3 to 12. The normalized degree metric reveals 7 main nodes detailed in Table 3. According to the centrality measures of the network’s vertices, the University of California is the most important (influential) and key institution for developing scientific production in this field. This institution presents the highest normalized degree, betweenness, and PageRank of the entire network. The University of California has a direct connection with 3 institutions: China Medical University (China), Arizona State University (The United States), and Aalborg University (Denmark). However, it connects indirectly with all other network nodes, except for the group formed by Iranian institutions (Shahid Beheshti University of Medical Sciences, Tehran University of Medical Sciences, and Islamic Azad University). In fact, the highest closeness metric corresponds to the Tehran University of Medical Sciences (0.5).

Table 3.

Predominant Institutions According to Measures of Centrality.

Institution Normalized degree Betweenness Closeness PageRank
University of California 1.000 1.137.873 0.0051 0.0407
Imperial College London 0.904 96.974 0.0034 0.0179
Capital Medical University 0.827 415.256 0.0046 0.0277
Singapore Eye Research Institute 0.808 349.023 0.0048 0.0348
Harvard Medical School 0.788 241.017 0.0040 0.0216
National University of Singapore 0.750 674.370 0.0051 0.0374
Mansoura University 0.731 186.696 0.0035 0.0257

Author Keywords and Trending Topics

Author-provided keywords facilitate the discernment of thematic patterns that serve as focal points for research inquiry. 45 The collected documents gathered 3852 different keywords. The construction of the co-occurrence network-based author’s keywords favors the identification of semantic similarity between terms and reveals knowledge structures around the area of interest.46,47 To elaborate the network, a dictionary of synonyms was inserted to unify terms corresponding to the same concept (e.g., “neural network” and “neural networks”). The parameters described at the end of the Methods section were used to elaborate the network observed in Figure 6. This network of co-occurrences includes the 100 most common keywords (2.6%), providing an overview of the research and a visualization of the interconnected subdomains.

Figure 6.

Figure 6.

Author’s keyword co-occurrence network. Nodes represent keywords, and the connection between nodes indicates the coexistence of terms in the same document. The size of the circle indicates the degree of the node.

Figure 6 shows the author’s keywords (nodes) grouped into 2 communities (clusters, groups, or subdomains). The coexistence of terms in the same cluster indicates that they have a similar context and relationship. The edges of the network determine the coexistence of terms in the same document.

The red cluster comprises 63 terms. Nodal centrality metrics (normalized degree, betweenness, closeness, and PageRank) point to 3 important nodes: machine learning, diabetes diagnostics, and artificial neural networks. The blue cluster includes the remaining 37 nodes and stands out: diabetic retinopathy, deep learning, and artificial intelligence. Specifically, the PageRank metric distinguishes the terms machine learning (0.097) and diabetic retinopathy (0.087) as the dominant node of each set. Table 4 shows the 10 most prominent terms within each cluster about diabetes research and the ML and DL methods/ techniques/algorithms key in the field.

Table 4.

Top Influential Author Keywords in Each Subdomain According to the PageRank Metric.

Cluster 1 (Red)
Cluster 2 (Blue)
Term PageRank Term PageRank
Diabetes application Diabetes diagnosis 0.066 Diabetic retinopathy 0.087
Diabetes mellitus 0.024 Retinal fundus images 0.015
Type 2 diabetes 0.021 Optical coherence tomography 0.011
Type 1 diabetes 0.014 Screening 0.009
Gestational diabetes 0.007 Diabetic macular edema 0.009
Diabetic foot 0.007 Age-related macular degeneration 0.008
Diagnosis 0.006 Retina 0.007
Diabetic nephropathy 0.006 Ophthalmology 0.007
Diabetes prediction 0.006 Telemedicine 0.007
Blood glucose 0.006 Glaucoma 0.006
Method/technique/algorithm Machine learning 0.097 Deep learning 0.067
Artificial neural networks 0.039 Artificial intelligence 0.049
Support vector machine 0.027 Convolutional neural network 0.032
Classification 0.023 Image processing 0.010
Prediction 0.013 Transfer learning 0.010
Data mining 0.011 Deep neural networks 0.008
Random forest 0.010 Feature extraction 0.007
Logistic regression 0.009 Segmentation 0.006
Feature selection 0.009 Computer-aided diagnosis 0.005
Decision tree 0.008 Image classification 0.004

The terminology associated with cluster 1 in Table 4 indicates ML techniques related to the study of diabetes, types of diabetes, and diabetes complications such as diabetic foot and diabetic nephropathy. Artificial neural networks, support vector machines, random forest, logistic regression, and decision tree algorithms are predominant. These techniques are mainly used to predict and classify variables of interest (supervised learning) through feature selection.48-50 Both processes are subfields of data mining.51,52

However, the terminology associated with cluster 2 highlights the relationship between DL techniques (a subfield of AI and machine learning53,54) and diabetes complications related to eye diseases. In this cluster, convolutional neural networks, transfer learning, and deep neural networks are the predominant algorithms. These techniques are mainly applied to image classification and segmentation processes (supervised and unsupervised learning) and computer-aided diagnosis by feature extraction.55-57

The trending topic analysis was conducted using the author-provided keywords. Bibliometric parameters encompassing a minimum frequency of 10 occurrences and a threshold of 5 occurrences per year were applied. Figure 7 depicts the prominent terms over time from 2010 to 2022. The term medical diagnosis holds the longest enduring trend, spanning 2010 to 2020. Post-2020, the authors have incorporated additional terms that denote refined diagnostic contexts within the keywords. Notable instances include diabetic retinopathy, diabetes diagnosis, and diabetic foot.

Figure 7.

Figure 7.

Trending topics in the field.

Figure 7 also illustrates a rise in the prevalence of algorithms on the application of ML and DL techniques within diabetes research, a trend manifesting in authors’ keywords since 2014. Notably, the term artificial neural network gained traction from 2014 to 2021, with its height occurring in 2019 at a frequency of 227 instances. By the years 2021-2022, a subtype of ANN known as convolutional neural network has garnered prominence. This latter term observed substantial uptake starting in 2020 and reached its peak at 203 occurrences in 2021. Moreover, prominent terms including artificial intelligence (294), deep learning (400), diabetic retinopathy (598), and machine learning (646) have exhibited a robust surge over the last 2 years, asserting their dominion in 2021.

Discussion

To the best of our knowledge, this study represents the first comprehensive bibliometric analysis of the application of ML and DL techniques within the field of diabetes research. In contrast to some bibliometric analyses that have focused on specific subdomains or applications, our analysis takes a broader perspective, encompassing the entire spectrum of ML and DL research in the context of diabetes.

We observed a noteworthy surge in interest within this field, particularly since 2012, and a notable increase in the last 3 years (see Figure 2). This can be attributed to the widespread dissemination and recognition of the enormous potential of AI in healthcare applications.58-60 In addition, it is closely tied to the ongoing evolution and refinement of these cutting-edge techniques.

A remarkable level of productivity was observed within the Detection category, which primarily focused on diabetes retinopathy and eye diseases. This can be attributed to various factors, including advancements in medical imaging technology, a heightened awareness of the significance of early detection in preventing vision loss, and the availability of extensive data sets for training and validating ML and DL models.61-63 Furthermore, progress in ML and DL techniques, such as the integration of convolutional NNs with SVMs or RF to automatically extract features from retinal images and enhance pattern classification tasks,22,64 has contributed to increased attention and productivity. The reader can refer to Table A in the Supplemental Appendix to observe the most common techniques found in each category.

Similarly, albeit with an important time lag, productivity within the Prediction category has experienced a substantial increase since 2017 (see Figure 2). Research in this subdomain has primarily focused on predicting diabetes, employing well-established ML techniques such as RF, logistic regression, SVMs, decision trees and NB (see Table A). These techniques are well-suited for modeling relationships between variables in this context, as they are methods capable of handling structured and unstructured data and incorporating categorical and quantitative variables, such as blood glucose levels, blood pressure, body mass index, and family history.

In the Management category, a similar trend in productivity was observed since 2017 compared to the Prediction category, although with notably fewer articles. The most common machine learning techniques observed within this category include logistic regression, RFs, SVMs, linear regression, and RL (see Table A). The relatively low number of articles in this category, as compared with the others, may be attributed to a combination of factors, such as research priorities, funding availability, clinical requirements, and prevailing trends in the field of diabetes from 2000 to 2022. It is essential to acknowledge that maturity in the use of ML and DL techniques can indeed impact the distribution of articles across the categories considered. As the scientific community accumulates experience and grows more confident in the applicability and precision of these methods, it is probable that their application expands into a range of areas, spanning from early detection to advanced management and treatment. This is a common behavior for the adoption of new technologies in medical research. 8 Therefore, greater growth in the coming years is expected to be seen in this subdomain. Some potential applications that currently do not utilize AI techniques could include earlier prediction of glucose excursions from continuous monitoring data, building upon existing research, 65 or reducing the impact of hypoglycemic events by leveraging machine learning to improve basal-bolus treatments.66,67 This maturity could also extend to automated insulin delivery systems that can leverage the capabilities of ML and DL techniques on large volumes of data and their computational power to infer complementary models and control strategies directly from experience. This would allow the learning of complex behaviors that are difficult to model analytically.

Regarding research journals, it was observed that the top 10 are qualified in the first- and second-highest quartiles in Scopus (8 in Q1, and 2 in Q2). Half of them operate with subscription access and the other as open access with different levels ranging from bronze to gold. Therefore, there is no preferred type of research journal to publish between subscription and open access, although most articles have been published in top-quality journals. The reader can refer to Table B in the Supplemental Appendix to observe the above details.

In terms of country productivity, there is a clear current dominance by China, surpassing the United States and India since 2021. It is worth noting India’s presence as the second most productive country currently, which led in this research field globally from 2015 to 2021. Other countries, such as the United Kingdom and South Korea, have shown a notable increase in productivity since 2019 but still lag significantly behind the top 3. It is evident that in this research field, the results of substantial investments in AI by these leading countries are clearly reflected, with a considerable surge since 2012 (Figure 4).

Regarding the network of institutions (Figure 5), there is no clear majority leadership in the field. The percentage of documents associated with each institution ranges from 0.06% to 1.24%. In general, we observe a well-connected network, facilitating interconnection among institutions for the vast majority, except for the cluster formed by Iranian institutions. This cluster does not engage in external collaborations, leading to the highest closeness metric value. Their main studies have focused on the classification and prediction of type 2 diabetes and complications such as nephropathy.

Finally, Figure 7 allows us to observe the trends in the field over the years from 2010 to 2022. Two terms that are no longer relevant today, exudates and medical diagnosis, have given way to a greater maturity of the topic. From 2018 to 2021, a significant increase was observed in terms such as retinal imaging, data mining, logistic regression, or artificial neural networks. In other words, there was a greater specification of machine learning techniques, with an application in retinopathy that began to dominate this field (see Figure 6). This behavior, which is also observed in the widespread use of ML and DL techniques in diabetes research (Figure 2), is consistent with recent research that reports a notable increase in studies on AI techniques and their applications in areas of health sciences since 2018.68-70

Meanwhile, from 2019 to the present, new more relevant terms have been emerging focused on the prediction, classification and diagnosis of diabetes using established machine learning techniques such as support vector machines. However, since 2020, there has been a strong emphasis on an approach that includes deep learning techniques such as convolutional neural networks, applied mainly to the detection and classification of eye diseases related to complications such as retinopathy. This group of terms represents the most outstanding trend in research throughout the entire period considered in our bibliometric analysis.

Since 2021 new terms such as transfer learning, digital health, diabetic foot, or image segmentation have appeared, indicating future trends with greater impact in this field. In particular, transfer learning has gained traction as one of the machine learning strategies, thanks to its usefulness in taking a model previously trained on a task and using it as a starting point for training a new model on a related task. This is particularly useful when fitting pretrained models with large data sets using limited available data to learn specific new tasks.

Algorithms for Predicting Complications of Diabetes

Of the 455 articles within the Prediction category, 67 were dedicated to predicting complications associated with diabetes. Among these, 58 articles addressed a single complication, including 23 on retinopathy, 14 on nephropathy, 9 on cardiopathy, 7 on neuropathy, 4 on diabetic foot, and 1 on diabetic ketoacidosis. The remaining 9 articles addressed multiple complications, with 2 being literature reviews. Please refer to Table C in the Supplemental Appendix for a list of these selected articles.

The most cited article in this list was published in the Journal of Diabetes Science and Technology in 2018. 71 It presented a machine learning approach to predict the onset of retinopathy, neuropathy, or nephropathy based on electronic health records of nearly one thousand patients with type 2 diabetes from Italy. They employed a combination of random forest and logistic regression techniques, achieving a model accuracy of up to 0.838. Similar studies were reported in the other articles of Table C. These studies primarily focused on diabetes complications using databases that ranged from hundreds to thousands of patients with type 2 diabetes. The preferred techniques included recurrent neural networks, Bayesian networks, classification and regression trees, ensemble models, support vector machines, decision trees, fuzzy neural networks, and XGBoost. In most of the complications addressed, accuracy percentages greater than 70% were observed.

The scope of these results demonstrates how the use of AI, especially machine learning, has become an essential tool in diabetes research. These advances allow for more accurate prediction of complications, which can have a significant impact on the care, treatment, and quality of life of patients with diabetes. The diversity of techniques and approaches also reflects the maturity and versatility of ML in this ever-evolving field.

Limitations

This study has identified several limitations that warrant attention.

First, the presence of multiple authors sharing the same name but not being the same individual presented a challenge when identifying the most prominent authors in the field and constructing co-citation networks.

Second, the absence of comprehensive details in some articles regarding the implementation of machine learning techniques hindered our ability to assess and compare the methods employed. These articles lacked crucial information such as parameterization, methodological approaches, training, validation, and test data.

Finally, our research scope was restricted due to the exclusion of conference proceedings and books, as well as articles published in languages other than English. This could have inadvertently resulted in the omission of pertinent studies conducted in alternative formats and languages. Future research should consider broadening the search criteria to encompass a more comprehensive range of relevant research.

It is also important to acknowledge that the scope of our bibliometric analysis may have been affected by limitations imposed on downloaded data due to potential subscription restrictions on scientific databases.

Conclusions

A bibliometric analysis was conducted to examine the use of ML and DL techniques in the field of diabetes research, based on scientific publications from 2000 to 2022. Descriptive and network analyses revealed a significant increase in productivity, particularly in recent years with exponential growth. China has become one of the major contributors to research productivity, surpassing the United States and India by 2021. India, on the contrary, has demonstrated the highest level of collaboration, primarily with the United States, Saudi Arabia, and the United Kingdom.

In terms of institutions, the University of California has emerged as the most influential and key institution in the advancement of scientific production in this field. It maintains direct connections with China Medical University (China), Arizona State University (United States), and Aalborg University (Denmark).

When analyzing the authors, it was not possible to establish predominant authors or collaboration networks due to the presence of multiple authors with the same name.

By examining the author’s keywords, 2 main clusters were identified. The first one predominantly focused on machine learning and diabetes diagnosis, whereas the second one emphasized diabetic retinopathy and deep learning. This suggests that deep learning techniques are commonly employed because of the specific characteristics of the data used.

Finally, a categorical analysis was performed to evaluate the application of the ML and DL techniques. The results indicate a mastery of these techniques in the areas of Detection, followed by Prediction and Management. Among the most widely used techniques in these categories, RF, and SVM are in the top 3. Logistic regression also stands out in the Prediction and Management categories, while CNNs are predominantly used in Detection. Overall, this area of research presents significant potential for continued growth in the coming years, especially in the subdomain of Management.

Supplemental Material

sj-docx-1-dst-10.1177_19322968231215350 – Supplemental material for Machine Learning and Deep Learning Techniques Applied to Diabetes Research: A Bibliometric Analysis

Supplemental material, sj-docx-1-dst-10.1177_19322968231215350 for Machine Learning and Deep Learning Techniques Applied to Diabetes Research: A Bibliometric Analysis by Maira García-Jaramillo, Carolina Luque and Fabian León-Vargas in Journal of Diabetes Science and Technology

Footnotes

Abbreviations: AI, artificial intelligence; ANN, artificial neural networks; CNN, convolutional neural networks; DL, deep learning; DT, decision tree; EA, evolutionary algorithms; FL, fuzzy logic; FNN, fuzzy neural networks; KNN, k-nearest neighbor; ML, machine learning; MLP, multilayer perceptron; NB, naïve Bayes; NN, neural network; RA, regression algorithms; RF, random forest; RL, reinforcement learning; SVM, support vector machines; SVR, support vector regression; TC, total citations; TL, transfer learning.

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding: The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was funded by Universidad Ean (grant no. TO_P01_2022) and Universidad Antonio Nariño (grant no. 2022033).

ORCID iDs: Maira García-Jaramillo Inline graphic https://orcid.org/0000-0001-6008-9068

Fabian León-Vargas Inline graphic https://orcid.org/0000-0002-1839-2036

Supplemental Material: Supplemental material for this article is available online.

References

  • 1. International Diabetes Federation. IDF Diabetes Atlas. 10th ed. Brussels, Belgium. 2021. https://www.diabetesatlas.org [Google Scholar]
  • 2. Fagherazzi G, Ravaud P. Digital diabetes: perspectives for diabetes prevention, management and research. Diabetes Metab. 2019;45(4):322-329. [DOI] [PubMed] [Google Scholar]
  • 3. Sriram RD, Reddy SSK. Artificial intelligence and digital tools: future of diabetes care. Clin Geriatr Med. 2020;36(3):513-525. [DOI] [PubMed] [Google Scholar]
  • 4. Rashid MM, Askari MR, Chen C, Liang Y, Shu K, Cinar A. Artificial intelligence algorithms for treatment of diabetes. Algorithms. 2022;15(9):299. [Google Scholar]
  • 5. Ansari RM, Harris MF, Hosseinzadeh H, Zwar N. Application of artificial intelligence in assessing the self-management practices of patients with type 2 diabetes. Healthcare. 2023;11(6):903. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Goecks J, Jalili V, Heiser LM, Gray JW. How machine learning will transform biomedicine. Cell. 2020;181(1):92-101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Maniruzzaman M, Rahman MJ, Ahammed B, Abedin MM. Classification and prediction of diabetes disease using machine learning paradigm. Health Inf Sci Syst. 2020;8(1):7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Huang J, Yeung AM, Armstrong DG, et al. Artificial intelligence for predicting and diagnosing complications of diabetes. J Diabetes Sci Technol. 2023;17(1):224-238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Li J, Huang J, Zheng L, Li X. Application of artificial intelligence in diabetes education and management: present status and promising prospect. Front Public Health. 2020;8:173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Contreras I, Vehi J. Artificial intelligence for diabetes management and decision support: literature review. J Med Internet Res. 2018;20(5):e10775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Chaki J, Ganesh S, Cidham S, Theertan S. Machine learning and artificial intelligence based Diabetes Mellitus detection and self-management: a systematic review. J King Saud Univ Comput Inf Sci. 2022;34(6):3204-3225. [Google Scholar]
  • 12. Jaiswal V, Negi A, Pal T. A review on current advances in machine learning based diabetes prediction. Prim Care Diabetes. 2021;15(3):435-443. [DOI] [PubMed] [Google Scholar]
  • 13. Tasin I, Nabil TU, Islam S, Khan R. Diabetes prediction using machine learning and explainable AI techniques. Health Technol Lett. 2022;10(1-2):1-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Alghamdi T. Prediction of diabetes complications using computational intelligence techniques. Appl Sci. 2023;13:3030. [Google Scholar]
  • 15. Park S, Park BS, Lee YJ, et al. Artificial intelligence with kidney disease: a scoping review with bibliometric analysis, PRISMA-ScR. Medicine. 2021;100(14):e25422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Verma A, Chitalia VC, Waikar SS, Kolachalama VB. Machine learning applications in nephrology: a bibliometric analysis comparing kidney studies to other medicine subspecialities. Kidney Med. 2021;3(5):762-767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Jimma B. Artificial intelligence in healthcare: a bibliometric analysis. Telemat Inform Rep. 2023;9:100041. [Google Scholar]
  • 18. Shaikh AK, Alhashmi SM, Khalique N, Khedr AM, Raahemifar K, Bukhari S. Bibliometric analysis on the adoption of artificial intelligence applications in the e-health sector. Digit Health. 2023;9:1-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Cuadros J. The real-world impact of artificial intelligence on diabetic retinopathy screening in primary care. J Diabetes Sci Technol. 2021;15(3):664-665. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Wang R, Zuo G, Li K, et al. Systematic bibliometric and visualized analysis of research hotspots and trends on the application of artificial intelligence in diabetic retinopathy. Front Endocrinol. 2022;13:1036426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Shao A, Jin K, Li Y, Lou L, Zhou W, Ye J. Overview of global publications on machine learning in diabetic retinopathy from 2011 to 2021: bibliometric analysis. Front Endocrinol. 2022;13:1032144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Feng H, Chen J, Zhang Z, Lou Y, Zhang S, Yang W. A bibliometric analysis of artificial intelligence applications in macular edema: exploring research hotspots and Frontiers. Front Cell Dev Biol. 2023;11:1174936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Donthu N, Kumar S, Mukherjee D, Pandey N, Lim WM. How to conduct a bibliometric analysis: an overview and guidelines. J Bus Res. 2021;133:285-296. [Google Scholar]
  • 24. León-Vargas F, Arango Oviedo JA, Luna Wandurraga HJ. Two decades of research in artificial pancreas: insights from a bibliometric analysis. J Diabetes Sci Technol. 2022;16(2):434-445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Pranckutė R. Web of Science (WoS) and Scopus: the titans of bibliographic information in today’s academic world. Publications. 2021;9(1):12. [Google Scholar]
  • 26. Baas J, Schotten M, Plume AM, Cote G, Karimi R. Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies. MIT Press J. 2020;1(1):377-386. [Google Scholar]
  • 27. Gunasekeran DV, Ting DSW, Tan GSW, Wong TY. Artificial intelligence for diabetic retinopathy screening, prediction and management. Curr Opin Ophthalmol. 2020;31(5):357-365. [DOI] [PubMed] [Google Scholar]
  • 28. Tulloch J, Zamani R, Akrami M. Machine learning in the prevention, diagnosis and management of diabetic foot ulcers: a systematic review. IEEE Access. 2020;8:198977-199000. [Google Scholar]
  • 29. Isaksen JL, Baumert M, Hermans ANL, Maleckar M, Linz D. Artificial intelligence for the detection, prediction, and management of atrial fibrillation. Herzschrittmacherther Elektrophysiol. 2022;33(1):34-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Omotehinwa TO. Examining the developments in scheduling algorithms research: a bibliometric approach. Heliyon. 2022;8(5):e09510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Kolaczyk ED, Chua DB, Barthélemy M. Group betweenness and co-betweenness: inter-related notions of coalition centrality. Soc Netw. 2009;31(3):190-203. [Google Scholar]
  • 32. Ding Y, Yan E, Frazho A, Caverlee J. PageRank for ranking authors in co-citation networks. J Am Soc Inform Sci Technol. 2009;60(11):2229-2243. [Google Scholar]
  • 33. Aria M, Cuccurullo C. bibliometrix: an R-tool for comprehensive science mapping analysis. J Informetr. 2017;11(4):959-975. [Google Scholar]
  • 34. Newman ME. Finding community structure in networks using the eigenvectors of matrices. Phys Rev E Stat Nonlin Soft Matter Phys. 2006;74(3, pt 2):036104. [DOI] [PubMed] [Google Scholar]
  • 35. Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech. 2008;2008(10):P10008. [Google Scholar]
  • 36. Zhang J, Fei J, Song X, Feng J. An improved Louvain algorithm for community detection. Math Probl Eng. 2021;2021:1485592. [Google Scholar]
  • 37. Aria M, Cuccurullo C. Package “bibliometrix.” Reference manual. 2023. https://cran.r-project.org/web/packages/bibliometrix/bibliometrix.pdf
  • 38. Kermany DS, Goldbaum M, Cai W, et al. Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell. 2018;172(5):1122-1131. [DOI] [PubMed] [Google Scholar]
  • 39. Abramoff MD, Lou Y, Erginay A, et al. Improved automated detection of diabetic retinopathy on a publicly available dataset through integration of deep learning. Invest Ophthalmol Vis Sci. 2016;57(13):5200-5206. [DOI] [PubMed] [Google Scholar]
  • 40. Sinthanayothin C, Boyce JF, Williamson TH, et al. Automated detection of diabetic retinopathy on digital fundus images. Diabet Med. 2002;19(2):105-112. [DOI] [PubMed] [Google Scholar]
  • 41. Zou Q, Qu K, Luo Y, Yin D, Ju Y, Tang H. Predicting diabetes mellitus with machine learning techniques. Front Genet. 2018;9:515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Sneha N, Gangil T. Analysis of diabetes mellitus for early prediction using optimal features selection. J Big Data. 2019;6(1):13. [Google Scholar]
  • 43. Hasan MK, Alam MA, Das D, Hossain E, Hasan M. Diabetes prediction using ensembling of different machine learning classifiers. IEEE Access. 2020;8:76516-76531. [Google Scholar]
  • 44. Stein N, Brooks K. A fully automated conversational artificial intelligence for weight loss: longitudinal observational study among overweight and obese adults. JMIR Diabetes. 2017;2(2):e8590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Tripathi M, Kumar S, Sonker SK, Babbar P. Occurrence of author keywords and keywords plus in social sciences and humanities research: a preliminary study. COLLNET J Scientometr Inf Manag. 2018;12(2):215-232. [Google Scholar]
  • 46. Bordag S. A comparison of co-occurrence and similarity measures as simulations of context. In: Gelbukh A, ed. International Conference on Intelligent Text Processing and Computational Linguistics. Berlin, Germany: Springer; 2008:52-63. [Google Scholar]
  • 47. Radhakrishnan S, Erbis S, Isaacs JA, Kamarthi S. Novel keyword co-occurrence network-based methods to foster systematic reviews of scientific literature. PLoS One. 2017;12(3):e0172778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Kotsiantis S. Feature selection for machine learning classification problems: a recent overview. Artif Intell Rev. 2011;42(1):157-176. [Google Scholar]
  • 49. Kakoly IJ, Hoque MR, Hasan N. Data-driven diabetes risk factor prediction using machine learning algorithms with feature selection technique. Sustainability. 2023;15(6):4930. [Google Scholar]
  • 50. Masood F, Masood J, Zahir H, Driss K, Mehmood N, Farooq H. Novel approach to evaluate classification algorithms and feature selection filter algorithms using medical data. J Comput Cogn Eng. 2023;2(1):57-67. [Google Scholar]
  • 51. Pati A, Parhi M, Pattanayak BK. A review on prediction of diabetes using machine learning and data mining classification techniques. Int J Biomed Eng Technol. 2023;41(1):83-109. [Google Scholar]
  • 52. Rastogi R, Bansal M. Diabetes prediction model using data mining techniques. Meas Sens. 2023;25:100605. [Google Scholar]
  • 53. Jakhar D, Kaur I. Artificial intelligence, machine learning & deep learning: definitions and differences. Clin Exp Dermatol. 2020;45(1):131-132. [DOI] [PubMed] [Google Scholar]
  • 54. Latif J, Xiao C, Imran A, Tu S. Medical imaging using machine learning and deep learning algorithms: a review. Paper presented at the 2019 2nd International Conference on Computing, Mathematics and Engineering Technologies (iCoMET) 2019; 30-31 January 2019; Sukkur, Pakistan. [Google Scholar]
  • 55. Bhandari S, Pathak S, Jain SA. A literature review of early-stage diabetic retinopathy detection using deep learning and evolutionary computing techniques. Arch Comput Methods Eng. 2023;30(2):799-810. [Google Scholar]
  • 56. Önal MN, Güraksin GE, Duman R. Convolutional neural network-based diabetes diagnostic system via iridology technique. Multimed Tools Appl. 2023;82(1):173-194. [Google Scholar]
  • 57. Usman TM, Saheed YK, Ignace D, Nsang A. Diabetic retinopathy detection using principal component analysis multi-label feature extraction and classification. Int J Cogn Comput Eng. 2023;4:78-88. [Google Scholar]
  • 58. Bohr A, Memarzadeh K. The rise of artificial intelligence in healthcare applications. In: Bohr A, Memarzadeh K, eds. Artificial Intelligence in Healthcare. Cambridge, MA: Academic Press; 2020:25-60. [Google Scholar]
  • 59. Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future Healthc J. 2019;6(2):94-98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Yoo I, Alafaireet P, Marinov M, et al. Data mining in healthcare and biomedicine: a survey of the literature. J Med Syst. 2012;36(4):2431-2448. [DOI] [PubMed] [Google Scholar]
  • 61. Castiglioni I, Rundo L, Codari M, et al. AI applications to medical images: from machine learning to deep learning. Phys Med. 2021;83:9-24. [DOI] [PubMed] [Google Scholar]
  • 62. Panayides AS, Amini A, Filipovic ND, et al. AI in medical imaging informatics: current challenges and future directions. IEEE J Biomed Health Inform. 2020;24(7):1837-1857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Kauppi T, Kalesnykiene V, Kamarainen J-K, et al. The DIARETDB1 diabetic retinopathy database and evaluation protocol. Paper presented at the British Machine Vision Conference; 10-13 September 2007; Warwick, England. [Google Scholar]
  • 64. Poly T, Islam M, Walther B, Lin M, Li Y. Artificial intelligence in diabetic retinopathy: bibliometric analysis. Comput Methods Programs Biomed. 2023;231:107358. [DOI] [PubMed] [Google Scholar]
  • 65. Calm R, García-Jaramillo M, Vehí J, Bondia J, Tarín C, García-Gabín W. Prediction of glucose excursions under uncertain parameters and food intake in intensive insulin therapy for type 1 diabetes mellitus. Annu Int Conf IEEE Eng Med Biol Soc. 2007;2007:1770-1773. [DOI] [PubMed] [Google Scholar]
  • 66. Gómez AM, Vallejo S, Ardila F, et al. Impact of a basal-bolus insulin regimen on metabolic control and risk of hypoglycemia in patients with diabetes undergoing peritoneal dialysis. J Diabetes Sci Technol. 2018;12(1):129-135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Gómez AM, Imitola Madero A, Henao Carrillo DC, et al. Hypoglycemia incidence and factors associated in a cohort of patients with type 2 diabetes hospitalized in general ward treated with basal bolus insulin regimen assessed by continuous glucose monitoring. J Diabetes Sci Technol. 2020;14(2):233-239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Guo Y, Hao Z, Zhao S, Gong J, Yang F. Artificial intelligence in healthcare: bibliometric analysis. J Med Internet Res. 2020;22(7):e18228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Li Y, Xu Z, Wang X, Wang X. A bibliometric analysis on deep learning during 2007–2019. Int J Mach Learn Cyber. 2020;11:2807-2826. [Google Scholar]
  • 70. Kumari J, Kumar E, Kumar D. A Structured analysis to study the role of machine learning and deep learning in the healthcare sector with big data analytics. Arch Computat Methods Eng. 2023;30:3673-3701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Dagliati A, Marini S, Sacchi L, et al. Machine learning methods to predict diabetes complications. J Diabetes Sci Technol. 2018;12(2):295-302. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sj-docx-1-dst-10.1177_19322968231215350 – Supplemental material for Machine Learning and Deep Learning Techniques Applied to Diabetes Research: A Bibliometric Analysis

Supplemental material, sj-docx-1-dst-10.1177_19322968231215350 for Machine Learning and Deep Learning Techniques Applied to Diabetes Research: A Bibliometric Analysis by Maira García-Jaramillo, Carolina Luque and Fabian León-Vargas in Journal of Diabetes Science and Technology


Articles from Journal of Diabetes Science and Technology are provided here courtesy of Diabetes Technology Society

RESOURCES