Abstract
Objective
To understand the prevalence and associated risk factors of lower extremity arterial disease (LEAD) in Chinese diabetic patients and to construct a risk prediction model.
Methods
Data from the Diabetes Complications Warning Dataset of the China National Population Health Science Data Center were used. Logistic regression analysis was employed to identify related factors, and machine learning algorithms were used to construct the risk prediction model.
Results
The study population consisted of 3,000 patients, with 476 (15.9%) having LEAD. Multivariate regression analysis indicated that male gender, atherosclerosis, carotid artery stenosis, fatty liver, hematologic diseases, endocrine disorders, and elevated glycosylated serum proteins were independent risk factors for LEAD. The risk prediction models constructed using Logistic regression and MLP algorithms achieved moderate discrimination performance, with AUCs of 0.73 and 0.72, respectively.
Conclusion
Our study identified the risk factors for LEAD in Chinese diabetic patients, and the constructed risk prediction model can aid in the diagnosis of LEAD.
Introduction
Lower extremity arterial disease (LEAD) is significantly associated with diabetes, with the risk increasing as the severity of diabetes worsens [1]. By 2035, the number of diabetes patients is expected to increase rapidly to approximately 600 million people [2]. Diabetic patients are at a higher risk of developing LEAD compared to non-diabetic patients [3, 4]. Consequently, the prevalence of lower extremity arterial disease in diabetic patients (LEADDP) is expected to increase significantly in the near future. However, statistics indicate that over 60% of LEAD patients may be asymptomatic, leading to early stages of the disease often being overlooked [5]. Compared to non-diabetic LEAD patients, diabetic peripheral neuropathy may mask LEAD symptoms, making the diagnosis of LEADDP more challenging. Current guidelines recommend initial screening for LEAD based on patient interviews and clinical examinations, using the Fontaine or Rutherford scales for assessment [6, 7]. This diagnosis heavily relies on the expertise of specialists and examination conditions, and the diagnostic performance is not ideal [8].
LEAD increases the risk of foot ulcers and lower limb amputations, significantly impacting the quality of life and treatment for diabetic patients, while also imposing a substantial economic burden on society [9]. Therefore, identifying risk factors and accurately predicting LEADDP risk is crucial for the early initiation of prevention and treatment in high-risk patients. In recent years, constructing risk prediction models using machine learning algorithms has become increasingly popular in medical research. These algorithms can automatically select informational variables and capture nonlinear relationships between variables, enhancing predictive capabilities.
The primary aim of this study is to describe the prevalence of LEAD in diabetic patients and to identify potential risk factors. Subsequently, we trained machine learning models to predict LEADDP risk by integrating all available clinical objective information.
Data and methods
In this study, we used the Diabetes Complications Warning Dataset obtained from the China National Population Health Science Data Center to analyze the significant risk factors for LEADDP and to construct a risk prediction model. This dataset includes general information, vital signs, laboratory measurements, and comorbidities for 3,000 diabetic patients, totaling 81 variables (S1 Table). It encompasses traditional risk factors [1, 8]: dyslipidemia, hypertension, cardiovascular diseases, kidney disease, inflammation, obesity, male gender, aging, and infections. Due to the complex pathogenesis of LEADDP and the unclear associated risk factors, we also included more information on comorbidities and additional laboratory indicators in order to identify new risk factors and avoid omissions. We aimed to construct a risk prediction model that can be used by non-specialists; hence, treatment factors were not included.
In this study, we utilized traditional statistical methods to describe the cohort’s demographic and health characteristics. For normally distributed continuous variables, independent sample t-tests or one-way ANOVA were employed to compare the means ± standard deviations between groups. Categorical variables were presented as frequencies (percentages), and the chi-square test was used to assess differences between groups. LEADDP was taken as the dependent variable, while general information, vital signs, laboratory measurements, and comorbidities were considered independent variables to explore the influence of various risk factors. Based on the results of the univariate analysis, factors with a p-value less than 0.1 were included in the multivariate analysis. We examined the correlations between all variables and visualized data with strong associations (Fig 1). Before training the model, we excluded incomplete cases from the dataset, resulting in the removal of 15 cases. After this preprocessing step, a total of 2,985 cases were included in the model. The collinearity of the included factors was examined. The dataset was randomly split into training and validation sets in a 7:3 ratio, with LEADDP as the dependent variable. The training set included 2,089 cases, with 296 cases of LEADDP and 1,793 cases of non-LEADDP. The validation set included 896 cases, with 127 cases of LEADDP and 769 cases of non-LEADDP (Table 1). We used the training set data to construct the LEADDP risk prediction model and validated it using the validation set data.
Fig 1. Heatmap of variable correlations.
Dark blue areas indicate strong negative correlations, while dark red areas indicate strong positive correlations. Light colors show weaker correlations. Abbreviations: AS, Atherosclerosis; FLD, Fatty Liver Disease; HBA1C, Hemoglobin A1c; TP, Total Protein.
Table 1. Characteristics of samples in the training and validation sets.
| NO LEADDP | LEADDP | Total | |
|---|---|---|---|
| Training set | 1793 | 296 | 2089 |
| Validation set | 769 | 127 | 896 |
| Total | 2562 | 423 | 2985 |
Results
Among the 3,000 diabetic patients included in this study, 476 (15.9%) had LEADDP. The average age was 57.8 years, with 37.5% being female. Significant differences were observed between patients who had LEADDP and those who did not in terms of general information, vital signs, laboratory tests, and comorbidities (Table 2).
Table 2. Baseline characteristics according to the presence of LEADDP.
| Category/variable | NO LEADDP | LEADDP | P value | |
|---|---|---|---|---|
| Sex, n (%) | male | 1560 (61.8) | 314 (66) | 0.089 |
| female | 964 (38.2) | 162 (34) | ||
| Age, years | 57.51±11.28 | 59.3±10.37 | 0.001 | |
| Height, meter | 166.51±7.03 | 166.34±8.54 | 0.665 | |
| Weight, kg | 73±13.00 | 73.56±12.43 | 0.392 | |
| SBP, mmHg | 138.22±20.84 | 141.18±21.72 | 0.005 | |
| DBP, mmHg | 80.40±12.01 | 80.82±11.57 | 0.48 | |
| Heart Rate, n/min | 79.88±26.81 | 69.64±35.03 | 0.107 | |
| BMI, kg/m2 | 26.25±3.82 | 26.58±3.61 | 0.088 | |
| Hypertention, n (%) | no | 835(33.1) | 119(25) | 0.001 |
| yes | 1689(66.9) | 357(75) | ||
| Hyperlipidemia, n (%) | no | 1973(78.3) | 371(77.9) | 0.952 |
| yes | 551(21.8) | 105(22.1) | ||
| AS, n (%) | no | 1420(56.3) | 37(7.8) | <0.0001 |
| yes | 1104(43.7) | 439(92.2) | ||
| Cerebral Apoplexty, n (%) | no | 2351(93.1) | 425(89.3) | 0.004 |
| yes | 173(6.9) | 51(10.7) | ||
| Carotid Artery Stenosis, n (%) | no | 2440(96.7) | 431(90.5) | <0.0001 |
| yes | 84(3.3) | 45(9.5) | ||
| FLD, n (%) | no | 1793(71) | 270(56.7) | <0.0001 |
| yes | 731(29) | 206(43.3) | ||
| Cirrhosis, n (%) | no | 2483(98.4) | 470(98.7) | 0.689 |
| yes | 41(1.6) | 6(1.3) | ||
| CLD, n (%) | no | 2191(968) | 400(84) | 0.109 |
| yes | 333(13.2) | 76(16) | ||
| Pancreatic Disease, n (%) | no | 2482(98.3) | 470(98.7) | 0.563 |
| yes | 42(1.7) | 6(1.3) | ||
| Biliary Tract Disease, n (%) | no | 2190(86.8) | 383(80.5) | <0.0001 |
| yes | 334(13.2) | 93(19.5) | ||
| Nephropathy, n (%) | no | 1520(60.2) | 203(42.6) | <0.0001 |
| yes | 1004(39.8) | 273(57.4) | ||
| Renal Faliure, n (%) | no | 2375(94.1) | 442(92.9) | 0.347 |
| yes | 149(5.9) | 34(7.1) | ||
| Nervous System Disease, n (%) | no | 2382(94.4) | 44(92.9) | 0.202 |
| yes | 142(5.6) | 34(7.1) | ||
| CHD, n (%) | no | 1693(67.1) | 322(67.6) | 0.832 |
| yes | 831(32.9) | 154(32.4) | ||
| MI, n (%) | no | 2363(93.6) | 447(93.9) | 0.839 |
| yes | 161(6.4) | 29(6.1) | ||
| CHF, n (%) | no | 2351(93.1) | 437(91.8) | 0.329 |
| yes | 173(6.9) | 39(8.2) | ||
| Arrhythmias, n (%) | no | 2392(94.8) | 434(91.2) | 0.003 |
| yes | 132(5.2) | 42(8.8) | ||
| Respiratory System Disease, n (%) | no | 2121(84) | 407(85.5) | 0.451 |
| yes | 403(16) | 69(14.5) | ||
| Hematonosis, n (%) | no | 2175(86.2) | 381(80) | 0.001 |
| yes | 349(13.8) | 95(20) | ||
| Rheumatic Immunity, n (%) | no | 2432(96.4) | 465(97.7) | 0.169 |
| yes | 92(3.6) | 11(2.3) | ||
| Pregnant, n (%) | no | 2515(99.6) | 475(99.8) | 0.715 |
| yes | 9(0.4) | 1(0.2) | ||
| Endocrine Disease, n (%) | no | 1773(70.2) | 225(47.3) | <0.0001 |
| yes | 751(29.8) | 251(52.7) | ||
| MEN, n (%) | no | 2430(96.3) | 462(97.1) | 0.426 |
| yes | 94(3.7) | 14(2.9) | ||
| PCOS, n (%) | no | 2521(99.9) | 476(100) | 1 |
| yes | 3(0.1) | 0(0) | ||
| Digestive Carcinoma, n (%) | no | 2385(94.5) | 462(97.1) | 0.022 |
| yes | 139(5.5) | 14(2.9) | ||
| Urologic Neoplasms, n (%) | no | 2496(98.9) | 473(99.4) | 0.462 |
| yes | 28(1.1) | 3(0.6) | ||
| Gynecolgical Tumor, n (%) | no | 2434(96.4) | 467(98.1) | 0.068 |
| yes | 90(3.6) | 9(1.9) | ||
| Breast Tumor, n (%) | no | 2515(99.6) | 475(99.8) | 0.715 |
| yes | 9(0.4) | 1(0.2) | ||
| Lung Tumor, n (%) | no | 2478(98.2) | 467(98.1) | 1 |
| yes | 46(1.8) | 9(1.9) | ||
| Intracranial Tumor, n (%) | no | 2512(99.5) | 472(99.2) | 0.493 |
| yes | 12(0.5) | 4(0.8) | ||
| Other Tumor, n (%) | no | 2312(91.6) | 442(92.9) | 0.366 |
| yes | 212(8.4) | 34(7.1) | ||
| GLU, mmol/L | 8.40±3.93 | 8.69±3.72 | 0.139 | |
| GLU_2H, mmol/L | 14.52±4.58 | 15.81±4.46 | 0.001 | |
| HBA1C, % | 7.70±1.71 | 8.29±1.79 | <0.0001 | |
| GSP, μmol/L | 224.11±79.15 | 237.69±79.13 | 0.005 | |
| TG, mmol/L | 2.03±1.65 | 1.99±1.50 | 0.704 | |
| TC, mmol/L | 4.60±1.38 | 4.69±1.61 | 0.233 | |
| HDL_C, mmol/L | 1.07±0.32 | 1.06±0.30 | 0.762 | |
| LDL_C, mmol/L | 2.84±1.09 | 2.95±1.38 | 0.048 | |
| FBG, g/L | 7.73±37.90 | 8.69±42.44 | 0.621 | |
| UPR_24, g/24h | 1.44±1.50 | 1.08±1.41 | <0.0001 | |
| BU, mmol/L | 7.07±4.97 | 7.37±4.91 | 0.222 | |
| SCR, μmol/L | 106.24±119.97 | 108.29±114.39 | 0.731 | |
| UCR, mmol/24h | 5.49±2.66 | 6.61±12.16 | 0.021 | |
| SUA, μmol/L | 328.86±103.68 | 327.81±93.56 | 0.838 | |
| HB, g/L | 131.77±23.35 | 131.4±21.94 | 0.75 | |
| CP, nmol/L | 2.21±1.53 | 2.17±2.1 | 0.777 | |
| INS, μU/ml | 14.71±39.04 | 11.61±13.54 | 0.213 | |
| PCV | 0.39±0.06 | 0.38±0.06 | 0.493 | |
| PLT, 109/L | 219.18±73.16 | 213.34±63.99 | 0.105 | |
| ESR, mm/h | 27.08±29.49 | 23.3±25.81 | 0.056 | |
| TBILI, μmol/L | 11.02±12.35 | 10.79±7.14 | 0.698 | |
| DBILI, μmol/L | 3.64±9.67 | 2.99±2.16 | 0.147 | |
| TP, g/L | 65.7±7.43 | 64.36±7.51 | <0.0001 | |
| ALB, g/L | 39.53±5.832 | 38.67±6.1 | 0.004 | |
| LDH_L, U/L | 174.29±64.78 | 171.48±54.01 | 0.377 | |
| ALT, U/L | 24.47±29.252 | 21.83±15.86 | 0.055 | |
| AST, U/L | 20.74±24.03 | 18.26±10.06 | 0.028 | |
| GGT, U/L | 43.3±73.3 | 38.844±53.28 | 0.209 | |
| ALP, U/L | 75.72±47.7 | 72.31±30.28 | 0.138 | |
| LP_A, mg/dl | 30.81±43.58 | 36.84±53.03 | 0.661 | |
| PL, mmol/L | 2.46±0.60 | 2.46±0.69 | 0.972 | |
| PT, s | 13.15±1.28 | 13.1±1.19 | 0.413 | |
| PTA | 99.19±18.88 | 98.37±19.07 | 0.387 | |
| APTT, s | 36.74±7.33 | 36.42±5.61 | 0.361 | |
| FIBRIN, g/L | 13.96±25.29 | 16.12±30.71 | 0.833 | |
| ALB_CR, mg/g | 149.66±230.06 | 154.12±260.96 | 0.748 | |
| LPS, U/L | 164.58±425.35 | 157.88±300.38 | 0.808 | |
| CA199, U/L | 30.7±226.85 | 18.21±16.1 | 0.247 | |
| CRP, mg/L | 1.26±2.88 | 0.85±2.15 | 0.008 | |
| TH2, ng/L | 0.45±0.1 | 0.35±0.11 | 0.155 | |
| IBILI, μmol/L | 7.37±4.35 | 7.78±5.59 | 0.072 | |
| GLO, g/L | 26.18±4.95 | 25.68±4.56 | 0.044 | |
Note: Data were expressed as mean ± SD or n (%).
Abbreviations: SBP, Systolic Blood Pressure; DBP, Diastolic Blood Pressure; BMI, Body Mass Index; CHD, Coronary Heart Disease; MI, Myocardial Infarction; CHF, Congestive Heart Failure; AS, Atherosclerosis; FLD, Fatty Liver Disease; CLD, Chronic Liver Disease; PCOS, Polycystic Ovary Syndrome; MEN, Multiple Endocrine Neoplasia; GLU, Glucose; GLU_2H, 2-hour Postprandial Glucose; HBA1C, Hemoglobin A1c; GSP, Glycated Serum Protein; TG, Triglycerides; TC, Total Cholesterol; HDL_C, High-Density Lipoprotein Cholesterol; LDL_C, Low-Density Lipoprotein Cholesterol; FBG, Fibrinogen; UPR_24, 24-hour Urinary Protein; BUN, Blood Urea Nitrogen; BU, Blood Urea; SCR, Serum Creatinine; UCR, Urine Creatinine; SUA, Serum Uric Acid; HB, Hemoglobin; CP, C-Peptide; INS, Insulin; PCV, Packed Cell Volume; PLT, Platelets; ESR, Erythrocyte Sedimentation Rate; TBILI, Total Bilirubin; DBILI, Direct Bilirubin; TP, Total Protein; ALB, Albumin; LDH_L, Lactate Dehydrogenase; ALT, Alanine Aminotransferase; AST, Aspartate Aminotransferase; GGT, Gamma-Glutamyl Transferase; ALP, Alkaline Phosphatase; LP_A, Lipoprotein(a); PL, Phospholipids; PT, Prothrombin Time; PTA, Prothrombin Activity; APTT, Activated Partial Thromboplastin Time; FIBRIN, Fibrinogen; ALB_CR, Albumin/Creatinine Ratio; LPS, Lipase; CA199, Cancer Antigen 19–9; CRP, C-Reactive Protein; TH2, T Helper Cell 2; IBILI, Indirect Bilirubin; GLO, Globulin.
LEADDP patients were predominantly male, older in age, and had higher proportions of hypertension, stroke, carotid artery stenosis, fatty liver, biliary diseases, kidney disease, arrhythmias, hematologic diseases, and other endocrine disorders. They also exhibited poorer glycemic control, with the majority having atherosclerosis in other areas. Additionally, LEADDP patients had higher levels of low-density lipoprotein and urinary creatinine, and lower levels of urinary microalbumin, total protein, serum albumin, aspartate aminotransferase, C-reactive protein, and globulin. Other laboratory indicators showed no statistically significant differences. In the multivariable analysis, all variables that showed a strong association with LEADDP in the univariate analysis (P < 0.1) were included. The results indicated that gender, atherosclerosis, carotid artery stenosis, fatty liver, hematologic diseases, endocrine disorders, and glycosylated serum protein were significantly associated with LEADDP (Table 3).
Table 3. Results of the multivariate analysis.
| P value | OR (95% CI) | |
|---|---|---|
| Sex | 0.018 | 1.422 (1.062, 1.904) |
| AS | <0.001 | 19.911 (12.349, 32.103) |
| Carotid Artery Stenosis | 0.002 | 2.198 (1.345, 3.591) |
| FLD | 0.004 | 1.518 (1.140, 2.023) |
| Hematonosis | <0.001 | 2.094 (1.467, 2.989) |
| Endocrine Disease | <0.001 | 2.155 (1.623, 2.862) |
| GSP | 0.014 | 1.002 (1.000, 1.004) |
Abbreviations: AS, Atherosclerosis; FLD, Fatty Liver Disease; GSP, Glycated Serum Protein.
We constructed five LEADDP risk prediction models using Python, specifically the Logistic regression, decision tree, random forest, k-nearest neighbors, and neural network algorithms. We plotted the ROC curves for each algorithm (Fig 2). Among them, the risk prediction models based on the Logistic regression and MLP algorithms achieved the best performance, with AUCs of 0.73 and 0.72, respectively.
Fig 2. AUC curves of various risk prediction models.
Discussion
In our study, approximately 15.9% of Chinese diabetic patients had LEAD. Male gender, atherosclerosis, carotid artery stenosis, fatty liver, hematologic diseases, other endocrine disorders, and glycated serum protein were found to be independently associated with the prevalence of LEADDP. The risk prediction model constructed using Logistic regression and MLP algorithms achieved the best performance, with moderate discriminative ability.
In our study, approximately 15.9% of Chinese diabetic patients had LEADDP, whereas another study from the southern coastal region of China reported a prevalence of about 4.9% [10]. The reason for this difference is not yet clear but may be related to regional variations. Our study data are from a medical center in northern China, where the climate is colder compared to the south. The diagnosis of LEAD primarily relies on ABI (Ankle-Brachial Index). At low temperatures, the pressure in the distal arteries of the lower limbs significantly decreases, which reduces the ABI value [11]. Therefore, we believe this difference may be due to the influence of different climates in the regions where the studies were conducted. Additionally, there are significant differences in dietary habits, lifestyles, and economic levels among people in different provinces of China, which may, to some extent, affect the development of diabetes and its complications. Existing research data indicate that the prevalence of diabetes varies across different provinces in China [12]. However, there have been no epidemiological reports specifically on LEADDP in China.
The risk of LEAD increases with age, a finding also supported by our study [8]. However, existing research indicates that the number of peripheral arterial disease (PAD) cases is higher in females than in males across all age groups [13]. Since females have lower ABI values than males, the actual prevalence of PAD in females might be higher than current estimates [14, 15]. However, in our study, we found that there were more men than women among diabetic patients with concurrent LEAD. A study on LEAD in the diabetic population also reached similar conclusions [16]. Males have a higher risk of LEADDP compared to females, with male diabetic patients being about 1.4 times more likely to develop LEADDP than female diabetic patients. This may be related to the unique pathophysiological processes in diabetic patients.
Diabetic vascular complications share common features, with their main pathological manifestations being endothelial dysfunction and atherosclerosis [17, 18]. The relationship between diabetic macrovascular complications and LEADDP is evident and confirmed in our study. Additionally, we found that carotid artery stenosis can be considered a risk factor for LEADDP. The risk of LEADDP in patients with carotid artery stenosis is 2.198 times higher than in those without. Because symptoms are more evident, diabetic patients with carotid artery stenosis are more likely to seek medical help. Therefore, emphasizing the predictive value of carotid artery stenosis will aid in the diagnosis and early treatment of LEADDP. Due to the widespread vascular complications of diabetes, cardiovascular disease cannot be ignored [17]. However, in our study, there were no significant differences in the prevalence of complications such as coronary heart disease, myocardial infarction, and chronic heart failure. Our study shows that hypertension is associated with LEADDP, but not all types of hypertension. Consistent with some studies, increased systolic pressure is associated with LEADDP, while differences in diastolic pressure are not statistically significant [19, 20]. This suggests that we should pay attention to the LEAD risk in diabetic patients with increased pulse pressure.
Glycemic control is considered an important measure to prevent the risk of LEADDP. Hyperglycemia promotes oxidative stress, glycoxidation, and systemic inflammation, damaging the endothelial cells of the arterial wall, leading to lipid deposition and the development of atherosclerosis [21]. Some studies have shown a link between glycemic control levels and the incidence of LEAD, indicating that poor glycemic control may lead to the development of LEADDP [10, 22, 23]. Other studies, however, have found little association between blood glucose levels and LEADDP [24, 25]. The reasons for these differences are unclear, possibly involving differences in study populations or data analysis methods. In our study, LEAD patients had poorer blood glucose control compared to those without LEAD. Although fasting blood glucose levels did not show significant differences, the postprandial 2-hour blood glucose levels were significantly elevated. Some studies have suggested that insulin resistance is an important cause of LEADDP and can be harmful even with normal blood glucose levels [26, 27]. Studies have also shown that arterial vascular damage can begin before blood glucose levels increase, with elevated insulin levels potentially being a contributing factor [21]. This may indicate that poor glycemic control and LEADDP are not the cause but the result. This may suggest a deeper connection between glycemic control and the occurrence of LEADDP, warranting further research.
Lipid abnormalities play a crucial role in the development of atherosclerosis. Unlike in the past, where only low-density lipoprotein was emphasized, both low-density lipoprotein and triglyceride abnormalities should now be considered [28]. For LEADDP, a recent study has shown that plasma concentrations of HDL-C, TC/HDL-C, and apolipoproteins are associated with the incidence of LEADDP [29]. Other studies have not found a link between lipid parameters and the incidence of LEADDP [23, 30]. These inconsistencies may be due to lipid-lowering treatments in the study populations [31]. In our study, low-density lipoprotein was significantly elevated in the LEADDP group, but other lipid indicators, particularly triglycerides, showed no statistically significant differences. Elevated low-density lipoprotein levels indicate dyslipidemia. In our study, patients with both LEAD and fatty liver had an increased risk of LEAD, and the difference in BMI was not significant. In some studies, obesity is considered a risk factor for the development of LEAD [32, 33]. However, obesity does not cause LEADDP in all populations [34]. This suggests that obesity may not be a direct risk factor for LEADDP, but rather an indicator of dyslipidemia.
Impaired renal function is not a traditional risk factor for LEAD, but studies have shown that it is associated with an increased risk of atherosclerosis, cardiovascular events, and LEAD [10, 35, 36]. In our study, individuals with LEADDP exhibited a higher prevalence of kidney disease and significantly reduced kidney function compared to those without LEADDP. Univariate analysis showed that concurrent kidney disease, UPR_24, and UCR were associated with LEADDP events, but this association was excluded after adjusting for potential confounding factors.
Certain endocrine diseases other than diabetes can also increase the risk of LEADDP. Hypothyroidism is associated with an increased risk of atherosclerosis and may cause peripheral vascular constriction [37, 38]. Patients with Cushing’s syndrome have elevated cortisol levels, leading to hypertension, hyperglycemia, and lipid abnormalities, increasing the risk of atherosclerosis [39]. In our study, we found that having other endocrine diseases increases the risk of LEADDP.
In this study, hematologic diseases were considered one of the risk factors for LEADDP. Possible mechanisms include coagulation disorders, abnormal platelet counts or function, and increased blood viscosity. Abnormalities in fibrinogen, thrombin formation, and fibrin degradation occur not only in acute thrombotic complications but may also occur in stable forms of LEAD, as research suggests [40]. Platelet abnormalities and increased blood viscosity may lead to the development and exacerbation of LEAD [41]. Currently, no studies have analyzed the correlation between hematologic diseases and LEADDP. This finding opens new avenues for research into LEADDP.
Machine learning can assist clinicians in making diagnoses, thereby reducing their workload, increasing the sensitivity of disease diagnosis, avoiding missed diagnoses, and lowering healthcare costs [42]. The pathogenesis of LEADDP is complex, and current research does not clearly define its risk factors. Machine learning algorithms can learn from existing data and identify relationships between independent and dependent variables. This capability makes them well-suited for constructing risk prediction models for LEADDP [43]. Previous studies have employed machine learning algorithms to develop risk prediction models for LEAD, achieving relatively good predictive performance [44, 45]. However, these studies did not specifically focus on diabetic patients, and thus could not capture the unique characteristics of LEAD occurrence in this population. The application of machine learning algorithms in LEAD risk prediction models specifically targeting diabetic patients remains relatively limited. In our study, we aimed to develop a preliminary machine learning model to predict LEAD in diabetic patients. Our results indicated that Logistic Regression and MLP were the most effective algorithms, exhibiting the highest AUC and superior statistical performance. Generally, the AUC ranges from 0.5 to 1.0, with values between 0.5 and 0.7 indicating low discrimination ability, values between 0.7 and 0.9 indicating moderate discrimination ability, and values above 0.9 indicating high discrimination ability [46]. In our study, the accuracy of the model’s predictive ability was evaluated by calculating the area under the ROC curve. The AUC values of the risk prediction models based on Logistic regression and MLP were 0.73 and 0.72, respectively, indicating that these two models achieved moderate discrimination ability. This model is built using easily collectable objective clinical data, and it can be directly integrated with data from medical systems without the need for secondary collection. This allows for improved LEAD risk prediction without increasing the workload of medical personnel.
This study identified risk factors for LEAD in the Chinese diabetic population, further explored the mechanisms related to LEADDP, and developed a relatively user-friendly risk prediction model using machine learning algorithms. The model can assist specialists in identifying high-risk LEADDP patients. The limitations of this study include the following. First, the data used in this study were obtained from the China National Population Health Science Data Center, and while the data were collected by reliable medical centers, the comorbidities in the dataset were not clearly defined, which may pose limitations for broader application. Second, for the sake of model usability, this study did not include treatment factors, so the potential effects of medications on the outcomes have not been explored. Third, as this is a cross-sectional study, the research design does not allow us to establish causal relationships between the identified risk factors. Finally, the data primarily come from northern China, which may limit its generalizability to other regions.
Conclusion
In conclusion, our study identified several important risk factors for LEAD in the Chinese diabetic population. These findings contribute to a deeper understanding of the pathogenesis of LEADDP. The accumulation of risk factors further increases the risk of disease, highlighting the urgent need for early diagnosis and intervention. The LEADDP risk prediction models constructed using machine learning can assist in early screening, reducing adverse outcomes and long-term risks for patients.
Supporting information
Abbreviations: SBP, Systolic Blood Pressure; DBP, Diastolic Blood Pressure; BMI, Body Mass Index; CHD, Coronary Heart Disease; MI, Myocardial Infarction; CHF, Congestive Heart Failure; AS, Atherosclerosis; FLD, Fatty Liver Disease; CLD, Chronic Liver Disease; PCOS, Polycystic Ovary Syndrome; MEN, Multiple Endocrine Neoplasia; GLU, Glucose; GLU_2H, 2-hour Postprandial Glucose; HBA1C, Hemoglobin A1c; GSP, Glycated Serum Protein; TG, Triglycerides; TC, Total Cholesterol; HDL_C, High-Density Lipoprotein Cholesterol; LDL_C, Low-Density Lipoprotein Cholesterol; FBG, Fibrinogen; UPR_24, 24-hour Urinary Protein; BUN, Blood Urea Nitrogen; BU, Blood Urea; SCR, Serum Creatinine; UCR, Urine Creatinine; SUA, Serum Uric Acid; HB, Hemoglobin; CP, C-Peptide; INS, Insulin; PCV, Packed Cell Volume; PLT, Platelets; ESR, Erythrocyte Sedimentation Rate; TBILI, Total Bilirubin; DBILI, Direct Bilirubin; TP, Total Protein; ALB, Albumin; LDH_L, Lactate Dehydrogenase; ALT, Alanine Aminotransferase; AST, Aspartate Aminotransferase; GGT, Gamma-Glutamyl Transferase; ALP, Alkaline Phosphatase; LP_A, Lipoprotein(a); PL, Phospholipids; PT, Prothrombin Time; PTA, Prothrombin Activity; APTT, Activated Partial Thromboplastin Time; FIBRIN, Fibrinogen; ALB_CR, Albumin/Creatinine Ratio; LPS, Lipase; CA199, Cancer Antigen 19–9; CRP, C-Reactive Protein; TH2, T Helper Cell 2; IBILI, Indirect Bilirubin; GLO, Globulin.
(PDF)
Acknowledgments
We thank the China National Population Health Science Data Center for providing data support. We also appreciate the constructive comments from the editors and each reviewer during the revision of our manuscript.
Data Availability
The data that support the findings of this study are openly available in the Diabetes Complications Warning Dataset at https://www.ncmi.cn/phda/dataDetails.do?id=CSTR:A0006.11.A0005.201905.000282-V1.0.
Funding Statement
This work was supported by the Shandong Provincial Natural Science Foundation (ZR2022MH268). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Fowkes FGR, Aboyans V, Fowkes FJI, McDermott MM, Sampson UKA, Criqui MH. Peripheral artery disease: epidemiology and global perspectives. Nat Rev Cardiol. 2017;14(3):156–170. doi: 10.1038/nrcardio.2016.179 [DOI] [PubMed] [Google Scholar]
- 2.Guariguata L, Whiting DR, Hambleton I, Beagley J, Linnenkamp U, Shaw JE. Global estimates of diabetes prevalence for 2013 and projections for 2035. Diabetes Res Clin Pract. 2014;103(2):137–149. doi: 10.1016/j.diabres.2013.11.002 [DOI] [PubMed] [Google Scholar]
- 3.Jarrett RJ. Type 2 (non-insulin-dependent) diabetes mellitus and coronary heart disease-chicken, egg or neither? Diabetologia. 1984;26(2):99–102. doi: 10.1007/BF00281114 [DOI] [PubMed] [Google Scholar]
- 4.Stern MP. Diabetes and cardiovascular disease. The “common soil” hypothesis. Diabetes. 1995;44(4):369–374. doi: 10.2337/diab.44.4.369 [DOI] [PubMed] [Google Scholar]
- 5.Li J, Li Y, Ma W, Liu Y, Yin X, Xie C, et al. Association of Time in Range levels with Lower Extremity Arterial Disease in patients with type 2 diabetes. Diabetes Metab Syndr. 2020;14(6):2081–2085. doi: 10.1016/j.dsx.2020.09.028 [DOI] [PubMed] [Google Scholar]
- 6.American Diabetes Association. 11. Microvascular Complications and Foot Care: Standards of Medical Care in Diabetes-2021. Diabetes Care. 2021;44(Suppl 1):S151–S167. doi: 10.2337/dc21-S011 [DOI] [PubMed] [Google Scholar]
- 7.Rutherford RB, Baker JD, Ernst C, Johnston KW, Porter JM, Ahn S, et al. Recommended standards for reports dealing with lower extremity ischemia: revised version. J Vasc Surg. 1997;26(3):517–538. doi: 10.1016/s0741-5214(97)70045-4 [DOI] [PubMed] [Google Scholar]
- 8.Nativel M, Potier L, Alexandre L, Baillet-Blanco L, Ducasse E, Velho G, et al. Lower extremity arterial disease in patients with diabetes: a contemporary narrative review. Cardiovasc Diabetol. 2018;17(1):138. doi: 10.1186/s12933-018-0781-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Buso G, Aboyans V, Mazzolai L. Lower extremity artery disease in patients with type 2 diabetes. Eur J Prev Cardiol. 2019;26(2_suppl):114–124. doi: 10.1177/2047487319880044 [DOI] [PubMed] [Google Scholar]
- 10.Chen T, Xiao S, Chen Z, Yang Y, Yang B, Liu N. Risk factors for peripheral artery disease and diabetic peripheral neuropathy among patients with type 2 diabetes. Diabetes Res Clin Pract. 2024;207:111079. doi: 10.1016/j.diabres.2023.111079 [DOI] [PubMed] [Google Scholar]
- 11.Carter SA. The effect of cooling on toe systolic pressures in subjects with and without Raynaud’s syndrome in the lower extremities. Clin Physiol. 1991;11(3):253–261. doi: 10.1111/j.1475-097x.1991.tb00456.x [DOI] [PubMed] [Google Scholar]
- 12.Zuo H, Shi Z, Hussain A. Prevalence, trends and risk factors for the diabetes epidemic in China: a systematic review and meta-analysis. Diabetes Res Clin Pract. 2014. Apr;104(1):63–72. doi: 10.1016/j.diabres.2014.01.002 [DOI] [PubMed] [Google Scholar]
- 13.Roth GA, Mensah GA, Johnson CO, Addolorato G, Ammirati E, Baddour LM, et al. Global Burden of Cardiovascular Diseases and Risk Factors, 1990–2019: Update From the GBD 2019 Study. J Am Coll Cardiol. 2020;76(25):2982–3021. doi: 10.1016/j.jacc.2020.11.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Aboyans V, Criqui MH, McClelland RL, Allison MA, McDermott MM, Goff DC Jr, et al. Intrinsic contribution of gender and ethnicity to normal ankle-brachial index values: the Multi-Ethnic Study of Atherosclerosis (MESA). J Vasc Surg. 2007;45(2):319–327. doi: 10.1016/j.jvs.2006.10.032 [DOI] [PubMed] [Google Scholar]
- 15.Collins TC, Suarez-Almazor M, Bush RL, Petersen NJ. Gender and peripheral arterial disease. J Am Board Fam Med JABFM. 2006;19(2):132–140. doi: 10.3122/jabfm.19.2.132 [DOI] [PubMed] [Google Scholar]
- 16.Mohammedi K, Woodward M, Hirakawa Y, Zoungas S, Williams B, Lisheng L, et al. Microvascular and Macrovascular Disease and Risk for Major Peripheral Arterial Disease in Patients With Type 2 Diabetes. Diabetes Care. 2016. Oct;39(10):1796–803. doi: 10.2337/dc16-0588 Epub 2016 Jul 25. [DOI] [PubMed] [Google Scholar]
- 17.Li Y, Liu Y, Liu S, Gao M, Wang W, Chen K, et al. Diabetic vascular diseases: molecular mechanisms and therapeutic strategies. Signal Transduct Target Ther. 2023;8(1):152. doi: 10.1038/s41392-023-01400-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zhou X, Yu L, Zhao Y, Ge J. Panvascular medicine: an emerging discipline focusing on atherosclerotic diseases. Eur Heart J. 2022;43(43):4528–4531. doi: 10.1093/eurheartj/ehac448 [DOI] [PubMed] [Google Scholar]
- 19.Meijer WT, Grobbee DE, Hunink MG, Hofman A, Hoes AW. Determinants of peripheral arterial disease in the elderly: the Rotterdam study. Arch Intern Med. 2000;160(19):2934–2938. doi: 10.1001/archinte.160.19.2934 [DOI] [PubMed] [Google Scholar]
- 20.Newman AB, Siscovick DS, Manolio TA, Polak J, Fried LP, Borhani NO, et al. Ankle-arm index as a marker of atherosclerosis in the Cardiovascular Health Study. Cardiovascular Heart Study (CHS) Collaborative Research Group. Circulation. 1993;88(3):837–845. doi: 10.1161/01.cir.88.3.837 [DOI] [PubMed] [Google Scholar]
- 21.Haas AV, McDonnell ME. Pathogenesis of Cardiovascular Disease in Diabetes. Endocrinol Metab Clin North Am. 2018. Mar;47(1):51–63. doi: 10.1016/j.ecl.2017.10.010 . [DOI] [PubMed] [Google Scholar]
- 22.Adler AI, Stevens RJ, Neil A, Stratton IM, Boulton AJM, Holman RR. UKPDS 59: hyperglycemia and other potentially modifiable risk factors for peripheral vascular disease in type 2 diabetes. Diabetes Care. 2002;25(5):894–899. doi: 10.2337/diacare.25.5.894 [DOI] [PubMed] [Google Scholar]
- 23.Althouse AD, Abbott JD, Forker AD, Bertolet M, Barinas-Mitchell E, Thurston RC, et al. Risk factors for incident peripheral arterial disease in type 2 diabetes: results from the Bypass Angioplasty Revascularization Investigation in type 2 Diabetes (BARI 2D) Trial. Diabetes Care. 2014;37(5):1346–1352. doi: 10.2337/dc13-2303 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Norman PE, Davis WA, Bruce DG, Davis TME. Peripheral arterial disease and risk of cardiac death in type 2 diabetes: the Fremantle Diabetes Study. Diabetes Care. 2006;29(3):575–580. doi: 10.2337/diacare.29.03.06.dc05-1567 [DOI] [PubMed] [Google Scholar]
- 25.Mohammedi K, Woodward M, Hirakawa Y, Zoungas S, Williams B, Lisheng L, et al. Microvascular and Macrovascular Disease and Risk for Major Peripheral Arterial Disease in Patients With Type 2 Diabetes. Diabetes Care. 2016;39(10):1796–1803. doi: 10.2337/dc16-0588 [DOI] [PubMed] [Google Scholar]
- 26.Yaribeygi H, Farrokhi FR, Butler AE, Sahebkar A. Insulin resistance: Review of the underlying molecular mechanisms. J Cell Physiol. 2019;234(6):8152–8161. doi: 10.1002/jcp.27603 [DOI] [PubMed] [Google Scholar]
- 27.Welsh GI, Hale LJ, Eremina V, Jeansson M, Maezawa Y, Lennon R, et al. Insulin signaling to the glomerular podocyte is critical for normal kidney function. Cell Metab. 2010;12(4):329–340. doi: 10.1016/j.cmet.2010.08.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Libby P. The changing landscape of atherosclerosis. Nature. 2021;592(7855):524–533. doi: 10.1038/s41586-021-03392-8 [DOI] [PubMed] [Google Scholar]
- 29.Bertrand C, Saulnier PJ, Potier L, Croyal M, Blanchard V, Gand E, et al. Plasma concentrations of lipoproteins and risk of lower-limb peripheral artery disease in people with type 2 diabetes: the SURDIAGENE study. Diabetologia. 2021;64(3):668–680. doi: 10.1007/s00125-020-05326-x [DOI] [PubMed] [Google Scholar]
- 30.Eshcol J, Jebarani S, Anjana RM, Mohan V, Pradeepa R. Prevalence, incidence and progression of peripheral arterial disease in Asian Indian type 2 diabetic patients. J Diabetes Complications. 2014;28(5):627–631. doi: 10.1016/j.jdiacomp.2014.04.013 [DOI] [PubMed] [Google Scholar]
- 31.Yeo TC, Chan YH, Low LP, Venketasubramanian N, Lim SC, Tay JC, et al. Risk factor profile and treatment patterns of patients with atherothrombosis in Singapore: insight from the REACH Registry. Ann Acad Med Singapore. 2008;37(5):365–371. [PubMed] [Google Scholar]
- 32.Criqui MH, Matsushita K, Aboyans V, Hess CN, Hicks CW, Kwan TW, et al. Lower Extremity Peripheral Artery Disease: Contemporary Epidemiology, Management Gaps, and Future Directions: A Scientific Statement From the American Heart Association. Circulation. 2021;144(9):e171–e191. doi: 10.1161/CIR.0000000000001005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Gerhard-Herman MD, Gornik HL, Barrett C, Barshes NR, Corriere MA, Drachman DE, et al. 2016 AHA/ACC Guideline on the Management of Patients With Lower Extremity Peripheral Artery Disease: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Circulation. 2017;135(12):e686–e725. doi: 10.1161/CIR.0000000000000470 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Heffron SP, Dwivedi A, Rockman CB, Xia Y, Guo Y, Zhong J, et al. Body mass index and peripheral artery disease. Atherosclerosis. 2020;292:31–36. doi: 10.1016/j.atherosclerosis.2019.10.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Mann JFE, Gerstein HC, Pogue J, Bosch J, Yusuf S, for the HOPE Investigators. Renal Insufficiency as a Predictor of Cardiovascular Outcomes and the Impact of Ramipril: The HOPE Randomized Trial. Ann Intern Med. 2001;134(8):629–636. doi: 10.7326/0003-4819-134-8-200104170-00007 [DOI] [PubMed] [Google Scholar]
- 36.Checheriţă IA, Smarandache D, David C, Ciocâlteu A, Ion DA, Lascăr I. Vascular calcifications in chronic kidney disease—clinical management. Romanian J Morphol Embryol Rev Roum Morphol Embryol. 2012;53(1):7–13. [PubMed] [Google Scholar]
- 37.Stabouli S, Papakatsika S, Kotsis V. Hypothyroidism and hypertension. Expert Rev Cardiovasc Ther. 2010;8(11):1559–1565. doi: 10.1586/erc.10.141 [DOI] [PubMed] [Google Scholar]
- 38.Biondi B, Klein I. Hypothyroidism as a risk factor for cardiovascular disease. Endocrine. 2004;24(1):1–13. doi: 10.1385/ENDO:24:1:001 [DOI] [PubMed] [Google Scholar]
- 39.Ferriere A, Tabarin A. Cushing’s syndrome: Treatment and new therapeutic approaches. Best Pract Res Clin Endocrinol Metab. 2020;34(2):101381. doi: 10.1016/j.beem.2020.101381 [DOI] [PubMed] [Google Scholar]
- 40.Lassila R. Role and management of coagulation disorders in peripheral arterial disease. Scand J Surg SJS Off Organ Finn Surg Soc Scand Surg Soc. 2012;101(2):94–99. doi: 10.1177/145749691210100204 [DOI] [PubMed] [Google Scholar]
- 41.Salhiyyah K, Forster R, Senanayake E, Abdel-Hadi M, Booth A, Michaels JA. Pentoxifylline for intermittent claudication. Cochrane Database Syst Rev. 2015;9(9):CD005262. doi: 10.1002/14651858.CD005262.pub3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hampe N, Wolterink JM, van Velzen SGM, Leiner T, Išgum I. Machine Learning for Assessment of Coronary Artery Disease in Cardiac CT: A Survey. Front Cardiovasc Med. 2019;6. doi: 10.3389/fcvm.2019.00172 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Santos MS, Abreu PH, García-Laencina PJ, Simão A, Carvalho A. A new cluster-based oversampling method for improving survival prediction of hepatocellular carcinoma patients. J Biomed Inform. 2015;58:49–59. doi: 10.1016/j.jbi.2015.09.012 [DOI] [PubMed] [Google Scholar]
- 44.Wang X, Yang Y, Zhang J, Zang S. Development and validation of a prediction model for the prolonged length of stay in Chinese patients with lower extremity atherosclerotic disease: a retrospective study. BMJ Open. 2023. Feb 9;13(2):e069437. doi: 10.1136/bmjopen-2022-069437 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Zhang L, Ma Y, Li Q, Long Z, Zhang J, Zhang Z, et al. Construction of a novel lower-extremity peripheral artery disease subtype prediction model using unsupervised machine learning and neutrophil-related biomarkers. Heliyon. 2024. Jan 6;10(2):e24189. doi: 10.1016/j.heliyon.2024.e24189 ; PMCID: PMC10827514 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Pearce J, Ferrier S. Evaluating the predictive performance of habitat models developed using logistic regression. Ecol Model. 2000;133(3):225–245. doi: 10.1016/S0304-3800(00)00322-7 [DOI] [Google Scholar]


