Skip to main content
Digital Health logoLink to Digital Health
. 2023 Jul 28;9:20552076231191055. doi: 10.1177/20552076231191055

The deep learning algorithm estimates chest radiograph-based sex and age as independent risk factors for future cardiovascular outcomes

Hao-Chun Liao 1, Chin Lin 2,3, Chih-Hung Wang 4,5, Wen-Hui Fang 2,6,
PMCID: PMC10388631  PMID: 37529539

Abstract

Objectives

Chest X-rays (CXRs) convey much illegible physiological information that deep learning model (DLM) has been reported interpreting successfully. Since the electrocardiogram age established by DLM was revealed as a heart biological marker, we hypothesize that CXR age has similar potential to describe the heart and lung states. Therefore, we developed a DLM to predict sex and age through CXR and analyzed its relation with future cardiovascular diseases (CVD).

Methods

A total of 90,396 CXRs aged 20 to 90 were collected and separated into a development set with 53,102 CXRs and demographic information pairs, a tuning set with 7073 pairs, an internal validation set with 17,364 pairs, and an external validation set with 12,857 pairs. The study trained DLM with development set for estimating age and sex and compared them to actual information.

Results

The mean absolute errors of predicted age were 4.803 and 4.313 years in the internal and external validation sets, respectively. The area under the curve of sex analysis was 0.9993 and 0.9988 in the internal and external validation sets, respectively. Patients whose CXR age was 5 years older than chronologic age lead to higher risk of all-cause mortality (hazard ratio (HR): 2.42, 95% confidence interval (CI): 2.00–2.92), cardiovascular (CV)-cause mortality (HR: 7.57, 95% CI: 4.55–12.60), new-onset heart failure (HR: 2.07, 95% CI: 1.56–2.76), new-onset chronic kidney disease (HR: 1.73, 95% CI: 1.46–2.05), new-onset acute myocardial infarction (HR: 1.80, 95% CI: 1.12–2.92), new-onset stroke (HR: 1.45, 95% CI: 1.10–1.90), new-onset coronary artery disease (HR: 1.26, 95% CI: 1.04–1.52), and new-onset atrial fibrillation (HR: 1.43, 95% CI: 1.01–2.02).

Conclusions

Using DLM to predict CXR age provided additional information for future CVDs. Older CXR age is an accessible risk classification tool for clinician use.

Keywords: Artificial intelligence, chest x-ray, deep learning, cardiovascular disease, heart disease risk factors

Key points

  1. Existed cardiovascular disease (CVD) risk stratification tools lack of evidence and data acquisition requires invasive procedures.

  2. Older deep learning model (DLM)-predicted chest X-ray age shows higher risk of mortality and CVD incidence.

  3. Older predicted age is an accessible risk classification tool to manage CVD.

Introduction

Cardiovascular disease (CVD) management is a global issue. International guidelines suggest the necessity of an ongoing basis over the age of 40 to make an assessment of CVD risk over a 10-year period, 1 which is calculated by age, sex, blood pressure, lipid profile, and smoking history to score the 10-year risk of fatal CVD.2,3 Several management and drug treatments are recommended as soon as possible in patients at high risk for CVD. 4 Although the high-risk individuals showed more benefits from risk factor interventions, most deaths in the community came from people with lower risk levels because they were more numerous than high-risk individuals, 2 which means that more potential patients can be screened out with risk stratification tools in different assessment directions. However, existing global cardiovascular risk assessments are generally not accurate enough and lack high-quality evidence of direct reductions in CVD morbidity or mortality.58 In addition, <30% of them are available in the electronic medical records database, usually due to a lack of data on cholesterol. 9 Therefore, a risk stratification tool with different and approachable methods is necessary.

Chest X-ray (CXR) is low-cost, approachable and the most popular examinations for initial assessment. CXR shows overall chest images, including lung and cardiac vascular systems. 10 The utility of CXR examination (CXR) in clinical now even at annual health examinations, 11 but even outpatient department and pre-operation study, 12 , 13 especially chest and cardiology surgery. A deep learning model (DLM) also showed enough ability on CXR to screen diseases such as pneumothorax, 14 opacity, nodules or masses, fractures or even COVID-19, 15 and CVD such as heart failure, 16 aortic dissection, 17 and left ventricular dysfunction. 18 These methods show successful application; however, they are often used to explore the classification of disease rather than the overall overview of the body. 19

Recently, physical age defined by DLM was reported as a cardiovascular risk factor, demonstrating the ability of electrocardiogram based age to predict CV-caused mortality and new-onset CVDs. 20 Compared with existing methods, it does not need to combine several kinds of information, so it is more suitable as a risk stratification tool. The CXR can directly show the lung and cardiac vascular systems of one person. 21 As age increases, senescence causes structural and functional changes in the pulmonary circulation, which affect pulmonary pressure, pulmonary hemodynamics, and gas exchange. 22 Hemodynamic changes in pulmonary arterial pressure and pulmonary wedge pressure are found after 45 years.22,23 In addition, the reduction of the alveolar-capillary interface and sclerosis of the pulmonary vasculature contribute to the reduction of pulmonary capillary blood volume with age. 24 Age-related pulmonary vascular remodeling, with increased muscular content of the pulmonary arteries and increases in arterial and venous wall thickness, demonstrates that ageing is associated with increased stiffness of the pulmonary vessels.22,25,26 Although these changes are indistinguishable to clinicians, information in CXR can be obtained by observing the changes in size, diameter, calcification, and other changes of large vessels.27,28 Several cardiovascular abnormalities such as cardiomyopathy, pericardial disease, valvular abnormalities, congenital abnormalities have their own unique characteristics on CXR. 29 Even indistinguishable vessels can be used to identified diseases such as pneumothorax and vascular infiltration by density and distribution. 30 Therefore, CXR, which can observe the above phenomenon, shows its potential ability to predict the above ageing vascular changes. In a previous study, CXR-predicted age was also reported and used for survival, 31 and its visualization also showed cardiac related. 32 We therefore tried to explore the extensive application of CXR to predict age through CV ageing change.

According to the above statement, CXR can record cardiovascular change, and there is precedent for CXR use in DLM. Due to the success of DLM defined-physical age, residuals between chronological age, and CXR age may imply information about underlying excessive ageing cardiovascular factors, which may help clinician improve the accuracy of CVD prediction. We established a retrospective cohort study and developed a DLM to predict chronological age to test the hypothesis. We also compared the correlations between chronological age, CXR age, and measured cardiovascular factors. Finally, we used CXR age to predict future CVD mortality and incidence to verify the usefulness of residuals between chronological age and CXR age.

Method

Data source and population

The database used in this research is from Tri-Service General Hospital, Taipei, Taiwan and was ethically approved by its institutional review board (IRB NO. C202105150). The CXR that was used was selected using the following specifications: (1) data from January 1, 2010 to December 31, 2020; (2) excluded people younger than 20 years old and older than 90 years old; (3) came from outpatient department and health examination center to limit the bias affected by acute stage change (Supplemental Figure 1). Each patient had only a single CXR with randomized choice of a time point in life, which may be representative. 33

Databases were divided into development, tuning, internal validation, and external validation sets, and their characteristic and laboratory results were analyzed and are shown in Table 1. The process of database separation is shown in Figure 1. Cases were followed up from the date of the CXR exam to cardiovascular events or February 28, 2021, based on which came first. Other details were presented in Supplemental material.

Table 1.

Patient characteristics and laboratory results in the development, tuning, internal validation, and external validation sets.

External validation set (n = 12857) Internal validation set (n = 17364) Development set (n = 53102) Tuning set (n = 7073) p Value
Gender (male) 6187(48.1%) 10088(58.1%) 26051(49.1%) 3266(46.2%) <0.001
Age (years) 63.7 ± 15.4 52.8 ± 16.7 54.8 ± 16.7 60.4 ± 15.7 <0.001
BMI (kg/m2) 24.5 ± 4.0 24.7 ± 4.0 24.2 ± 4.1 24.2 ± 4.2 <0.001
SBP (mmHg) 130.2 ± 20.3 126.3 ± 18.9 126.0 ± 20.4 128.4 ± 23.7 <0.001
DBP (mmHg) 76.9 ± 13.3 76.8 ± 12.7 76.3 ± 12.7 76.6 ± 13.1 <0.001
Disease history
 AMI 250(1.9%) 278(1.6%) 439(0.8%) 223(3.2%) <0.001
 CAD 3854(30.0%) 3257(18.8%) 6458(12.2%) 1775(25.1%) <0.001
 STK 2033(15.8%) 1411(8.1%) 3029(5.7%) 830(11.7%) <0.001
 HF 1566(12.2%) 1349(7.8%) 2083(3.9%) 828(11.7%) <0.001
 AF 831(6.5%) 644(3.7%) 967(1.8%) 428(6.1%) <0.001
 DM 4264(33.2%) 3117(18.0%) 8788(16.5%) 1892(26.7%) <0.001
 HTN 7053(54.9%) 5769(33.2%) 14190(26.7%) 3137(44.4%) <0.001
 CKD 3593(27.9%) 2530(14.6%) 7346(13.8%) 1921(27.2%) <0.001
 HLP 7136(55.5%) 7536(43.4%) 16936(31.9%) 2958(41.8%) <0.001
 COPD 4357(33.9%) 3722(21.4%) 8803(16.6%) 1668(23.6%) <0.001
Laboratory test
 GLU (mg/dL) 117.3 ± 51.1 107.0 ± 38.3 107.6 ± 37.3 114.5 ± 43.7 <0.001
 HbA1c (%) 6.2 ± 1.2 5.9 ± 1.0 6.0 ± 1.1 6.2 ± 1.2 <0.001
 TG (mg/dL) 130.6 ± 82.2 121.9 ± 75.7 122.7 ± 79.3 125.1 ± 75.5 <0.001
 TC (mg/dL) 176.9 ± 39.8 181.4 ± 38.2 181.7 ± 39.6 178.1 ± 40.7 <0.001
 LDL (mg/dL) 105.0 ± 33.1 110.0 ± 32.6 109.4 ± 33.3 106.1 ± 34.3 <0.001
 HDL (mg/dL) 50.7 ± 14.5 52.2 ± 14.2 52.7 ± 14.6 51.7 ± 14.5 <0.001
 eGFR (mL/min) 78.1 ± 27.0 89.7 ± 25.3 89.7 ± 24.4 80.3 ± 28.5 <0.001
 BUN (mg/dL) 19.8 ± 13.9 16.8 ± 11.2 16.4 ± 10.0 19.4 ± 14.3 <0.001
 AST (U/L) 24.1 ± 22.1 23.2 ± 26.3 22.9 ± 21.0 23.9 ± 26.6 <0.001
 ALT (U/L) 22.9 ± 26.9 24.0 ± 26.1 22.7 ± 24.3 22.5 ± 28.4 <0.001
 Alb (g/dL) 4.1 ± 0.6 4.3 ± 0.5 4.2 ± 0.5 4.1 ± 0.6 <0.001
 Hb (mg/dL) 12.8 ± 2.1 13.6 ± 2.0 13.3 ± 2.0 12.8 ± 2.1 <0.001
 WBC (103/μL) 6.7 ± 2.7 6.4 ± 2.3 6.4 ± 2.6 6.6 ± 2.7 <0.001
 PLT (103/μL) 225.9 ± 79.3 234.5 ± 70.7 238.4 ± 75.9 231.2 ± 81.7 <0.001
 CRP (mg/L) 1.7 ± 3.3 1.2 ± 2.6 1.4 ± 2.9 1.7 ± 3.3 <0.001
 Na (mEq/L) 138.6 ± 3.6 139.2 ± 3.0 139.1 ± 3.1 138.7 ± 3.5 <0.001
 K (mEq/L) 4.1 ± 0.5 4.1 ± 0.4 4.1 ± 0.4 4.1 ± 0.5 <0.001
 Cl (mEq/L) 103.3 ± 4.0 103.3 ± 3.4 103.3 ± 3.5 103.3 ± 4.0 0.399
 Mg (mg/dL) 2.1 ± 0.3 2.2 ± 0.2 2.1 ± 0.2 2.1 ± 0.3 <0.001
 Ca (mg/dL) 9.3 ± 0.6 9.3 ± 0.5 9.3 ± 0.5 9.3 ± 0.6 <0.001

All the p values of the characteristics were <0.001, except chloride. The mean age ranged from 52.8 to 63.7 years.

AF: atrial fibrillation; Alb: albumin; ALT: alanine aminotransferase; AMI: acute myocardial infarction; AST: aspartate aminotransferase; BMI: body mass index; BUN: blood urea nitrogen; Ca: calcium; CAD: coronary artery disease; CKD: chronic kidney disease; Cl: chloride; COPD: chronic obstructive pulmonary disease; CRP, C-reactive protein; DBP: diastolic blood pressure; DM: diabetes mellitus; eGFR: estimated glomerular filtration rate; GLU: glucose; Hb: hemoglobin; HbA1c: glycated hemoglobin; HDL: high-density lipoprotein cholesterol; HF: heart failure; HLP: hyperlipidemia; HTN: hypertension; K: potassium; LDL: low-density lipoprotein cholesterol; Mg: magnesium; Na: sodium; PLT: platelet; SBP: systolic blood pressure; STK: stroke, TC: total cholesterol; TG: triglyceride; WBC: white blood cell count.

Figure 1.

Figure 1.

Development, tuning, internal validation, and external validation set generation. Schematic representation of the dataset creation and analysis strategy, which was devised to assure a robust and reliable dataset for training, validating, and testing of the network. Once a patient's data were placed in one of the datasets, that individual's data were used only in that set, avoiding “cross-contamination” among the development, tuning, and validation sets. External validation sets were collected only for patients who visited the Jingjhou branch, while others were not. The details of the flow chart and how each of the datasets was used are described in the Methods section.

Observation variables

Our main observational variables included disease and laboratory characteristics. The definitions of the variables were described in Supplemental material.

Regarding all-cause mortality, the status of whether the patient was dead or alive was collected from electronic medical records and updated by each hospital, and survival time was calculated by the time interval from death to the date of CXR. Furthermore, to limit the bias of incomplete records, the last medical record for the patient's visit was used to correct their data. For CV-cause mortality, patients whose cause of death was related to arrhythmia, acute coronary syndrome, stroke, and heart failure (HF) were included.

DLM and statistical analysis

CXR data preprocessing and DLM details were showed in Supplemental material. Several statistical tools used were also presented in Supplemental material. The above analyses were performed on the software environment R version 3.4.4. The significance level was set to p < 0.05.

Results

Performance of the DLM to predict age and sex

The overall correlation and the explained variance (R-square), which are presented in Figure 2, were used to calculate the mean absolute error on predicting due to its property of continuity. In the internal validation set, the mean absolute error of actual age and CXR age was 4.803 years, and the explained variance (R-square) by CXR age on chronological age in the validation set was 84.5% (r = 0.919), as shown in Figure 2A. In the external validation set, the mean absolute error of actual age and CXR age was 4.313 years, and the explained variance (R-square) by CXR age on chronological age in the validation set was 89.1% (r = 0.944), as shown in Figure 2D. The confusion matrixes in Figure 2B and 2E visualize the distribution.

Figure 2.

Figure 2.

The association between chronological age and CXR age and the sensitivity and specificity of sexual prediction in the internal and external validation sets. (A) Scatter plots of chronologic age and CXR age in the internal validation set. Red points represent the highest density, followed by yellow, green light blue, and dark blue. Perfect model performance would fall only along the diagonal line. (B) The confusion matrix in the internal validation set categorized by 15 years old. The squared weighted kappa value was 0.76 in this analysis. (C) AUC curve of sexual prediction in the internal validation set. (D) Scatter plots of chronologic age and CXR age in the external validation set. Red points represent the highest density, followed by yellow, green light blue, and dark blue. Perfect model performance would fall only along the diagonal line. (E) The confusion matrix in the external validation set categorized by 15 years old. The squared weighted kappa value was 0.76 in this analysis. (F) AUC curve of sexual prediction in the external validation set. AUC: area under curve; CXR: chest X-ray.

We also used CXR to predict sex, with the internal validation set showing sensitivity of 98.9%, specificity of 98.7%, and area under curve (AUC) of 0.9993 on Figure 2C, and the external validation set showing sensitivity of 98.6%, specificity of 98.3%, and AUC of 0.9988 on Figure 2F.

Analysis of the characteristics and risk contributions in patients with unequal actual and predicted ages

Patients whose CXR age was more than 5 years old than the actual age (higher residual) had a higher prevalence of acute myocardial infarction, stroke, coronary artery disease, heart failure, atrial fibrillation, diabetes mellitus (DM), hypertension (HTN), and chronic kidney disease (CKD) but not chronic obstructive pulmonary disease (COPD). They also had worsened performance on laboratory data, including high-density lipoprotein (HDL), estimated glomerular filtration rate, blood urea nitrogen, aspartate aminotransferase, albumin, hemoglobin, white blood cells and platelets, as shown in Figure 3. In comparison, patients whose CXR ages were less than 5 years old than the actual age (lower residual) showed a better performance regardless of disease prevalence or laboratory data. Risk effect analysis of patient characteristics on new-onset cardiovascular-related disease was conducted by the Cox proportional hazard model, 34 adjusted by chronologic age in Figure 3. If the patient has the diseases listed above, the probability of CV mortality will be relatively high, and not to our surprise, the probability of other cardiovascular-related diseases in the future, except for a few outcomes, was roughly statistically significant, with a similar trend presented in the laboratory data.

Figure 3.

Figure 3.

Analysis of the characteristics and risk contributions in patients with inconsistent chronological age and CXR age in the validation set. We selected patient characteristics for risk analysis based on the outcome of interest. Bar plots represent the mean or proportion where appropriate and corresponding 95% conference intervals, which are adjusted by gender and chronological age via linear or logistic regression, based on patient characteristics in patients with lower to higher residuals. (CXR age—chronological age) Significant tests are based on the trend test (*: p for trend < 0.05; **: p for trend < 0.01; ***: p for trend < 0.001), and the effects were the difference per group. Risk analysis plots showed the hazard ratios, calculated by the change in values among each residual group. The colors red, gray, and blue represent positive, nonsignificant, and negative impacts on the corresponding outcomes, respectively. CXR: chest X-ray.

Analysis of the long-term outcomes in patients with unequal actual and predicted ages

We analyzed the long-term disease incidence in Figure 4. As above, we separated the CXR age >5, between 5 and −5, and <5 groups, presenting the relationship between residual age and disease. Patients who had high residuals showed statistically significant long-term incidence rates for all-cause mortality, CV-cause mortality, new-onset HF, new-onset CKD, new-onset acute myocardial infarction (AMI), new-onset stroke (STK), new-onset coronary artery disease (CAD), and new-onset atrial fibrillation (AF) in the internal validation set, and the external validation set also showed the same tendency for all-cause mortality, new-onset HF, new-onset CKD, and new-onset AF. Except for the CAD group in the external validation, the other groups still showed a varied tendency between the high residual and the incidence rate. In contrast, the results presented by the low-residual group were roughly the opposite of those of the high-residual group.

Figure 4.

Figure 4.

The comparison of positive, near zero, and negative inconsistent chronological age and CXR age for long-term outcomes of interest in each validation set. These line charts were compared with three lines, red, blue, and green, representing CXR ages >5, between 5 and −5, and <5, respectively, to show the difference in long-term incidence and adverse events in each group. The number of patients in the risk plot is shown under each line chart for assistance. (A) Analysis of the internal validation set. (B) Analysis of the external validation set. CXR: chest X-ray.

The continuous association between unequal age and outcome is presented in Figure 5. Except for the DM, AMI, and CAD groups in the external validation set, roughly speaking, the patients who had high-residual age tended to have high all-mortality, CV-cause mortality and CVD, and those who had low-residual age tended to have low all-mortality, CV-cause mortality, and CVD.

Figure 5.

Figure 5.

The comparison of higher and lower inconsistent chronological age and CXR age for outcomes of interest with hazard ratio in each validation set. Continuous association of the difference between chronological age and ECG age on each outcome. The solid line and dashed line are the point estimation and corresponding 95% conference interval, respectively. The solid lines red and blue represent the internal validation set and external validation set, respectively. For example, the figure in the upper left corner shows that whether it's an internal or external validation set, as the difference between chronological age and ECG age increases, the hazard ration of the overall mortality rate tends to increase. CXR: chest X-ray; ECG: electrocardiogram.

Discussion

In this retrospective cohort study, we established a CXR-based DLM to detect chronological age and tried to use it to predict mortality and some CVDs. In our study, the mean absolute error between CXR age and chronological age was <5 years, and these results suggest that the residuals themselves cause acceptable excessive random errors, proving the ability of predicting age. In patients with poor physical conditions and comorbidities, the CXR age was higher than their actual age. We further used CXR age to predict future cardiovascular-related diseases and survival age and found that patients with higher-residual age had a significantly higher incidence of future complications, which shows that the DLM can help in stratifying patients with high risk.

The area under the curve of sex was more than 0.99, so there was no doubt about the ability of the DLM to predict sex. Although it is of little clinical use, because of its high accuracy, it has a certain degree of applicability in forensic medicine, 35 identification of unidentified acute trauma patients, and even filling in missing data in large public datasets.

According to previous studies and our study, the DLM has the ability to predict the age of humans in normal body status via CXR.31,32 If a person has CVD comorbidities or worse laboratory data, the residual between CXR age and chronological age presents corresponding changes in our study. Previous studies used gradient-weighted class-activation maps to visualize areas of DLM analysis. One found that the DLM focused on the mediastinum, the cardiac silhouette, and the aortic knob, 31 and another found that the cervical spine, thoracic spines, first ribs, aortic arch, heart region, rib cage, and soft tissue of the thoracic wall and flank were highlighted, 32 hinting at the relationship between heart appearance and CXR age. Furthermore, we analyzed the prevalence of COPD and its relationship with CXR age and showed no consistent association, which may imply that CXR age is not related to lung fibrosis or lung parenchyma, corresponding with a previous study. 31 Theoretically, lung cancer is the lung disease that mainly affects mortality. The mean doubling time of lung cancer was 166.3 days, 36 which only had an increasing prevalence rate with age but not a gradual progression with age. In addition, pulmonary diseases related to CVD, such as acute pulmonary edema, were excluded from the case and did not affect the judgment. In contrast, CVD tends to progress gradually with age, making CVD and age development strongly associated. Furthermore, we also found a significant correlation between CXR age and CVD by analyzing the terms on the CXR report (Supplemental P.6). Overall, we can conclude from the above that CXR age is related to heart status, and we proposed a better way to evaluate cardiovascular status.

High-residual CXR age shows a tendency toward high CVD prevalence and poor laboratory data, and only two parameters, single CXR and age, were used for CVD risk stratification. Ageing affects cardiovascular systems in a multifaceted way. These include tissue hypoperfusion, vascular degeneration, remodeling, lumen enlargement, wall thickening, 37 left ventricular hypertrophy and fibroblast proliferation, resulting in decreased cardiac output and increased fibrotic tissue. 38 In addition, increased oxidative stress, inflammation, apoptosis, and overall myocardial deterioration and degeneration are associated with structural damage, such as cardiac hypertrophy and the development of fibrosis in aged hearts, which usually lead to eventual heart dysfunction.39,40 It affects the left ventricle asymmetrically, mainly the interventricular septum, and leads to a redistribution of cardiac muscle. 41 These pathological changes correlate with the visualization zone in previous studies and may be detected by DLM and establish the features.31,32 Age is another parameter that is an independent risk factor for CVD and is widely used in several international risk stratification tools. 42 We used it as a benchmark of ageing to distinguish between pathological and physiological changes. In comparison, the common CVD risk stratification tool Framingham Heart Score consists of age, sex, smoking history, total cholesterol, HDL cholesterol and systolic blood pressure, 43 and some parameters, such as family history or social deprivation, are recommended as risk factors.4244 CXR age directly observes the cardiac image and is related to poor laboratory data and physical condition, corresponding to traditional tools, which use indirect laboratory data and vital signs to evaluate its ageing degree. Both have a reasonable mechanism and can complement each other. However, a lack of high-quality evidence on CVD morbidity or mortality,58 missing data due to expense and many parameters make it difficult to use in medical databases, 9 and the definition of family or social history is difficult to define. Conversely, the simple, minimal information needed and the most importantly, more precise CXR age is an approachable CVD risk stratification tool that we can use (the comparison was showed in supplemental Table 1).

To the best of our knowledge, this is the first study to use DLM-based CXR age to predict future cardiovascular-related diseases. Previous studies only analyzed risk factors and primary outcome only mortality without additional analysis, 31 other just analyzing heart failure. 45 Both were trained with small or public databases, which are well known that there are inaccuracies in labels and lower quality. 46 We analyzed high residuals and dug deeper into the reasons behind it, finding statistically significant risk difference on almost all complications. However, new-onset DM and HTN showed no obvious outcome. We suppose that because new-onset DM and HTN are usually caused not by insufficient existing heart function, they mostly affect small blood vessels first, and they take a long time to affect the heart. For CXR age, which directly observes the shape of the heart, it is possible to observe no change. In previous studies, the pathogenesis of HTN involved the interaction of genetic and environmental factors, including abnormal fluid volume regulation, increased vasoconstriction, and vessel wall remodeling. 47 However, it is a slow progression and requires several years to progress, so HTN is usually associated with later rather than early CAD. 48 For DM, there was no difference between patients with DM and non-DM on vessel screening by coronary angiography within 5 years of diagnosis. 49 As a result, CXR age has the ability to predict CVD mortality and incidence, which can present its abnormality on CXR.

CXR age is an approachable CVD risk stratification tool that can help clinician with early screenings who require further surveys, especially in the passive electronic medical records screening. The most important thing is that CXR age is not a new intervention, but only analyzes the original information from routine examination, allowing the doctor to give the human body state an overall score, which can be used to guide people regarding lifestyle improvements and further interventions without the need for a comprehensive inspection.

Limitation

Several limitations were found in our work. First, our research is hospital-based and only includes a physical examination center and outpatient department, which may show a limitation to the community. Second, it is uncertain which CXR characteristics were analyzed by the DLM. Previous research used Gradient-weighted Class Activation Mapping (Grad-CAMs) for analysis, but a certain relationship is still unconfirmed.31,32 Third, the laboratory characteristics were taken from the physical examination center and outpatient department, which may have received treatment. Taking the lipid profile as an example, we found that the result of low-density lipoprotein (LDL) was the opposite of our residual age. This may be related to the management of dyslipidemia. Since LDL is regarded as a major cardiovascular risk factor, European Society of Cardiology guidelines suggest controlling LDL-C levels lower than approximately 100 mg/dL with statins, adjusting for CV risk. 50 In contrast, low HDL is not often clinically interfered with by using drugs, causing HDL to correlate to residual age. Therefore, this factor should be considered in future studies. Finally, in our study, the internal validation and the external validation showed a different outcome on AMI, STK, CAD or CV-cause mortality. We hypothesize that this may be related to the characteristics of the hospital. As an external validation, it only has a respiratory care ward and nursing home, and there is no general ward. Further validation should be performed in more databases. Therefore, there is insufficient evidence to directly apply our model to foreign data, particularly in countries with different disease distributions, as it may yield inconsistent results. However, it is important to emphasize that our article primarily focuses on the correlation between the predicted physiological age based on CXRs and acute age. We believe that this conclusion holds true for individuals from diverse ethnic backgrounds.

Conclusion

Our study provides a protocol for using deep learning to analyze physiological age from conventional chest images as a clinical tool. It can lead to more accurate risk assessment classification models, and thus more accurate medical treatments can be given. Through further research, we may be able to understand the clinical role of deep learning to analyze the overall information of chest images and prove its usefulness in evaluation and treatment.

Supplemental Material

sj-docx-1-dhj-10.1177_20552076231191055 - Supplemental material for The deep learning algorithm estimates chest radiograph-based sex and age as independent risk factors for future cardiovascular outcomes

Supplemental material, sj-docx-1-dhj-10.1177_20552076231191055 for The deep learning algorithm estimates chest radiograph-based sex and age as independent risk factors for future cardiovascular outcomes by Hao-Chun Liao, Chin Lin, Chih-Hung Wang and Wen-Hui Fang in DIGITAL HEALTH

Nonstandard Abbreviations and Acronyms

AF

atrial fibrillation

AI

artificial intelligence

Alb

albumin

ALT

alanine aminotransferase

AMI

acute myocardial infarction

AST

aspartate aminotransferase

AUC

area under curve

BMI

body mass index

BUN

blood urea nitrogen

Ca

calcium

CAD

coronary artery disease

CHF

congestive heart failure

CI

conference interval

CKD

chronic kidney disease

Cl

chloride

COPD

chronic obstructive pulmonary disease

Cr

creatinine

CVD

cardiovascular disease

CXR

chest X-ray

CXR age

CXR-based age

DLM

deep learning model

DM

diabetes mellitus

ECG

electrocardiogram

eGFR

estimated glomerular filtration rate

GLU

glucose

Hb

hemoglobin

HbA1c

glycated hemoglobin

HDL

high-density lipoprotein cholesterol

HR

hazard ratio

HTN

hypertension

K

potassium

LDL

low-density lipoprotein cholesterol

Na

sodium

PLT

platelet

ROC

receiver operating characteristic

TC

total cholesterol

TG

triglyceride

TG

triglycerides

WBC

white blood cell count.

Footnotes

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding: The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the National Defense Medical Center, Taiwan, Tri-Service General Hospital, Ministry of Science and Technology, Taiwan (grant number NSTC112-2321-B-016-003, TSGH-B-112016, MND-MAB-C13-112052).

Supplemental material: Supplemental material for this article is available online.

References

  • 1.Zabihi M, Rad AB, Katsaggelos AK, et al. "Detection of atrial fibrillation in ECG hand-held devices using a random forest classifier." 2017 Computing in Cardiology (CinC). IEEE, 2017.
  • 2.Piepoli MF, Hoes AW, Agewall S, et al. 2016 European guidelines on cardiovascular disease prevention in clinical practice: the sixth joint task force of the European Society of Cardiology and other societies on cardiovascular disease prevention in clinical practice (constituted by representatives of 10 societies and by invited experts) developed with the special contribution of the European Association for Cardiovascular Prevention & Rehabilitation (EACPR). Eur Heart J 2016; 37: 2315–2381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Arnett DK, Blumenthal RS, Albert MA, et al. 2019 ACC/AHA guideline on the primary prevention of cardiovascular disease: a report of the American College of Cardiology/American Heart Association task force on clinical practice guidelines. Circulation 2019; 140: e596–e646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hearts technical package for cardiovascular disease management in primary health care: risk based CVD management. Geneva: World Health Organization; 2020. Licence: CC BY-NC-SA 3.0 IGO.
  • 5.Collins DR, Tompson AC, Onakpoya IJ, et al. Global cardiovascular risk assessment in the primary prevention of cardiovascular disease in adults: systematic review of systematic reviews. BMJ Open 2017; 7: e013650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Liew SM, Doust J, Glasziou P. Cardiovascular risk scores do not account for the effect of treatment: a review. Heart 2011; 97: 689–697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Studziński K, Tomasik T, Krzysztoń J, et al. Effect of using cardiovascular risk scoring in routine risk assessment in primary prevention of cardiovascular disease: an overview of systematic reviews. BMC Cardiovasc Disord 2019; 19: 11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Karmali KN, Persell SD, Perel P, et al. Risk scoring for the primary prevention of cardiovascular disease. Cochrane Database Syst Rev 2017; 3: Cd006887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Poplin R, Varadarajan AV, Blumer K, et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat Biomed Eng 2018; 2: 158–164. [DOI] [PubMed] [Google Scholar]
  • 10.Bhatia RS, Dorian P. Screening for cardiovascular disease risk with electrocardiography. JAMA Intern Med 2018; 178: 1163–1164. [DOI] [PubMed] [Google Scholar]
  • 11.Watanabe Y, Nakagawa T, Fukai K, et al. Descriptive study of chest X-ray examination in mandatory annual health examinations at the workplace in Japan. PLoS One 2022; 17: e0262404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Jindal S, Gombar S, Jain K. Is routine preoperative chest X-ray: an underutilized tool in asymptomatic patients! Ann Card Anaesth 2018; 21: 460–461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kovacevic M, Goranovic T, Markic A, et al. Usefulness of routine chest X-ray in preoperative evaluation of patients undergoing non-cardiopulmonary surgery: a prospective observational study: 1ap5-5. Eur J Anaesthesiol, EJA 2012; 29: 16. [Google Scholar]
  • 14.Majkowska A, Mittal S, Steiner DF, et al. Chest radiograph interpretation with deep learning models: assessment with radiologist-adjudicated reference standards and population-adjusted evaluation. Radiology 2020; 294: 421–431. [DOI] [PubMed] [Google Scholar]
  • 15.Rahaman MM, Li C, Yao Y, et al. Identification of COVID-19 samples from chest X-ray images using deep learning: a comparison of transfer learning approaches. J Xray Sci Technol 2020; 28: 821–839. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Matsumoto T, Kodera S, Shinohara H, et al. Diagnosing heart failure from chest X-ray images using deep learning. Eur Heart J 2020; 41. [DOI] [PubMed] [Google Scholar]
  • 17.Liu W-T, Lin C-S, Tsao T-P, et al. A deep-learning algorithm-enhanced system integrating electrocardiograms and chest X-rays for diagnosing aortic dissection. Can J Cardiol 2021; 38(2): 160–168. [DOI] [PubMed] [Google Scholar]
  • 18.Hsiang C-W, Lin C, Liu W-C, et al. Detection of left ventricular systolic dysfunction using an artificial intelligence–enabled chest X-ray. Can J Cardiol 2022; 38(6): 763–773. [DOI] [PubMed] [Google Scholar]
  • 19.Irvin J, Rajpurkar P, Ko M, et al. Chexpert: a large chest radiograph dataset with uncertainty labels and expert comparison. Proc AAAI Conf Artificial Intelligence 2019; 33(No. 01): 590–597. [Google Scholar]
  • 20.Chang C-H, Lin C-S, Luo Y-S, et al. Electrocardiogram-based heart age estimation by a deep learning model provides more information on the incidence of cardiovascular disorders. Front Cardiovasc Med 42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Klein JS, R-d-CMASAtCRAFIHJ. Kubik-Huch RA, von Schulthess GK, editors. Diseases of the chest, breast, heart and vessels 2019–2022: diagnostic and interventional imaging. Cham (CH): Springer; 2019, Chapter 1, pp. 30–40. Available from: https://www.ncbi.nlm.nih.gov/books/NBK553874/ doi: 10.1007/978-3-030-11149-6_1. [PubMed]
  • 22.Taylor Bryan J, Bruce D. Johnson. The pulmonary circulation and exercise responses in the elderly. Semin Respir Crit Care Med 2010; 31: 528–538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Ghali JK, Liao Y, Cooper RSet al. et al. Changes in pulmonary hemodynamics with aging in a predominantly hypertensive population. Am J Cardiol 1992; 70: 367–370. [DOI] [PubMed] [Google Scholar]
  • 24.Chang SC, Chang HI, Liu SY, et al. Effects of body position and age on membrane diffusing capacity and pulmonary capillary blood volume. Chest 1992; 102: 139–142. [DOI] [PubMed] [Google Scholar]
  • 25.Plank L, James J, Wagenvoort CA. Caliber and elastin content of the pulmonary trunk. Arch Pathol Lab Med 1980; 104: 238–241. [PubMed] [Google Scholar]
  • 26.Gozna ER, Marble AE, Shaw Aet al. et al. Age-related changes in the mechanics of the aorta and pulmonary artery of man. J Appl Physiol 1974; 36: 407–411. [DOI] [PubMed] [Google Scholar]
  • 27.Torun E, Kacmaz F. Clinical implications of chest X-ray parameters in evaluating patients with cardiac dyspnea. Eurasian J Med 2008; 40: 133–136. [PMC free article] [PubMed] [Google Scholar]
  • 28.Sahin H, Stark P. Diagnostic utility of chest radiography in predicting long-standing systemic arterial hypertension. Aorta (Stamford) 2017; 5: 168–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Lauzier PT, Chow BJW. Artificial intelligence detection of left ventricular systolic dysfunction using chest X-rays: prospective validation, please. Can J Cardiol 2022; 38: 720–722. [DOI] [PubMed] [Google Scholar]
  • 30.Boyars M. Chest roentgenography for pulmonary evaluation. In: Walker HK, Hall WD, Hurst JW. (eds) Clinical methods: the history, physical, and laboratory examinations. 3rd ed. Boston: Butterworths, 1990. Chapter 48. Available from: Https://www.Ncbi.Nlm.Nih.Gov/books/nbk370/. [PubMed] [Google Scholar]
  • 31.Raghu VK, Weiss J, Hoffmann U, et al. Deep learning to estimate biological age from chest radiographs. JACC: Cardiovascular Imaging 2021; 14(11): 2226–2236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Yang C-Y, Pan Y-J, Chou Y, et al. Using deep neural networks for predicting age and sex in healthy adult chest radiographs. J Clin Med 2021; 10: 4431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Raghunath S, Ulloa Cerna AE, Jing L, et al. Prediction of mortality from 12-lead electrocardiogram voltage data using a deep neural network. Nat Med 2020; 26: 886–891. [DOI] [PubMed] [Google Scholar]
  • 34.Сох D. Regression models and life-tables. Journal of 1972. [Google Scholar]
  • 35.Li D, Lin CT, Sulam Jet al. et al. Deep learning prediction of sex on chest radiographs: a potential contributor to biased algorithms. Emergency Radiology 2022. [DOI] [PubMed] [Google Scholar]
  • 36.Arai T, Kuroishi T, Saito Y, et al. Tumor doubling time and prognosis in lung cancer patients: evaluation from chest films and clinical follow-up study. Japanese lung cancer screening research group. Jpn J Clin Oncol 1994; 24: 199–204. [PubMed] [Google Scholar]
  • 37.Lakatta EG, Levy D. Arterial and cardiac aging: major shareholders in cardiovascular disease enterprises: part II: the aging heart in health: links to heart disease. Circulation 2003; 107: 346–354. [DOI] [PubMed] [Google Scholar]
  • 38.North BJ, Sinclair DA. The intersection between aging and cardiovascular disease. Circ Res 2012; 110: 1097–1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Curtis AB, Karki R, Hattoum Aet al. et al. Arrhythmias in patients ≥80 years of age: pathophysiology, management, and outcomes. J Am Coll Cardiol 2018; 71: 2041–2057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Rodgers JL, Jones J, Bolleddu SI, et al. Cardiovascular risks associated with gender and aging. J Cardiovasc Dev Dis 2019; 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Steenman M, Lande G. Cardiac aging and heart disease in humans. Biophys Rev 2017; 9(2): 131–137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Dhingra R, Vasan RS. Age as a risk factor. Med Clin North Am 2012; 96: 87–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.D'Agostino RB, Sr, Vasan RS, Pencina MJ, et al. General cardiovascular risk profile for use in primary care: the Framingham heart study. Circulation 2008; 117: 743–753. [DOI] [PubMed] [Google Scholar]
  • 44.Jackson R. Updated New Zealand cardiovascular disease risk-benefit prediction guide. Br Med J 2000; 320: 709–710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Ieki H, Ito K, Saji M, et al. Deep learning-based chest X-ray age serves as a novel biomarker for cardiovascular aging. bioRxiv. 2021.
  • 46.Çallı E, Sogancioglu E, van Ginneken B, et al. Deep learning for chest X-ray analysis: a survey. Medical Image Analysis 2021; 72: 102125. [DOI] [PubMed] [Google Scholar]
  • 47.MacKenzie A. Endothelium-derived vasoactive agents, at1 receptors and inflammation. Pharmacol Ther 2011; 131: 187–203. [DOI] [PubMed] [Google Scholar]
  • 48.Milane A, Abdallah J, Kanbar R, et al. Association of hypertension with coronary artery disease onset in the Lebanese population. Springerplus 2014; 3: 533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Srinivasan MP, Kamath PK, Bhat NM, et al. Severity of coronary artery disease in type 2 diabetes mellitus: does the timing matter? Indian Heart J 2016; 68: 158–163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Reiner Z, Catapano AL, De Backer G, et al. ESC/EAS guidelines for the management of dyslipidaemias: the task force for the management of dyslipidaemias of the European Society of Cardiology (ESC) and the European Atherosclerosis Society (EAS). Eur Heart J 2011; 32: 1769–1818. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sj-docx-1-dhj-10.1177_20552076231191055 - Supplemental material for The deep learning algorithm estimates chest radiograph-based sex and age as independent risk factors for future cardiovascular outcomes

Supplemental material, sj-docx-1-dhj-10.1177_20552076231191055 for The deep learning algorithm estimates chest radiograph-based sex and age as independent risk factors for future cardiovascular outcomes by Hao-Chun Liao, Chin Lin, Chih-Hung Wang and Wen-Hui Fang in DIGITAL HEALTH


Articles from Digital Health are provided here courtesy of SAGE Publications

RESOURCES