Skip to main content
European Heart Journal Open logoLink to European Heart Journal Open
. 2023 Nov 8;3(6):oead113. doi: 10.1093/ehjopen/oead113

Fully automated coronary artery calcium quantification on electrocardiogram-gated non-contrast cardiac computed tomography using deep-learning with novel Heart-labelling method

Daigo Takahashi 1, Shinichiro Fujimoto 2,, Yui O Nozaki 3, Ayako Kudo 4, Yuko O Kawaguchi 5, Kazuhisa Takamura 6, Makoto Hiki 7, Eisuke Sato 8, Nobuo Tomizawa 9, Hiroyuki Daida 10,11, Tohru Minamino 12,2
Editor: Frank A Flachskampf
PMCID: PMC10683040  PMID: 38035036

Abstract

Aims

To develop an artificial intelligence (AI)-model which enables fully automated accurate quantification of coronary artery calcium (CAC), using deep learning (DL) on electrocardiogram (ECG)-gated non-contrast cardiac computed tomography (gated CCT) images.

Methods and results

Retrospectively, 560 gated CCT images (including 60 synthetic images) performed at our institution were used to train AI-model, which can automatically divide heart region into five areas belonging to left main (LM), left anterior descending (LAD), circumflex (LCX), right coronary artery (RCA), and another. Total and vessel-specific CAC score (CACS) in each scan were manually evaluated. AI-model was trained with novel Heart-labelling method via DL according to the manual-derived results. Then, another 409 gated CCT images obtained in our institution were used for model validation. The performance of present AI-model was tested using another external cohort of 400 gated CCT images of Stanford Center for Artificial Intelligence of Medical Imaging by comparing with the ground truth. The overall accuracy of the AI-model for total CACS classification was excellent with Cohen’s kappa of k = 0.89 and 0.95 (validation and test, respectively), which surpasses previous research of k = 0.89. Bland-Altman analysis showed little difference in individual total and vessel-specific CACS between AI-derived CACS and ground truth in test cohort (mean difference [95% confidence interval] were 1.5 [−42.6, 45.6], −1.5 [−100.5, 97.5], 6.6 [−60.2, 73.5], 0.96 [−59.2, 61.1], and 7.6 [−134.1, 149.2] for LM, LAD, LCX, RCA, and total CACS, respectively).

Conclusion

Present Heart-labelling method provides a further improvement in fully automated, total, and vessel-specific CAC quantification on gated CCT.

Keywords: Coronary artery calcium, Computed tomography, Artificial intelligence, Deep learning

Graphical Abstract

Graphical abstract.

Graphical abstract

Introduction

Coronary artery calcium (CAC) score (CACS) is widely used for risk assessment of coronary artery disease (CAD)1 and has been integrated into several clinical guidelines. Numerous previous studies have demonstrated that the conventional quantitative evaluation by CAC scoring on electrocardiogram (ECG)-gated non-contrast cardiac computed tomography (gated CCT) serves a great value of predicting CAD.2,3 Recent research has also revealed the prognostic impact bestowed by branch-wise CAC assessment.4–6 However, to calculate CACS in every single patient for CAD screening is too inefficient because it requires experts’ manual identification of coronary calcium lesions.

On the other hand, in recent years, artificial intelligence (AI) technologies have made great strides and demonstrated their efficacy in several medical fields. Especially in medical diagnostic imaging, deep learning has a big potential to assist in detecting lesions or abnormalities.7–9 Although some software has already been developed to quantify total CACS full-automatically,10,11 identification of localized calcification on each coronary vessel is still challenging. The aim of this study is to develop a further accuracy-improved AI-model for both total and vessel-specific CAC scoring on gated CCT. The key contributions of our study are:

  1. Explainable AI-model for Agatston scoring; performs heart labelling into coronary artery perfusion regions followed by well-proven rule-based calcium detection with CT thresholding of 130 Hounsfield units (HU) rather than black box approach of direct calcification detection

  2. Diverse training and validation cohorts, also containing normal CCTs without calcification

  3. Test on external public dataset with accuracy better than state of the art.

Methods

This retrospective study was approved by our institutional review board in accordance with local ethics procedures, and the requirement for written informed consent was waived.

Study design and population

Three independent cohorts were studied, including a total of 1369 patients who underwent gated CCT.

Training cohort

Study flow chart model training is shown in Figure 1A. We retrospectively studied 500 consecutive patients who had no prior history of heart disease and underwent coronary CT angiography (CCTA) for suspected of CAD in our institution between August 2019 and July 2020. In every patient, gated CCT scans were also obtained just before CCTA for routine CAC scoring. An additional 60 synthetic gated CCT images were generated from the other CCTA cases (detailed later); therefore, a total of 560 gated CCT images were used for model training.

Figure 1.

Figure 1

(A) Study flow chart of training cohort. A total of 560 gated CCT images were used for model training, including 60 synthetic gated CCT generated from CCTA images. (B) Study flow chart of validation and test cohort. For validation, as 21 gated CCT images were excluded due to its poor image quality or artefacts, a total of 409 gated CCT images were enrolled. For test, 50 scans were excluded because of several reasons, and a total of 400 gated CCT images were used. The performance of the model was assessed by comparing AI-derived CACS with manual CACS, or ground truth segmentation (validation cohort and test cohort, respectively) in both total and vessel-specific levels.

Validation and test cohort

Study flow chart of model validation and test is shown in Figure 1B. For performance validation of the AI-model, 430 consecutive patients who had no prior history of heart disease and underwent gated CCT and CCTA for clinical indication for suspected CAD in our institution between August 2020 and August 2021 were enrolled. Twenty-one scans were excluded due to its poor image quality or artefacts; thus, a total of 409 gated CCT images were used for model validation. The performance of the model was assessed by comparing AI-derived CACS with manually-derived CACS in both total and vessel-specific levels. The validation cohort played an integral role in monitoring and fine-tuning the model during training to avoid overfitting.

For performance testing of the AI-model, we studied another 450 gated CCT datasets that Stanford University has made available online for free (Center for Artificial Intelligence in Medicine and Imaging; AIMI).12 To be used for CACS analysis, these datasets consist of the images obtained from the patients with at least some calcification in their coronary arteries. Fifty scans were excluded because of metal implants, severe artefacts, or data corruption (14, 23, and 13, respectively); then, a total of 400 gated CCT images were enrolled. We compared AI-derived CACS with the disclosed ground truth segmentation. The test cohort was employed only for evaluation purposes, and at no point was it used to influence or impact the training process.

Imaging protocol

All the patients in the internal training and validation cohorts were scanned using a 320-row CT (Aquillion ONE, Canon medical systems, Tokyo, Japan). Prospective gated CCT was obtained just before CCTA, at 75% of R-R interval, with a tube current of 50–250 mA (depending on body mass index), and voltage of 120 KVp. The slice thickness was 3.0 mm. Images of the external test cohort were obtained with a same voltage and slice thickness; however, the other detailed conditions are not disclosed.

Manual coronary artery calcium detection, calculation, and categorization

Vessel-specific CAC identification was performed by an expert on an axial view of gated CCT images, and each calcification was labelled using one of the four labels [left main (LM), left anterior descending (LAD), left circumflex (LCX), and right coronary artery (RCA)] using dedicated software (SYNAPSE VINCENT, Fujifilm, Japan). Complementary information from sagittal and coronal views was also used as a reference. As described in conventional studies, the threshold for a calcific lesion was set at CT value of 130 HU, having area ≥ 1 mm2.13,14 Then CACS was calculated using the method of Agatston,3 and the scores were categorized into five risk groups as previously described: (0) <1, (1) 1–10, (2) 11–100, (3) 101–400, and (4) >400.15,16 Patients in test cohort had at least a little coronary calcification; thus they were stratified into four latter groups. In training cohort, second manual CAC segmentation was performed 3 months after the initial analysis and also by another expert to assess intra- and inter-observer agreement to certify reproducibility of CAC quantification.

Artificial intelligence algorithm methodology

Heart-labelling method

Conventionally, AI-based CAC scoring algorithms detect calcifications on gated CCT images directly.10,11 An ideal and explainable approach would be to extract and label coronary arteries in CCT images using the AI, followed by calcium extraction using threshold of 130 HU. However, the extraction of coronary arteries is challenging due to poor visibility in thick slice scan (3 mm). Thus, we propose a more practical yet explainable approach of extracting vessel perfusion regions by segmenting CCT images into five regions; four perfusion regions of LM, LAD, LCX, and RCA and another (e.g. aorta, valves, ventricles; the other area or structures inside the mediastinum.). Figure 2A shows gated CCT image, and Figure 2B shows heart segmented into five regions. We refer to this method as ‘Heart-labelling method’.

Figure 2.

Figure 2

Heart-labelling method. (A) Gated CCT image. Calcifications are localized in LM, LAD, LCX and another (aorta) region. (B) Heart labelling into vessel perfusion regions and other region (green). (C) Sparse labels; only calcified regions are labelled. (D) Coronary labels. In 3D coronary CT angiography, whole coronary arteries are labelled (left). The synthetic image is generated by transforming contrast enhanced image (mid) into non-contrast image (right). (E) Non-coronary labels. Non-coronary label generation. Original CCT image (left), labelling by CT value threshold (middle), removal of the coronary artery region by applying multiple erosion operations followed by dilation which removes any thin tube-like structures (right).

Heart-labelling from sparse labels

Only calcified regions are labelled in the ground truth as shown in Figure 2C. From the context of Heart-labelling into perfusion regions, these labels are sparse labels; only a part of perfusion region is labelled. In the field of DL, training using such sparse labels is conducted by masking out unlabelled regions while training.17 Masking out means AI is trained to correctly output only the labelled regions, while the output at unlabelled regions is ignored. The hypothesis is that, if we consider the entire dataset, the labelled calcified regions are distributed uniformly across various regions in the perfusion area, allowing the AI to learn underlying perfusion area.

Additional anatomical labels for better accuracy

The assumption that calcified regions are representative of the entire perfusion region is not always true and requires very large datasets. Because regions that fall outside of the defined calcified labels would typically be disregarded, allowing the AI to attribute potentially arbitrary probabilities to those uncertain areas, the use of sparse labels alone might produce noisy labels. So, to further guide the AI to learn the underlying perfusion area, we create additional anatomical labels.

First, we create a small dataset in which entire coronary arteries are labelled. From 60 scans of CCTA, synthetic gated CCT images were generated using CycleGAN.18 Our implementation is derived from a recently proposed non-contrast to contrast image translation using CycleGAN.19 The coronary arteries clearly visible in CCTA were manually labelled and transferred to CCT images as shown in Figure 2D.

Second, to guide the AI to recognize the non-coronary region, also referred to as other region in this work, we automatically generate the labels for it. The labels are generated automatically using simple thresholding and morphology operations. As shown in Figure 2E, within the heart region, the areas with CT value greater than 10 HU are extracted. To remove coronary artery region, morphology operations of multiple erosions followed by dilation were applied so that it can remove any tube-like thin structures.

Convolutional neural network

Figure 3 shows a training flow using 3D fully convolutional neural network (FCNN) derived from U-Net architecture,20 which was trained to segment the heart into five regions: LMT, LAD, LCX, RCA, and another. It takes two input channels, 3D CT volume and binary mask of heart region, and six output channels, comprised of probability maps corresponding to LMT, LAD, LCX, RCA, another, and outside heart classes. Three-dimensional FCNN consists of an encoder and a decoder, each of which consists of several convolutional filters through which the image is processed to produce the output. The weights of the convolution filters are updated during training so that the loss (computed as difference between AI output and ground truth) is minimized. In this work, we use multi-class dice loss function. To make the AI robust to various scanning protocols (resolution, noise levels, position), we apply data augmentation during the training. The details of loss function and data augmentation are provided in Supplementary material online, File  S1.

Figure 3.

Figure 3

AI-algorithm using U-Net architecture. AI-algorithm consists of two channel input of a plain CCT image and a heart mask, and six channel outputs of LM, LAD, LCX, RCA, another, and outside heart classes. Three-dimensional fully convolutional neural network consists of an encoder and a decoder, which have convolutional filters, and the weights of the convolution layers are updated point by point so that the loss is minimized.

Statistical analysis

All statistical analysis was performed using JMP 14.2 (SAS Institute Inc., Cary, NC, USA). For the training cohort, intra- and inter-observer variabilities were assessed using Pearson’s correlation coefficient and Bland–Altman (B-A) analysis. For validation and test cohorts, the risk category performance of the AI-model was assessed by the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, and kappa value. The concordance of total and vessel-specific CACS between AI prediction and ground truth were assessed by B-A plot and correlation metrics using the method of Pearson. Kolmogorov–Smirnov (K-S) test was also used to analyse the similarity of distributions of AI-derived CACS and ground truth. Normally distributed quantitative variables are presented with mean and 95% CI. Non-normally distributed data was presented with median and interquartile range (IQR). A statistically significant difference was defined as a two-sided P-value <0.05.

Results

Baseline characteristics

Characteristics of all patients included in training and validation cohort are shown in Table 1. The mean age was 63.5 ± 13.3 and 65.8 ± 11.7 years, and 38.8% and 43.8% were female. The mean body mass index was 23.4 ± 3.7 and 23.5 ± 3.3. The median CACS was 32.6 and 22.8, respectively.

Table 1.

Baseline characteristics of the training and validation cohort

Training cohort (n = 500) Validation cohort (n = 409)
Age (years) 63.5 ± 13.3 65.8 ± 11.7
Sex
 Female, n (%) 194 (38.8) 179 (43.8)
 Male, n (%) 306 (61.2) 230 (56.2)
Body mass index (kg/m2) 23.4 ± 3.7 23.5 ± 3.3
Risk factor
 Hypertension, n (%) 263 (52.6) 235 (58.4)
 Dyslipidemia, n (%) 265 (53.0) 202 (49.4)
 Diabetes mellitus, n (%) 144 (28.8) 118 (28.9)
 Family history of CAD, n (%) 132 (26.4) 80 (19.6)
 Current smoking, n (%) 71 (14.2) 71 (17.4)
Blood test
 LDL-Cholesterol (mg/dL) 109.9 ± 32.2 112.5 ± 49.9
 HDL-Cholesterol (mg/dL) 56.1 ± 15.9 58.2 ± 17.3
 Triglyceride (mg/dL) 137.2 ± 93.6 143.9 ± 124.1
 Hemoglobin A1c (%) 6.3 ± 1.1 6.2 ± 1.1
 Serum creatinine (mg/dL) 0.78 [0.65, 0.87] 0.77 [0.64, 0.89]
Agatston score
 Total 32.6 [0, 200.9] 22.8 [0, 205.4]
 LM 0 [0, 0] 0 [0, 0]
 LAD 10.1 [0, 99.0] 5.7 [0, 97.5]
 LCX 0 [0, 12.1] 0 [0, 12.3]
 RCA 0 [0, 33.2] 0 [0, 33.3]

CAD, coronary artery disease; HDL, high density lipoprotein; LAD, left anterior descending; LCX, left circumflex; LDL, low density lipoprotein; LM, left main; RCA, right coronary artery

Intra- and inter-observer reproducibility

Figure 4A and 4B represent intra- and inter-observer agreement of CACS in training cohort, and we found excellent correlations (r = 0.9999, P < 0.0001; r = 0.9987, P < 0.0001, respectively). The graphical illustration in the form of B-A plots was also shown.

Figure 4.

Figure 4

Reproducibility of CAC scoring. (A) Intra-observer agreement in total CACS quantification. Correlation plot between first analysis (x-axis) and second analysis (y-axis). The Pearson’s correlation coefficient was r = 0.9999. Also shown is an inset with a magnified view of scores in the range 0–400. In Bland–Altman plot, red dashed line shows mean difference (−0.2) with the two blue dashed lines representing upper and lower limit of agreement (LoA) of ±1.96SD ([−11.9, 11,4]). (B) Inter-observer agreement in total CACS quantification. Correlation plot between analysis by expert A (x-axis) and B (y-axis). The Pearson’s correlation coefficient was r = 0.9987. Also shown is an inset with a magnified view of scores in the range 0–400. In B-A plot, red dashed line shows mean difference (2.1) with the two blue dashed lines representing upper and lower limit of agreement (LoA) of ±1.96SD [(−51.2, 55.4).

Categorical assessment of total coronary artery calcium scoring

Table 2a represents that the overall accuracy of the AI prediction in the internal validation cohort. Total CACS classification was excellent with Cohen’s kappa of k = 0.89. The classification accuracy was 91.7% (375/409), with 24 false-positive cases. The sensitivity, specificity, PPV, and NPV were 100%, 82.7%, 91.8%, and 100%, respectively.

Table 2.

Confusion matrix of CAC risk category

(a) Validation cohort
AI prediction
Ground truth Risk category [0] [1] [2] [3] [4]
[0] 115 15 9 0 0
[1] 0 38 5 0 0
[2] 0 0 84 2 0
[3] 0 0 1 71 2
[4] 0 0 0 0 67
(b) Test cohort
AI prediction
Ground truth Risk category [0] [1] [2] [3] [4]
[0] N/A N/A N/A N/A N/A
[1] N/A 57 5 1 0
[2] N/A 0 126 4 1
[3] N/A 0 1 92 2
[4] N/A 0 0 0 111

Risk category: CAC score belongs to; [0], <1; [1], 1–10; [2], 10–100; [3], 100–400; [4], >400.

AI, artificial intelligence; CAC, coronary artery calcium.

Similarly, results of AI prediction in external test cohort can be found in Table 2b. Since patients without any CAC are not enrolled in this cohort, total CACS is categorized into only four groups [(1) 1–10, (2) 10–100, (3) 100–400, and (4) >400]. The accuracy was still excellent [96.5% (386/400)] with Cohen’s kappa of k = 0.95.

Accuracy in total and vessel-specific coronary artery calcium scoring

Figure 5 represents the comparison of the total CACS between AI-derived score and ground truth that was performed. We found excellent correlation between two methods, in both validation and test cohort (r = 0.995, P < 0.0001; r = 0.994, P < 0.0001, respectively). Bland–Altman plots are also shown, suggesting little difference between two methods {mean difference [95%CI (confidence interval)]: 9.8 [−66.9, 88.5] and 9.6 [−134.1, 149.2], respectively}. Kolmogorov–Smirnov (K-S) test showed no statistically significant difference in the distribution of CACS between two methods (P = 0.998 and 0.979, respectively). Vessel-specific results are shown in Figure 6. In validation cohort, AI-derived CACS excellently correlated with manual CACS (r = 0.875, 0.992, 0.937, and 0.988 for LM, LAD, LCX, and RCA, respectively), and B-A plots graphically represented little difference between two methods in vessel level [mean difference (95%CI) were −1.9 (−43.1, 39.3), 1.1 (−39.0, 45.3), 2.8 (−45.4–51.0), and 3.9 (−49.6, 57.2), in the same order]. Correspondingly, excellent correlation and little difference were shown in test cohort (r = 0.852, 0.982, 0.976, and 0.995; mean difference [95%CI] were 1.5 [−42.6, 45.6], −1.5 [−100.5, 97.5], 6.6 [−60.2, 73.5], and 1.0 [−59.2, 61.1] for LM, LAD, LCX, and RCA, respectively).

Figure 5.

Figure 5

Performance of AI-derived total CAC scoring. For total CACS prediction, there was excellent correlation between AI-derived CACS and manual CACS (ground truth), in both validation and test cohort (r = 0.995, P < 0.0001; r = 0.994, P < 0.0001, respectively). B-A plots are also shown, suggesting little difference between two methods (mean difference [95%CI (confidence interval)]: 9.8 [−66.9, 88.5] and 9.6 [−134.1, 149.2], respectively). Kolmogorov-Smirnov (K-S) test showed no statistically significant difference in the distribution of CACS between two methods (P = 0.998 and 0.979, respectively).

Figure 6.

Figure 6

Performance of AI-derived vessel-specific CAC scoring. In validation cohort, AI-derived CACS excellently correlated with manual CACS (r = 0.875, 0.992, 0.937, and 0.988 for LM, LAD, LCX, and RCA, respectively), and B-A plots graphically represented little difference between two methods in vessel level (mean difference [95%CI] were −1.9 [−43.1, 39.3], 1.1 [−39.0, 45.3], 2.8 [−45.4–51.0], and 3.9 [−49.6, 57.2], in the same order). Correspondingly, excellent correlation and little difference were shown in test cohort (r = 0.852, 0.982, 0.976, and 0.995; mean difference [95%CI] were 1.5 [−42.6, 45.6], −1.5 [−100.5, 97.5], 6.6 [−60.2, 73.5], and 1.0 [−59.2, 61.1] for LM, LAD, LCX, and RCA, respectively).

Discussion

In present study, we have developed an improved comprehensive AI-model for automated CAC scoring on gated CCT at total and vessel-specific levels. Although it is theoretically possible to develop AI-models to calculate other indices of CAC (e.g. volume score, mass score), the present study focused on the Agatston score, which is the simplest, most widely used, and evidence-rich assessment method. By adopting a novel Heart-labelling method, AI-derived CAC scoring has achieved the following outcomes (compared to manual scoring): (i) excellent agreement in classification of CACS risk categories and (ii) excellent consistency in total and vessel-specific absolute CACS.

Fully automated CACS risk category classification has been validated in several research so far.21,22 In this study, our model achieved excellent accuracy in CACS classification with Cohen’s kappa k = 0.89 and 0.95 (validation and test cohorts, respectively). One recent research has demonstrated the great agreement in automated CACS classification of 79 patients with Cohen’s kappa k = 0.89.23 In the present model, a relatively large number of patients were analysed (n = 409 and 400), still maintaining or surpassing the previous reported model. In the validation cohort, there were 24 false positives which impaired overall result. Greater accuracy was achieved in the test cohort probably because it did not contain the case with CAC = 0.

Similarly, when treated as a continuous value, excellent CACS consistency was achieved between AI prediction and ground truth in both validation and test cohort (=0.993, P < 0.0001; r = 0.903, P < 0.0001, respectively). The reason why the latter one was lower is probably because the validation cohort includes 139 (33.9%) CAC-absent patients and 115 out of 139 CAC-absent cases (82.7%) were completely correctly analysed (as CAC = 0), which contributed to improving the overall accuracy. The mean differences were 9.8 and 9.6, which means AI prediction have tendency of slight overestimation.

Moreover, the incremental value of evaluating vessel-specific CAC has been demonstrated in several reports. It has been revealed that the presence and high burden of left main CAC are independently associated with greater cardiovascular and total mortality,4,5 and that the CAC of RCA is significantly associated with coronary heart disease.6 Another research has showcased that the presence of CAC in the proximal dominant coronary artery is associated with major coronary heart disease events (hazard ratio: 2.59).24 From this point of view, it can be said that vessel-specific CAC evaluation enables further risk assessment in addition to routine CAC scoring.

However, fully automated accurate CAC quantification of each coronary vessel has been a task with room for improvement, generally because of following problems: (i) misclassification between LM, LAD, and LCX, (ii) false positives from the calcification at aortic root and annulus, and (iii) false positives from non-coronary calcium (e.g. mitral annulus, pulmonary arteries). In this study, we adopted Heart-labelling method as one of the frameworks, which greatly improved the accuracy of AI prediction. The present Heart-labelling method can understand the underlying anatomy of heart using our novel training technique of using not only sparse labels but also anatomical labels. The latter one is developed for more accurate elimination of non-coronary calcium. In general, complex deep-learning models are black boxes that cannot explain the process how it judges and elicits the final output. This is considered one of the problems of current AI trends, however, our Heart-labelling method reasonably overcomes the issue.

Although we found excellent performance of AI-derived CAC scoring in test cohort, some drawback cases should be mentioned (Figure 7). There were still several false positives due to misrecognition of non-coronary calcium, such as (a) pericardial calcification, (b) pulmonary trunk, (c) calcified mediastinal lymph node, and (d) mitral annulus calcification. We consider that model training was not enough to cover these kinds of cases, which is not frequent in clinical practice. (e) was a unique false negative case in which CAC in the proximal LAD was not detected by the model. It is presumably because the extraordinary LAD course, which initially descends in a caudal direction and then ascends cranial direction, was not learned sufficiently in the model training. In terms of high sensitivity of CAC scoring, false positives are acceptable rather than false negatives. We consider that in order to get over these shortcomings, more numbers of varied cases should be integrated into model training.

Figure 7.

Figure 7

Drawbacks. There are several failure cases of the present AI-model, false positives of; (A) pericardial calcification, (B) pulmonary trunk, (C) calcified mediastinal lymph node, and (D) mitral annulus calcification, and false negative of (E) proximal LAD.

This study has some important limitations. First, detailed conditions for acquiring CCT images in the test cohort are unclear. Future studies need to enrol various cases with different vendors and conditions. Secondly, because the test cohort did not include calcium-absent patients, the accuracy of AI prediction may be overestimated. However, even in validation cohort including 139 (34.0%) calcium-absent cases, perfect sensitivity and NPV were found, which means it can provide robust performance in terms of screening test in clinical practice. Related to this issue, we should remark that we only validated per-patient sensitivity, not per-lesion sensitivity. Thirdly, the exclusion criteria for the images with poor quality or artefacts can be inconsistent. The manual readers determined to get rid of each image which was not to be analysed properly, and a total of 71 images were excluded. Finally, there are no consistent rules or post-processing methods for the AI to correctly segment calcifications that span multiple vascular beds. In particular, the segmentation of CAC at LM bifurcation may be clinically important, thus further refinement is needed in future studies.

In conclusion, the present Heart-labelling method provides a further improvement in fully automated, total and vessel-specific CAC detection on gated CCT.

Supplementary Material

oead113_Supplementary_Data

Acknowledgements

The AI-model was developed by Fujifilm Corporation, Tokyo, Japan. The authors are grateful to Yosuke Kogure, RT, Hidekazu Inage, RT, Hideyuki Sato, RT, Takuya Yokota, RT and Hikaru Kudo, RT for technical assistance in this study.

Contributor Information

Daigo Takahashi, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Shinichiro Fujimoto, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Yui O Nozaki, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Ayako Kudo, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Yuko O Kawaguchi, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Kazuhisa Takamura, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Makoto Hiki, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Eisuke Sato, Department of Radiological Technology, Faculty of Health Science, Juntendo University, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Nobuo Tomizawa, Department of Radiology, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Hiroyuki Daida, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan; Department of Radiological Technology, Faculty of Health Science, Juntendo University, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Tohru Minamino, Department of Cardiovascular Biology and Medicine, Juntendo University Graduate School of Medicine, 2-1-1 Hongo Bunkyo-ku, Tokyo 113-8421, Japan.

Lead author biography

Inline graphicDr. Daigo Takahashi, MD, PhD, is a cardiologist working at Juntendo University Hospital, Tokyo, Japan. He completed his medical training at Juntendo University Shizuoka Hospital. Through the career in acute cardiovascular care, he found great value in less invasive diagnostic methods, and began dedicating himself to research using cardiovascular imaging, especially in cardiac computed tomography using artificial intelligence.

Data availability

Most of the data underlying this article cannot be shared publicly due to the privacy of individuals that participated in the study. The data will be shared on reasonable request to the corresponding author. However, one portion of the datasets analysed in the current study is available in Center for Artificial Intelligence in Medicine & Imaging, at https://stanfordaimi.azurewebsites.net/datasets/e8ca74dc-8dd4-4340-815a-60b41f6cb2aa.

Supplementary material

Supplementary material is available at European Heart Journal Open online.

Funding

This research is partially funded by Fujifilm corporation, Tokyo, Japan and Japan Society for the Promotion of Science KAKENHI (grant no. JP21K08088).

References

  • 1. Knuuti  J, Wijns  W, Saraste  A, Capodanno  D, Barbato  E, Funck-Brentano  C, Prescott  E, Storey  RF, Deaton  C, Cuisset  T, Agewall  S, Dickstein  K, Edvardsen  T, Escaned  J, Gersh  BJ, Svitil  P, Gilard  M, Hasdai  D, Hatala  R, Mahfoud  F, Masip  J, Muneretto  C, Valgimigli  M, Achenbach  S, Bax  JJ; ESC Scientific Document Group . 2019 ESC guidelines for the diagnosis and management of chronic coronary syndromes. Eur Heart J  2020;41:407–477. [DOI] [PubMed] [Google Scholar]
  • 2. Greenland  P, Bonow  RO, Brundage  BH, Budoff  MJ, Eisenberg  MJ, Grundy  SM, Lauer  MS, Post  WS, Raggi  P, Redberg  RF, Rodgers  GP, Shaw  LJ, Taylor  AJ, Weintraub  WS; American College of Cardiology Foundation Clinical Expert Consensus Task Force (ACCF/AHA Writing Committee to Update the 2000 Expert Consensus Document on Electron Beam Computed Tomography); Society of Atherosclerosis Imaging and Prevention; Society of Cardiovascular Computed Tomography . ACCF/AHA 2007 clinical expert consensus document on coronary artery calcium scoring by computed tomography in global cardiovascular risk assessment and in evaluation of patients with chest pain. J Am Coll Cardiol  2007;49:378–402. [DOI] [PubMed] [Google Scholar]
  • 3. Agatston  AS, Janowitz  WR, Hildner  FJ, Zusmer  NR, Viamonte  M, Detrano  R. Quantification of coronary artery calcium using ultrafast computed tomography. J Am Coll Cardiol  1990;15:827–832. [DOI] [PubMed] [Google Scholar]
  • 4. Lahti  SJ, Feldman  DI, Dardari  Z, Mirbolouk  M, Orimoloye  OA, Osei  AD, Graham  G, Rumberger  J, Shaw  L, Budoff  MJ, Rozanski  A, Miedema  MD, Al-Mallah  MH, Berman  D, Nasir  K, Blaha  MJ. The association between left main coronary artery calcium and cardiovascular-specific and total mortality: the Coronary Artery Calcium Consortium. Atherosclerosis  2019;286:172–178. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Tota-Maharaj  R, Joshi  PH, Budoff  MJ, Whelton  S, Zeb  I, Rumberger  J, Al-Mallah  M, Blumenthal  RS, Nasir  K, Blaha  MJ. Usefulness of regional distribution of coronary artery calcium to improve the prediction of all-cause mortality. Am J Cardiol  2015;115:1229–1234. [DOI] [PubMed] [Google Scholar]
  • 6. Blaha  MJ, Budoff  MJ, Tota-Maharaj  R, Dardari  ZA, Wong  ND, Kronmal  RA, Eng  J, Post  WS, Blumenthal  RS, Nasir  K. Improving the CAC score by addition of regional measures of calcium distribution: multi-ethnic study of atherosclerosis. JACC Cardiovasc Imaging  2016;9:1407–1416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Hosny  A, Parmar  C, Quackenbush  J, Schwartz  LH, Aerts  HJWL. Artificial intelligence in radiology. Nat Rev Cancer  2018;18:500–510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lakhani  P, Sundaram  B. Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology  2017;284:574–582. [DOI] [PubMed] [Google Scholar]
  • 9. Shin  HC, Roth  HR, Gao  M, Lu  L, Xu  Z, Nogues  I, Yao  J, Mollura  D, Summers  RM. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imaging  2016;35:1285–1298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Sartoretti  T, Gennari  AG, Sartoretti  E, Skawran  S, Maurer  A, Buechel  RR, Messerli  M. Fully automated deep learning powered calcium scoring in patients undergoing myocardial perfusion imaging. J Nucl Cardiol  2023;30:313–320. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Zhang  N, Yang  G, Zhang  W, Wang  W, Zhou  Z, Zhang  H, Xu  L, Chen  Y. Fully automatic framework for comprehensive coronary artery calcium scores analysis on non-contrast cardiac-gated CT scan: total and vessel-specific quantifications. Eur J Radiol  2021;134:109420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. COCA . Coronary calcium and chest CT’s. Available from: https://stanfordaimi.azurewebsites.net/datasets/e8ca74dc-8dd4-4340-815a-60b41f6cb2aa.
  • 13. Divakaran  S, Cheezum  MK, Hulten  EA, Bittencourt  MS, Silverman  MG, Nasir  K, Blankstein  R. Use of cardiac CT and calcium scoring for detecting coronary plaque: implications on prognosis and patient management. Br J Radiol  2015;88:20140594. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Hecht  HS. Coronary artery calcium scanning: past, present, and future. JACC Cardiovasc Imaging  2015;8:579–596. [DOI] [PubMed] [Google Scholar]
  • 15. Rumberger  JA, Brundage  BH, Rader  DJ, Kondos  G. Electron beam computed tomographic coronary calcium scanning: a review and guidelines for use in asymptomatic persons. Mayo Clin Proc  1999;74:243–252. [DOI] [PubMed] [Google Scholar]
  • 16. van der Bijl  N, Joemai  RM, Geleijns  J, Bax  JJ, Schuijf  JD, de Roos  A, Kroft  LJM. Assessment of Agatston coronary artery calcium score using contrast-enhanced CT coronary angiography. AJR Am J Roentgenol  2010;195:1299–1305. [DOI] [PubMed] [Google Scholar]
  • 17. Çiçek  Ö, Abdulkadir  A, Lienkamp  SS, Brox  T, Ronneberger  O. 3D U-Net: learning dense volumetric segmentation from sparse annotation. In: Çiçek  Ö, Abdulkadir  A, Lienkamp  SS, Brox  T, Ronneberger  O, eds. Medical Image Computing and Computer-Assisted Intervention—MICCAI 2016. MICCAI 2016. Lecture Notes in Computer Science, Vol. 9901. Cham: Springer International Publishing; 2016. [Google Scholar]
  • 18. Zhu  J-Y, Park  T, Isola  P, Efros  AA. Unpaired image-to-image translation using cycle-consistent adversarial networks 2017 March 01, 2017:[arXiv:1703.10593 p.]. Available from: https://ui.adsabs.harvard.edu/abs/2017arXiv170310593Z.
  • 19. Sandfort  V, Yan  K, Pickhardt  PJ, Summers  RM. Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks. Sci Rep  2019;9:16884. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Ronneberger  O, Fischer  P, Brox  T. U-Net: convolutional networks for biomedical image segmentation 2015 May 01, 2015:[arXiv:1505.04597 p.]. Available from: https://ui.adsabs.harvard.edu/abs/2015arXiv150504597R.
  • 21. Sandstedt  M, Henriksson  L, Janzon  M, Nyberg  G, Engvall  J, De Geer  J, Alfredsson  J, Persson  A. Evaluation of an AI-based, automatic coronary artery calcium scoring software. Eur Radiol  2020;30:1671–1678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Wang  W, Wang  H, Chen  Q, Zhou  Z, Wang  R, Wang  H, Zhang  N, Chen  Y, Sun  Z, Xu  L. Coronary artery calcium score quantification using a deep-learning algorithm. Clin Radiol  2020;75:237.e11–237.e16. [DOI] [PubMed] [Google Scholar]
  • 23. Eng  D, Chute  C, Khandwala  N, Rajpurkar  P, Long  J, Shleifer  S, Khalaf  MH, Sandhu  AT, Rodriguez  F, Maron  DJ, Seyyedi  S, Marin  D, Golub  I, Budoff  M, Kitamura  F, Takahashi  MS, Filice  RW, Shah  R, Mongan  J, Kallianos  K, Langlotz  CP, Lungren  MP, Ng  AY, Patel  BN. Automated coronary calcium scoring using deep learning with multicenter external validation. NPJ Digit Med  2021;4:88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Ferencik  M, Pencina  KM, Liu  T, Ghemigian  K, Baltrusaitis  K, Massaro  JM, D’Agostino  RB, O’Donnell  CJ, Hoffmann  U. Coronary artery calcium distribution is an independent predictor of incident major coronary heart disease events: results from the Framingham heart study. Circ Cardiovasc Imaging  2017;10:e006592. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

oead113_Supplementary_Data

Data Availability Statement

Most of the data underlying this article cannot be shared publicly due to the privacy of individuals that participated in the study. The data will be shared on reasonable request to the corresponding author. However, one portion of the datasets analysed in the current study is available in Center for Artificial Intelligence in Medicine & Imaging, at https://stanfordaimi.azurewebsites.net/datasets/e8ca74dc-8dd4-4340-815a-60b41f6cb2aa.


Articles from European Heart Journal Open are provided here courtesy of Oxford University Press on behalf of the European Society of Cardiology

RESOURCES