Skip to main content
Korean Circulation Journal logoLink to Korean Circulation Journal
editorial
. 2019 Nov 4;50(1):85–87. doi: 10.4070/kcj.2019.0314

Machine Learning: a New Opportunity for Risk Prediction

Osung Kwon 1, Wonjun Na 2, Young-Hak Kim 2,
PMCID: PMC6923232  PMID: 31854158

Globally, cardiovascular disease (CVD) remains the major cause of mortality and morbidity, and identifying people at risk of CVD is the cornerstone of clinical cardiology.1) Accordingly, current guidelines for primary prevention of CVD recommend algorithms to identify asymptomatic patients on the basis of their predicted risk.2),3) These established algorithms are typically developed using multivariate regression models with a limited number of well-established risk factors and generally assume that all such factors are related to the CVD outcomes in a linear fashion, with limited or no interactions between the different factors.4),5) Owing to their restrictive modeling assumptions and limited number of predictors, the existing algorithms generally exhibit suboptimal predictive performance.4),5)

Along with the emergence of big data, machine learning (ML) provides an alternative approach to established prediction modelling that may address the current limitations. Accumulated medical data and digitalized clinical information enable ML to verify a hypothesis generated from a conventional statistical analysis and to agnostically discover new predictors of CVD risk. Recently, deep learning (DL), a branch of ML, has become increasingly popular in the medical research community because of its excellent performance in different domains and the rapid methodological improvements.6) DL represents an improvement in artificial neural networks, consisting of more layers that permit higher levels of abstraction and improved predictions from data.7) To date, it is the leading ML tool in medical image analysis, with promising results.8) By virtue of large training biomedical data and advanced computing power, more recently, DL has been applied to the development of risk prediction models using electronic health data.6)

Based on this background, Cho et al. investigated the additional discriminative accuracy of a time-series DL algorithm using repeated-measures data for identifying people at high risk of CVD, in comparison with the Cox hazard regression model.9) The authors found that the time-series DL algorithm analysis showed greater discriminative accuracy than the Cox model approaches. This study expands the possibility of DL from models that predict outcomes on the basis of data from specific time points to those that predict future events in complex time-varying datasets. The study used large data from a national health screening program and the national health insurance claims database in South Korea for development and validation. Furthermore, prospective cohort data from the Rotterdam Study were used for ethnically generalizable external validation. The design based on this approach abided by the recommendation of the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis statement,10) which made the study results reliable. In addition, ML, particularly neural networks, is sometimes called “black box” because of the difficulty of interpretation. Thus, the authors assessed the attribute rank of risk predictors in the DL model.

The data derived from the hospital information system are characterized by tremendousness, heterogeneity, and complexity, and DL could provide a solution for analyzing these kind of complex data. However, as mentioned by the authors in the limitations, in the study, only 6 variables that are already well-known strong cardiovascular risk factors, were used to develop the risk prediction model, which limits the study value and use of DL. Considering the purpose of the study, which was to confirm the superior analytic performance of DL in contrast to that of Cox regression, the study serves as an attempt to navigate the challenges of DL for developing CV risk prediction models. Further studies using a large number of diverse variables would be required to validate the predictive performance of DL.

DL introduces exciting new opportunities for precision medicine, including risk stratification and future event prediction. Attempts to apply DL methods to patient care and clinical research are already planned or underway. In spite of the current hurdles to the application of DL for cardiac risk stratification, inspiration and dispassionate effort would lead to the development of more reliable and robust models for realizing personalized cardiovascular health care.

Footnotes

Conflict of Interest: The authors have no financial conflicts of interest.

Author Contributions:
  • Conceptualization: Kwon O, Kim YH.
  • Writing - original draft: Kwon O, Na W.
  • Writing - review & editing: Kim YH.

The contents of the report are the author's own views and do not necessarily reflect the views of the Korean Circulation Journal.

References

  • 1.Roth GA, Johnson C, Abajobir A, et al. Global, regional, and national burden of cardiovascular diseases for 10 causes, 1990 to 2015. J Am Coll Cardiol. 2017;70:1–25. doi: 10.1016/j.jacc.2017.04.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Goff DC, Jr, Lloyd-Jones DM, Bennett G, et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association task force on practice guidelines. J Am Coll Cardiol. 2014;63:2935–2959. doi: 10.1016/j.jacc.2013.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Authors/Task Force Members. Piepoli MF, Hoes AW, et al. 2016 European Guidelines on cardiovascular disease prevention in clinical practice: the sixth joint task force of the European Society of Cardiology and other societies on cardiovascular disease prevention in clinical practice (constituted by representatives of 10 societies and by invited experts) developed with the special contribution of the European Association for Cardiovascular Prevention & Rehabilitation (EACPR) Atherosclerosis. 2016;252:207–274. doi: 10.1016/j.atherosclerosis.2016.05.037. [DOI] [PubMed] [Google Scholar]
  • 4.Siontis GC, Tzoulaki I, Siontis KC, Ioannidis JP. Comparisons of established risk prediction models for cardiovascular disease: systematic review. BMJ. 2012;344:e3318. doi: 10.1136/bmj.e3318. [DOI] [PubMed] [Google Scholar]
  • 5.Alaa AM, Bolton T, Di Angelantonio E, Rudd JH, van der Schaar M. Cardiovascular disease risk prediction using automated machine learning: a prospective study of 423,604 UK Biobank participants. PLoS One. 2019;14:e0213653. doi: 10.1371/journal.pone.0213653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Miotto R, Wang F, Wang S, Jiang X, Dudley JT. Deep learning for healthcare: review, opportunities and challenges. Brief Bioinform. 2018;19:1236–1246. doi: 10.1093/bib/bbx044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
  • 8.Litjens G, Ciompi F, Wolterink JM, et al. State-of-the-art deep learning in cardiovascular image analysis. JACC Cardiovasc Imaging. 2019;12:1549–1565. doi: 10.1016/j.jcmg.2019.06.009. [DOI] [PubMed] [Google Scholar]
  • 9.Cho IJ, Sung JM, Kim HC, et al. Development and external validation of a deep learning algorithm for prognostication of cardiovascular outcomes. Korean Circ J. 2019;50:72–84. doi: 10.4070/kcj.2019.0105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med. 2015;162:55–63. doi: 10.7326/M14-0697. [DOI] [PubMed] [Google Scholar]

Articles from Korean Circulation Journal are provided here courtesy of The Korean Society of Cardiology

RESOURCES