Abstract
Unexpected insights and practical advances in cardiovascular disease (CVD) are being discovered by rapidly advancing developments in supercomputers and machine learning (ML) software algorithms. These have been accelerated during the COVID-19 pandemic, and the resulting CVD translational implications of ML are steering new measures of prevention and treatment, new tools for objective clinical diagnosis, and even opportunities for rethinking basic foundations of CVD nosology. As the usual cardiovascular specialist may not be familiar with these tools, the editor has invited this brief overview.
Keywords: Cardiovascular disease, Clinical diagnosis, Machine learning, Prediction models
Unexpected insights and practical advances in cardiovascular disease (CVD) are being discovered by rapidly advancing developments in supercomputers and machine learning (ML) software algorithms. These have been accelerated during the COVID-19 pandemic, and the resulting CVD translational implications of ML are steering new measures of prevention and treatment, new tools for objective clinical diagnosis, and even opportunities for rethinking basic foundations of CVD nosology [1], [2], [3], [4], [5]. As the usual cardiovascular specialist may not be familiar with these tools, the editor has invited this brief overview.
1. Types of machine learning
ML is a subcategory within the broad discipline of Artificial Intelligence (AI). AI is the general concept of employing machines and mathematical algorithms that aspire to ultimately make computers behave with the capabilities of the human mind. AI is a lofty moving-target goal spanning a wide umbrella of technological advances that includes various subgroups such as ML which are more practical and less existential. ML uses computer programs in a dynamic self-altering process to automatically improve outcomes through iterative self-training experiences of exploring data patterns [6]. There are four basic flavors of ML: unsupervised, supervised, reinforcement learning, and deep-learning.
-
•
Unsupervised ML models learn from clustering and association patterns of unlabeled input data without human intervention. Here, the output delivers de novo meaningful insights to the table with minimal a priori assumptions. Such models include multivariate principal component analysis (PCA), k-means clustering, and hierarchical clustering. PCA is a method of reducing an unmanageably large number of complex variables in a dataset down to only two or three meaningful variables that still retain most of the original information. K-means clustering sorts through huge pools of raw data to form multiple groups each clustered around a common center value. Hierarchical clustering builds a branching tree of data organized by degree of similarity nested within a branch or comparisons between branches. One medical example of unsupervised ML is predicting or managing heart disease based on anomalies found within a battery of patient characteristics such as age, body mass index, sex, lab values, lifestyle, etc. Another example is the exploration of Fibonacci “Golden Ratio” relationships that occur in nature and throughout anatomy and physiology; for instance, invoking ML to look for fractal patterns has been proposed in interpreting transthoracic echocardiography data to unmask branching arrangements of coronary artery lesions responsible for ST-segment elevation myocardial infarctions, in order to assess abnormalities in cardiac dimensions and to examine blood pressure patterns that may predict decompensation in congestive heart failure patients [7], [8].
-
•
Supervised ML involves labeled data based on predictive models supplied by a human operator. Here, an algorithm arrives at useful insights based on self-learning from previous data sets. For example, support vector machine (SVM) models [9] can classify ischemic heart disease as simple binary states of “good or bad” based on template electrocardiograms (ECGs) or on ejection fraction data measured by single photon emission tomography and echocardiography images. Other supervised models incorporate Bayesian networks, random forests, or least absolute shrinkage and selection operator (LASSO) regressions.
-
•
Reinforcement ML engages algorithms that either discover errors or bolster findings based on reward feedback from human intervention. This approach is being deployed for use in drug prescription recommendations and in imaging problems.
-
•
Deep-learning ML can be either supervised or unsupervised in enabling a computer to teach itself by exposure to vast datasets using multiple layers of artificial neural network computational nodes. Such algorithms can stitch together small samples of data scalable into a big picture with a predictable pattern. For example, deep-learning algorithms that combine retinal blood vessel images with clinical lab values can detect various CVD risk factors with accuracy comparable to human experts [10].
2. CVD machine learning on the street
Collectively, ML algorithms can use clinical and preclinical data of electrocardiography and echocardiography interpretations, MRI and computed tomography (CT) imaging diagnoses, scintigraphic myocardial perfusion imaging, and lab chemistry values to address predictions or diagnoses of stroke, heart failure, arrhythmias, coronary artery disease, acute myocardial infarction, hospital recidivism, transplant decisions, and other CVD issues requiring intervention decisions [11], [12]. Many ML algorithms have been shown to validate such risk prediction models reliably, yielding receiver operating characteristic area under the curve (ROC AUC) values in the range of ~0.7 to 0.9 [11], [12].
While numerous ML algorithms can explain the pathophysiological “why” underlying a given disorder, other approaches such as deep-learning engage “black-box” mathematical models that are of lesser value because they are devoid of extrapolation to actual medical insights. ML is gaining traction as an invaluable tool for advancing academic CVD basic science research discoveries, although literature meta-analyses reveal that considerably more work is required to translate ML algorithms into everyday clinical practices of reliable precision cardiology and patient management outside of major academic medical research institutions [2].
To improve accuracy and to validate results, large databases from high quality clinical trials are necessary to train the models. This is especially important for the elusive holy grail of assessing CVD risk in slowly emerging prodromic patients or mining risk assessment in electronic health records of asymptomatic patients. While the Framingham Risk Score was included in the American College of Cardiology/American Heart Association 2010 guideline [13], this traditional model is arguably not optimal because its incomplete variable set overestimates false positive CVD risk. To enhance diagnosis, the American Heart Association's Institute for Precision Cardiovascular Medicine [14] introduced patient genomic data into the mix as a laudable contributing factor.
Recent advances in microbiome metagenomics are contributing additional useful dimensions in applying ML to CVD. For example, assessing multifaceted gut microbiome–host interactions in mood disorders comorbid with CVD has surprisingly uncovered pathophysiological causal relationships that tie together heretofore strange bedfellow clinics: cardiology, psychiatry, and gastroenterology [15]. This is opening doors to integrate patient management across these traditionally siloed disciplines. One such illustration is the application of ML that unraveled a new putative phenotype of CVD denoted “depressive-hypertension” [15]. Here, hypertensive patients comorbid with depression harbor a unique signature of gastrointestinal microbial species' functional genomics in a model that disrupts central control of both blood pressure and mood. Further in these patients, ML analyses uninterestingly implied that oral ACE inhibitor drugs in the gut lumen exhibit antibiotic pharmacology that competitively inhibits key enzymes of proinflammatory gut bacterial cell wall biosynthesis associated with depression and hypertension [15]. Future expansion of these ML studies may lead to new or repurposed drugs to treat resistant hypertension.
3. Bioethical considerations
Unprecedented integration of facial expression, other “body language”, and volumes of lab data present unlimited opportunities for patient management. For example, imaging of facial expressions among patients in the critical care setting who may be too ill or sedated to express their discomfort has the potential to improve their management. The COVID-19 pandemic mandated virtual outpatient visit sessions revealing latent opportunities to apply ML to decipher facial expressions for the video images that would otherwise have not been captured during an in-person visit with a masked patient.
Nevertheless, making accurate predictions based on extensive use of personal biometric data raises the poignant issue that ML scientists and clinicians must rigorously address many ethical challenges. Attention must be paid to minimalizing biases and skewed representations relating to sex/gender, race, stereotypes, and physical limitations identities. Furthermore, in the light of the American Medical Association policy on AI in healthcare [16], ultimately clinicians must continue to treat patients humanistically regardless of the use of objective tools at their disposal. Emergence of journals such as Intelligence-Based Medicine (https://www.sciencedirect.com/journal/intelligence-based-medicine) shall facilitate this task.
4. Rolling out machine learning for CVD
ML holds much promise for the future of cardiology investigations and clinical translations. However, before ML becomes commonplace with FDA medical device approval, a considerable number of knowledge gaps exist. For example: i) numerous technological and societal/ethical challenges remain concerning standardization of gathering CVD data; and of ii) assessing model performances [12], [17]. Once the models are established as robust and ethically unbiased, then iii) infrastructure matters must be addressed such as supercomputer processing requirements for ambulatory wearable sensors, clinic-based online cloud needs, massive data storage requirements, real-time needs for interventional procedures, and ethical/social implications of equitable data access. Nevertheless, the future is bright for AI/ML as applied to the cardiovascular system.
Funding
Dr. Pepine receives support from NIH/NHLBI grants R01 HL132448 and R01 HL146158.
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References
- 1.Cuocolo R., Perillo T., De Rosa E., Ugga L., Petretta M. Current applications of big data and machine learning in cardiology. J. Geriatr. Cardiol. 2019;16:601–607. doi: 10.11909/j.issn.1671-5411.2019.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Krittanawong C., Virk H.U.H., Bangalore S., Wang Z., Johnson K.W., Pinotti R., Zhang H., Kaplin S., Narasimhan B., Kitai T., Baber U., Halperin J.L., Tang W.H.W. Machine learning prediction in cardiovascular diseases: a meta-analysis. Sci. Rep. 2020;10:16057. doi: 10.1038/s41598-020-72685-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Seetharam K., Shrestha S., Sengupta P.P. Artificial intelligence in cardiovascular medicine. Curr. Treat. Options Cardiovasc. Med. 2019;21:25. doi: 10.1007/s11936-019-0728-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Lopez-Jimenez F., Attia Z., Arruda-Olson A.M., Carter R., Chareonthaitawee P., Jouni H., Kapa S., Lerman A., Luong C., Medina-Inojosa J.R., Noseworthy P.A., Pellikka P.A., Redfield M.M., Roger V.L., Sandhu G.S., Senecal C., Friedman P.A. Artificial intelligence in cardiology: present and future. Mayo Clin. Proc. 2020;95:1015–1039. doi: 10.1016/j.mayocp.2020.01.038. [DOI] [PubMed] [Google Scholar]
- 5.Johnson K.W., Torres Soto J., Glicksberg B.S., Shameer K., Miotto R., Ali M., Ashley E., Dudley J.T. Artificial intelligence in cardiology. J. Am. Coll. Cardiol. 2018;71:2668–2679. doi: 10.1016/j.jacc.2018.03.521. [DOI] [PubMed] [Google Scholar]
- 6.Jordan M.I., Mitchell T.M. Machine learning: trends, perspectives, and prospects. Science. 2015;349:255–260. doi: 10.1126/science.aaa8415. [DOI] [PubMed] [Google Scholar]
- 7.Gibson C.M., Gibson W.J., Murphy S.A., Marble S.J., McCabe C.H., Turakhia M.P., Kirtane A.J., Karha J., Aroesty J.M., Giugliano R.P., Antman E.M. And TIMI study group, Association of the Fibonacci Cascade with the distribution of coronary artery lesions responsible for ST-segment elevation myocardial infarction. Am. J. Cardiol. 2003;92:595–597. doi: 10.1016/s0002-9149(03)00731-8. [DOI] [PubMed] [Google Scholar]
- 8.Yetkin E., Ozturk S., Cuglan B., Turhan H. Golden ratio in congestive heart failure: a promising proportion for prognosis and decompensation. Cardiol. J. 2020;27:904–905. doi: 10.5603/CJ.2020.0177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Yadav S.S., Jadhav S.M., Bonde R.G., Chaudhari S.T. 2020 3rd International Conference on Communication System, Computing and IT Applications (CSCITA), 3-4 April 2020. 2020. Automated cardiac disease diagnosis using support vector machine; pp. 56–61. [DOI] [Google Scholar]
- 10.Cheung C.Y., Xu D., Cheng C.-Y., Sabanayagam C., Tham Y.-C., Yu M., Rim T.H., Chai C.Y., Gopinath B., Mitchell P., Poulton R., Moffitt T.E., Caspi A., Yam J.C., Tham C.C., Jonas J.B., Wang Y.X., Song S.J., Burrell L.M., Farouque O., Li L.J., Tan G., Ting D.S.W., Hsu W., Lee M.L., Wong T.Y. A deep-learning system for the assessment of cardiovascular disease risk via the measurement of retinal-vessel calibre. Nat. Biomed. Eng. 2020 doi: 10.1038/s41551-020-00626-4. [DOI] [PubMed] [Google Scholar]
- 11.Weng S.F., Reps J., Kai J., Garibaldi J.M., Qureshi N. Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLoS One. 2017;12 doi: 10.1371/journal.pone.0174944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ranka S., Reddy M., Noheria A. Artificial intelligence in cardiovascular medicine. Curr. Opin. Cardiol. 2021;36:26–35. doi: 10.1097/HCO.0000000000000812. [DOI] [PubMed] [Google Scholar]
- 13.Greenland P., Alpert J.S., Beller G.A., Benjamin E.J., Budoff M.J., Fayad Z.A., Foster E., Hlatky M.A., Hodgson J.M., Kushner F.G., Lauer M.S., Shaw L.J., Smith S.C., Jr., Taylor A.J., Jr., Weintraub W.S., Jr., Wenger N.K., Jr., Jacobs A.K., Jr., American College of Cardiology Foundation/American Heart Asscoiation Task Force on Practice Guidelines 2010 ACCF/AHA guideline for assessment of cardiovascular risk in asymptomatic adults: executive summary: a report of the American College of Cardiology Foundation/American Heart Association Task Force on Practice Guidelines. Circulation. 2010;122:2748–2764. doi: 10.1161/CIR.0b013e3182051bab. [DOI] [PubMed] [Google Scholar]
- 14.Houser S.R. The American Heart Association's new institute for precision cardiovascular medicine. Circulation. 2016;134:1913–1914. doi: 10.1161/CIRCULATIONAHA.116.022138. [DOI] [PubMed] [Google Scholar]
- 15.Stevens B.R., Pepine C.J., Richards E.M., Kim S., Raizada M.K. Depressive-hypertension: A proposed human endotype of brain/gut microbiome dysbiosis. Am. Heart J. 2021:100540. doi: 10.1016/j.ahj.2021.05.002. [DOI] [PubMed] [Google Scholar]
- 16.Crigger E., Khoury C. Making policy on augmented intelligence in health care. AMA J. Ethics. 2019;21:E188–E191. doi: 10.1001/amajethics.2019.188. [DOI] [PubMed] [Google Scholar]
- 17.US_FDA Artificial Intelligence/Machine Learning (AI/ML)-based_Software as a Medical Device (SaMD) action plan. 2021. https://www.fda.gov/media/145022/download
