Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Jul 24.
Published in final edited form as: Heart Fail Clin. 2023 Apr 7;19(3):391–405. doi: 10.1016/j.hfc.2023.03.001

The Emerging Role of Artificial Intelligence in Valvular Heart Disease

Caroline Canning a,b, James Guo a,b, Akhil Narang a,b, James D Thomas a,b, Faraz S Ahmad a,b,c,*
PMCID: PMC11267973  NIHMSID: NIHMS2001655  PMID: 37230652

Abstract

Valvular heart disease (VHD) is a morbid condition in which timely identification and evidence-based treatments can lead to improved outcomes. Artificial intelligence broadly refers to the ability for computers to perform tasks and problem solve like the human mind. Studies applying AI to VHD have used a variety of structured (eg, sociodemographic, clinical) and unstructured (eg, electrocardiogram, phonocardiogram, and echocardiograms) and machine learning modeling approaches. Additional researches in diverse populations, including prospective clinical trials, are needed to evaluate the effectiveness and value of AI-enabled medical technologies in clinical care for patients with VHD.

Author Keywords: Artificial intelligence, Deep learning, Heart failure, Machine learning, Natural language processing, Valvular heart disease

Introduction

Valvular heart disease (VHD) has been found in 14% of patients referred for echocardiographic workup of suspected heart failure (HF)1 and 21% of patients hospitalized for HF.2 The epidemiology of VHD varies around the world. In high-income countries, VHD is predominantly due to degenerative disease, whereas in low-income and middle-income countries, VHD is predominantly due to rheumatic heart disease.3 Even with this variance, the global incidence of VHD has increased by 45% in the last 30 years due to an aging population,4 contributing to burden of degenerative disease.4, 5 Consequently, the proportion of HF secondary to VHD is likely to increase 3 Aortic and mitral valvulopathies predominate in prevalence, particularly calcific aortic valve disease and degenerative mitral valve disease.1, 3 Substantial advances in the management of severe VHD have been made in recent decades with the development of transcatheter approaches to valve repair and replacement. Transcatheter aortic and mitral valve procedures have increased by nearly 5-fold and 10-fold, respectively, in recent years with growing favorable outcomes.6, 7 Tricuspid transcatheter valve repair and replacement are also emerging therapies and are currently under investigation.8, 9

VHD is a ripe area for the development and testing of artificial intelligence (AI) or machine learning (ML) approaches. A systematic review of AI/ML in cardiovascular medicine found that less than 3% of studies were applied in this disease area.10 Cardiovascular medicine is increasingly digital, with widespread integration of electronic health records (EHRs) in clinical practice, repositories of cardiovascular diagnostic data including echocardiograms, electrocardiogram (ECG), phonocardiograms (PCGs), and cardiac computed tomography (CT) and MRI. The vast amount of data coupled with the increased availability of high-performance computing will be conducive to the continued growth in AI and ML research and application development to improve detection, risk prediction, and management of VHD.

AI broadly refers to the ability for computers to perform tasks and problem solve like the human mind. ML is a subset of AI that emerged from the fields of computer science and statistics. ML relies on input, specifically large collections of data, to recognize patterns, entities, and the connections between them. The discipline of ML explores the analysis and construction of algorithms that can learn from and make predictions based on data. ML algorithms build a mathematical model based on sample data, known as training data, to make predictions or decisions without being explicitly programmed to do so.11 An increasingly utilized subset of ML is deep learning (DL), or artificial neural networks (ANNs). This specifically refers to algorithms that are modeled after the human brain containing artificial “neurons” arranged in successive layers culminating in a decision function. This relationship is illustrated in Fig. 1.

Fig. 1.

Fig. 1.

Artificial intelligence and machine learning. Relationship between artificial intelligence, machine learning, and “deep” learning. CNN, convolutional neural network; DNN, deep neural network; GAN, generative adversarial network; KNN, k-nearest neighbor; PCA, principal components analysis; RF, random forests; RNN, recurrent neural network; SVM, Support vector machines; VAE, variational autoencoder. (From Wehbe RM, Khan SS, Shah SJ, Ahmad FS. Predicting High-Risk Patients and High-Risk Outcomes in Heart Failure. Heart Fail Clin. 2020;16(4):387–407.)

Typically, an ML dataset will be split into 3 categories: training, validation, and testing. The training subset is used to fit an ML model. The validation subset is to test the model and tweak certain hyperparameters because it is trained to optimize performance. Finally, the test subset is used to evaluate the fit, or model performance of the ML model. It is important that the test subset be segregated from the rest of the data so it cannot be included in the training and validation.

Two main approaches, supervised and unsupervised learning, have been widely applied to a variety of clinical problems and tasks in biomedical science. Supervised learning requires human input to label input and output training data. A model will learn the relationship between the inputs and outputs, and will adjust to make the correct answer. Therefore, supervised models are generally used for classification or regression (predicting outcomes). Unsupervised learning identifies intrinsic patterns within data without human intervention, and the output of the model is generally patterns or trends. These models generally include clustering, association tasks, dimensionality reduction, or factorization. Unsupervised learning can be used in exploratory data analysis to inform supervised learning tasks. Other types of learning frameworks, such as reinforcement learning, have been applied in health care. Several reviews have described in detail different ML approaches, applications, strengths, and limitations for addressing problems in health care more broadly and within cardiovascular medicine.10, 1217

In this review, we highlight areas AI/ML methods and their application may address the following unmet needs in the care of patients with VHD: (1) screening for VHD, (2) assessment of severity of disease, (3) risk prediction, and (4) phenomapping ( Fig. 2 ). Later, we discuss some of the challenges and future directions of AI/ML in VHD.

Fig. 2.

Fig. 2.

Clinical applications of AI/ML in valvular heart disease. Four main tasks for AI/ML in valvular heart disease are reviewed in this article: (1) screening to valvular heart disease, (2) assessment of severity of disease, (3) risk prediction, and (4) phenomapping. For each task, a variety of data types and models have been applied, predominantly in retrospective datasets. Each of these has the potential to lead to improvements in clinical care in 4 main areas but additional, prospective clinical studies are needed.

Screening for valvular heart disease

Several studies have shown that VHD remains underdiagnosed.1822 In the United States alone, national prevalence of VHD affects 2.5% of the population.19 Most patients are not screened for VHD until they report the onset of symptoms even though most cases remain asymptomatic for many years.19, 22 This presents an opportunity for early detection before the lesion becomes more severe, causes symptoms, or leads to ventricular dilation or dysfunction. More studies are calling for early identification and treatment of VHD,23, 24 and thus timely identification has implications for disease management and improvement of patient outcomes.2527 The gold standard of detecting VHD is via echocardiogram18 but due to cost and resource limitations it is usually reserved for either symptomatic patients or those for whom there is high clinical suspicion of VHD based on physical examination. Other cardiac examinations that are more widely indicated can help recognize patients with asymptomatic VHD that should be referred for an echo. Below we explore examples of studies that are using ML strategies to help identify patients with VHD through auscultation and ECG. Although these examples show that these technologies can identify patients with underdiagnosed VHD, prospective randomized controlled trials and cost-effectiveness analyses are needed to inform recommendations from guideline-writing organizations, such as various cardiovascular societies and the US Preventive Services Task Force.

Auscultation

The stethoscope was invented in 1818 by René Laennec. It has remained the most inexpensive and accessible modality to detect VHD as heart sounds can reveal the cause of a lesion as well as the severity. Although every clinician learns to auscultate as a foundational part of the physical examination, the ability to detect and interpret murmurs varies widely.28, 29 Thus, researchers have explored ways to aid clinicians in this task.

Attempts to classify pathologic condition in PCGs, or visual plot representation of heart sounds, has been performed for more than 50 years. This was accelerated by the 2016 PhysioNet/Computing in Cardiology Challenge,30 which published an open-access database of 3126 heart sound recordings. Researchers were tasked with identifying PCGs as normal, abnormal, or poor signal quality. The 3 top performing entrants used ensemble methods, or combination of different classifiers to improve accuracy. However, each team used different classifiers, demonstrating that different approaches could have comparable performance. 31 The feature extraction stage, which transforms the data into derived values, or features, to reduce redundancy was deemed key to effective models.31

Digital stethoscopes have enabled the recording and automated classification of heart sounds. These stethoscopes work by converting acoustic sound to electronic signals, which can be amplified for the examiner.32 Many models can transmit the information to a mobile app or computer in the form of an audio file or PCG.32

One AI-enabled digital stethoscope that detects and characterizes murmurs in adults and pediatric patients has received Food and Drug Administration (FDA) clearance and Conformite Europeenne (CE) mark. This was supported by a study from Chorba and colleagues,33 who created and tested a deep convolutional neural network (CNN) model that classifies PCGs in 3 categories: heart murmur, no murmur, or inadequate signal. To determine the accuracy of the signal quality and murmur detection performance, the algorithm output was compared with annotations from 3 cardiologists. Algorithm performance had a sensitivity and specificity for detecting murmurs of 76.3% and 91.4%. The study also used the gold standard echocardiogram to detect “clinically significant” (more than moderate) aortic stenosis (AS) and mitral regurgitation (MR). Detection of AS at the aortic position was 93.2% sensitive and 86.0% specific. The detection of MR at the mitral position was 66.2% sensitive and 94.6% specific.

Electrocardiogram

Besides auscultation, ECG is another relatively inexpensive and ubiquitous noninvasive test to evaluate the heart. In current clinical practice, valvular diseases are not directly diagnosed on ECG but evidence of structural heart disease, such as left ventricular hypertrophy, can provide clues for VHD leading to left ventricular remodeling. Several studies have examined whether DL can be applied to ECGs to detect VHD in patients.

The ability to detect and predict the future development of severe AS is of interest because the condition is associated with a mortality rate of 40% to 50% within 1 year without valve replacement.34, 35 Multiple groups of investigators have trained a specific neural network called a CNN, which are often used for image classification and process an image in a grid-like fashion, using ECG-Echo pairs. Kwon and colleagues36 used 39,371 ECGs from patients at a cardiology teaching hospital to train a CNN to detect AS, and then performed external validation on the algorithm with 10,865 ECGs from patients at a community hospital, which showed generalizability with an area under the curve (AUC) of 0.861, sensitivity 0.80, and specificity 0.78. They also used a saliency map, which focused on the T wave, to highlight the parts of the ECG that were used to predict presence of AS. This type of technique has been used to increase confidence for clinicians to accept the prediction results because the user sees that the algorithm is using an important feature to make a prediction rather than some artifact in the background. Results from Cohen Shelley and colleagues37 reinforce these findings. The authors trained a CNN to identify moderate-to-severe AS in 258,607 adults with ECG-TTE pairs, with a comparable AUC of 0.85, sensitivity of 0.78, and specificity of 0.74 when using just ECG morphology alone. Their saliency map similarly highlighted the TP segment as an area of importance. However, several studies have raised concerns about the accuracy and reliability of saliency maps for DL algorithms and emphasize the need for prospective testing of algorithms to generate the evidence base to support adoption over the reliance of saliency maps and other techniques to increase confidence of end-users.38, 39

Because the overall incidence of VHD is relatively low, the rECHOmmend study40 combined moderate-to-severe valvular disease (AS, AR, MS, MR, tricuspid regurgitation [TR]) with structural changes of reduced EF less than 50%, or ventricular septal thickness greater than 15 mm to achieve increased positive predictive value (PPV) for their models. Authors marked ECGs as positive for a given condition if it was acquired within a year before the patient’s first positive echocardiogram for the label. Then they trained a CNN to predict presence or absence of any of the 7 labels with ECG tracings and a range of input data including demographics, laboratories, findings, and ECG measurements. Although the model performed best with all inputs, the area under the receiver operating characteristic (AUROC) remained high (0.91) with age, sex, and ECG tracing data. This potentially enables more targeted referral for echocardiography to help detect underdiagnosed VHD, although prospective testing is needed to determine the clinical utility and cost-effectiveness of these approaches.

Assessment of severity for valvular heart disease

The studies that explore severity predominantly use echocardiography as an input, as echocardiography remains the primary modality for diagnosis and evaluation of VHD.18 Echocardiograms are one of the least expensive and most accessible forms of cardiac imaging.4143 Between 2001 and 2011, there were more than 7 million echocardiograms performed annually in the United States, with its use increasing by more than 3% yearly.41 AI has been used for a variety of tasks related to echocardiography,4447 including view acquisition, view classification,48 quality assessment,49 automation of measurements and observations,50, 51 and disease identification, such as cardiac amyloidosis52 and hypertrophic cardiomyopathy.5355

The use of AI to diagnose VHDs via echocardiographic images represents an emerging area of research. For example, Yang and colleagues56 developed a 3-stage DL framework that evaluated view classification, screening for VHDs (aortic and mitral lesions), and when positive, performed segmentation to provide metrics related to severity. The investigators used 2 different approaches to detect regurgitant and stenotic lesions. A CNN was trained to detect color in the left ventricular outflow tract (LVOT) or left atrium for AR and MR, respectively. It also measured anatomic structures to localize the regurgitation in time and space. For stenotic lesions, the algorithm identified the PLAX-2D and A4C views as input for the diagnosis and then calculated valve area with Doppler to calculate mitral stenosis (MS), and the Vmax and mean pressure gradients for AS. Overall, the DL model provided varying agreement when compared with measurements by 2 physician echocardiographers with Intersection of Union (IoU) metrics. IoU is also referred to as the Jaccard Index, is a common metric used in image segmentation. The lowest IoU was for AR jet 0.63, whereas the highest IoU was for the left atrial area 0.86. Next, the model achieved high performance for identifying valvular stenosis and regurgitation with AUC for MS 0.95, MR 0.97, AS 0.91, and AR 0.97 using retrospective data. Finally, the study concluded that the accuracy of the degree of valve lesion severity was within the bounds of normal practice when compared with 2 experienced physicians. They also validated their results in prospective, consecutive cases (n=1374) to demonstrate generalizability and adequate performance outside of the dataset used for training. This study describes in detail the development and testing of a pipeline for a complex set of tasks and posted their code in a public repository, both of which are essential to promoting reproducibility and advances in this field.

In a study by Sengupta and colleagues,57 used both unsupervised and supervised ML classifiers to predict high-severity or low-severity AS. Their model was trained on echocardiographic data from 1052 patients with AS and derived high-severity and low-severity groups of AS, they validated their association of severity with other markers of AS such as AV calcium score on CT, markers of myocardial damage on CMR, and biomarkers such as brain natriuretic peptide (BNP). The ML model was able to correctly classify 99% of patients with definitive echocardiographic features of severe AS into the high-severity group. Notably, it was also able to reclassify the remaining patients with either nonsevere AS or inconclusive discordant echocardiographic findings without additional tests. When compared with conventional strategies for AS severity classification, the ML severity classification improved discrimination (integrated discrimination improvement: 0.07; 95% CI: 0.02–0.12) and reclassification (net reclassification improvement: 0.17; 95% CI: 0.11–0.23) for the outcome of aortic valve replacement (AVR) at 5 years. The investigators show that ML can analyze echocardiographic measurements to accurately classify AS severity in most patients with potential to optimize timing of AVR.

Recent preliminary results from a commercial AI-echo company sought to detect and characterize aortic stenosis purely from B-mode images without spectral or color Doppler. Trained on 30,000 echoes (parasternal long axis and short axis and apical 5-chamber views), a test set demonstrated detection of moderate-or-severe AS with sensitivity of 91% and specificity of 94%.58 Another commercial AI company developed the ability to identify severe AS with clinical characteristics and echocardiogram measurements without LVOT data. The model performed equally independent of AS gradients.59 An increasing number of companies are developing technology in this space and will be subject to FDA review.

In addition to classifying VHD severity, AI can be used to acquisition of images by personnel without earlier ultrasound experience to obtain diagnostic quality echoes.60 With further development and validation for the destination of VHD, this use case of AI could be scaled to resource-limited settings without access to trained sonographers to better detect conditions such as rheumatic heart disease.

Besides using echocardiogram images and videos, natural language processing (NLP) has been used to extract valve severity from echocardiogram reports given the challenges with capturing and using structured data generated by echocardiographers and the limited accuracy of International Classification of Diseases (ICD) coding.61, 62 NLP can be rule-based or use ML. Solomon and colleagues61 developed an NLP algorithm with the goal of making sure that all patients with VHD are appropriately followed. The NLP algorithm had a 99% PPV and 99% NPV for identifying prevalent AS. They found that among those classified as having AS by the NLP, only 64.6% of patients had an ICD diagnosis code of AS. They also found that the NLP-derived hemodynamic parameters were concordant with the physician designated levels of severity. Identifying and recording a patient’s diagnosis in the medical record and developing care pathways can help increase rates of appropriate follow-up in accordance to clinical guidelines.18

Another strategy to make sure a diagnosis is not missed is through clinical decision support (CDS). CDS are tools embedded in a clinical workflow, such as the EHR, to help improve decision-making. A multicenter study of 35 institutions used 1,147,157 echocardiograms to build a tool to help identify patients who may mistakenly not have received a diagnosis of severe AS called the Diagnostic Precision Algorithm.63 The tool uses echocardiographic parameters to determine if patients with similar measurements would be likely to have severe AS, then it prioritizes patients for review based on how likely these valve measurements would be considered severe. The NLP is used to detect that the echocardiogram report was not categorized as severe AS. Overall, the Diagnostic Precision Algorithm was able to predict the proportion of severe AS versus the actual proportion with an average error of 2.1% in the prediction model with left ventricular ejection fraction (LVEF) and an error of 2.2% without LVEF. As the authors highlight, the algorithm demonstrated high accuracy and yields good potential for application to assist physicians to follow-up with patients with potentially undiagnosed severe AS.18

Alternatively to clinician CDS, similar to HF, workflows could be created to either support a population health approach in which a member of the clinical team reviews cases and works with the patients’ primary team to ensure adherence to guideline recommends or directly nudge the patient to discuss evidence-based care for VHD with their clinician.64, 65

Risk prediction

Risk prediction models are equations that use patient risk factors to estimate the probability of a health-care outcome. Well-known risk-prediction models include the Well’s Criteria66 for pulmonary embolism or the Quick SOFA (qSOFA) score67 for identifying high-risk mortality from sepsis in critically ill patients. ML is just one tool among many that can be used to create these models. In VHD, ML has been applied largely in the area of predicting outcomes and response to AVR rather than determining the risk of developing VHD.

The discrepancy of the aortic valve area and low transvalvular gradient makes the management of patients with low gradient AS challenging.68 Namasivayam and colleagues69 developed the Aortic Stenosis Risk (ASteRisk) with the goal of predicting outcomes within 5 years of follow-up for patients with moderate-to-severe AS that would be interpretable and also effective in patients with low-gradient AS. Using bootstrap Lasso regression (Least Absolute Shrinkage and Selection Operator), they selected a subset of variables for a logistic regression model that minimized prediction error (AUC = 0.74). They limited the ASteRisk score to include 9 readily available variables that include a mix of echocardiogram measurements and clinical history to maximize the ease of use in practice. Some of the risk factors included aortic valve area, mean gradient, energy loss, HF, hyperlipidemia, and CKD. They found the ASteRisk model superior to predicting all-cause mortality or AVR in moderate-to-severe AS patients for years 2 to 5, versus a baseline model that used more conventional risk factors (mean transvalvular gradient, aortic valve area, age, and LVEF). The model also worked in patients with low-gradient AS. They tested generalizability on patients from another institution that identified patients at high risk of death at 1 to 5 years. Additional external validation and, ideally, prospective testing, will help elucidate the clinical usefulness of this model for clinical decision-making.

CMR can provide information about not only the structure and function of the heart but also the degree of myocardial fibrosis.70 Kwak and colleagues set out to investigate if how markers of damage on CMR could predict prognosis in severe AS and help guide timing for AVR.71 Investigators used a supervised ML method called Random Forest, a technique that outputs the average result of multiple decision trees, to determine the predictors of death in patients with severe AS undergoing AVR. The study used clinical, echocardiogram, and CMR variables but remained focused on 4 CMR variables, which emerged as most predictive of all cause mortality: extracellular volume fraction (ECV%), late gadolinium enhancement (LGE%), right ventricular ejection fraction (RVEF), and indexed left ventricular end-diastolic volume (LVEDVi). They analyzed the parameters of each variable associated with markedly worse prognosis and built an AS-CMR risk score, in which a point was given for ECV% greater than 27%, REVF 50 or lesser or greater than 80%, LGE% greater than 2%, and LVEDVi 55 or lesser or greater than 80 mL/m2 . Noting the importance of RVEF as an important prognostic marker, authors posited that these patients may benefit from TAVR, where RV function is generally maintained, versus SAVR, which is associated with RV dysfunction after surgery.

Phenomapping

Many have posited that ML will be a critical tool in achieving personalized medicine, where medical decisions are tailored to an individual’s phenotype. VHDs, even of the same severity, are heterogenous syndromes similar to other cardiovascular conditions. ML methods have been used in multiple studies to define phenogroups, or groups of patients with similar clinical characteristics, and differential risk of adverse outcomes.

In aortic stenosis, valve area, transaortic gradient, and maximum aortic velocity are the primary quantitative measurements to stratify severity. One study of patients with severe aortic stenosis used 3 unsupervised, independent methods to identify 4 phenogroups.72 Then, the investigators used a supervised algorithm to predict survival outcomes by each cluster. Each group had an increased amount of myocardial damage and mortality, regardless of the severity of AS. Therefore, authors concluded cardiac remodeling and progression of systolic and diastolic dysfunction plays a large role in prognosis. Guidelines recommend patients with symptomatic, severe AS undergoing evaluation for aortic valve replacement18 but these results suggest cardiac remodeling may be another factor during the evaluation for timing of intervention in asymptomatic patients.72

Several other studies of AS have used unsupervised methods to identify clusters with different rates of adverse events after valve replacement. Lachmann and colleagues73 used unsupervised clustering among patients undergoing TAVR for severe AS, identifying 4 clusters with variations of pulmonary hypertension, LVEF, and prevalence of other valvular defects. Then, investigators tested the ability to reproduce the phenogroups prospectively by categorizing the patients into clusters with an ANN, following their clinical course, and measuring the different rates of adverse events between groups. Kwak and colleagues74 grouped 398 patients with moderate-to-severe AS into 3 clusters that were broadly defined as patients with (1) cardiac dysfunction, (2) comorbidities (particularly ESRD), and (3) healthy AS with neither cardiac dysfunction nor comorbidities. Although AS severity between clusters did not differ, mortality rate was significantly different between clusters. During a median 2.4-year follow-up, the comorbidities cluster had the highest mortality rate of 19.8% versus the healthy AS cluster had the lowest mortality rate of 6.0% during the 2.4 years.

Phenomapping has also produced valuable insights about MR. There are many causes of secondary MR.75 Bartko and colleagues76 focused on phenomapping in sMR in patients with HF with reduced ejection fraction (HFrEF) using echocardiogram measurements and clinical variables. Principal component analysis, which is an unsupervised learning technique for dimensionality reduction, identifies the most informative variables in the data. Investigators found 4 phenogroups with different morphologies of varying LV and LA size and dilation. Each cluster had stepwise progression in sMR severity but regurgitant fraction peaked in Cluster 3, which was characterized by small LV and extensive LA dilation. The approach suggested that clinicians should consider not only LV remodeling but also LA and mitral valve annulus. Notably, authors published a simplified calculator that clinicians can use to identify which cluster an individual patient belongs to. Further translating these phenogroups into practice, the investigators remarked that these phenotypes may help explain difference in outcome from 2 landmark MitraClip (Abbot Laboratories) clinical trials, and the need to better understand patient selection and MR phenogroups.77, 78

The 2020 ACC/AHA Guidelines for Management of Patients with Valvular Heart Disease noted thresholds for interventions are expanding because of randomized clinical trials.18 Whether phenogroup mapping will provide clinically meaningful information and ultimately inform the decision-making process for valvular interventions remain unanswered questions.

Discussion

We have discussed some of the unmet needs in the early detection, diagnosis, and management of VHD that may be addressed by AI-enabled technologies. Examples include screening for VHD with ECG or PCG, improving the classification of the severity of specific types of VHD, and phenomapping and risk prediction to identify subgroups at differential risk for worsening disease and cardiovascular events ( Tables 1 and 2 , see Fig. 2 ). Most research currently leverages data from a single modality or multimodal tabular data but a growing area AI and VHD research in the coming years will be developing models leveraging a combination of unstructured and structured, multimodal, longitudinal data.79 Research will also likely increase training models using self-supervised learning, which has outperformed other DL algorithms for several language tasks and now is being more widely tested in computer vision, signal analytics, and other data types.14

Table 1.

Artificial intelligence applications included in this review

Valve Valvulopathy Screening for VHD Severity of Disease Risk Stratification Phenomapping
Aortic Stenosis
Regurgitation
x
x
x
x
x x
Mitral Stenosis
Regurgitation
x
x
x
x

x
Tricuspid Stenosis
Regurgitation

x
Not present
Pulmonary Stenosis
Regurgitation
Not present

Table 2.

Highlighted artificial intelligence/machine learning algorithms in valvular heart disease

Category Title Authors ML Approach Model Training Data Objective Key Findings
Screening for VHD rECHOmmend: An ECG-Based Machine Learning Approach for Identifying Patients at Increased Risk of Undiagnosed Structural Heart Disease Detectable by Echocardiography Ulloa-Cerna AE, Jing L, Pfeifer JM, et al. Convolutional Neural Network with XGBoost 2,925,925 ECGs from 631,710 patients, clinical demographics (age, sex), laboratories, vitals Detect AS, AR, MS, MR, TR, reduced EF <50% or ventricular septum thickness >15 mm Model using just ECG tracing, age, sex had AUC = 0.91 at sensitivity 0.90, with a PPV of 0.42 for clinically meaningful disease. Validated at 10 other sites, with AUCs 0.79–0.93
Severity of Disease Automated Analysis of Doppler Echocardiographic Videos as a Screening Tool for Valvular Heart Diseases Yang Feifei, Chen Xiaotian, Lin Xixiang, et al. Convolutional Neural Network 1,335 echocardiogram Doppler videos and images Echocardiogram view classification, segmentation, identification of stenosis vs regurgitation, and grading of severity In retrospective dataset for identifying aortic and mitral valve stenosis and regurgitation all accuracy ≥0.87, sensitivities ≥0.82, specificities ≥0.88. For prospective cohort, model performance similar for stenosis, and slightly decreased for regurgitation
Risk Stratification Markers of Myocardial Damage Predict Mortality in Patients With Aortic Stenosis Kwak S, Everett RJ, Treibel TA, et al. Random Survival Forest CMR measurements, echocardiogram measurements, demographics, clinical variables from 440 patients Identify markers of myocardial fibrosis in CMR that are prognostic in AS for patients undergoing AVR Created the AS-CMR score to identify patients at high risk of mortality post-AVR. Patient gets one point for ECV % >27%, RVEF ≤50 or >80%, LGE % >2%, LVEDVi ≤55 or >80 mL/m2. Each point has statistically significant difference in 3 y mortality
Phenomapping Principal Morphomic and Functional Components of Secondary Mitral Regurgitation Bartko PE, Heitzinger G, Spinka G, et al. Principal Component Analysis Morphology and functional parameters from echocardiograms (32 variables) from 383 patients Identify different phenogroups of secondary MR patients with HFrEF Identified four clusters of patients with atrial and ventricular morphologies, and regurgitant fractions. Each cluster had statistically significant survival times. Published an online tool to calculate for individual patient

Abbreviations: AR, aortic regurgitation; AS, aortic stenosis; AVR, aortic valve replacement; CMR, cardiovascular magnetic resonance; ECG, electrocardiogram; ECV%, extracellular volume fraction; EF, ejection fraction; HfrEF, heart failure with reduced ejection fraction; LGE%, late gadolinium enhancement; LVEDVi, indexed left ventricular end-diastolic volume; MR, mitral regurgitation; MS, mitral stenosis; RVEF, right ventricular ejection fraction; TR, tricuspid regurgitation.

Another expanding area of research will be the application of large language models. ChatGPT and BioGPT, released by OpenAI and Microsoft in late 2022, are transformer-based language models that have caught the attention of researchers and lay-people alike but have yet to be applied to VHD research. The application of large language models to VHD and health care more broadly must be taken when using these limitations of these models as noted by the ChatGPT developers: “ChatGPT sometimes writes plausible-sounding but incorrect or nonsensical answers.”80

The application of AI to VHD remains an emerging area of research and application development. The Food and Drug Administration has 3 specific pathways for the approval of AI/ML-based medical technologies and requires the use of “locked” algorithms—algorithms that remain static over time—for use as part of clinical care.81 However, beyond FDA approval, often several additional steps are necessary before broader adoption of the AI/ML based medical technology into clinical practice and incorporation into clinical guidelines. For example, the accurate identification of prevalent VHD with ECG or PCG represents an initial step. Prospective clinical trials evaluating the clinical effectiveness and cost-effectiveness of screening strategies are an important step for adoption into clinical guidelines. Furthermore, implementation studies examining barriers and facilitators to and costs of implementation in hospitals and clinics are also essential to broader adoption of these technologies.82, 83 Finally, several factors unrelated to the algorithms themselves (known as dataset shift) can lead to decreased model performance over time.84 There remains a need for the development of robust governance systems to monitor model performance once deployed into clinical settings.

Strategies to better identify and reduce bias in ML models is an active area of investigation in the ML community. Several studies have shown that ML models can embed aspects of structural racism, gender, and implicit bias into their algorithms and potentially exacerbate health inequities.8588 Addressing these biases during the training, testing, and validation of AI/ML technologies for VHD will be critical to realize the potential benefit of these technologies across diverse populations and as a potential lever to advance equity.

In addition to concerns with biases in algorithms and deprecation in model performance when applied to new populations or over time, the “black box” nature of many algorithms remains a challenge to adoption. This issue is particularly salient in cardiovascular imaging as CNNs, which are black box algorithms, are widely used in studies. Although some clinicians may remain skeptical of AI-enabled technologies without the use of methods to identify the primary features contributing to a prediction, several experts have demonstrated the limitations of these methods.38, 39 Ultimately, similar to medications, the mechanism of action may be less important in the presence of high-quality evidence of benefit generated by randomized controlled trials. 38

Guidelines are emerging to promote transparency in reporting and evaluating AI/ML clinical trials in a standardized fashion. The Consolidated Standards of Reporting Trials–Artificial Intelligence provides general recommendations for reporting of AI clinical trials in medicine that have been endorsed by several scientific journals.89 The Proposed Recommendations for Cardiovascular Imaging-Related Machine Learning Evaluation (PRIME) checklist provides best practice strategies to promote standardization and reduce bias more specifically related to cardiovascular imaging.90 Guidelines for the use of AI for echocardiography and other specific modalities are likely to develop in the near future in light of the growing impact of AI and myriad challenges, including clinical trial reporting standardization, bias reduction, and the understanding of medicolegal consequences of using AI/ML in practice.91 Adherence to reporting guidelines and sharing code and datasets when able would contribute to greater transparency, rigor, and reproducibility and acceleration research and application development in this area.90, 9295

In other areas of cardiovascular medicine, such as screening for atrial fibrillation or left ventricular dysfunction,37, 9698 and in other specialties,99 prospective cohort studies and clinical trials have helped form the foundation of the required evidence base for the scaling of these technologies. Similar studies in VHD will be essential to advancing the use of AI for improving outcomes for patients with VHD.

Summary

VHD is a morbid condition in which AI has the potential to improve screening, diagnosis, phenomapping, and risk prediction. Currently, most of the research and applications remain relatively early in the development and implementation lifecycle. Future research should prioritize adherence to reporting guidelines, sharing code and datasets when able, external validation in diverse populations, measuring and reducing bias in algorithms, prospective evaluation in clinical trials, and development of systems for ongoing monitoring of model performance.

Key Points.

  • Valvular heart disease is a morbid condition with several unmet needs.

  • Machine learning has the potential to improve the care of patients with valvular heart disease by facilitating screening, improving severity classification, identifying phenogroups, and improving risk prediction.

  • Applications of machine learning in valvular heart disease are early in development and require additional testing.

Clinics Care Points.

  • ML not only has the potential to improve the care of patients with VHD but also has pitfalls and limitations.

  • ML approaches may in the future play an important role in the care of patients with VHD but AI-applications require rigorous testing, including prospective evaluation in clinical trials.

Funding

Dr Ahmad was supported by grants from the National Institutes of Health / National Heart, Lung, and Blood Institute (K23HL155970) and the American Heart Association (AHA number 856917).

Footnotes

Disclosure

F.S. Ahmad receives consulting fees from Pfizer and Livongo Teladoc outside of this study. C. Canning is an independent contractor at Tempus.

References

  • 1.Marciniak A, Glover K, Sharma R. Cohort profile: prevalence of valvular heart disease in community patients with suspected heart failure in UK. BMJ Open 2017;7(1):e012240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Philbin EF, DiSalvo TG. Prediction of hospital readmission for heart failure: development of a simple risk score based on administrative data. Rev Port Cardiol 1999;18(9):855–6. [PubMed] [Google Scholar]
  • 3.Coffey S, Roberts-Thomson R, Brown A, et al. Global epidemiology of valvular heart disease. Nat Rev Cardiol 2021;18(12):853–64. [DOI] [PubMed] [Google Scholar]
  • 4.Chen J, Li W, Xiang M. Burden of valvular heart disease, 1990–2017: Results from the Global Burden of Disease Study 2017. J Glob Health 2020;10(2):20404. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Baumgartner H, Falk V, Bax JJ, et al. 2017 ESC/ EACTS Guidelines for the management of valvular heart disease. Eur Heart J 2017;38(36):2739–91. [DOI] [PubMed] [Google Scholar]
  • 6.Carroll JD, Mack MJ, Vemulapalli S, et al. STS-ACC TVT Registry of Transcatheter Aortic Valve Replacement. J Am Coll Cardiol 2020;76(21):2492–516. [DOI] [PubMed] [Google Scholar]
  • 7.Mack M, Carroll JD, Thourani V, et al. Transcatheter Mitral Valve Therapy in the United States: A Report From the STS-ACC TVT Registry. J Am Coll Cardiol 2021;78(23):2326–53. [DOI] [PubMed] [Google Scholar]
  • 8.Goldberg YH, Ho E, Chau M, et al. Update on Transcatheter Tricuspid Valve Replacement Therapies. Front Cardiovasc Med 2021;8:619558. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Fam NP, von Bardeleben RS, Hensey M, et al. Transfemoral Transcatheter Tricuspid Valve Replacement With the EVOQUE System. JACC Cardiovasc Interv 2021;14(5):501–11. [DOI] [PubMed] [Google Scholar]
  • 10.Friedrich S, Groß S, Kö nig IR et al. Applications of artificial intelligence/machine learning approaches in cardiovascular medicine: a systematic review with recommendations. Eur Heart J Digit Health 2021;2(3):424–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Beam AL, Kohane IS. Big Data and Machine Learning in Health Care. JAMA 2018;319(13): 1317–8. [DOI] [PubMed] [Google Scholar]
  • 12.Zhou J, Du M, Chang S, et al. Artificial intelligence in echocardiography: detection, functional evaluation, and disease diagnosis. Cardiovasc Ultrasound 2021;19(1):29. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Sermesant M, Delingette H, Cochet H, et al. Applications of artificial intelligence in cardiovascular imaging. Nat Rev Cardiol 2021;18(8):600–9. [DOI] [PubMed] [Google Scholar]
  • 14.Krishnan R, Rajpurkar P, Topol EJ. Self-supervised learning in medicine and healthcare. Nat Biomed Eng 2022. 10.1038/s41551-022-00914-1. Published online August 11. [DOI] [PubMed] [Google Scholar]
  • 15.Acosta JN, Falcone GJ, Rajpurkar P, et al. Multimodal biomedical AI. Nat Med 2022;28(9):1773–84. [DOI] [PubMed] [Google Scholar]
  • 16.Rajpurkar P, Chen E, Banerjee O, et al. AI in health and medicine. Nat Med 2022;28(1):31–8. [DOI] [PubMed] [Google Scholar]
  • 17.Ahmad FS, Luo Y, Wehbe RM, et al. Advances in Machine Learning Approaches to Heart Failure with Preserved Ejection Fraction. Heart Fail Clin 2022;18(2):287–300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Otto CM, Nishimura RA, Bonow RO, et al. 2020 ACC/ AHA Guideline for the Management of Patients With Valvular Heart Disease: A Report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. Circulation 2021;143(5):e72–227. [DOI] [PubMed] [Google Scholar]
  • 19.Nkomo VT, Gardin JM, Skelton TN, et al. Burden of valvular heart diseases: a population-based study. Lancet 2006;368(9540):1005–11. [DOI] [PubMed] [Google Scholar]
  • 20.Alexander KM, Orav J, Singh A, et al. Geographic Disparities in Reported US Amyloidosis Mortality From 1979 to 2015: Potential Underdetection of Cardiac Amyloidosis. JAMA Cardiol 2018;3(9):865–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.d’Arcy JL, Coffey S, Loudon MA, et al. Large-scale community echocardiographic screening reveals a major burden of undiagnosed valvular heart disease in older people: the OxVALVE Population Cohort. Eur Heart J. Published online 2016. Available at: https://academic.oup.com/eurheartj/article-abstract/37/47/3515/2844994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Maron MS, Hellawell JL, Lucove JC, et al. Occurrence of Clinically Diagnosed Hypertrophic Cardiomyopathy in the United States. Am J Cardiol 2016; 117(10):1651–4. [DOI] [PubMed] [Google Scholar]
  • 23.Kang DH, Park SJ, Lee SA, et al. Early Surgery or Conservative Care for Asymptomatic Aortic Stenosis. N Engl J Med 2020;382(2):111–9. [DOI] [PubMed] [Google Scholar]
  • 24.Carabello BA. Aortic Valve Replacement Should Be Operated on Before Symptom Onset. Circulation 2012;126(1):112–7. [DOI] [PubMed] [Google Scholar]
  • 25.Ross J Jr, Braunwald E. Aortic stenosis. Circulation 1968;38(1 Suppl):61–7. [DOI] [PubMed] [Google Scholar]
  • 26.Cheitlin MD, Gertz EW, Brundage BH, et al. Rate of progression of severity of valvular aortic stenosis in the adult. Am Heart J 1979;98(6):689–700. [DOI] [PubMed] [Google Scholar]
  • 27.Curtis JP, Sokol SI, Wang Y, et al. The association of left ventricular ejection fraction, mortality, and cause of death in stable outpatients with heart failure. J Am Coll Cardiol 2003;42(4):736–42. [DOI] [PubMed] [Google Scholar]
  • 28.Dobrow RJ, Calatayud JB, Abraham S, et al. A study of physician variation in heart-sound interpretation. Med Ann Dist Columbia 1964;33:305–8. [PubMed] [Google Scholar]
  • 29.Etchells E, Bell C, Robb K. Does This Patient Have an Abnormal Systolic Murmur? JAMA 1997;277(7): 564–71. [PubMed] [Google Scholar]
  • 30.Clifford GD, Liu C, Moody B, et al. Classification of normal/abnormal heart sound recordings: The PhysioNet/Computing in Cardiology Challenge 2016. In: 2016 computing in cardiology conference (CinC). 2016. p. 609–12. [Google Scholar]
  • 31.Clifford GD, Liu C, Moody B, et al. Recent advances in heart sound analysis. Physiol Meas 2017;38(8): E10–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Swarup S, Makaryus AN. Digital stethoscope: technology update. Med Devices 2018;11:29–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Chorba JS, Shapiro AM, Le L, et al. Deep Learning Algorithm for Automated Cardiac Murmur Detection via a Digital Stethoscope Platform. J Am Heart Assoc 2021;10(9):e019905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Carabello BA. Transcatheter aortic-valve implantation for aortic stenosis in patients who cannot undergo surgery. Curr Cardiol Rep 2011;13(3): 173–4. [DOI] [PubMed] [Google Scholar]
  • 35.Ben-Dor I, Pichard AD, Gonzalez MA, et al. Correlates and causes of death in patients with severe symptomatic aortic stenosis who are not eligible to participate in a clinical trial of transcatheter aortic valve implantation. Circulation 2010;122(11 Suppl): S37–42. [DOI] [PubMed] [Google Scholar]
  • 36.Kwon JM, Lee SY, Jeon KH, et al. Deep Learning-Based Algorithm for Detecting Aortic Stenosis Using Electrocardiography. J Am Heart Assoc 2020;9(7): e014717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Cohen-Shelly M, Attia ZI, Friedman PA, et al. Electrocardiogram screening for aortic valve stenosis using artificial intelligence. Eur Heart J 2021;42(30): 2885–96. [DOI] [PubMed] [Google Scholar]
  • 38.Ghassemi M, Oakden-Rayner L, Beam AL. The false hope of current approaches to explainable artificial intelligence in health care. Lancet Digit Health 2021;3(11):e745–50. [DOI] [PubMed] [Google Scholar]
  • 39.Saporta A, Gui X, Agrawal A, et al. Benchmarking saliency methods for chest X-ray interpretation. Nat Mach Intell 2022;4(10):867–78. [Google Scholar]
  • 40.Ulloa-Cerna AE, Jing L, Pfeifer JM, et al. rECHOmmend: An ECG-Based Machine Learning Approach for Identifying Patients at Increased Risk of Undiagnosed Structural Heart Disease Detectable by Echocardiography. Circulation 2022;146(1):36–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Papolos A, Narula J, Bavishi C, et al. Hospital Use of Echocardiography: Insights From the Nationwide Inpatient Sample. J Am Coll Cardiol 2016;67(5):502–11. [DOI] [PubMed] [Google Scholar]
  • 42.Douglas PS, Khandheria B, Stainback RF, et al. ACCF/ASE/ACEP/AHA/ASNC/SCAI/SCCT/SCMR 2008 appropriateness criteria for stress echocardiography: a report of the American College of Cardiology Foundation Appropriateness Criteria Task Force, American Society of Echocardiography, American College of Emergency Physicians, American Heart Association, American Society of Nuclear Cardiology, Society for Cardiovascular Angiography and Interventions, Society of Cardiovascular Computed Tomography, and Society for Cardiovascular Magnetic Resonance endorsed by the Heart Rhythm Society and the Society of Critical Care Medicine. J Am Coll Cardiol 2008;51(11):1127–47. [DOI] [PubMed] [Google Scholar]
  • 43.Matulevicius SA, Rohatgi A, Das SR, et al. Appropriate use and clinical impact of transthoracic echocardiography. JAMA Intern Med 2013;173(17): 1600–7. [DOI] [PubMed] [Google Scholar]
  • 44.Zhang J, Gajjala S, Agrawal P, et al. Fully automated echocardiogram interpretation in clinical practice. Circulation 2018;138(16):1623–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Ghorbani A, Ouyang D, Abid A, et al. Deep Learning Interpretation of Echocardiograms. doi: 10.1101/681676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Tromp J, Bauer D, Claggett BL, et al. A formal validation of a deep learning-based automated workflow for the interpretation of the echocardiogram. Nat Commun 2022;13(1):6776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Ouyang D, He B, Ghorbani A, et al. Video-based AI for beat-to-beat assessment of cardiac function. Nature 2020;580(7802):252–6. 10.1038/s41586-020-2145-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Madani A, Arnaout R, Mofrad M, et al. Fast and accurate view classification of echocardiograms using deep learning. NPJ Digit Med 2018;1. 10.1038/s41746-017-0013-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Abdi AH, Luong C, Tsang T, et al. Automatic Quality Assessment of Echocardiograms Using Convolutional Neural Networks: Feasibility on the Apical Four-Chamber View. IEEE Trans Med Imaging 2017;36(6): 1221–30. [DOI] [PubMed] [Google Scholar]
  • 50.Asch FM, Poilvert N, Abraham T, et al. Automated Echocardiographic Quantification of Left Ventricular Ejection Fraction Without Volume Measurements Using a Machine Learning Algorithm Mimicking a Human Expert. Circ Cardiovasc Imaging 2019;12(9): e009303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Diller GP, Babu-Narayan S, Li W, et al. Utility of machine learning algorithms in assessing patients with a systemic right ventricle. Eur Heart J Cardiovasc Imaging 2019;20(8):925–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Goto S, Mahara K, Beussink-Nelson L, et al. Artificial intelligence-enabled fully automated detection of cardiac amyloidosis using electrocardiograms and echocardiograms. Nat Commun 2021;12(1):2726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Goto S, Solanki D, John JE, et al. Multinational Federated Learning Approach to Train ECG and Echocardiogram Models for Hypertrophic Cardiomyopathy Detection. Circulation 2022;146(10): 755–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Duffy G, Cheng PP, Yuan N, et al. High-Throughput Precision Phenotyping of Left Ventricular Hypertrophy With Cardiovascular Deep Learning. JAMA Cardiol 2022;7(4):386–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Soto JT, Weston Hughes J, Sanchez PA, et al. Multimodal deep learning enhances diagnostic precision in left ventricular hypertrophy. Eur Heart J Digit Health 2022;3(3):380–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Feifei Yang, Xiaotian Chen, Xixiang Lin, et al. Automated Analysis of Doppler Echocardiographic Videos as a Screening Tool for Valvular Heart Diseases. JACC Cardiovasc Imaging 2022;15(4): 551–63. [DOI] [PubMed] [Google Scholar]
  • 57.Sengupta Partho P, Shrestha S, Kagiyama N, et al. A Machine-Learning Framework to Identify Distinct Phenotypes of Aortic Stenosis Severity. JACC Cardiovasc Imaging 2021;14(9):1707–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Poilvert N, Goldstein S, Lang RM, et al. Abstract 14356: Machine Learning for Detection of Presence and Severity of Aortic Stenosis From B-mode Ultrasound Images: Results of a Blinded Clinical Trial. Circulation 2022;146(Suppl_1):A14356. [Google Scholar]
  • 59.Playford D, Bordin E, Mohamad R, et al. Enhanced Diagnosis of Severe Aortic Stenosis Using Artificial Intelligence: A Proof-of-Concept Study of 530,871 Echocardiograms. JACC Cardiovasc Imaging 2020;13(4):1087–90. [DOI] [PubMed] [Google Scholar]
  • 60.Narang A, Bae R, Hong H, et al. Utility of a Deep-Learning Algorithm to Guide Novices to Acquire Echocardiograms for Limited Diagnostic Use. JAMA Cardiol 2021;6(6):624–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Solomon MD, Tabada G, Allen A, et al. Large-scale identification of aortic stenosis and its severity using natural language processing on electronic health records. Cardiovasc Digit Health J 2021;2(3):156–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Strom JB, Xu J, Sun T, et al. Characterizing the Accuracy of International Classification of Diseases, Tenth Revision Administrative Claims for Aortic Valve Disease. Circulation 2022;15(10). 10.1161/circoutcomes.122.009162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Thomas JD, Petrescu OM, Moualla SK, et al. Artificial intelligence to assist physicians in identifying patients with severe aortic stenosis. Intelligence-Based Medicine 2022;6:100059. [Google Scholar]
  • 64.Allen LA, Venechuk G, McIlvennan CK, et al. An Electronically Delivered Patient-Activation Tool for Intensification of Medications for Chronic Heart Failure With Reduced Ejection Fraction: The EPIC-HF Trial. Circulation 2021;143(5):427–37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Cheema Baljash, Kannan Mutharasan R Sharma Aditya, et al. Augmented Intelligence to Identify Patients With Advanced Heart Failure in an Integrated Health System. JACC (J Am Coll Cardiol): Advances 2022;1(4):1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Wells PS, Anderson DR, Rodger M, et al. Excludingpulmonary embolism at the bedside without diagnostic imaging: management of patients with suspected pulmonary embolism presenting to the emergency department by using a simple clinical model and d-dimer. Ann Intern Med 2001;135(2):98–107. [DOI] [PubMed] [Google Scholar]
  • 67.Seymour CW, Liu VX, Iwashyna TJ, et al. Assessment of Clinical Criteria for Sepsis: For the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). JAMA 2016;315(8): 762–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Clavel MA, Magne J, Pibarot P. Low-gradient aortic stenosis. Eur Heart J 2016;37(34):2645–57. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Namasivayam M, Myers PD, Guttag JV, et al. Predicting outcomes in patients with aortic stenosis using machine learning: the Aortic Stenosis Risk (ASteRisk) score. Open Heart 2022;9(1). 10.1136/openhrt-2022-001990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Lee SP, Lee W, Lee JM, et al. Assessment of diffuse myocardial fibrosis by using MR imaging in asymptomatic patients with aortic stenosis. Radiology 2015;274(2):359–69. [DOI] [PubMed] [Google Scholar]
  • 71.Kwak S, Everett RJ, Treibel TA, et al. Markers of Myocardial Damage Predict Mortality in Patients With Aortic Stenosis. J Am Coll Cardiol 2021;78(6): 545–58. [DOI] [PubMed] [Google Scholar]
  • 72.Bohbot Y, Raitière O, Guignant P, et al. Unsupervised clustering of patients with severe aortic stenosis: A myocardial continuum. Arch Cardiovasc Dis. Published online September 2022;29. 10.1016/j.acvd.2022.06.007. [DOI] [PubMed] [Google Scholar]
  • 73.Lachmann M, Rippen E, Schuster T, et al. Subphenotyping of Patients With Aortic Stenosis by Unsupervised Agglomerative Clustering of Echocardiographic and Hemodynamic Data. JACC Cardiovasc Interv 2021; 14(19):2127–40. [DOI] [PubMed] [Google Scholar]
  • 74.Kwak S, Lee Y, Ko T, et al. Unsupervised Cluster Analysis of Patients With Aortic Stenosis Reveals Distinct Population With Different Phenotypes and Outcomes. Circ Cardiovasc Imaging 2020;13(5): e009707. [DOI] [PubMed] [Google Scholar]
  • 75.Ouyang D, Thomas JD. Characterizing Mitral Regurgitation With Precision Phenotyping and Unsupervised Learning. JACC Cardiovasc Imaging 2021; 14(12):2301–2. [DOI] [PubMed] [Google Scholar]
  • 76.Bartko PE, Heitzinger G, Spinka G, et al. Principal Morphomic and Functional Components of Secondary Mitral Regurgitation. JACC Cardiovasc Imaging 2021;14(12):2288–300. [DOI] [PubMed] [Google Scholar]
  • 77.Stone GW, Lindenfeld J, Abraham WT, et al. Transcatheter Mitral-Valve Repair in Patients with Heart Failure. N Engl J Med 2018;379(24):2307–18. [DOI] [PubMed] [Google Scholar]
  • 78.Obadia JF, Messika-Zeitoun D, Leurent G, et al. Percutaneous Repair or Medical Treatment for Secondary Mitral Regurgitation. N Engl J Med 2018; 379(24):2297–306. [DOI] [PubMed] [Google Scholar]
  • 79.Kline A, Wang H, Li Y, et al. Multimodal machine learning in precision health: A scoping review. NPJ Digit Med 2022;5(1):171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.OpenAI. ChatGPT. Optimizing Language Models for Dialogue. OpenAI 2022. Available at: https://openai.com/blog/chatgpt/. Accessed February 25, 2023. [Google Scholar]
  • 81.Benjamens S, Dhunnoo P, Mesko B. The state of artificial intelligence-based FDA-approved medical devices and algorithms: an online database. NPJ Digit Med 2020;3:118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Li RC, Asch SM, Shah NH. Developing a delivery science for artificial intelligence in healthcare. NPJ Digit Med 2020;3:107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Marwaha JS, Landman AB, Brat GA, et al. Deploying digital health tools within large, complex health systems: key considerations for adoption and implementation. NPJ Digit Med 2022;5(1):13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Finlayson SG, Subbaswamy A, Singh K, et al. The Clinician and Dataset Shift in Artificial Intelligence. N Engl J Med 2021;385(3):283–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Gianfrancesco MA, Tamang S, Yazdany J, et al. Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data. JAMA Intern Med 2018;178(11):1544–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Hajian S, Bonchi F, Castillo C. Algorithmic Bias: From Discrimination Discovery to Fairness-aware Data Mining. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. KDD ‘16. Association for Computing Machinery; 2016. p. 2125–6. [Google Scholar]
  • 87.Larrazabal AJ, Nieto N, Peterson V, et al. Gender imbalance in medical imaging datasets produces biased classifiers for computer-aided diagnosis. Proc Natl Acad Sci U S A 2020;117(23):12592–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Obermeyer Z, Powers B, Vogeli C, et al. Dissecting racial bias in an algorithm used to manage the health of populations. Science 2019;366(6464):447–53. [DOI] [PubMed] [Google Scholar]
  • 89.Liu X, Cruz Rivera S, Moher D, et al. , CONSORT-AI Working Group. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. Nat Med 2020; 26(9):1364–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Sengupta PP, Shrestha S, Berthon B, et al. Proposed Requirements for Cardiovascular Imaging-Related Machine Learning Evaluation (PRIME): A Checklist: Reviewed by the American College of Cardiology Healthcare Innovation Council. JACC Cardiovasc Imaging 2020;13(9):2017–35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Tseng AS, Lopez-Jimenez F, Pellikka PA. Future Guidelines for Artificial Intelligence in Echocardiography. J Am Soc Echocardiogr 2022;35(8): 878–82. [DOI] [PubMed] [Google Scholar]
  • 92.Collins GS, Dhiman P, Andaur Navarro CL, et al. Protocol for development of a reporting guideline (TRIPOD-AI) and risk of bias tool (PROBAST-AI) for diagnostic and prognostic prediction model studies based on artificial intelligence. BMJ Open 2021; 11(7):e048008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Vasey B, Nagendran M, Campbell B, et al. Reporting guideline for the early-stage clinical evaluation of decision support systems driven by artificial intelligence: DECIDE-AI. Nat Med 2022;28(5):924–33. [DOI] [PubMed] [Google Scholar]
  • 94.Liu Y, Chen PHC, Krause J, et al. How to Read Articles That Use Machine Learning: Users’ Guides to the Medical Literature. JAMA 2019;322(18): 1806–16. [DOI] [PubMed] [Google Scholar]
  • 95.Stevens LM, Mortazavi BJ, Deo RC, et al. Recommendations for Reporting Machine Learning Analyses in Clinical Research. Circ Cardiovasc Qual Outcomes 2020;13(10):e006556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Perez MV, Mahaffey KW, Hedlin H, et al. Large-Scale Assessment of a Smartwatch to Identify Atrial Fibrillation. N Engl J Med 2019;381(20):1909–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Attia ZI, Harmon DM, Dugan J, et al. Prospective evaluation of smartwatch-enabled detection of left ventricular dysfunction. Nat Med 2022;28(12):2497–503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Yao X, Rushlow DR, Inselman JW, et al. Artificial intelligence-enabled electrocardiograms for identification of patients with low ejection fraction: a pragmatic, randomized clinical trial. Nat Med 2021;27(5):815–9. [DOI] [PubMed] [Google Scholar]
  • 99.Plana D, Shung DL, Grimshaw AA, et al. Randomized Clinical Trials of Machine Learning Interventions in Health Care: A Systematic Review. JAMA Netw Open 2022;5(9):e2233946. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES