An explainable artificial intelligence approach for predicting cardiovascular outcomes using electronic health records

Sergiusz Wesołowski; Gordon Lemmon; Edgar J Hernandez; Alex Henrie; Thomas A Miller; Derek Weyhrauch; Michael D Puchalski; Bruce E Bray; Rashmee U Shah; Vikrant G Deshmukh; Rebecca Delaney; H Joseph Yost; Karen Eilbeck; Martin Tristani-Firouzi; Mark Yandell

doi:10.1371/journal.pdig.0000004

. 2022 Jan 18;1(1):e0000004. doi: 10.1371/journal.pdig.0000004

An explainable artificial intelligence approach for predicting cardiovascular outcomes using electronic health records

Sergiusz Wesołowski ^1,^#, Gordon Lemmon ^1,^#, Edgar J Hernandez ¹, Alex Henrie ¹, Thomas A Miller ², Derek Weyhrauch ², Michael D Puchalski ², Bruce E Bray ^3,⁴, Rashmee U Shah ³, Vikrant G Deshmukh ⁵, Rebecca Delaney ⁶, H Joseph Yost ⁷, Karen Eilbeck ⁶, Martin Tristani-Firouzi ^2,^8,^*, Mark Yandell ^1,^*

Editor: Mecit Can Emre Simsekler⁹

¹Department of Human Genetics and Utah Center for Genetic Discovery, University of Utah, Salt Lake City, UT, United States of America

²Division of Pediatric Cardiology, University of Utah School of Medicine, Salt Lake City, UT, United States of America

³Division of Cardiovascular Medicine, University of Utah School of Medicine, Salt Lake City, UT, United States of America

⁴University of Utah, Biomedical Informatics, Salt Lake City, UT 84108, United States of America

⁵University of Utah Health Care CMIO Office, Salt Lake City, UT, United States of America

⁶Department of Population Health Sciences, University of Utah, Salt Lake City, UT, United States of America

⁷Molecular Medicine Program, University of Utah, Salt Lake City, UT, United States of America

⁸Nora Eccles Harrison CVRTI, University of Utah School of Medicine, Salt Lake City, UT, United States of America

⁹Khalifa University of Science and Technology, UNITED ARAB EMIRATES

I have read the journal’s policy and the authors of this manuscript have the following competing interests: GL, VD, MY own shares in Backdrop Health, there are no financial ties regarding this research.

^✉

* E-mail: Martin.Tristani@utah.edu (MTF); myandell@genetics.utah.edu (MY)

Contributed equally.

Roles

Sergiusz Wesołowski: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

Gordon Lemmon: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft

Edgar J Hernandez: Data curation, Formal analysis, Methodology, Software, Visualization, Writing – original draft

Alex Henrie: Data curation, Formal analysis, Software, Validation, Visualization

Thomas A Miller: Conceptualization, Data curation, Investigation, Validation, Writing – original draft

Derek Weyhrauch: Investigation, Validation, Visualization, Writing – original draft

Michael D Puchalski: Data curation, Project administration, Resources, Writing – original draft

Bruce E Bray: Conceptualization, Data curation, Formal analysis, Investigation, Validation, Visualization, Writing – original draft

Rashmee U Shah: Data curation, Validation, Visualization, Writing – original draft

Vikrant G Deshmukh: Conceptualization, Data curation, Investigation, Methodology, Project administration, Validation

Rebecca Delaney: Validation, Writing – original draft

H Joseph Yost: Conceptualization, Funding acquisition, Project administration, Supervision, Validation, Writing – original draft

Karen Eilbeck: Conceptualization, Funding acquisition, Project administration, Validation, Visualization, Writing – original draft

Martin Tristani-Firouzi: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Supervision, Visualization, Writing – original draft

Mark Yandell: Conceptualization, Formal analysis, Funding acquisition, Methodology, Project administration, Software, Supervision, Visualization, Writing – original draft

Mecit Can Emre Simsekler: Editor

PMCID: PMC8975108 NIHMSID: NIHMS1779599 PMID: 35373216

Abstract

Understanding the conditionally-dependent clinical variables that drive cardiovascular health outcomes is a major challenge for precision medicine. Here, we deploy a recently developed massively scalable comorbidity discovery method called Poisson Binomial based Comorbidity discovery (PBC), to analyze Electronic Health Records (EHRs) from the University of Utah and Primary Children’s Hospital (over 1.6 million patients and 77 million visits) for comorbid diagnoses, procedures, and medications. Using explainable Artificial Intelligence (AI) methodologies, we then tease apart the intertwined, conditionally-dependent impacts of comorbid conditions and demography upon cardiovascular health, focusing on the key areas of heart transplant, sinoatrial node dysfunction and various forms of congenital heart disease. The resulting multimorbidity networks make possible wide-ranging explorations of the comorbid and demographic landscapes surrounding these cardiovascular outcomes, and can be distributed as web-based tools for further community-based outcomes research. The ability to transform enormous collections of EHRs into compact, portable tools devoid of Protected Health Information solves many of the legal, technological, and data-scientific challenges associated with large-scale EHR analyses.

Introduction

The application of data-science methods to electronic health record (EHR) databases promises a new, global perspective on human health, with widespread applications for outcomes research and precision medicine initiatives. However, unmet technological challenges still exist [1–3]^[. One is the need for improved means for ab initio discovery of comorbid clinical variables in the context of confounding demographic variables at scale. Moreover, how best to tease apart the intertwined impacts of multiple comorbidities and demographic variables on patient health remains a daunting challenge [1, 3–9].

We used a massively-scalable comorbidity discovery method called Poisson Binomial based Comorbidity (PBC) discovery [10] to search Electronic Health Records (EHRs) from the University of Utah and Primary Children’s Hospital for comorbid diagnoses, procedures, and medications. In this context, we refer to co-occurring medical diagnoses, procedures and medications using the single blanket term, comorbidity. PBC can also discover temporal relationships and quantify transition rates between various comorbidities. The result is a disease network, devoid of Protected Health Information (PHI), that is well-suited for powering downstream outcomes research.

Although comorbidity discovery is a necessary first step towards enabling outcomes research, it is not an end in itself. Comorbidities do not exist as isolated pairs, rather they combine to create a complex web of influence on any given outcome. While PBC is powered to discover that web, harnessing it for outcomes research requires a separate computational machinery, one capable of calculating the joint contributions of multiple, conditionally dependent variables on an outcome, so called multimorbidity calculations [1,3,11–13]. Moreover, because researchers seek not merely to predict outcomes, but also to measure the contributions of factors driving them, ‘explainable’ solutions [14–22], rather than black box approaches are required. We have adapted Probabilistic Graphical Models (PGMs) [2,22–27] to address these needs.

PGMs are well suited for outcomes research. Contrary to other methods, e.g. generalized linear models (with or without mixed effects), PGMs are capable of: (1) discovering and modelling any number of multilevel dependencies between variables, (2) capturing non-additive or non-multiplicative interactions, and (3) their application does not require excluding nor imputing missing data [28]. Moreover, PGMs model the full joint probability function governing relationships in the data, and thus do not necessitate a dichotomy between response and input variables. Rather, PGMs are capable of answering a prediction query for any variables conditioned on any set of inputs included in the model.

Using these computational technologies, we mined the EHRs of over 1.6 million University of Utah and Primary Children’s Hospital patients, including over 500,000 mother-child pairs, for comorbid diagnoses, procedures, medications, and lab tests driving diverse cardiovascular health outcomes, focusing on three areas: heart transplant, sinoatrial node dysfunction, and congenital heart disease. Our results illuminate the comorbid and demographic landscapes surrounding these key cardiovascular outcomes in the US intermountain west, and demonstrate how our approach can inform health care disparities with precise, quantitative results in the context of a specific health care system.

Results

PBC is well powered for discovery of cardiovascular comorbidities

Table 1 demonstrates the utility of the PBC [10] approach for discovery, by comparing the power of PBC versus a standard stratification approach (followed by χ²) to detect the well documented comorbid relationship between atrial fibrillation (AF) and acute cerebrovascular disease (stroke) [29,30]. Table 1 provides a power analysis as a function of corpus size and number of demographic variables. The effects of stratifying the data for χ² analysis, versus adding them to the PBC calculation, can be observed as one proceeds down the table columns. Results for three different starting cohort sizes are shown. Note how stratification lowers the strength of p-values as a function of the size of the stratum. This effect is exacerbated when more than a few potentially confounding variables are controlled for, and stratification quickly results in cohorts that are too small for discovery activities, as the comorbidities fail to achieve statistical significance. For example, using a starting corpus of 9,525 records, stratification followed by χ² analysis fails to detect the well-known comorbid relationship between AF and Stroke for female patients aged 50–59 when white ancestry is included in the stratum description. By contrast, the PBC approach maintains power across all comparisons. For more on these points, see [10].

Table 1. PBC is well powered for comorbidity discovery on demographically complex datasets, unlike stratification.

Atrial Fibrillation and Acute Cerebrovascular Disease
Features	PBC p-value			χ2 p-value
Features	N = 1,538,059	N = 95,407	N = 9,525	N = 1,538,05	N = 95,407	N = 9,525
no features	1e-31020	1e-1715	1e-203	1e-31020	1e-1715	1e-203
+sex	1e-31017	1e-1955	1e-215	1e-16657	1e-1125	1e-147
+age	1e-25448	1e-1589	1e-200	1e-1304	1e-88.3	1e-13.1
+ancestry	1e-14381	1e-628	1e-73.1	1e-15.72	1	1
+ethnicity	1e-11357	1e-806	1e-110	1e-12.25	1	1
+insurance	1e-11533	1e-771	1e-83	1e-2.68	1	1
+span	1e-11325	1e-698	1e-84.1	1e-1.75	1	1

Open in a new tab

Progressively smaller random samples were drawn from the Utah EHR corpus, such that each cohort is a subset of this larger precursor. N = the number of subjects in each cohort under consideration. Cells in the table contain p-values for the association between Atrial Fibrillation and Acute Cerebrovascular Disease (stroke), as calculated by PBC or χ2 (for stratification). P-values less than the Bonferroni corrected alpha (1e-9.5) are shown in light blue, while cells that do not pass the significance threshold are red. Stratum filters apply to the features’ column, row by row as follows: no filters, female, 50–59 years of age, white, non-Hispanic, commercial insurance, minimum of 2 years of medical history.

Comorbidities of heart transplant

We evaluated every pairwise combination of diagnoses, procedures, and medications mentioned in our EHR corpus for comorbid associations, using PBC [10] to adjust on a patient-by-patient basis for the potentially confounding demographic variables shown in Fig 1. Fig 2A summarizes the results of this computation as a patient disease network. The network provides a visual overview of the entire EHR corpus, wherein every node (state) is a diagnosis, procedure, or medication, and edges denote Bonferroni significant comorbid relations between terms. Given a node of interest, heart transplant, for example, its comorbid diagnoses and associated procedures and medications can be recovered by following edges to that node back to their terms.

Fig 1 — Demographic variables used in the comorbidity discovery process are displayed on the y-axis. The percent of all diagnoses, procedures, and medications influenced by a given demographic feature is displayed on the x-axis. For example, sex influences 42.2% percent of diagnoses, procedures, and medications in the Utah EHR corpus; ancestry influences 27.4% and EHR exposure 100%. EHR exposure includes subject age, length of medical record history, number of visits. See article [10] for details. Features were selected using L1 regularization.

Fig 2 — **Panel A.** Graphical representation of the Patient Disease Network. 39,055 ICD 10 diagnosis codes, 5,716 CPT procedure codes, and 1,764 RxNorm medication codes comprising 50 million comorbidities are represented by the map. To render the patient disease network more readily interpretable, we utilized Minimum Description Length clustering, so that nodes with similar comorbidity patterns lay near to one another in the network. The comorbidities of Heart Transplant are labeled red for reference purposes. See Methods for details. **Panel B.** Term trajectory for Adult Heart Transplant. Nodes represent diagnosis (black), procedures (red), and medications (blue). Edges are temporally ordered comorbidities (Bonferroni alpha = 10E-9.5), arrows denote direction. Edges are labeled with transition probabilities (e.g. patient flux). For example, an adult patient with viral myocarditis has a 17% chance of developing a heart failure diagnosis, and a 4.9% chance of undergoing heart transplantation. See Methods for additional details and **S5 Table** for code references for the highlighted terms.

The transition probabilities associated with each edge provide means to calculate the pairwise contributions of each term to the outcome’s observed (marginal) frequency in the EHR corpus. This provides a way to intuit an outcome’s comorbidity landscape, and calculate the expected flux of patients through that region of the network. These patient ‘trajectories’ provide a framework for cost prediction and service allocation activities. For example, the trajectory for adult heart transplant (2B) tracks the time course of diagnoses, procedures and medication use preceding and following heart transplantation. Thus, one can follow the trajectory of ischemic heart disease, flowing through the diagnosis of heart failure, cardiogenic shock, administration of the vasoactive medication milrinone, and culminating in heart transplantation with subsequent downstream complications. Crucially, this methodology provides precise measures of patient flux between these nodes.

Multimorbidity network for heart transplant supports conditional outcome risk calculations

Although trajectories provide intuitive and useful overviews of the comorbidity landscape, effective outcomes research requires calculating the joint contributions of conditionally dependent multimorbid terms on an outcome. We leverage Probabilistic Graphical Models as an explainable AI solution for this computationally intensive task. Fig 3A illustrates a multimorbidity network derived from a temporalized Probabilistic Graphical Model for the predisposing comorbidities of adult heart transplant presented in Fig 2B. Because the edges in a multimorbidity network denote conditional dependencies between terms, rather than transition probabilities, the multimorbidity network’s topology is necessarily different from the trajectory topology shown in Fig 2B. The PGM provides easy means to calculate outcomes risk for any combination of variables in it. For example, a prior diagnosis of cardiomyopathy (non-ischemic) increases the risk of heart transplantation 86±35 fold, whereas a diagnosis of viral myocarditis confers a 59±21 fold increase in risk. The strongest single variable for heart transplant risk is the use of the vasoactive medication milrinone, which increases risk 175±30 fold. Note that we are not suggesting milrinone causes heart transplant—rather that the prescription of milrinone in a patient’s medical record is a powerful predictor of future heart transplant.

Fig 3 — **Panel A.** PGM for Adult Transplant. N = 1.6 million individuals. The clinical variables were chosen based on Bonferroni-corrected ICD10 and RXnorm billing codes significantly associated (preceding) with heart transplant. Each node represents a diagnosis, procedure, or medication code and each edge represents a conditional dependence between nodes. For detailed description of the clinical variables, please refer to **S5 Table**. **Panel B**. PGM for Pediatric Transplant. N = 26,458 individuals. Clinical variable terms represent terms in the Primary Children’s Hospital echocardiographic database or CCS billing codes when available. For detailed description of the clinical variables, please refer to the **S5 Table**. DCM: Dilated cardiomyopathy; Norwood: Norwood surgery; HLHS: hypoplastic left heart syndrome; Glenn: Glenn surgery; Fontan: Fontan surgery; AVSD: atrioventricular septal defect; ASD: Atrial septal defect; BAV: Bicuspid aortic valve; Coarctation: Coarctation of the aorta; VSD: Ventricular septal defect. Heart Transplant is highlighted in orange. For A and B, the target node (heart transplant) is colored red and nodes with direct connections to the target (ie, within the Markov blanket) are circled red. Values in Tables represent mean ± STD.

The utility of PGMs for outcomes research is best illustrated by their application to problems of complex multimorbid outcomes analyses, where conditional dependencies of these variables interact to further modulate risk for the outcome under study. For example, we can explore the role of heart disease etiology on transplant risk in the context of milrinone infusion. Thus, a cardiomyopathy patient requiring milrinone has a 407±101 fold increased risk for heart transplant. Likewise, a patient with viral myocarditis requiring milrinone therapy has a 346±93 fold increased risk for heart transplant; while milrinone use in a patient with ischemic heart disease confers a 205±28 fold increased risk of heart transplant. Moreover, while both cardiomyopathy and ischemic heart disease have similar increased risks for heart transplant in isolation (86±35 fold and 64±14 fold, respectively), cardiomyopathy patients who require milrinone therapy are at far greater risk for heart transplant than patients with ischemic heart disease requiring milrinone. Additional conditional queries conducted with the PGM are presented in Fig 3A. This list is by no means exhaustive—the PGM is capable of answering an astonishing number of queries—3²⁵ to be precise. We encourage the reader to explore these by following the link to the corresponding web application https://pbc.genetics.utah.edu/lemmon2021/bayes. In this context, the explainable nature of PGMs lays the foundation for massively parallel testing of novel hypotheses between multiple, complex clinical variables of interest.

The comorbidity landscape for pediatric heart transplant is dramatically different from that of adults, as it includes a large contribution from congenital heart defects (CHD) and palliative surgical procedures. Fig 3B presents a multimorbidity network for 13 common CHD terms defined by echocardiogram and identified by PBC as comorbid with pediatric heart transplant. A prior diagnosis of dilated cardiomyopathy (DCM), defined as genetic/idiopathic DCM, increases a child’s risk for heart transplant 102.2±33.6-fold, over the marginal probability of transplant. Among single ventricle forms of CHD, patients with hypoplastic left heart syndrome (HLHS) are at the greatest risk for heart transplant (56.8±17.8-fold), as compared to tricuspid atresia (17.1±11.8-fold) or laterality defects (25.8-fold ± 8.5). Again, the utility of PGMs for complex multimorbid analyses is highlighted by the ability to calculate the additional risk for heart transplant in a child with a laterality defect, if that child also requires the Norwood surgery (51.3±10.5-fold).

Multimorbidity network for sinoatrial node dysfunction supports multimorbidity risk calculations for a range of clinical and demographic health predictors

Fig 4A extends the investigations to include the impacts of these same pediatric heart surgeries in the context of various CHD phenotypes on a different clinical outcome, sinoatrial node dysfunction (SND). The Fontan surgery dominates the landscape of pediatric SND, increasing the risk 19.6±6.4-fold over the marginal probability of SND. Moreover, Fontan surgery is the only clinical variable with a direct connection to SND; the other clinical variables connect indirectly to SND via the Fontan node. Thus, the relative risk of SND for specific forms of single ventricle CHD (HLHS, tricuspid atresia, unbalanced AVSD) following the Fontan surgery are similar (Fig 4), indicating that the Fontan surgery itself is the primary indicator of future SND, rather than the underlying form of CHD that required the procedure. Collectively, the preceding analyses demonstrate how multiple nets can be used in tandem to address complex multimorbidity outcomes questions.

Multimorbidity networks also provide powerful means to investigate the impacts of various demographic factors upon outcomes. The net in Fig 4B models the multimorbid landscape surrounding SND in adult patients. As SND and AF are both risk factors for each other [31], we temporalized the network (see Methods) to analyze clinical variables that precede SND. The ancestry and ethnicity nodes enable explorations of demographic impacts upon SND and its comorbidities. Thus, in the University of Utah Hospital system, a Hispanic patient with AF has a 61±6 fold increased risk of SND, compared to 30±1 fold risk for white ancestry and 40±7 fold risk for African Americans. These results underscore the potential of our approach to inform ethnic/racial health care disparities with precise, quantitative results, and in the context of a specific health care system. Moreover, these findings illustrate how our approach can empower these discussions despite demographic skews in the underlying EHR corpus (see S2 and S3 Tables); an important finding for the Utah health system.

Multimorbidities of congenital malformations augmented by maternal health data

The impact of maternal health on health outcomes in the child is an area of intense investigation. The Multimorbidity network shown in Fig 5A places a child’s risk for congenital malformations in the context of a maternal diagnosis of pregnancy-induced hypertension (HTN-PREG) during that pregnancy, leveraging outcomes data for over 130,000 births at the University of Utah Hospital system over the last 15 years. HTN-PREG elevates the risk of cardiac and circulatory congenital anomalies 1.83±0.03-fold, an effect not due to maternal age differences (S1 Fig). The multimorbidity network also illuminates the strong dependencies between clinical variables and allows for quantitative assessments of risk. For example, a diagnosis of Down Syndrome is associated with a 25.9±0.8-fold increased risk for a congenital cardiac anomaly (S4A Table). Moreover, a child with a congenital cardiac anomaly is a priori 9.2±0.9-fold more likely to have a nervous system anomaly than baseline (S4B Table). The impact of maternal health on a child’s risk of CHD is further explored in Fig 5B. Our ability to seamlessly combine and compute upon maternal/child EHR data highlights the extensibility of our approach to study health outcomes across generations in order to define the impacts of maternal health on childhood outcomes.

Web-based outcomes calculators

We repackaged the multimorbidity networks as stand-alone web-based outcomes calculators. This allows users to interact with a multimorbidity network as an ‘app’, whereby they can use slider buttons to toggle values of its states and to select an outcome of interest. These web-apps are available here: https://pbc.genetics.utah.edu/lemmon2021/bayes/bayes.

Methods

Ethics statement

Human subjects approval for this study was obtained following review by the University of Utah Institutional Review Board, IRB_00095807 under a waiver of consent and authorization. Patient data was not anonymized prior to the start of the study. All authors completed Human Subjects research requirements.

Utah data resource

The University of Utah maintains an Enterprise Data Warehouse (EDW)–a central storage and search facility for all clinical data collected from all affiliated University hospitals and clinics across the Intermountain West. SQL queries were used to aggregate data from various tables and collect the following information: (1) gender, ancestry, ethnicity, and age for each patient; (2) list of patient visits, along with visit dates, and medical terms associated with each visit, including diagnostic codes, procedure codes, and medications ordered. ICD9 and ICD10 diagnosis codes consist of 18,000 and 142,000 codes respectively, while procedural codes (CPT) include around 10,000 codes. In all, we collected records for 1.6 million patients, 21 million visits and 166 million diagnosis (DX), procedure (PX) and medication (RX) codes. See S1–S5 Tables for additional details.

We combined these data with the Primary Children’s Hospital’s database of echocardiographic variables (diagnoses, ventricular function, valve gradients, chamber/vessel sizes, etc.) dating back to 2006 for 65,618 probands, 44,254 of which also appear longitudinally in the University’s EDW. These data contain 529,317 mother-child pairs with EHR data, 14,155 of which include a child with echo data, allowing us to study maternal contributions to congenital heart disease (CHD). Collectively, these data comprise the Utah Data Resource (UDR). For the purposes of computation, custom encryption is applied to the UDR to produce data free of protected health information (PHI) and unintelligible without its cyphers. We can then generate statistics on this PHI free data in a variety of compute environments, decrypting the results on PHI approved machines.

In this analysis, a patient’s diagnoses are inferred via billing codes. Thus, the investigations and risk calculations presented herein reflect medical practice within the University of Utah Hospital network and Primary Children’s Healthcare. How closely they approximate underlying universal (‘true’) risks is still unknown. Moving forward, we note that the methods described below provide powerful means for large-scale cross institutional comparisons aimed at discovering differences in medical practice and billing trends.

Patient disease network

We used a Poisson Binomial based methodology called PBC [10] to discover comorbidities within our EHR corpus. Standard methods such as stratification seek to control for confounding variables through ‘stratifying’ by age and gender (for instance) and calculating comorbidity statistics for each strata, under the restrictive assumption a every patient in a stratum has the same probability of manifesting each morbidity. However this approach fails to scale, since the use of many confounding variables leads to strata too small to detect a statistical significance comorbidity. In contrast, PBC models the effects of age, gender, race, ethnicity, insurance type, and the length and density of each patient’s medical record. These input features are used to determine per-patient probabilities for each medical term, using a Poisson binomial test. The result is much greater statistical power [10].

PBC was used to find significant connections among every possible combination of ICD diagnoses, procedures, and RxNorm medication terms, thereby creating a patient disease network [10]. Patient disease network is a term borrowed from Capobianco et. al³ and denotes a network comprising all significant connections among diagnoses, procedures and medications (Bonferroni p-value cutoff 10E-9.48). We only considered terms appearing in at least 15 patients. This filter reduced the number of unique terms to 39,055 ICD10 diagnosis codes, 5,716 CPT procedure codes, and 1,764 RxNorm medication codes. We used Minimum Description Length clustering [32] to visualize the data, so that nodes with similar combinations of edges would lay near one another in the network. We also determined the patient flux between every pair of nodes. The result is shown in Fig 2A, which provides a visual representation of our patient disease network for the entire EHR corpus.

In keeping with previous work [13,33–36] on patient disease networks, we refer to a sub-portion of the network, focused on a single outcome as a trajectory, or term trajectory. Fig 2B shows a trajectory for adult heart transplant. Trajectories provide means to display additional features of the network, such as transition probabilities (which correspond to patient flux between nodes), and the marginal frequencies of outcomes and comorbid terms within the EHR corpus. Collectively, this information allows for better intuition of the disease landscape surrounding an outcome. The trajectory is also a useful starting point for cost and service allocation calculations.

Multimorbidity networks

While trajectories describe transition probabilities between two comorbid terms, they provide no means to determine the combined effects of multiple comorbid diagnoses, and associated clinical procedures and medications upon an outcome. We have employed Probabilistic Graphical Models (PGMs) to overcome this limitation. We learned the structures of the PGMs using the python3 package “pomegranate” [28], which provides a Bayesian Information Criterion (BIC)-based DP-A* exact structure search algorithm [37,38,46]. The exact search algorithm explores the entire applicable space of conditional dependencies in order to discover the optimal network structure for the data. Parameter learning for this optimal network is accomplished using the loopy belief propagation algorithm [39]. We use the same package for our inference and multimorbidity risk calculations. The visual interpretation was designed using the graph_tool [40] Python3 package and D3.js Java library.

For each Probabilistic Graphical Model, a maximum of 25 comorbid features were selected using PBC and validated by experts in the medical field (TAM, DW, MDP, BEB, RUS, MTF). Features that were judged to be of clinical relevance, importance or interest for the field under study were selected and used as inputs to learn the PGM structure and infer risk. These selected features became the inputs used to learn the PGM structure and infer risk. The patient’s features were described in a categorical data format, (e.g. indicating the ancestry, ethnicity, or insurance type) or “present/absent” binary variables in case of medical diagnoses and procedures. A continuous feature (e.g. age, BMI, blood pressure) were discretized based on established clinical thresholds. Because the PGMs only present the facts about the data, PGMs themselves cannot discover or infer the temporal order of the events (unless specified as a Dynamic PGM). To overcome this issue, for our temporalized PGMs we have imposed the order (discovered using PBC; see [10] for additional details.) on the EHR extraction process prior to learning the Probabilistic Graphical Model structure. When trained on temporalized data, PGMs are forced to learn temporal conditional probabilities. Missing data are handled inherently by the Probabilistic Graphical Model structure learning process. That is, no patients were excluded due to missing data and no missing data was imputed. The resulting temporalized structures we call multimorbidity networks.

Probabilistic Graphical Models represent conditional dependencies in the dataset as a directed acyclic graph (DAG); however, it is important not to confuse directionality with causality or temporal ordering. In keeping with best practice, the multimorbidity networks are visualized in their undirected, moralized form, in which every node is connected to its Markov blanket. A single constructed multimorbidity Network provides an inference engine capable of answering O(3ⁿ) personalized conditional risk queries, where n denotes the number of features describing a patient’s condition, and the base of the exponent is 3, because in case of binary health records data there are three states for each node that can be specified: present, absent, or status unknown.

Confidence values

Risk estimates derived from Probabilistic Graphical Models are maximum likelihood estimates given the optimal structure under the BIC and an assumed uniform prior probability of any distinct EHR. To obtain standard deviation values for these estimates, we created 100 nets in parallel [41] from bootstrap replicates of the same data used to create Figs 3, 4 and 5. We then queried the resulting replicate nets, and calculated standard deviations of risks of outcomes of interest.

Discussion

The ability to model dependencies among multiple risk factors is crucial for meaningful outcomes research. Unfortunately, traditional techniques, such as logistic regression, have limited ability to capture so-called ‘conditional dependencies’ between variables, which are the heart and soul of multimorbid analyses. Although mixture and generalized linear models with mixed effects can (in principle) overcome this weakness, these techniques are limited because a new model must be designed for every question. Neural nets provide one possible alternative. Although they can account for non-linear interactions in the data and are scalable [7], Neural nets are often referred to as ‘black boxes’ (i.e., lacking explainability) [14,15,20,21,42–46] due to the difficulties in determining precisely how and why different input variables were used to produce the outputs.

Because we sought not merely to predict outcomes, but also to understand the relationships between multiple clinical variables and outcomes, we selected an ‘explainable’ AI solution, rather than a black box approach. Probabilistic Graphical Model-based [23–25,46] multimorbidity networks offer best-practice solutions to this problem. Moreover, they effectively model data without recourse to a fixed decision protocol (e.g decision trees), and are resilient to missing/unknown data. Crucially, the contributions of different combinations of variables to an outcome can be precisely and easily determined.

Explainability comes at a cost; unlike Neural nets, which are incredibly scalable, multimorbidity networks can model a maximum of only 30 or so variables at once [28,37,38]. It is therefore necessary to pre-identify high impact variables when modeling an outcome, a need fulfilled by PBC [10]. We argue that the ability to rigorously investigate interrelations among 30 or so primary determinants represents a giant step toward understanding cardiovascular disease.

Our results illustrate how multimorbidity networks provide explainable solutions for understanding the joint impacts of diagnoses, medications, and medical procedures on cardiovascular health outcomes. We emphasize that the necessarily brief results reported here hardly exhaust the contents of these machineries. Consider that a multimorbidity network with n nodes supports ~3ⁿ possible queries. The net shown in Fig 4B, for example, supports ~3¹⁴ different queries—a number that gives some indication both of the complexity of the data being extracted from the EHR corpus by our approach, and the value of these multimorbidity networks to further outcomes research.

Conclusion

The analyses presented here provide a first step toward a global description of heart disease and associated comorbidities across the USA intermountain west. However, the map we seek resides not so much in the results reported here, as it does in the products of our analyses: the PGM multimorbidity networks. As we have explained, these networks support multitudes of queries, and when used in combination, support both wide-ranging and focused explorations of a disease landscape. Given the right datasets, we have shown that the approach can provide new insights, such as the mother-child cross-generational cardiovascular multimorbidities we described. However, our approach also has limitations. Our exact approach allows us to model at most ~30 health conditions at a time. In future work we would like to relax this limiting factor by allowing approximate solutions that enable us to scale up the complexity of the multimorbidity networks to thousands of health conditions. Another area for innovation regards incorporation of continuous variables, as current software packages do not allow us to incorporate such variables at scale, however there is no theoretical limitation preventing their use in a PGM framework.

A major strength of our approach is that these outcomes machineries can be redistributed as web-based tools. Indeed, the multimorbidity Networks described here have been made available online [pbc.genetics.utah.edu/lemmon2021/bayes], with the hope that the wider scientific community will find them useful for their own outcomes research. The ability to transform enormous collections of EHR data into compact, portable machines for outcomes research, with no exchange of PHI, solves many of the legal, technological, and data-scientific challenges associated with large-scale EHR analyses.

Supporting information

S1 Fig. Distribution density plot of mother’s age at pregnancy, with and without hypertension complicating pregnancy.

Blue line: mothers with diagnosis of hypertension complicating pregnancy (N = 11,523 mothers). Red line: mothers without diagnosis of hypertension complicating pregnancy (N = 113,491 mothers).

(TIF)

Click here for additional data file.^{(3.9MB, tif)}

S1 Table. Overview of Utah Data Resource.

(TIF)

Click here for additional data file.^{(2.6MB, tif)}

S2 Table. Demographic variables and the Utah Data Resource.

(TIF)

Click here for additional data file.^{(5.5MB, tif)}

S3 Table. Multimorbidity Landscape of Sinoatrial Node Dysfunction (SND) in adults.

Risk and fold-change risk estimates calculated from the multimorbidity network in Fig 4B main text. For detailed description of the clinical variables, please refer to S5 Table.

(TIF)

Click here for additional data file.^{(4.5MB, tif)}

S4 Table. Risks of Cardiac or Nervous System Congenital Anomalies as a Function of Comorbid Clinical Variables.

Risk of cardiac (Panel A) or nervous system (Panel B) congenital anomalies given the presence of specific clinical variables. Baseline risk and fold change risk calculated using the multimorbidity network in Fig 5 of the main text. For example, a child with a known diagnosis of Down Syndrome has a 25.9-fold increased risk of a cardiac congenital anomaly over the marginal risk of cardiac anomaly. HTN-PREG, hypertension complicating pregnancy (AKA pregnancy-induced hypertension). For detailed description of the clinical variables please refer to S5 Table.

(TIF)

Click here for additional data file.^{(100.6KB, tif)}

S5 Table. Reference table for EHR coding.

(TIF)

Click here for additional data file.^{(19MB, tif)}

Acknowledgments

We thank Barry Moore, Jacob Shreiber, Jerry Rudisin, Sepideh Ebadi, Edward B. Clark and members of the University of Utah EDW, UPDB and Utah Center for High Performance Computing for insightful discussions, facilitating access to medical records and familial relationships, and computational support.

Data Availability

We obtained medical records from the University of Utah and Primary Children’s Hospital under an IRB that waived consent (see ethics statement). We refer to this cross-institution extract as the Utah Data Resource. Because the aggregate is comprised of exact dates and other protected patient information, the data cannot be made publicly available. Information regarding how qualified researchers might apply for data access can be found here https://irb.utah.edu/about/contact/. However, All Probabilistic Graphical Models described in this paper are available through the web using the following link: https://pbc.genetics.utah.edu/lemmon2021/bayes/.

Funding Statement

This research was supported by the AHA Children’s Strategically Focused Research Network grant (17SFRN33630041) (https://professional.heart.org/en/research-programs/strategically-focused-research/strategically-focused-research-networks) and the Nora Eccles Treadwell Foundation. RD’s effort was supported by the National Institutes of Health under Ruth L. Kirschstein National Research Service Award T32 HL007576 from the National Heart, Lung, and Blood Institute (https://grants.nih.gov/grants/oer.htm). GL was supported by NRSA training grant T32H757632 (https://researchtraining.nih.gov/programs/training-grants/T32). SW was supported by NRSA training grant T32DK110966-04 (https://researchtraining.nih.gov/programs/training-grants/T32). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Valderas J. M., Starfield B., Sibbald B., Salisbury C. & Roland M. Defining Comorbidity: Implications for Understanding Health and Health Services. Ann. Fam. Med. 7, 357–363 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Kraisangka J. et al. Bayesian Network vs. Cox’s Proportional Hazard Model of PAH Risk: A Comparison. in Artificial Intelligence in Medicine (eds. Riaño D., Wilk S. & ten Teije A.) 139–149 (Springer International Publishing, 2019). doi: 10.1007/s11906-019-0950-y [DOI] [Google Scholar]
3.Capobianco E. & Lio P. Comorbidity: a multidimensional approach. Trends Mol. Med. 19, 515–521 (2013). doi: 10.1016/j.molmed.2013.07.004 [DOI] [PubMed] [Google Scholar]
4.Guo M. et al. Analysis of disease comorbidity patterns in a large-scale China population. BMC Med. Genomics 12, 177 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Hu J. X., Thomas C. E. & Brunak S. Network biology concepts in complex disease comorbidities. Nat. Rev. Genet. 17, 615–629 (2016). doi: 10.1038/nrg.2016.87 [DOI] [PubMed] [Google Scholar]
6.Akram P. & Liao L. Prediction of comorbid diseases using weighted geometric embedding of human interactome. BMC Med. Genomics 12, 161 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Rank N. et al. Deep-learning-based real-time prediction of acute kidney injury outperforms human predictive performance. Npj Digit. Med. 3, 1–12 (2020). doi: 10.1038/s41746-019-0211-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Gutiérrez-Sacristán A. et al. comoRbidity: an R package for the systematic analysis of disease comorbidities. Bioinformatics 34, 3228–3230 (2018). doi: 10.1093/bioinformatics/bty315 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Moni M. A. & Liò P. comoR: a software for disease comorbidity risk assessment. J. Clin. Bioinforma. 4, 8 (2014). doi: 10.1186/2043-9113-4-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Lemmon G., Wesolowski S., Henrie A., Tristani-Firouzi M. & Yandell M. A Poisson binomial-based statistical testing framework for comorbidity discovery across electronic health record datasets. Nat. Comput. Sci. 1, 694–702 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Aguado A., Moratalla-Navarro F., López-Simarro F. & Moreno V. MorbiNet: multimorbidity networks in adult general population. Analysis of type 2 diabetes mellitus comorbidity. Sci. Rep. 10, 2416 (2020). doi: 10.1038/s41598-020-59336-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Xu H., Moni M. A. & Lio P. CytoCom: A Cytoscape app to visualize, query and analyse disease comorbidity networks. Bioinforma. Oxf. Engl. 31, (2014). doi: 10.1093/bioinformatics/btu731 [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Ronzano F., Gutiérrez-Sacristán A. & Furlong L. I. Comorbidity4j: a tool for interactive analysis of disease comorbidities over large patient datasets. Bioinformatics 35, 3530–3532 (2019). doi: 10.1093/bioinformatics/btz061 [DOI] [PubMed] [Google Scholar]
14.Barredo Arrieta A. et al. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Inf. Fusion 58, 82–115 (2020). [Google Scholar]
15.Amann J. et al. Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC Med. Inform. Decis. Mak. 20, 310 (2020). doi: 10.1186/s12911-020-01332-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Anguita-Ruiz A., Segura-Delgado A., Alcalá R., Aguilera C. M. & Alcalá-Fdez J. eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research. PLoS Comput. Biol. 16, e1007792 (2020). doi: 10.1371/journal.pcbi.1007792 [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Gordon L., Grantcharov T. & Rudzicz F. Explainable Artificial Intelligence for Safe Intraoperative Decision Support. JAMA Surg. 154, 1064–1065 (2019). doi: 10.1001/jamasurg.2019.2821 [DOI] [PubMed] [Google Scholar]
18.Lamy J.-B., Sekar B., Guezennec G., Bouaud J. & Séroussi B. Explainable artificial intelligence for breast cancer: A visual case-based reasoning approach. Artif. Intell. Med. 94, 42–53 (2019). doi: 10.1016/j.artmed.2019.01.001 [DOI] [PubMed] [Google Scholar]
19.Lauritsen S. M. et al. Explainable artificial intelligence model to predict acute critical illness from electronic health records. Nat. Commun. 11, 3852 (2020). doi: 10.1038/s41467-020-17431-x [DOI] [PMC free article] [PubMed] [Google Scholar]
20.London A. J. Artificial Intelligence and Black-Box Medical Decisions: Accuracy versus Explainability. Hastings Cent. Rep. 49, 15–21 (2019). doi: 10.1002/hast.973 [DOI] [PubMed] [Google Scholar]
21.Wang H. et al. Predicting Hospital Readmission via Cost-Sensitive Deep Learning. IEEE/ACM Trans. Comput. Biol. Bioinform. 15, 1968–1978 (2018). doi: 10.1109/TCBB.2018.2827029 [DOI] [PubMed] [Google Scholar]
22.Arora P. et al. Bayesian Networks for Risk Prediction Using Real-World Data: A Tool for Precision Medicine. Value Health 22, 439–445 (2019). doi: 10.1016/j.jval.2019.01.006 [DOI] [PubMed] [Google Scholar]
23.Neuberg L. G. CAUSALITY: MODELS, REASONING, AND INFERENCE, by Judea Pearl, Cambridge University Press, 2000. Econom. Theory 19, 675–685 (2003). [Google Scholar]
24.Pearl, J. Reverend bayes on inference engines: a distributed hierarchical approach. in Proceedings of the Second AAAI Conference on Artificial Intelligence 133–136 (AAAI Press, 1982).
25.Pearl J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. (Morgan Kaufmann Publishers Inc., 1988). [Google Scholar]
26.McLachlan S., Dube K., Hitman G. A., Fenton N. E. & Kyrimi E. Bayesian networks in healthcare: Distribution by medical condition. Artif. Intell. Med. 107, 101912 (2020). doi: 10.1016/j.artmed.2020.101912 [DOI] [PubMed] [Google Scholar]
27.Oniśko A. & Druzdzel M. J. Impact of precision of Bayesian network parameters on accuracy of medical diagnostic systems. Artif. Intell. Med. 57, 197–206 (2013). doi: 10.1016/j.artmed.2013.01.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Schreiber, J. Pomegranate: fast and flexible probabilistic modeling in python. ArXiv171100137 Cs Stat (2018).
29.Wolf P. A., Dawber T. R., Thomas H. E. & Kannel W. B. Epidemiologic assessment of chronic atrial fibrillation and risk of stroke: the Framingham study. Neurology 28, 973–977 (1978). doi: 10.1212/wnl.28.10.973 [DOI] [PubMed] [Google Scholar]
30.Wolf P. A., Abbott R. D. & Kannel W. B. Atrial fibrillation as an independent risk factor for stroke: the Framingham Study. Stroke 22, 983–988 (1991). doi: 10.1161/01.str.22.8.983 [DOI] [PubMed] [Google Scholar]
31.John R. M. & Kumar S. Sinus Node and Atrial Arrhythmias. Circulation 133, 1892–1900 (2016). doi: 10.1161/CIRCULATIONAHA.116.018011 [DOI] [PubMed] [Google Scholar]
32.Rissanen J. Modeling by shortest data description. Automatica 14, 465–471 (1978). [Google Scholar]
33.Agusti A. & Faner R. Lung function trajectories in health disease. Lancet Respir. Med. 7, 358–364 (2019). doi: 10.1016/S2213-2600(18)30529-0 [DOI] [PubMed] [Google Scholar]
34.Burckhardt P., Nagin D. S. & Padman R. Multi-Trajectory Models of Chronic Kidney Disease Progression. AMIA Annu. Symp. Proc. AMIA Symp. 2016, 1737–1746 (2016). [PMC free article] [PubMed] [Google Scholar]
35.Reed E. & Corner J. Defining the illness trajectory of metastatic breast cancer. BMJ Support. Palliat. Care 5, 358–365 (2015). doi: 10.1136/bmjspcare-2012-000415 [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Siggaard T. et al. Disease trajectory browser for exploring temporal, population-wide disease progression patterns in 7.2 million Danish patients. Nat. Commun. 11, 4952 (2020). doi: 10.1038/s41467-020-18682-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Koivisto M. & Sood K. Exact Bayesian Structure Discovery in Bayesian Networks. J. Mach. Learn. Res. 5, 549–573 (2004). [Google Scholar]
38.Yuan, C., Malone, O. & Wu, X. Learning optimal Bayesian networks using A* search. in In Proceedings of the 22nd International Joint Conference on Artificial Intelligence (2011).
39.Weiss Y. Correctness of Local Probability Propagation in Graphical Models with Loops. Neural Comput. 12, 1–41 (2000). doi: 10.1162/089976600300015880 [DOI] [PubMed] [Google Scholar]
40.The graph-tool python library. (2014) doi: 10.6084/m9.figshare.1164194.v14 [DOI]
41.GNU Scientific Library Reference Manual—Read online. https://www.e-booksdirectory.com/details.php?ebook=3457.
42.Payrovnaziri S. N. et al. Explainable artificial intelligence models using real-world electronic health record data: a systematic scoping review. J. Am. Med. Inform. Assoc. JAMIA 27, 1173–1185 (2020). doi: 10.1093/jamia/ocaa053 [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Rajkomar A. et al. Scalable and accurate deep learning for electronic health records. Npj Digit. Med. 1, 18 (2018). doi: 10.1038/s41746-018-0029-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Franz, L., Shrestha, Y. R. & Paudel, B. A Deep Learning Pipeline for Patient Diagnosis Prediction Using Electronic Health Records. ArXiv200616926 Cs (2020).
45.Miotto R., Li L., Kidd B. A. & Dudley J. T. Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records. Sci. Rep. 6, 26094 (2016). doi: 10.1038/srep26094 [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Heckerman, D., Geiger, D. & Chickering, D. M. Learning Bayesian Networks: The Combination of Knowledge and Statistical Data. ArXiv13026815 Cs (2015).

PLOS Digit Health. doi: 10.1371/journal.pdig.0000004.r001

Decision Letter 0

Henry Horng-Shing Lu, Mecit Can Emre Simsekler

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

11 Oct 2021

PDIG-D-21-00066

An Explainable Artificial Intelligence Approach for Predicting Cardiovascular Outcomes using Electronic Health Records

PLOS Digital Health

Dear Dr. Wesolowski,

Thank you for submitting your manuscript to PLOS Digital Health. After careful consideration, we feel that it has merit but does not fully meet PLOS Digital Health’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Dec 10 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at digitalhealth@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pdig/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

We look forward to receiving your revised manuscript.

Kind regards,

Mecit Can Emre Simsekler, Ph.D.

Academic Editor

PLOS Digital Health

Journal Requirements:

1. We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex. If you are providing a .tex file, please upload it under the item type ‘LaTeX Source File’ and leave your .pdf version as the item type ‘Manuscript’.

2. Please provide separate figure files in .tif or .eps format only, and remove any figures embedded in your manuscript file. If you are using LaTeX, you do not need to remove embedded figures.

For more information about figure files please see our guidelines: https://journals.plos.org/digitalhealth/s/figures

3. We have noticed that you have uploaded supporting information but you have not included a list of legends. Please add a full list of legends for all supporting information files (including figures, table and data files) after the references list.

4. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments (if provided):

For the general readership of the journal, it would be better if you could re-organize the section headings in a typical order, e.g., introduction, methods, results, discussion and conclusions. Accordingly, that would be great if you could add a short conclusion section highlighting the limitations of the study and directions for future research.

Please also ensure that the link on ‘data availability’ section you provided is accessible.

https://pbc.genetics.utah.edu/lemmon2021/bayes/bayes

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Does this manuscript meet PLOS Digital Health’s publication criteria? Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe methodologically and ethically rigorous research with conclusions that are appropriately drawn based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I don't know

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available (please refer to the Data Availability Statement at the start of the manuscript PDF file)?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception. The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS Digital Health does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors used a comorbidity discovery method to analyse EHRs for comorbid diagnoses, procedures and medications. I believe that the paper fits well with the journal’s scope. I have a few suggestions for improving the understandability of the manuscript for a broader range of readers.

I understand that some of the authors developed the method, and the manuscript refers to the author’s previous paper where they explain the PBC. Yet, still, I would advise authors to provide more information on the method used.

I see that the authors highlighted some of the pros of using PBC, but could you please expand it? Why PBC? For instance, can’t we discover temporal relationships and quantify transition rates between comorbidities using other methods? What does PBC add to other methods?

Could you please check the links provided in the manuscript?; the links do not work.

Did your study provide some significant findings that have not been discovered yet? What does it add to the current knowledge on cardiovascular comorbidities? Would you please highlight accordingly? How can these findings be useful in practice?

Could you please check Figure 4B? Some texts are overlapped.

The authors might also suggest some directions for future research.

Reviewer #2: Overall this is a throughout and interesting approach to identifying how multiple conditionally dependent variables can predict certain cardiovascular outcomes using PBC. I especially found Table 1 to drive the central point that PBC can be a powerful statistical tool in smaller sample sizes as compared with chi-squared analyses for the same variables. The other illustrations are easy to follow and support the central arguments made in the text. Overall the case is well made for the utility of PBC derived multi morbidity networks in analyzing large EHR datasets.

There is an issue with figure 4B. I believe it is a duplicated image of figure 5A. The description of Figure 4B is totally dissimilar from what is depicted in the graphic. Please consider editing figure 4B prior to publication.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

Do you want your identity to be public for this peer review? If you choose “no”, your identity will remain anonymous but your review may still be made public.

For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLOS Digit Health. 2022 Jan 18;1(1):e0000004. doi: 10.1371/journal.pdig.0000004.r002

Author response to Decision Letter 0

11 Nov 2021

Attachment

Submitted filename: Response to Reviewers.docx

Click here for additional data file.^{(9KB, docx)}

PLOS Digit Health. doi: 10.1371/journal.pdig.0000004.r003

Decision Letter 1

Henry Horng-Shing Lu, Mecit Can Emre Simsekler

17 Nov 2021

An Explainable Artificial Intelligence Approach for Predicting Cardiovascular Outcomes using Electronic Health Records

PDIG-D-21-00066R1

Dear Dr. Wesolowski,

We're pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you'll receive an e-mail detailing the required amendments. When these have been addressed, you'll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at https://www.editorialmanager.com/pdig/ click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they'll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact digitalhealth@plos.org.

Kind regards,

Mecit Can Emre Simsekler, Ph.D.

Academic Editor

PLOS Digital Health

Additional Editor Comments (optional):

Reviewers' comments:

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. Distribution density plot of mother’s age at pregnancy, with and without hypertension complicating pregnancy.

Blue line: mothers with diagnosis of hypertension complicating pregnancy (N = 11,523 mothers). Red line: mothers without diagnosis of hypertension complicating pregnancy (N = 113,491 mothers).

(TIF)

Click here for additional data file.^{(3.9MB, tif)}

S1 Table. Overview of Utah Data Resource.

(TIF)

Click here for additional data file.^{(2.6MB, tif)}

S2 Table. Demographic variables and the Utah Data Resource.

(TIF)

Click here for additional data file.^{(5.5MB, tif)}

S3 Table. Multimorbidity Landscape of Sinoatrial Node Dysfunction (SND) in adults.

Risk and fold-change risk estimates calculated from the multimorbidity network in Fig 4B main text. For detailed description of the clinical variables, please refer to S5 Table.

(TIF)

Click here for additional data file.^{(4.5MB, tif)}

S4 Table. Risks of Cardiac or Nervous System Congenital Anomalies as a Function of Comorbid Clinical Variables.

(TIF)

Click here for additional data file.^{(100.6KB, tif)}

S5 Table. Reference table for EHR coding.

(TIF)

Click here for additional data file.^{(19MB, tif)}

Attachment

Submitted filename: Response to Reviewers.docx

Click here for additional data file.^{(9KB, docx)}

Data Availability Statement

[pdig.0000004.ref001] 1.Valderas J. M., Starfield B., Sibbald B., Salisbury C. & Roland M. Defining Comorbidity: Implications for Understanding Health and Health Services. Ann. Fam. Med. 7, 357–363 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref002] 2.Kraisangka J. et al. Bayesian Network vs. Cox’s Proportional Hazard Model of PAH Risk: A Comparison. in Artificial Intelligence in Medicine (eds. Riaño D., Wilk S. & ten Teije A.) 139–149 (Springer International Publishing, 2019). doi: 10.1007/s11906-019-0950-y [DOI] [Google Scholar]

[pdig.0000004.ref003] 3.Capobianco E. & Lio P. Comorbidity: a multidimensional approach. Trends Mol. Med. 19, 515–521 (2013). doi: 10.1016/j.molmed.2013.07.004 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref004] 4.Guo M. et al. Analysis of disease comorbidity patterns in a large-scale China population. BMC Med. Genomics 12, 177 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref005] 5.Hu J. X., Thomas C. E. & Brunak S. Network biology concepts in complex disease comorbidities. Nat. Rev. Genet. 17, 615–629 (2016). doi: 10.1038/nrg.2016.87 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref006] 6.Akram P. & Liao L. Prediction of comorbid diseases using weighted geometric embedding of human interactome. BMC Med. Genomics 12, 161 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref007] 7.Rank N. et al. Deep-learning-based real-time prediction of acute kidney injury outperforms human predictive performance. Npj Digit. Med. 3, 1–12 (2020). doi: 10.1038/s41746-019-0211-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref008] 8.Gutiérrez-Sacristán A. et al. comoRbidity: an R package for the systematic analysis of disease comorbidities. Bioinformatics 34, 3228–3230 (2018). doi: 10.1093/bioinformatics/bty315 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref009] 9.Moni M. A. & Liò P. comoR: a software for disease comorbidity risk assessment. J. Clin. Bioinforma. 4, 8 (2014). doi: 10.1186/2043-9113-4-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref010] 10.Lemmon G., Wesolowski S., Henrie A., Tristani-Firouzi M. & Yandell M. A Poisson binomial-based statistical testing framework for comorbidity discovery across electronic health record datasets. Nat. Comput. Sci. 1, 694–702 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref011] 11.Aguado A., Moratalla-Navarro F., López-Simarro F. & Moreno V. MorbiNet: multimorbidity networks in adult general population. Analysis of type 2 diabetes mellitus comorbidity. Sci. Rep. 10, 2416 (2020). doi: 10.1038/s41598-020-59336-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref012] 12.Xu H., Moni M. A. & Lio P. CytoCom: A Cytoscape app to visualize, query and analyse disease comorbidity networks. Bioinforma. Oxf. Engl. 31, (2014). doi: 10.1093/bioinformatics/btu731 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref013] 13.Ronzano F., Gutiérrez-Sacristán A. & Furlong L. I. Comorbidity4j: a tool for interactive analysis of disease comorbidities over large patient datasets. Bioinformatics 35, 3530–3532 (2019). doi: 10.1093/bioinformatics/btz061 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref014] 14.Barredo Arrieta A. et al. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Inf. Fusion 58, 82–115 (2020). [Google Scholar]

[pdig.0000004.ref015] 15.Amann J. et al. Explainability for artificial intelligence in healthcare: a multidisciplinary perspective. BMC Med. Inform. Decis. Mak. 20, 310 (2020). doi: 10.1186/s12911-020-01332-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref016] 16.Anguita-Ruiz A., Segura-Delgado A., Alcalá R., Aguilera C. M. & Alcalá-Fdez J. eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research. PLoS Comput. Biol. 16, e1007792 (2020). doi: 10.1371/journal.pcbi.1007792 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref017] 17.Gordon L., Grantcharov T. & Rudzicz F. Explainable Artificial Intelligence for Safe Intraoperative Decision Support. JAMA Surg. 154, 1064–1065 (2019). doi: 10.1001/jamasurg.2019.2821 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref018] 18.Lamy J.-B., Sekar B., Guezennec G., Bouaud J. & Séroussi B. Explainable artificial intelligence for breast cancer: A visual case-based reasoning approach. Artif. Intell. Med. 94, 42–53 (2019). doi: 10.1016/j.artmed.2019.01.001 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref019] 19.Lauritsen S. M. et al. Explainable artificial intelligence model to predict acute critical illness from electronic health records. Nat. Commun. 11, 3852 (2020). doi: 10.1038/s41467-020-17431-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref020] 20.London A. J. Artificial Intelligence and Black-Box Medical Decisions: Accuracy versus Explainability. Hastings Cent. Rep. 49, 15–21 (2019). doi: 10.1002/hast.973 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref021] 21.Wang H. et al. Predicting Hospital Readmission via Cost-Sensitive Deep Learning. IEEE/ACM Trans. Comput. Biol. Bioinform. 15, 1968–1978 (2018). doi: 10.1109/TCBB.2018.2827029 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref022] 22.Arora P. et al. Bayesian Networks for Risk Prediction Using Real-World Data: A Tool for Precision Medicine. Value Health 22, 439–445 (2019). doi: 10.1016/j.jval.2019.01.006 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref023] 23.Neuberg L. G. CAUSALITY: MODELS, REASONING, AND INFERENCE, by Judea Pearl, Cambridge University Press, 2000. Econom. Theory 19, 675–685 (2003). [Google Scholar]

[pdig.0000004.ref024] 24.Pearl, J. Reverend bayes on inference engines: a distributed hierarchical approach. in Proceedings of the Second AAAI Conference on Artificial Intelligence 133–136 (AAAI Press, 1982).

[pdig.0000004.ref025] 25.Pearl J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. (Morgan Kaufmann Publishers Inc., 1988). [Google Scholar]

[pdig.0000004.ref026] 26.McLachlan S., Dube K., Hitman G. A., Fenton N. E. & Kyrimi E. Bayesian networks in healthcare: Distribution by medical condition. Artif. Intell. Med. 107, 101912 (2020). doi: 10.1016/j.artmed.2020.101912 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref027] 27.Oniśko A. & Druzdzel M. J. Impact of precision of Bayesian network parameters on accuracy of medical diagnostic systems. Artif. Intell. Med. 57, 197–206 (2013). doi: 10.1016/j.artmed.2013.01.004 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref028] 28.Schreiber, J. Pomegranate: fast and flexible probabilistic modeling in python. ArXiv171100137 Cs Stat (2018).

[pdig.0000004.ref029] 29.Wolf P. A., Dawber T. R., Thomas H. E. & Kannel W. B. Epidemiologic assessment of chronic atrial fibrillation and risk of stroke: the Framingham study. Neurology 28, 973–977 (1978). doi: 10.1212/wnl.28.10.973 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref030] 30.Wolf P. A., Abbott R. D. & Kannel W. B. Atrial fibrillation as an independent risk factor for stroke: the Framingham Study. Stroke 22, 983–988 (1991). doi: 10.1161/01.str.22.8.983 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref031] 31.John R. M. & Kumar S. Sinus Node and Atrial Arrhythmias. Circulation 133, 1892–1900 (2016). doi: 10.1161/CIRCULATIONAHA.116.018011 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref032] 32.Rissanen J. Modeling by shortest data description. Automatica 14, 465–471 (1978). [Google Scholar]

[pdig.0000004.ref033] 33.Agusti A. & Faner R. Lung function trajectories in health disease. Lancet Respir. Med. 7, 358–364 (2019). doi: 10.1016/S2213-2600(18)30529-0 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref034] 34.Burckhardt P., Nagin D. S. & Padman R. Multi-Trajectory Models of Chronic Kidney Disease Progression. AMIA Annu. Symp. Proc. AMIA Symp. 2016, 1737–1746 (2016). [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref035] 35.Reed E. & Corner J. Defining the illness trajectory of metastatic breast cancer. BMJ Support. Palliat. Care 5, 358–365 (2015). doi: 10.1136/bmjspcare-2012-000415 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref036] 36.Siggaard T. et al. Disease trajectory browser for exploring temporal, population-wide disease progression patterns in 7.2 million Danish patients. Nat. Commun. 11, 4952 (2020). doi: 10.1038/s41467-020-18682-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref037] 37.Koivisto M. & Sood K. Exact Bayesian Structure Discovery in Bayesian Networks. J. Mach. Learn. Res. 5, 549–573 (2004). [Google Scholar]

[pdig.0000004.ref038] 38.Yuan, C., Malone, O. & Wu, X. Learning optimal Bayesian networks using A* search. in In Proceedings of the 22nd International Joint Conference on Artificial Intelligence (2011).

[pdig.0000004.ref039] 39.Weiss Y. Correctness of Local Probability Propagation in Graphical Models with Loops. Neural Comput. 12, 1–41 (2000). doi: 10.1162/089976600300015880 [DOI] [PubMed] [Google Scholar]

[pdig.0000004.ref040] 40.The graph-tool python library. (2014) doi: 10.6084/m9.figshare.1164194.v14 [DOI]

[pdig.0000004.ref041] 41.GNU Scientific Library Reference Manual—Read online. https://www.e-booksdirectory.com/details.php?ebook=3457.

[pdig.0000004.ref042] 42.Payrovnaziri S. N. et al. Explainable artificial intelligence models using real-world electronic health record data: a systematic scoping review. J. Am. Med. Inform. Assoc. JAMIA 27, 1173–1185 (2020). doi: 10.1093/jamia/ocaa053 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref043] 43.Rajkomar A. et al. Scalable and accurate deep learning for electronic health records. Npj Digit. Med. 1, 18 (2018). doi: 10.1038/s41746-018-0029-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref044] 44.Franz, L., Shrestha, Y. R. & Paudel, B. A Deep Learning Pipeline for Patient Diagnosis Prediction Using Electronic Health Records. ArXiv200616926 Cs (2020).

[pdig.0000004.ref045] 45.Miotto R., Li L., Kidd B. A. & Dudley J. T. Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records. Sci. Rep. 6, 26094 (2016). doi: 10.1038/srep26094 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pdig.0000004.ref046] 46.Heckerman, D., Geiger, D. & Chickering, D. M. Learning Bayesian Networks: The Combination of Knowledge and Statistical Data. ArXiv13026815 Cs (2015).

PERMALINK

An explainable artificial intelligence approach for predicting cardiovascular outcomes using electronic health records

Sergiusz Wesołowski

Gordon Lemmon

Edgar J Hernandez

Alex Henrie

Thomas A Miller

Derek Weyhrauch

Michael D Puchalski

Bruce E Bray

Rashmee U Shah

Vikrant G Deshmukh

Rebecca Delaney

H Joseph Yost

Karen Eilbeck

Martin Tristani-Firouzi

Mark Yandell

Roles

Abstract

Introduction

Results

PBC is well powered for discovery of cardiovascular comorbidities

Table 1. PBC is well powered for comorbidity discovery on demographically complex datasets, unlike stratification.

Comorbidities of heart transplant

Fig 1. Percent of medical terms influenced by various demographic features.

Fig 2. Patient Disease Network for the Utah Data Resource.

Multimorbidity network for heart transplant supports conditional outcome risk calculations

Fig 3. Multimorbidity Landscape of Heart Transplant.

Multimorbidity network for sinoatrial node dysfunction supports multimorbidity risk calculations for a range of clinical and demographic health predictors

Fig 4. Multimorbidity Landscape of Sinoatrial Node Dysfunction (SND).

Multimorbidities of congenital malformations augmented by maternal health data

Fig 5. Impact of maternal health on congenital anomalies in the child.

Web-based outcomes calculators

Methods

Ethics statement

Utah data resource

Patient disease network

Multimorbidity networks

Confidence values

Discussion

Conclusion

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Henry Horng-Shing Lu

Mecit Can Emre Simsekler

Roles

Transfer Alert

Author response to Decision Letter 0

Decision Letter 1

Henry Horng-Shing Lu

Mecit Can Emre Simsekler

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases