Assessment of a Deep Learning Model to Predict Hepatocellular Carcinoma in Patients With Hepatitis C Cirrhosis

George N Ioannou; Weijing Tang; Lauren A Beste; Monica A Tincopa; Grace L Su; Tony Van; Elliot B Tapper; Amit G Singal; Ji Zhu; Akbar K Waljee

doi:10.1001/jamanetworkopen.2020.15626

. 2020 Sep 1;3(9):e2015626. doi: 10.1001/jamanetworkopen.2020.15626

Assessment of a Deep Learning Model to Predict Hepatocellular Carcinoma in Patients With Hepatitis C Cirrhosis

George N Ioannou ^1,², Weijing Tang ³, Lauren A Beste ⁴, Monica A Tincopa ⁵, Grace L Su ^5,⁶, Tony Van ⁷, Elliot B Tapper ^5,⁶, Amit G Singal ⁸, Ji Zhu ^3,⁷, Akbar K Waljee ^5,^6,^7,^✉

¹Division of Gastroenterology, Department of Medicine, Veterans Affairs Puget Sound Healthcare System and University of Washington, Seattle

²Research and Development, Veterans Affairs Puget Sound Healthcare System, Seattle, Washington

³Department of Statistics, University of Michigan, Ann Arbor

⁴Division of General Internal Medicine, Department of Medicine, Veterans Affairs Puget Sound Healthcare System and University of Washington, Seattle

⁵Michigan Medicine, Division of Gastroenterology and Hepatology, Department of Internal Medicine, Ann Arbor

⁶VA Ann Arbor Health Services Research and Development Center of Clinical Management Research, Ann Arbor, Michigan

⁷Michigan Integrated Center for Health Analytics and Medical Prediction (MiCHAMP), Ann Arbor

⁸Division of Gastroenterology, Department of Medicine, University of Texas Southwestern, Dallas

Accepted for Publication: June 21, 2020.

Published: September 1, 2020. doi:10.1001/jamanetworkopen.2020.15626

^✉

Corresponding Author: Akbar K. Waljee, MD, MSc, Michigan Medicine, Division of Gastroenterology and Hepatology, Department of Internal Medicine, 2215 Fuller Rd, Gastroenterology 111D, Ann Arbor, MI 48105 (awaljee@med.umich.edu).

Author Contributions: Dr Waljee had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis. Drs Ioannou and Tang and joint first authors.

Concept and design: Ioannou, Tincopa, Su, Tapper, Zhu, Waljee.

Acquisition, analysis, or interpretation of data: Ioannou, Tang, Beste, Van, Tapper, Singal, Zhu, Waljee.

Drafting of the manuscript: Ioannou, Tang, Beste, Su, Van, Zhu, Waljee.

Critical revision of the manuscript for important intellectual content: Ioannou, Tang, Beste, Tincopa, Su, Tapper, Singal, Zhu, Waljee.

Statistical analysis: Ioannou, Tang, Van, Zhu, Waljee.

Obtained funding: Ioannou, Su.

Administrative, technical, or material support: Waljee.

Supervision: Ioannou, Zhu, Waljee.

Conflict of Interest Disclosures: Dr Ioannou reported receiving grants from the National Institutes of Health National Cancer Institute and the US Department of Veterans Affairs Clinical Science Research and Development during the conduct of the study. Dr Su reported receiving grants from the Veterans Administration during the conduct of the study. Dr Tapper reported receiving personal fees from Axcella, Novo Nordisk, Kaleido, Allergan, Novartis, Mallinckrodt, and Rebiotix; receiving grants and personal fees from Bausch; and receiving grants from Gilead outside the submitted work. Dr Singal reported receiving grants and personal fees from Gilead and Abbvie outside the submitted work. Drs Waljee and Zhu reporting receiving support from the Michigan Integrated Center for Health Analytics and Medical Prediction in the Institute for Healthcare Policy and Innovation at the University of Michigan Medical School. Dr Waljee reported receiving grants from the US Department of Defense and the US Department of Veterans Affairs Health Service Research and Development during the conduct of the study. No other disclosures were reported.

Funding/Support: Drs Waljee, Beste, Ioannou, Su are funded by IIR 16-024 from the United States (U.S.) Department of Veterans Affairs Health Services Research and Development Service.

Role of the Funder/Sponsor: The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Disclaimer: The views expressed in this article are those of the authors and do not necessarily represent the views of the Department of Veterans Affairs or the US government.

^✉

Corresponding author.

PMCID: PMC7489819 PMID: 32870314

Key Points

Question

Can deep learning recurrent neural network (RNN) models using raw longitudinal data extracted directly from electronic health records outperform conventional regression models in predicting the risk of developing hepatocellular carcinoma (HCC)?

Findings

This prognostic study included 48 151 patients with hepatitis C virus (HCV)–related cirrhosis in the national Veterans Health Administration who had at least 3 years of follow-up after the diagnosis of cirrhosis. Deep learning RNN models outperformed conventional linear regression models and could be used to identify patients with HCV-related cirrhosis at high risk of developing HCC.

Meaning

The findings of this study suggest that RNN models could have multiple applications in clinical practice and could be applied to HCC outreach and surveillance strategies.

This prognostic study examines whether deep learning recurrent neural network (RNN) models that use raw longitudinal data extracted directly from electronic health records outperform conventional regression models in predicting the risk of developing hepatocellular carcinoma (HCC).

Abstract

Importance

Deep learning, a family of machine learning models that use artificial neural networks, has achieved great success at predicting outcomes in nonmedical domains.

Objective

To examine whether deep learning recurrent neural network (RNN) models that use raw longitudinal data extracted directly from electronic health records outperform conventional regression models in predicting the risk of developing hepatocellular carcinoma (HCC).

Design, Setting, and Participants

This prognostic study included 48 151 patients with hepatitis C virus (HCV)–related cirrhosis in the national Veterans Health Administration who had at least 3 years of follow-up after the diagnosis of cirrhosis. Patients were identified by having at least 1 positive HCV RNA test between January 1, 2000, to January 1, 2016, and were followed up from the diagnosis of cirrhosis to January 1, 2019, for the development of incident HCC. A total of 3 models predicting HCC during a 3-year period were developed and compared, as follows: (1) logistic regression (LR) with cross-sectional inputs (cross-sectional LR); (2) LR with longitudinal inputs (longitudinal LR); and (3) RNN with longitudinal inputs. Data analysis was conducted from April 2018 to August 2020.

Exposures

Development of HCC.

Main Outcomes and Measures

Area under the receiver operating characteristic curve, area under the precision-recall curve, and Brier score.

Results

During a mean (SD) follow-up of 11.6 (5.0) years, 10 741 of 48 151 patients (22.3%) developed HCC (annual incidence, 3.1%), and a total of 52 983 samples (51 948 [98.0%] from men) were collected. Patients who developed HCC within 3 years were older than patients who did not (mean [SD] age, 58.2 [6.6] years vs 56.9 [6.9] years). RNN models had superior mean (SD) area under the receiver operating characteristic curve (0.759 [0.009]) and mean (SD) Brier score (0.136 [0.003]) than cross-sectional LR (0.689 [0.009] and 0.149 [0.003], respectively) and longitudinal LR (0.682 [0.007] and 0.150 [0.003], respectively) models. Using the RNN model, the samples with the mean (SD) highest 51% (1.5%) of HCC risk, in which 80% of all HCCs occurred, or the mean (SD) highest 66% (1.2%) of HCC risk, in which 90% of all HCCs occurred, could potentially be targeted. Among samples from patients who achieved sustained virologic response, the performance of the RNN models was even better (mean [SD] area under receiver operating characteristic curve, 0.806 [0.025]; mean [SD] Brier score, 0.117 [0.007]).

Conclusions and Relevance

In this study, deep learning RNN models outperformed conventional LR models, suggesting that RNN models could be used to identify patients with HCV-related cirrhosis with a high risk of developing HCC for risk-based HCC outreach and surveillance strategies.

Introduction

Patients with chronic hepatitis C virus (HCV) infection have a high risk of developing hepatocellular carcinoma (HCC). The risk of HCC increases among patients with HCV infection when they develop advanced fibrosis or cirrhosis. Conversely, the risk decreases after HCV eradication,^1,2,3,4,5 which is becoming increasingly common. Many other factors are known to be associated with increased risk of HCC among patients with HCV or cirrhosis, including low platelet count, increased aspartate transaminase–to–alanine aminotransferase (AST:ALT) ratio, male sex, and older age.⁶ Among patients with HCV-related cirrhosis, the annual risk of HCC varies from less than 1% to more than 5%, depending on a number of readily available, well-described risk factors. Conventional regression models have recently been developed to estimate the risk of HCC in patients with HCV according to the presence or absence of cirrhosis, response to antiviral treatment, and a small number of routinely available baseline clinical characteristics.⁶

HCC risk prediction in patients with HCV infection is particularly difficult because it can fluctuate over time. The development of cirrhosis and the eradication of HCV represent major transition points at which HCC risk changes drastically and abruptly. In addition, HCC risk changes more gradually as patients age or as portal hypertension worsens, liver stiffness increases, or platelet count declines over a period of years. Therefore, longitudinal models that can adequately capture the changes in these predictive factors over time could be ideally suited for HCC risk prediction. Recent advances in deep learning models have been shown to help learn feature representations of data and improve model performance in different domains, such as computer vision and natural language processing. Deep learning models have also been successfully applied to health care to predict clinical events,⁷ disease classification,⁸ and electronic health record (EHR) data augmentation.⁹ Among them, various types of model architectures have been used, such as feedforward neural networks, recurrent neural networks (RNNs), and convolutional neural networks.¹⁰ The recurrent mechanism used in RNNs can help capture temporal dynamics and long-term information over time; RNNs can also handle longitudinal data with varying lengths of follow-up. Therefore, RNNs are powerful and popular models for processing sequential data, such as time series, longitudinal clinical events, and clinician’s text notes. Our aim was to develop and examine the prediction accuracy of a deep learning model based on RNNs for predicting progression to HCC in a cohort of patients with chronic HCV infection in the Veterans Health Administration (VHA). Furthermore, we aimed to compare the performance of a deep learning RNN model with conventional logistic regression (LR) models.

Methods

Data Source

The VHA is the largest integrated health care system treating patients with HCV in the United States.¹¹ The VHA uses a single comprehensive electronic health care information network that integrates all care applications into a single, common database. We obtained data on all patients with chronic HCV in the VHA system using the Corporate Data Warehouse, a national, continually updated repository of health care data.¹² The study was approved by the institutional review boards of the Puget Sound and Ann Arbor VA Healthcare Systems. Patient consent was waived based on the utility of secondary data analysis given the number of patients involved. This study followed the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) reporting guideline.

Study Population

Using VHA Corporate Data Warehouse data, we identified 280 418 patients with at least 1 positive HCV RNA test during the 16-year period from January 1, 2000, to January 1, 2016, and retrospectively followed up their EHRs in the VHA system until January 1, 2019. We excluded 203 573 patients (72.6%) who were never diagnosed with cirrhosis because HCC screening is only recommended for patients with HCV after they develop advanced fibrosis or cirrhosis; we excluded an additional 3680 patients (1.3%) in whom the diagnosis of HCC preceded the diagnosis of cirrhosis. The diagnosis of cirrhosis was based on the presence of the International Classification of Diseases, Ninth Revision (ICD-9) or International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) codes for cirrhosis or complications of cirrhosis (ie, gastroesophageal varices, encephalopathy, nonmalignant ascites, hepatorenal syndrome, hepatopulmonary syndrome) (eTable 1 in the Supplement), recorded at least twice in any inpatient or outpatient encounter. This approach has been validated and widely used in VHA-based studies by us^{4,13,14,15,16,17,18,19,20} and others.^21,22,23 The earliest date that any of these ICD-9 or ICD-10 codes were recorded was considered the date of cirrhosis diagnosis. Finally, because our aim was to develop longitudinal models predicting the development of HCC during a 3-year period, we excluded 25 014 patients (8.9%) who had less than 3 years of available follow-up from the diagnosis of cirrhosis to their last visit in the VA system. This resulted in a final analytic sample of 48 151 patients with HCV-related cirrhosis and at least 3 years of follow-up after the diagnosis of cirrhosis, of whom 10 741 (22.3%) developed HCC during follow-up.

Diagnosis of HCC

The diagnosis of HCC was based on the presence of ICD-9 code 155.0 or ICD-10 code C22.0 (the VHA began using ICD-10 codes on October 1, 2015), recorded at least twice. The ICD-9 code–based definition of HCC using VHA records has been shown to have a positive predictive value of 84% to 94% compared with medical record extraction^23,24,25 and has been widely used by us^17,18,20,26 and other investigators.^27,28,29

Predictor Variables

We used 2 types of predictor variables for HCC prediction, as follows: (1) 4 baseline predictors, which do not change over time, ie, age at cirrhosis diagnosis, sex, race, and HCV genotype; and (2) 27 longitudinal predictors, which may change over time and are available at multiple times during follow-up, including development of cirrhosis, achievement of sustained virologic response (SVR), body mass index (calculated as weight in kilograms divided by height in meters squared), and 24 laboratory blood tests (bilirubin, AST, AST–upper limit of normal (ULN) ratio, ALT, ALT:ULN ratio, α-fetoprotein, α-fetoprotein–ULN ratio, alkaline phosphatase, alkaline phosphatase–ULN ratio, albumin, AST:ALT ratio, fibrosis-4 (FIB-4) score,³⁰ AST-platelet ratio index (APRI), blood urea nitrogen, creatinine, glucose, international normalized ratio, hemoglobin, white blood cell count, platelet count, sodium, potassium, chloride, and total protein). The development of cirrhosis is a longitudinal indicator, which starts with 0 and changes to 1 at the date of diagnosis of cirrhosis. Therefore, it indicates not only whether a patient developed cirrhosis but also how long the patient had cirrhosis. To determine SVR, we identified all antiviral treatment regimens (interferon and/or direct-acting antiviral agents) and whether they resulted in SVR, defined as a serum HCV RNA viral load below the lower limit of detection performed at least 12 weeks after the end of treatment.³¹ Also, to identify any patients who might have achieved SVR as a result of antiviral treatment received outside the VHA, we defined SVR in patients with a prior positive HCV viral load who had subsequent persistent negative viral loads.

Rationale for Cases and Controls Used in Model Building

We wanted to simulate the clinically relevant scenario in which a physician would like to estimate the probability that a specific patient with cirrhosis will develop HCC within the following 3 years from the time of the clinic visit using all available information at the time of the clinic visit. To do this we analyzed cases and controls by sampling random clinic visits as follows (Figure 1A).

Figure 1. — A, Patients with hepatitis C virus infection who had a diagnosis of cirrhosis and at least 3 years of follow-up from the time of diagnosis of cirrhosis to their last follow-up visit in the Veterans Healthcare Administration (VHA) were identified. Patients who developed hepatocellular carcinoma (HCC) within 3 years of time t after the development of cirrhosis were designated cases, and those who did not were designated controls. All data available at or before time t were used as predictors of the development of cirrhosis within 3 years of time t. The first and third examples are for patients who developed HCC during follow-up; the second example is for a patient who did not develop HCC during follow-up. B, Schematic comparison of the 3 different models we developed to predict HCC development (ie, model 1, logistic regression using cross-sectional baseline data at time t; model 2, logistic regression using human-designed longitudinal data prior to time t; and model 3, recurrent neural networks using raw longitudinal data prior to time t). C, Model structure of longitudinal recurrent neural network under 1 representative splitting.

Cases

We identified 10 738 patients who developed HCC during follow-up and had at least 1 visit after the diagnosis of cirrhosis within 3 years before the diagnosis of HCC. We randomly sampled 1 visit (time t) for each patient. Thus, we obtained 10 738 case samples in which HCC was diagnosed within 3 years of the sampled visit (time t) (Figure 1A).

Controls

For patients who did not develop HCC during follow-up (n = 37 410), we randomly sampled 1 visit (time t) after the diagnosis of cirrhosis that was at least 3 years before the date of last follow-up for each patient to ensure that no HCC developed within 3 years (Figure 1A). In addition, we wanted to capture visits of patients who developed HCC more than 3 years after time t as controls. We identified 4835 patients who had more than 3 years of follow-up from the date of cirrhosis diagnosis to the date of HCC diagnosis and included 1 control visit that occurred more than 3 years before the diagnosis of HCC (Figure 1A). Thus, we obtained 42 245 control samples, in which HCC was not diagnosed within 3 years of the sampled visit (time t). This resulted in 52 983 total samples from 48 151 patients.

Model Building and Feature Extraction

We developed and compared 3 different models predicting the probability of developing HCC within 3 years after time t (Figure 1B). First, we developed LR models using only the baseline, cross-sectional (cross-sectional LR) values of each of the predictors immediately prior to time t, which assumes the linearity of these variables and log odds. Sex, race, and HCV genotype were modeled as dummy categorical variables. Laboratory tests were modeled as continuous variables. We imputed missing values by the mean of nonmissing entries in the training data.

Second, we developed LR models as described earlier; however, we included the following 5 summary statistics for each of the longitudinal predictors (longitudinal LR), designed to capture longitudinal information available prior to time t: minimum, maximum, minimum of slope, maximum of slope, and total variation. The slope is defined as the ratio of difference of longitudinal predictor and the time gap between 2 consecutive visits. The total variation is defined as the mean of absolute value of slopes. We performed feature selection for the cross-sectional LR and longitudinal LR models using the lasso approach.³²

Third, we developed an RNN model that can use information from both the baseline predictors and the raw longitudinal predictors from the first visit in the VA until the time of visit (Figure 1C). We imputed missing values by filling the missing entries at the first visit by the mean of nonmissing entries of training data and by replacing the remaining missing entries at any time by looking backward, ie, the latest available nonmissing values prior to this time.

In contrast to conventional LR, which requires specific feature extraction, RNNs can handle a varied number of visits and irregular time gaps between 2 consecutive visits. It can also automatically learn features that are useful for prediction. Specifically, we used gated recurrent units,³³ an improved version of standard RNN that can better store long-term information. After combining temporal information from longitudinal predictors and time-invariant information from baseline predictors, we constructed a classifier using feedforward neural networks.³⁴ We used the Rectified Linear Unit (ReLU)³⁵ as the nonlinear activation function and used the sigmoid activation to return a risk probability between 0 and 1 in the output layer. To prevent overfitting, a dropout layer was added after a nonlinear activation.³⁶ All parameters were optimized through minimizing the binary cross-entropy loss with the Adam stochastic algorithm.³⁷

Statistical Analysis

We randomly split the cohort into a training set (90%) and a testing set (10%). We fit 3 models using the training set and evaluated prediction performances using the same testing set. We repeated this procedure 10 times and reported the mean performance characteristics on the testing set over 10 random splits. We report 2-sided P values of the paired sample t test when comparing the performance characteristics between LR models and the RNN model. Statistical significance was set at P < .05.

Performance characteristics to evaluate the models’ discrimination were assessed based on the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC).³⁸ We used the Brier score³⁹ to compare overall accuracy; a Brier score of 0 signifies perfect accuracy. Furthermore, we provided the mean of predicted probability and the proportion of actual positive outcomes for 3 risk categories according to thirds of predicted risk for each model to calibrate the probabilities; a well-calibrated model should obtain similar results for each category.

For the LR models with lasso penalty, a hyperparameter tuning was done by first identifying the optimal penalty coefficient based on a 5-fold cross-validation, and then the model was fit with the selected coefficient using the training set. For the RNN model, we fixed some hyperparameters to reduce the computational cost (hidden layers set at 2; dropout rate, 0.2; batch size, 256). We searched for the optimal hidden sizes of model structures because they are more sensitive in prediction performance based on our evaluation. We performed LR models without lasso penalty using the Scikit-learn library⁴⁰ in Python version 3.45.7 and LR models with lasso penalty by the glmnet R version 3.6.1 package⁴¹ (R Project for Statistical Computing). The numerical implementation of the RNN model was in PyTorch version 1.1.⁴²

Results

Characteristics of the Population Used for Model Building

Of 52 983 samples, most came from men (51 948 [98.0%]), as expected in a VHA population, with representation from multiple racial and ethnic groups (Table 1). Compared with control samples from patients who did not develop HCC within 3 years of the sampled visit, patients who developed HCC within 3 years were older when they were diagnosed with cirrhosis (mean [SD] age, 56.9 [6.9] years vs 58.2 [6.6] years), were more likely to have genotype 3 HCV (3256 [7.7%] vs 1212 [11.3%]), were less likely to achieve SVR (5680 [13.4%] vs 1192 [11.1%]), had been diagnosed with cirrhosis for a longer time (mean [SD] duration of cirrhosis diagnosis, 1.97 [2.61] years vs 2.72 [3.30] years), had higher serum AST, ALT, and bilirubin levels (mean [SD] AST: 71.7 [47.2] U/L vs 87.1 [49.5] U/L [to convert to microkatals per liter, multiply by 0.0167]; mean [SD] ALT, 70.1 [71.0] U/L vs 74.9 [58.5] U/L [to convert to microkatals per liter, multiply by 0.0167]; mean [SD] bilirubin, 1.2 [1.5] mg/dL vs 1.5 [1.5] mg/dL [to convert to millimoles per liter, multiply by 17.104]), had higher FIB-4 and APRI scores (mean [SD] FIB-4 score: 5.1 [4.7] vs 6.8 [5.2]; mean [SD] APRI score: 1.8 [2.0] vs 2.5 [2.2]), and had a lower mean (SD) platelet count (141.1 [74.5] ×10³/μL vs 121.0 [68.3] ×10³/μL [to convert to ×10⁹ per liter, multiply by 1.0]) at the sampled visit (time t).

Table 1. Characteristics of Controls and Cases Used in Model Building at the Sampled Visit.

Characteristic	Mean (SD)
Characteristic	Samples of patients who did not develop HCC within 3 y (n = 42 245)	Samples of patients who developed HCC within 3 y (n = 10 738)
Age at cirrhosis diagnosis, y	56.9 (6.9)	58.2 (6.6)
Men, No. (%)	41 315 (97.8)	10 633 (99.0)
Race/Ethnicity, No. (%)
White, non-Hispanic	23 681 (56.1)	5996 (55.8)
Black, non-Hispanic	10 805 (25.6)	2626 (24.5)
Hispanic, Asian, Pacific Island, AIAN, or other	4956 (11.7)	1323 (12.35)
Declined to answer or missing	2803 (6.6)	793 (7.4)
Genotype, No. (%)
1	30 702 (72.7)	7497 (69.8)
2	3289 (7.8)	637 (5.9)
3	3256 (7.7)	1212 (11.3)
≥4	356 (0.8)	93 (0.9)
Missing	4642 (11.0)	1299 (12.1)
Achieved SVR at time t, No. (%)	5680 (13.4)	1192 (11.1)
Duration of cirrhosis at time t, y	1.97 (2.61)	2.72 (3.30)
BMI at time t	28.7 (5.6)	28.1 (5.4)
Laboratory test results at time t
AST, U/L	71.7 (47.2)	87.1 (49.5)
ALT, U/L	70.1 (71.0)	74.9 (58.5)
Platelet count, ×10³/μL	141.1 (74.5)	121.0 (68.3)
Bilirubin, mg/dL	1.2 (1.5)	1.5 (1.5)
INR	1.2 (0.3)	1.2 (0.3)
Creatinine, mg/dL	1.1 (1.0)	1.0 (0.7)
FIB-4 score	5.1 (4.7)	6.8 (5.2)
APRI score	1.8 (2.0)	2.5 (2.2)

Open in a new tab

Abbreviations: AIAN, American Indian/Alaskan Native; ALT, alanine aminotransferase; AST, aspartate transaminase; APRI, AST to platelet ratio index; BMI, body mass index (calculated as weight in kilograms divided by height in meters squared); FIB-4, fibrosis-4; INR, international normalized ratio; SVR, sustained virologic response.

SI conversion factors: To convert ALT and AST to microkatals per liter, multiply by 0.0167; bilirubin to millimoles per liter, multiply by 17.104; creatinine to millimoles per liter, multiply by 88.4; and platelet count to ×10⁹ per liter, multiply by 1.0.

Model Performance Among All Samples

The RNN model resulted in significantly higher mean (SD) AUROC (0.759 [0.009]), a measure of discrimination, than the longitudinal LR (0.689 [0.009]) or cross-sectional LR (0.682 [0.007]) models without feature selection (P < .001 for both comparisons) (Table 2 and Figure 2A). The absolute value of the AUROC achieved by the RNN model is considered good. The RNN model achieved significantly higher mean (SD) AUPRC (0.479 [0.018]) than the longitudinal LR (0.361 [0.009]) or cross-sectional LR (0.345 [0.011]) models without feature selection (P <.001 for both comparisons). Also, the RNN model resulted in a significantly lower mean (SD) Brier score (0.136 [0.003]), a measure of overall accuracy, than the longitudinal LR (0.149 [0.003) or cross-sectional LR (0.150 [0.003]) models without feature selection (P <.001 for both comparisons). In comparison with the longitudinal LR model that used specific summary statistics of longitudinal predictors, the RNN model obtained significant improvement by automatically extracting useful features from raw longitudinal predictors.

Table 2. Comparison of the Performance Characteristics of 3 Different Models Predicting the Development of Hepatocellular Carcinoma Within 3 Years in Patients With Hepatitis C Virus–Related Cirrhosis.

Performance characteristic	Mean (SD)			P value compared with RNN model
Performance characteristic	Cross-sectional LR model	Longitudinal LR model	RNN model	P value compared with RNN model
All samples
AUROC	0.682 (0.007)	0.689 (0.009)	0.759 (0.009)	<.001
Brier score	0.150 (0.003)	0.149 (0.003)	0.136 (0.003)	<.001
AUPRC	0.345 (0.011)	0.361 (0.009)	0.479 (0.018)	<.001
Proportion of samples who test positive at 90% sensitivity	0.746 (0.008)	0.736 (0.013)	0.663 (0.012)	<.001
Specificity at 90% sensitivity	0.293 (0.010)	0.305 (0.016)	0.397 (0.014)	<.001
Positive predictive value at 90% sensitivity	0.243 (0.003)	0.246 (0.003)	0.273 (0.006)	<.001
Negative predictive value at 90% sensitivity	0.920 (0.006)	0.923 (0.007)	0.940 (0.003)	<.001
Proportion of samples who test positive at 80% sensitivity	0.601 (0.012)	0.591 (0.017)	0.514 (0.015)	<.001
Specificity at 80% sensitivity	0.449 (0.015)	0.462 (0.021)	0.558 (0.018)	<.001
Positive predictive value at 80% sensitivity	0.268 (0.007)	0.27 (30.009)	0.314 (0.009)	<.001
Negative predictive value at 80% sensitivity	0.898 (0.004)	0.901 (0.005)	0.916 (0.004)	<.001
Samples from patients who achieved SVR
AUROC	0.672 (0.030)	0.705 (0.024)	0.806 (0.025)	<.001
Brier score	0.139 (0.006)	0.136 (0.006)	0.117 (0.007)	<.001
AUPRC	0.333 (0.060)	0.361 (0.050)	0.519 (0.064)	<.001
Proportion of samples who test positive at 90% sensitivity	0.793 (0.041)	0.702 (0.028)	0.571 (0.052)	<.001
Specificity at 90% sensitivity	0.230 (0.050)	0.340 (0.035)	0.499 (0.064)	<.001
Positive predictive value at 90% sensitivity	0.205 (0.022)	0.230 (0.020)	0.285 (0.033)	<.001
Negative predictive value at 90% sensitivity	0.904 (0.013)	0.933 (0.006)	0.954 (0.005)	<.001
Proportion of samples who test positive at 80% sensitivity	0.628 (0.040)	0.559 (0.035)	0.429 (0.039)	<.001
Specificity at 80% sensitivity	0.409 (0.051)	0.492 (0.043)	0.651 (0.047)	<.001
Positive predictive value at 80% sensitivity	0.230 (0.026)	0.257 (0.023)	0.337 (0.036)	<.001
Negative predictive value at 80% sensitivity	0.898 (0.009)	0.914 (0.009)	0.934 (0.007)	<.001

Open in a new tab

Abbreviations: AUPRC, area under the precision-recall curve; AUROC, area under receiver operating characteristic curve; LR, logistic regression; RNN, recurrent neural network; SVR, sustain virologic response.

Figure 2. — We developed 3 different models predicting the development of hepatocellular carcinoma within 3 years under 1 representative splitting (results based on the testing set) in all samples from patients with hepatitis C virus (HCV)–related cirrhosis and samples from patients who achieved sustained virologic response (SVR) with HCV-related cirrhosis.

For the 2 LR models with feature selection, the lasso method only eliminated a small mean number of features and resulted in very similar AUROC, AUPRC, and Brier scores. These scores were inferior to those of the RNN model (eTable 2 in the Supplement).

Prioritizing Patients for HCC Screening Outreach Interventions

We envision that risk stratification models could be used to prioritize the patients with the highest risk for screening outreach interventions. Using the RNN model, we determined that 90% of all HCC diagnoses in the following 3 years occurred in samples with the mean (SD) highest 66% (1.2%) of risk scores, whereas 80% of HCCs occurred in samples with the mean (SD) highest 51% (1.5%) of risk scores. Thus, using the RNN model, we could potentially target the top 51% of samples with the highest HCC risk scores, in which 80% of all HCCs occurred, or the top 66% of samples with the highest HCC risk scores, in which 90% of all HCCs occurred. In contrast, the proportions that would need to be screened to include 80% or 90% of patients who would be diagnosed with HCC were much greater using the longitudinal LR and cross-sectional LR models (Table 2).

Model Calibration

All 3 models had excellent agreement between observed and predicted 3-year risks when the population was divided into 3 risk categories according to thirds of predicted risk for each model (Table 3), implying that all models were well calibrated. However, the RNN model achieved much greater separation between the first and third tertiles (predicted 3-year HCC risk 2% vs 25%) than the longitudinal LR (4% vs 19%) or cross-sectional LR (5% vs 18%) models.

Table 3. Comparison of Predicted and Observed 3-Year HCC Risk in the Study Population Divided Into Tertiles According to Each Model Under 1 Representative Splitting^a.

Tertile	Cross-sectional LR model			Longitudinal LR model			RNN model
	%		No. (%)	%		No. (%)	%		No. (%)
	Observed 3-y HCC risk	Predicted 3-y HCC risk	No. (%)	Observed 3-y HCC risk	Predicted 3-y HCC risk	No. (%)	Observed 3-y HCC risk	Predicted 3-y HCC risk	No. (%)
First, low risk	4	5	17661 (33.3)	4	4	17661 (33.3)	2	2	17661 (33.3)
Second, medium risk	10	9	17661 (33.3)	9	9	17661 (33.3)	7	8	17661 (33.3)
Third, high risk	18	18	17661 (33.3)	18	19	17661 (33.3)	24	25	17661 (33.3)

Open in a new tab

Abbreviations: HCC, hepatocellular carcinoma; LR, logistic regression; RNN, recurrent neural network.

^{^a}

The observed 3-year HCC risk is the proportion of those who developed HCC within 3 years among samples in each group. The predicted 3-year HCC risk is the mean of probabilities that are returned by models for samples in each group.

Model Performance Among Samples From Patients Who Achieved SVR and Men

Because most patients with HCV infection now undergo treatment with direct-acting antivirals and achieve SVR that reduces the risk of HCC, we evaluated our models’ performance characteristics among a subset of samples from patients who achieved SVR during follow-up. Mean (SD) AUROC (0.806 [0.025]), AUPRC (0.519 [0.064]), and Brier score (0.117 [0.007]) of the RNN model (Table 2 and Figure 2B) were all superior in the subset who achieved SVR than in the entire population; these scores continued to be superior to the LR models.

We further evaluated 3 models among samples from male patients, given that 98% of patients with HCV infection patients in the VA system were men. The performance characteristics of the RNN model on samples from male patients were very similar to those on the entire population and continued to be superior to the LR models (eTable 3 in the Supplement).

Discussion

The past decade has seen an explosion in the amount of medical information stored in electronic health records (EHRs). Such EHR data are potentially ideal for deep learning algorithms, but surprisingly few applications of deep learning have been developed that use EHR data to assist with diagnosis or prognosis.⁴³ We demonstrated an application for RNN models that outperformed conventional LR models in the prediction of HCC risk in patients with HCV-related cirrhosis, including those who achieve SVR following antiviral therapy.

Changes in many predictor variables over time can provide crucial prognostic information, but such changes are difficult or impossible to model using conventional regression modeling algorithms. RNNs are powerful methods for processing sequential data and have shown superior performance in many applications, such as machine translation.⁴⁴ The specific structure of RNNs can handle temporal data with varying length and capture long-term dependencies, which enables automatic feature learning needed for prediction from raw temporal data. This is especially important because it does not depend on human-engineered feature extraction and discovers novel patterns using all information within the analysis.

Our RNN models exhibited an AUROC of 0.759 among all samples and 0.806 among samples from patients with SVR, which is considered very good and compares favorably with other HCC risk prediction models.^6,45,46 Equally importantly, our models had great calibration and excellent agreement between observed and predicted HCC risk (Table 3).

We envision 2 areas of clinical implementation of HCC risk prediction models such as the RNN models we developed, aimed at improving HCC surveillance strategies. First, our models can be used to improve screening outreach efforts. Currently, less than 50% of patients with cirrhosis get regular HCC surveillance across most health care systems.^47,48 RNN models could be used to identify the patients with the highest risk, who could then be targeted for interventions to improve their uptake of HCC surveillance. For example, we demonstrated that by targeting the samples with the top 51% of HCC risk scores calculated by our RNN models, we would be including 80% of patients who would develop HCC in the next 3 years, while targeting the top 66% would include 90% of patients who would develop HCC. This is a much more effective strategy than our current first-come, first-served approach to outreach for HCC screening. Identifying and offering screening only to the patients with the highest risk could also be a plausible strategy in health care systems around the world that do not have the capacity to screen all at-risk patients. Assuming that a given maximum number of screening studies can be performed in such health care systems, these studies will lead to early diagnosis of HCC in a higher proportion of patients if they are targeted to patients with the highest HCC risk identified by our RNN models than if they are randomly distributed among all patients with cirrhosis.

Second, we envision that our RNN models could also be used in the future to identify high-risk patients for new surveillance strategies that are more effective than the current strategy of ultrasonography and α-fetoprotein, but are also more expensive (ie, risk-based screening). Many new surveillance strategies are being investigated in phase 2 and 3 studies and some are already available in clinical practice. For example, abbreviated magnetic resonance imaging protocols have been developed specifically for the purposes of HCC screening, which have much greater sensitivity and specificity than ultrasonography.^49,50,51 However, these examinations are substantially more expensive than ultrasonograph examinations and would have to be limited to high-risk patients. Also, multiple novel biomarker panels are being developed that could also be more cost-effective if they were combined with ultrasonography in high-risk patients.

Given that most patients with HCV infection are now expected to undergo treatment and achieve SVR and given that SVR reduces the risk of HCC, it is imperative that HCC risk prediction models incorporate SVR and predict well among patients who achieve SVR. Indeed, the performance of our RNN models was even better among samples from patients who achieved SVR than among the entire population.

Limitations

This study has limitations related to lack of external validation and the computational cost of running the analyses. To reduce computational cost, we only performed optimal search for some of the hyperparameters. Even so, the RNN model outperformed conventional LR models. Health care systems are now investing in the infrastructure to construct some of these complex models. For example, the VHA has collaborated with Google’s DeepMind to develop an RNN model for predicting acute kidney injury using national VHA data.⁴³ All deep learning neural network models, including ours, have limited interpretability due to their black-box nature, which may limit acceptability by clinicians. However, recent innovations allow for interpretable deep learning models by determining the proportion of the prediction attributed to each feature.^52,53

Conclusions

In this study, we demonstrated that RNN models that use raw longitudinal EHR data are superior to conventional LR models in estimating the risk of HCC in patients with HCV-related cirrhosis. RNN models such as ours could have multiple applications in clinical practice, provided they can be incorporated within EHR software systems.

Supplement.

eTable 1. ICD-10 and ICD-9 Codes for Cirrhosis, Decompensated Cirrhosis, and HCC

eTable 2. Feature Selection Frequency Over 10 Splits for the Longitudinal LR With Lasso Penalty

eTable 3. Comparison of the Performance Characteristics of 3 Different Models Predicting the Development of HCC Within 3 years in Male Patients with HCV-Related Cirrhosis

Click here for additional data file.^{(1.1MB, pdf)}

References

1.Morgan RL, Baack B, Smith BD, Yartel A, Pitasi M, Falck-Ytter Y. Eradication of hepatitis C virus infection and the development of hepatocellular carcinoma: a meta-analysis of observational studies. Ann Intern Med. 2013;158(5 Pt 1):329-337. doi: 10.7326/0003-4819-158-5-201303050-00005 [DOI] [PubMed] [Google Scholar]
2.Singal AK, Singh A, Jaganmohan S, et al. Antiviral therapy reduces risk of hepatocellular carcinoma in patients with hepatitis C virus-related cirrhosis. Clin Gastroenterol Hepatol. 2010;8(2):192-199. doi: 10.1016/j.cgh.2009.10.026 [DOI] [PubMed] [Google Scholar]
3.Singer AW, Reddy KR, Telep LE, et al. Direct-acting antiviral treatment for hepatitis C virus infection and risk of incident liver cancer: a retrospective cohort study. Aliment Pharmacol Ther. 2018;47(9):1278-1287. doi: 10.1111/apt.14593 [DOI] [PubMed] [Google Scholar]
4.Ioannou GN, Green PK, Berry K. HCV eradication induced by direct-acting antiviral agents reduces the risk of hepatocellular carcinoma. J Hepatol. 2017;S0168-8278(17)32273-0. doi: 10.1016/j.jhep.2017.08.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Kanwal F, Kramer J, Asch SM, Chayanupatkul M, Cao Y, El-Serag HB. Risk of hepatocellular cancer in HCV patients treated with direct-acting antiviral agents. Gastroenterology. 2017;153(4):996-1005.e1. doi: 10.1053/j.gastro.2017.06.012 [DOI] [PubMed] [Google Scholar]
6.Ioannou GN, Green PK, Beste LA, Mun EJ, Kerr KF, Berry K. Development of models estimating the risk of hepatocellular carcinoma after antiviral treatment for hepatitis C. J Hepatol. 2018;69(5):1088-1098. doi: 10.1016/j.jhep.2018.07.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Shickel B, Tighe PJ, Bihorac A, Rashidi P. Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. IEEE J Biomed Health Inform. 2018;22(5):1589-1604. doi: 10.1109/JBHI.2017.2767063 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Gulshan V, Peng L, Coram M, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016;316(22):2402-2410. doi: 10.1001/jama.2016.17216 [DOI] [PubMed] [Google Scholar]
9.Che Z, Cheng Y, Zhai S, Sun Z, Liu Y. Boosting deep learning risk prediction with generative adversarial networks for electronic health records. arXiv. Preprint posted online September 6, 2017. Accessed July 30, 2020. https://arxiv.org/abs/1709.01648 [Google Scholar]
10.Xiao C, Choi E, Sun J. Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review. J Am Med Inform Assoc. 2018;25(10):1419-1428. doi: 10.1093/jamia/ocy068 [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Moon AM, Green PK, Berry K, Ioannou GN. Transformation of hepatitis C antiviral treatment in a national healthcare system following the introduction of direct antiviral agents. Aliment Pharmacol Ther. 2017;45(9):1201-1212. doi: 10.1111/apt.14021 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.US Department of Veterans Affairs Corporate Data Warehouse. Accessed December 19, 2016. https://www.hsrd.research.va.gov/for_researchers/vinci/cdw.cfm
13.Moon AM, Dominitz JA, Ioannou GN, Lowy E, Beste LA. Use of antibiotics among patients with cirrhosis and upper gastrointestinal bleeding is associated with reduced mortality. Clin Gastroenterol Hepatol. 2016;14(11):1629-1637.e1. doi: 10.1016/j.cgh.2016.05.040 [DOI] [PubMed] [Google Scholar]
14.Lerrigo R, Beste LA, Leipertz SL, et al. Characteristics and outcomes of transjugular intrahepatic portosystemic shunt recipients in the VA Healthcare System. Eur J Gastroenterol Hepatol. 2016;28(6):667-675. doi: 10.1097/MEG.0000000000000604 [DOI] [PubMed] [Google Scholar]
15.Ioannou GN, Beste LA, Chang MF, et al. Effectiveness of sofosbuvir, ledipasvir/sofosbuvir, or paritaprevir/ritonavir/ombitasvir and dasabuvir regimens for treatment of patients with hepatitis C in the Veterans Affairs National Health Care System. Gastroenterology. 2016;151(3):457-471.e5. doi: 10.1053/j.gastro.2016.05.049 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Ioannou GN, Bryson CL, Weiss NS, Boyko EJ. Associations between lipodystrophy or antiretroviral medications and cirrhosis in patients with HIV infection or HIV/HCV coinfection. Eur J Gastroenterol Hepatol. 2015;27(5):577-584. doi: 10.1097/MEG.0000000000000290 [DOI] [PubMed] [Google Scholar]
17.Beste LA, Leipertz SL, Green PK, Dominitz JA, Ross D, Ioannou GN. Trends in burden of cirrhosis and hepatocellular carcinoma by underlying liver disease in US veterans, 2001-2013. Gastroenterology. 2015;149(6):1471-1482.e5. doi: 10.1053/j.gastro.2015.07.056 [DOI] [PubMed] [Google Scholar]
18.Ioannou GN, Bryson CL, Weiss NS, Miller R, Scott JD, Boyko EJ. The prevalence of cirrhosis and hepatocellular carcinoma in patients with human immunodeficiency virus infection. Hepatology. 2013;57(1):249-257. doi: 10.1002/hep.25800 [DOI] [PubMed] [Google Scholar]
19.Beste LA, Ioannou GN, Larson MS, Chapko M, Dominitz JA. Predictors of early treatment discontinuation among patients with genotype 1 hepatitis C and implications for viral eradication. Clin Gastroenterol Hepatol. 2010;8(11):972-978. doi: 10.1016/j.cgh.2010.07.012 [DOI] [PubMed] [Google Scholar]
20.Ioannou GN, Splan MF, Weiss NS, McDonald GB, Beretta L, Lee SP. Incidence and predictors of hepatocellular carcinoma in patients with cirrhosis. Clin Gastroenterol Hepatol. 2007;5(8):938-945, 945.e1-945.e4. doi: 10.1016/j.cgh.2007.02.039 [DOI] [PubMed] [Google Scholar]
21.Backus LI, Boothroyd DB, Phillips BR, Mole LA. Predictors of response of US veterans to treatment for the hepatitis C virus. Hepatology. 2007;46(1):37-47. doi: 10.1002/hep.21662 [DOI] [PubMed] [Google Scholar]
22.Davila JA, Henderson L, Kramer JR, et al. Utilization of surveillance for hepatocellular carcinoma among hepatitis C virus-infected veterans in the United States. Ann Intern Med. 2011;154(2):85-93. doi: 10.7326/0003-4819-154-2-201101180-00006 [DOI] [PubMed] [Google Scholar]
23.Kanwal F, Hoang T, Kramer JR, et al. Increasing prevalence of HCC and cirrhosis in patients with chronic hepatitis C virus infection. Gastroenterology. 2011;140(4):1182-1188.e1. doi: 10.1053/j.gastro.2010.12.032 [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Kramer JR, Giordano TP, Souchek J, Richardson P, Hwang LY, El-Serag HB. The effect of HIV coinfection on the risk of cirrhosis and hepatocellular carcinoma in U.S. veterans with hepatitis C. Am J Gastroenterol. 2005;100(1):56-63. doi: 10.1111/j.1572-0241.2005.40670.x [DOI] [PubMed] [Google Scholar]
25.Davila JA, Weston A, Smalley W, El-Serag HB. Utilization of screening for hepatocellular carcinoma in the United States. J Clin Gastroenterol. 2007;41(8):777-782. doi: 10.1097/MCG.0b013e3180381560 [DOI] [PubMed] [Google Scholar]
26.Beste LA, Green PK, Berry K, Kogut MJ, Allison SK, Ioannou GN. Effectiveness of hepatitis C antiviral treatment in a USA cohort of veteran patients with hepatocellular carcinoma. J Hepatol. 2017;67(1):32-39. doi: 10.1016/j.jhep.2017.02.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
27.El-Serag HB, Kanwal F, Richardson P, Kramer J. Risk of hepatocellular carcinoma after sustained virological response in Veterans with hepatitis C virus infection. Hepatology. 2016;64(1):130-137. doi: 10.1002/hep.28535 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.White DL, Richardson P, Tayoub N, Davila JA, Kanwal F, El-Serag HB. The updated model: an adjusted serum alpha-fetoprotein-based algorithm for hepatocellular carcinoma detection with hepatitis C virus-related cirrhosis. Gastroenterology. 2015;149(7):1986-1987. doi: 10.1053/j.gastro.2015.10.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
29.El-Serag HB, Kanwal F, Davila JA, Kramer J, Richardson P. A new laboratory-based algorithm to predict development of hepatocellular carcinoma in patients with hepatitis C and cirrhosis. Gastroenterology. 2014;146(5):1249-55.e1. doi: 10.1053/j.gastro.2014.01.045 [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Ioannou GN, Green P, Lowy E, Mun EJ, Berry K. Differences in hepatocellular carcinoma risk, predictors and trends over time according to etiology of cirrhosis. PLoS One. 2018;13(9):e0204412. doi: 10.1371/journal.pone.0204412 [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Yoshida EM, Sulkowski MS, Gane EJ, et al. Concordance of sustained virological response 4, 12, and 24 weeks post-treatment with sofosbuvir-containing regimens for hepatitis C virus. Hepatology. 2015;61(1):41-45. doi: 10.1002/hep.27366 [DOI] [PubMed] [Google Scholar]
32.Tibshirani R. Regression shrinkage and selection via the lasso. J Royal Stat Soc Method. 2011;48(1):267-288. Accessed August 5, 2020. https://statweb.stanford.edu/~tibs/lasso/lasso.pdf [Google Scholar]
33.Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv. Preprint posted online June 3, 2014. Accessed July 31, 2020. https://arxiv.org/abs/1406.1078
34.Hornik K, Stinchcombe M, White H. Multilayer feedforward networks are universal approximators. Neural Netw. 1989;2(5):359-366. doi: 10.1016/0893-6080(89)90020-8 [DOI] [Google Scholar]
35.Glorot X, Bordes A, Bengio Y Deep sparse rectifier neural networks. Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. Published 2011. Accessed July 31, 2020. http://proceedings.mlr.press/v15/glorot11a/glorot11a.pdf
36.Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R.. Dropout: a simple way to prevent neural networks from overfitting. J Machine Learning Res. 2014;15(1):1929-1958. Accessed July 31, 2020. http://jmlr.org/papers/volume15/srivastava14a/srivastava14a.pdf [Google Scholar]
37.Kingma DP, Ba J Adam: A method for stochastic optimization. arXiv. Preprint posted online December 22, 2014. Accessed July 31, 2020. https://arxiv.org/abs/1412.6980
38.Boyd K, Eng KH, Page CD. Area under the precision-recall curve: point estimates and confidence intervals In: Blockeel H, Kersting K, Nijssen S, Železný F, eds. Machine Learning and Knowledge Discovery in Databases. Springer; 2013: 451-466. doi: 10.1007/978-3-642-40994-3_29 [DOI] [Google Scholar]
39.Graf E, Schmoor C, Sauerbrei W, Schumacher M. Assessment and comparison of prognostic classification schemes for survival data. Stat Med. 1999;18(17-18):2529-2545. doi: [DOI] [PubMed] [Google Scholar]
40.Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825-2830. Accessed August 5, 2020. http://www.jmlr.org/papers/volume12/pedregosa11a/pedregosa11a.pdf [Google Scholar]
41.Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Soft. 2010;33(1):1 Accessed August 5, 2020. https://www.jstatsoft.org/article/view/v033i01 [PMC free article] [PubMed] [Google Scholar]
42.Paszke A, Gross S, Chintala S, et al. Automatic differentiation in pytorch. Accessed August 5, 2020. https://openreview.net/pdf?id=BJJsrmfCZ
43.Tomašev N, Glorot X, Rae JW, et al. A clinically applicable approach to continuous prediction of future acute kidney injury. Nature. 2019;572(7767):116-119. doi: 10.1038/s41586-019-1390-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Wu Y, Schuster M, Chen Z, Le QV, Norouzi M, Macherey W, et al. Google's neural machine translation system: bridging the gap between human and machine translation. arXiv. Preprint posted online October 8, 2016. Accessed July 31, 2020. https://arxiv.org/abs/1609.08144
45.Ioannou GN, Green P, Kerr KF, Berry K. Models estimating risk of hepatocellular carcinoma in patients with alcohol or NAFLD-related cirrhosis for risk stratification. J Hepatol. 2019;71(3):523-533. doi: 10.1016/j.jhep.2019.05.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Sharma SA, Kowgier M, Hansen BE, et al. Toronto HCC risk index: a validated scoring system to predict 10-year risk of HCC in patients with cirrhosis. J Hepatol. 2017;S0168-8278(17)32248-1. doi: 10.1016/j.jhep.2017.07.033 [DOI] [PubMed] [Google Scholar]
47.Davila JA, Morgan RO, Richardson PA, Du XL, McGlynn KA, El-Serag HB. Use of surveillance for hepatocellular carcinoma among patients with cirrhosis in the United States. Hepatology. 2010;52(1):132-141. doi: 10.1002/hep.23615 [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Singal AG, El-Serag HB. Hepatocellular carcinoma from epidemiology to prevention: translating knowledge into practice. Clin Gastroenterol Hepatol. 2015;13(12):2140-2151. doi: 10.1016/j.cgh.2015.08.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Lee JY, Huo EJ, Weinstein S, et al. Evaluation of an abbreviated screening MRI protocol for patients at risk for hepatocellular carcinoma. Abdom Radiol (NY). 2018;43(7):1627-1633. doi: 10.1007/s00261-017-1339-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Tillman BG, Gorman JD, Hru JM, et al. Diagnostic per-lesion performance of a simulated gadoxetate disodium-enhanced abbreviated MRI protocol for hepatocellular carcinoma screening. Clin Radiol. 2018;73(5):485-493. doi: 10.1016/j.crad.2017.11.013 [DOI] [PubMed] [Google Scholar]
51.Marks RM, Ryan A, Heba ER, et al. Diagnostic per-patient accuracy of an abbreviated hepatobiliary phase gadoxetic acid-enhanced MRI for hepatocellular carcinoma surveillance. AJR Am J Roentgenol. 2015;204(3):527-535. doi: 10.2214/AJR.14.12986 [DOI] [PubMed] [Google Scholar]
52.Lundberg SM, Lee S-I A unified approach to interpreting model predictions. Accessed July 31, 2020. https://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions
53.Lundberg SM, Nair B, Vavilala MS, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. 2018;2(10):749-760. doi: 10.1038/s41551-018-0304-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement.

eTable 1. ICD-10 and ICD-9 Codes for Cirrhosis, Decompensated Cirrhosis, and HCC

eTable 2. Feature Selection Frequency Over 10 Splits for the Longitudinal LR With Lasso Penalty

eTable 3. Comparison of the Performance Characteristics of 3 Different Models Predicting the Development of HCC Within 3 years in Male Patients with HCV-Related Cirrhosis

Click here for additional data file.^{(1.1MB, pdf)}

[zoi200578r1] 1.Morgan RL, Baack B, Smith BD, Yartel A, Pitasi M, Falck-Ytter Y. Eradication of hepatitis C virus infection and the development of hepatocellular carcinoma: a meta-analysis of observational studies. Ann Intern Med. 2013;158(5 Pt 1):329-337. doi: 10.7326/0003-4819-158-5-201303050-00005 [DOI] [PubMed] [Google Scholar]

[zoi200578r2] 2.Singal AK, Singh A, Jaganmohan S, et al. Antiviral therapy reduces risk of hepatocellular carcinoma in patients with hepatitis C virus-related cirrhosis. Clin Gastroenterol Hepatol. 2010;8(2):192-199. doi: 10.1016/j.cgh.2009.10.026 [DOI] [PubMed] [Google Scholar]

[zoi200578r3] 3.Singer AW, Reddy KR, Telep LE, et al. Direct-acting antiviral treatment for hepatitis C virus infection and risk of incident liver cancer: a retrospective cohort study. Aliment Pharmacol Ther. 2018;47(9):1278-1287. doi: 10.1111/apt.14593 [DOI] [PubMed] [Google Scholar]

[zoi200578r4] 4.Ioannou GN, Green PK, Berry K. HCV eradication induced by direct-acting antiviral agents reduces the risk of hepatocellular carcinoma. J Hepatol. 2017;S0168-8278(17)32273-0. doi: 10.1016/j.jhep.2017.08.030 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r5] 5.Kanwal F, Kramer J, Asch SM, Chayanupatkul M, Cao Y, El-Serag HB. Risk of hepatocellular cancer in HCV patients treated with direct-acting antiviral agents. Gastroenterology. 2017;153(4):996-1005.e1. doi: 10.1053/j.gastro.2017.06.012 [DOI] [PubMed] [Google Scholar]

[zoi200578r6] 6.Ioannou GN, Green PK, Beste LA, Mun EJ, Kerr KF, Berry K. Development of models estimating the risk of hepatocellular carcinoma after antiviral treatment for hepatitis C. J Hepatol. 2018;69(5):1088-1098. doi: 10.1016/j.jhep.2018.07.024 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r7] 7.Shickel B, Tighe PJ, Bihorac A, Rashidi P. Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. IEEE J Biomed Health Inform. 2018;22(5):1589-1604. doi: 10.1109/JBHI.2017.2767063 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r8] 8.Gulshan V, Peng L, Coram M, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016;316(22):2402-2410. doi: 10.1001/jama.2016.17216 [DOI] [PubMed] [Google Scholar]

[zoi200578r9] 9.Che Z, Cheng Y, Zhai S, Sun Z, Liu Y. Boosting deep learning risk prediction with generative adversarial networks for electronic health records. arXiv. Preprint posted online September 6, 2017. Accessed July 30, 2020. https://arxiv.org/abs/1709.01648 [Google Scholar]

[zoi200578r10] 10.Xiao C, Choi E, Sun J. Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review. J Am Med Inform Assoc. 2018;25(10):1419-1428. doi: 10.1093/jamia/ocy068 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r11] 11.Moon AM, Green PK, Berry K, Ioannou GN. Transformation of hepatitis C antiviral treatment in a national healthcare system following the introduction of direct antiviral agents. Aliment Pharmacol Ther. 2017;45(9):1201-1212. doi: 10.1111/apt.14021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r12] 12.US Department of Veterans Affairs Corporate Data Warehouse. Accessed December 19, 2016. https://www.hsrd.research.va.gov/for_researchers/vinci/cdw.cfm

[zoi200578r13] 13.Moon AM, Dominitz JA, Ioannou GN, Lowy E, Beste LA. Use of antibiotics among patients with cirrhosis and upper gastrointestinal bleeding is associated with reduced mortality. Clin Gastroenterol Hepatol. 2016;14(11):1629-1637.e1. doi: 10.1016/j.cgh.2016.05.040 [DOI] [PubMed] [Google Scholar]

[zoi200578r14] 14.Lerrigo R, Beste LA, Leipertz SL, et al. Characteristics and outcomes of transjugular intrahepatic portosystemic shunt recipients in the VA Healthcare System. Eur J Gastroenterol Hepatol. 2016;28(6):667-675. doi: 10.1097/MEG.0000000000000604 [DOI] [PubMed] [Google Scholar]

[zoi200578r15] 15.Ioannou GN, Beste LA, Chang MF, et al. Effectiveness of sofosbuvir, ledipasvir/sofosbuvir, or paritaprevir/ritonavir/ombitasvir and dasabuvir regimens for treatment of patients with hepatitis C in the Veterans Affairs National Health Care System. Gastroenterology. 2016;151(3):457-471.e5. doi: 10.1053/j.gastro.2016.05.049 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r16] 16.Ioannou GN, Bryson CL, Weiss NS, Boyko EJ. Associations between lipodystrophy or antiretroviral medications and cirrhosis in patients with HIV infection or HIV/HCV coinfection. Eur J Gastroenterol Hepatol. 2015;27(5):577-584. doi: 10.1097/MEG.0000000000000290 [DOI] [PubMed] [Google Scholar]

[zoi200578r17] 17.Beste LA, Leipertz SL, Green PK, Dominitz JA, Ross D, Ioannou GN. Trends in burden of cirrhosis and hepatocellular carcinoma by underlying liver disease in US veterans, 2001-2013. Gastroenterology. 2015;149(6):1471-1482.e5. doi: 10.1053/j.gastro.2015.07.056 [DOI] [PubMed] [Google Scholar]

[zoi200578r18] 18.Ioannou GN, Bryson CL, Weiss NS, Miller R, Scott JD, Boyko EJ. The prevalence of cirrhosis and hepatocellular carcinoma in patients with human immunodeficiency virus infection. Hepatology. 2013;57(1):249-257. doi: 10.1002/hep.25800 [DOI] [PubMed] [Google Scholar]

[zoi200578r19] 19.Beste LA, Ioannou GN, Larson MS, Chapko M, Dominitz JA. Predictors of early treatment discontinuation among patients with genotype 1 hepatitis C and implications for viral eradication. Clin Gastroenterol Hepatol. 2010;8(11):972-978. doi: 10.1016/j.cgh.2010.07.012 [DOI] [PubMed] [Google Scholar]

[zoi200578r20] 20.Ioannou GN, Splan MF, Weiss NS, McDonald GB, Beretta L, Lee SP. Incidence and predictors of hepatocellular carcinoma in patients with cirrhosis. Clin Gastroenterol Hepatol. 2007;5(8):938-945, 945.e1-945.e4. doi: 10.1016/j.cgh.2007.02.039 [DOI] [PubMed] [Google Scholar]

[zoi200578r21] 21.Backus LI, Boothroyd DB, Phillips BR, Mole LA. Predictors of response of US veterans to treatment for the hepatitis C virus. Hepatology. 2007;46(1):37-47. doi: 10.1002/hep.21662 [DOI] [PubMed] [Google Scholar]

[zoi200578r22] 22.Davila JA, Henderson L, Kramer JR, et al. Utilization of surveillance for hepatocellular carcinoma among hepatitis C virus-infected veterans in the United States. Ann Intern Med. 2011;154(2):85-93. doi: 10.7326/0003-4819-154-2-201101180-00006 [DOI] [PubMed] [Google Scholar]

[zoi200578r23] 23.Kanwal F, Hoang T, Kramer JR, et al. Increasing prevalence of HCC and cirrhosis in patients with chronic hepatitis C virus infection. Gastroenterology. 2011;140(4):1182-1188.e1. doi: 10.1053/j.gastro.2010.12.032 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r24] 24.Kramer JR, Giordano TP, Souchek J, Richardson P, Hwang LY, El-Serag HB. The effect of HIV coinfection on the risk of cirrhosis and hepatocellular carcinoma in U.S. veterans with hepatitis C. Am J Gastroenterol. 2005;100(1):56-63. doi: 10.1111/j.1572-0241.2005.40670.x [DOI] [PubMed] [Google Scholar]

[zoi200578r25] 25.Davila JA, Weston A, Smalley W, El-Serag HB. Utilization of screening for hepatocellular carcinoma in the United States. J Clin Gastroenterol. 2007;41(8):777-782. doi: 10.1097/MCG.0b013e3180381560 [DOI] [PubMed] [Google Scholar]

[zoi200578r26] 26.Beste LA, Green PK, Berry K, Kogut MJ, Allison SK, Ioannou GN. Effectiveness of hepatitis C antiviral treatment in a USA cohort of veteran patients with hepatocellular carcinoma. J Hepatol. 2017;67(1):32-39. doi: 10.1016/j.jhep.2017.02.027 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r27] 27.El-Serag HB, Kanwal F, Richardson P, Kramer J. Risk of hepatocellular carcinoma after sustained virological response in Veterans with hepatitis C virus infection. Hepatology. 2016;64(1):130-137. doi: 10.1002/hep.28535 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r28] 28.White DL, Richardson P, Tayoub N, Davila JA, Kanwal F, El-Serag HB. The updated model: an adjusted serum alpha-fetoprotein-based algorithm for hepatocellular carcinoma detection with hepatitis C virus-related cirrhosis. Gastroenterology. 2015;149(7):1986-1987. doi: 10.1053/j.gastro.2015.10.004 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r29] 29.El-Serag HB, Kanwal F, Davila JA, Kramer J, Richardson P. A new laboratory-based algorithm to predict development of hepatocellular carcinoma in patients with hepatitis C and cirrhosis. Gastroenterology. 2014;146(5):1249-55.e1. doi: 10.1053/j.gastro.2014.01.045 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r30] 30.Ioannou GN, Green P, Lowy E, Mun EJ, Berry K. Differences in hepatocellular carcinoma risk, predictors and trends over time according to etiology of cirrhosis. PLoS One. 2018;13(9):e0204412. doi: 10.1371/journal.pone.0204412 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r31] 31.Yoshida EM, Sulkowski MS, Gane EJ, et al. Concordance of sustained virological response 4, 12, and 24 weeks post-treatment with sofosbuvir-containing regimens for hepatitis C virus. Hepatology. 2015;61(1):41-45. doi: 10.1002/hep.27366 [DOI] [PubMed] [Google Scholar]

[zoi200578r32] 32.Tibshirani R. Regression shrinkage and selection via the lasso. J Royal Stat Soc Method. 2011;48(1):267-288. Accessed August 5, 2020. https://statweb.stanford.edu/~tibs/lasso/lasso.pdf [Google Scholar]

[zoi200578r33] 33.Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv. Preprint posted online June 3, 2014. Accessed July 31, 2020. https://arxiv.org/abs/1406.1078

[zoi200578r34] 34.Hornik K, Stinchcombe M, White H. Multilayer feedforward networks are universal approximators. Neural Netw. 1989;2(5):359-366. doi: 10.1016/0893-6080(89)90020-8 [DOI] [Google Scholar]

[zoi200578r35] 35.Glorot X, Bordes A, Bengio Y Deep sparse rectifier neural networks. Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. Published 2011. Accessed July 31, 2020. http://proceedings.mlr.press/v15/glorot11a/glorot11a.pdf

[zoi200578r36] 36.Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R.. Dropout: a simple way to prevent neural networks from overfitting. J Machine Learning Res. 2014;15(1):1929-1958. Accessed July 31, 2020. http://jmlr.org/papers/volume15/srivastava14a/srivastava14a.pdf [Google Scholar]

[zoi200578r37] 37.Kingma DP, Ba J Adam: A method for stochastic optimization. arXiv. Preprint posted online December 22, 2014. Accessed July 31, 2020. https://arxiv.org/abs/1412.6980

[zoi200578r38] 38.Boyd K, Eng KH, Page CD. Area under the precision-recall curve: point estimates and confidence intervals In: Blockeel H, Kersting K, Nijssen S, Železný F, eds. Machine Learning and Knowledge Discovery in Databases. Springer; 2013: 451-466. doi: 10.1007/978-3-642-40994-3_29 [DOI] [Google Scholar]

[zoi200578r39] 39.Graf E, Schmoor C, Sauerbrei W, Schumacher M. Assessment and comparison of prognostic classification schemes for survival data. Stat Med. 1999;18(17-18):2529-2545. doi: [DOI] [PubMed] [Google Scholar]

[zoi200578r40] 40.Pedregosa F, Varoquaux G, Gramfort A, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825-2830. Accessed August 5, 2020. http://www.jmlr.org/papers/volume12/pedregosa11a/pedregosa11a.pdf [Google Scholar]

[zoi200578r41] 41.Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Soft. 2010;33(1):1 Accessed August 5, 2020. https://www.jstatsoft.org/article/view/v033i01 [PMC free article] [PubMed] [Google Scholar]

[zoi200578r42] 42.Paszke A, Gross S, Chintala S, et al. Automatic differentiation in pytorch. Accessed August 5, 2020. https://openreview.net/pdf?id=BJJsrmfCZ

[zoi200578r43] 43.Tomašev N, Glorot X, Rae JW, et al. A clinically applicable approach to continuous prediction of future acute kidney injury. Nature. 2019;572(7767):116-119. doi: 10.1038/s41586-019-1390-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r44] 44.Wu Y, Schuster M, Chen Z, Le QV, Norouzi M, Macherey W, et al. Google's neural machine translation system: bridging the gap between human and machine translation. arXiv. Preprint posted online October 8, 2016. Accessed July 31, 2020. https://arxiv.org/abs/1609.08144

[zoi200578r45] 45.Ioannou GN, Green P, Kerr KF, Berry K. Models estimating risk of hepatocellular carcinoma in patients with alcohol or NAFLD-related cirrhosis for risk stratification. J Hepatol. 2019;71(3):523-533. doi: 10.1016/j.jhep.2019.05.008 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r46] 46.Sharma SA, Kowgier M, Hansen BE, et al. Toronto HCC risk index: a validated scoring system to predict 10-year risk of HCC in patients with cirrhosis. J Hepatol. 2017;S0168-8278(17)32248-1. doi: 10.1016/j.jhep.2017.07.033 [DOI] [PubMed] [Google Scholar]

[zoi200578r47] 47.Davila JA, Morgan RO, Richardson PA, Du XL, McGlynn KA, El-Serag HB. Use of surveillance for hepatocellular carcinoma among patients with cirrhosis in the United States. Hepatology. 2010;52(1):132-141. doi: 10.1002/hep.23615 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r48] 48.Singal AG, El-Serag HB. Hepatocellular carcinoma from epidemiology to prevention: translating knowledge into practice. Clin Gastroenterol Hepatol. 2015;13(12):2140-2151. doi: 10.1016/j.cgh.2015.08.014 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r49] 49.Lee JY, Huo EJ, Weinstein S, et al. Evaluation of an abbreviated screening MRI protocol for patients at risk for hepatocellular carcinoma. Abdom Radiol (NY). 2018;43(7):1627-1633. doi: 10.1007/s00261-017-1339-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zoi200578r50] 50.Tillman BG, Gorman JD, Hru JM, et al. Diagnostic per-lesion performance of a simulated gadoxetate disodium-enhanced abbreviated MRI protocol for hepatocellular carcinoma screening. Clin Radiol. 2018;73(5):485-493. doi: 10.1016/j.crad.2017.11.013 [DOI] [PubMed] [Google Scholar]

[zoi200578r51] 51.Marks RM, Ryan A, Heba ER, et al. Diagnostic per-patient accuracy of an abbreviated hepatobiliary phase gadoxetic acid-enhanced MRI for hepatocellular carcinoma surveillance. AJR Am J Roentgenol. 2015;204(3):527-535. doi: 10.2214/AJR.14.12986 [DOI] [PubMed] [Google Scholar]

[zoi200578r52] 52.Lundberg SM, Lee S-I A unified approach to interpreting model predictions. Accessed July 31, 2020. https://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions

[zoi200578r53] 53.Lundberg SM, Nair B, Vavilala MS, et al. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. 2018;2(10):749-760. doi: 10.1038/s41551-018-0304-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Assessment of a Deep Learning Model to Predict Hepatocellular Carcinoma in Patients With Hepatitis C Cirrhosis

George N Ioannou, MD

Weijing Tang, MA

Lauren A Beste, MD, MSc

Monica A Tincopa, MD

Grace L Su, MD

Tony Van, MS

Elliot B Tapper, MD

Amit G Singal, MD

Ji Zhu, PhD

Akbar K Waljee, MD, MSc

Key Points

Question

Findings

Meaning

Abstract

Importance

Objective

Design, Setting, and Participants

Exposures

Main Outcomes and Measures

Results

Conclusions and Relevance

Introduction

Methods

Data Source

Study Population

Diagnosis of HCC

Predictor Variables

Rationale for Cases and Controls Used in Model Building

Figure 1. Schematics of Case and Control Definitions and Models Developed to Predict HCC Development.

Cases

Controls

Model Building and Feature Extraction

Statistical Analysis

Results

Characteristics of the Population Used for Model Building

Table 1. Characteristics of Controls and Cases Used in Model Building at the Sampled Visit.

Model Performance Among All Samples

Table 2. Comparison of the Performance Characteristics of 3 Different Models Predicting the Development of Hepatocellular Carcinoma Within 3 Years in Patients With Hepatitis C Virus–Related Cirrhosis.

Figure 2. Receiver Operating Characteristic Curves for 3 Prediction Models.

Prioritizing Patients for HCC Screening Outreach Interventions

Model Calibration

Table 3. Comparison of Predicted and Observed 3-Year HCC Risk in the Study Population Divided Into Tertiles According to Each Model Under 1 Representative Splittinga.

Model Performance Among Samples From Patients Who Achieved SVR and Men

Discussion

Limitations

Conclusions

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Table 3. Comparison of Predicted and Observed 3-Year HCC Risk in the Study Population Divided Into Tertiles According to Each Model Under 1 Representative Splitting^a.