Abstract
Background.
Inflammatory and metabolic biomarkers have been associated with HCC risk in phase I and II biomarker studies. We developed a robust metabolic biomarker panel predictive of HCC in a longitudinal phase III study.
Methods.
We used data and banked serum from a prospective cohort of 2266 adult patients with cirrhosis who were followed until the development of HCC (n=126). We custom designed a FirePlex immunoassay to measure baseline serum levels of 39 biomarkers and established a set of biomarkers with the highest discriminatory ability for HCC. We performed bootstrapping to evaluate the predictive performance using C-index and time-dependent area under the receiver operating characteristic curve (AUROC). We quantified the incremental predictive value of the biomarker panel when added to previously validated clinical models.
Results.
We identified a 9-biomarker panel (P9) with a C-index of 0.67 (95% CI, 0.66, 0.67), including insulin growth factor-1, interleukin-10, transforming growth factor β1, adipsin, fetuin-A, interleukin-1 β, macrophage stimulating protein α chain, serum amyloid A, and TNF-α. Adding P9 to our clinical model with 10 factors including AFP improved AUROC at 1 and 2 years by 4.8% and 2.7%, respectively. Adding P9 to aMAP score improved AUROC at 1 and 2 years by 14.2% and 7.6%, respectively. Adding AFP L-3 or DCP did not change the predictive ability of the P9 model.
Conclusions.
We identified a panel of 9 serum biomarkers that is independently associated with developing HCC in cirrhosis, and that improved the predictive ability of risk stratification models containing clinical factors.
Keywords: Risk stratification, epidemiology, hepatitis C, nonalcoholic fatty liver disease, alcohol
Background
The relative risk of HCC in patients with cirrhosis is considerably higher than in the general population. However, the absolute risk of HCC as expressed by annual incidence is only 1%−3%, and most patients with cirrhosis do not develop HCC.[1–3] Furthermore, cirrhosis related to alcoholic and metabolic dysfunction associated steatotic liver disease (MASLD) are becoming more common and have lower annual HCC rate than cirrhosis related to hepatitis C or B virus infections.[1, 4] Therefore, predicting the small subgroup of patients with cirrhosis who will develop HCC has become more difficult in contemporary clinical practice, and one that is not addressed by existing biomarkers and risk stratification tools.[5]
Risk stratification of HCC among patients with cirrhosis can allow for a more precise prognostication, targeted prevention and early detection that is different from the one size fits all approach of current practices. For example, instead of HCC surveillance offered to all patients with cirrhosis, surveillance could be intensified or limited to those with high absolute HCC risk, while patients with low risk can be spared from the repetitive, costly, and occasionally harmful routine testing. High risk groups could also be targeted for enrollment in chemoprevention trials facilitating the feasibility and cost-effectiveness of these initiatives.[6]
To address the gap in HCC risk prediction in cirrhosis, we previously developed and validated an HCC risk stratification model, which contains demographic (age, and sex), clinical (liver disease etiology [HCV vs. other etiologies], alanine aminotransferase, platelet count, albumin, alpha-fetoprotein) and lifestyle features (alcohol use, smoking, body mass index) (See Supplementary Materials for more details). This predictive model demonstrated moderate discriminatory ability with area under the receiver operating characteristic curve (AUROC) of 0.75 (95% CI = 0.65–0.85) and 0.77 (95% CI = 0.71–0.83) for predicting HCC at 1-year and 2-year follow-up, respectively.[7] Similarly, the aMAP score (age, male, albumin-bilirubin, and platelets) was reported to achieve an overall C-index of 0.81 (95% CI: 0.79–0.82)[8–9] in patients with chronic hepatitis. These basic models are good foundations for HCC risk prediction in cirrhosis, but their current modest performance may not be sufficient for clinical use. We hypothesize that we can improve the predictive performance of the model by adding objective and reliable variables from multiple domains that reflect distinct aspects of cirrhosis progression to HCC.
Multiple biological pathways are affected in patients with cirrhosis who develop HCC.[10] Several human studies have examined the role of biomarkers of metabolic dysfunction including insulin resistance (e.g., proinsulin, insulin, insulin growth factor [IGF-1], insulin growth factor binding protein 1 [IGF-BP1]) and adipocytokines (e.g., leptin, adiponectin, resistin, adipsin); inflammation including proinflammatory cytokines (e.g., tumor necrosis factor [TNF], latent ttransforming growth factor beta 1 [TGF β1], interleukin-1 beta [IL-1β]), acute phase reactants (e.g., c reactive protein), and anti-inflammatory cytokines (e.g., several other interleukins). Several of these biomarkers were reported to be significantly associated with HCC,[11–16] and therefore can potentially be useful for HCC risk stratification or early detection in patients with cirrhosis. However, most of these studies were phase I or II biomarker studies (cross-sectional or case-control in design), examined one or few biomarkers, and did not adjust for demographic and clinical confounders. A key next step in biomarker validation for HCC is to examine their performance in a phase III biomarker study including a well-defined cirrhosis population that is eligible for surveillance.[17]
In this study, we used samples and data from our longitudinal multicenter prospective cohort of patients with cirrhosis, the Texas HCC Consortium (THCCC) to examine the association of several circulating serum biomarkers that were previously linked with the risk of developing HCC. We also sought to develop a robust metabolic biomarker panel predictive of HCC and test the incremental value of this panel compared to our base HCC prediction model of demographic, clinical, and lifestyle factors and the aMAP score.
Methods
Study Cohort
We used data from the THCCC. Adult patients with cirrhosis were prospectively recruited at eight liver clinics in four cities (Michael E. DeBakey VA Medical Center and Baylor St. Luke’s Medical Center in Houston; University of Texas Southwestern, Parkland Health & Hospital System, and two clinical sites within Baylor Scott & White Research Institute in Dallas/Fort Worth; Doctor’s Hospital at Renaissance in McAllen; and Texas Liver Institute in San Antonio). THCCC recruitment and follow-up began in December 2016 and are still ongoing.[4, 18] Blood samples, collected at each study visit, were allowed to clot and then centrifuged at the collecting site according to a standardized protocol. Serum aliquots were frozen locally at −80°C and batch shipped to the central laboratory for long-term storage. We analyzed samples from 2266 unique subjects recruited between December 2016 and August 2021 who were followed through December 31, 2022. The research was conducted in accordance with both the Declarations of Helsinki and Istanbul. Research was approved by the Baylor College of Medicine IRB, and written consent was given by all patients to conduct this research. More details on recruitment, follow-up, and data/specimen collection are provided in Supplement 1.
We followed the PRoBE (Prospective-specimen collection, retrospective-blinded-evaluation) guidance for conducting a phase III biomarker study,[19] and the TRIPOD guidelines for reporting our results.[20–21] Cirrhosis diagnosis was based on predefined criteria for liver histology, radiology, or elastography, or serum biomarkers.[7] Patients with uncontrolled hepatic decompensation, history of HCC, or non-hepatic cancers were excluded. All participants had a negative liver imaging at baseline. Our primary outcome was incident HCC, defined as tumors occurring at least 1 month after index visit to minimize risk of prevalent HCC but we also conducted a sensitivity analysis excluding patients diagnosed with HCC within the first 6 months of follow up. HCC was defined according to AASLD criteria[22] including histological or radiological diagnosis using characteristic appearance (arterial enhancement and delayed washout of contrast) on triple phase CT or MRI (LI-RAD 5 or 4) that were reviewed in multidisciplinary tumor boards and treated as HCC.
Biomarkers and Custom Multiplex Immunoassays
We reviewed the published literature through 2020 to identify biomarkers of inflammation, oncogeneosis or metabolic dysfunction reported to be significant or near significant predictors of HCC risk in human studies. Serum samples collected from patients with cirrhosis at the time of enrollment in THCCC and before HCC diagnosis were evaluated using the custom Fireplex platform for circulating levels of our analytes of interest (Supplemental Table 1). The FirePlex platform utilizes bio-inert cross linked polyethylene glycol hydrogel particles. These hydrogels are comprised of two regions that encode the identity of the target and one that determines the amount of target present. By quantifying the fluorescent intensity in the two encoding regions, the identity of a target was determined using an integrated FirePlex Analysis Workbench software http://www.abcam.com/FireflyAnalysisSoftware. We used commercial serum to initially screen and identify antibody pairs for each target of interest. We performed bioinformatics of cross-reactivity by screening analyte sequences for high homology against the human proteome. The specificity of the assays was confirmed within sub-panels using commercial proteins. Based on our literature review, we initially identified 41 potential targets (Supplemental Table 1) but excluded chemerin and Agouti-related protein (AgRP) because we were unable to identify antibody pair to confirm the specificity against these analytes. We then used 36 human serum samples (from patients with cirrhosis in THCCC) to evaluate assay performance including intra- and inter-assay coefficient variances, assay sensitivity and dynamic range for each analyte within our multiplex panels. Each sample was evaluated in duplicates and at 1:4, 1:200, and 1:200,000 dilutions. Separately, we assayed AFP L-3 and DCP using microchip capillary electrophoresis and liquid-phase binding assay on an uTASWako i30 Immunoanalyzer System (FujiFilm IVD HCU Lab in Richmond, VA, USA).
Statistical Methods
We used two methods to select biomarker panels that are predictive of HCC occurrence: the stepwise regression, and the least absolute shrinkage and selection operator (Lasso). The stepwise regression selects a panel from individual biomarkers based on their statistical significance with HCC in the model. For this analysis, we selected metabolic panels based on two p-values cutoffs of <0.05 and <0.2. Lasso imposes penalty terms during model fitting to shrink the less contributive biomarkers with smaller coefficients, resulting in a reduced panel with biomarkers having non-zero coefficients; this approach not only enhances prediction accuracy but also mitigates the risk of overfitting. In alignment with the framework proposed by Riley et al,[20] considering a targeted shrinkage of less than 10% as recommended, and working with a set of 36 variables, the sample size of our cohort was adequate to achieve an anticipated Cox-Snell R squared statistic of at least 0.15.
We constructed Cox Proportional Hazard models to calculate the C-index and the AUROC across various evaluation windows (i.e., time dependent AUROC). The C-index allowed us to evaluate the model discriminatory performance for HCC overall and time dependent AUROC to evaluate the model performance for HCC at 1- and 2-years of follow-up. We used the risk Regression package to estimate the time dependent AUROC.[23] The internal validation was conducted by bootstrapping, which provides nearly unbiased estimates of model performance compared to split-sample (e.g., cross validation) methods.[24] Specifically, we constructed 200 bootstrap datasets by resampling the original data with replacement, thus effectively getting bias-corrected C-index and time dependent AUROC to approximate how the panel will perform on future data. The calibration of the model was assessed visually using calibration curves. To estimate the association of the metabolic panels with HCC risk, using the original data, we calculated hazard ratios with their 95% confidence intervals from the Fine-Gray competing risk model that accounted for the competing risks of liver transplantation and death.[25] The discriminatory ability was further examined by comparing the cumulative incidence function (CIF) curves between high-risk and low-risk groups, based on being above or below the median of derived risk scores, respectively.
The incremental predictive value of biomarker panel score was evaluated by comparing predictive performance to our previously validated base model. This model was developed using two prospective cirrhosis cohort studies including THCCC[7] and externally validated in a separate prospective VA cohort. The computation of the risk index by the base model is: 0.0399∗age + 0.5617∗gender(male=1) + 0.9023∗log10 AFP + (−1.2631)∗log10 platelets + (– 0.3357)∗log10 ALT + (–0.5859)∗albumin + 0.0252∗BMI + 0.2816∗smoking(past/current=1) + 0.2446∗alcohol(current heavy=1) + (0.3596)∗alcohol(other=1) + 0.4611∗etiology(HCV active/cured=1). To do this, we used the bootstrap method to assess whether differences in model performance (C index, time dependent AUROC) with and without metabolic panel scores were statistically significant.[26]
Last, we examined the clinical utility by estimating the number of cases that would be correctly reclassified from false positives using the base model to true negatives when adding the biomarker signature to the model.[23]
We performed several secondary analyses to estimate the C index and time dependent AUROC in (1) two separate groups, stratified by underlying cirrhosis etiology (viral hepatitis versus non-viral hepatitis); (2) after excluding patients with active HCV; and (3) adding DCP, AFP L-3 or both to the models containing base model combined with the biomarker panels. We also examined the incremental predictive value of adding biomarker panel score to another previously developed predictive model, the aMAP risk score, using the published equation and parameter estimates (aMAP risk score = [{ (0.06)*age +0.89*sex(Male:1, Female:0) + 0.48*[(log10 bilirubin * 0.66) + albumin*(−0.085)]-0.01*platelets}+7.4]/14.77*100 only, the average of 200 bootstrapping C-index is 0.651 (95% 0.647–0.655)).[8]
Results
We analyzed data from 2266 patients with cirrhosis, of whom 126 developed HCC during follow up of 39.9 months. The HCC annual incidence rate was 16.6 per 1,000 person (95% Ci, 13.8 to 19.8). During follow up, there were 257 deaths (11.3%), 59 (2.6%) liver transplant, and 450 (19.9%) dropped out (diagnosis of non-HCC primary cancer, withdrawal of consent, or a loss to follow-up). The proportion censored by 12/31/2022 was 60.6% (n=1374).
The median duration between the date of serum metabolic biomarker measurement and the HCC date was 24.9 months (IQR 26.6 months). The mean age of the cohort was 59.6 years (standard deviation 10.3), and 37.6% were women. The racial/ethnic distribution was 1158 (51.1%) non-Hispanic White, 700 (30.9%) Hispanic, 360 (15.9%) African American, and 48 (2.1%) belonged to other groups. At the time of enrollment, the underlying risk factors for cirrhosis were alcohol-related liver disease in 394 (17.4%), MASLD in 733 (32.3%), active HCV in 301 (13.3%), cured HCV in 534 (23.6%), HBV in 30 (1.3%) and miscellaneous causes in the rest. Most patients had diabetes (46.7%), overweight (82.9%) or obesity (51.2%). All 39 biomarkers were measured in all patients; however, we excluded free TGF-β1 from the analysis because of missing values in more than 75% of cases.
The panel selected by the stepwise selection regression with a p <0.05 included 3 biomarkers (IGF-1, latent TGF-β1 and IL-10), while the panel selected with a p <0.2 had 9 biomarkers that were inclusive of the previous three (IGF 1, latent TGF β1, macrophage stimulating protein alpha chain [MSPa], TNF-α, adipsin, IL-1β, IL-10, serum amyloid A [SAA] and fetuin-A). The panel selected by the Lasso consisted of 5-biomarker (IGF-1, latent TGF-β1, IL-10, IL-1β, TNF α), which included the 3-biomarker panel described above and were all contained within the 9-biomarker panel identified based on the p <0.2 (Supplemental Table 2). The 9-biomarker panel had the highest C-index of 0.736 for predicting HCC in Cox regression model compared to the 3- and 5-biomarker panels, which were not significantly different from each other (C-index=0.700 and 0.703, respectively; data not shown). Therefore, we evaluated the 3-biomarkers (P3) and 9-biomarker (P9) panels further.
There were significant differences in baseline serum levels of the biomarkers according to underlying etiology of liver disease. Fetuin-A, latent TGF-β1, TNF-α were higher in active HCV, MSPa in cured HCV and IGF-1 and DCP in non-viral hepatitis than the other two groups (Table 1). There were also significant differences in baseline levels of several biomarkers (TGF-α, IGF-1, MSP-α, AFP L-3) between women and men. Supplemental Table 2 shows the baseline serum levels of the 9 biomarkers in P9 as well as DCP and AFP L-3 in patients with cirrhosis who subsequently developed HCC and those who did not as well as the univariate risk estimates, hazard ratios [HR], for the association between baseline levels and future risk of developing HCC. Compared to patients who did not progress, serum levels of IL-10 and TNF-α were higher whereas the rest of the 7 biomarkers were lower in patients who progressed to HCC. Both AFP L-3 and DCP were higher at baseline in those who developed HCC.
Table 1.
Baseline serum levels of P9 biomarkers as well as AFP L-3 and DCP in patients with cirrhosis stratified by underlying etiology and gender.
Active HCV (mean, SD) | HCV with SVR (mean, SD) | Non-viral hep (mean, SD) | P value (ANOVA test) | Men mean, SD) | Women mean, SD) | P value | |
---|---|---|---|---|---|---|---|
adipsin | 2.56E+06 (2.02E+06) | 2.89E+06 (3.04E+06) | 2.99E+06 (2.26E+06) | 0.048 | 2.73E+06 (2.56E+06) | 2.56E+06 (2.22E+06) | 0.096 |
fetuin-A | 1.07E+09 (0.55E+09) | 1.04E+09 (0.53E+09) | 0.95E+09 (0.51E+09) | <0.001 | 1.00E+09 (0.54E+09) | 0.97E+09 (0.50E+09) | 0.165 |
transforming growth factor beta 1 | 3.56E+04 (1.86E+04) | 3.44E+04 (1.74E+04) | 3.08E+04 (1.80E+04) | <0.001 | 3.36E+04 (1.91E+04) | 3.00E+04 (1.59E+04) | <0.001 |
insulin growth factor 1 | 1917.3 (1362.8) | 1900.2 (1152.7) | 2284.6 (1504.9) | <0.001 | 1931.5 (1280.9) | 2500.6 (1567.1) | <0.001 |
interleukin-1 beta | 3.9 (5.0) | 3.6 (1.8) | 3.6 (2.2) | 0.231 | 3.6 (2.9) | 3.6 (2.4) | 0.461 |
interleukin 10 | 21.9 (14.5) | 18.8 (7.8) | 22.4 (49.0) | 0.204 | 20.4 (11.7) | 23.3 (62.6) | 0.180 |
macrophage stimulating protein alpha chain | 7.20E+06 (4.06E+06) | 8.48E+06 (4.33E+06) | 7.11E+06 (3.76E+06) | <0.001 | 7.67E+06 (4.12E+06) | 7.07E+06 (3.71E+06) | 0.004 |
serum amyloid A | 5.11E+05 (5.83E+05) | 6.04E+05 (7.83E+05) | 6.43E+05 (8.51E+05) | 0.033 | 6.04E+05 (8.49E+05) | 6.36E+05 (7.27E+05) | 0.346 |
tumor necrosis factor alpha | 19.6 (9.8) | 16.4 (6.6) | 16.7 (7.6) | <0.001 | 16.8 (7.3) | 17.5 (8.5) | 0.126 |
AFP L-3 | 4.7 (5.0) | 1.6 (3.2) | 2.6 (4.1) | <0.001 | 2.4 (4.0) | 3.0 (4.3) | 0.001 |
DCP | 2.8 (29.0) | 7.3 (49.4) | 7.1 (43.7) | <0.001 | 7.4 (47.2) | 5.2 (36.5) | 0.258 |
Using the internal validation dataset, we evaluated five models for predicting HCC risk in our primary analysis. These were 1) model using P3; 2) model using P9; 3) a model using demographic and clinical variables (base model); (4) model using demographic and clinical variables in the base model and P3; and (5) a model using demographic and clinical variables in the base model and P9. The C-index values of these five models were 0.655, 0.668, 0.715 0.731 and 0.732, respectively. Adding P3 or P9 significantly (both p<0.01) improved the C-index by 2.2% and 2.4%, respectively, over the base model. Adding P3 or P9 also significantly (both p<0.01) improved the 1-year and 2-year AUROCs (p<0.01) over the base model (Table 2 and Figure 1). The improvement in AUROC with addition of P3 and especially P9 to the base model was more prominent for the 1-year HCC risk.
Table 2.
Performance characteristics of predictive models for HCC risk stratification in patients with cirrhosis.
C index (entire study duration) | 1-year AUROC (95% CI) | 2-year AUROC (95% CI) | |
---|---|---|---|
Base model (age, sex, AFP, platelets, ALT, albumin, BMI, smoking, alcohol, liver disease etiology) | 0.715 (0.711, 0.719) | 0.747 (0.741, 0.753) | 0.728 (0.724, 0.732) |
P3 alone | 0.655 (0.651, 0.659) | 0.709 (0.703, 0.715) | 0.664 (0.660, 0.668) |
P9 alone | 0.668 (0.664, 0.672) | 0.730 (0.724, 0.736) | 0.682 (0.678, 0.686) |
Base model +P3 | 0.731 (0.727, 0.735) | 0.776 (0.770, 0.782) | 0.743 (0.739, 0.747) |
Base Model +P9 | 0.732 (0.728, 0.736) | 0.783 (0.777, 0.789) | 0.748 (0.744, 0.752) |
AUROC: Area Under Receiver Operating Curve. P3 3 biomarker panel. P9: nine biomarker panel. CI: confidence intervals. BMI: body mass index
Figure 1.
Area under receiver operating curve for HCC at follow-up of 1 year (A) and 2 years (B) shown for base HCC predictive model, base model plus P3, and base model plus P9.
Both P-9 and P3 were independently predictive of HCC risk. The 9-biomarker panel was associated with HCC risk independent of the base model with an adjusted hazard ratio of 2.06 (95% CI, 1.63–2.61) as compared with 1.75 (95% CI, 1.16–2.64) for the 3-biomarker panel; P<0.05.
We obtained serum levels of AFP L3 and DCP in 1945 (85.83%) and 1943 (85.75%) patients, respectively (both markers were available in 1929 [86.6%] of the cohort). We added each as well as both biomarkers to our base model as well as the model with P9, and found no statistically significant change in C value or AUROC (Table 4) after adding AFP-L3 and/or DCP.
Table 4.
Performance characteristics of aMAP with and without biomarker panels
Model | C-Index (95% CI) | AUROC-1yr (95% CI) | AUROC-2yr (95% CI) |
---|---|---|---|
aMAP risk score | 0.651 (0.647, 0.655) | 0.657 (0.651, 0.663) | 0.674 (0.670, 0.678) |
aMAP risk score + P3 | 0.687 (0.683, 0.691) | 0.722 (0.716, 0.728) | 0.701 (0.697, 0.705) |
aMAP risk score + P9 | 0.693 (0.689, 0.697) | 0.750 (0.744, 0.756) | 0.725 (0.721, 0.729) |
We also tested the predictive value of aMAP with and without P3 and P9. aMAP had a lower predictive value than our base model (0.651 versus 0.715). However, adding the P9 to aMAP increased the C index from 0.651 with aMAP alone to 0.693 with aMAP and P9 (p<0.01); a similar magnitude of increase to that observed in our primary analysis (Table 2 and 4).
The calibration of the models is shown in Figure 2. The base model combined with P9 had better calibration (i.e., the predicted HCC risk has higher similarity to the actual HCC risk) at all ranges of HCC risk than the base model alone or that of the base model combined with P3. The CIF curves of the base model combined with P9 for the low-risk and high-risk groups were compared in Figure 3. The high-risk group consistently exhibited a significantly higher risk of HCC (p<0.01) right from the outset, highlighting the strong discriminatory capability of the model.
Figure 2.
Calibration plots for HCC predictive models (predicted vs actual HCC) at 2 years of follow-up (A) base model with P3 and (B) base model with P9.
Figure 3.
The cumulative incidence function of HCC stratified by the baseline average risk scores derived from a predictive model that combines clinical and demographic variables with the P9 biomarker panel. The groups are stratified as high (above the median score) and low risk (below median score) for HCC.
We examined the number of HCC cases and non-cases that would be correctly or incorrectly identified per 1000 patients in a 2-year screening program under scenarios of desired ≥90% sensitivity or ≥90% specificity (Table 3). Of the 70 HCC cases per 1000 patients in the study cohort, 63 are captured by a cut-point with ≥ 0.90 sensitivity. At this cut-point, there was a specificity of 31.7% (with 636 false positive cases) using the base model. The addition of P9, while maintaining a cut-point with ≥.90 sensitivity, resulted in a specificity of 34.2% with 612 false positives (i.e., correct reclassification of 4% of false positive cases). The base model at the ≥.90 specificity cut-point achieves a sensitivity of 25.4% with 52 false negatives. Incorporating P9 into the model increased the specificity to 29.4% with 49 false negatives, while maintaining ≥.90 specificity. This resulted in a correct reclassification of 6% of false negatives as true positive.
Table 3.
Predictive value of HCC occurring within 2 years of follow up.
Cutoff for sensitivity=0.9 | Specificity (at fixed sensitivity of >=90%) | Cutoff for specificity=0.9 | Sensitivity (at fixed specificity of >=90%) | |
---|---|---|---|---|
Base model | −1.378 | 0.317 | −0.162 | 0.254 |
Base + P3 model | −5.128 | 0.333 | −3.906 | 0.286 |
Base + P9 model | −2.875 | 0.342 | −1.650 | 0.294 |
The stratified analyses by etiology in patients with viral hepatitis (301 active HCV, 534 cured HCV, 30 with HBV) and 1401 patients with non-viral hepatitis showed comparable results in both groups. However, the predictive values of the base model as well as base combined with P3 or P9 models were 3.6% to 4.1% higher in the non-viral hepatitis group compared to the viral hepatitis group (Table 3). The base model performed slightly worse in women than men (c static 0.667 vs. 0.716) but the base model combined with P9 improved the predictive value disproportionately for women; (0.733 vs. 0.728) (Table 5). Lastly, the results of sensitivity analyses excluding 301 patients with active HCV and 12 patients who developed HCC within the first 6 months of follow-up showed no differences from the primary analyses (Supplemental Table 3).
Table 5.
C-index for HCC predictive models. The analyses were conducted among patients with cirrhosis stratified by underlying etiology, and by gender.
HCV or HBV (n=865) | Other etiology (1401) | Male (n=1415) | Female (n=851) | |
---|---|---|---|---|
Base model | 0.705 (0.701, 0.709) | 0.714 (0.710, 0.718) | 0.716 (0.712,0.720) | 0.667 (0.661,0.673) |
P3 model | 0.618 (0.614, 0.622) | 0.674 (0.670, 0.678) | 0.628 (0.624,0.632) | 0.689 (0.683,0.695) |
P9 model | 0.629 (0.625, 0.633) | 0.679 (0.675, 0.683) | 0.638 (0.634,0.642) | 0.691 (0.685,0.697) |
Base + P3 model | 0.710 (0.706, 0.714) | 0.740 (0.736, 0.744) | 0.728 (0.724,0.732) | 0.729 (0.723,0.735) |
Base + P9 model | 0.711 (0.707, 0.715) | 0.743 (0.739, 0.747) | 0.728 (0.724,0.732) | 0.733 (0.727,0.739) |
Discussion
In this longitudinal, we concomitantly evaluated 39 biomarkers that were previously linked to HCC and derived and internally validated a 9-biomarker signature that was predictive of HCC risk among patients with cirrhosis. These biomarkers were predictive of short as well as intermediate-term HCC risk both independent of as well as in addition to our previously developed base HCC prediction model with demographic, lifestyle, and clinical variables. This study is a major step in developing an adaptive multidimensional risk model that contains variables from several domains reflecting distinct aspects of the pathogenetic processes that drive or promote oncogenesis in cirrhosis.
Our approach represents a valuable framework for biomarker evaluation in HCC risk stratification. We followed a structured 2-step approach that entailed (1) using a previously validated base model and (2) measuring the marginal predictive value by adding the new biomarkers to the base model. For the first step, we used two separate models -- a model developed and externally validated by our group as well as previously developed aMAP.[8] It would have been easier and perhaps “more exciting” to present the results from the 9-biomarker panel in isolation; however, it would have been inconsistent with clinical practice where new biomarkers are unlikely to replace simple, routinely available information. Our approach represents a more critical and pragmatic appraisal of the value of new biomarkers by comparing the gain in predictive ability relative to that offered by simple tests. We also demonstrate a robust and almost identical incremental predictive benefit of the P9 biomarker panel over aMAP, similar to the additional value conferred to our base risk predictive model.
Our study highlights that risk prediction and stratification is difficult, and just like other disorders (e.g., predicting cancer in Barrett’s esophagus),[27] one expects relatively small increments in predictive power with the addition of new biomarkers to robust clinical models. For example, addition of AFP L-3 and DCP did not have any effect on the predictive ability of the model. The incremental gain was larger in the subgroup with non-viral hepatitis related cirrhosis, which may have clinical implications, especially if our results are confirmed in other cohorts of patient with non-viral hepatitis related cirrhosis. We expect that further progress in risk stratification may occur in relatively small steps, and likely come from adding biomarkers reflecting different biological domains (e.g., genomics, radiomics, transcriptomics) in progression to HCC.[37] A guiding principle is that variables that require additional cost and testing, need to significantly add to the base model to maintain the eventual applicability and cost effectiveness.
We conducted the largest PRoBE study, phase 3 HCC biomarker testing design, which is ideal to move the findings of many phase 1 or 2 studies further down the validation path.[17] We also conducted comprehensive and efficient testing of available biomarkers including AFP L-3 and DCP. Placing these biomarkers on FirePlex immunoassay has greatly facilitated the efficiency of simultaneous testing. We selected markers based on biological relevance, availability of suggestive evidence from biomarker studies in humans, as well as eventual clinical applicability criteria. Some of the selected biomarkers represented similar pathways or underlying metabolic dysfunction and they were highly correlated. Therefore, we used special statistical techniques that examined both the correlation as well as the predictive value for HCC. We created three different biomarker panels based on 3 modeling approaches (Lasso and stepwise regression using two different cutoffs). The findings were robust as well as biologically plausible: all 3 biomarkers (IGF-1, Latent TGF-β1 and IL-10)[28–31] identified by stepwise regression with a p cut-off of 0.05 and are parts of known pathways of oncogenesis, tumor suppression and inflammation, respectively. The Lasso method contained in the 5-biomarker selected and the latter were all contained in the 9 biomarkers selected from stepwise regression with a p cut-off of 0.20 (IGF-1, Latent TGF-β1, IL-10, MSPa, TNF-α, adipsin,[32–33] IL-1β, SAA and fetuin-A).[11, 29, 34–36] The 9-biomarker panel was associated with the highest discrimination model and calibration and the highest correct reclassification value. Therefore, we believe that the 9-biomarker panel is the most appropriate panel to be used as an independent HCC predictor or to be included in our adaptive modeling approach. Our results were robust in several subgroup and sensitivity analyses.
Our study has some limitations. While this is one of the larger prospective cohort studies of patients with cirrhosis, the number of outcome HCC events is small (n=126). This limitation also prohibited modeling each biomarker individually in the final HCC predictive model. It is possible that with larger sample size the composition or the number of the additional biomarkers will change. The incremental discriminatory value of the biomarker panel was modest, and therefore it is not clear whether this enhanced model is cost-effective across different clinical scenarios and healthcare systems. It is possible that the modest performance could be boosted by incorporating changes over time, and while we have longitudinal samples, we have not performed the assays yet. Last, while we performed internal validation, external validation in other datasets is required.
In summary, in a phase 3 PRoBE study, we have developed a panel of 9 circulating serum biomarkers that is significantly associated with HCC risk among patients with cirrhosis. This biomarker panel was independently predictive of HCC but also led to small albeit significant improvements in the predictive value compared to readily available base models of demographic, clinical and lifestyle variables. Our approach also represents a valuable framework for biomarker evaluation in HCC risk stratification.
Supplementary Material
What is already known on this topic:
Several individual biomarkers of inflammation and metabolic dysfunction have been reported to be significantly associated with HCC in phase I or II biomarker studies.
What this study adds:
This longitudinal phase III biomarker validation study measured the performance of 39 potential HCC biomarkers in a large, well-defined large cohort of patients with cirrhosis. Our study has identified a panel of 9 biomarkers which significantly improves the predictive ability of previously established risk stratification models containing demographic and clinical variables.
How this study might affect research, practice, or policy:
This study is a major step in developing multidimensional risk models that contains variables from several domains reflecting distinct aspects of the pathogenetic processes that drive oncogenesis in cirrhosis.
Grant Support:
This work was supported by the Cancer Prevention & Research Institute of Texas grant (RP150587) and the NCI (NCI P01 CA263025), and in part by Center for Gastrointestinal Development, Infection, and Injury (NIDDK P30 DK 56338). Drs. Kanwal and El-Serag are investigators at the Veterans Administration Center for Innovations in Quality, Effectiveness and Safety (CIN 13–413), Michael E. DeBakey VA Medical Center, Houston, Texas. Patient samples were stored and processed at the Population Sciences Biorepository core at Baylor College of Medicine with funding from the NCI (P30 Cancer Center Support Grant CA125123). Dr. Thrift’s effort was supported in part by the facilities and resources of the Gulf Coast Center for Precision and Environmental Health P30ES030285 (PI: Walker). Dr. Singal and Kanwal’s research is supported by CPRIT RP200554 and RP200633.
Abbreviations:
- HCC
hepatocellular cancer
- NAFLD
nonalcoholic fatty liver disease
- HR
hazard ratio
- CI
confidence interval
- HCV
hepatitis C virus
- HBV
hepatitis B virus
- THCCC
Texas Hepatocellular Carcinoma Consortium Cohort
- BMI
body mass index
Footnotes
Conflicts of Interest: No conflicts of interest to disclose.
References
- 1.Kanwal F, Khaderi S, Singal AG, et al. Risk factors for HCC in contemporary cohorts of patients with cirrhosis. Hepatology. Mar 1 2023;77(3):997–1005. doi: 10.1002/hep.32434 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.McGlynn KA, Petrick JL, El-Serag HB. Epidemiology of Hepatocellular Carcinoma. Hepatology. Jan 2021;73 Suppl 1(Suppl 1):4–13. doi: 10.1002/hep.31288 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Moon AM, Singal AG, Tapper EB. Contemporary Epidemiology of Chronic Liver Disease and Cirrhosis. Clin Gastroenterol Hepatol. Nov 2020;18(12):2650–2666. doi: 10.1016/j.cgh.2019.07.060 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.El-Serag HB, Kanwal F, Feng Z, et al. Risk Factors for Cirrhosis in Contemporary Hepatology Practices-Findings From the Texas Hepatocellular Carcinoma Consortium Cohort. Gastroenterology. Jul 2020;159(1):376–377. doi: 10.1053/j.gastro.2020.03.049 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.McMahon B, Cohen C, Brown RS Jr., et al. Opportunities to address gaps in early detection and improve outcomes of liver cancer. JNCI Cancer Spectr. May 2 2023;7(3)doi: 10.1093/jncics/pkad034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Harrison AC, Kanwal F, Asrani SK, et al. The Texas collaborative center for hepatocellular cancer: Reducing liver cancer mortality in Texas through coordination, collaboration and advocacy. Front Oncol. 2022;12:953933. doi: 10.3389/fonc.2022.953933 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kanwal F KS, Singal A, Marrero J, Asrani S, Amos C, Thrift A, Kramer, Yu J, Cao Y, Luster M, Al-Sarraj A, Ning J, El-Serag HB. Risk Stratification Model for Hepatocellular Cancer in Patients With Cirrhosis. Clini Gastroenterol Hepatpol. Apr 30:S1542–3565(23)00319–1. doi: 10.1016/j.cgh.2023.04.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Johnson PJ, Innes H, Hughes DM, Kalyuzhnyy A, Kumada T, Toyoda H. Evaluation of the aMAP score for hepatocellular carcinoma surveillance: a realistic opportunity to risk stratify. Br J Cancer. 2022. Oct;127(7):1263–1269. doi: 10.1038/s41416-022-01851-1. Epub 2022 Jul 7. PMID: 35798825; PMCID: PMC9519948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Fan R, Papatheodoridis G, Sun J, Innes H, Toyoda H, Xie Q, Mo S, Sypsa V, Guha IN, Kumada T, Niu J, Dalekos G, Yasuda S, Barnes E, Lian J, Suri V, Idilman R, Barclay ST, Dou X, Berg T, Hayes PC, Flaherty JF, Zhou Y, Zhang Z, Buti M, Hutchinson SJ, Guo Y, Calleja JL, Lin L, Zhao L, Chen Y, Janssen HLA, Zhu C, Shi L, Tang X, Gaggar A, Wei L, Jia J, Irving WL, Johnson PJ, Lampertico P, Hou J. aMAP risk score predicts hepatocellular carcinoma development in patients with chronic hepatitis. J Hepatol. 2020. Dec;73(6):1368–1378. doi: 10.1016/j.jhep.2020.07.025. Epub 2020 Jul 21. PMID: 32707225. [DOI] [PubMed] [Google Scholar]
- 10.Hoshida Y, Villanueva A, Kobayashi M, et al. Gene expression in fixed tissues and outcome in hepatocellular carcinoma. N Engl J Med. Nov 6 2008;359(19):1995–2004. doi: 10.1056/NEJMoa0804525 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Aleksandrova K, Boeing H, Nothlings U, et al. Inflammatory and metabolic biomarkers and risk of liver and biliary tract cancer. Hepatology. Sep 2014;60(3):858–71. doi: 10.1002/hep.27016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Malaguarnera G, Giordano M, Paladina I, Berretta M, Cappellani A, Malaguarnera M. Serum markers of hepatocellular carcinoma. Dig Dis Sci. Oct 2010;55(10):2744–55. doi: 10.1007/s10620-010-1184-7 [DOI] [PubMed] [Google Scholar]
- 13.Yang Q, Sun C, Sheng Y, Chen W, Deng C. The Biomarkers for Predicting Viral HepatitisAssociated Hepatocellular Carcinoma. Turk J Gastroenterol. Jan 2022;33(1):1–7. doi: 10.5152/tjg.2022.19813 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Costantini S, Capone F, Maio P, et al. Cancer biomarker profiling in patients with chronic hepatitis C virus, liver cirrhosis and hepatocellular carcinoma. Oncol Rep. Jun 2013;29(6):2163–8. doi: 10.3892/or.2013.2378 [DOI] [PubMed] [Google Scholar]
- 15.Cabiati M, Gaggini M, De Simone P, Del Ry S. Data mining of key genes expression in hepatocellular carcinoma: novel potential biomarkers of diagnosis prognosis or progression. Clin Exp Metastasis. Aug 2022;39(4):589–602. doi: 10.1007/s10585-022-10164-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Conti F, Dall’Agata M, Gramenzi A, Biselli M. Biomarkers for the early diagnosis of bacterial infection and the surveillance of hepatocellular carcinoma in cirrhosis. Biomark Med. 2015;9(12):1343–51. doi: 10.2217/bmm.15.100 [DOI] [PubMed] [Google Scholar]
- 17.Pepe MS, Etzioni R, Feng Z, et al. Phases of biomarker development for early detection of cancer. J Natl Cancer Inst. Jul 18 2001;93(14):1054–61. doi: 10.1093/jnci/93.14.1054 [DOI] [PubMed] [Google Scholar]
- 18.Feng Z, Marrero JA, Khaderi S, et al. Design of the Texas Hepatocellular Carcinoma Consortium Cohort Study. Am J Gastroenterol. Mar 2019;114(3):530–532. doi: 10.14309/ajg.0000000000000068 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Pepe MS, Feng Z, Janes H, Bossuyt PM, Potter JD. Pivotal evaluation of the accuracy of a biomarker used for classification or prediction: standards for study design. J Natl Cancer Inst. Oct 15 2008;100(20):1432–8. doi: 10.1093/jnci/djn326 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Riley RD, Ensor J, Snell KIE, Harrell FE Jr, Martin GP, Reitsma JB, Moons KGM, Collins G, van Smeden M. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020. Mar 18;368:m441. doi: 10.1136/bmj.m441. PMID: 32188600. [DOI] [PubMed] [Google Scholar]
- 21.Collins GS, Reitsma JB, Altman DG, Moons KG. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015. Jan 7;350:g7594. doi: 10.1136/bmj.g7594. PMID: 25569120. [DOI] [PubMed] [Google Scholar]
- 22.Singal AG, Llovet JM, Yarchoan M, et al. AASLD practice guidance on prevention, diagnosis, and treatment of hepatocellular carcinoma. Hepatology. May 22 2023;doi: 10.1097/HEP.0000000000000466 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zhang Z, Cortese G, Combescure C, Marshall R, Lee M, Lim HJ, Haller B; written on behalf of AME Big-Data Clinical Trial Collaborative Group. Overview of model validation for survival regression model with competing risks using melanoma study data. Ann Transl Med. 2018. Aug;6(16):325. doi: 10.21037/atm.2018.07.38. PMID: 30364028; PMCID: PMC6186983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Mondol M, Rahman MS. A comparison of internal validation methods for validating predictive models for binary data with rare events. Journal of Statistical Research. January/08 2018;51:131–144. doi: 10.47302/jsr.2017510203 [DOI] [Google Scholar]
- 25.Austin PC, Fine JP. Practical recommendations for reporting Fine-Gray model analyses for competing risk data. Stat Med. Nov 30 2017;36(27):4391–4400. doi: 10.1002/sim.7501 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wang L, Luedtke AR, Huang Y. Assessing the incremental value of new biomarkers based on OR rules. Biostatistics. Jul 1 2020;21(3):594–609. doi: 10.1093/biostatistics/kxy070 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Dong J, Buas MF, Gharahkhani P, Kendall BJ, Onstad L, Zhao S, Anderson LA, Wu AH, Ye W, Bird NC, Bernstein L, Chow WH, Gammon MD, Liu G, Caldas C, Pharoah PD, Risch HA, Iyer PG, Reid BJ, Hardie LJ, Lagergren J, Shaheen NJ, Corley DA, Fitzgerald RC; Stomach and Oesophageal Cancer Study (SOCS) consortium; Whiteman DC, Vaughan TL, Thrift AP. Determining Risk of Barrett’s Esophagus and Esophageal Adenocarcinoma Based on Epidemiologic Factors and Genetic Variants. Gastroenterology. 2018. Apr;154(5):1273–1281.e3. doi: 10.1053/j.gastro.2017.12.003. Epub 2017 Dec 13. PMID: 29247777; PMCID: PMC5880715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Huber Y, Bierling F, Labenz C, et al. Validation of insulin-like growth factor-1 as a prognostic parameter in patients with hepatocellular carcinoma in a European cohort. BMC Cancer. Jul 31 2018;18(1):774. doi: 10.1186/s12885-018-4677-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Wang J, Li YC, Deng M, et al. Serum insulin-like growth factor-1 and its binding protein 3 as prognostic factors for the incidence, progression, and outcome of hepatocellular carcinoma: a systematic review and meta-analysis. Oncotarget. Oct 6 2017;8(46):81098–81108. doi: 10.18632/oncotarget.19186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Scharf JG, Braulke T. The role of the IGF axis in hepatocarcinogenesis. Horm Metab Res. Nov-Dec 2003;35(11–12):685–93. doi: 10.1055/s-2004-814151 [DOI] [PubMed] [Google Scholar]
- 31.Mazziotti G, Sorvillo F, Morisco F, et al. Serum insulin-like growth factor I evaluation as a useful tool for predicting the risk of developing hepatocellular carcinoma in patients with hepatitis C virus-related cirrhosis: a prospective study. Cancer. Dec 15 2002;95(12):2539–45. doi: 10.1002/cncr.11002 [DOI] [PubMed] [Google Scholar]
- 32.Marra F, Bertolani C. Adipokines in liver diseases. Hepatology. Sep 2009;50(3):957–69. doi: 10.1002/hep.23046 [DOI] [PubMed] [Google Scholar]
- 33.Duan XF, Tang P, Li Q, Yu ZT. Obesity, adipokines and hepatocellular carcinoma. Int J Cancer. Oct 15 2013;133(8):1776–83. doi: 10.1002/ijc.28105 [DOI] [PubMed] [Google Scholar]
- 34.Betesh L, Comunale MA, Wang M, et al. Identification of fucosylated Fetuin-A as a potential biomarker for cholangiocarcinoma. Proteomics Clin Appl. Sep 2017;11(9–10)doi: 10.1002/prca.201600141 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Comunale MA, Wang M, Hafner J, et al. Identification and development of fucosylated glycoproteins as biomarkers of primary hepatocellular carcinoma. J Proteome Res. Feb 2009;8(2):595–602. doi: 10.1021/pr800752c [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Li L, Gu X, Fang M, Ji J, Yi C, Gao C. The diagnostic value of serum fucosylated fetuin A in hepatitis B virus-related liver diseases. Clin Chem Lab Med. Apr 2016;54(4):693–701. doi: 10.1515/cclm-2015-0307 [DOI] [PubMed] [Google Scholar]
- 37.Thrift AP, Kanwal F, Liu Y, et al. Risk stratification for hepatocellular cancer among patients with cirrhosis using a hepatic fat polygenic risk score. PLoS One. 2023;18(2):e0282309. doi: 10.1371/journal.pone.0282309 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.