To the editor
Chronic liver disease (CLD), including cirrhosis and liver cancer, accounts for 9.5 million deaths globally in 2017, one third of which occurred in China1. Non-alcoholic fatty liver disease (NAFLD) alongside its metabolic risk factors has become a major risk factor for CLD2. Based on easily accessible risk factors, the CLivD score, a simple model to predict future risk of CLD, has been developed and validated in European populations3. This model can be used to identify individuals at high risk who could benefit from lifestyle interventions and should be referred for further liver assessment. However, the associations of lifestyle and metabolic risk factors with CLD risk differ in Chinese adults4,5. It is unclear whether the CLivD score could be used in Chinese adults after model recalibration. Therefore, we performed an external validation of the CLivD score for Europeans in the China Kadoorie Biobank (CKB).
The China Kadoorie Biobank (CKB) recruited 512,715 participants aged 30-79 years from 10 areas (5 urban, 5 rural) in China during 2004-20086. All participants completed an interviewer-administered laptop-based questionnaire and physical measurements were recorded by trained technicians. 17 biochemistry traits were measured in a subsample of 17,681 participants7. Follow-up for International Statistical Classification of Diseases and Related Health Problems 10th Revision-coded incident events continued to January 1, 2018, through linkage with morbidity and mortality registries and a nationwide health insurance system. CLD was defined as advanced liver disease or liver-related mortality, including liver cirrhosis, NAFLD, and liver fibrosis (ICD-10 code: K70, K72, and K74 alongside other complications of CLD)3. Discrimination was assessed using Harrell’s C index. Calibration was assessed graphically by comparing the predicted risks and the observed risk. We recalibrated the original CLivD score model by re-estimating the predictor coefficients in CKB (i.e. model refit)8.
Of the 504,009 participants included (excluding participants with a prior history of cancer, cirrhosis or hepatitis), the mean age was 52 years (SD 10.7 years), and 59.2% were women. During 11 years of follow-up, there were 4091 incident cases of CLD. The Harrell’s C index of the CLivD score was 0.65 (0.64-0.66) for modelnon-lab and 0.68 (0.62-0.73) for modellab (Figure). The Harrell’s C index for modelnon-lab in CKB was similar to those reported by the original CLivD score paper, but the Harrell’s C index for modellab in CKB was lower. For modelnon-lab, the Harrell’s C index was slightly lower among men and participants who tested positive for HBsAg and those with diabetes. For modellab, the Harrell’s C index was slightly higher among women and those from rural areas, weekly drinkers, and fatty liver index (FLI) ≥60 (Figure). Both models under-estimated risk at lower levels of observed risk but over-estimated risk among participants among the top decile of observed risk. After recalibration, both models still underestimated the risk (Figure).
Figure. Harrell's C and model calibration curves for the CLivD score model in CKB.
Panel (A): Time since study entry was used as the underlying time scale. The squares represent C-statistics and the vertical lines represent 95% CI. The area of the squares is inversely proportional to the variance of the log C-statistic.
Panel (B): Observed risk was plotted against expected risk of developing CLD over the 10-year period in groups based on quintiles of expected risk and the slope and intercept were estimated. (A) and (B) were for the original CLivD score models, while (C) and (D) were from model refit.
Modelnon-lab had good overall discrimination in this Chinese population. Three prospective studies have externally validated the CLivD score. The first two studies were European cohorts and reported a C-statistic of 0.65 (CCHS) and 0.74 (Whitehall II) for modelnon-lab, separately6. The third study was NHANES, involving diverse race and ethnicities, and showed a C-statistic of 0.637 for modelnon-lab9. The current study reported a C-statistic of 0.649 for modelnon-lab, which was generally consistent with the aforementioned studies. In contrast, modellab performed worse in the Chinese population than previous studies. The C-statistic for modellab was 0.78 in CCHS and 0.733 in NHANES, respectively, and was higher than the C-statistic of 0.68 in CKB. Possible reasons for the worse performance of modellab included the much smaller number of participants with biochemistry data in CKB, their inclusion criteria and the weak associations between predictors and incident CLD.
The CLivD score under-estimated the risk of CLD in CKB across the range of observed risk even after recalibration by refitting the models. Underestimation was a particular issue in HBsAg-positive participants, which is anticipated since progression of liver disease is driven more by viral factors than the factors included in the CLivD score. In contrast, calibration was much better in the HBsAg-negative subjects. The unsatisfying performance of model calibration may reflect the fact that different sets of predictors are needed for Chinese to develop risk prediction models with good calibration as well as good discrimination.
This was the first study to externally validate risk prediction models for CLD in the CKB, a large, geographically diverse population-based cohort study in China. The model including easily obtainable variables based on demographic, lifestyle factors, and medical history was found to have good discrimination, whereas the model additionally including GGT had a lower discrimination. Both models underestimated the risk across the range of observed risk even after model refit. Future studies are warranted to develop risk prediction models for CLD in Chinese adults taking into account of the varying risk factors and aetiologies of CLD.
Statements and declarations
Competing of interest statement
The authors have no relevant financial or non-financial interests to disclose.
Author contributions
Conceptualization, Y.P. and C.K.; methodology, Y.P. and C.K.; software, Y.P., C.K., and F.A.; validation, Y.P., C.K., and F.A.; formal analysis, Y.P. and C.K.; investigation, Y.P.; resources, Y.P., C.K., and F.A.; data curation, Y.P. and C.K.; writing—original draft preparation, Y.P. and C.K.; writing—review and editing, Y.P., C.K., F.A., L.L., and Z.C.; visualization, Y.P. and C.K.; supervision, L.L. and Z.C.; project administration, Y.P., C.K., L.L., and Z.C.; funding acquisition, Y.P., C.K., L.L., and Z.C. All authors have read and agreed to the published version of the manuscript.
Financial support
This work was supported by National Natural Science Foundation of China (82192904, 82192901, 82192900). The CKB baseline survey and the first re-survey were supported by a grant from the Kadoorie Charitable Foundation in Hong Kong. The long-term follow-up is supported by grants from the UK Wellcome Trust (212946/Z/18/Z, 202922/Z/16/Z, 104085/Z/14/Z, 088158/Z/09/Z), grants (2016YFC0900500) from the National Key R&D Program of China, National Natural Science Foundation of China (81390540, 91846303, 81941018), and Chinese Ministry of Science and Technology (2011BAI09B01).
Ethical approval
The study was approved by the Institutional Review Board of Peking University Health Science Center (ID of the approval: IRB00001052-20040). Informed consent was obtained from all participants.
References
- 1.GBD 2017 Causes of Death Collaborators. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018;392(10159):1736–1788. doi: 10.1016/S0140-6736(18)32203-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Xiao J, Wang F, Wong NK, et al. Global liver disease burdens and research trends: Analysis from a Chinese perspective. J Hepatol. 2019;71(1):212–221. doi: 10.1016/j.jhep.2019.03.004. [DOI] [PubMed] [Google Scholar]
- 3.Åberg F, Luukkonen PK, But A, et al. Development and validation of a model to predict incident chronic liver disease in the general population: The CLivD score. J Hepatol. 2022;77(2):302–311. doi: 10.1016/j.jhep.2022.02.021. [DOI] [PubMed] [Google Scholar]
- 4.Pang Y, Kartsonaki C, Turnbull I, et al. Diabetes, Plasma Glucose, and Incidence of Fatty Liver, Cirrhosis, and Liver Cancer: A Prospective Study of 0.5 Million People. Hepatology. 2018;68(4):1308–1318. doi: 10.1002/hep.30083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pang Y, Kartsonaki C, Lv J, et al. Observational and Genetic Associations of Body Mass Index and Hepatobiliary Diseases in a Relatively Lean Chinese Population. JAMA Netw Open. 2020;3(10):e2018721. doi: 10.1001/jamanetworkopen.2020.18721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Chen Z, Chen J, Collins R, Guo Y, Peto R, Wu F, et al. China Kadoorie Biobank of 0.5 million people: survey methods, baseline characteristics and long-term follow-up. Int J Epidemiol. 2011;40(6):1652–66. doi: 10.1093/ije/dyr120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wang X, Wu Z, Lv J, Yu C, Sun D, Pei P, et al. Life-course adiposity and severe liver disease: a Mendelian randomization analysis. Obesity. 2023 Oct 23; doi: 10.1002/oby.23913. [DOI] [PubMed] [Google Scholar]
- 8.Van Calster B, McLernon DJ, van Smeden M, Wynants L, Steyerberg EW Topic Group ‘Evaluating diagnostic tests and prediction models’ of the STRATOS initiative. Calibration: the Achilles heel of predictive analytics. BMC Med. 2019;17(1):230. doi: 10.1186/s12916-019-1466-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Song J, Jiang ZG. A good step toward low-cost prognostication of liver-related outcome awaits more validation. J Hepatol. 2022;77(3):887–889. doi: 10.1016/j.jhep.2022.04.008. [DOI] [PubMed] [Google Scholar]

