Skip to main content
eBioMedicine logoLink to eBioMedicine
. 2022 Feb 25;77:103890. doi: 10.1016/j.ebiom.2022.103890

Clinical characteristics of neonatal cholestasis in a tertiary hospital and the development of a novel prediction model for mortality

Ho Jung Choi a,1, Inki Kim b,1, Hye-Jin Lee a, Hyun Ju Oh a, Mi Kyoung Ahn a, Woo Im Baek a, Yeong Eun Kim a, Seak Hee Oh a,, Byong Sop Lee a, Jung-Man Namgoong c, Dae Yeon Kim d, Eun Joo Lee d, Jung Ok Shim e, Jae Sung Ko d, Kyung Mo Kim a
PMCID: PMC8889106  PMID: 35220043

Summary

Background

Few studies have described the aetiologies of neonatal cholestasis, and the overall neonatal cholestasis-related mortality (NCM) rate is unclear. We investigated the aetiology and outcome of neonatal cholestasis in a tertiary hospital and developed an NCM prediction model for these patients.

Methods

Patients aged <100 days with serum direct bilirubin (DB) levels of >1.0 mg/dL were retrospectively screened. Diagnostic and laboratory data during the 8-week follow-up period after enrolment between 2005 and 2020 were extracted digitally, and medical charts were reviewed manually by clinicians. Logistic regression was used to derive a prediction model for the 1-year mortality outcome of neonatal cholestasis, and performance evaluation and external validation were conducted for the NCM prediction model.

Findings

We enrolled 4028 neonates with DB of >1.0 mg/dL at least once. Prematurity and birth injury (35.4%), complex heart anomalies (18.6%), liver diseases (11.4%), and gastrointestinal anomalies (9.2%) were the most common aetiologies; 398 (9.9%) patients died before one year of age. The peak value of DB was positively correlated to the 1-year mortality rate. In the multivariate analysis, simple laboratory indices, including platelet, prothrombin time, aspartate aminotransferase, albumin, direct bilirubin, creatinine, and C-reactive protein, were independent predictors of 1-year mortality outcome of complete-case subjects. Using these laboratory indices, a logistic regression-based NCM prediction model was constructed. It showed acceptable performances on discrimination (area under the curve, 0.916), calibration (slope, 1.04) and Brier scoring (0.072). The external validation of the sample (n = 920) from two other centres also revealed similar performance profiles of the NCM model.

Interpretation

Various aetiologies of neonatal cholestasis were identified in a tertiary hospital, resulting in unfavourable outcomes of a large proportion. The NCM prediction model may have the potential to help clinicians to be aware of high-risk neonatal cholestasis.

Keywords: Neonatal cholestasis, Aetiology, Mortality, Prediction model


Research in context.

Evidence before this study

Aetiologies of neonatal cholestasis were described in few studies from liver centres with small numbers of subjects. Risks of mortality in neonatal cholestasis have been assessed by a limited dataset of subgroups such as parenteral nutrition.

Added value of this study

This study of the largest cohort of neonatal cholestasis showed various aetiologies and combined medical conditions in detail. Using simple laboratory indices, a novel prediction model for 1-year mortality was developed and validated based on the TRIPOD guidelines.

Implications of all the available evidence

The results show that the prediction model has a potential for clinical application in identifying high-risk neonatal cholestasis.

Alt-text: Unlabelled box

Introduction

Neonatal cholestasis is defined as reduced bile formation or flow resulting from hepatobiliary dysfunction or obstructive lesions of the bile duct in the neonatal period.1 Neonatal cholestasis is screened by the identification of elevated direct (or conjugated) bilirubin (DB) and bile acids. Neonatal cholestasis occurs in approximately 1 out of every 2500 term neonates.2,3 In primary care, the management of neonatal cholestasis requires caution because its causes are known to be biliary atresia (BA) and monogenic liver disease in approximately 25–40% and 25% of cases, respectively.1, 2, 3, 4, 5 As delayed Kasai operation generally leads to a poor outcome of the native liver, the early detection of BA is crucial.4 Infections, galactosemia, tyrosinemia, panhypopituitarism, bile acid synthesis defect, and biliary sludges (or stones) also require prompt management. So, the correct diagnosis of neonatal cholestasis often requires extensive invasive and non-invasive evaluations.1 Although the aetiologies are not discovered on differential diagnosis or treatments are not available for specific causes, nutritional optimisation and prevention of medical complications from this condition can be beneficial for the appropriate growth or nutritional stabilisation of the patient before liver transplantation. Despite the clinical importance of neonatal cholestasis, currently, a total of fewer than two thousand studies of various study designs are available in the literature on this subject.5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23

There is also little information available on the outcome of neonatal cholestasis in the literature, and the little available information is generally determined and reported by each specific cause and available treatment.5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23 Approximately 50% of patients with BA lose their native livers within 2 years after a Kasai operation.4 Contrarily, idiopathic cholestasis, including that caused by idiopathic neonatal hepatitis, is often transient and benign; only 1% of patients with idiopathic cholestasis lost their native livers before the age of 2 years in a recent report from the Childhood Liver Disease Research Network study.5 Transient cholestasis is also often observed in infants with specific systemic conditions such as sepsis, ischaemic injury from birth asphyxia or congenital heart anomalies, haemolytic disease and defective enteral autonomy with prolonged parenteral nutrition.1, 2, 3, 4, 5, 6 Premature infants cannot often afford oral nutrition and the risk of cholestasis doubles in cases of parenteral nutrition that lasts for more than 30 days.24 Many single-centre studies have revealed a variety of epidemiologic characteristics and clinical outcomes in neonatal cholestasis, of which cohorts contained heterogeneous patient groups based on the kind of centre (liver-oriented centre vs. other specialised centres; referral centre vs. primary centres).5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23

Predicting the outcome of neonatal cholestasis could be useful. By discriminating unfavourable outcomes such as in-hospital mortality among patients with neonatal cholestasis, clinicians are able to identify high-risk neonatal cholestasis cases and to prepare focused and multidisciplinary strategies for them. There are several mortality prediction models for liver diseases in paediatric hepatology, including the Pediatric End-Stage Liver Disease (PELD) score for chronic liver disease, the King's College Hospital criteria (KCHC) score, and the Liver Injury Units (LIU) for acute liver failure.25, 26, 27 The training dataset of these models originated from end-stage liver disease and fulminant hepatitis in paediatric groups with wide age ranges, mostly more than 6 months. In addition, the primary goal of these predictions is to estimate the risk-benefit of liver transplantation in affected infants and children. Currently, there is no prediction model for mortality in neonatal cholestasis using a big-enough dataset of neonatal cholestasis cases.

To make the neonatal cholestasis-related mortality (NCM) prediction model clinically useful, clear and sufficient information should be provided in the essential parts of model development-sample aetiologies, variable selection, model construction, full performance evaluation and external validation.28,29 Notably, not only accuracy in discrimination (i.e. receiver-operating characteristic curve analysis) but also precision in the calibration of the predicted probability must be included in the evaluation of the new model's performance.30 In addition to reliable performance, a prediction model should provide clinicians insights into algorithms used for predicting mortality in neonatal cholestasis. This is because understanding the ‘how and why’ by clinicians, which is referred to as the interpretability of models in statistics, applies to medical decision-making in real-life clinical practice.31

In this study, we investigated the aetiologic characteristics of neonatal cholestasis and its outcome in a tertiary hospital. In addition, using a large dataset of simple laboratory indices in this cohort, we developed an NCM prediction model for 1-year in-hospital mortality in neonates with cholestasis.

Methods

Study population

We conducted a retrospective study on cholestasis in a group of paediatric patients aged less than 100 days in the Seoul Asan Medical Center. Neonatal cholestasis was defined as serum DB >1.0 mg/dL according to a definition of the North American Society for Pediatric Gastroenterology, Hepatology and Nutrition and the European Society for Paediatric Gastroenterology, Hepatology, and Nutrition (NASPGHN/ESPGHN) guideline.1 The inclusion criteria were as follows:1 age <100 days (neonatal period),2 DB >1.0 mg/dL (cholestasis) at least once and3 occurrence of a first DB >1.0 mg/dL between January 2005 and January 2020 (study period). Based on the study criteria, neonates (n = 13,344) who never had DB >1.0 mg/dL were excluded from the analysis. Diagnostic information in the electronic medical records was classified according to the International Classification of Disease (ICD) system.14 All electric medical records and their ICD codes were manually validated in detail by paediatric hepatologists. The diagnosis of BA was evidenced by intraoperative angiography of the bile duct. Other definitions of idiopathic neonatal hepatitis and idiopathic neonatal cholestasis were used according to the criteria of the Childhood Liver Disease Research Network.5 From enrolment to the 8th week of follow-up, simple laboratory indices, such as the white blood cell (WBC, × 10³/µL) and platelet (PLT, × 10³/µL) counts; haemoglobin level (Hb, g/dL); prothrombin time (PT INR, international normalized ratio), alanine aminotransferase level (ALT, g/dL), aspartate aminotransferase (AST, IU/L) level and albumin (ALB, g/dL), total bilirubin (TB, mg/dL), DB, creatinine (Cr, mg/dL), and C-reactive protein (CRP, mg/dL) levels, were analysed.

Literature search

To compare the outcomes of the participants of the present study, the Medline and EMBASE databases were searched to extract outcome data from the prospective or retrospective case series or cohort study with >20 subjects with cholestasis during the neonatal period. The total number of subjects, proportion of BA, liver transplantation rate, mortality rate and risk factors for mortality were reviewed. If information on patients’ poor outcomes was not available in reported cohorts in any given study, the study was excluded. To evaluate the necessity of prediction model, we searched previous models for cholestatic paediatric patients and checked suggested questionnaires (Supplementary Table 1).32

Statistical analysis

The outcome (or dependent) variable for this study was survival up to the age of 1 year [i.e.1 survival group vs.2 mortality group]. Descriptive statistics for the characteristics listed above were provided for 1-year survival and 1-year mortality in subjects included in the model development. Continuous variables were expressed as mean values with standard deviations or median values with interquartile ranges (IQRs) for normally distributed and skewed data, respectively. Differences between variable groups were assessed using the two-sample t-test or the Mann-Whitney U test for continuous variables as appropriate. For categorical variables, the χ2 test or Fisher's exact test (in cases where the number of patients involved was ≤5) was used. Receiver-operating characteristic (ROC) curve analysis with the area under the curve (AUC) was performed to identify optimal cut-off values of continuous variables. For the multivariate analysis, logistic regression (LR) was performed. As alternative analyses, machine learning was also performed to explore underlying data pattern. All statistical calculations were performed using IBM SPSS version 27.0 (SPSS Inc., Armonk, NY), R version 3.6.1 (R Foundation for Statistical Computing, Vienna, Austria), and Python version 3.8.2 (Python Software Foundation, Wilmington), as appropriate. A p-value of <0.05 was considered statistically significant in statistical models.

Variable selection and missing values

The simple laboratory indices within 8 weeks of enrolment (DB >1.0 mg/dL) were selected as predictor candidates. Based on the fact that the paediatric prediction models for chronic liver disease and acute liver failure were adopted to use peak index values during the specific follow-up durations,25, 26, 27 not only the initial but also the peak values of each laboratory index were used as candidate variables evaluated in the modelling process. ROC curve analyses were performed to evaluate the discriminatory ability in univariate analysis of each variable and calculate empirical optimal cut-off values based on the Youden index.

Before conducting multiple imputation or regression imputation for missing values,33,34 missing values were classified into missing completely at random, missing at random, and missing not at random (MNAR). As MNAR is possibly associated with both independent and dependent variables,35 the plausible mechanism of suspected MNAR was reviewed using descriptive statistics. Clinical judgement on subgroups of neonatal cholestasis may affect the selection of a range of serologic tests, such as PT INR. For example, clinicians may not conduct full laboratory evaluation for transient benign neonatal cholestasis20,23 because transient cholestasis without any underlying severe diseases shows favourable survival outcomes.5 As MNAR is still problematic in statistics,35 we conducted both [1] complete-case analysis without imputation and [2] whole-case analysis with imputation.

Model development

The prediction model was developed following the recommended guideline of the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement and Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS) checklist.28,36 One of the primary aims of this study was to develop a prediction model for 1-year in-hospital mortality (binary dependent variable). Firstly, multivariate LR was performed to develop an NCM prediction model; binary LR with the forward method37 was derived for the model 1-year in-hospital mortality risk and resulted in regression β coefficients and intercept of the appropriate laboratory values (logit [probability] = β0 + β1 × 1 + β2 × 2 + … + βnXn). Multicollinearity among independent variables was examined before LR, and parameters with correlation coefficients of ≥0.85 were removed from the LR analysis. Using logit (probability), the predicted probability of the 1-year mortality event was calculated.

For alternative analyses, decision trees including CART and CHAID modelling,38 Random forests (RFs)39 and extreme gradient boost (XGBoost)40 were also performed (Supplementary Method 1 in detail). For machine learning analysis, overall variable importance and Shapley additive explanations (SHAP) value were provided.41,42 To promote the interpretability of XGBoost, a SHAP plot was also used to show the complex relationship between independent and dependent variables.

Performance evaluation

To evaluate overall performance, the Brier score (value=1Nk=1n[predictedprobabilitykobservedeventk]2), which is a metric concept similar to the mean square error, was used.43 Moreover, Nagelkerke's R2 and the Hosmer-Lemeshow test were used as measures of the overall performance.44 A prediction density plot was used to visualise the discriminatory trends of each model. ROC curve analysis was used to evaluate the discrimination of the models and the c-index, which is equivalent to the AUC in the ROC curve analysis in the binary outcome, was a measure of the discriminatory ability. The calibration curve was also used to test the precision of the models, and the calibration slope and intercept were calculated using Hosmer-Lemeshow contingency tables. The sensitivity, specificity, positive predictive value and negative predictive value were used to evaluate the ‘clinical usefulness’ of models.

Internal and external validation

Both the internal and external validations of the models were performed. Internal validation for the reproducibility of prediction models for the originated dataset was performed through 10-fold cross-validation and bootstrap re-sampling.45,46 Based on the TRIPOD statement,28 the mean difference between the 200 bootstrapping re-samples is defined as the optimism. Estimating the optimism (or bootstrap)-corrected AUC and its confidence intervals (CIs) of the final NCM prediction model was conducted using the internal dataset. The dataset of neonatal cholestasis at two other large tertiary hospitals (Seoul National University Hospital and Korea University Hospital) in South Korea were used for external validation. Using the same modelling frames, performance on the external dataset was quantified using the Brier score, AUC and calibration. In addition, clinical usefulness was determined in the same manner.

Ethical approval

The study was approved by the Institutional Review Board of the Asan Medical Center (#2020-0202), with a waiver of requirement for informed consent due to the nature of this retrospective study. The study was performed in compliance with the Declaration of Helsinki, and other relevant regulations.

Role of funding source

The funder had no role in the study design, data collection, data analyses, interpretation, writing of the report, and the decision of paper submission.

Results

Characteristics of neonatal cholestasis

During the study period, 17,372 neonates aged less than 100 days underwent 128,822 serologic DB tests (Supplementary Figure 1). Among them, 4028 neonates had DB of at least ≥1-fold of the DB threshold of >1.0 mg/dL, defined as neonatal cholestasis in the NASPGHN/ESPGHN guideline,1 with a median of 1.3 mg/dL (IQR: 1.1–1.6 mg/dL). The mean age of these patients at the time of enrolment was 10.2 ± 19.7 days with a median gestational age of 35 weeks (IQR: 32–38 weeks) and median birth weight of 2.36 kg (IQR: 1.5–3.0 kg). There were 2322 (57.6%) male patients. A variety of combined medical conditions were noted in the classification of the ICD code and review of medical charts (2.6 ICD codes per patient). Among them, the primary diagnoses, directly linked to the development of neonatal cholestasis, are listed in Table 1. Birth issues with prematurity and injury (35.4%), congenital heart anomaly with other cardiac issues (18.6%), liver diseases (11.4%), gastrointestinal anomaly and related issues (9.2%), neonatal unconjugated jaundice with DB <20% of TB (7.4%), respiratory anomaly and other issues (5.5%) and genetic/chromosomal disorders (4.3%) were the most common primary diagnoses in the whole-case dataset. Among diseases of hepatic origin, BA and neonatal hepatitis were the most common. Neonatal unconjugated jaundice also had the transient condition of DB >1.0 mg/dL, which is a definition of the study. In addition, a variety of genetic disorders and other specific diagnoses combined with neonatal cholestasis, which were not highlighted in the literature,5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23 are listed in Supplementary Table 2.

Table 1.

Primary diagnosis of neonatal cholestasis.

Primary diagnosis for neonatal cholestasis Dataset
Complete-case n = 2661 (%) Whole-case n = 4028 (%)
Prematurity and birth-related issues 654 (24.6%) 1426 (35.4%)
Congenital heart anomaly and other problems 663 (24.9%) 751 (18.6%)
Liver origin diseases including biliary atresia and hepatitis 416 (15.6%) 461 (11.4%)
Gastrointestinal anomaly and other problems 345 (13%) 369 (9.2%)
Neonatal jaundice in healthy newborns 58 (2.2%) 299 (7.4%)
Respiratory anomaly and related issues 178 (6.7%) 222 (5.5%)
Genetic/chromosomal disorders 141 (5.3%) 174 (4.3%)
Renal anomalies and urinary tract issues 55 (2.1%) 81 (2.0%)
Malignancy and tumour 57 (2.1%) 59 (1.5%)
Brain anomalies and other neurologic issues 32 (1.2%) 48 (1.2%)
Infection 15 (0.6%) 43 (1.1%)
Haematologic disorders 16 (0.6%) 38 (0.9%)
Hypopituitarism/hypothyroidism 9 (0.3%) 27 (0.7%)
Others 22 (0.8%) 30 (0.7%)

Outcomes of neonatal cholestasis

Among the 4028 neonates with cholestasis, 425 (10.6%) patients died during the whole follow-up period and 398 (9.9%) patients died in the first year of their lives (Figure 1a). The median age at the time of death was 55 days (IQR: 23–128 days) with a sex ratio (male/female) of 1.3 (Figure 1b). Causes of 1-year mortality are listed in Supplementary Table 3; complex heart anomalies, prematurity-related condition, birth injury, foetal hydrops and congenital diaphragmatic hernia were the most common causes of mortality. Among survivors, 54 (1.3%) underwent liver transplantation, with BA being the most common indication (n = 46/54, 85%). Notably, transient neonatal cholestasis (or early resolved cholestasis), including idiopathic neonatal hepatitis, idiopathic neonatal cholestasis, and neonatal jaundice with DB >20% of TB, had no mortality-related issues. Among the neonates included in this study, 148 neonates with BA were initially diagnosed or referred to a tertiary hospital without Kasai operation. Among them, 145 (3.6%) patients had Kasai operation, while three patients with no Kasai operation went to liver transplantation later. In BA, one patient died due to sepsis caused by cholangitis and another due to heart failure caused by combined cardiac anomalies. In the literature search,5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23 18 reports (n = 2225) were included (Supplementary Table 4), and the death rate was 10.8% (n = 240), which was similar to that found in the current study, although etiologic proportions were different.

Figure 1.

Fig 1

Outcome of neonatal cholestasis. a. The loss of native liver occurred in 12% of patients. b. Time-lapse from birth to demise among 425 mortality cases.

Variable selection and plausible mechanism of the loss of missing variables

For comparison of simple laboratory indices between the 1-year mortality group and the survival group, the initial dataset at the time of enrolment and peak value dataset during the 8-week follow-up were selected for analysis (Supplementary Table 5). All the laboratory indices showed significant differences between survival and death groups in the univariate analysis. Among them, the peak value dataset had more variables of higher discriminatory ability in the ROC curve analysis. In addition, based on the fact that current liver-oriented prediction models, such as KCHC, PELD and LIU, use peak values of variables during a certain period in their prediction algorithms,25, 26, 27 we used the peak values of simple laboratory indices in the development of a new NCM prediction model. Notably, peak DB value was positively correlated to observed mortality rate (Supplementary Figure 2). In multicollinearity analysis, the Spearman correlation coefficient between AST and ALT was 0.9 (Supplementary Figure 3), and ALT was then removed from the LR analysis. The sample size of this study was appropriate to satisfy the minimum criteria of events per selected variables.47

As the peak variable dataset was selected, the missing data mechanism was also evaluated; a large proportion of missing values in PT INR and CRP were possibly MNAR (Supplementary Table 6). Among the 4028 patients, the proportion of missing PT INR values was 34% (n = 1368). Groups with missing PT INR had only 2.3% (n = 32) mortality compared to the 13.7% (n = 366) mortality of the group with PT INR test values. This absence of PT INR values was also associated with higher values of DB, AST, Cr, and CRP (Supplementary Figure 4), suggesting that PT INR may be a cliché of problematic MNAR impacting both independent and dependent variables. As there is currently no established strategy for problematic MNAR,35 we then performed separate analyses for both [1] complete-case analysis (n = 2661) without missing PT INR values as a main reference dataset and [2] whole-case analysis (n = 4028) with regression imputation as an alternative dataset (Supplementary Figure 5).

Interpretable prediction model development

Multivariate LR was conducted first for complete-case analysis using the forward method to predict 1-year mortality (Supplementary Table 7). The significance of the LR model was assessed by χ2=843.0 (p-value <0.001) and the Hosmer–Lemeshow test revealed insufficient goodness-of-fit of the LR model (p-value <0.05) with a low Nagelkerke's R2 of 0.54. The logit (p) and its predicted probability were calculated using the following equation:

Logit(p)=0.016×lowPLT(103/μL)+0.095×peakPTINR+0.000065×peakAST(IU/L)0.748×lowALB(g/dL)+0.051×peakDB(mg/dL)+0.38×peakCr(mg/dL)+0.027×peakCRP(mg/dL)0.382
Probability(%)=11+eLogit(p)×100

Alternative analysis using machine learning

The odds ratios in this LR analysis cannot reflect the relative importance of continuous variables in the LR model because the scale of each variable is different. Therefore, we performed decision tree analyses for more interpretable models. CART and CHAID were used to visualise classification algorithms to distinguish the death group from the survival group (Figure 2a and Supplementary Figure 6). In both CART and CHAID analyses, initial segregations consisted of subgroups by different values of PLT. These findings suggested that the lowest values of PLT in this dataset are hierarchically higher in the prediction of mortality than variables directly related to hepatic conditions. Based on the fact that multiple illnesses such as sepsis, birth asphyxia, prematurity, liver disease, respiratory distress, and disseminated intravascular coagulation can be combined with thrombocytopenia in neonates,48 the severity of thrombocytopenia was probably reflected by the degree of systemic illness, which may affect survival in patients with neonatal cholestasis. Next, segregations were guided by thresholds of PT INR and AST in the CART model (Figure 2a) and by thresholds of PT INR, AST, ALB and DB in the CHAID model (Supplementary Figure 6). PT INR, AST, ALB and DB are, in general, indicators of liver pathologies.25, 26, 27

Figure 2.

Fig 2

Exploratory analyses on complete-case and whole-case development dataset. a. Example of decision trees to classify 1-year mortality case in whole-case dataset without imputation: CART analysis. b. SHAP plot for XGBoost model; complete-case analysis vs. whole-case analysis with regression imputation. c. Odds ratio plot in LR model. Asterisks indicate no statistical significance in LR.

We also applied flexible machine learning (RF and XGBoost) to explore whether other data patterns between predictors and outcomes were hidden. Complete-case and whole-case datasets were split into the train set (70%) and the test set (30%) in RF and XGBoost (Supplementary Figure 7). Through grid search, hyperparameters were optimised (Supplementary Table 8); otherwise, default settings were applied to both models. The result of variable importance in RF and XGBoost had similar findings of LR and decision tree models (Supplementary Figure 8), with PLT, PT INR, Cr, AST and DB being the most important in RF's complete-case and whole-case analyses. In XGBoost, variable importance, presented as the F score, showed a similar hierarchical order of variables to that shown in RF. In a SHAP summary plot, as a patient's platelet count decreases (approaches blue), the SHAP value impact on 1-year mortality increases (Figure 2b), and as patients’ PT INR, CRP, AST and DB increase, the SHAP value impacts also increase. Overall, machine learning modelling agreed with the findings of LR (Figure 2c).

Performance evaluation

Three measures were used to evaluate the apparent performance of the NCM model and other alternative machine learnings; these were the Brier score, discrimination by the AUC value, and calibration. Table 2 suggests that the overall judgement of the performance of LR-based models is acceptable compared to that of other models (Figure 3a and Supplementary Figs. 9 to 13). The AUCs of the LR-based models were 0.916 for the complete-case analysis and 0.937 for the whole-case analysis with regression imputations. The calibration slopes were 1.04 for complete-case analysis and 1.05 for whole-case analysis. The discriminatory abilities were high in RF and XGBoost for the complete-case and whole-case analyses. Furthermore, the differences between complete-cases and whole-case analyses were minimal within analyses. Overall, LR-based NCM prediction model from complete-case dataset showed acceptable performance in predicting the 1-year mortality of neonatal cholestasis.

Table 2.

Performance and clinical usefulness of neonatal cholestasis-related mortality (NCM) prediction model. The NCM model was derived from complete-case dataset by logistic regression analysis.

Prediction model Dataset Performance
Clinical usefulness
Brier score AUC Calibration slope Threshold Sensitivity Specificity PPV NPV
Logistic regression
Complete-case 0.072 0.916 1.04 16.1% 86.9% 84.6% 47.3% 97.6%
Whole-case 0.052 0.937 1.05 13.7% 86.9% 88.4% 45.1% 98.4%
Alternative analyses
CART* Complete-case 0.071 0.89 1.01 9% 87.7% 78.6% 39.5% 98.6%
Whole-case* 0.052 0.89 1.02 9% 82.4% 87.8% 42.6% 97.8%
CHAID* Complete-case 0.071 0.916 1.06 14% 87.4% 81% 42.4% 97.6%
Whole-case* 0.052 0.936 1.08 7% 94.2% 77.8% 31.7% 99.2%
Random forest⁎⁎ Complete-case 0.067 0.921 1.03 18.9% 84% 87.8% 50.9% 97.3%
Whole-case 0.046 0.954 1.09 14.5% 89% 90.3% 49.8% 98.6%
XGBoost⁎⁎ Complete-case 0.074 0.919 0.82 5.6% 84.9% 85.6% 47.4% 97.4%
Whole-case 0.049 0.951 0.93 11% 86.6% 90.9% 51% 98.4%

Missing values were not imputed

⁎⁎

Values of performances were derived from the test dataset (See Supplementary Figure 7).

AUC; area under the curve, CART; classification and regression tree, CHAID; chi-square automatic interaction detection, ML; machine learning PPV; positive predictive value, NPV; negative predictive value, XGBoost; extreme gradient boost.

Figure 3.

Fig 3

Performance evaluation of LR-based NCM prediction model across development (complete-case) and external validation (complete-case) samples.

Subgroup analyses and comparison of other paediatric models

We performed subgroup analysis according to the major aetiologies, such as [1] prematurity with very low birth weight [2], complex heart anomalies, and [3] gastrointestinal anomalies with related issues. The discriminatory abilities of the NCM model were acceptable with AUCs ranging from 0.89 to 0.93 (Supplementary Figure 14), suggesting that the apparent discriminatory performance of the NCM model were applicable to the subgroups.

In addition, we compared our NCM model with existing scoring models (KCHC and PELD) that have been widely used for acute liver failure and chronic liver disease in the older infant and childhood age groups. The AUCs of the NCM model and PELD score (0.863–0.937) were much higher than those of the KCHC (0.607–0.606) in both complete-case and whole-case analyses (Supplementary Figure 15). Overall, the NCM model showed a significantly higher discriminatory performance compared to the PELD and KCHC models (Supplementary Table 9).

Using another criterion of DB ≥2.0 mg/dL in the Childhood Liver Disease Research Network,5 1608 (40%) patients were identified with a mortality rate of 7.4%. Adding another criterion of cholestasis duration of >2 weeks, persistent cholestasis with DB ≥2.0 mg/dL was observed in 937 (23%) patients with 19.1% mortality. Using the NCM model, we performed subgroup analyses (Supplementary Figure 16). In the performance evaluation of DB ≥2 mg/dL and DB ≥2 mg/dL and duration >2 weeks groups, the AUCs were 0.90 and 0.89, respectively. Other LR analyses using the dataset of the two subgroups were also performed, and all risk variables remained significant, with the exception of albumin (data not shown), indicating that the variables used in the original NCM model and reconstructed models were valid in the subgroups with higher intensity and longer duration of the phenotype. Bootstrap analysis of these LR showed the same variable patterns (data not shown).

Internal and external validations

To validate the NCM model internally in our dataset, the cross-validation and bootstrap re-sampling method were used to estimate the optimism and its 95% CIs. In the 10-fold cross-validation, the average AUCs of complete-case and whole-case analyses were 0.913 (95% CI, 0.898–0.927) and 0.936 (95% CI, 0.924–0.948), respectively (Supplementary Table 10). In bootstrap re-sampling, the stability of AUCs was shown on 150 times re-sampling, and the average values of optimism after 500 times re-sampling were 0.003 and 0.0023 for complete-case and whole-case analyses, respectively. Therefore, the optimism-adjusted AUCs were 0.912 (95% CI: 0.897–0.928) and 0.935 (95% CI: 0.922–0.947) for complete-case and whole-case analyses, respectively. Moreover, optimism-adjusted calibration slopes were 1.03 (95% CI, 0.92–1.14) and 1.04 (95% CI, 0.99–1.07), respectively (Supplementary Figure 17). A criterion of internal validation includes obtaining a calibration slope ≥0.9 and optimism of AUC ≤0.02.49 As a result, no adjustments to the regression coefficients in the NCM prediction model were made.

As external validation is essential in assessing the generalizability of a new model to a plausibly related population,50 we collected the retrospective dataset (n = 2541 with 174 [6.8%] deaths) of neonatal cholestasis with the same inclusion criteria from two of the largest tertiary hospitals in Korea. Due to insufficient manual reviews of medical charts of external samples, spatial transportability was evaluated in the external validation. Among external samples, the complete-case dataset was available in 36.2% (n = 920 with 109 [11.8%] deaths) of all subjects. The NCM prediction model showed acceptable discriminatory performance across the external validation samples (AUC=0.903 and Brier score=0.072) (Figure 3b). Moreover, the level of agreement between the observed and predicted 1-year mortality risk across the external samples was acceptable (calibration slope=0.923, intercept=0.006) compared to the apparent performance. Finally, systemic inspection on the NCM model was performed in accordance with the TRIPOD and CHARM checklists.

Discussion

The present study had two major findings: aetiologic characteristics of neonatal cholestasis in a tertiary hospital and development of a novel neonatal cholestasis-related prediction model for 1-year mortality. First, this is the first report of thousands of subjects with neonatal cholestasis that has never been published in a single centre that describes a variety of aetiologies and their outcomes in detail. A large proportion (54%) of birth issues and heart issues reflected the clinical setting of a tertiary hospital. The variety of liver diseases and genetic disorders also reflected that of a liver transplantation centre, where the largest paediatric transplantation cohort was built in Korea.51 The proportion (3.6%) of BA was small compared to those reported in other studies (Supplementary Table 4). The authors believe that it is because large number of subjects (n = 2420) with subtle neonatal cholestasis (1 mg/dL <DB <2.0 mg/dL during the 8-week follow-up) enrolled based on the NASPGHN/ESPGHN guideline's cut-off value of DB >1.0 mg/dL.1 The cut-off DB value of >1 mg/dL used by Harpavat et al. to screen biliary atresia showed an excellent diagnostic accuracy in their well-established population-based study.52 In fact, if we adopted another criterion of DB ≥2.0 mg/dL in the Childhood Liver Disease Research Network,5 1608 (40%) patients could be enrolled from this dataset. Regardless of the screening value of DB, independent risk factors remained similar in this dataset, and peak DB was positively correlated to the observed mortality rate (Supplementary Figure 2). With regard to mortality in neonatal cholestasis, the literature search showed an overall mortality of 10.8% in these groups (Supplementary Table 4), which was interestingly similar to the value (10.6%) we found in our cohort. However, the interpretation of this coincident finding requires caution because of the different proportions of underlying aetiologies and study inclusions among studies.

Second, a novel prediction model for 1-year mortality named NCM was developed and validated following the recommended steps of the TRIPOD statement.28 This is also the first model to predict the outcome of neonatal cholestasis using large internal and external datasets. Liver-oriented prediction models, such as PELD and KCHC, were also suitable to an extent for further analysis in this cohort (Supplementary Figure 15), with AUCs of 0.678 and 0.841, respectively, as reported in other studies.53 This may indicate that the dataset of our cohort may be appropriate for the analysis of cholestasis. However, the comparison may not be meaningful because the purposes of models, decision rules and datasets of subjects are different.25, 26, 27 As TRIPOD statement recommends a nomogram for rapid estimation of risk, a nomogram for 1-year overall mortality probability was made based on our NCM prediction analyses (Supplementary Figure 18).

The performance of the NCM prediction model was acceptable in terms of Brier scoring, discrimination and calibration. However, the value of Nagelkerke's R2 of 0.5 in this LR model indicates that the model explains 50% of data patterns in the LR-based analysis. This low R2 value may be appropriate because prognostic models generally have R2 values of 0.2–0.3(32) because substantial future uncertainty within 1 year remains at the individual level in the reality of neonatal cholestasis. Notably, the machine learnings RF and XGBoost, known to be efficient classifiers,54 agreed with the findings of the LR-based NCM prediction in our study. Especially, the Shapley plot of XGBoost properly explained how the changes and density of variables affect the mortality outcome in this study.

There is limited knowledge about risk factors for mortality in neonatal cholestasis. To the best of our knowledge, no literature identified risk factors for mortality of neonatal cholestasis, except for a recent study by Santos Silva et al. (Supplementary Table 4). This study identified some predictors of diagnosis and prognosis, which helped build a diagnostic decision algorithm.23 TB, DB and gamma-glutamyl transferase were related to the poor outcomes in a group diagnosed as having neonatal cholestasis. The findings of Silva's report support the concept of our study using simple laboratory indices in predicting mortality. In the NCM model, the clinical meaning of each predictor is difficult to understand in a clinical perspective, despite rigorous analytic exploration in this study. We speculate that the whole samples have various heterogeneous aetiologies for the explanation. Each patient's condition towards mortality cannot be simply explained. Moreover, the endpoint of the outcome is a future event, death within 1 year. Based on our analyses, lower PLT and ALB levels and higher PT INR, AST, DB, TB, CRP and Cr levels were related to higher mortality risk. Several severe conditions may be shown in combinations of predictors; for example, [low PLT + high PT INR + high CRP + high Cr] in severe septic shock, [high PT INR + high AST + high DB + high TB] in acute liver failure, [high PT INR + high DB] in prematurity with parenteral nutrition-associated liver disease, or [high AST + high DB + high Cr] in ischaemic liver disease from heart failure and congenital heart anomalies. A prospective study in a controlled sample setting is needed to fully understand the specific role of each predictor on future death.

Despite the less interpretable role of predictors, the clinical utility of the NCM prediction model is clear. By providing the probability of mortality, the NCM prediction model can help clinicians be aware of high-risk patients. We believe that patients with a high mortality probability should prioritise the therapeutic approach. The prediction model in medical practice generates decision rules, such as decision support on test ordering, surgical decision-making, using treatment threshold (starting, delaying, and intensifying), cost-effectiveness of treatment, and simple prediction of future events (attention).32,55 The decision rule of the NCM prediction model is just a simple prediction for now. However, this NCM prediction model is just about to be listed on the evidence-level I of the prediction model (Supplementary Table 11).56 More clinical utility can be evaluated in another cohort.

There are several limitations in this study. First, the study design is retrospective, and a biased cohort is possible. The large sample size in our study cannot equate to the novelty of the research. Conventionally, requirements to derive reliable prediction mode are [1] at least ‘100 events’ and [2] at least 10 events per variable, and preferably 20 in the case of an event rate <20%. The sample size of the complete-case dataset (n = 2661) in this study barely meets the preferable sample size requirement. The retrospective study design limits well-controlled data collection. However, the inclusion criteria are almost similar to a well-designed prospective study for diagnostic screening of BA among neonatal cholestasis.52 Despite routine diagnostic and genetic screening, alpha-1-antitrypsin deficiency was absent in our dataset considering the conventional findings.56 The prevalence rates of some specific diseases may differ among East Asian subjects.21 Moreover, the evaluation for aetiologies was based on the availability of diagnostic modalities in our centre. Second, the principal outcome, which is the 1-year mortality as a hard endpoint, was partially interrupted by various interventions. For example, patients with BA have a Kasai operation at the time of diagnosis and liver transplantation at approximately 1 year of age if the Kasai operation fails.51 In this context, the nomogram from the NCM prediction model may not be suitable for BA. Surgical techniques and medical care for severe complex anomalies and prematurity have also improved over time. As expected, like a retrospective study, the outcome assessment was not done by the prior protocol, and the study period of outcome assessment was arbitrary. Third, the NCM prediction model provides only the predicted probabilities of mortality risk and the authors believe that the prediction model is inherently away from providing certainty on the individual outcome of the future reality.

Nevertheless, the LR-based NCM prediction model showed internal and external validity, suggesting a possibility of generalisation in a population of children with neonatal cholestasis. Furthermore, this novel NCM model fulfilled all checklists of the TRIPOD and CHARM checklists. Thus, this model strongly suggests the necessity of a prospective study to develop more powerful prediction models. To overcome these limitations, the NCM prediction model needs evidence-level II, assessing the predictive ability when tested prospectively in one setting.55 In this light, the prospective cohort of the Childhood Liver Disease Research Network5 is vital and needs to expand globally.

In conclusion, the present study provided detailed aetiologic characteristics of the largest paediatric cohort of neonatal cholestasis in a tertiary hospital. In addition, an NCM prediction model for 1-year mortality, which is the first in neonatal cholestasis, showed appropriate performance.

Contributors

S.H.O., J.M.N., and I.K. conceived and designed the study. S.H.O., H.J.C., I.K., and H.J.L. verified and analysed the data and wrote the paper. H.J.O., M.K.A., W.I.B., Y.E.K., B.S.L., D.Y.K., E.J.L. and K.M.K. reviewed the manuscript. J.O.S. and J.S.K. provided the external validation samples. B.S.L., D.Y.K., E.J.L. and K.M.K. participated in the management of the patients and analysed the data. All authors reviewed and approved the manuscript.

Declaration of interests

The authors have no conflicts of interest to declare.

Acknowledgments

Data sharing

The main data underlying the findings of the current study are provided within the article and its Supplementary Information File. Extra data are available from the corresponding author upon reasonable request after receiving clearance from the institutional review board of our center.

Acknowledgments

This research was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (grant number: HR21C0198).

Footnotes

Supplementary material associated with this article can be found in the online version at doi:10.1016/j.ebiom.2022.103890.

Appendix. Supplementary materials

mmc1.docx (3.1MB, docx)
mmc2.docx (481.2KB, docx)

References

  • 1.Fawaz R., Baumann U., Ekong U., et al. Guideline for the evaluation of cholestatic jaundice in infants: joint recommendations of the North American Society for pediatric gastroenterology, hepatology, and nutrition and the European Society for pediatric gastroenterology, hepatology, and nutrition. J Pediatr Gastroenterol Nutr. 2017;64(1):154–168. doi: 10.1097/MPG.0000000000001334. [DOI] [PubMed] [Google Scholar]
  • 2.Dick M.C., Mowat A.P. Hepatitis syndrome in infancy–an epidemiological survey with 10 year follow up. Arch Dis Child. 1985;60(6):512–516. doi: 10.1136/adc.60.6.512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Hartley J.L., Davenport M., Kelly D.A. Biliary atresia. Lancet. 2009;374(9702):1704–1713. doi: 10.1016/S0140-6736(09)60946-6. (London, England) [DOI] [PubMed] [Google Scholar]
  • 4.Serinet M.O., Wildhaber B.E., Broué P., et al. Impact of age at Kasai operation on its results in late childhood and adolescence: a rational basis for biliary atresia screening. Pediatrics. 2009;123(5):1280–1286. doi: 10.1542/peds.2008-1949. [DOI] [PubMed] [Google Scholar]
  • 5.Hertel P.M., Hawthorne K., Kim S., et al. Presentation and outcomes of infants with idiopathic cholestasis: a multicenter prospective study. J Pediatr Gastroenterol Nutr. 2021;73(4):478–484. doi: 10.1097/MPG.0000000000003248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Gottesman L.E., Del Vecchio M.T., Aronoff S.C. Etiologies of conjugated hyperbilirubinemia in infancy: a systematic review of 1692 subjects. BMC Pediatr. 2015;15:192. doi: 10.1186/s12887-015-0506-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Mowat A.P., Psacharopoulos H.T., Williams R. Extrahepatic biliary atresia versus neonatal hepatitis. Review of 137 prospectively investigated infants. Arch Dis Child. 1976;51(10):763–770. doi: 10.1136/adc.51.10.763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Danks D.M., Campbell P.E., Jack I., Rogers J., Smith A.L. Studies of the aetiology of neonatal hepatitis and biliary atresia. Arch Dis Child. 1977;52(5):360–367. doi: 10.1136/adc.52.5.360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hitch D.C., Leonard J.C., Pysher T.J., Manion C.V., Smith E.I. Differentiation of cholestatic jaundice in infants. Utility of diethyl-IDA. Am J Surg. 1981;142(6):671–677. doi: 10.1016/0002-9610(81)90309-3. [DOI] [PubMed] [Google Scholar]
  • 10.Spivak W., Sarkar S., Winter D., Glassman M., Donlon E., Tucker K.J. Diagnostic utility of hepatobiliary scintigraphy with 99mTc-DISIDA in neonatal cholestasis. J Pediatr. 1987;110(6):855–861. doi: 10.1016/s0022-3476(87)80396-7. [DOI] [PubMed] [Google Scholar]
  • 11.Motala C., Ireland J.D., Hill I.D., Bowie M.D. Cholestatic disorders of infancy–aetiology and outcome. J Trop Pediatr. 1990;36(5):218–222. doi: 10.1093/tropej/36.5.218. [DOI] [PubMed] [Google Scholar]
  • 12.Yachha S.K., Khanduri A., Kumar M., et al. Neonatal cholestasis syndrome: an appraisal at a tertiary center. Indian Pediatr. 1996;33(9):729–734. [PubMed] [Google Scholar]
  • 13.Fischler B., Papadogiannakis N., Nemeth A. Aetiological factors in neonatal cholestasis. Acta Paediatr. 2001;90(1):88–92. doi: 10.1080/080352501750064932. [DOI] [PubMed] [Google Scholar]
  • 14.Stormon M.O., Dorney S.F., Kamath K.R., O'Loughlin E.V., Gaskin K.J. The changing pattern of diagnosis of infantile cholestasis. J Paediatr Child Health. 2001;37(1):47–50. doi: 10.1046/j.1440-1754.2001.00613.x. [DOI] [PubMed] [Google Scholar]
  • 15.Aanpreung P., Laohapansang M., Ruangtrakool R., Kimhan J. Neonatal cholestasis in Thai infants. J Med Assoc Thai. 2005;88(Suppl 8):S9–15. [PubMed] [Google Scholar]
  • 16.Tiker F., Tarcan A., Kilicdag H., Gurakan B. Early onset conjugated hyperbilirubinemia in newborn infants. Indian J Pediatr. 2006;73(5):409–412. doi: 10.1007/BF02758562. [DOI] [PubMed] [Google Scholar]
  • 17.Humphrey T.M., Stringer M.D. Biliary atresia: US diagnosis. Radiology. 2007;244(3):845–851. doi: 10.1148/radiol.2443061051. [DOI] [PubMed] [Google Scholar]
  • 18.Rafeey M., Golzar A., Javadzadeh A. Cholestatic syndromes of infancy. Pak J Biol Sci. 2008;11(13):1764–1767. doi: 10.3923/pjbs.2008.1764.1767. [DOI] [PubMed] [Google Scholar]
  • 19.Ipek M.S., Aydin M., Zenciroglu A., Gokce S., Okumus N., Gulaldi N.C. Conjugated hyperbilirubinemia in the neonatal intensive care unit. Turk J Gastroenterol. 2013;24(5):406–414. doi: 10.4318/tjg.2013.0553. [DOI] [PubMed] [Google Scholar]
  • 20.Lu F.T., Wu J.F., Hsu H.Y., et al. γ-Glutamyl transpeptidase level as a screening marker among diverse etiologies of infantile intrahepatic cholestasis. J Pediatr Gastroenterol Nutr. 2014;59(6):695–701. doi: 10.1097/MPG.0000000000000538. [DOI] [PubMed] [Google Scholar]
  • 21.Liu P., Guo L., Huang L., et al. Analysis of factors affecting the prognosis of neonatal cholestasis. Int J Clin Exp Med. 2015;8(5):8005–8009. [PMC free article] [PubMed] [Google Scholar]
  • 22.Abuduxikuer K., Chen R., Wang Z.L., Wang J.S. Risk factors associated with mortality in neonatal intrahepatic cholestasis caused by citrin deficiency (NICCD) and clinical implications. BMC Pediatr. 2019;19(1):18. doi: 10.1186/s12887-018-1383-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Santos Silva E., Moreira Silva H., Catarino C., Dias C.C., Santos-Silva A., Lopes A.I. Neonatal cholestasis: development of a diagnostic decision algorithm from multivariate predictive models. Eur J Pediatr. 2021;180(5):1477–1486. doi: 10.1007/s00431-020-03886-z. [DOI] [PubMed] [Google Scholar]
  • 24.Lauriti G., Zani A., Aufieri R., et al. Incidence, prevention, and treatment of parenteral nutrition-associated cholestasis and intestinal failure-associated liver disease in infants and children: a systematic review. JPEN J Parenter Enteral Nutr. 2014;38(1):70–85. doi: 10.1177/0148607113496280. [DOI] [PubMed] [Google Scholar]
  • 25.Freeman R.B., Wiesner R.H., Harper A., et al. The new liver allocation system: moving toward evidence-based transplantation policy. Liver Transpl. 2002;8(9):851–858. doi: 10.1053/jlts.2002.35927. [DOI] [PubMed] [Google Scholar]
  • 26.Jain V., Dhawan A. Prognostic modeling in pediatric acute liver failure. Liver Transpl. 2016;22(10):1418–1430. doi: 10.1002/lt.24501. [DOI] [PubMed] [Google Scholar]
  • 27.Liu E., MacKenzie T., Dobyns E.L., et al. Characterization of acute liver failure and development of a continuous risk of death staging system in children. J Hepatol. 2006;44(1):134–141. doi: 10.1016/j.jhep.2005.06.021. [DOI] [PubMed] [Google Scholar]
  • 28.Moons K.G., Altman D.G., Reitsma J.B., et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1–73. doi: 10.7326/M14-0698. [DOI] [PubMed] [Google Scholar]
  • 29.Damen J.A., Hooft L., Schuit E., et al. Prediction models for cardiovascular disease risk in the general population: systematic review. BMJ. 2016;353:i2416. doi: 10.1136/bmj.i2416. (Clinical research ed) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Diamond G.A. What price perfection? Calibration and discrimination of clinical prediction models. J Clin Epidemiol. 1992;45(1):85–89. doi: 10.1016/0895-4356(92)90192-p. [DOI] [PubMed] [Google Scholar]
  • 31.Wang F., Kaushal R., Khullar D. Should health care demand interpretable artificial intelligence or accept “black box” medicine? Ann Intern Med. 2020;172(1):59–60. doi: 10.7326/M19-2548. [DOI] [PubMed] [Google Scholar]
  • 32.Steyerberg EW. Clinical Prediction Models (Statistics for Biology and Health). Springer International Publishing.
  • 33.Sterne J.A., White I.R., Carlin J.B., et al. Multiple imputation for missing data in epidemiological and clinical research: potential and pitfalls. BMJ. 2009;338:b2393. doi: 10.1136/bmj.b2393. (Clinical research ed) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Enders C.K. A primer on the use of modern missing-data methods in psychosomatic medicine research. Psychosom Med. 2006;68(3):427–436. doi: 10.1097/01.psy.0000221275.75056.d8. [DOI] [PubMed] [Google Scholar]
  • 35.Pedersen A.B., Mikkelsen E.M., Cronin-Fenton D., et al. Missing data and multiple imputation in clinical epidemiological research. Clin Epidemiol. 2017;9:157–166. doi: 10.2147/CLEP.S129785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Moons K.G.M., de Groot J.A.H., Bouwmeester W., et al. Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist. PLoS Med. 2014;11(10) doi: 10.1371/journal.pmed.1001744. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.In Lee K., Koval J.J. Determination of the best significance level in forward stepwise logistic regression. Commun Stat Simul Comput. 1997;26(2):559–575. [Google Scholar]
  • 38.Breiman L.F.J., Olshen R.A., Stone C.I. Wadsworth; Belmont, CA: 1984. Classification and Regression Trees. [Google Scholar]
  • 39.Breiman L. Random forests. Mach Learn. 2001;45(1):5–32. [Google Scholar]
  • 40.Chen T., Guestrin C. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2016. XGBoost: a scalable tree boosting system; pp. 785–794. [Online]. Available: https://arxiv.org/abs/1603.02754. [Google Scholar]
  • 41.Moore K.W., de Waal Malefyt R., Coffman R.L., O'Garra A. Interleukin-10 and the interleukin-10 receptor. Annu Rev Immunol. 2001;19:683–765. doi: 10.1146/annurev.immunol.19.1.683. [DOI] [PubMed] [Google Scholar]
  • 42.Robinson J.T., Thorvaldsdottir H., Winckler W., et al. Integrative genomics viewer. Nat Biotechnol. 2011;29(1):24–26. doi: 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Hilden J., Habbema J.D., Bjerregaard B. The measurement of performance in probabilistic diagnosis. III. Methods based on continuous functions of the diagnostic probabilities. Methods Inf Med. 1978;17(4):238–246. [PubMed] [Google Scholar]
  • 44.Nagelkerke N.J. A note on a general definition of the coefficient of determination. Biometrika. 1984;78:691–692. [Google Scholar]
  • 45.Efron B. Estimating the error rate of a prediction rule: improvement on cross-validation. J Am Stat Assoc. 1983;78(382):316–331. [Google Scholar]
  • 46.Harrell F.E., Lee K.L., Mark D.B. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15(4):361–387. doi: 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]
  • 47.Peduzzi P., Concato J., Kemper E., Holford T.R., Feinstein A.R. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. 1996;49(12):1373–1379. doi: 10.1016/s0895-4356(96)00236-3. [DOI] [PubMed] [Google Scholar]
  • 48.Roberts I., Murray N.A. Neonatal thrombocytopenia: causes and management. Arch Dis Child Fetal Neonatal Ed. 2003;88(5):F359–FF64. doi: 10.1136/fn.88.5.F359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Christodoulou E., van Smeden M., Edlinger M., et al. Adaptive sample size determination for the development of clinical prediction models. Diagn Progn Res. 2021;5(1):6. doi: 10.1186/s41512-021-00096-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Steyerberg E.W., Vergouwe Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur Heart J. 2014;35(29):1925–1931. doi: 10.1093/eurheartj/ehu207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Oh SH, Jeong IS, Kim DY, et al. Recent improvement in survival outcomes and reappraisal of prognostic factors in pediatric living donor liver transplantation.n/a(n/a) 2022. [DOI] [PubMed]
  • 52.Harpavat S., Garcia-Prats J.A., et al. Diagnostic yield of newborn screening for biliary atresia using direct or conjugated bilirubin measurements. JAMA. 2020;323(12):1141–1150. doi: 10.1001/jama.2020.0837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Lee EJ, Kim JW, Moon JS, et al. Development of a prognostic score to predict mortality in patients with pediatric acute liver failure. 2020;70(6):777–82. [DOI] [PubMed]
  • 54.Taylor R.A., Pare J.R., Venkatesh A.K., et al. Prediction of in-hospital mortality in emergency department patients with sepsis: a local big data-driven, machine learning approach. Acad Emerg Med. 2016;23(3):269–278. doi: 10.1111/acem.12876. official journal of the Society for Academic Emergency Medicine. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Reilly B.M., Evans A.T. Translating clinical research into clinical practice: impact of using prediction rules to make decisions. Ann Intern Med. 2006;144(3):201–209. doi: 10.7326/0003-4819-144-3-200602070-00009. [DOI] [PubMed] [Google Scholar]
  • 56.Balistreri W.F., Bezerra J.A. Whatever happened to "neonatal hepatitis"? Clin Liver Dis. 2006;10(1):27–53. doi: 10.1016/j.cld.2005.10.008. v. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

mmc1.docx (3.1MB, docx)
mmc2.docx (481.2KB, docx)

Articles from EBioMedicine are provided here courtesy of Elsevier

RESOURCES