Abstract
Background/Aims
The BARD score is a model to detect advanced liver fibrosis in nonalcoholic fatty liver disease (NAFLD) patients. The aims of this study were to identify additional factors and then to build an enhanced version of the BARD score.
Methods
One hundred seven patients with biopsy-proven NAFLD were enrolled retrospectively. Logistic regressions were performed to identify independent risk factors for advanced liver fibrosis (stage 3 or 4). An enhanced model of the BARD score (BARDI score) was built and evaluated with a receiver operating characteristic (ROC) curve.
Results
In multivariate analysis, age (odds ratio [OR], 0.89; p=0.04), aspartate aminotransferase/alanine aminotransferase ratio (OR, 1.73; p<0.01), and international normalized ratio (INR) (OR, 8.85; p<0.01) were independently significant factors. The BARDI score was created by adding the INR to the BARD. The area under the ROC curve of the BARDI score was significantly larger than that of the BARD score (0.881 vs 0.808, p<0.01). A BARDI score of 3 or more showed a positive predictive value (PPV) of 51.0% and a negative predictive value (NPV) of 96.0%.
Conclusions
The BARDI score had an improved PPV over the BARD score and maintained an excellent NPV. Further study is warranted for its external validation and comparison with other models.
Keywords: Non-alcoholic fatty liver disease, Liver cirrhosis, International normalized ratio
INTRODUCTION
Nonalcoholic fatty liver disease (NAFLD) is the most common type of liver disease in the developed countries. About 30% of adults have this disease.1 The spectrum of NAFLD is diverse, ranging from simple steatosis to nonalcoholic steatohepatitis (NASH), which can lead to cirrhosis. The mainstream management strategy for NAFLD is regular follow-up with risk factor modification and early detection of liver fibrosis.2 Therefore, the early detection of liver fibrosis is very important.
The gold standard for detecting liver fibrosis is liver biopsy. However this procedure is both risky and expensive, making it difficult for patients to undergo repeated procedures.3,4 Further, the NAFLD patients stay asymptomatic usually until its final stages, making it difficult for the clinician to decide when to recommend biopsy, which can result in delayed diagnosis and management of liver cirrhosis. Several imaging methods which use ultrasound, computed tomography or magnetic resonance imaging (MRI) have been introduced to detect liver fibrosis early,5-10 but these methods are still expensive and available only in limited places.
Several models using demographic and clinical variables have been developed that can be used to predict liver fibrosis in NAFLD patients. However, some of these were not made specifically for NAFLD patients,11-13 and some models require a liver fibrosis panel which is not broadly used.12,14 The BARD score15 and the NAFLD fibrosis score16 are the models which were initially made specifically for NAFLD and do not require a special test. The main strength of the BARD score over the NAFLD fibrosis score is its simplicity. The components of the BARD score: body mass index (BMI), aspartate aminotransferase (AST)/alanine aminotransferase (ALT) ratio and diabetes, are all widely available in clinical practices. The score can be added up simply, without the use of a calculator. Considering the high prevalence of NAFLD, it is very important that general physicians have a simple method to routinely screen patients for advanced liver fibrosis. The limitation of the BARD score is its high false positivity. If a patient, for example, has diabetes and a BMI of 28 kg/m2 (BARD score, 2), this patient will be classified as having a high risk of advanced liver fibrosis even though the patient has an AST/ALT ratio of less than 0.8 and no liver fibrosis. The positive predictive value (PPV) of a BARD score of 2 or more was 43% for advanced liver fibrosis in its initial development.15 A prospective study has shown the PPV of a BARD score of 2 or more to be only 36.1% for advanced liver fibrosis, which is lower than PPVs of other clinical models.5 This low PPV of BARD score means that a significant number of patients will have false positive results and may end up having unnecessary imaging tests and risky liver biopsies.
In this study, we evaluated the performance of the BARD score in our NAFLD patients and built an enhanced model of the BARD score by finding additional variables in NAFLD patients.
MATERIALS AND METHODS
1. Data sources and elements
All patients with NASH, whose diagnoses were confirmed by liver biopsy between 2002 and 2006 at Cabell Huntington Hospital/Marshall University Medical Center, were enrolled retrospectively. The diagnostic criteria for NASH were used as described in literature previously.17 Any patients who had a history of alcohol abuse in their medical records, serologic evidence of hepatitis virus infection, or history of other liver disease, such as hemochromatosis, alpha-1 antitrypsin deficiency, Wilsons' disease, autoimmune hepatitis, primary biliary cirrhosis, or primary sclerosing cholangitis, were excluded. Also the patients who had an age of less than 18 years old at the time of biopsy were excluded.
Demographics (age, sex, weight, and height), comorbidities (hypertension, diabetes, and hyperlipidemia), and laboratory information (liver profile, lipid profile, serum creatinine, and international normalized ratio [INR]) tested within a month before the biopsy were obtained from the medical records. Hypertension, diabetes, and hyperlipidemia patients were defined as those who were previously diagnosed with these diseases by physicians, or those who took antihypertensive medications, hypoglycemic agents/insulin, or lipid-lowering agents at the time of biopsy. The liver profile, lipid profile and serum creatinine level were measured using ADVIA® 1650 Chemistry System (Siemens Healthcare Diagnostics, Deerfield, IL, USA). Protamine was measured using BCS® System (Siemens Healthcare Diagnostics). The INR was calculated from the protamine level by SoftLab® (SCC Soft Computer, Clearwater, FL, USA). Pathological reports made by two pathologists for clinical purposes were used. The extent of fibrosis was staged as follows: stage 0 was defined as no fibrosis. Enlarged fibrotic portal tracts were classified as stage 1. Periportal or portal-portal septa but intact architecture was classified as stage 2. Fibrosis with architectural distortion without cirrhosis as stage 3, and probable or definite cirrhosis as stage 4.17 We classified stages 3 and 4 fibrosis as advanced fibrosis and stages 0, 1, and 2 as nonadvanced fibrosis. This study was approved by Marshall University Institutional Review Board.
2. Statistical method
The chi-square test and Fisher's exact test were used for the categorical variables as appropriate, and a Wilcoxon rank sum test was used for the continuous variables to compare between advanced and nonadvanced fibrosis groups. A box plot was created to show the relationship between variables and the severity of fibrosis. The variables which were significantly associated with the severity of fibrosis in univariate analyses were selected for the multivariate logistic regression. A Wald chi-square test was used for the statistical significance testing in the logistic regression.
The independent predictors in our multivariate model were incorporated into the BARD score model to make an enhanced model. A receiver operating characteristics (ROC) curve was plotted for both the BARD score model and our enhanced model to evaluate their abilities to identify advanced fibrosis in NAFLD patients. The area under the ROC curve (AUROC) was calculated and a likelihood ratio test was performed to compare the models. As an independent validation data set was not available, a 20-fold cross validation was performed as internal validation.18
A p-value of 0.05 was used as the cutoff of statistical significance. Statistical analyses were conducted using the SAS 9.1 statistical software package (SAS Institute, Cary, NC, USA). SPSS version 13.0 (SPSS Inc., Chicago, IL, USA) was used to create the box plot and ROC curve.
RESULTS
Of the 135 patients who were identified initially, 28 of patients were excluded because of our exclusion criteria: 10 alcoholic hepatitis, nine viral hepatitis, six pediatric patients, two hemochromatosis, and one patient with lack of data. Finally, 107 patients were included in this study. Twenty-two patients had stage 0 fibrosis, 20 patients had stage 1, 31 patients had stage 2, 18 patients had stage 3, and 16 patients had stage 4 fibrosis. As shown in Table 1, the median age was 48.9. Most (79%) of the patients had a BMI of more than 28 kg/m2 and there were more females than males in both of nonadvanced and advanced fibrosis groups. Age, BMI, and sex did not show any significant difference between the two groups. Of the laboratory test results, ALT, AST/ALT ratio, bilirubin, INR, cholesterol, and high density lipoprotein levels were significantly different between the advanced and nonadvanced liver fibrosis groups.
Table 1.
Demographic and Clinical Characteristics of Patients
Data are presented as medians (interquartile ranges) or number (%).
BMI, body mass index; AST, aspartate aminotransferase; ALT, alanine aminotransferase; INR, international normalized ratio; LDL, low density lipoprotein; HDL, high density lipoprotein.
*Comparison between nonadvanced vs advanced disease. Chi-square tests for sex, diabetes, hypertension, and hyperlipidemia. Wilcoxon Rank-Sum tests for other variables.
Of the comorbidities, only hypertension was significantly different between the two fibrosis groups. About 33% of the patients had diabetes. The difference in frequencies of diabetes and hyperlipidemia between the two fibrosis groups didn't reach a level of statistical significance (Table 1).
Age, BMI, AST/ALT ratio, INR, bilirubin, cholesterol, diabetes, and hypertension were included initially in multivariate logistic regression. Age, BMI, and diabetes were included because these had been the important variables in the previous studies. After stepwise selection, age, AST/ALT ratio, and INR were the independently significant factors for advanced fibrosis. Bilirubin, diabetes, and hypertension were not independently significant, even though these variables remained in the final model (Table 2).
Table 2.
Multivariate Logistic Regression Analysis for the Prediction of Advanced Liver Fibrosis
Stepwise selection after including age, body mass index, AST/ALT ratio, INR, bilirubin, cholesterol, diabetes, and hypertension. The AST/ALT ratio and INR were multiplied by 10 to match the interval scale with other variables.
OR, odds ratio; CI, confidence interval; AST, aspartate aminotransferase; ALT, alanine aminotransferase; INR, international normalized ratio.
*Wald chi-square test; †OR for the 0.1 change of AST/ALT ratio; ‡OR for the 0.1 change of INR.
The BARD score was calculated and its ROC curve was created using our study sample. An enhanced model of BARD score, BARDI score, was generated by adding the INR to the components of the BARD score model. To keep the character of the BARD score, the same criteria (BMI ≥28 kg/m2 and AST/ALT ratio ≥0.8) were used in the BARDI score. For the INR, the ROC curve was generated and the maximum point of sensitivity plus specificity was chosen as a cutoff (INR, 1.07). Forced entry multivariate logistic regression was performed again using BMI, AST/ALT ratio, diabetes and INR. The coefficients (β) of variables in this multivariate logistic regression were used for the BARDI scoring system. AST/ALT ≥0.8 (β=3.21) received three credits, diabetes (β=1.22) received one credit, and an INR ≥1.07 (β=2.01) received two credits. A BMI ≥28 kg/m2 (β=0.12) received one credit even with a low β in order to keep the character of the BARD score in the BARDI score. The ROC curve of this BARDI model showed better performance than the BARD model (Fig. 1). The AUROC of the BARDI model was significantly better than the AUROC of the BARD model (0.881 vs 0.808, p<0.01). Comparing a BARD score of 2 or more with a BARDI score of 2 or more, the PPV was 45.8% for the BARD and 45.0% for the BARDI, which were not much different in our subjects. The negative predictive values (NPV) were 100% in both the BARD and BARDI scores. When comparing a BARDI score of 3 or more with a BARD of score 2 or more, the BARDI score showed higher specificity (49.0% vs 34.7%) without losing much sensitivity (96.3% vs 100%). The PPV for a BARDI score of 3 or more was 51.0%, while the PPV for a BARD score of 2 or more was 45.8%. The NPV for a BARDI score of 3 or more was 96.0% and NPV for a BARD score of 2 or more was 100% (Table 3). Twenty fold cross validation of the BARDI model showed 80.39% accuracy with a standard error of 0.75%. We tried to incorporate age, bilirubin, and hypertension respectively into the BARDI model. However, these variables did not increase the AUROC significantly and were not included in the final model (data not shown).
Fig. 1.
Receiver operating characteristic (ROC) curve of BARD and BARDI score for advanced liver fibrosis. Area under ROC curve for BARDI model, 0.881 (95% confidence interval [CI], 0.804 to 0.958). Area under ROC curve for BARD model, 0.808 (95% CI, 0.712 to 0.904). Seventy-six patients were used for this ROC curve after the exclusion of patients with missing values.
*Likelihood ratio test.
Table 3.
Comparison between BARD and BARDI Scores Using Our Study Sample
BARD area under receiver operating characteristic curve 0.808. BARDI area under receiver operating characteristic curve 0.881. BARDI score is the sum of the points below.
If body mass index ≥28 kg/m2, then 1 point. If aspartate aminotransferase/alanine aminotransferase ratio ≥0.8, then 3 points. If diabetes present, then 1 point. If international normalized ratio ≥1.07, then 2 points.
PPV, positive predictive value; NPV, negative predictive value.
DISCUSSION
We developed an enhanced model (BARDI score), by adding the INR, which was known to have correlation with liver fibrosis.13,16 As shown in Table 3, the BARDI score kept the simplicity and strength of the BARD score. All patients with a BARD score of 2 or more should have a BARDI score of 2 or more. A BARDI score of 2 or more performed similarly to a BARD score of 2 or more. When a BARDI score of 3 or more was compared with a BARD score of 2 or more, the BARDI score showed an improved PPV and kept an excellent NPV for detecting advanced liver fibrosis.
Using 2 or more of the BARD score is just the same as having one of either two risk factors (obesity and diabetes) or an AST/ALT ratio ≥0.8 as criteria for advanced fibrosis. If the patient has two risk factors with an AST/ALT ratio less than 0.8, the patient will be classified as high risk for advanced fibrosis. Usually, diabetic patients have a high prevalence of obesity.19 All these obese diabetic patients will be classified into the high risk group for advanced fibrosis, which will result in a lot of false positivity. In our study, we had a skewed population in terms of BMI and most patients received a credit due to obesity (BMI ≥28). Among the patients with both diabetes and obesity in our study (n=27), many patients (n=11, 40.7%) would have a BARD score of 2, even though they did not have advanced liver fibrosis. In contrast, a BARDI score of 3 or more cannot be made from the risk factors alone. Risk factors need to be combined with an INR ≥1.07 to reach a score of 3 or more. This combination is the main reason why the BARDI score can decrease the false positivity (increase PPV) of the BARD score. The role of AST/ALT ratio was kept the same in the BARDI score as in the BARD score. Any patient with an AST/ALT ratio of 0.8 or more should be classified into the possible advanced liver fibrosis group in either scoring system.
Age,11-13,16 platelet count,13 albumin,16 bilirubin,13,16 triglyceride,11 impaired fasting glucose level,16 and hypertension15,16 are known to be associated with advanced liver fibrosis and some of them have been used in other models, as shown in Table 4. In our study, we didn't have enough data for platelet, albumin and were not able to compare BARDI score directly with other models including NAFLD fibrosis score16 or FIB-4 score.13 As our patients did not have liver fibrosis panel, it was also not possible to compare the BARDI score with any models (OELF,12,14 ELF14 models) that used the liver fibrosis panel. Indirect comparison of PPV and NPV was also not possible because the PPV and NPV would be affected by the prevalence of advanced liver fibrosis in each study. However, the indirect comparison of sensitivity and specificity between models showed promising results. For example, an NAFLD fibrosis score showed 82% sensitivity and 77% specificity. A BARDI score of 3 or more showed 96% sensitivity and 49% specificity. When it comes with the BARDI score of 5 or more, the BARDI showed 85% sensitivity and 79.6% specificity. When the AUROC was compared using the previously reported AUROCs of other models, the AUROC 0.881 of the BARDI score was at least comparable with other models (Table 4).
Table 4.
Previously Reported Models for Advanced Liver Fibrosis in NAFLD
NAFLD, nonalcoholic fatty liver disease; AUROC, area under receiver operating characteristic curve; PPV, positive predictive value; NPV, negative predictive value; BMI, body mass index; ALT, alanine aminotransferase; AST, aspartate aminotransferase; P3NP, amino-terminal propeptide of type III collagen; TIMP-1, tissue inhibitor of matrix metalloproteinase 1; HA, hyaluronic acid; IFG, impaired fasting glucose; DM, diabetes mellitus; NA, not available.
*AUROC, PPV, NPV were calculated to detect stage 2 or higher; †AUROC, PPV, NPV data were from the study.
Other imaging studies including ultrasound based transient elastography (FibroScan), MR spectroscopy, diffusion weighted MRI, and MR elastography have been developed.9 It has been shown that these imaging methods have slightly better or comparable results to clinical models. In the case of transient elastography, the AUROC was reported to be 0.9047 and 0.94.5 In particular, Wong et al.5 showed the superiority of transient elastography to other previous clinical models for the diagnosis of advanced liver fibrosis. However, these imaging techniques cost more expenses and limited to only a small number of centers. Considering that NAFLD is a very common disease with a prevalence of about 30%,1 there should be a way that clinicians can evaluate a patient's risk of liver fibrosis in their offices before they refer them to specialized centers. If the assessment can be done using simple usual lab tests, many at risk patients can be detected or excluded before they go for an imaging test. After first screening with a clinical model, a physician still can perform imaging tests, which can give some information before the liver biopsy. An imaging test can differentiate a diffuse lesion from a local lesion and can also provide information about the character of a localized lesion, including possible mass like lesion, infection and so on. Then patient can opt for biopsy if it is indicated through the clinical model and imaging test.
Our study had several limitations. We had only 107 patients to analyze, which made it hard to split our subjects for external validation. Instead, we performed 20-fold cross validation, which showed a good performance of the BARDI score. Another limitation was derived from the restrictions of retrospective study. All information was gathered using the medical records which had been produced for clinical purposes only. The information was not complete enough to compare several previous models using our study subjects. We were able to calculate only the BARD score for our subjects. Also some medical records were not detailed regarding alcohol consumption. We had to exclude any patients who drank unspecified amounts of alcohol from our study. Also, our study was based on patients who had already had liver biopsies, indicating that the studies subjects may be sicker than the general population. Actually, all previous clinical models were made using the patient population who had received or had decided to have liver biopsy as well. The performance of these models should be evaluated prospectively in general population.
In conclusion, the INR was an independently significant factor for advanced liver fibrosis. We developed the BARDI score by adding the INR to the BARD score, resulting in an enhanced model that was still easy to calculate and had better performance than the BARD in predicting advanced liver fibrosis. Using the BARDI score would decrease false positivity with an excellent NPV, and eventually prevent unnecessary imaging tests or risky liver biopsies. Further external validation study and prospective study are needed to confirm this finding.
ACKNOWLEDGEMENTS
The authors thank Dr. W. Ray Kim for his help in the preparation of this manuscript. The authors also thank Drs. Toni Pacioles, Fikadu Tekleyes, and Waseem Shora in data collection.
Footnotes
No potential conflict of interest relevant to this article was reported.
References
- 1.Browning JD, Szczepaniak LS, Dobbins R, et al. Prevalence of hepatic steatosis in an urban population in the United States: impact of ethnicity. Hepatology. 2004;40:1387–1395. doi: 10.1002/hep.20466. [DOI] [PubMed] [Google Scholar]
- 2.Lewis JR, Mohanty SR. Nonalcoholic fatty liver disease: a review and update. Dig Dis Sci. 2010;55:560–578. doi: 10.1007/s10620-009-1081-0. [DOI] [PubMed] [Google Scholar]
- 3.Alba LM, Lindor K. Review article: non-alcoholic fatty liver disease. Aliment Pharmacol Ther. 2003;17:977–986. doi: 10.1046/j.1365-2036.2003.01493.x. [DOI] [PubMed] [Google Scholar]
- 4.Seeff LB, Everson GT, Morgan TR, et al. Complication rate of percutaneous liver biopsies among persons with advanced chronic liver disease in the HALT-C trial. Clin Gastroenterol Hepatol. 2010;8:877–883. doi: 10.1016/j.cgh.2010.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Wong VW, Vergniol J, Wong GL, et al. Diagnosis of fibrosis and cirrhosis using liver stiffness measurement in nonalcoholic fatty liver disease. Hepatology. 2010;51:454–462. doi: 10.1002/hep.23312. [DOI] [PubMed] [Google Scholar]
- 6.Yoneda M, Fujita K, Inamori M, Tamano M, Hiriishi H, Nakajima A. Transient elastography in patients with non-alcoholic fatty liver disease (NAFLD) Gut. 2007;56:1330–1331. doi: 10.1136/gut.2007.126417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yoneda M, Mawatari H, Fujita K, et al. Noninvasive assessment of liver fibrosis by measurement of stiffness in patients with nonalcoholic fatty liver disease (NAFLD) Dig Liver Dis. 2008;40:371–378. doi: 10.1016/j.dld.2007.10.019. [DOI] [PubMed] [Google Scholar]
- 8.Szczepaniak LS, Nurenberg P, Leonard D, et al. Magnetic resonance spectroscopy to measure hepatic triglyceride content: prevalence of hepatic steatosis in the general population. Am J Physiol Endocrinol Metab. 2005;288:E462–E468. doi: 10.1152/ajpendo.00064.2004. [DOI] [PubMed] [Google Scholar]
- 9.Talwalkar JA, Yin M, Fidler JL, Sanderson SO, Kamath PS, Ehman RL. Magnetic resonance imaging of hepatic fibrosis: emerging clinical applications. Hepatology. 2008;47:332–342. doi: 10.1002/hep.21972. [DOI] [PubMed] [Google Scholar]
- 10.Tsuda N, Okada M, Murakami T. Potential of gadolinium-ethoxybenzyl-diethylenetriamine pentaacetic acid (Gd-EOB-DTPA) for differential diagnosis of nonalcoholic steatohepatitis and fatty liver in rats using magnetic resonance imaging. Invest Radiol. 2007;42:242–247. doi: 10.1097/01.rli.0000258058.44876.a5. [DOI] [PubMed] [Google Scholar]
- 11.Ratziu V, Giral P, Charlotte F, et al. Liver fibrosis in overweight patients. Gastroenterology. 2000;118:1117–1123. doi: 10.1016/s0016-5085(00)70364-7. [DOI] [PubMed] [Google Scholar]
- 12.Rosenberg WM, Voelker M, Thiel R, et al. Serum markers detect the presence of liver fibrosis: a cohort study. Gastroenterology. 2004;127:1704–1713. doi: 10.1053/j.gastro.2004.08.052. [DOI] [PubMed] [Google Scholar]
- 13.Sterling RK, Lissen E, Clumeck N, et al. Development of a simple noninvasive index to predict significant fibrosis in patients with HIV/HCV coinfection. Hepatology. 2006;43:1317–1325. doi: 10.1002/hep.21178. [DOI] [PubMed] [Google Scholar]
- 14.Guha IN, Parkes J, Roderick P, et al. Noninvasive markers of fibrosis in nonalcoholic fatty liver disease: validating the European Liver Fibrosis Panel and exploring simple markers. Hepatology. 2008;47:455–460. doi: 10.1002/hep.21984. [DOI] [PubMed] [Google Scholar]
- 15.Harrison SA, Oliver D, Arnold HL, Gogia S, Neuschwander-Tetri BA. Development and validation of a simple NAFLD clinical scoring system for identifying patients without advanced disease. Gut. 2008;57:1441–1447. doi: 10.1136/gut.2007.146019. [DOI] [PubMed] [Google Scholar]
- 16.Angulo P, Hui JM, Marchesini G, et al. The NAFLD fibrosis score: a noninvasive system that identifies liver fibrosis in patients with NAFLD. Hepatology. 2007;45:846–854. doi: 10.1002/hep.21496. [DOI] [PubMed] [Google Scholar]
- 17.Brunt EM, Janney CG, Di Bisceglie AM, Neuschwander-Tetri BA, Bacon BR. Nonalcoholic steatohepatitis: a proposal for grading and staging the histological lesions. Am J Gastroenterol. 1999;94:2467–2474. doi: 10.1111/j.1572-0241.1999.01377.x. [DOI] [PubMed] [Google Scholar]
- 18.Moore C. SAS macro CVLR (cross-validation for logistic regression) [Internet] Athens: University of Georgia; 2000. [cited 2010 Nov 16]. Available from: http://ctmoore.myweb.uga.edu/prof/CVLRnotes.htm. [Google Scholar]
- 19.Centers for Disease Control and Prevention (CDC) Estimated county-level prevalence of diabetes and obesity: United States, 2007. MMWR Morb Mortal Wkly Rep. 2009;58:1259–1263. [PubMed] [Google Scholar]