Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Nov 1.
Published in final edited form as: Hepatology. 2008 Nov;48(5):1549–1557. doi: 10.1002/hep.22517

Prediction of Clinical Outcomes in Primary Biliary Cirrhosis by Serum Enhanced Liver Fibrosis (ELF) Assay

MJ Mayo, J Parkes, B Adams-Huet, B Combes, AS Mills, RS Markin, R Rubin, D Wheeler, M Contos, AB West, S Saldana, Y Getachew, R Butsch, V Luketic, MG Peters, A Di Biscelglie, N Bass, J Lake, T Boyer, E Martinez, J Boyer, G Garcia-Tsao, D Barnes, WM Rosenberg
PMCID: PMC2597274  NIHMSID: NIHMS65279  PMID: 18846542

Abstract

Background

Primary biliary cirrhosis (PBC) is sometimes diagnosed based upon a positive antimitochondrial antibody in the appropriate clinical setting without a liver biopsy. While a liver biopsy can assess the extent of liver fibrosis and provide prognostic information, serum fibrosis markers avoid biopsy complications, sampling error, and provide results as a continuous variable, which may be more precise than categorical histological stages. The current study was undertaken to evaluate serum fibrosis markers as predictors of clinical progression in a large cohort of PBC patients.

Methods

Serial liver biopsies and serum samples were collected every 2 years in 161 PBC subjects for a median of 7.3 years. Clinical progression was defined as development of one or more of the following events: varices, variceal bleed, ascites, encephalopathy, liver transplant, or liver-related death. Serum HA, TIMP-1, and PIIINP were measured and entered into the previously validated Enhanced Liver Fibrosis (ELF) algorithm. The ability of ELF, histological fibrosis, bilirubin, MELD, and Mayo Risk Score to differentiate between individuals who would experience a clinical event from those who would not was evaluated at different time points.

Results

Event-free survival was significantly lower in those with high baseline ELF. Each 1 point increase in ELF was associated with a 3 fold increase in future complications. The prognostic performance of all tests was similar when performed close to the time of the first event. However, at earlier times in the disease process (4 and 6 years prior to the first event), the prognostic performance of ELF was significantly better than MELD or Mayo R score.

Conclusion

The ELF algorithm is a highly accurate non-invasive measure of PBC disease severity which provides useful long-term prognostic information.

Keywords: Mayo Risk Score, MELD, Biopsy, Metalloproteinase, Hyaluronic acid

Introduction

Some patients with primary biliary cirrhosis (PBC) will have a normal life span without complications, whereas others will progress to complications of cirrhosis, liver failure, and death [1]. Accurately predicting clinical outcomes in patients with primary biliary cirrhosis presents a challenge. Surrogate markers of disease progression are needed for multiple reasons, including providing prognostic information to patients, optimizing referral time for liver transplant, and serving as endpoints in clinical trials [2]. The Model for End stage Liver Disease (MELD) and Mayo Prognostic Model for PBC (Mayo R score) are prognostic models that have been validated in PBC patients as predictors of death. They are widely used to determine appropriate timing of liver transplantation, but these models were not originally developed to detect disease progression at earlier stages of PBC. Liver biopsy has been the only reliable means to assess prognosis in all stages of disease. However, the accuracy of liver biopsy is limited by sampling error, subjective interpretations, limited frequency of biopsies, and categorical staging systems. Non-invasive surrogate markers are appealing because they do not pose the same risks of pain and bleeding as liver biopsy; they can be performed frequently and provide a score that is a continuous variable, potentially capable of tracking progression from mild fibrosis through to end-stage cirrhosis. Serum fibrosis markers could potentially replace liver biopsy as the test of choice for determining the position of a patient along the spectrum of disease severity. However, their prognostic accuracy compared to liver biopsy has not been assessed in PBC.

The correlation between serum fibrosis markers and histology in other chronic liver disease is good, but not perfect. Thus, clinicians have been reluctant to substitute serum fibrosis markers for histological staging. Most marker panels show similar ability to distinguish significant from insignificant fibrosis in cross-sectional studies, with area under the receiver operating characteristic curve (AUROC) of about 0.8. [3-5] However, imperfect correlation due to biopsy sampling error is to be expected, and only longitudinal studies with clinical endpoints can determine the prognostic accuracy of biochemical markers of disease severity. Very few studies have acquired sufficient follow-up data to evaluate clinical outcomes. Amongst 357 patients with alcoholic liver disease, those with elevated serum YKL-40 (growth factor) or PIIINP (N-terminal propeptide of type III procollagen) had shorter survival than patients with normal serum levels of YKL-40 or PIIINP.[6] In a group of 537 treated and untreated patients with hepatitis C, fibrosis biomarkers had a similar, but statistically better ability than biopsy to predict hepatitis-related complications over a 5-year period. [7] This study investigated the hypothesis that serum fibrosis markers would predict clinical progression as well or better than current methods of disease staging in primary biliary cirrhosis. This is the first study to evaluate serum fibrosis markers in a large cohort of patients with primary biliary cirrhosis patients and in particular to examine the ability of serum fibrosis markers to predict clinical outcomes.

Methods

Subjects

One hundred sixty one patients with PBC were prospectively followed between 1993 and 2003 as part of a multi-center US clinical trial (PUMPS) that was designed to investigate whether low dose weekly methotrexate, when added to ursodiol, improved survival or delayed progression of PBC. Methotrexate was not found to affect the course of PBC [8], so patients from both treatment arms were combined for the purpose of the present analysis. Combining treatment arms was also later justified by a subanalysis demonstrating no effect of the treatment arm on the outcomes of this analysis. Only patients with established, but not decompensated PBC were enrolled into the parent trial. Participants were required to have both a positive anti-mitochondrial antibody and either an abnormal alkaline phosphatase or at least stage 1 disease on liver biopsy. Exclusion criteria included a history of variceal bleeding, ascites, or encephalopathy.

Investigations

Subjects received a history, physical, and serum blood tests at entry and every 3 months. At entry and at 2 year intervals, they also underwent percutaneous liver needle core biopsy to assess histological changes, endoscopy to look for new varices, and abdominal ultrasound to look for new ascites. On the same day as these procedures, serum for long term storage was obtained, frozen, and later used for the serum fibrosis marker assay. Thus, all biochemical, serological, and histological data were precisely synchronized. Biopsies were coded with a unique subject identifier and date. They were fixed and stained with hemotoxylin and eosin for assessment of cell morphology and trichrome stain for assessment of fibrosis. Each biannual biopsy was read by 4 independent pathologists and a mean PBC stage of 1-4 (explained in Table 1) was calculated. Pathologists convened on an annual basis to independently read those biopsies that had been collected over the preceding 12 months. Biopsies were read in isolation, i.e. pathologists did not confer and previous biopsies from the same subject were not compared to newly acquired biopsies. Each pathologist read each biopsy only once, except for a single pathologist (S. Mills), who, at the request of the NIH, also read each biopsy using the Ishak scoring system to quantify fibrosis. [9] (Table 1). Routine laboratories (bilirubin, albumin, INR, creatinine) were performed at each center throughout the trial. MELD and Mayo R scores were calculated as described previously. Enhanced Liver Fibrosis (ELF) Test scores, were determined on frozen sera after the close of the study. The ELF assay, which includes serum hyaluronic acid (HA), tissue inhibitor of metalloproteinase 1 (TIMP-1), and procollagen type III N-terminal propeptide (PIIINP), was previously developed and validated as a predictor of histological fibrosis in a cohort of 1921 patients with chronic liver disease, including 53 patients with either PBC or primary sclerosing cholangitis [3]. All assays are heterogeneous immunoassays employing a magnetic particle separation technique developed by Bayer Diagnostics and now property of Siemens Molecular Solutions [3].

Table 1.

Fibrosis Scoring Systems

Ishak Score
0 No fibrosis
1 Fibrous expansion of some portal areas, with or without short septae
2 Fibrous expansion of most portal areas, with or without short septae
3 Fibrous expansion of most portal areas with occasional portal to portal (P-P) and/or portal to central (P-C) bridging
4 Fibrous expansion of portal areas with marked bridging (P-P as well as P-C)
5 Marked bridging (P-P as well as P-C) with occasional nodules (incomplete cirrhosis)
6 Cirrhosis (probable or definite)
PBC Stage
1 No fibrosis. (There had to be some portal inflammation on the initial biopsy in order to be enrolled).
2 Non- bridging fibrosis. (Portal tracts expanded beyond the limiting plate with any degree of fibrosis less than stage 3.)
3 Bridging fibrosis (Two or more portal-portal bridges)
4 Cirrhosis (at least one regenerative nodule surrounded by fibrosis)

Analysis

Each of the potential prognostic tests (ELF, histology, serum bilirubin, Mayo R score, and MELD) was then examined for its ability to separate patients who developed clinical progression from those who did not. Clinical progression was defined as the development of one of the following events: new varices, variceal bleed, ascites, encephalopathy, liver-related death, or liver transplantation.

Two types of analyses were performed to examine the predictive ability of the prognostic tests. In the forward-looking analysis, the ability of each baseline test result (ELF, Ishak score, PBC stage, Bilirubin, MELD, Mayo R score) to predict development of a clinical event during follow-up was analyzed. Each of the baseline test scores were divided into three subgroups (low, intermediate, and high score) and the frequency of progression in each subgroup was determined. A Kaplan-Meier time to event curve was generated for each subgroup. A Cox Proportional Hazard ratio was modeled, using ELF and the mean histological PBC stage of the 4 pathologists as continuous variables.

A rear-looking analysis was also performed to address the possibility that some tests might be better long-range predictors in patients with earlier disease. The rear-looking analysis was designed to account for heterogeneity of disease severity in the study population at the time of entry into the parent trial. This analysis looked retrospectively at tests performed at 0, 2, 4, 6, or 8 years prior to the development of the first clinical event. For example, in subjects who developed new varices, the data measured at the time of the endoscopy is designated ‘0 years prior’ to the event. In subjects who never developed a clinical event, data were used from the last date the individual was known to be completely free of complications. At each of these time points, the ability of each test to predict whether or not the subject would later develop a clinical event was determined. ROC curves were generated to compare the diagnostic performance of each of the tests at each time point in the disease process. The areas under the receiver operating characteristic curves (AUROC)s for each test were compared using a non-parametric approach [10], including both omnibus and pairwise comparisons. To adjust for multiple comparisons, a p-value of <0.01 was considered significant for the pairwise testing.

ELF scores were correlated with biopsy findings using the Spearman Rank Order Correlation test. An ROC curve was generated to examine the ability of ELF score to predict histological fibrosis. Serial assessments of ELF that were made over the eight year follow-up duration were evaluated for change over time by estimating the slopes of the regression line from initial to final ELF determination with random coefficient models. Comparisons of the slopes between the event and no event groups were made with linear contrasts from the random coefficient model. The modeling and fitting was performed in SAS 9.1. (SAS Institute, Cary, NC) using the PROC MIXED procedure.

Results

The median follow-up of the 161 subjects was 7.3 years (range 5 months-10 years). The vast majority of subjects were Caucasian (93%) and female (93%). During follow-up, forty two (26%) of the subjects developed one of the events defined as clinical progression: onset of varices (33), variceal bleed (0), ascites (6), encephalopathy (1), liver-related death (1), or liver transplantation (1). Although most of these subjects had the onset of varices as their first event, this was often followed by one or more of the other events.

The characteristics of the study population are shown in Table 2. The mean scores of all prognostic tests were significantly higher at study entry in the group that progressed to develop clinical events during follow-up.

Table 2.

Characteristics of Study Population at Study Entry (n=161)

No Event (n=119) Event (n=42) p-value

Age 51.1 ± 8.8 53.0 ± 8.4 0.24

ELF 0.17 ± 0.72 1.10 ± 0.98 <0.0001
0.14 1.14

Ishak Score
(0-6)
2.37 ± 1.34 3.14 ± 1.18 0.0008
2.0 3.0

PBC Stage
(1-4)
1.98 ± 0.70 2.50 ± 0.87 0.0009
1.0 1.75

Bilirubin 0.54 ± 0.28 0.81 ± 0.53 0.002
0.50 0.60

MELD 6.30 ± 0.66 7.02 ± 1.55 0.001
6.0 6.0

Mayo Risk 3.68 ± 0.64 4.15 ± 0.60 0.0001
3.66 4.21

Results are expressed as mean ± standard deviation and median

Adequacy of Liver Biopsies

Of the 578 liver biopsies used in this study, the median biopsy length was 1.7 cm (minimum 1 cm, maximum 5 cm). The majority of biopsies (80%) had 6 portal tracts per tissue core. Less than 1% had more than 6 portal tracts per biopsy. Very few (7%) had less than 4 portal tracts, and the remainder (13%) had 4 or 5 portal tracts per biopsy. The biopsies were fragmented in 47% of cases. Pathologists were given the option to interpret a biopsy as “inadequate for staging,” which occurred in one instance. The weighted kappa score for inter-pathologist variability in assessing PBC stage was 0.62.

Forward-looking event-free survival

The chance of developing a clinical event was examined in persons with either low, intermediate, or high scores on each of the prognostic tests obtained at baseline. There was a significant correlation between the baseline score of all prognostic tests and the likelihood of developing a clinical complication over the next 6 years (Table 3). Only 8% of subjects with a low baseline ELF score developed a complication during follow-up, whereas 24% of subjects with an intermediate score and 46% of subjects with high a score developed a clinical complication. In a Cox's Proportional Hazard model adjusting for age and treatment arm (the latter had no effect), each increase in ELF score by 1 point was associated with a 3-fold increase in future complications (Relative risk 2.9, 95% CI 2.0 -4.3). Every stage increase of PBC stage (1-4 point scale) led to a 2-fold increase in future complication rate (RR 2.4, 95% CI 1.6 -3.6). ELF performed as well as either Ishak score or PBC stage.

Table 3.

Clinical Progression According to Baseline Prognostic Test

Test Subgroup Subgroup Value Clinical Event*
Yes No
ELF Low -1.32 to -0.08 4 (8%) 49 (92%)
Intermediate -0.06 to 0.669 13 (24%) 41 (76%)
High 0.67 to 3.23 25 (46%) 29 (54%)
Ishak Score Low 0 - 1 1 (4%) 25 (96%)
Intermediate 2 - 3 25 (25%) 74 (75%)
High 4 - 6 16 (46%) 19 (54%)
PBC Stage Low <2.0 12 (17%) 57 (83%)
Intermediate 2.0 -2.9 12 (20%) 47 (80%)
High 3-4 18 (55%) 15 (45%)
Bilirubin Low ≤ 1.0 32 (22%) 114 (78%)
Intermediate 1.1 - 2.0 8 (62%) 5 (38%)
High > 2.0 2 (100%) 0 (0%)
MELD Low ≤ 6.0 24 (20%) 95 (80%)
Intermediate 7-10 17 (41%) 24 (59%)
High ≥ 11 1 (100%) 0
Mayo Risk Low < 3.5 8 (14%) 48 (86%)
Intermediate 3.5-3.9 9 (21%) 34 (79%)
High ≥ 4.0 25 (40%) 37 (60%)

ELF is categorized by tertiles

*

p< 0.004 for all prognostic tests, event group versus no event group, comparison by Cochran-Armitage Trend test

The event-free survival according to baseline ELF score is depicted in Figure 1. The log rank test comparing the three subgroups showed a highly significant difference (p<0.0001) in event-free survival depending on the baseline ELF score.

Figure 1. Event-Free Survival by Baseline ELF Score.

Figure 1

Kaplan-Meier curve of time to first clinical event by ELF score. Low ELF: -1.32 - -0.08, Intermediate ELF: -0.06- 0.669, High ELF: 0.67-3.23. Event –free survival was significantly lower in patients with higher baseline ELF scores.

Rear-looking prediction of clinical progression

The ability of each test to predict whether or not a subject would later develop a clinical event was also retrospectively determined at increasing time intervals (0, 2, 4, 6, and 8 years) prior to the first event. For each time point, ROC curves were generated for each diagnostic test (Figure 2), and the AUROC was calculated (Table 4) as a measure of the prognostic value of each test. The AUROCs of the various tests were not significantly different from each other at years 0 and 2 (ANOVA, p=0.24 at 0 years and p=0.25 at 2 years). However, at earlier time points in the disease process (4 and 6 years prior to the first complication), there were significant differences (ANOVA p=0.003 at 4 years and p=0.007 at 6 years). The differences in the prognostic tests performed 8 years before the first clinical event were also significant (p<0.001), but the confidence interval of the AUROCs were quite wide at this time point due to the small number of patients in the event group at that time, which made comparisons between tests difficult to interpret.

Figure 2. Performance of Prognostic Tests at Specified Time Points In the Disease Process of PBC.

Figure 2

The ability of each prognostic test to predict whether an individual would develop a clinical complication was examined at different time points before the first event. Near the time of the first event (year 0), the ROC curves overlapped considerably. However, at earlier time points (4 and 6 years before the first clinical event), ELF score and histological fibrosis performed better as prognostic tests. (ANOVA p=0.003 at 4 years and p=0.007 at 6 years).

Table 4.

Performance of Prognostic Tests in PBC: Area Under the Receiver Operating Characteristic Curves with 95% confidence intervals

AREA UNDER ROC (CI)
TEST Years Prior to First Complication
0 2 4 6 8
European Liver Fibrosis Panel 0.748
(0.651-0.845)
0.739
(0.653-0.825)
0.682
(0.578-0.786)
0.684
(0.528-0.841)
0.784
(0.600-0.968)
Ishak Score
(0-6)
0.732
(0.627-0.836)
0.743
(0.647-0.839)
0.721
(0.617-0.824)
0.751
(0.606-0.896)
0.584
(0.423-0.745)
PBC Stage
(1-4)
0.706
(0.583-0.829)
0.675
(0.562-0.788)
0.634
(0.505-0.764)
0.657
(0.471-0.843)
0.305
(0.077-0.534)
Bilirubin
(loge)
0.650
(0.526-0.774)
0.659
(0.541-0.777)
0.617
(0.501-0.732)
0539
(0.368-0.711)
0.559
(0.098-1.0)
MELD 0.642
(0.538-0.746)
0.630
(0.531-0.729)
0.544
(0.439-0.649)
0.592
(0.455-0.730)
0.612
(0.327-0.898)
Mayo PBC Risk Score 0.680
(0.564-0.796)
0.625
(0.518-0.733)
0.501
(0.388-0.614)
0.431
(0.294-0.568)
0.441
(0.119-0.763)

At each time point in the disease process, Ishak staging system and ELF score had highest predictive values (AUROC). In fact, there was no statistically significant difference in the AUROC of ELF and histological staging at any time point. The AUROC of histological staging by the 7-point Ishak system was consistently slightly greater than the AUROC of the 4-point PBC stage. The prognostic performance of ELF (AUROC) was also higher than the other non-invasive tests (bilirubin, MELD, Mayo Risk Score) at each time point, although this difference was not statistically significant at later time points in the disease process (0 and 2 years before the first event). The most strongly significant differences were between ELF and Mayo R score AUROCs at earlier time points in the disease process (4 and 6 years prior to the first event, p=0.004 and 0.009 respectively).

Correlation of ELF with histology

The initial ELF score in each subject correlated well with the extent of disease on liver biopsy, using either the Ishak scoring system (rho=.52, p<0.0001) or the PBC staging system (rho=.60, p<0.0001) However, as expected, there was much overlap in individual ELF scores, particularly in persons with intermediate stages of histological fibrosis (Figure 3). ELF score was an accurate non-invasive marker of histological fibrosis. The AUROC of ELF to predict cirrhosis (Ishak 5-6) score was 0.76 (95% CI 0.63-0.89) and to predict significant fibrosis (Ishak 3-6) was 0.75 (CI 0.67-0.82) (Figure 4)

Figure 3. Distribution of First ELF Score According to Biopsy Score.

Figure 3

The first serum ELF score correlated well with histology. Because the mean PBC stage was derived by averaging the scores of four pathologists, the calculated values ranged from 1.0 to 4.0 in increments of 0.25.

Figure 4. ELF as a Predictor of Histological Fibrosis and Cirrrhosis.

Figure 4

ELF score was an accurate non-invasive marker of histological fibrosis. The AUROC of ELF to predict cirrhosis was 0.76 (95% CI 0.63-0.89) and to predict significant fibrosis was 0.75 (CI 0.67-0.82)

Serial Determinations of ELF

One hundred forty nine subjects had more than one frozen serum sample and thus could be used to evaluate change in serial ELF score determinations over the 8 year follow up period. The ELF score increased by 0.032 per year (95% CI:0.008 to 0.005, p=0.009), and the rate of increase was no different comparing the event (slope 0.046 per year, CI: -.0.002 to 0.094,, n=38) and no event (slope 0.027 per year, 95%CI:0 to 0.054, n=111) groups (p=0.50).

Discussion

This is the first study to examine the utility of a serum fibrosis marker panel in primary biliary cirrhosis, and in particular, to predict clinical outcomes. The relationship between the baseline ELF score and the likelihood of developing a clinical complication over the next 6 years was highly significant, as demonstrated by the event free survival Kaplan-Meier analysis.

Other non-invasive prognostic models, such as the Mayo R Score and MELD, are based upon bilirubin, albumin, prothrombin time, edema, and creatinine. These biochemical and clinical markers become abnormal in the later stages of cirrhosis and hepatic dysfunction, which is why they are excellent predictors of death. However, these markers are not sensitive to disease progression earlier in the course of PBC. This is most likely the explanation for the poor prognostic performance of Mayo R Score, MELD, and bilirubin 6 years prior to the first clinical event, similar to the chance of flipping a coin. (AUROC=0.420-0.564). Even though the parent trial was not originally designed to compare prognostic tests, the availability of concurrent data and long-term follow-up enabled us to make comparisons and determine retrospectively the most accurate prognostic tests.

Clinical jaundice is known to be a late but poor prognostic sign in PBC. The PUMPS trial excluded patients with a history of bilirubin >2.0, so only 2 patients in this cohort had jaundice at study entry. Although both of these patients did develop a clinical event, a large percentage of patients with normal baseline bilirubin also progressed to have a clinical event. In the retrospective analysis, taking the loge of bilirubin improved its prognostic ability, although the AUROC of loge bilirubin (reported in this paper) was still lower than that of ELF or histology.

The key components of the ELF algorithm are hyaluronic acid, tissue inhibitor of metalloproteinase 1, and procollagen type III N-terminal propeptide, which are expressed during early stages of collagen deposition in the liver, and this is most likely the reason that ELF retains its prognostic ability even in early stages of the disease process (AUROC 0.737-0.863 at all time points). It should be noted, however, that none of the prognostic tests, including biopsy and ELF, had AUROCs of over 0.9. Thus, no test is error-free when trying to predict clinical progression and this must be considered when counseling individual patients in the clinical setting.

Liver biopsy has been the reference standard for staging PBC and other chronic liver diseases for over 50 years, and the prognostic information gained from liver biopsy is useful. The three most popular staging systems for PBC are the Ludwig, Popper, and Scheuer systems [11, 12, 13]. This study used the the Ludwig system, which has become favored over the others as it is increasingly recognized that florid duct lesions or ductular proliferation need not be present to define early stages. ELF algorithm was found to be superior to PBC stage at all time points for predicting clinical progression. Of course, a liver biopsy provides additional diagnostic information, but this is not always necessary. A typical middle-aged female with cholestatic liver tests, positive AMA and no other liver disease risk factors does not require a confirmatory liver biopsy [14]. Although not originally developed for PBC, the Ishak system was also used to score each of the biopsies in this study. The performance of the Ishak score was not statistically different from the performance of the ELF score. Thus, we propose that if a liver biopsy is not needed for diagnostic purposes, the ELF score can be utilized for prognostic staging and monitoring during follow-up. In the situation where a liver biopsy is appropriate for diagnostic purposes, the Ishak system should be used for more accurate staging. The Ishak system is based entirely on the extent of fibrosis and provides better prognostic information, but cannot establish the diagnosis of PBC. The diagnosis must still be made by the presence of typical histological features such as non-suppurative destructive cholangitis.

The possibility of replacing staging liver biopsies with simple noninvasive marker algorithms is gaining momentum as evidence is accumulating in other chronic liver diseases that these panels are reasonable surrogate markers of histological liver fibrosis. Several cross-sectional studies have evaluated the ability of marker panels to quantify fibrosis in hepatitis C and non-alcoholic fatty liver disease. [15-19] Consistently, these marker panels are reasonably accurate at distinguishing the absence of fibrosis from advanced cirrhosis, but the accuracy of the test in a single individual, particularly an individual with an intermediate degree of fibrosis, is lower 15, 20-23] These studies have been criticized because the “gold standard” of histology is inaccurate due to the high rate of sampling error known to occur in patients with diffuse parenchymal liver diseases.[24] One study of patients with cirrhosis from different causes demonstrated that three samples taken by the same route in the same patients were concordant for the degree of fibrosis in only 50% of cases. [25] Although increasing the length of the biopsy improves diagnostic accuracy, the rate of discordance in biopsies >25mm is still 25% in hepatitis C [26]. Thus, although “bigger is better,” needle biopsies are by their nature somewhat inaccurate. PBC, in particular, is characterized by patchy involvement of portal areas. Whole section scanning of 50 PBC liver explants demonstrated that only 20% of PBC livers have a consistent histological stage of fibrosis throughout the liver, even when the tissue was obtained at clinically-defined end-stage disease. [27] The correlation between ELF score and histological fibrosis in these patients was strong. However, as expected, there was also significant overlap of individual ELF scores, particularly in persons with intermediate stages of histological fibrosis. The ELF score, therefore, was not a perfect reflection of the histological fibrosis score; yet its excellent performance as a prognostic marker implies that biopsy error may actually account for part of the discord between ELF and histological fibrosis score.

One advantage of the current study was that serial serum samples were available in a very well defined cohort with meticulously determined clinical outcomes over a long time period. Thus, prognostic tests could be evaluated at different time points and also evaluated for change over time. The average ELF score increased over time (by 0.032 per year). This average rate of increase was not different between subjects who experienced an event and those who didn't, which indicates that the rate of PBC fibrosis progression may be relatively steady, regardless of whether the starting point is early or advanced disease. It also suggests that the patients who developed the clinical events started the study at a later time point in their disease process. This data argues against the theory that there may be a rapidly fibrotic and slowly fibrotic form of PBC, at least in the cohort studied here.

In conclusion, the ELF algorithm is a highly accurate non-invasive measure of PBC disease severity which provides accurate prognostic information. The ability of this algorithm to predict clinical outcomes is as good or better than traditional prognostic tests for PBC, particularly in early stages of the disease. It is a simple, non-invasive technique with long-range predictive ability that should be very useful in the clinical assessment of PBC.

Acknowledgments

Support: This work was supported by the following grants: NIH R21 DK70995, NIH DK46602, NIH M01-RR00633, DTI UK-Texas Initiative. Reagents were provided by Bayer Diagnostics/Siemens Molecular Solutions and i-Qur Limited.

References

  • 1.Corpechot C, Carrat F, Bahr A, Chretien Y, Poupon RE, Poupon R. The effect of ursodeoxycholic acid therapy on the natural course of primary biliary cirrhosis. Gastroenterology. 2005;128:297–303. doi: 10.1053/j.gastro.2004.11.009. [DOI] [PubMed] [Google Scholar]
  • 2.McHutchison J, Poynard T, Afdhal N International Fibrosis Group Meeting Participants. Fibrosis as an end point for clinical trials in liver disease: a report of the international fibrosis group. Clin Gastroenterol Hepatol. 2006;4:1214–1220. doi: 10.1016/j.cgh.2006.07.006. [DOI] [PubMed] [Google Scholar]
  • 3.Rosenberg WM, Voelker M, Thiel R, Becka M, Burt A, Schuppan D, Hubscher S, Roskams T, Pinzani M, Arthur MJ European Liver Fibrosis Group. Serum markers detect the presence of liver fibrosis: a cohort study. Gastroenterology. 2004;127(6):1704–13. doi: 10.1053/j.gastro.2004.08.052. [DOI] [PubMed] [Google Scholar]
  • 4.Ratziu V, Massard J, Charlotte F, Messous D, Imbert-Bismut F, Bonyhay L, Tahiri M, Munteanu M, Thabut D, Cadranel JF, Le Bail B, de Ledinghen V, Poynard T LIDO Study Group. CYTOL study group. Diagnostic value of biochemical markers (FibroTest-FibroSURE) for the prediction of liver fibrosis in patients with non-alcoholic fatty liver disease. BMC Gastroenterol. 2006;6:6. doi: 10.1186/1471-230X-6-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Naveau S, Raynard B, Ratziu V, Abella A, Imbert-Bismut F, Messous D, Beuzen F, Capron F, Thabut D, Munteanu M, Chaput JC, Poynard T. Biomarkers for the prediction of liver fibrosis in patients with chronic alcoholic liver disease. Clin Gastroenterol Hepatol. 2005;3:167–74. doi: 10.1016/s1542-3565(04)00625-1. [DOI] [PubMed] [Google Scholar]
  • 6.Nojgaard C, Johansen JS, Christensen E, Skovgaard LT, Price PA, Becker U EMALD Group. Serum levels of YKL-40 and PIIINP as prognostic markers in patients with alcoholic liver disease. J Hepatol. 2003;39:179–86. doi: 10.1016/s0168-8278(03)00184-3. [DOI] [PubMed] [Google Scholar]
  • 7.Ngo Y, Munteanu M, Messous D, Charlotte F, Imbert-Bismut F, Thabut D, Lebray P, Thibault V, Benhamou Y, Moussalli J, Ratziu V, Poynard T. A prospective analysis of the prognostic value of biomarkers (FibroTest) in patients with chronic hepatitis C. Clin Chem. 2006;52:1887–96. doi: 10.1373/clinchem.2006.070961. [DOI] [PubMed] [Google Scholar]
  • 8.Combes B, Emerson SS, Flye NL, Munoz SJ, Luketic VA, Mayo MJ, McCashland TM, Zetterman RK, Peters MG, Di Bisceglie AM, Benner KG, Kowdley KV, Carithers RL, Rosoff L, Garcia-Tsao G, Boyer JL, Boyer TD, Martinez EJ, Bass NM, Lake JR, Barnes DS, Bonacini M, Lindsay KL, Milss AS, Markin RS, Rubin R, West AB, Wheeler DE, Contos MJ, Hoffman AF. Methotrexate (MTZ) plus ursodeoxycholic acid (UDCA) in the treatment of primary biliary cirrhosis. Hepatology. 2005;42:1184–1193. doi: 10.1002/hep.20897. [DOI] [PubMed] [Google Scholar]
  • 9.Ishak K, Baptista A, Bianchi L, Callea F, De Groote J, Gudat F, Denk H, Desmet V, Korb G, MacSween RN, et al. Histological grading and staging of chronic hepatitis. J Hepatol. 1995;22:696–699. doi: 10.1016/0168-8278(95)80226-6. [DOI] [PubMed] [Google Scholar]
  • 10.DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics. 1988;44:837–45. [PubMed] [Google Scholar]
  • 11.Ludwig J, Dickson ER, McDonald GS. Staging of chronic nonsuppurative destructive cholangitis (syndrome of primary biliary cirrhosis) Virchow Archiv A Pathol Anat Histol. 1978;379:103. doi: 10.1007/BF00432479. [DOI] [PubMed] [Google Scholar]
  • 12.Popper H, Schaffner F. Nonsuppurative destructive chronic cholangitis and chronic hepatitis. In: Popper H, Schaffner F, editors. Progress in liver diseases. Vol. 3. New York: Grune & StrattonInc; 1970. pp. 336–354. [PubMed] [Google Scholar]
  • 13.Scheuer PJ. Primary biliary cirrhosis. Proceedings of the Royal Society of Medicine. 1967;60:1257. [PMC free article] [PubMed] [Google Scholar]
  • 14.Zein Co, Angulo P, Lindor KD. When is liver biopsy needed in the diagnosis of primary biliary cirrhosis? Clin Gastro & Hep. 2003;1(2):89–9. doi: 10.1053/cgh.2003.50014. [DOI] [PubMed] [Google Scholar]
  • 15.Imbert-Bismut F, Ratzui V, Pieroni L, Charlotte F, Benhamou Y, Poynard T MULTIVIRC group. Biochemical markers of liver fibrosis in patients with hepatitis C virus infection: a prospective study. Lancet. 2001;357:1069–75. doi: 10.1016/S0140-6736(00)04258-6. [DOI] [PubMed] [Google Scholar]
  • 16.Guha IN, Parkes J, Roderick PR, Harris S, Rosenberg WM. Non-invasive markers associated with liver fibrosis in non-alcoholic fatty liver disease. Gut. 200;55(11):1650–60. doi: 10.1136/gut.2006.091454. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Zaman A, Rosen HR, Ingram K, Corless CL, Oh E, Smith K. Assessment of FIBROSpectII to detect hepatic fibrosis in chronic hepatitis C patients. Am J Med. 2007;120(3):280e9–14. doi: 10.1016/j.amjmed.2006.06.044. [DOI] [PubMed] [Google Scholar]
  • 18.Guha IN, Parkes J, Roderick P, Chattopadhyay D, Cross R, Harris S, Kaye P, Burt AD, Ryder SD, Aithal GP, Day CP, Rosenberg WM. Noninvasive markers of fibrosis in nonalcoholic fatty liver disease: Validating the European Liver Fibrosis Panel and exploring simple markers Hepatology. doi: 10.1002/hep.21984. in press. [DOI] [PubMed] [Google Scholar]
  • 19.Angulo P, Hui JM, Marchesini G, Bugianesi E, George J, Farrell GC, Enders F, Saksena S, Burt AD, Bida JP, Lindor K, Sanderson SO, Lenzi M, Adams LA, Kench J, Therneau TM, Day CP. The NAFLD fibrosis score: a noninvasive system that identifies liver fibrosis in patients with NAFLD. Hepatology. 2007;45:846–54. doi: 10.1002/hep.21496. [DOI] [PubMed] [Google Scholar]
  • 20.Poynard R, Imbert-Bismut I, Ratzui V, Chevret S, Jardel C, Moussalli J, Messous D, Degos F GERMED cyt04 groups. Biochemical markers of liver fibrosis inpatients infected by hepatitis C virius: longitudinal validation in a randomized trial. J Virol Hepatitis. 2002;9:128–133. doi: 10.1046/j.1365-2893.2002.00341.x. [DOI] [PubMed] [Google Scholar]
  • 21.Poynard T, McHutchison J, Manns M, Myers RP, Albreacht J. Biochemcial surrogate markers of liver fibrosis and activity in a randomized trial of peginterferon alfa-2b and ribavirin. Hepatology. 2003;38:481–492. doi: 10.1053/jhep.2003.50319. [DOI] [PubMed] [Google Scholar]
  • 22.Myers RP, Ratzui V, Imbert-Bismut F, Charlotte F, Poynard T MULTIVIRC group. Biochemical markers of liver fibrosis :a comparison with historical features in patients with chronic hepatitis c. Am J Gastro. 2002;97:2419–25. doi: 10.1111/j.1572-0241.2002.05997.x. [DOI] [PubMed] [Google Scholar]
  • 23.Myers RP, de Torres M, Imbert-Bismut F, Ratzui V, Charlotte f, Poynard T MULTIVIRC group. Biochemical markers of fibrosis in patient with chronic hepatitis C: a comparison with prothrombin time, platelet count, and age-platelet index. Dig Dis Sci. 2003;48:146–153. doi: 10.1023/a:1021702902681. [DOI] [PubMed] [Google Scholar]
  • 24.Scheuer PJ. Liver Biopsy Size matters in chronic hepatitis: bigger is better. Hepatology. 2003;38:1356–7. doi: 10.1016/j.hep.2003.10.010. [DOI] [PubMed] [Google Scholar]
  • 25.Maharaj B, Maharaj RJ, Leary WP, Coppan RM, Naran AD, Pirie D, Pudfin DJ. Sampling variability and its influence on the diagnostic yield of percutaneous needle biopsy of the liver. Lancet. 1986;8480:523–5. doi: 10.1016/s0140-6736(86)90883-4. [DOI] [PubMed] [Google Scholar]
  • 26.Bedossa P, Dargere D, Paradis V. Sampling variability of liver fibrosis in chronic hepatitis C. Hepatology. 2003;38:1449–57. doi: 10.1016/j.hep.2003.09.022. [DOI] [PubMed] [Google Scholar]
  • 27.Garrido MC, Hubscher SG. Accuracy of staging in primary biliary cirrhosis. Journal of Clinical Pathology. 1996;49:556–9. doi: 10.1136/jcp.49.7.556. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES