Abstract
Background
Acute pancreatitis has a broad clinical spectrum, from mild illness to multiple organ failure and death. Prognostic scores have been developed or adapted to predict disease severity. This study aimed to compare the prognostic scores according to sensitivity and specificity, receiver operating characteristic curves and area under the curve. Statistical correlation with disease severity, length of hospital stay, mortality and complication rates.
Methods
Retrospective analysis of the clinical data of patients admitted to an Internal Medicine ward with the diagnosis of acute pancreatitis over a ten year period. Evaluation of prognostic scores: Ranson, Glasgow-Imrie, Balthazar, APACHE II (admission and at 48 hours) and C-reactive protein (48 hours), was carried out as well as statistical analysis using Microsoft Excel 2007® and SPSS 16®. The confidence interval used was 95%.
Results
Data from 193 clinical files was collected. However, 67 were excluded due to lack of information. According to the Atlanta criteria, 90 cases were deemed as mild and 36 severe. The mortality rate was 6% and the local complication rate was 9.3%. Ranson, Glasgow and APACHE II scores had significant correlation with mortality. Apart from C-reactive protein levels at 48 hours, all scores had significant correlation with disease severity. The scores with best area under the curve correlation were APACHE II (48 hours): 0.892, Ranson: 0.879, and APACHE II (admission): 0.861.
Conclusions
The most accurate prognostic scores in this study were APACHE II (48 hours) and Ranson. APACHE II at admission was a good indicator, impaired only by high false positive ratio.
Keywords: Acute Pancreatitis, Prognostic Scores, APACHE II, Ranson, Glasgow, Balthazar, C-reactive Protein
Introduction
Acute pancreatitis (AP) diagnostic criteria and outcome prediction were the subject of discussions over the years, but the Atlanta Symposium in 1992 set the standards relating diagnostic criteria and disease severity [1]. Predicting severity is an essential step while evaluating a patient with AP as it allows physicians to stratify disease severity and management strategies [2, 3]. Several prognostic scoring systems based of clinical, laboratorial and radiologic evaluations have been created or adapted to predict outcome, some based on local complications and other reflecting systemic manifestations of AP. Ranson’s score [4] is possibly the most used scoring system created specifically for AP. The Acute Physiology And Chronic Health Evaluation II (APACHE II) [5] scoring system was created to evaluate any severe acute illness and has successfully been used to predict AP severity. Unspecific biomarkers, such as C-reactive protein (CRP) have also been studied as outcome predictors, but it has only been useful for predicting complications, namely necrotizing AP [2, 6].
This study aimed to compare specific, unspecific and morphological based prognostic scoring systems regarding disease severity, according to sensitivity and specificity, receiver operating characteristic curve and area under the curve. Correlation with length of hospital stay, mortality and complication rates was determined in order to evaluate which scores were better predictors of these outcomes.
Methods
The authors carried out a retrospective analysis of the clinical data of patients admitted with AP to an Internal Medicine ward over a period of ten years. Demographic features, analytical and radiological findings, disease severity as well as final outcome were evaluated. Between 1997 and 2006, 193 patients with AP were admitted to our ward, but 67 were excluded due to insufficient information on clinical files. Ranson, Glasgow and APACHE II’s (at admission and at 48 hours) prognostic scores were calculated in all patients. CRP at 48 hours was evaluated in all patients. Balthazar’s scores were calculated only when a CT scan was performed (48 patients). Statistical correlation between these scoring systems and disease severity as defined by the Atlanta criteria [1], mortality and complication rates, as well as length of hospital stay was carried out with confidence interval of 95%.
Definitions
The diagnostic criteria for AP were those defined by the 2006 AP Guidelines [2], as the presence of at least two of the following features: 1) characteristic abdominal pain; 2) elevation over 3 times the upper normal limit of serum amylase/lipase; 3) characteristic features on computer tomography (CT) scan. Over the last twenty years much has been learned about this condition and therapeutic strategies and imaging techniques have improved. In many studies the criteria adopted vary, occasionally reflecting local experience or national guidelines [7]. In this study severe AP was diagnosed according strictly to Atlanta criteria [1] (Table 1). Local complications were determined by CT scan, which was performed only when clinical course was unfavorable, when morphologic changes were detected on transabdominal ultrasonography or based on clinical suspicion.
Table 1. Severe Acute Pancreatitis Criteria.
Diagnose of severe AP | Criteria |
---|---|
Early Prognostic Scores | Ranson ≥ 3 |
APACHE II ≥ 8 | |
Organ Failure | Systolic pressure < 90 mmHg |
PaO2 ≤ 60 mmHg | |
Creatinine > 2.0 mg/L after rehydratation | |
Gastrointestinal bleeding > 500 cc/24 h | |
Local Complications (on CT scan) | Necrosis |
Abcess | |
Pseudocyst |
Ranson
Originally Ranson criteria were created for alcohol-induced AP [4] and were revised in 1979 for gallstone-induced AP [8]. Original Ranson score was used by default as alcohol-induced AP was the most prevalent etiology. When gallstones were found, revised Ranson score was used. The cutoff value accepted in the literature is 3 [2, 6, 9-12].
Glasgow-Imrie
Also known as Glasgow score, it includes eight laboratory criteria and age. Like Ranson, this scoring system can only be calculated at 48 hours. The cutoff value used is also 3 [6, 11, 13].
Balthazar
In 2002 Balthazar created a severity stratification method based on necrosis extent and pancreatic morphologic changes [14]. This scoring system has a maximum of ten points, and patients with scores higher than 6 have a higher rate of complications and death [6, 11, 15, 16].
APACHE II
The Acute Physiology and Chronic Health Evaluation II scoring system was created in 1985 to evaluate any severe acute disease in an Intensive Care Unit setting [2, 5, 6, 11, 12]. Although it was not specifically created to evaluate AP severity, it has been successfully used to predict AP outcome. In this context, APACHE II was included by the Atlanta Symposium, with a cutoff value of 8 [1].
C-reactive protein
CRP is an acute phase reactant synthesized by the liver, with peak serum values occurring within the first 72 hours after symptoms onset [6]. Severity stratification by CRP has been used due to its availability and cost. The late serum peak impairs its utility as a biomarker on admission. Nevertheless, CRP serum level over 15 mg/dL at 48 hours is a good indicator of necrotizing pancreatitis [2, 6, 15, 17, 18].
Statistics
Results are expressed as mean ± standard deviation. Statistical analysis includes Student t-test, Fisher exact test, Pearson’s chi-square test, McNemars test, Mann-Whitney test and Odds ratio. P values less than 0.05 were considered statistically significant. Receiver Operating Characteristic (ROC) curves and Area Under the Curve (AUC) analysis were used to compare prognostic scoring systems. Data was gathered using Microsoft Excel 2007® and analyzed by SPSS 16.0®.
Results
Clinical data
Clinical data was collected in 193 cases. In our series there was clear male prevalence, with a 1.7:1 ratio. The mean age was of 52.42 years (± 19.62) in males and 60.31 (± 19.60) in females. Based on age, there were two peak incidences: between 40 and 50 years and between 70 and 80 years. The most common etiology was alcohol consumption (39.3%), followed by gallstones (24.1%). In 31.9% no identifiable cause was found, but in some cases microlithiasis was suspected. Relating age class, gender and etiology, we could find two patient profiles: the middle-aged male patient with a history of alcohol consumption and the elderly woman with gallstones (Fig. 1).
CT scan was carried out in 48 patients, half had pancreatic morphological changes and 17% also had necrosis. Local complications were found in 18 patients, the most common being pancreatic pseudocyst (56.5%) and aseptic necrosis (30.4%). In our series the mortality rate was 5.7% (11 patients), similar to what is reported by most studies [2]. The mean length of hospital stay was of 10.78 days (± 7.93).
Due to unavailable data, disease severity was determined retrospectively in only 126 patients, 29% of which had severe AP. Patients with severe AP were notably older, 69.89 years (± 17.43) than those with mild disease, 52.06 years (± 19.73). All deaths occurred in patients with severe AP.
Prognostic scores
Ranson
The authors found a sensitivity of 91.2% and specificity of 74.4% related to degree of severity, achieving a good discriminatory ability with AUC of 0.879 (0.818-0.940) (Fig. 2). The high negative predictive value (NPV), 95.7%, allows this score to exclude severe AP outcome (Table 2).There was significant correlation between disease severity and Ranson score 3 or above, with odds ratio of 30.131 (8.401-107.857, P < 0.001).
Table 2. Value of Prognostic Scores in the Prediction of Severe Acute Pancreatitis.
Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | AUC | |
---|---|---|---|---|---|
Ranson | 91.2 | 74.4 | 57.4 | 95.7 | 0.879 (0.818-0.940) |
Glasgow | 73.5 | 71.1 | 49.0 | 87.7 | 0.805 (0.724-0.886) |
Balthazar | 26.7 | 100.0 | 100.0 | 68.6 | 0.715 (0.528-0.902) |
APACHE II (0 h) | 83.3 | 68.9 | 51.7 | 91.2 | 0.861 (0.785-0.938) |
APACHE II (48 h) | 79.4 | 83.1 | 64.3 | 91.4 | 0.892 (0.832-0.953) |
CRP (48 h) | 44.4 | 70.7 | 41.4 | 73.2 | 0.683 (0.564-0.802) |
PPV: Positive predictive value; NPV: negative predictive value; AUC: Area under the curve.
Regarding mortality, as there were no deaths in the group with Ranson score less than 3, odds ratio was impossible to determine. However, by Fisher’s exact test there was significant evidence that there was increased mortality in the group with Ranson score equal or above 3 (P = 0.001). There was no statistical difference regarding complication rates between the groups (P = 0.171). Concerning length of hospital stay, there was significant difference between the medians by the Mann-Whitney test (P = 0.016) (Table 3).
Table 3. Correlation of Prognostic Scores With Study Endpoints.
Severity | Mortality | Complications | Length of Stay | |
---|---|---|---|---|
Ranson | 30.101 (8.401-107.857)a P < 0.001b |
*** P = 0.001c |
2.045 (0.723-5.786)a P = 0.171b |
8 versus 12 days P = 0.016d |
Glasgow | 6.838 (2.814-16.615)a P < 0.001b |
*** P = 0.001c |
1.323 (0.473-3.698)a P = 0.593b |
8 versus 12 days P = 0.003d |
Balthazar |
*** P = 0.017c |
14.333 (0.707-290.431)a P = 0.161c |
*** P = 0.012c |
17 versus 37 days P < 0.001d |
APACHE II (0 h) | 11.071 (4.140-29.605)a P < 0.001b |
*** P < 0.001c |
1.049 (0.377-2.921)a P = 0.927b |
8 versus 11 days P = 0.029d |
APACHE II (48 h) | 19.029 (7.003-51.701)a P < 0.001b |
*** P < 0.001c |
1.882 (0.668-5.304)a P = 0.227b |
9 versus 13 days P = 0.010d |
CRP (48 h) | 1.929 (0.749-4.972)a P = 0.171b |
2.375 (0.559-10.086)a P = 0.252c |
6.600 (2.090-20.843)a P = 0.001c |
7 versus 13,5 days P < 0.001d |
Odds ratio not calculated; a: Odds Ratio; b: Chi-square test; c: Fisher exact test; d: Mann-Whitney test.
Glasgow-Imrie
In this case series this score was slightly inferior to Ranson, as sensitivity was 74.5% and specificity 71.1% (Table 2). This score also had good discriminatory ability, as the AUC was 0.805 (0.724-0.886) (Fig. 2).
Significant correlation with disease severity was found, with odds ratio of 6.838 (2.814-16.615, P < 0.001). As in the Ranson score analysis, as there were no deaths in one of the groups, odds ratio was impossible to determine. There were differences between groups by Fisher exact test, with increased mortality in the group with Glasgow score equal or above 3 (P = 0.001). There was no statistical difference between groups regarding complication rates (P = 0.593). The median length of stay was significantly superior in those patients with at least 3 Glasgow criteria (P = 0.003) (Table 3).
Balthazar
CT scan was performed only in 48 patients, based on clinical evaluation and suspicion of complications. Therefore our results may be biased by pretest probability and small sample size (only 4 patients in the group with higher score). Using 6 as cutoff value, the specificity was 100% but with very low sensitivity, namely 26.7% (Table 2). With AUC of 0.715 (0.528-0.902), discrimination ability was only passable (Fig. 2).
As there were no severe cases of AP in the group with lower Balthazar score, odds ratio was not determined. However, Fisher’s exact test showed that the group with the higher Balthazar score included more severe cases (P = 0.017). Concerning mortality, there was no significant difference between groups (P = 0.161), but odds ratio lacks precision due to sample size. There was significant correlation with complication rate (Fisher exact test, P = 0.012), but odds ratio was not determinable. There was significant difference between length of stay medians as calculated by the Mann-Whitney test (P < 0.001) (Table 3).
APACHE II
Although not created specifically for risk stratification in AP, the APACHE II score at admission and at 48 hours had a good discriminatory ability. Using 8 as cutoff value, the AUC at admission was 0.861 (0.785-0.938), increasing to 0.892 (0.832-0.953) at 48 hours (Fig. 2). PPV, NPV and specificity also improved over time. Sensitivity decreased slightly at 48 hours (83.3% to 79.4%) (Table 2).
There was significant correlation with disease severity. At admission the odds ratio was 11.071 (4.140-29.605, P < 0.001) and at 48 hours was 19.029 (7.003-51.701, P < 0.001). Regarding mortality, as there were no deaths in patients with APACHE II lower than 8, the odds ratio was not possible to determine. However, Fisher exact test showed significant correlation in both evaluations (P < 0.001). No significant association between APACHE II score and complication rate was found (P = 0.927; P = 0.227). Length of hospital stay correlated with APACHE II score, at admission (P = 0.029) and at 48 hours (P = 0.003) (Table 3).
C-reactive protein
In our study CRP value was not a good marker for disease severity (P = 0.171) or mortality (P = 0.252). However, with the exception of Balthazar score, this was the only score capable of predicting the occurrence of local complications, with odds ratio of 6.600 (2.090-20.843, P = 0.001). Hospital length of stay was also associated with higher CRP values at 48 hours (P < 0.001) (Table 2).
Regarding the prediction of severity, CRP had a low sensitivity (44.4%) and specificity (70.7%) (Table 3) and its discriminatory ability was poor, with AUC: 0.683 (0.564-0.802) (Fig. 2).
Discussion
Prognostic scores were created or adapted in AP to predict disease severity. In this context, APACHE II and Ranson scores were the most accurate among those evaluated. This finding was concordant with several previous studies [2, 6, 18].
APACHE II (48 hours) had the most powerful specificity and sensitivity, but Ranson had the best negative predictive value. The main drawback of Ranson score is the time interval required for its calculation. APACHE II score at admission were slightly less reliable, but proved to be a useful screening score, with very good negative predictive values. The NPV and PPV values obtained are similar to those described by Chatzicostas et al [19], but in the present study no statistical difference was found between Ranson and APACHE II AUC’s values. Different APACHE II cutoff values were used in our study as compared to Chatzicostas’s, thus explaining this disparity. The cutoff value used for APACHE II score was based on American College of Gastroenterology guidelines [2] and the Atlanta Symposium [1]. Naturally, when APACHE II score was repeated in subsequent days, specificity and positive predictive values improved. Therefore, serial evaluation of APACHE II’s score may prove to be valuable regarding disease severity and clinical outcome, with direct consequences on the level of monitoring and management of patients with AP. The Glasgow-Imrie score was inferior to both Ranson and APACHE II, therefore its usefulness is questionable.
The results obtained for Balthazar score was as expected according to evidence described in literature [2, 14]. However, in our analysis Balthazar score results may have been biased by several factors: small sample size, small number of subjects in one of the subgroups and the pretest probability, as CT scan was only performed in selected patients based on clinical evolution. Consequently, the authors cannot present valid conclusions regarding this prognostic score. In order to obtain reliable data, a prospective study should be done, with CT scan performed in all patients, regardless of clinical evaluation and prognostic scores. Gürleyik et al [15], presented a small prospective study with these characteristics and concluded that Balthazar has a better accuracy than APACHE II scores. Very high accuracy in disease severity prediction was shown by Balthazar when this scoring system was presented [14].
Other imaging techniques, as magnetic resonance imaging (MRI) and endoscopic ultrasound (EUS), have improved significantly over the past years and have become increasingly available. Several studies showed potential for the usage of these techniques on the evaluation of AP. CT scan usage is impaired by the usage of radiation and the risk of contrast induced nephrotoxicity. MRI does not have these problems and has proved to be superior in evaluation of mild AP, peripancreatic fat infiltration and pancreatic and biliary ducts assessment. There is significant concordance between MRI and CT scan, as well as with clinical course. However, no correlation was found with APACHE II, therefore MRI has limited role in determining systemic complications [20]. Several MRI protocols can be used in order to increase diagnostic accuracy [21]. Transabdominal ultrasonography is not accurate in many cases due to overlying gas and retroperitoneal location of the pancreas. EUS has better resolution and can detect the presence of microlithiasis, occult pancreatic neoplasms and pancreas divisum, which can cause AP [22, 23]. The presence of peripancreatic edema in EUS is associated with disease severity [23], but more studies are needed to validate the role of EUS in the staging and prognosis of AP. In this study identifiable cause was found in almost one third of the patients, therefore EUS could have been useful.
CRP value at 48 hours was not a useful prognostic score for disease severity or mortality. However, in our sample it was the only score that predicted the occurrence of local complications with statistical significance (Balthazar score results were disregarded due to eventual bias). These findings were expected, as similar results had been previously described by Rau [3].
These stratification scores are adjunctive to each other. The authors suggest that the evaluation strategy of patients with AP should include APACHE II’s at admission, followed by daily reassessments. CRP must be determined at 48 hours as it can predict the occurrence of local complications. Based on APACHE II’s scoring system and CRP value, contrast-enhanced CT scan should be performed in selected patients, 48-72 hours post admission. EUS and MRI can also be used, when there is suspicion of microlithiasis or local complications. However, as their role is not yet perfectly defined, we cannot advise the routine usage of these techniques. Perhaps a revision of the Atlanta criteria is needed in order to reflect the evolution of knowledge and imaging techniques, as well as other prognostic scores that have been created.
Acknowledgments
The authors have no potential conflicts of interest. No funding was granted to support this study.
References
- 1.Bradley EL 3rd. A clinically based classification system for acute pancreatitis. Summary of the International Symposium on Acute Pancreatitis, Atlanta, Ga, September 11 through 13, 1992. Arch Surg. 1993;128(5):586–590. doi: 10.1001/archsurg.1993.01420170122019. [DOI] [PubMed] [Google Scholar]
- 2.Banks PA, Freeman ML. Practice guidelines in acute pancreatitis. Am J Gastroenterol. 2006;101(10):2379–2400. doi: 10.1111/j.1572-0241.2006.00856.x. [DOI] [PubMed] [Google Scholar]
- 3.Rau BM. Predicting severity of acute pancreatitis. Curr Gastroenterol Rep. 2007;9(2):107–115. doi: 10.1007/s11894-007-0004-5. [DOI] [PubMed] [Google Scholar]
- 4.Ranson JH, Rifkind KM, Roses DF, Fink SD, Eng K, Spencer FC. Prognostic signs and the role of operative management in acute pancreatitis. Surg Gynecol Obstet. 1974;139(1):69–81. [PubMed] [Google Scholar]
- 5.Knaus WA, Draper EA, Wagner DP, Zimmerman JE. APACHE II: a severity of disease classification system. Crit Care Med. 1985;13(10):818–829. doi: 10.1097/00003246-198510000-00009. [DOI] [PubMed] [Google Scholar]
- 6.Schutte K, Malfertheiner P. Markers for predicting severity and progression of acute pancreatitis. Best Pract Res Clin Gastroenterol. 2008;22(1):75–90. doi: 10.1016/j.bpg.2007.10.013. [DOI] [PubMed] [Google Scholar]
- 7.Bollen TL, van Santvoort HC, Besselink MG, van Leeuwen MS, Horvath KD, Freeny PC, Gooszen HG. The Atlanta Classification of acute pancreatitis revisited. Br J Surg. 2008;95(1):6–21. doi: 10.1002/bjs.6010. [DOI] [PubMed] [Google Scholar]
- 8.Ranson JH. The timing ofÍndices de Gravidade. Ann Surg. 1979;189(5):654–63. doi: 10.1097/00000658-197905000-00016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Mergener K, Baillie J. Acute pancreatitis. BMJ. 1998;316(7124):44–48. doi: 10.1136/bmj.316.7124.44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Matos Ricardo. Índices de Gravidade. Revista Portuguesa de Medicina Intensiva. 2000;9(2):159–62. [Google Scholar]
- 11.Carroll JK, Herrick B, Gipson T, Lee SP. Acute pancreatitis: diagnosis, prognosis, and treatment. Am Fam Physician. 2007;75(10):1513–1520. [PubMed] [Google Scholar]
- 12.Whitcomb DC. Clinical practice. Acute pancreatitis. N Engl J Med. 2006;354(20):2142–2150. doi: 10.1056/NEJMcp054958. [DOI] [PubMed] [Google Scholar]
- 13.Blamey SL, Imrie CW, O'Neill J, Gilmour WH, Carter DC. Prognostic factors in acute pancreatitis. Gut. 1984;25(12):1340–1346. doi: 10.1136/gut.25.12.1340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Balthazar EJ. Acute pancreatitis: assessment of severity with clinical and CT evaluation. Radiology. 2002;223(3):603–613. doi: 10.1148/radiol.2233010680. [DOI] [PubMed] [Google Scholar]
- 15.Gurleyik G, Emir S, Kilicoglu G, Arman A, Saglam A. Computed tomography severity index, APACHE II score, and serum CRP concentration for predicting the severity of acute pancreatitis. JOP. 2005;6(6):562–567. [PubMed] [Google Scholar]
- 16.Leung TK, Lee CM, Lin SY, Chen HC, Wang HJ, Shen LK, Chen YY. Balthazar computed tomography severity index is superior to Ranson criteria and APACHE II scoring system in predicting acute pancreatitis outcome. World J Gastroenterol. 2005;11(38):6049–6052. doi: 10.3748/wjg.v11.i38.6049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Sandberg AA, Borgstrom A. Early prediction of severity in acute pancreatitis. Is this possible? JOP. 2002;3(5):116–125. [PubMed] [Google Scholar]
- 18.De Campos T, Cerqueira C, Kuryura L, Parreira JG, Solda S, Perlingeiro JA, Assef JC. et al. Morbimortality indicators in severe acute pancreatitis. JOP. 2008;9(6):690–697. [PubMed] [Google Scholar]
- 19.Chatzicostas C, Roussomoustakaki M, Vlachonikolis IG, Notas G, Mouzas I, Samonakis D, Kouroumalis EA. Comparison of Ranson, APACHE II and APACHE III scoring systems in acute pancreatitis. Pancreas. 2002;25(4):331–335. doi: 10.1097/00006676-200211000-00002. [DOI] [PubMed] [Google Scholar]
- 20.Tang W, Zhang XM, Xiao B, Zeng NL, Pan HS, Feng ZS, Xu XX. Magnetic resonance imaging versus Acute Physiology And Chronic Healthy Evaluation II score in predicting the severity of acute pancreatitis. Eur J Radiol. 2010 doi: 10.1016/j.ejrad.2010.08.020. [DOI] [PubMed] [Google Scholar]
- 21.Xiao B, Zhang XM, Tang W, Zeng NL, Zhai ZH. Magnetic resonance imaging for local complications of acute pancreatitis: a pictorial review. World J Gastroenterol. 2010;16(22):2735–2742. doi: 10.3748/wjg.v16.i22.2735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kotwal V, Talukdar R, Levy M, Vege SS. Role of endoscopic ultrasound during hospitalization for acute pancreatitis. World J Gastroenterol. 2010;16(39):4888–4891. doi: 10.3748/wjg.v16.i39.4888. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Sotoudehmanesh R, Hooshyar A, Kolahdoozan S, Zeinali F, Shahraeeni S, Keshtkar AA. Prognostic value of endoscopic ultrasound in acute pancreatitis. Pancreatology. 2010;10(6):702–706. doi: 10.1159/000320695. [DOI] [PubMed] [Google Scholar]