Abstract
Background/Aims
Interobserver variation by experience was documented for the diagnosis of esophagitis using the Los Angeles classification. The aim of this study was to evaluate whether interobserver agreement can be improved by higher levels of endoscopic experience in the diagnosis of erosive esophagitis.
Methods
Endoscopic images of 51 patients with gastroesophageal reflux disease (GERD) symptoms were obtained with conventional endoscopy and optimal band imaging (OBI). Endoscopists were divided into an expert group (16 gastroenterologic endoscopic specialists guaranteed by the Korean Society of Gastrointestinal Endoscopy) and a trainee group (individuals with fellowships, first year of specialty training in gastroenterology). All endoscopists had no or minimal experience with OBI. GERD was diagnosed using the Los Angeles classification with or without OBI.
Results
The mean weighted paired κ statistics for interobserver agreement in grading erosive esophagitis by conventional endoscopy in the expert group was better than that in the trainee group (0.51 vs 0.42, p<0.05). The mean weighted paired k statistics in the expert group and in the trainee group based on conventional endoscopy with OBI did not differ (0.42, 0.42).
Conclusions
Interobserver agreement in the expert group using conventional endoscopy was better than that in the trainee group. Endoscopic experience can improve the interobserver agreement in the grading of esophagitis using the Los Angeles classification.
Keywords: Gastroesophageal reflux, Agreement, Experience
INTRODUCTION
Gastroesophageal reflux disease (GERD) is a condition characterized by heartburn or regurgitation caused by esophageal exposure to acidic gastric content.1 Due to the increasing prevalence of GERD in Asia, the social and medical importance of GERD is increasing.
Several diagnostic tools have been developed for diagnosis of GERD; however, endoscopy is the major tool for assessing esophagitis. Detection of erosion or ulcer at the gastroesophgeal junction by endoscopy can allow for estimation of patients' symptoms and can be used in design of a treatment plan. Several classification systems have been introduced for improvement of the consistency of diagnosis of GERD.2,3 Los Angeles (LA) classification was recently introduced and is used as a mainstay for diagnosis of erosive esophagitis, because of good correlation between esophagitis grade and acid exposure, heartburn severity, and risk of symptom relapse.2,4 However, this system also has several limitations, including low interobserver agreement in trainees and low symptom correlation in endoscopy-negative reflux disease (NERD) or minimal change on the gastroesophageal junction.5,6
The optimal band imaging (OBI) system was developed as a new type of computed virtual chromoendoscopy and was introduced as flexible spectral-imaging color enhancement (FICE) by Fujinon Corporation (Saitama, Japan). This system can enhance the mucosal surface and mucosal microvasculature without use of dye. Several studies have reported that this system was very useful for diagnosis of various GI diseases, such as detection of depressed-type early gastric cancer and differential diagnosis of colonic polyps.7-11 Recently, a narrow band imaging (NBI) system was found to increase the interobserver consistency in diagnosis of GERD.12 Several previous studies reported that the level of experience influenced the interobserver agreement for diagnosis GERD by LA classification. But, a study with many endoscopists to confirm interobserver agreement to diagnose GERD by LA classification is limited.
In this study, we evaluated that interobserver agreement in diagnosis of erosive esophagitis using conventional endoscopy with or without OBI can be improved by the level of convention endoscopic and OBI system experience.
MATERIALS AND METHODS
We retrospectively recruited patients with symptoms suggestive of GERD between January 2009 and December 2009. All subjects underwent conventional endoscopy with the OBI system using video endoscopy (EG-590WR; Fujinon Corp.). The FICE process was performed with R 520 nm, G 500 nm, and B 405 nm. Fifty-one static photographs of the esophagogastric junction were selected according to their resolution. All names or dates were removed from the images.
Twenty-six endoscopists participated in this study for evaluation of interobserver agreement for diagnosis of GERD. We divided 26 endoscopists into two group, expert group and trainee group. Expert group was composed of 16 endoscopists (gastroenterologic endoscopic specialist guaranteed by Korean Society of Gastroinetestinal Endoscopy) with more than 4 years conventional endoscopy experience and no or minimal OBI system experience. Trainee group was composed of 10 endoscopists with minimal conventional and OBI system experience. Sixteen expert endoscopists were gastroenterologist specialists and 10 trainees were in their first year of specialty training in gastroenterology. The definition of minimal experience for OBI is defined when endoscopist has expericence for OBI without experience for diagnosis of GERD by OBI.
To enhance the consistency in diagnosing GERD, introduction for OBI system and examples of GERD by OBI system was introduced to investigators before test. The endoscopic grade of GERD was determined using LA classification, which was presented at a symposium World Congress of Gastroenterology in Los Angeles in 1994. Mucosal breaks were graded as normal, A, B, C, and D according to the extent. The still image of the esophagogastric junction with a ruler was provided in order to minimize interobserver variation on examination of the extent of the mucosal break. The 51 still images were shown to 26 investigators using a video projector for 10 seconds. First, investigators recorded the grade of GERD by only conventional endoscopy image for 51 still images. Second, investigators recorded the grade of GERD by conventional endoscopy image and the OBI system image. Representative images on conventional and OBI endoscopy images are shown in Fig. 1.
Fig. 1.
(A) A still image obtained using conventional endoscopy in a patient without erosive esophagitis. (B) A still image of optimal band imaging in the same patient.
1. Statistical analysis
Our primary interest was to compare interobserver agreement in the interpretation of conventional images with or without the combination of OBI system image. The index we used to assess this agreement was the κ statistic, a measure of agreement beyond chance. A κ-value greater than 0.8 denoted excellent agreement, 0.8 to 0.6 denoted good agreement, 0.6 to 0.4 denoted fair agreement, and less than 0.4 denoted poor agreement. A p-value of less than 0.05 was regarded as statistically significant. The analysis was performed using SPSS version 13.0 (SPSS Inc., Chicago, IL, USA) and MedCalc (Medcalc Software, Mariakerke, Belgium).
RESULTS
Endoscopists experience for OBI and opinion on efficacy of OBI for diagnosis of GERD by LA classification are shown in Table 1. Experience of conventional endoscopy in expert group was larger than that of trainee group. But, most endoscopists in this study had no or minimal experience in use of OBI in grading of esophagitis.
Table 1.
Experience with OBI for the Diagnosis of GERD and Opinions on the Efficacy of OBI in the Diagnosis of GERD according to the Los Angeles Classification
Two subjects declined to provide their opinion on the efficacy of OBI in the diagnosis of GERD.
OBI, optimal band imaging; GERD, gastroesophageal reflux disease.
We obtained 1,326 diagnoses of GERD from 26 endoscopists and 51 patients. Erosive esophagitis was diagnosed in 376 (28.4%) by conventional endoscopy only (Table 2). The numbers for grades A, B, C, and D erosive esophagitis by conventional endoscopy were 220 (16.6%), 102 (7.7%), 39 (2.9%), and 15 (1.1%), respectively. The most frequent diagnosis in conventional endoscopy without OBI or with OBI system was normal (NERD).
Table 2.
Correlation between Diagnoses Obtained Using Conventional Endoscopy versus OBI by 26 Evaluators
OBI, optimal band imaging.
Erosive esophagitis using conventional endoscopy with OBI was increased from 376 (28.4%) to 425 (32.1%) (Table 2). Grade A erosive esophagitis by LA classification showed an increase, from 220 (16.6%) to 291 (21.9%) after using the OBI system. The number of patients diagnosed as normal by conventional endoscopy, but diagnosed as grade A esophagitis using OBI was 128 (43.9%, 128 of 291).
The mean weighted paired κ statistics for interobserver agreement in grading of erosive esophagitis in the expert group and trainee group by conventional endoscopy was 0.51 and 0.42, respectively (p<0.05) (Tables 3 and 4). The proportion of fair agreement and good agreement in the expert group by conventional endoscopy was 48.3% (58 of 120) and 27.5% (33 of 120). The proportion of above fair agreement in the expert group by conventional endoscopy only was 75.8% (91 of 120). The proportion of fair agreement and good agreement in the trainee group by conventional endoscopy only was 53.3% (24 of 45) and 8.9% (4 of 45). The proportion of above fair agreement in the trainee group by conventional endoscopy only was 62.2% (28 of 45).
Table 3.
Weighted Paired κ Statistics for Interobserver Consistency in the Grading of Esophagitis in the Expert Group Based on Conventional Imaging
Table 4.
Weighted Paired κ Statistics for Interobserver Consistency in the Grading of Esophagitis in the Trainee Group Based on Conventional Imaging
The mean weighted paired κ statistics in the expert group and trainee group by conventional endoscopy with OBI was 0.42 and 0.42, respectively (Tables 5 and 6). The proportion of fair agreement and good agreement in the expert group by conventional endoscopy using OBI was 34.1% (41 of 120) and 15% (18 of 120). The proportion of above fair agreement in the expert group by conventional endoscopy only was 49.1% (59 of 120). The proportion of fair agreement and good agreement in the trainee group by conventional endoscopy using OBI was 57.5% (26 of 45) and 6.6% (3 of 45). The proportion of above fair agreement in the trainee group by conventional endoscopy only was 64.4% (29 of 45).
Table 5.
Weighted Paired κ Statistics for Interobserver Consistency in the Grading of Esophagitis in the Expert Group Based on Conventional Imaging with OBI
OBI, optimal band imaging.
Table 6.
Weighted Paired κ Statistics for Interobserver Consistency in the Grading of Esophagitis in the Trainee Group Based on Conventional Imaging with OBI
OBI, optimal band imaging.
DISCUSSION
GERD can be diagnosed by several diagnostic modalities, including traditional endoscopy, pH testing, and questionnaire. So far, the diagnosis of GERD using endoscopy is a mainstay in the clinical field; LA classification system is a popular system for grading the esophagitis. Interobserver variation for diagnosis of GERD using LA classification and virtual chromoendoscopy (NBI) was recently introduced by Lee et al.12 But, no study regarding diagnosis of GERD using traditional endoscopy with OBI has been reported so far.
We conducted this study in order to confirm whether endoscopic experience could influence interobserver consistency for diagnosis of GERD by LA classification and for evaluation of whether OBI could improve interobserver consistency for diagnosis of GERD in each expert and trainee endoscopist group. Interobserver consistency with conventional endoscopy showed fair agreement (0.41 to 0.60) in both groups. However, consistency in the expert group was better than that in the trainee group (0.51 vs 0.42, p<0.05). In addition, the proportion of the above fair agreement with conventional endoscopy in the expert group was better than that in the trainee group (75.8% vs 62.2%). This result is similar to those of several previous studies.3,6
Interobserver consistency for diagnosis of GERD using conventional endoscopy with OBI showed fair agreement in both groups. The consistency was 0.42 in both groups. The proportion of the above fair agreement in the expert group and the trainee group was 49.1% and 64.4%, respectively.
We think that OBI cannot improve the consistency of GERD diagnosis because of the following problems. First, most observers have no or minimal experience in use of OBI system for diagnosing GERD. The proportion of those who have experience in use of OBI system was 12.5% and 40% in the expert group and the trainee group, respectively. The lack of experience in use of OBI system in the expert group may influence the reduction of interobserver consistency after adding OBI images.
A previous study on experience in performance of virtual chromoendoscopy concluded that diagnostic accuracy could improve for less-experienced endoscopists after participation in an expanded training program.13 Thus, study to determine whether a training program for OBI can improve diagnostic accuracy and interobserver consistency is needed.
Secondly, we used a still image instead of video image in grading of esophagitis by LA classification. In addition, time duration of the slide was only 10 seconds. This method may influence the negative effect for interobserver consistency.
In conclusion, experience in diagnosis of GERD by endoscopy improves interobserver agreement in grading of esophagitis by LA classification.
Footnotes
No potential conflict of interest relevant to this article was reported.
References
- 1.Vakil N, van Zanten SV, Kahrilas P, Dent J, Jones R Global Consensus Group. The Montreal definition and classification of gastroesophageal reflux disease: a global evidence-based consensus. Am J Gastroenterol. 2006;101:1900–1920. doi: 10.1111/j.1572-0241.2006.00630.x. [DOI] [PubMed] [Google Scholar]
- 2.Lundell LR, Dent J, Bennett JR, et al. Endoscopic assessment of oesophagitis: clinical and functional correlates and further validation of the Los Angeles classification. Gut. 1999;45:172–180. doi: 10.1136/gut.45.2.172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Rath HC, Timmer A, Kunkel C, et al. Comparison of interobserver agreement for different scoring systems for reflux esophagitis: impact of level of experience. Gastrointest Endosc. 2004;60:44–49. doi: 10.1016/s0016-5107(04)01289-1. [DOI] [PubMed] [Google Scholar]
- 4.Armstrong D, Bennett JR, Blum AL, et al. The endoscopic assessment of esophagitis: a progress report on observer agreement. Gastroenterology. 1996;111:85–92. doi: 10.1053/gast.1996.v111.pm8698230. [DOI] [PubMed] [Google Scholar]
- 5.Kusano M, Ino K, Yamada T, et al. Interobserver and intraobserver variation in endoscopic assessment of GERD using the "Los Angeles" classification. Gastrointest Endosc. 1999;49:700–704. doi: 10.1016/s0016-5107(99)70285-3. [DOI] [PubMed] [Google Scholar]
- 6.Pandolfino JE, Vakil NB, Kahrilas PJ. Comparison of inter- and intraobserver consistency for grading of esophagitis by expert and trainee endoscopists. Gastrointest Endosc. 2002;56:639–643. doi: 10.1067/mge.2002.129220. [DOI] [PubMed] [Google Scholar]
- 7.Mouri R, Yoshida S, Tanaka S, Oka S, Yoshihara M, Chayama K. Evaluation and validation of computed virtual chromoendoscopy in early gastric cancer. Gastrointest Endosc. 2009;69:1052–1058. doi: 10.1016/j.gie.2008.08.032. [DOI] [PubMed] [Google Scholar]
- 8.Togashi K, Osawa H, Koinuma K, et al. A comparison of conventional endoscopy, chromoendoscopy, and the optimal-band imaging system for the differentiation of neoplastic and non-neoplastic colonic polyps. Gastrointest Endosc. 2009;69(3 Pt 2):734–741. doi: 10.1016/j.gie.2008.10.063. [DOI] [PubMed] [Google Scholar]
- 9.Osawa H, Yoshizawa M, Yamamoto H, et al. Optimal band imaging system can facilitate detection of changes in depressed-type early gastric cancer. Gastrointest Endosc. 2008;67:226–234. doi: 10.1016/j.gie.2007.06.067. [DOI] [PubMed] [Google Scholar]
- 10.Cha JM, Lee JI, Joo KR, Jung SW, Shin HP. A prospective randomized study on computed virtual chromoendoscopy versus conventional colonoscopy for the detection of small colorectal adenomas. Dig Dis Sci. 2010;55:2357–2364. doi: 10.1007/s10620-009-1003-1. [DOI] [PubMed] [Google Scholar]
- 11.Pohl J, Lotterer E, Balzer C, et al. Computed virtual chromoendoscopy versus standard colonoscopy with targeted indigocarmine chromoscopy: a randomised multicentre trial. Gut. 2009;58:73–78. doi: 10.1136/gut.2008.153601. [DOI] [PubMed] [Google Scholar]
- 12.Lee YC, Lin JT, Chiu HM, et al. Intraobserver and interobserver consistency for grading esophagitis with narrow-band imaging. Gastrointest Endosc. 2007;66:230–236. doi: 10.1016/j.gie.2006.10.056. [DOI] [PubMed] [Google Scholar]
- 13.Higashi R, Uraoka T, Kato J, et al. Diagnostic accuracy of narrow-band imaging and pit pattern analysis significantly improved for less-experienced endoscopists after an expanded training program. Gastrointest Endosc. 2010;72:127–135. doi: 10.1016/j.gie.2010.01.054. [DOI] [PubMed] [Google Scholar]