Abstract
AIM: To evaluate the three-dimensional (3-D) representation performance of 4 publicly available Shape-from-Shading (SfS) algorithms in small-bowel capsule endoscopy (SBCE).
METHODS: SfS techniques recover the shape of objects using the gradual variation of shading. There are 4 publicly available SfS algorithms. To the best of our knowledge, no comparative study with images obtained during clinical SBCE has been performed to date. Three experienced reviewers were asked to evaluate 54 two-dimensional (2-D) images (categories: protrusion/inflammation/vascular) transformed to 3-D by the aforementioned SfS 3-D algorithms. The best algorithm was selected and inter-rater agreement was calculated.
RESULTS: Four publicly available SfS algorithms were compared. Tsai’s SfS algorithm outperformed the rest (selected as best performing in 45/54 SBCE images), followed by Ciuti’s algorithm (best performing in 7/54 images) and Torreão’s (in 1/54 images). In 26/54 images; Tsai’s algorithm was unanimously selected as the best performing 3-D representation SfS software. Tsai’s 3-D algorithm superiority was independent of lesion category (protrusion/inflammatory/vascular; P = 0.678) and/or CE system used to obtain the 2-D images (MiroCam®/PillCam®; P = 0.558). Lastly, the inter-observer agreement was good (kappa = 0.55).
CONCLUSION: 3-D representation software offers a plausible alternative for 3-D representation of conventional capsule endoscopy images (until optics technology matures enough to allow hardware enabled-“real” 3-D reconstruction of the gastrointestinal tract).
Keywords: Capsule endoscopy, Small-bowel, Three-dimensional, Software, Algorithm, Reconstruction, Technology, Advance
Core tip: Accurate three-dimensional (3-D) reconstruction of the gastrointestinal tract requires the use of stereo-cameras that can simulate human binocular vision. In the absence of such technology in capsule endoscopy, we rely on software approaches [such as the Shape-from-Shading (SfS) algorithms] to obtain 3-D representation of digestive tract structures. In the present study, we evaluated the use of 4 publically available SfS in capsule endoscopy. 3 experienced/experts reviewers concluded that Tsai’s approach is the best of the four available algorithms.
INTRODUCTION
Capsule endoscopy (CE) has changed our diagnostic approach for small-bowel diseases[1,2]. Although more accurate and of higher diagnostic yield than other modalities[3,4], there are still occasions where pathology is either missed or misinterpreted[5-7]. Furthermore, reports have shown that three-dimensional (3-D) reconstruction can facilitate diagnosis by enhancing textural features of mucosal structures or intestinal abnormalities[8,9]. However, accurate 3-D reconstruction of the gastrointestinal (GI) tract requires the use of stereoscopic cameras that can simulate human binocular vision[10,11]. With the current level of technological investment in CE though i.e., camera size, packaging constraints and power consumption, accurate 3-D imaging of the intestinal lumen in small-bowel capsule endoscopy (SBCE) is still unfeasible[9,12].
Therefore, software approaches that offer 3-D representation of conventional monocular two-dimensional (2-D) CE frames have been developed[13] and proposed for use in CE[14]. Such approaches e.g., Shape-from-Shading (SfS) algorithms, are members of a family of shape recovery algorithms called shape-from-X techniques (Figure 1)[13]. Given a single 2-D image, these algorithms recover the shape of objects using the gradual variation of shading[13]. Essentially, surface ‘‘reconstruction’’ with SfS is achieved through a mathematical representation that is inverted in order to recover dense surface distance and normal information by the gradual variation of shading[13]. We were able to retrieve 4 publicly available SfS algorithms[15-18]. To the best of our knowledge, no comparative study with images obtained during clinical SBCE has been performed to date[19]. We aimed to evaluate the 3-D representation performance of 4 publicly available SfS algorithms by comparing them with their equivalent 2-D images of small-bowel structures/lesions obtained during SBCE, in order to identify the algorithm more helpful in facilitating identification and distinction between lesion and surrounding mucosa.
MATERIALS AND METHODS
Between January 2011 and January 2012, 262 SBCE procedures were performed at the Royal Infirmary of Edinburgh (tertiary referral centre for CE for the southeast of Scotland, United Kingdom) in 249 patients (mean age: 52.6 ± 12.1 years), as already described elsewhere[9]. Out of them, 140 were performed with PillCam®SB2 (Given®Imaging Ltd., Yokneam, Israel) and 122 with MiroCam® (IntroMedic®Co, Seoul, South Korea). A total of 54 were selected images (27 obtained with MiroCam® and 27 with PillCam®SB) on the basis of the overall quality i.e., brightness, absence of air bubbles, debris, or opaque luminal fluid and clarity of findings (lesions or structures). Thereafter, images were classified in the following image groups: (1) vascular lesions i.e., angioectasias (n = 16); (2) inflammatory lesions i.e., ulcers, erosions, aphthae, cobblestone, fold and/or villous oedema (n = 18); and (3) protruding lesions/structures i.e., polyp/mass, nodular lymphoid hyperplasia, cluster of focal lymphangiectasia, chylous cysts, and ampulla of Vater, (n = 20).
3-D image representation software
All selected images were reconstructed in 3-D by means of all 4 SfS algorithms. Three reviewers (Rondonotti E, Mandelli G, Koulaouzidis A) with extensive CE experience and blinded to each other participated in this study. In order to facilitate the evaluation process, a Mathworks© Matlab program with a graphic user interface (GUI) was developed (Figure 2; a video presenting the evaluation process is provided as supplementary material via this link: https://dl.dropboxusercontent.com/u/7591304/EvaluationVideo.mov). The program consisted of two windows in which the conventional 2-D SBCE image (Figure 2, single frame at the right side/window of the GUI screen) and its corresponding 3-D represented images (four, one for each of the 4 SfS under evaluation) are presented to the reviewer (Figure 2, left side/window of the GUI screen).
The 3-D SfS representations appeared in random order. The reviewers had the ability and freedom to rotate and zoom in each of the 3-D represented images. At the bottom of the GUI screen, a single “task request”: “Choose the 3-D representation you consider most helpful in distinguishing the finding (seen in 2-D) from the surrounding mucosa” appeared. This prompted reviewers to choose one among the four 3-D ‘reconstructed’ images, each generated by a different 3-D algorithm. After selecting the best SfS representation, the reviewer had to click “next” to proceed to the next case. This process was repeated until the program reached the last case after which each separate evaluation was concluded.
Outcome measures
Reviewers were asked to evaluate 54 images. The following subgroup analyses were performed: (1) evaluation of 3-D representation according to the type of finding (vascular vs inflammatory vs protruding); and (2) evaluation according to the system generating the 2-D image (PillCam® vs Mirocam®). Furthermore, inter-observer agreement was calculated.
Ethics consideration
This study was conducted in accordance with United Kingdom research ethics guidelines. After review by the local ethics committee further specific ethical review and approval were not required, as the study was considered an evaluation of previously collected endoscopy images, using data already obtained as part of regular clinical care[20].
Statistical analysis
For numerical variables, values are presented as mean ± SD. Where necessary, the Fisher exact test was calculated. A two-tailed P value < 0.05 was considered statistically significant. Inter-observer agreement was calculated using an online kappa calculator (available from http://justusrandolph.net/kappa/) which provides the calculation of Randolph’s free-marginal multirater kappa [21], applicable when raters are not forced to assign a certain number of cases to each category. Values of kappa can range from -1.0 to 1.0, with -1.0 indicating perfect disagreement below chance, 0.0 indicating agreement equal to chance, and 1.0 indicating perfect agreement above chance. More specifically, the inte is classified per kappa as poor < 0.20, fair 0.2-0.40, good 0.41-0.60, very good 0.61-0.80 and, excellent 0.81-1.00[22]. All other statistical analyses were performed using a statistical package, StatsDirect, StatsDirect Ltd, Altrincham, Cheshire, United Kingdom.
RESULTS
Of the 4 SfS algorithms, Tsai’s 3-D algorithm outperformed the rest (selected as best in 45/54 images), followed by Ciuti’s (best performing SfS in 7/54 images) and Torreão’s (in 1/54 images); there was a single image for which each reviewer selected (as best performing) a different 3-D representation algorithm. Of note, not once was Barron’s 3-D algorithm selected as best performing (Table 1, Figure 3).
Table 1.
SfS method |
Vascular |
Inflammatory |
Protrusion |
|||
PillCam® | MiroCam® | PillCam® | MiroCam® | PillCam® | MiroCam® | |
Tsai | 7 | 7 | 7 | 6 | 8 | 10 |
Ciuti | 1 | 0 | 1 | 0 | 1 | 4 |
Torreão | 0 | 0 | 1 | 0 | 0 | 0 |
Barron | 0 | 0 | 0 | 0 | 0 | 0 |
None selected | 0 | 1 | 0 | 0 | 0 | 0 |
In 26/54 images, Tsai’s algorithm was unanimously selected as the best performing 3-D representation SfS software. Tsai’s 3-D algorithm superiority was independent of lesion category (protrusion/inflammatory/vascular; P = 0.678) and/or CE system used to obtain the 2-D images (MiroCam®/PillCam®; P = 0.558). Lastly, the inter-observer agreement was good (kappa = 0.55).
DISCUSSION
In the present study, we compared the performance of 4 publicly available 3-D ‘‘reconstruction’’ algorithms[15-18] (SfS software) using 54 conventional 2-D CE images. The evaluation criterion was subjective i.e., perceived visualisation improvement (3-D representations offered over the corresponding conventional 2-D images) by 3 experienced CE reviewers. Based on this evaluation, Tsai’s algorithm is the 3-D representation model recommended for use in CE. This outcome directly supports Tsai’s SfS model theoretical advantages: (1) able to produce good results for round surfaces, which are the case for most digestive tract shapes; and (2) it behaves quite well with bright surfaces[13].
Depth information is an important aspect of human vision; it helps human brain to analyse and comprehend the surrounding environment. Images captured with conventional (non-stereoscopic) cameras ‘‘discard’’ the 3rd dimension (depth) as conventional cameras can only save 2 dimensions (height and width). Therefore depth information is lost; and moreover, most imaging algorithms perform less efficiently.
To date, engineers have not been able to equip capsule endoscopes with stereoscopic cameras for the following reasons: (1) packaging/space limitations; (2) low depth resolution of stereoscopic or time-of-flight cameras[22-24]; and (3) power consumption issues. However, it is almost certain that in the foreseeable future these hardware-related limitations will be overcome[11] and eventually 3-D CE will be a commodity. Nevertheless, until hardware changes are widely implemented, several efforts have been made to convert 2-D images into 3-D images (3-D representation or “reconstruction”) through software and dedicated algorithms. There are software algorithms that offer a fair trade-off between 2-D images and hardware-enabled 3-D images. These algorithms are part of a family of shape recovery algorithms called Shape-from-X techniques[13]. Basically a SfS algorithm recovers the shape of objects, given a single monocular image, using the gradual variation of shading[8,13].
SfS algorithms can be divided into four groups: (1) minimization approaches[16-18]; (2) propagation approaches; (3) local approaches; and (4) linear approaches[15]. It is important to remember that each of the 4 SfS algorithms evaluated herein utilizes a different approach to recover the shape from a conventional 2-D image.
More specifically, Tsai et al[15] described an repetitive update of the depth using a linear approximation of the reflectance function. Ciuti et al[16] used a camera model with perspective projection and a light source close to the surface and away from the optical centre to measure depth. Torreão et al[17] applied a linear-nonlinear biological model that mimics neuronal responses to estimate shape. Finally, Barron et al[18] proposed a unified model for recovering shape, reflectance and optional illumination while using local smoothness, global scarcity or entropy, and the absolute colour of each pixel. Although Tsai’s[14,15] method is very straightforward and to an extent simplistic, it provides satisfying results. Cuiti’s et al[16] algorithm, on the other hand, uses a more advanced model (incorporating a camera model with perspective projection) that makes things in the background appear further back than in Tsai’s model (Figure 4).
Since for a given 2-D image, light source and surface shape are not known, these algorithms try to model how the 2-D image was created from the 3-D environment to finally produce an approximation this 3-D depth. The above modelling has a significant impact on the resulting 3-D representation. During SfS process additional constraints need to be applied on the surface shape parameters or the light conditions to find the surface characteristics.
In conclusion, we showed previously that 3-D representation software offers a plausible alternative for 3-D representation of conventional CE images (until optics technology matures enough to allow a hardware enabled-“real” 3-D reconstruction of the GI tract)[9]. In the present study we compared 4 publicly available SfS methods. 3-D reconstruction is attracting interest in capsule endoscopy[8,9,14,25-28], especially as newly developed and/or under development CE become available, with greater potential (due to imager and optics) for 3-D software[20].
COMMENTS
Background
Over the past decade, conventional endoscope technology has advanced with the use of three-dimensional (3-D) cameras offering increased diagnostic and interventional capabilities. Unfortunately, due to hardware limitations, 3-D small-bowel capsule endoscopy (SBCE) is still an open technological challenge. It is aspired that 3-D SBCE will be able to offer similar benefits to conventional 3-D endoscopy. Therefore, information technology engineers suggested the use of software techniques (Shape-from-Shading, SfS) methods that simulate 3-D reconstruction i.e., 3-D representation in SBCE images. To date, various SfS approaches have been proposed; each aims to retrieve depth information from 2-D images (shape recovery) through different mathematical transformations, hence offering different shape approximations.
Research frontiers
The authors aimed to evaluate the 3-D representation performance of 4 publicly available SfS algorithms by comparing them with their equivalent 2-D images of small-bowel structures/lesions obtained during SBCE, in order to identify the algorithm more helpful in facilitating identification and distinction between the lesion and the surrounding mucosa.
Innovations and breakthroughs
This study, in conjunction with further similar work in the field, is useful in the assessing the potential validity of integrating 3-D representation in capsule endoscopy reviewing software.
Applications
Software-enabled 3-D representation is a promising approach that enables 3-D imaging at no additional cost. The authors have shown that SfS application leads to improved visualisation in SBCE and is it likely to be of use in certain clinical scenarios, like the ‘mass or bulge’ question.
Peer review
An interesting paper dealing with software and capsule endoscopy.
Footnotes
P- Reviewer: Figueiredo P S- Editor: Zhai HH L- Editor: A E- Editor: Wu HL
References
- 1.Iddan G, Meron G, Glukhovsky A, Swain P. Wireless capsule endoscopy. Nature. 2000;405:417. doi: 10.1038/35013140. [DOI] [PubMed] [Google Scholar]
- 2.Eliakim R. Video capsule endoscopy of the small bowel. Curr Opin Gastroenterol. 2013;29:133–139. doi: 10.1097/MOG.0b013e32835bdc03. [DOI] [PubMed] [Google Scholar]
- 3.Mata A, Llach J, Bordas JM. Wireless capsule endoscopy. World J Gastroenterol. 2008;14:1969–1971. doi: 10.3748/wjg.14.1969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Koulaouzidis A, Rondonotti E, Karargyris A. Small-bowel capsule endoscopy: a ten-point contemporary review. World J Gastroenterol. 2013;19:3726–3746. doi: 10.3748/wjg.v19.i24.3726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Mavrogenis G, Coumaros D, Renard C, Bellocq JP, Defta D, Charneau D, Leroy J. Jejunal gastrointestinal stromal tumor missed by three capsule endoscopies. Endoscopy. 2011;43:735–736,author reply 737. doi: 10.1055/s-0030-1256573. [DOI] [PubMed] [Google Scholar]
- 6.Hakim FA, Alexander JA, Huprich JE, Grover M, Enders FT. CT-enterography may identify small bowel tumors not detected by capsule endoscopy: eight years experience at Mayo Clinic Rochester. Dig Dis Sci. 2011;56:2914–2919. doi: 10.1007/s10620-011-1773-0. [DOI] [PubMed] [Google Scholar]
- 7.Triantafyllou K, Papanikolaou IS, Papaxoinis K, Ladas SD. Two cameras detect more lesions in the small-bowel than one. World J Gastroenterol. 2011;17:1462–1467. doi: 10.3748/wjg.v17.i11.1462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Koulaouzidis A, Karargyris A. Three-dimensional image reconstruction in capsule endoscopy. World J Gastroenterol. 2012;18:4086–4090. doi: 10.3748/wjg.v18.i31.4086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Koulaouzidis A, Karargyris A, Rondonotti E, Noble CL, Douglas S, Alexandridis E, Zahid AM, Bathgate AJ, Trimble KC, Plevris JN. Three-dimensional representation software as image enhancement tool in small-bowel capsule endoscopy: A feasibility study. Dig Liver Dis. 2013;45:909–914. doi: 10.1016/j.dld.2013.05.013. [DOI] [PubMed] [Google Scholar]
- 10. Available from: http: //en.wikipedia.org/wiki/Stereo_camera.
- 11.Kolar A, Romain O, Ayoub J, Viateur S, Granado B. Prototype of video endoscopic capsule with 3-d imaging capabilities. IEEE Trans Biomed Circuits Syst. 2010;4:239–249. doi: 10.1109/TBCAS.2010.2049265. [DOI] [PubMed] [Google Scholar]
- 12.Fisher LR, Hasler WL. New vision in video capsule endoscopy: current status and future directions. Nat Rev Gastroenterol Hepatol. 2012;9:392–405. doi: 10.1038/nrgastro.2012.88. [DOI] [PubMed] [Google Scholar]
- 13.Zhang R, Tsai PS, Cryer JE, Shah M. Shape from Shading: A Survey. IEEE Trans Pattern Anal Mach Intell. 1999;21:690–706. [Google Scholar]
- 14.Karargyris A, Karargyris O, Bourbakis N. 3D Representation of the Digestive Tract Surface in Wireless Capsule Endoscopy Videos; BIBE; May 31-Jun 3; Philadelphia, United States. New Jersey: IEEE International Conference on Bioinformatics and BioEngineering. In: Proceedings of the 2010 IEEE xplore; 2010. pp. 279–280. [Google Scholar]
- 15.Tsai PS, Shah M. Shape from shading using linear approximation. Comput Vis Image. 1994;12:487–498. [Google Scholar]
- 16.Ciuti G, Visentini-Scarzanella M, Dore A, Menciassi A, Dario P, Guang-Zhong Y. Intra-operative monocular 3D reconstruction for image-guided navigation in active locomotion capsule endoscopy. Biomedical Robotics and Biomechatronics (BioRob) 2012;4th IEEE RAS & EMBS International Conference June 2012:768–774, 24-27. [Google Scholar]
- 17.Torreão JRA, Fernandes JL. Linear-nonlinear neuronal model for shape from shading. Pattern Recognit Lett. 2011;32:1223–1239. [Google Scholar]
- 18.Barron JT, Malik J. Color constancy, intrinsic images, and shape estimation. Computer Vision–ECCV 2012. Berlin Heidelberg: Springer; 2012. pp. 57–70. [Google Scholar]
- 19.Koulaouzidis A, Karargyris A. Application of 3D-Representation Algorithms in Small-Bowel Capsule Endoscopy. Glob J Gastroenterol Hepatol. 2013;1:2–3. [Google Scholar]
- 20.Warrens MJ. Inequalities between multi-rater kappas. Adv Data Anal Classif. 2010;4:271–286. [Google Scholar]
- 21. Available from: http://justusrandolph.net/kappa/
- 22.Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159–174. [PubMed] [Google Scholar]
- 23.Blais F. Review of 20 years of range sensor development. J Electron Imaging. 2004;13:231–243. [Google Scholar]
- 24.Blais F Wikipedia. Stereo camera. Available from: http: //en.wikipedia.org/wiki/Stereo_camera.
- 25.Ciaccio EJ, Tennyson CA, Bhagat G, Lewis SK, Green PH. Use of shape-from-shading to estimate three-dimensional architecture in the small intestinal lumen of celiac and control patients. Comput Methods Programs Biomed. 2013;111:676–684. doi: 10.1016/j.cmpb.2013.06.002. [DOI] [PubMed] [Google Scholar]
- 26.Ciaccio EJ, Tennyson CA, Bhagat G, Lewis SK, Green PH. Implementation of a polling protocol for predicting celiac disease in videocapsule analysis. World J Gastrointest Endosc. 2013;5:313–322. doi: 10.4253/wjge.v5.i7.313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Prasath VBS, Figueiredo IN, Figueiredo PN, Palaniappan K. Mucosal region detection and 3D reconstruction in wireless capsule endoscopy videos using active contours. Proceedings of the Engineering in Medicine and Biology Society (EMBC) 2012;IEEE Annual International Conference; 2012 Aug 28-Sep 1; San Diego, CA, USA. Engineering in Medicine and Biology Society (EMBC), 2012:4014–4017. doi: 10.1109/EMBC.2012.6346847. [DOI] [PubMed] [Google Scholar]
- 28.Fu Y, Zhang DW, Liu H, Meng MQH. 3-D shape recovery of luminal wall from WCE image. Proceedings of the Automation and Logistics (ICAL) 2012;IEEE International Conference; 2012 Aug 15-17; Zhengzhou, China. Automation and Logistics (ICAL), 2012:300–303. [Google Scholar]