Abstract
Colonoscopy is an effective screening procedure in colorectal cancer prevention programs; however, colonoscopy practice can vary in terms of lesion detection, classification, and removal. Artificial intelligence (AI)-assisted decision support systems for endoscopy is an area of rapid research and development. The systems promise improved detection, classification, screening, and surveillance for colorectal polyps and cancer. Several recently developed applications for AI-assisted colonoscopy have shown promising results for the detection and classification of colorectal polyps and adenomas. However, their value for real-time application in clinical practice has yet to be determined owing to limitations in the design, validation, and testing of AI models under real-life clinical conditions. Despite these current limitations, ambitious attempts to expand the technology further by developing more complex systems capable of assisting and supporting the endoscopist throughout the entire colonoscopy examination, including polypectomy procedures, are at the concept stage. However, further work is required to address the barriers and challenges of AI integration into broader colonoscopy practice, to navigate the approval process from regulatory organizations and societies, and to support physicians and patients on their journey to accepting the technology by providing strong evidence of its accuracy and safety. This article takes a closer look at the current state of AI integration into the field of colonoscopy and offers suggestions for future research.
Keywords: Colonoscopy, Adenoma, Artificial intelligence, Computational intelligence, Endoscopy, Surveillance
Core Tip: Artificial intelligence (AI)-assisted decision support systems for endoscopy have shown promising results for the detection and classification of colorectal lesions. However, their integration into clinical practice is currently limited by the lack of design, validation, and testing under real-life clinical conditions. Further work is required to address the challenges of AI integration, to navigate the regulatory approval process, and to support physicians and patients on their journey to accepting the technology by providing strong evidence of accuracy and safety. This article describes the current state of AI integration into colonoscopy practice and offers suggestions for future research.
INTRODUCTION
Colorectal cancer (CRC) is the fourth most commonly diagnosed and the third most fatal cancer worldwide in 2018[1]. The prevalence costs of cancer care were estimated to be $14.1 billion for CRC in the United States in 2010[2]. Over the past decade, CRC incidence and mortality have declined as a result of the increase in CRC screening and prevention examinations[3]. Colonoscopy is a screening tool with high sensitivity for the detection of precancerous and cancerous lesions, and may contribute to an approximately 80%, and up to 60% reduction in CRC incidence and mortality, respectively[4-8]. Colonoscopy prevents CRC by breaking the adenoma-carcinoma sequence through detection and removal of premalignant colorectal polyps[3]. Furthermore, it is a cost-effective procedure that often allows surgery to be avoided in patients with adenomas or CRCs that do not invade deeper than the superficial submucosa[9]. However, the quality of colonoscopy procedures depends on the experience of the endoscopists and the techniques and technology used[10]. A suboptimal colonoscopy examination can result in interval cancers, which are CRCs that occur after a colonoscopy and before the next surveillance examination, and are usually due to non-detection and/or incomplete resection of premalignant polyps. Recent research has shown that CRC precursor lesions are incompletely resected in about 14% of colonoscopy procedures[11]. Quality indicators have been established to describe and measure the quality of colonoscopy examinations[12], and the use of pre- and intraprocedural quality metrics has been shown to result in both an increase in colonoscopy quality and standardization of procedures[12,13]. One of the most recognized quality metrics is the adenoma detection rate (ADR), which is the proportion of an endoscopist’s patients undergoing screening colonoscopy who have at least one adenoma detected; every 1% increase in the ADR has been shown to result in a 3% decrease in the risk of post-colonoscopy CRC[10].
Over 90% of colorectal polyps are diminutive (≤ 5 mm) or small (≤ 10 mm), and most of these polyps are non-neoplastic[10]. Recent advances in image-enhanced endoscopy [IEE; e.g., blue-light imaging, narrow-band imaging (NBI), and i-Scan] have resulted in enhanced visualization of the polyp surface pattern. IEE can be employed for the optical classification of colorectal polyps during colonoscopy, obviating the need for pathology[14,15]. The American Society for Gastrointestinal Endoscopy (ASGE) Technology Committee, in its Preservation and Incorporation of Valuable endoscopic Innovations (PIVI) statement, has recommended the optical evaluation of diminutive polyps, adopting a “resect and discard” strategy for all diminutive colorectal polyps, and a “diagnosis and leave” strategy for diminutive rectosigmoid polyps, if the endoscopist can reach the recommended threshold of ≥ 90% agreement with histopathology results for surveillance interval assignment and ≥ 90% negative predictive value (NPV) for diagnosis of adenomatous histology, respectively[14,15]. Optical diagnosis can distinguish between neoplastic and non-neoplastic polyps and therefore deliver clinical and cost benefits by reducing the number of unnecessary histopathology examinations and providing immediate surveillance interval recommendations to patients. However, despite the demonstrated high accuracy of optical diagnosis for diminutive polyps, endoscopists have been reluctant to support its broad implementation because of concerns about incorrect diagnoses, assignment of inappropriate surveillance intervals, and related medicolegal issues[16].
To address the shortcomings in current colonoscopy practice, research has been directed at standardizing colonoscopy procedures among endoscopists through the integration of artificial intelligence (AI) into colonoscopy practice. AI could provide real-time support to physicians by automatically recognizing specific polyp patterns in colonoscopy images and/or videos, as well as suggesting the most probable histology and providing a confidence level for the predicted histology. The use of such technology would help to mitigate the effects of endoscopist experience in optical diagnosis. Computer-assisted, or most recently, AI-assisted colonoscopy diagnostic systems (CAD) for detection (CADe) and classification (CADx) of colorectal polyps are currently the two main areas of research and implementation of AI in clinical practice. AI-assisted colonoscopy improves ADR and allows for reliable, operator-independent pathology prediction of colorectal polyps. However, there is still a substantial communication gap between computer and medical fields, with scientists in these two disciplines divided in terms of background knowledge, available resources, research typology, and awareness of unmet needs in clinical practice. In this review, we summarize the most important aspects of the application of CADe and CADx in routine colonoscopy practice.
DEVELOPMENT OF COMPUTER-ASSISTED DIAGNOSTIC SYSTEMS
Pairing colonoscopy devices with image-enhanced technology (i.e., white-light endoscopy and chromoendoscopy) has improved the quality of care to patients by increasing the precision of colonoscopy procedures[4]. Recently, research efforts have focused on integrating computational power and previously collected data to enhance the simultaneous detection and classification of colonoscopy images or videos and support endoscopists in their decisions about the presence and/or histology of a polyp.
Machine learning is a subset of AI that allows mathematical methods to develop an algorithm based on given data (e.g., polyp images or videos) to predict the same pattern or a specific task in unseen or unknown data[17]. The final output of these systems (e.g., detection or classification of polyps) is based on pre-defined features or extraction of the most relevant image features (e.g., polyps), which may help in the specification, detection, or classification of a new image. In conventional machine learning (i.e., handcrafted models), a researcher manually introduces the clinically relevant polyp features to the machine learning algorithm. In contrast, in the most advanced machine learning method, which is called deep learning, polyp features, clinically relevant or not, are automatically extracted by the algorithm without prior introduction by a researcher. As a result, the output is based on the capture and summary of complex polyp characteristics, either for detection (i.e., discrimination of polyp from background mucosa) or prediction of histopathology (i.e., neoplastic or non-neoplastic)[17]. Deep learning employs deep neural networks (DNNs), which imitates the complex interconnected neural network in the human brain. These artificial neurons are positioned in several detections and pooling layers, taking weighted data (from the precedent layer), processing it, and passing the output (processed data) to the next layer. Each layer performs as a “step of abstraction[17]”, which forms a hierarchy of common features that grow in complexity throughout the layers (i.e., edge- > basic shape- > object- > class prediction). In other words, each layer would extract useful and relevant features from a given data that would facilitate the classification of the images. When data are presented, the DNN performs the repetitive iterations of a previously chosen model (i.e., support vector machines, random forests, or neural networks) throughout the deeper layers, so-called hierarchical feature learning[17]. For computer-assisted colonoscopy, the development of the AI model is primarily based on supervised data, where data are retrospectively labeled by one or a group of expert endoscopists. For example, in CADx, colonoscopy images or videos will be labeled as neoplastic or non-neoplastic based on the reference standard of pathology results (Figure 1), which would have been reviewed and finalized following consensus by several pathologists. In CADe, however, polyp images or videos will be reviewed by experienced endoscopists, and polyp borders will be delineated based on consensus by endoscopists. Ultimately, the output of the AI algorithm will identify the presence of a polyp, or be able to discriminate between a neoplastic and non-neoplastic polyp (Figure 2)[17]. However, there are some shortcomings and barriers to the development and implementation of CAD systems in real-time endoscopy practice, as discussed below.
Datasets
The data used to develop a CAD system will be divided into three or more datasets: One training dataset to build the AI model, one validation dataset to check the generalizability of the model, and at least one test dataset from another source of data to test the performance of the model[17]. Commonly, training and validation data are derived from the same source (i.e., colonoscopies performed at a single center); however, it is crucial to avoid overlap of data; otherwise, evaluation of the model hyperparameters would be flawed and would lead to “model overfitting.” Model overfitting is an error in modeling that occurs when the model is too tightly fitted to the training data and random fluctuations in the training data are learned as concepts by the model. The problem is that the fitted model does not generalize to new data due to its low bias and high variance. Overfitting can be avoided by tight monitoring of the model during the training by constantly evaluating the model performance in the training and validation data[17].
Researchers should use large and heterogeneous data, including normal and abnormal colonoscopies. A sufficient number of colonoscopy images or video frames would ensure a robust evaluation of model performance. Data should ideally be collected from multiple centers and diverse patients in terms of race, age, sex, and medical issues.
A lack of ground truth data or reliable annotated “big data” for generating effective and high-performance AI models could limit the broad application of CAD systems in clinical settings[18]. This is a challenging goal to achieve as it requires millions of colonoscopy images and videos to be annotated by multiple highly experienced experts to ensure a consensus on ambiguous images. Annotation and data labeling by experts should follow a uniform and standardized protocol, otherwise, the generalizability and performance evaluation of the model will be unreliable.
Gold standard comparison
The absence of a “gold standard” for diagnosing polyp histology would affect the accuracy of CAD performance. Although pathology results are currently regarded as the reference standard, the interobserver agreement among pathologists is not 100%; polyp histology determined by one pathologist might be different from that of another pathologist when reassessing the same specimen slides[19-22]. Therefore, the pathology data used for AI models must be re-evaluated by several pathologists prior to inclusion to ensure agreement on polyp pathology.
Technical transparency
The application of CAD in routine practice is a product of an interdisciplinary collaboration between medical and AI researchers. A recent review demonstrated that researchers failed to report the AI model characteristics effectively[23]. Researchers should ensure that they clearly define and report the AI model architecture or hyperparameters, including the number of deep layers and learning rate. The definition and testing of hyperparameters are crucial to the validation process owing to their direct effect on the model’s performance; optimal model generalizability in the validation step implies the correct choice of hyperparameters. Researchers should briefly explain the source of data, the process of data selection, and the number of patients, including images/videos frames, normal colonoscopies (i.e., without polyp identification), colonoscopy centers, and participating endoscopists together with their level of expertise[17].
Furthermore, researchers should adopt appropriate techniques to prevent model overfitting. Data leakage may occur when the testing dataset results are used to tune the model parameters instead of using the results derived from the validation dataset. Therefore, the model may over-fit toward the unseen data, risking a biased estimate of model performance. The stringent use of high-quality still images instead of videos that contain large variability in colonoscopy images may increase the risk of overfitting.
Computer-assisted polyp detection system
In the context of CAD, although the shift from separate engineering and medical disciplines to combined medical and engineering research has gained momentum over the last decade, pilot studies established the idea of CADe as early as 2003[24,25]. The primary hand-crafted AI models used the pre-described polyp features (e.g., color and/or texture-based features) and annotated colonoscopy videos for the detection of colorectal polyps[25-29]. Other studies used the same idea and developed several AI models that resulted in up to 90% sensitivity[30-32]. However, these studies used small and homogeneous datasets to develop and validate the AI models, raising doubts over the model’s optimal performance. The hand-crafted features used to build the model led to suboptimal performance, probably because of impaired feature recognition and description, and a high level of false-positive detection owing to the presence of colonic folds, blood vessels, and feces in the lateral view.
After the invention of DNNs, important polyp features could be automatically recognized. Subsequently, the accuracy and sensitivity of models improved, signaling the great potential for CADe application. Recently, Yamada et al[33] developed a CADe system using a supervised DNN, and validated the system using a dataset of 705 still images of 752 lesions and 4135 still images of noncancerous tissue. This system performed well, with a sensitivity and specificity of 97.3% and 99.0%, respectively, and an area under the curve (AUC) of 0.975 in the validation set. Misawa et al[34] developed a model based on 546 short colonoscopy videos, comprising 155 polyp-positive and 391 polyp-negative videos. Two experts retrospectively annotated videos for polyp presentation to provide a gold standard for comparison. The model presented sensitivity, specificity, and accuracy of 90.0%, 63.3%, and 76.5%, respectively. The polyp detection rate and false-positive detection rate were 95% and 60%, respectively. Other significant research used a large dataset for training an AI model, which comprised 8641 annotated images from over 2000 colonoscopies[35]. The model generated excellent detection capability, with an AUC of 99% and an accuracy of 96.4%. The performance of this model was also superior to that of experts. The authors tested model performance in 20 colonoscopy videos with a total duration of 5 h, during which colonoscopists removed 28 polyps. After reviewing the videos by four independent experts, eight additional polyps were identified (36 polyps) without the use of AI assistance and 17 additional polyps were detected with AI assistance (total 45 polyps). The model had a false-positive rate of 7%.
Research with a prospective design and focusing on the evaluation of the real-time performance of CADe is scarce. Wang et al[36] conducted a prospective non-blinded clinical trial, which aimed to measure ADR with and without the application of CADe. Using 522 and 536 colonoscopies in the control and intervention arms, respectively, the authors found a statistically significant increase in ADR (29.1% vs 20.3%) and an increased number of adenomas per patient (0.53 vs 0.31) when CADe was used. The false-positive rate was 7.5% per colonoscopy, and there was no significant difference in the procedure time. CADe could detect a higher number of diminutive adenomas and hyperplastic polyps, which represent a higher risk of unnecessary polypectomies, pathology examinations, and longer procedure times. To date, the generalizability of this system has not been tested in Western clinical settings.
In contrast to the results of the latter study, Klare et al[37] prospectively evaluated endoscopist performance using CADe assistance during the real-time colonoscopy procedures of 55 patients. However, the endoscopists only observed the regular monitor, and an independent investigator observed the monitor dedicated to representing the real-time outputs of the CADe system in a separate room, which was blinded from the endoscopists’ sight. Therefore, the endoscopists were blinded to the real-time CADe outputs. This system did not increase the precision of polyp detection in real-time practice: In per-patient analysis, the application of CADe resulted in endoscopists achieving a lower ADR (29.1% vs 30.9%); in per-polyp analysis, CADe could only detect 55 out of 73 polyps previously detected by endoscopists. Tables 1 and 2 shows the summary of the recent studies evaluating a CADe system.
Table 1.
Ref.
|
Year
|
Study design
|
Study aim
|
CADe system
|
Image modality
|
Number of patients in the CADe group
|
Number of patients in the control group
|
Number of polyps (CADe vs control group)
|
Adenoma detection rate (%) (CADe vs control group)
|
Polyp detection rate (%) (CADe vs control group)
|
Number of false-positive rate (%) (CADe vs control group)
|
Withdrawal time (CADe vs control group), min ± SD; minute
|
Wang et al[36] | 2019 | Non-blinded prospective randomised controlled study | To investigate whether a high-performance real-time CADe system can increase polyp and adenoma detection rates in the real clinical setting | The real-time automatic polyp detection system (Shanghai Wision AI Co., Ltd.) based on artificial neural network-SegNet architecture | Real-time Video stream | 522 | 536 | 767 (498 vs 269) | 29.1 vs 20.3; P < 0.001; 95%CI = 1.21-2.135 | 45.0 vs 29.1; P < 0.001; 95%CI = 1.532-2.544 | 39 vs 0 | 6.18 ± 1.38 vs 6.07 ± 1.11; P = 0.15 |
Wang et al[74] | 2020 | Double-blind Prospective randomised trial | To assess the effectiveness of a CADe system for improving detection of colon adenomas andpolyps; to analyse the characteristics ofpolyps missed by endoscopists | The real-time automatic polyp detection system (Shanghai Wision AI Co., Ltd.) based on artificial neural network-SegNet architecture | Real-time Video stream | 484 | 478 | 809 (501 vs 308) | 34.0 vs 28.0; P = 0.030; OR = 1.36, 95%CI = 1.03–1.79 | 52.0 vs 37.0; P < 0.0001; OR = 1.86, 95%CI = 1.44–2.41 | 48 in CADe group (control group not reported) | 6.48 ± 1.32 vs 6.37 ± 1.09; P = 0.14 |
Su et al[75] | 2020 | Single-blind Prospective randomised trial | To develop an automatic quality control system; to investigate whether the system could increase the detection of polyps and adenomas in real clinical practice | Five deep learning convolutional neural networks (DCNNs) based on AlexNet, ZFNet, and YOLO V2 | Real-time Video stream | 308 | 315 | 273 (177 vs 96) | 28.9 vs 16.5; P < 0.001; OR = 2.055, 95%CI = 1.397-3.024 | 38.3 vs 25.4; P = 0.00; OR = 1.824, 95%CI = 1.296-2.569 | 62 in CADe system (control group not reported) | 7.03 ± 1.01 vs 5.6 ± 1.26; P < 0.001 |
Gong et al[76] | 2020 | Single-blind Prospective randomised trial | To evaluate whether the CADe system could improve polyp yield during colonoscopy | ENDOANGEL based on the deep neural networks and perceptual hash algorithms | Real-time video stream | 355 | 349 | 302 (178 vs 124) | 16 vs 8; P = 0.001; OR = 2.30, 95%CI = 1.40-3·77 | 47 vs 34; P = 0.0016; OR = 1.69, 95%CI = 1.22-2.34 | For endoscope being inside = 0.8; For identification of the caecum = 2; for prediction of slipping = 0 | 6.38 ± 2·48 vs 4.76 ± 254; P < 0.0001 |
Liu et al[77] | 2020 | Double-blind Prospective randomised trial | To study the impact of CADe system on the detection rateof polyps and adenomas in colonoscopy | The convolutional threedimensional (3D) neural network | Real-time video stream | 508 | 518 | 734 (486 vs 248) | 39.1 vs 23.9; P < 0.001; OR = 1.637, 95%CI = 1.201‑2.220 | 43.7 vs 27.8; P < 0.001; OR = 1.57, 95%CI = 1.586‑2.483 | 36 in CADe system (control group not reported) | 6.82 ± 1.78 vs 6.74 ± 1.62; P < 0.001 |
Luo et al[78] | 2021 | Non-blinded Prospective randomised trial | To explore whether CADe could improve the polyp detection rate in the actual clinical environment | A CNN algorithm based on a YOLO network architecture | Real-time Video stream | 150 | 150 | 185 (105 vs 80) | 38.7 vs 34.0; P < 0.001 | - | 52 in CADe system (control group not reported) | 6.22 ± 0.55 vs 6.17 ± 0.52; P = 0.102 |
Repici et al[79] | 2020 | Singles-blind Prospective randomised trial | To assess the safety and efficacy of a CADe system for the detection of colorectal neoplasia | The CNN (GI-Genius; Medtronic) | Real-time Video stream | 341 | 344 | 596 (353 vs 243) | 54.8 vs 40.4; P < 0.001; RR = 1.30, 95%CI = 1.14-1.45 | 279/341 (82) 214/344 (62) | - | 417 ± 101 seconds for the CADe group vs 435 ± 149 for controls; P = 0.1 |
Wang1 et al[80] | 2020 | Singles-blind Prospective randomised trial | To investigate the impact of CADe on adenoma miss and detection rate | The artificial neural network (EndoScreener, Shanghai Wision AI Co,Ltd, Shanghai, Chin) | Real-time Video stream | 184 (CADe-routine group)2 | 185 (Routine-CADe group)3 | 529 (244 vs 285) | 42.39 vs 35.68; P = 0.186; OR = 1.327, 95%CI = 0.872–2.018 | 63.59 vs 55.14; P = 0.09; OR = 1.421, 95%CI = 0.936–2.157 | 67 in CADe system (control group not reported) | 6.55 (5.34–7.77) vs 6.51 (5.45–7.57); P = 0.7454 |
The total adenoma miss rate by computer-assisted detection system (CADe) [colonoscopy = 13.89%, 95% confidence interval (CI) = 8.24%–19.54%]; by routine colonoscopy = 40.00%, 95%CI=31.23%–48.77%, P < 0.0001. The total polyp miss rate by CADe colonoscopy = 12.98%, 95%CI = 9.08%–16.88%; by routine colonoscopy = 45.90%, 95%CI = 39.65%–52.15%, P < 0.0001). Visible adenoma miss rate: Routine-CADe group = 24.21% vs CADe-routine group = 1.59%, P < 0.001; Visible polyp miss rate: Routine-CADe group = 30.89% vs CADe-routine group = 2.36%; P < 0.001.
It means that the colonoscopy was performed by the CADe system and then the conventional method.
It means that the colonoscopy was performed by the conventional method and then the CADe system.
Median (interquartile range).
CADe: Computer-assisted detection system; CNN: Convolutional neural network; DCNN: Deep learning convolutional neural network; SD: Standard deviation; OR: Odds ratio; RR: Relative risk; CI: Confidence interval.
Table 2.
Ref.
|
Year
|
Study design
|
System
|
Image modality
|
Number of patients/colonoscopies used for training/test datasets (total)
|
Number of colonoscopy/polyp images/videos used for training/test datasets
|
Diagnostic properties
|
Park and Sargent[81] | 2016 | Retrospective | CADe based on DCNN using a conditional random field model | Still images | 35 (colonoscopy videos) | 562/562 (colonoscopy still images) | Sensitivity = 86%; specificity = 85%; AUC = 0.8585 |
Fernández-Esparrach et al[73] | 2016 | Retrospective | CADe based on energy map | Still images | NA/24 colonoscopy videos containing 31 different polyps | NA/Experiment A: 612 polyp images from all 24 videos. Experiment B: 47886 frames from the 24 videos | Experiment A: accuracy = small vs all polyps = 77.5%, 95%CI = 71.5%–82.6% vs 66.2%, 95%CI = 61.4%–70.7%; P < 0.01. Experiment B: The AUC = high quality frames vs all Frames = 0.79, 95%CI = 0.70–0.87 vs 0.75, 95%CI = 0.66–0.83 |
Yu et al[82] | 2017 | Retrospective | CADe based on three-dimensional (3-D) deep learning integration framework by leveraging the 3-D fully CNN (3D-FCN) | Videos | 20/18 (colonoscopy videos) | 3799 frames with polyps in total | Sensitivity = 71%; PPV = 88%; precision = 88.1% |
Billah et al[83] | 2017 | Retrospective | CADe based on CNN and color wavelet features using a linear support vector machine | Still images | 100 (colonoscopy videos for combined training and test datasets) | 14000 still images (combined for training and test datasets) | Accuracy = 98.65%; sensitivity = 98.79%; specificity = 98.52% |
Zhang et al[84] | 2017 | Retrospective | CADe based on DCNN | Still images | NA | 2262/150 random, 30 NBI (colonoscopy still images) | Accuracy = 85.9%; sensitivity = 98%; PPV = 99%; precision = 87.3%; recall rate = 87.6%; AUC = 1.0 |
Wang et al[85] | 2018 | Retrospective | CADe based on DNN | Still images | 1290/1138 (2428) patients | 27113/5545 (colonoscopy images) | Sensitivity = 94.38%, 95%CI = 93.80%-94.96% in images with polyp; AUC = 0.984 |
Misawa et al[34] | 2018 | Retrospective | CADe based on CNN | Videos | 59/14 (73) | 411/135 (colonoscopy videos containing 150 polyps) | Per-polyp sensitivity = 94%; per-frame sensitivity = 90%; specificity = 63.3%; accuracy = 76.5%; false positive rate = 60%; AUC = 0.87 |
Yamada et al[33] | 2019 | Retrospective | CADe based on DNN | Videos | NA/77 (number of videos) | 13983/4840 (colonoscopy videos) | Sensitivity = 97.3%, 95%CI = 95.9%–98.4%; specificity = 99.0%, 95%CI = 98.6%–99.2%; AUC = 0.975, 95%CI = 0.964–0.986) |
Urban et al[35] | 2018 | Retrospective | CADe based on deep learning CNN | Videos | Several training and validation sets: (1) Cross-validation on the 8641 images; (2) Training on the 8641 images and testing on the 9 videos, 11 videos, and independent dataset; and (3) Training on the 8641 images and 9 videos and testing on the 11 videos and independent dataset | Sensitivity = 96.9%; specificity: 95%; AUC = 0.991; accuracy = 96.4%; false positive rate = 7% | |
Klare et al[37] | 2019 | Prospective | Automated polyp detection software (“KoloPol,” Fraunhofer IIS, Erlangen, Germany) based on CNN | Live colonoscopy videos | NA | NA/55 (colonoscopy videos) | Per-polyp sensitivity = 75.3%, 95%CI = 62.3%-84.9%; PDR = 50.9%, 95%CI = 37.1%-64.4%; ADR = 29.1%, 95%CI = 17.6%-42.9% |
Ozawa et al[86] | 2020 | Retrospective | CADe based on DCNN | Still images | 12895 patients | 16418/7077 | Sensitivity = 92%; PPV = 86%; accuracy = 83%; identified adenomas = 97% |
CADe: Computer-assisted detection system; CNN: Convolutional neural network; DCNN: Deep learning convolutional neural network; AUC: Area Under the Receiver Operating Characteristic curve; PPV: Positive predictive value; NPV: Negative predictive value; PDR: Polyp detection rate; ADR: Adenoma detection rate; CI: Confidence interval.
Computer-assisted polyp classification system
Computer-assisted diagnosis of the histopathology of colorectal polyps has become an area of significant research interest because of its potential to prevent the resection of low-risk polyps and reduce the number of unnecessary histopathology examinations. Many studies have successfully developed and validated CADx models, the use of which would allow the “diagnosis and leave strategy” to be implemented. In a prospective pilot study, in which the data from 128 patients undergoing colonoscopy using NBI were used to test a CADx system (209 polyps detected and removed), three polyp features were used to build the AI model: Mean vessel length, vessel circumference, and mean brightness within detected blood vessels[38]. The results showed that the endoscopists’ ability to predict polyp histology was superior to that of CADx, which had a sensitivity of 90% and specificity of 70.2% in differentiating neoplastic from non-neoplastic images compared with histopathology as the gold standard. The system's diagnostic performance was compared with that of endoscopists, who were blinded to the histopathology reference standard. Endoscopists accurately predicted polyp histology with a sensitivity of 93.8% and specificity of 85.7% when there was interobserver agreement. In cases of disagreement between endoscopists, the suggested safe prediction of polyp histology (i.e., classification as neoplastic) produced a sensitivity of 96.9% and specificity of 71.4%. Overall, CADx could predict polyp histology with an approximate sensitivity and specificity of 90% and 70%, respectively; however, the overall correct classification rate was moderate (85.3%). Notably, this AI algorithm was not fully automated; thus, its real-time performance in a clinical setting remains to be determined. Another limitation of this study was the use of data from NBI colonoscopies. Although NBI may assist polyp classification, its use may cast doubt on the generalizability of the model, especially in clinical settings where NBI is not available.
The real-time evaluation of CADx is important if the technology is to be integrated into clinical practice. Some studies have used the real-time decision outputs from support vector machines for building CADx algorithms, with promising results[39-43]. Moreover, Chen et al[44] demonstrated that an AI model could accurately predict the histopathology of 284 diminutive polyps, comprising 96 hyperplastic and 188 neoplastic polyps diagnosed using NBI, with 96.3% sensitivity, 78.1% specificity, 91.5% NPV, and 89.6% PPV. This study and the study by Byrne et al[45] that used the combination of CADe and CADx systems (described below), are remarkable in that they achieved the threshold NPV of ≥ 90% recommended by the ASGE PIVI statement, favoring the implementation of the “diagnose and leave” strategy for diminutive rectosigmoid polyps[46]. However, the results of the former study need to be confirmed in a prospective study, ideally in a controlled trial, where the probability of selection bias is less, and the AI model can be compared with a conventional setting (without using AI).
More prospective studies assessing CADx are required to support the integration into clinical practice. The existing prospective studies resulted in a high and favorable diagnostic performance, which provided strong evidence to support the real-time application of CADx[47,48]. In contrast, the AI models developed and tested in a prospective trial by Kuiper et al[49] did not show sufficient power for differentiating adenomatous from non-adenomatous lesions. Another CADx model in a prospective study by Rath et al[50] could only produce moderate accuracy, sensitivity, and specificity (84.7%, 81.8%, and 85.2%, respectively), although the NPV was relatively high at 96.1%. This model would therefore allow diminutive rectosigmoid polyps to be diagnosed and left in situ without resection. The authors suggested that the low prevalence of neoplastic polyps could explain the model's moderate diagnostic performance compared with hyperplastic polyps in their dataset, which might proportionately result in an overestimation of the NPV, and an underestimation of the accuracy and PPV of the model. Table 3 shows the summary of the recent studies evaluating a CADe system.
Table 3.
Ref.
|
Year
|
Study design
|
Study aim
|
System
|
Number of patients/colonoscopies used for training/test datasets (total)
|
Number of colonoscopy/polyp images/videos used in training/test datasets
|
Diagnostic properties
|
Tischendorf et al[38] | 2010 | Prospective pilot | Distinguishing adenomas from non-adenomas | CADx based on SVMs | NA/128; Colonoscopy videos | NA/209 polyps containing 160 neoplastic and 49 non-neoplastic polyps in the test dataset | CADx: Sensitivity = 90%, specificity = 70%, correct classification rate = 85.3%. Consensus decision between the human. Observers: Sensitivity = 93.8%, specificity = 85.7%, correct classification rate = 91.9%. “Safe” decision, when there was interobserver discrepancy: Sensitivity = 96.9%, specificity = 71.4%, correct classification rate = 90.9% |
Aihara et al[47] | 2013 | Prospective | Distinguishing neoplastic from non-neoplastic lesion | CADx based on numerical color analysis of autofluorescence endoscopy as an Adobe AIRapplication | NA/32 patients in the test dataset | NA/102 lesions containing 75 neoplastic lesions in the test dataset | Sensitivity = 94.2%; specificity = 88.8%; PPV = 95.6%; NPV = 85.2% |
Mori et al[87] | 2015 | Retrospective pilot | Distinguishing small (≤ 10 mm) neoplastic from non-neoplastic lesion | CADx (EC-CAD) based on CNN | NA/152 patients in the test dataset | NA/176 small polyps in the test dataset containing 137 neoplastic and 39 non-neoplastic polyps for the test dataset | Accuracy = 89.2%, 95%CI = 83.7%-93.4%; Sensitivity = 92.0%, 95%CI = 86.1%-95.9%; specificity of 79.5%, 95%CI = 63.5%-90.7% |
Kuiper et al[49] | 2015 | Retrospective | Distinguishing small (≤ 9 mm) neoplastic from non-neoplastic lesion | CADx (WavSTAT) based on CNN | NA/87 patients in the test dataset | NA/207 small lesions in the test dataset | Accuracy = 74.4%, 95%CI = 68.1%–79.9%; sensitivity = 85.3%, 95%CI = 0.78–0.90; specificity = 58.8%, 95%CI = 0.48–0.69; PPV = 74.8%, 95%CI = 0.67–0.81; NPV = 73.5%; accuracy of on-site recommended surveillance interval = 73.7% |
Misawa et al[34] | 2018 | Retrospective | Distinguishing neoplastic from non-neoplastic lesion categorized | CADx based on SVMs | NA | 979 images containing 381 non-neoplasms and 598 neoplasms in the training dataset/100 images containing 50 non-neoplasms and 50 neoplasms in the test dataset | Accuracy = 90.0%, 95%CI = 82.4–95.1; sensitivity = 84.5%, 95%CI = 72.6–92.7; specificity = 97.6%, 95%CI = 87.4–99.9; PPV = 98.0%, 95%CI = 89.4–99.9; NPV = 82.0%, 95%CI = 68.6–91.4 |
Byrne et al[51] | 2018 | Retrospective | Distinguishing neoplastic from non-neoplastic lesions | CADx + CADe based on an improved DCNN model using NBI | NA | NA/21804 unseen frames in the test dataset | Accuracy = 99.94%; sensitivity = 95.95%; specificity = 91.66%; NPV = 93.6%; prediction of polyp videos = 97.6% |
Mori et al[48] | 2018 | Prospective | Distinguishing diminutive (≤ 5 mm) neoplastic from non-neoplastic lesions | CADx based on SVMs used with NBI and endocytoscope | NA/791 patients in the test dataset | 61925/466 polyps from 325 patients in the test dataset | CADx-NBI: Sensitivity = 92.7%, 95%CI = 89.1–95.4; specificity = 89.8%, 95%CI = 84.4–93.9; PPV = 93.7%, 95%CI = 90.2–96.2; NPV = 88.3%, 95%CI = 82.7–92.6. CADx-endocytoscope: Sensitivity = 91.3%, 95%CI = 87.5–94.3; specificity = 88.7%, 95%CI = 83.1–93.0; PPV = 92.9%, 95%CI = 89.3–95.6; NPV = 86.3%, 95%CI = 80.4–90.9 |
Byrne et al[45] | 2019 | Retrospective | Distinguishing diminutive (≤ 5 mm) neoplastic from non-neoplastic lesions | CADx based on DCNN | Training dataset: 60089 frames from 223 polyp videos (29% NICE type 1, 53% NICE type 2 and 18% of normal mucosa with no polyp)/validation dataset: 40 videos (NICE type 1, NICE type 2 and two videos of normal mucosa)/test dataset: 125 consecutively identified diminutive polyps, comprising 51 hyperplastic polyps and 74 adenomas | Accuracy = 94%, 95%CI = 86%-97%; sensitivity = 98%, 95%CI = 92%-100%; Specificity = 83%, 95%CI = 67%-93%; NPV = 97%; PPV = 90% | |
Song et al[88] | 2020 | Retrospective | Distinguishing adenomas from SPs | CADx based on DCNN | NA | 12480 image patches of 624 polyps/two test datasets of 545 polyp | Agreement between the true polyp histology CADx = 0.614–0.642; accuracy = 81.3%–82.4%; sensitivity = 82.1%; specificity = 93.7%; PPV = 78%; NPV = 95%; the AUC = 0.93–0.95, 0.86–0.89, and 0.89–0.91 for serrated polyps, benign adenoma/mucosal or superficial submucosal cancer, and deep submucosal cancer, respectively |
Kudo et al[89] | 2020 | Retrospective | Distinguishing small (≤ 10 mm) neoplastic from non-neoplastic lesions | The EndoBRAIN system (CADx + CADe based on DCNN) | NA/89 patients test set | 69,142 images taken at 520-fold magnification and 2,000 polyps/100 lesions (≤ 10 mm) in the test dataset | CADe: Accuracy = 98%, 95%CI = 97.3%–98.6%; sensitivity = 96.9%, 95%CI = 95.8%–97.8%; specificity = 100%, 95%CI = 99.6%–100%; PPV = 100%, 95%CI = 99.8%–100%; NPV = 94.6%, 95%CI = 92.7%–96.1%; CADx: Accuracy = 96%, 95%CI = 95.1%–96.8%; sensitivity = 96.9%, 95%CI = 95.8%–97.8%; specificity = 94.3%, 95%CI = 92.3%–95.9%; PPV = 96.9%, 95%CI = 95.8%–97.8%; NPV = 94.3%, 95%CI = 92.3%–95.9% |
CADe: Computer-assisted detection system; CADx: Computer-assisted diagnosis system; CNN: Convolutional neural network; DCNN: Deep learning convolutional neural network; AUC: Area Under the Receiver Operating Characteristic curve; PPV: Positive predictive value; NPV: Negative predictive value; SVM: Support vector machine; SP: Serrated polyps; CI: Confidence interval.
Combined CADe and CADx models
The ideal CAD system would support the simultaneous detection and classification of polyps to optimize colonoscopy outcomes and achieve the best level of CRC prevention. A recent study evaluated the real-time application of CADx in combination with CADe[45]. The validated model was tested on a series of 125 diminutive polyps, comprising 51 hyperplastic polyps and 74 adenomas. The combined model could not detect histopathology in 15% of polyps. For the remaining 106 polyps histologically predicted with high confidence, the AI model demonstrated an accuracy of 94%, sensitivity of 98%, specificity of 83%, NPV of 97%, and positive predictive value (PPV) of 90%. In a significant study, Byrne et al[51] developed a new platform using three distinct AI CADe and CADx algorithms to provide endoscopists with a full workflow from detection to classification: An NBI light detector, a polyp detector, and an optical biopsy. The NBI light detector runs throughout the colonoscopy procedure to ensure the detection of all colorectal polyps with white light imaging, and the optical biopsy provides an accurate polyp classification using NBI light. The NBI light model resulted in an excellent accuracy of 99.94% when tested in 21804 unseen colonoscopy video frames. However, the detection mode using white light resulted in a sensitivity of only 79%. The optical biopsy model could accurately classify 97.6% of polyps, which was significantly higher than a previous CADx model tested by the same research team[45], and had a sensitivity of 95.95%, specificity of 91.66%, and NPV of 93.6% for polyp classification.
QUALITY ASSESSMENT OF COLONOSCOPY BY COMPUTER
Few studies have evaluated an AI-assisted system for the ability to accurately and automatically assess the quality of a colonoscopy procedure, including the identification of critical anatomical landmarks, especially when the endoscopic field is blurry[52,53]. Filip et al[53] developed a “Colometer” system that could rate colonoscopy quality based on the percentage of the withdrawal time with adequate visualization. This system could detect the factors associated with optimal real-time visualization of the mucosa, including image clarity, withdrawal velocity, and level of bowel cleanliness. A dataset of expert-annotated images and videos was used to train the AI model. The authors compared the quality rated by this system with that of three independent experts. There was a strong correlation between AI and expert quality ratings (ρ coefficient 0.65, P = 0.01). In another study, a system comprising two AI algorithms was designed to automatically detect the appendiceal orifice on a colon image or video[54]. The first algorithm was developed to detect the appendiceal orifice on endoscopic images based on the local shape, lighting, and intensity differences from a normal edge direction. The second algorithm was designed to detect the appendiceal orifice in the colonoscopy videos using a frame intensity histogram. The system could detect the orifice in images with an average sensitivity and specificity of 96.86% and 90.47%, respectively, and correctly classified 21 out of 23 colonoscopy videos (accuracy 91.30%).
RECOMMENDATIONS FOR FUTURE RESEARCH
Despite potential benefits of AI in colonoscopy, regulatory approval and standardization of AI models are difficult goals to achieve for a number of reasons described below.
Polyp morphology
Datasets might underrepresent particular polyp morphologies that are not common findings during colonoscopy. For example, non-polypoid lesions with Paris classification of flat and/or depressed morphology are more likely to harbor advanced histology or malignancy but are not a common finding during colonoscopy[55]. The endoscopic detection of non-polypoid lesions is problematic because of their surface pattern resemblance to normal mucosa[56]. Moreover, serrated polyps comprise about 30% of colon polyps, with sessile serrated polyp/adenoma (SSA/P) prevalence being less than 10%[57]. It has been proven that SSA/Ps can be responsible for CRC through a serrated (hyperplastic-SSP/A-serrated-CRC) sequence[58]. However, SSA/Ps can hardly be distinguished from normal mucosa or hyperplastic polyps by features of crypt distortion. Research has shown that previously diagnosed hyperplastic polyps might be reclassified as SSAs after pathological reassessment[19-22], particularly for larger (> 5 mm) or right-sided polyps, and co-existing adenomas containing advanced histology[19,21,59]. A recent meta-analysis showed that pathological reassessment of resected polyps led to a significant change in diagnosis from hyperplastic to SSA for polyps in the right colon and polyps ≥ 5 mm (odds ratio 4.401 and 8.336, respectively)[59]. Moreover, there is poor agreement among pathologists in the determination of high-risk polyp features owing to the various approaches used for preparing biopsy specimens or level of expertise[19,60]. Therefore, the development of an AI platform capable of detecting and distinguishing subtle adenomatous features from normal mucosa with a high level of accuracy would be a valuable clinical tool.
Metadata
Most studies have failed to assess the performance and accuracy of AI models according to polyp size, polyp location, bowel preparation score, or withdrawal time[18]. Patients’ information including demographic and clinical characteristics (e.g., colonoscopy indication, disease status), procedure-related quality characteristics (i.e., bowel preparation level, withdrawal time), procedure time and room, endoscopists fatigue (i.e., the procedure performed in the morning or afternoon) are the important factors that are linked with the long-term non-endoscopic outcome of interest. In other words, the detection and classification of colorectal polyps are the intermediate outcomes of the colonoscopy, but the prevention of interval cancer during the surveillance period, or the evaluation of the effectiveness of medical therapy and the need for surgical treatment in patients with inflammatory bowel diseases are the ultimate goals of the colonoscopy depending on the primary indication of the procedure. As mentioned in Kudo et al[61], metadata is a critical component in establishing optimal AI platforms that can perform well in real-world practice with suboptimal conditions. For example, SSA/Ps are mainly located in the right colon, where endoscopic access and complete inspection of the mucosa are challenging[58]. Collecting a high number of colonoscopy videos with a high number of SSA/P polyps and cross-linking with patient's data would increase the accuracy and effectiveness of the colonoscopy. Future AI models must incorporate the information of the polyp size and location as well as the clinical, pre-procedural, and polyp morphological characteristics rather than focusing on the polyp images and videos alone.
Prospective real-time studies
The robustness of AI platforms has not been widely estimated in real-time clinical settings through prospective studies. Most studies have been retrospective in design and subject to selection bias. Therefore, the comparison of accuracy between model and endoscopists may falsely deviate in favor of CAD. For example, in CADe, the researcher might exclude unclear colonoscopy or polyp images/videos; a fuzzy or blurred endoscopic view may occur when water or blood obscures the field, or when feces cover the bowel surface preventing a complete examination. There should also be a mixture of polyp-positive and polyp-negative images from abnormal and normal colonoscopies in all training, validation, and test datasets. The development of AI models must be rigorously based on a training dataset that is preferably gathered during real-time colonoscopies. Data should be collected prospectively by both experienced and novice endoscopists to represent the actual state of practice when assessing the model. The elimination of selection bias is most relevant to CADe systems and less so to CADx systems. Studies should be based in several centers to ensure the reproducibility of the results at the testing level. Testing CAD systems in non-academic settings will demonstrate whether the model represents actual real-world practice, where more polyps are missed and/or there is no access to advanced technologies such as NBI. In addition, real-time and multicenter studies may help to clarify the place of AI in the diagnostic process. Prospective studies would provide robust evidence to support the application of CAD and enhance endoscopists’ trust in optical polyp classification[62]. Nevertheless, CAD is still an operator-dependent technology as it is the experienced endoscopists who must provide the annotated datasets for the development of the system, and the accuracy of the AI output relies on the endoscopist presenting a clear endoscopic field to the system. Certain challenges such as prolonged procedure times, high positivity rate, and inability to predict the histology in the presence of feces or blood in the visual field should be mitigated to prevent suboptimal diagnosis. Physicians should continue to follow the recommended procedural measures, including sufficient bowel preparation and photo documentation, to avoid legal and insurance issues.
Researchers should prioritize prospective controlled trials to allow a precise comparison between the settings that use and do not use AI platforms, otherwise, the real benefits of the AI system cannot be determined. Crossover studies, where patients act as their own controls and undergo colonoscopy both with and without AI support would be useful as fewer patients would be needed. In practice, the endoscopist would first detect and classify a polyp before using the AI support system to ensure the accuracy of their classification. This process should be performed in a time-efficient manner as the benefit of AI assistance would be irrelevant if the procedure was significantly prolonged.
Standardization of endpoints
All research evaluating the diagnostic accuracy of CAD systems should use standardized research endpoints derived from the latest guidelines. Similarly to other diagnostic evaluation studies, sensitivity, specificity, PPV, NPV, and AUC must be reported, as well as confusion matrices and mean average precision for multiclass classifications and intersection over union (IoU), or the DICE coefficient for segmentation (i.e., delineation) in particular situations[63,64]. The use of such a comprehensive set of metrics would provide convincing evidence, reassuring physicians about the reliability of AI tools. For example, ADR must be reported for all research related to the evaluation of CADe systems, as such systems aim to achieve complete detection of all colorectal lesions. Similarly, the NPV of CADx systems must be reported to confirm the ability of CADx to achieve the recommended NPV benchmark of ≥ 90% according to the PIVI statement[46]. In addition, for surveillance interval assignment, the agreement between AI-based assignment and that of the histopathology reference standard must reach the ≥ 90% threshold recommended by the PIVI statement[46].
Transparency of AI analyses
We should avoid the black-box phenomenon when the decision-making process of the model by the convolutional neural network cannot be deconvoluted due to the complexity of the process[65,66]. An important aspect of the wide application of AI platforms is the trust that physicians and responsible regulatory officials place in the AI analyses. Research should move toward facilitating extreme transparency in the generation and validation of AI models to avoid hesitancy about their public implementation.
Safety and cost-effectiveness
Finally, as well as CADe and CADx systems, a computer-based support system that aids endoscopists in selecting the most appropriate polypectomy procedure is necessary. Current practice involves the use of forceps to remove diminutive polyps, especially for the resection of polyps up to 2 mm[67]; however, the rate of incomplete resection is lower for the removal of polyps ≥ 3 mm when a snare is used[68]. In addition to providing a suggestion for an appropriate polypectomy device, AI can also help to estimate polyp size, delineate the extent of the lesion and a safe polypectomy margin, and identify post-resection lesion remnants that indicate an incomplete resection and the need for further tissue removal at colonoscopy follow-up. The goal of this system is to provide a complete polypectomy that will reduce the risk of interval cancer, as about 30% of all interval cancers are thought to be caused by incomplete resection of CRC precursors[11,69,70].
In addition to addressing the challenges associated with the development of reliable AI models that can be confidently employed in routine practice with high efficacy, research is needed to assess the cost-effectiveness of these systems related to the reduction in the number of patients diagnosed with interval cancer, reduction in the number of unnecessary pathology evaluations for low-confidence predictions of polyp histology by optical diagnosis, and facilitation of efficient physician-patient communication concerning future clinical arrangements.
Adapting the newly developed AI-based techniques in routine practice and enhancing endoscopists’ trust in the new devices is only possible by a symbiotic relationship between academia and industry. It would facilitate obtaining regulatory approval from health authorities regarding research involving human subjects, constructing large “ground truth” data for developing AI models, and transporting knowledge and technology to ultimately access the market[71]. Several manufacturers have obtained the regulatory approvals to launch and commercialize their AI-based colonoscopy devices around the world (Table 4); however, many of them have not provided a detailed report of their devices’ performance. Further research should try to compare the performance of different AI-based systems in real-time settings by conducting prospective controlled trials with multiple intervention arms sing different commercially available AI-based colonoscopy systems. Due to the time- and cost-consuming nature of these studies, an alternative method for accelerating research is to test the “benchmarks” using the publicly available datasets such as the ASU-Mayo colonoscopy video database[29], the CVC-ClinicDB database[28], the Kvasir dataset[72], and the ETIS-Larib Polyp database. Nonetheless, these datasets contain a limited number of colonoscopy videos and images and may not reflect the true performance of an AI-based system.
Table 4.
Computer assissted system
|
Product
|
Manufacturer
|
Year of regulatory approval
|
Place of regulatory approval
|
CADx | EndoBRAIN | Cybernet System Corp./Olympus Corp. | 2018 | Japan |
CADe | GI Genius | Medtronic Corp. | 2019 in Europe; 2021 in United States | Europe/United States |
CADe | ENDO-AID | Olympus Corp. | 2020 | Europe |
CADe/CADx | CAD EYE | Fujifilm Corp. | 2020 | Europe/Japan |
CADe | DISCOVERY | Pentax Corp. | 2020 | Europe |
CADe | EndoBRAIN-EYE | Cybernet System Corp./Olympus Corp. | 2020 | Japan |
CADe | EndoAngel | Wuhan EndoAngel Medical Technology Company | 2020 | China |
CADe | EndoScreener | WISION A.I. | 2020 | China |
CADx | EndoBRAIN-PLUS | Cybernet System Corp./Olympus Corp. | 2020 | Japan |
CADx | EndoBRAIN-UC | Cybernet System Corp./Olympus Corp. | 2020 | Japan |
CADe | WISE VISION | NEC Corp. | 2021 | Europe/Japan |
CADe | ME-APDS | Magentiq Eye | 2021 | Europe |
CADe | CADDIE | Odin Vision | 2021 | Europe |
CADe: Computer-assisted detection system; CADx: Computer-assisted diagnosis system.
CONCLUSION
AI research is a rapidly evolving discipline that promises to enhance physicians’ performance. AI models have demonstrated the ability to compete with and outperform endoscopists, suggesting that all endoscopists would benefit from becoming familiar with CAD technology and comfortable with the integration of AI-assisted devices in colonoscopy practice. The decision support systems are being offered as reliable tools for the detection and classification of colorectal polyps, with the primary aim of outperforming endoscopists by detecting all CRC precursors; however, the new era of AI platforms has seen attempts to establish considerably more complex systems, in which the detection and classification of polyps are supported. Despite the recent achievements in designing and validating such systems, the current lack of AI-assisted systems that support endoscopists in monitoring colonoscopy quality, and that automatically annotate colonoscopy videos, suggest appropriate polypectomy devices, and indicate the completeness of polypectomy, limits the role of AI in colonoscopy practice. Through the integration of the most recent advances in computer science into colonoscopy practice, it appears possible to improve the quality of diagnosis, treatment, and screening in patients. However, AI platforms are still in their infancy in terms of clinical establishment and require much more exploration and innovation. They must be trusted by all physicians, regulatory organizations responsible for approval for clinical use, and patients. The AI-assisted colonoscopy is highly dependent on the endoscopist, who must attempt to present the clearest possible image or video to the AI model for analysis, and then take account of other concurrent patient factors such as the family history of CRC or the results of previous colonoscopies. The human qualities of respect and empathy must be apparent when communicating with patients to overcome any mistrust or reservations patients may have toward the new technology. Therefore, at the current stage of AI development, AI models can only “serve as a second observer, or a concurrent observer, but not an independent decision-maker”[73].
Footnotes
Conflict-of-interest statement: Mahsa Taghiakbari has no conflicts of interest relevant to this paper to disclose. Yuichi Mori is a consultant for the Olympus corporation and receives a speaking fee from the same corporation. Yuichi Mori has an ownership interest in Cybernet System Corporation. Daniel von Renteln is supported by the “Fonds de Recherche du Québec Santé” career development award and has received research funding from ERBE, Ventage, Pendopharm, and Pentax; he is also a consultant for Boston Scientific and Pendopharm.
Provenance and peer review: Invited article; Externally peer reviewed.
Peer-review model: Single blind
Peer-review started: March 19, 2021
First decision: August 9, 2021
Article in press: December 7, 2021
Specialty type: Gastroenterology and hepatology
Country/Territory of origin: Canada
Peer-review report’s scientific quality classification
Grade A (Excellent): A
Grade B (Very good): 0
Grade C (Good): 0
Grade D (Fair): 0
Grade E (Poor): 0
P-Reviewer: Shaikh DH S-Editor: Fan JR L-Editor: A P-Editor: Fan JR
Contributor Information
Mahsa Taghiakbari, Department of Gastroenterology, CRCHUM, Montreal H2X 0A9, Quebec, Canada.
Yuichi Mori, Clinical Effectiveness Research Group, University of Oslo, Oslo 0450, Norway; Digestive Disease Center, Showa University Northern Yokohama Hospital, Yokohama 224-8503, Japan.
Daniel von Renteln, Department of Gastroenterology, CRCHUM, Montreal H2X 0A9, Quebec, Canada. danielrenteln@gmail.com.
References
- 1.Rawla P, Sunkara T, Barsouk A. Epidemiology of colorectal cancer: incidence, mortality, survival, and risk factors. Prz Gastroenterol. 2019;14:89–103. doi: 10.5114/pg.2018.81072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Yabroff KR, Lund J, Kepka D, Mariotto A. Economic burden of cancer in the United States: estimates, projections, and future research. Cancer Epidemiol Biomarkers Prev. 2011;20:2006–2014. doi: 10.1158/1055-9965.EPI-11-0650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Zauber AG, Winawer SJ, O'Brien MJ, Lansdorp-Vogelaar I, van Ballegooijen M, Hankey BF, Shi W, Bond JH, Schapiro M, Panish JF, Stewart ET, Waye JD. Colonoscopic polypectomy and long-term prevention of colorectal-cancer deaths. N Engl J Med. 2012;366:687–696. doi: 10.1056/NEJMoa1100370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rex DK, Boland CR, Dominitz JA, Giardiello FM, Johnson DA, Kaltenbach T, Levin TR, Lieberman D, Robertson DJ. Colorectal Cancer Screening: Recommendations for Physicians and Patients From the U.S. Multi-Society Task Force on Colorectal Cancer. Gastroenterology. 2017;153:307–323. doi: 10.1053/j.gastro.2017.05.013. [DOI] [PubMed] [Google Scholar]
- 5.Brenner H, Chang-Claude J, Seiler CM, Rickert A, Hoffmeister M. Protection from colorectal cancer after colonoscopy: a population-based, case-control study. Ann Intern Med. 2011;154:22–30. doi: 10.7326/0003-4819-154-1-201101040-00004. [DOI] [PubMed] [Google Scholar]
- 6.Brenner H, Chang-Claude J, Jansen L, Knebel P, Stock C, Hoffmeister M. Reduced risk of colorectal cancer up to 10 years after screening, surveillance, or diagnostic colonoscopy. Gastroenterology. 2014;146:709–717. doi: 10.1053/j.gastro.2013.09.001. [DOI] [PubMed] [Google Scholar]
- 7.Baxter NN, Goldwasser MA, Paszat LF, Saskin R, Urbach DR, Rabeneck L. Association of colonoscopy and death from colorectal cancer. Ann Intern Med. 2009;150:1–8. doi: 10.7326/0003-4819-150-1-200901060-00306. [DOI] [PubMed] [Google Scholar]
- 8.Doubeni CA, Corley DA, Quinn VP, Jensen CD, Zauber AG, Goodman M, Johnson JR, Mehta SJ, Becerra TA, Zhao WK, Schottinger J, Doria-Rose VP, Levin TR, Weiss NS, Fletcher RH. Effectiveness of screening colonoscopy in reducing the risk of death from right and left colon cancer: a large community-based study. Gut. 2018;67:291–298. doi: 10.1136/gutjnl-2016-312712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Bartel MJ, Brahmbhatt BS, Wallace MB. Management of colorectal T1 carcinoma treated by endoscopic resection from the Western perspective. Dig Endosc. 2016;28:330–341. doi: 10.1111/den.12598. [DOI] [PubMed] [Google Scholar]
- 10.Corley DA, Jensen CD, Marks AR, Zhao WK, Lee JK, Doubeni CA, Zauber AG, de Boer J, Fireman BH, Schottinger JE, Quinn VP, Ghai NR, Levin TR, Quesenberry CP. Adenoma detection rate and risk of colorectal cancer and death. N Engl J Med. 2014;370:1298–1306. doi: 10.1056/NEJMoa1309086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kim KO, Huh KC, Hong SP, Kim WH, Yoon H, Kim SW, Kim YS, Park JH, Lee J, Lee BJ, Park YS. Frequency and Characteristics of Interval Colorectal Cancer in Actual Clinical Practice: A KASID Multicenter Study. Gut Liver. 2018;12:537–543. doi: 10.5009/gnl17485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Anderson JC, Butterly LF. Colonoscopy: quality indicators. Clin Transl Gastroenterol. 2015;6:e77. doi: 10.1038/ctg.2015.5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Quirke P, Risio M, Lambert R, von Karsa L, Vieth M. Quality assurance in pathology in colorectal cancer screening and diagnosis—European recommendations. Virchows Arch. 2011;458:1–19. doi: 10.1007/s00428-010-0977-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Rex DK. Can we do resect and discard with artificial intelligence-assisted colon polyp “optical biopsy? Tech Innova Gastrointes Endosc. 2020;22:52–55. [Google Scholar]
- 15.ASGE Technology Committee, Abu Dayyeh BK, Thosani N, Konda V, Wallace MB, Rex DK, Chauhan SS, Hwang JH, Komanduri S, Manfredi M, Maple JT, Murad FM, Siddiqui UD, Banerjee S. ASGE Technology Committee systematic review and meta-analysis assessing the ASGE PIVI thresholds for adopting real-time endoscopic assessment of the histology of diminutive colorectal polyps. Gastrointest Endosc. 2015;81:502.e1–502.e16. doi: 10.1016/j.gie.2014.12.022. [DOI] [PubMed] [Google Scholar]
- 16.Willems P, Djinbachian R, Ditisheim S, Orkut S, Pohl H, Barkun A, Bouin M, Faulques B, von Renteln D. Uptake and barriers for implementation of the resect and discard strategy: an international survey. Endosc Int Open. 2020;8:E684–E692. doi: 10.1055/a-1132-5371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.van der Sommen F, de Groof J, Struyvenberg M, van der Putten J, Boers T, Fockens K, Schoon EJ, Curvers W, de With P, Mori Y, Byrne M, Bergman JJGHM. Machine learning in GI endoscopy: practical guidance in how to interpret a novel field. Gut. 2020;69:2035–2045. doi: 10.1136/gutjnl-2019-320466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Ahmad OF, Soares AS, Mazomenos E, Brandao P, Vega R, Seward E, Stoyanov D, Chand M, Lovat LB. Artificial intelligence and computer-aided diagnosis in colonoscopy: current evidence and future directions. Lancet Gastroenterol Hepatol. 2019;4:71–80. doi: 10.1016/S2468-1253(18)30282-6. [DOI] [PubMed] [Google Scholar]
- 19.Schachschal G, Sehner S, Choschzick M, Aust D, Brandl L, Vieth M, Wegscheider K, Baretton GB, Kirchner T, Sauter G, Rösch T. Impact of reassessment of colonic hyperplastic polyps by expert GI pathologists. Int J Colorectal Dis. 2016;31:675–683. doi: 10.1007/s00384-016-2523-8. [DOI] [PubMed] [Google Scholar]
- 20.Vleugels JLA, Hazewinkel Y, Dijkgraaf MGW, Koens L, Fockens P, Dekker E, group Ds. Optical diagnosis expanded to small polyps: post-hoc analysis of diagnostic performance in a prospective multicenter study. Endoscopy. 2019;51:244–252. doi: 10.1055/a-0759-1605. [DOI] [PubMed] [Google Scholar]
- 21.Anderson JC, Lisovsky M, Greene MA, Hagen C, Srivastava A. Factors Associated With Classification of Hyperplastic Polyps as Sessile Serrated Adenomas/Polyps on Morphologic Review. J Clin Gastroenterol. 2018;52:524–529. doi: 10.1097/MCG.0000000000000840. [DOI] [PubMed] [Google Scholar]
- 22.Baldin RKS, Anselmi Júnior RA, Azevedo M, Sebastião APM, Montemor M, Tullio LF, Soares LFdP, Noronha Ld. Interobserver variability in histological diagnosis of serrated colorectal polyps. J Coloprocto (Rio de Janeiro) 2015;35:193–197. [Google Scholar]
- 23.Lipton ZC, Steinhardt J. Troubling trends in machine learning scholarship. 2018 Preprint. Available from: 10.1145/3316774.
- 24.Maroulis DE, Iakovidis DK, Karkanis SA, Karras DA. CoLD: a versatile detection system for colorectal lesions in endoscopy video-frames. Comput Methods Programs Biomed. 2003;70:151–166. doi: 10.1016/s0169-2607(02)00007-x. [DOI] [PubMed] [Google Scholar]
- 25.Karkanis SA, Iakovidis DK, Maroulis DE, Karras DA, Tzivras M. Computer-aided tumor detection in endoscopic video using color wavelet features. IEEE Trans Inf Technol Biomed. 2003;7:141–152. doi: 10.1109/titb.2003.813794. [DOI] [PubMed] [Google Scholar]
- 26.Hwang S, Oh J, Tavanapong W, Wong J, Groen PCd. Polyp Detection in Colonoscopy Video using Elliptical Shape Feature. 2007 IEEE International Conference on Image Processing; 2007: 465–468. [Google Scholar]
- 27.Park SY, Sargent D, Spofford I, Vosburgh KG, Y AR. A colon video analysis framework for polyp detection. IEEE Trans Biomed Eng. 2012;59:1408–1418. doi: 10.1109/TBME.2012.2188397. [DOI] [PubMed] [Google Scholar]
- 28.Bernal J, Sánchez FJ, Fernández-Esparrach G, Gil D, Rodríguez C, Vilariño F. WM-DOVA maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians. Comput Med Imaging Graph. 2015;43:99–111. doi: 10.1016/j.compmedimag.2015.02.007. [DOI] [PubMed] [Google Scholar]
- 29.Tajbakhsh N, Gurudu SR, Liang JJ. Automated polyp detection in colonoscopy videos using shape and context information. IEEE Trans Med Imag. 2015;35:630–644. doi: 10.1109/TMI.2015.2487997. [DOI] [PubMed] [Google Scholar]
- 30.Wang Y, Tavanapong W, Wong J, Oh JH, de Groen PC. Polyp-Alert: near real-time feedback during colonoscopy. Comput Meth Prog Bio . 2015;120:164–179. doi: 10.1016/j.cmpb.2015.04.002. [DOI] [PubMed] [Google Scholar]
- 31.Angermann Q, Bernal J, J , Sánchez-Montes C, Hammami M, Fernández-Esparrach G, Dray X, Romain O, Sanchez J, Histace A. Real-Time Polyp Detection in Colonoscopy Videos. Int J Computer Rad Sur. 2017:06–20;. [Google Scholar]
- 32.k G, c R. Automatic Colorectal Polyp Detection in Colonoscopy Video Frames. Asian Pac J Cancer Prev. 2016;17:4869–4873. doi: 10.22034/APJCP.2016.17.11.4869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Yamada M, Saito Y, Imaoka H, Saiko M, Yamada S, Kondo H, Takamaru H, Sakamoto T, Sese J, Kuchiba A, Shibata T, Hamamoto R. Development of a real-time endoscopic image diagnosis support system using deep learning technology in colonoscopy. Sci Rep. 2019;9:14465. doi: 10.1038/s41598-019-50567-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Misawa M, Kudo SE, Mori Y, Cho T, Kataoka S, Yamauchi A, Ogawa Y, Maeda Y, Takeda K, Ichimasa K, Nakamura H, Yagawa Y, Toyoshima N, Ogata N, Kudo T, Hisayuki T, Hayashi T, Wakamura K, Baba T, Ishida F, Itoh H, Roth H, Oda M, Mori K. Artificial Intelligence-Assisted Polyp Detection for Colonoscopy: Initial Experience. Gastroenterology. 2018;154:2027–2029.e3. doi: 10.1053/j.gastro.2018.04.003. [DOI] [PubMed] [Google Scholar]
- 35.Urban G, Tripathi P, Alkayali T, Mittal M, Jalali F, Karnes W, Baldi P. Deep Learning Localizes and Identifies Polyps in Real Time With 96% Accuracy in Screening Colonoscopy. Gastroenterology. 2018;155:1069–1078.e8. doi: 10.1053/j.gastro.2018.06.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Wang P, Berzin TM, Glissen Brown JR, Bharadwaj S, Becq A, Xiao X, Liu P, Li L, Song Y, Zhang D, Li Y, Xu G, Tu M, Liu X. Real-time automatic detection system increases colonoscopic polyp and adenoma detection rates: a prospective randomised controlled study. Gut. 2019;68:1813–1819. doi: 10.1136/gutjnl-2018-317500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Klare P, Sander C, Prinzen M, Haller B, Nowack S, Abdelhafez M, Poszler A, Brown H, Wilhelm D, Schmid RM, von Delius S, Wittenberg T. Automated polyp detection in the colorectum: a prospective study (with videos) Gastrointest Endosc. 2019;89:576–582.e1. doi: 10.1016/j.gie.2018.09.042. [DOI] [PubMed] [Google Scholar]
- 38.Tischendorf JJ, Gross S, Winograd R, Hecker H, Auer R, Behrens A, Trautwein C, Aach T, Stehle T. Computer-aided classification of colorectal polyps based on vascular patterns: a pilot study. Endoscopy. 2010;42:203–207. doi: 10.1055/s-0029-1243861. [DOI] [PubMed] [Google Scholar]
- 39.Tamaki T, Yoshimuta J, Kawakami M, Raytchev B, Kaneda K, Yoshida S, Takemura Y, Onji K, Miyaki R, Tanaka S. Computer-aided colorectal tumor classification in NBI endoscopy using local features. Med Image Anal. 2013;17:78–100. doi: 10.1016/j.media.2012.08.003. [DOI] [PubMed] [Google Scholar]
- 40.Hirakawa T, Tamaki T, Raytchev B, Kaneda K, Koide T, Kominami Y, Yoshida S, Tanaka S. SVM-MRF segmentation of colorectal NBI endoscopic images. Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:4739–4742. doi: 10.1109/EMBC.2014.6944683. [DOI] [PubMed] [Google Scholar]
- 41.Häfner M, Tamaki T, Tanaka S, Uhl A, Wimmer G, Yoshida S. Local fractal dimension based approaches for colonic polyp classification. Med Image Anal. 2015;26:92–107. doi: 10.1016/j.media.2015.08.007. [DOI] [PubMed] [Google Scholar]
- 42.Wimmer G, Tamaki T, Tischendorf JJ, Häfner M, Yoshida S, Tanaka S, Uhl A. Directional wavelet based features for colonic polyp classification. Med Image Anal. 2016;31:16–36. doi: 10.1016/j.media.2016.02.001. [DOI] [PubMed] [Google Scholar]
- 43.Okamoto T, Koide T, Sugi K, Shimizu T, Anh-Tuan H, Tamaki T, Raytchev B, Kaneda K, Kominami Y, Yoshida S, Mieno H, Tanaka S. Image segmentation of pyramid style identifier based on Support Vector Machine for colorectal endoscopic images. Annu Int Conf IEEE Eng Med Biol Soc. 2015;2015:2997–3000. doi: 10.1109/EMBC.2015.7319022. [DOI] [PubMed] [Google Scholar]
- 44.Chen PJ, Lin MC, Lai MJ, Lin JC, Lu HH, Tseng VS. Accurate Classification of Diminutive Colorectal Polyps Using Computer-Aided Analysis. Gastroenterology. 2018;154:568–575. doi: 10.1053/j.gastro.2017.10.010. [DOI] [PubMed] [Google Scholar]
- 45.Byrne MF, Chapados N, Soudan F, Oertel C, Linares Pérez M, Kelly R, Iqbal N, Chandelier F, Rex DK. Real-time differentiation of adenomatous and hyperplastic diminutive colorectal polyps during analysis of unaltered videos of standard colonoscopy using a deep learning model. Gut. 2019;68:94–100. doi: 10.1136/gutjnl-2017-314547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Rex DK, Kahi C, O'Brien M, Levin TR, Pohl H, Rastogi A, Burgart L, Imperiale T, Ladabaum U, Cohen J, Lieberman DA. The American Society for Gastrointestinal Endoscopy PIVI (Preservation and Incorporation of Valuable Endoscopic Innovations) on real-time endoscopic assessment of the histology of diminutive colorectal polyps. Gastrointest Endosc. 2011;73:419–422. doi: 10.1016/j.gie.2011.01.023. [DOI] [PubMed] [Google Scholar]
- 47.Aihara H, Saito S, Inomata H, Ide D, Tamai N, Ohya TR, Kato T, Amitani S, Tajiri H. Computer-aided diagnosis of neoplastic colorectal lesions using 'real-time' numerical color analysis during autofluorescence endoscopy. Eur J Gastroenterol Hepatol. 2013;25:488–494. doi: 10.1097/MEG.0b013e32835c6d9a. [DOI] [PubMed] [Google Scholar]
- 48.Mori Y, Kudo SE, Misawa M, Saito Y, Ikematsu H, Hotta K, Ohtsuka K, Urushibara F, Kataoka S, Ogawa Y, Maeda Y, Takeda K, Nakamura H, Ichimasa K, Kudo T, Hayashi T, Wakamura K, Ishida F, Inoue H, Itoh H, Oda M, Mori K. Real-Time Use of Artificial Intelligence in Identification of Diminutive Polyps During Colonoscopy: A Prospective Study. Ann Intern Med. 2018;169:357–366. doi: 10.7326/M18-0249. [DOI] [PubMed] [Google Scholar]
- 49.Kuiper T, Alderlieste YA, Tytgat KM, Vlug MS, Nabuurs JA, Bastiaansen BA, Löwenberg M, Fockens P, Dekker E. Automatic optical diagnosis of small colorectal lesions by laser-induced autofluorescence. Endoscopy. 2015;47:56–62. doi: 10.1055/s-0034-1378112. [DOI] [PubMed] [Google Scholar]
- 50.Rath T, Tontini GE, Vieth M, Nägel A, Neurath MF, Neumann H. In vivo real-time assessment of colorectal polyp histology using an optical biopsy forceps system based on laser-induced fluorescence spectroscopy. Endoscopy. 2016;48:557–562. doi: 10.1055/s-0042-102251. [DOI] [PubMed] [Google Scholar]
- 51.Byrne MF, Soudan F, Henkel M, Oertel C, Chapados N, Echagüe FJ, Ghalehjegh SH, Guizard N, Giguère S, MacPhail MEJGE. Mo1679 real-time artificial intelligence “full colonoscopy workflow” for automatic detection followed by optical biopsy of colorectal polyps. Gas Endoscopy . 2018;87:AB475. [Google Scholar]
- 52.Stanek SR, Tavanapong W, Wong J, Oh J, Nawarathna RD, Muthukudage J, de Groen PC. SAPPHIRE: a toolkit for building efficient stream programs for medical video analysis. Comput Methods Programs Biomed. 2013;112:407–421. doi: 10.1016/j.cmpb.2013.07.028. [DOI] [PubMed] [Google Scholar]
- 53.Filip D, Gao X, Angulo-Rodríguez L, Mintchev MP, Devlin SM, Rostom A, Rosen W, Andrews CN. Colometer: a real-time quality feedback system for screening colonoscopy. World J Gastroenterol. 2012;18:4270–4277. doi: 10.3748/wjg.v18.i32.4270. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Wang Y, Tavanapong W, Wong JS, Oh J, de Groen PC. Detection of quality visualization of appendiceal orifices using local edge cross-section profile features and near pause detection. IEEE Trans Biomed Eng. 2010;57:685–695. doi: 10.1109/TBME.2009.2034466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Tsuda S, Veress B, Toth E, Fork FT. Flat and depressed colorectal tumours in a southern Swedish population: a prospective chromoendoscopic and histopathological study. Gut . 2002;51:550–555. doi: 10.1136/gut.51.4.550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Fidler JL, Johnson CD, MacCarty RL, Welch TJ, Hara AK, Harmsen WS. Detection of flat lesions in the colon with CT colonography. Abdom Imaging. 2002;27:292–300. doi: 10.1007/s00261-001-0171-z. [DOI] [PubMed] [Google Scholar]
- 57.JE IJ, de Wit K, van der Vlugt M, Bastiaansen BA, Fockens P, Dekker E. Prevalence, distribution and risk of sessile serrated adenomas/polyps at a center with a high adenoma detection rate and experienced pathologists. Endoscopy . 2016;48:740–746. doi: 10.1055/s-0042-105436. [DOI] [PubMed] [Google Scholar]
- 58.Rex DK, Ahnen DJ, Baron JA, Batts KP, Burke CA, Burt RW, Goldblum JR, Guillem JG, Kahi CJ, Kalady MF, O'Brien MJ, Odze RD, Ogino S, Parry S, Snover DC, Torlakovic EE, Wise PE, Young J, Church J. Serrated lesions of the colorectum: review and recommendations from an expert panel. Am J Gastroenterol . 2012;107:1315–1329. doi: 10.1038/ajg.2012.161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Niv Y. Changing pathological diagnosis from hyperplastic polyp to sessile serrated adenoma: systematic review and meta-analysis. Eur J Gastroenterol Hepatol. 2017;29:1327–1331. doi: 10.1097/MEG.0000000000000994. [DOI] [PubMed] [Google Scholar]
- 60.West NP, Langman G, Haboubi N, Carey F, Henry J, Morgan M, Sheahan K. Significant polyps and early colorectal cancer: the importance of high-quality standardized histopathology. Colorectal Dis. 2019;21 Suppl 1:53–56. doi: 10.1111/codi.14506. [DOI] [PubMed] [Google Scholar]
- 61.Kudo SE, Mori Y, Misawa M, Takeda K, Kudo T, Itoh H, Oda M, Mori K. Artificial intelligence and colonoscopy: Current status and future perspectives. Dig Endosc. 2019;31:363–371. doi: 10.1111/den.13340. [DOI] [PubMed] [Google Scholar]
- 62.East JE, Rees CJ. Making optical biopsy a clinical reality in colonoscopy. Lancet Gastroenterol Hepatol. 2018;3:10–12. doi: 10.1016/S2468-1253(17)30366-7. [DOI] [PubMed] [Google Scholar]
- 63.Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP, Irwig L, Lijmer JG, Moher D, Rennie D, de Vet HC, Kressel HY, Rifai N, Golub RM, Altman DG, Hooft L, Korevaar DA, Cohen JF Group S. STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. BMJ. 2015;351:h5527. doi: 10.1136/bmj.h5527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Dice LR. Measures of the Amount of Ecologic Association Between Species. Ecology . 1945;26:297–302. [Google Scholar]
- 65.Montavon G, Samek W, Müller K-R. Methods for interpreting and understanding deep neural networks. Dig Sign Proce. 2018;73:1–15. [Google Scholar]
- 66.Japkowicz N, Shah M. Evaluating Learning Algorithms: A Classification Perspective: Cambridge University Press; 2011. [Google Scholar]
- 67.Fyock CJ, Draganov PV. Colonoscopic polypectomy and associated techniques. World J Gastroenterol. 2010;16:3630–3637. doi: 10.3748/wjg.v16.i29.3630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Shaukat A, Kaltenbach T, Dominitz JA, Robertson DJ, Anderson JC, Cruise M, Burke CA, Gupta S, Lieberman D, Syngal S, Rex DK. Endoscopic Recognition and Management Strategies for Malignant Colorectal Polyps: Recommendations of the US Multi-Society Task Force on Colorectal Cancer. Gastroenterology. 2020;159:1916–1934.e2. doi: 10.1053/j.gastro.2020.08.050. [DOI] [PubMed] [Google Scholar]
- 69.Samadder NJ, Curtin K, Tuohy TM, Pappas L, Boucher K, Provenzale D, Rowe KG, Mineau GP, Smith K, Pimentel R, Kirchhoff AC, Burt RW. Characteristics of missed or interval colorectal cancer and patient survival: a population-based study. Gastroenterology . 2014;146:950–960. doi: 10.1053/j.gastro.2014.01.013. [DOI] [PubMed] [Google Scholar]
- 70.Farrar WD, Sawhney MS, Nelson DB, Lederle FA, Bond JH. Colorectal cancers found after a complete colonoscopy. Clin Gastroenterol Hepatol. 2006;4:1259–1264. doi: 10.1016/j.cgh.2006.07.012. [DOI] [PubMed] [Google Scholar]
- 71.Mori Y, Neumann H, Misawa M, Kudo S-e, Bretthauer M. Artificial intelligence in colonoscopy - Now on the market. What's next? J Gastroenterol Hepatol . 2021;36:7–11. doi: 10.1111/jgh.15339. [DOI] [PubMed] [Google Scholar]
- 72.Pogorelov K, Randel KR, Griwodz C, Eskeland SL, de Lange T, Johansen D, Spampinato C, Dang-Nguyen DT, Lux M, Schmidt PT. Kvasir: A multi-class image dataset for computer aided gastrointestinal disease detection. Proceedings of the 8th ACM on Multimedia Systems Conference; 2017: 164-169. [Google Scholar]
- 73.Fernández-Esparrach G, Bernal J, López-Cerón M, Córdova H, Sánchez-Montes C, Rodríguez de Miguel C, Sánchez FJ. Exploring the clinical potential of an automatic colonic polyp detection method based on the creation of energy maps. Endoscopy. 2016;48:837–842. doi: 10.1055/s-0042-108434. [DOI] [PubMed] [Google Scholar]
- 74.Wang P, Liu X, Berzin TM, Glissen Brown JR, Liu P, Zhou C, Lei L, Li L, Guo Z, Lei S, Xiong F, Wang H, Song Y, Pan Y, Zhou G. Effect of a deep-learning computer-aided detection system on adenoma detection during colonoscopy (CADe-DB trial): a double-blind randomised study. Lancet Gastroenterol Hepatol. 2020;5:343–351. doi: 10.1016/S2468-1253(19)30411-X. [DOI] [PubMed] [Google Scholar]
- 75.Su JR, Li Z, Shao XJ, Ji CR, Ji R, Zhou RC, Li GC, Liu GQ, He YS, Zuo XL, Li YQ. Impact of a real-time automatic quality control system on colorectal polyp and adenoma detection: a prospective randomized controlled study (with videos) Gastrointest Endosc. 2020;91:415–424.e4. doi: 10.1016/j.gie.2019.08.026. [DOI] [PubMed] [Google Scholar]
- 76.Gong D, Wu L, Zhang J, Mu G, Shen L, Liu J, Wang Z, Zhou W, An P, Huang X, Jiang X, Li Y, Wan X, Hu S, Chen Y, Hu X, Xu Y, Zhu X, Li S, Yao L, He X, Chen D, Huang L, Wei X, Wang X, Yu H. Detection of colorectal adenomas with a real-time computer-aided system (ENDOANGEL): a randomised controlled study. Lancet Gastroenterol Hepatol. 2020;5:352–361. doi: 10.1016/S2468-1253(19)30413-3. [DOI] [PubMed] [Google Scholar]
- 77.Liu WN, Zhang YY, Bian XQ, Wang LJ, Yang Q, Zhang XD, Huang J. Study on detection rate of polyps and adenomas in artificial-intelligence-aided colonoscopy. Saudi J Gastroenterol. 2020;26:13–19. doi: 10.4103/sjg.SJG_377_19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Luo Y, Zhang Y, Liu M, Lai Y, Liu P, Wang Z, Xing T, Huang Y, Li Y, Li A, Wang Y, Luo X, Liu S, Han Z. Artificial Intelligence-Assisted Colonoscopy for Detection of Colon Polyps: a Prospective, Randomized Cohort Study. J Gastrointest Surg. 2021;25:2011–2018. doi: 10.1007/s11605-020-04802-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Repici A, Badalamenti M, Maselli R, Correale L, Radaelli F, Rondonotti E, Ferrara E, Spadaccini M, Alkandari A, Fugazza A, Anderloni A, Galtieri PA, Pellegatta G, Carrara S, Di Leo M, Craviotto V, Lamonaca L, Lorenzetti R, Andrealli A, Antonelli G, Wallace M, Sharma P, Rosch T, Hassan C. Efficacy of Real-Time Computer-Aided Detection of Colorectal Neoplasia in a Randomized Trial. Gastroenterology. 2020;159:512–520.e7. doi: 10.1053/j.gastro.2020.04.062. [DOI] [PubMed] [Google Scholar]
- 80.Wang P, Liu P, Glissen Brown JR, Berzin TM, Zhou G, Lei S, Liu X, Li L, Xiao X. Lower Adenoma Miss Rate of Computer-Aided Detection-Assisted Colonoscopy vs Routine White-Light Colonoscopy in a Prospective Tandem Study. Gastroenterology. 2020;159:1252–1261.e5. doi: 10.1053/j.gastro.2020.06.023. [DOI] [PubMed] [Google Scholar]
- 81.Park SY, Sargent D. Colonoscopic polyp detection using convolutional neural networks. SPIE Med Imag. 2016 [Google Scholar]
- 82.Yu LQ, Chen H, Dou Q, Qin J, Heng PA. Integrating Online and Offline Three-Dimensional Deep Learning for Automated Polyp Detection in Colonoscopy Videos. IEEE J Biomed Health Inform. 2017;21:65–75. doi: 10.1109/JBHI.2016.2637004. [DOI] [PubMed] [Google Scholar]
- 83.Billah M, Waheed S, Rahman MM. An Automatic Gastrointestinal Polyp Detection System in Video Endoscopy Using Fusion of Color Wavelet and Convolutional Neural Network Features. Int J Biomed Imaging. 2017;2017:9545920. doi: 10.1155/2017/9545920. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Zhang R, Zheng Y, Mak TW, Yu R, Wong SH, Lau JY, Poon CC. Automatic Detection and Classification of Colorectal Polyps by Transferring Low-Level CNN Features From Nonmedical Domain. IEEE J Biomed Health Inform. 2017;21:41–47. doi: 10.1109/JBHI.2016.2635662. [DOI] [PubMed] [Google Scholar]
- 85.Wang P, Xiao X, Glissen Brown JR, Berzin TM, Tu M, Xiong F, Hu X, Liu P, Song Y, Zhang D, Yang X, Li L, He J, Yi X, Liu J, Liu X. Development and validation of a deep-learning algorithm for the detection of polyps during colonoscopy. Nat Biomed Eng. 2018;2:741–748. doi: 10.1038/s41551-018-0301-3. [DOI] [PubMed] [Google Scholar]
- 86.Ozawa T, Ishihara S, Fujishiro M, Kumagai Y, Shichijo S, Tada T. Automated endoscopic detection and classification of colorectal polyps using convolutional neural networks. Therap Adv Gastroenterol. 2020;13:1756284820910659. doi: 10.1177/1756284820910659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Mori Y, Kudo SE, Wakamura K, Misawa M, Ogawa Y, Kutsukawa M, Kudo T, Hayashi T, Miyachi H, Ishida F, Inoue H. Novel computer-aided diagnostic system for colorectal lesions by using endocytoscopy (with videos) Gastrointest Endosc. 2015;81:621–629. doi: 10.1016/j.gie.2014.09.008. [DOI] [PubMed] [Google Scholar]
- 88.Song EM, Park B, Ha CA, Hwang SW, Park SH, Yang DH, Ye BD, Myung SJ, Yang SK, Kim N, Byeon JS. Endoscopic diagnosis and treatment planning for colorectal polyps using a deep-learning model. Sci Rep. 2020;10:30. doi: 10.1038/s41598-019-56697-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Kudo SE, Misawa M, Mori Y, Hotta K, Ohtsuka K, Ikematsu H, Saito Y, Takeda K, Nakamura H, Ichimasa K, Ishigaki T, Toyoshima N, Kudo T, Hayashi T, Wakamura K, Baba T, Ishida F, Inoue H, Itoh H, Oda M, Mori K. Artificial Intelligence-assisted System Improves Endoscopic Identification of Colorectal Neoplasms. Clin Gastroenterol Hepatol. 2020;18:1874–1881.e2. doi: 10.1016/j.cgh.2019.09.009. [DOI] [PubMed] [Google Scholar]