Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Apr 18.
Published in final edited form as: Gastrointest Endosc Clin N Am. 2020 Apr 11;30(3):585–595. doi: 10.1016/j.giec.2020.02.010

How Artificial Intelligence Will Impact Colonoscopy and Colorectal Screening

Dennis L Shung a, Michael F Byrne b,*
PMCID: PMC12007662  NIHMSID: NIHMS2060882  PMID: 32439090

INTRODUCTION

Colorectal cancer is a major cause of cancer-related death, and colonoscopy with complete resection of neoplastic lesions has been suggested to reduce both incidence and mortality of colorectal cancer.1,2 Quality of colonoscopies in detecting malignancies varies, and outpatient colonoscopies in surgery centers cost the most for all procedures performed in ambulatory surgery centers, an estimated $3.2 billion in the United States in 2018.3 In the American Board of Internal Medicine’s “Choosing Wisely” campaign, one of the areas of endoscopic overuse is surveillance colonoscopy in individuals with low-risk polyps.4 Payers are motivated to decrease unnecessary costs associated with colonoscopy.

Artificial intelligence (AI) is poised to transform clinical practice through machine learning, a method that overlaps computer science and statistics to create software programs that directly learn from patterns found in data input.5

AI adds value to colorectal cancer screening and surveillance by enhancing the visual abilities of the endoscopist to find precancerous polyps and classify polyps that do not need to be removed. Currently, the state of AI in colonoscopy consists of software running supervised machine learning algorithms to specifically find polyps or to classify them. In the future, we anticipate a fully integrated system that will provide decision support for assessing depth of invasion, and possibly even integrating genotypic and other data to provide personalized risk assessments for colon cancer.

ARTIFICIAL INTELLIGENCE PROMOTES VALUE IN COLORECTAL CANCER SCREENING AND SURVEILLANCE

Value in health care has been defined as improvement in quality or decrease in cost.6 Colonoscopy as a preventive measure to completely remove neoplastic lesions holds value for public health in preventing colorectal cancer.1 However, this is dependent on the quality of the procedure in finding all precancerous lesions, which can be reflected with adenoma detection rates (ADRs).7 One study has suggested that increase of 1% in ADR leads to a 3% decreased risk of interval colorectal cancer.7 AI in colonoscopy can potentially improve the quality of the procedure through computer-aided detection (CADe) to increase the adenoma detection rate.

Costs can be managed by expert judgment to avoid removing low-risk polyps unnecessarily, proposed in the Preservation and Incorporation of Valuable endoscopic Innovations (PIVI) by guidelines from the American Society for Gastrointestinal Endoscopy for optic biopsy of diminutive polyps.8 AI can decrease the costs related to unnecessary polyp removal through computer-aided diagnosis (CADx), as implemented by either a resect-and-discard or diagnose-and-leave strategy.

The technology that has advanced CADe into real-world implementation is deep learning through neural networks, a branch of machine learning that discovers features that are meaningful through large amounts of data. This contrasts with earlier methods, when algorithms were built by a human selecting features thought to be meaningful (recently summarized in a review by Ahmad and colleagues9).

For deep learning-driven CADe alone, there are 3 recent prospective validation studies.1012 (Table 1).

Table 1.

Deep learning computer-aided detection with validation on colonoscopy videos with histologically confirmed polyps

Study Methods Training Dataset Validation Dataset Sensitivity (per Image Frame), % Specificity, % Area Under the Receiver Operator Curve
Wang et al,10 2018 Convoluted Neural Networks 5545 images 138 videos 91.6
Misawa et al,11 2018 Convoluted Neural Networks 73 colonoscopy videos divided into short videos: training (411 videos) Testing (135 videos) 90.0 63.3 0.87
Urban et al,12 2018 Convoluted Neural Networks 8641 images 9 videos 93.0 93.0

For CADx alone, machine learning models have been used in 6 prospective validation studies, which test different modalities: magnifying narrow band imaging (NBI),13,14 endocytoscopy,15 laser-induced fluorescence spectroscopy,16,17 and autofluorescence endoscopy.18 (Table 2).

Table 2.

Machine learning computer-aided diagnosis with validation on captured images of polyps

Study Setting Methods (with # Features) Training Dataset Validation Dataset, no. Polyps Sensitivity, % Specificity, %
Chen et al,14 2018 Magnifying NBI Deep Neural Network 2157 polyps 284 96.3 78.1
Kominami et al,13 2016 Magnifying NBI Support Vector Machines (SVM) (128 features) 2247 polyps at maximal magnifying NBI 118 95.9 93.3
Mori et al,15 2017 Endocytoscopy SVM (312 features) 61,925 endocytoscopic images 466 91.3 – 93.8 88.7–91.0
Rath etal,16 2016 Laser-induced fluorescence spectroscopy WavStat4 – Linear Discriminant Analysis Not reported 137 81.8 85.2
Kuiper et al,17 2015 Laser-induced fluorescence spectroscopy WavStat4 – Linear Discriminant Analysis Not reported 207 85.3 58.8
Aihara et al,18 2013 Autofluorescence endoscopy Unknown 103 polyps 102 94.2 88.9

Both CADe and CADx have been integrated for fully automated polyp detection and immediate characterization in real time in a recent study trained a convolutional neural network using a training set of 223 polyp videos (60,089 frames) and validated the algorithm on 40 videos (106 polyps) with sensitivity 98% and specificity of 83% for identifying adenomas.19

An obvious limitation of the technology is that the algorithm cannot detect polyps that are not in the visual field of the colonoscope and performance may vary based on quality of inspection. Furthermore, other factors such as the speed of withdrawal and bowel preparation can also affect algorithm performance. This of course is not unique to AI, but applies to other devices such as forms of virtual chromoendoscopy.

Finally, a group recently presented (in abstract form) a system that decreases documentation burden by automating the input of quality measures based on image recognition (time of insertion, cecal intubation time, withdrawal time, tool use and intervention via Current Procedural Terminology codes and preparation quality via Boston Bowel Preparation Scale).20 They deployed an approach with 7 separate convoluted neural networks with live feedback during the procedure.

INTEGRATION INTO CLINICAL PROCESSES DIRECTLY AFFECTS ADOPTION OF MACHINE LEARNING PRODUCTS

Clinical integration is key, particularly in the fast-paced world of colonoscopy, and special considerations must be taken to ensure that the product enhances clinical care. As a baseline, machine learning products should provide a fully automate diagnostic workflow during colonoscopy that detects and characterizes polyps in real time. A recent model that does both was tested on 215 colonoscopy videos in which the model detected and tracked individual polyps while also providing an estimate for the histology via optical biopsy.21 In addition, the product should not unnecessarily prolong procedures, overwhelm practitioners with false positives, or become a distraction that adversely affects the ability of practitioners to perform the procedures well. These models function as clinical decision support, and must consider how explainable the model is to providers, the usability of the product during clinical care, how the information is delivered to the providers and how the product increases or decreases the time necessary to provide clinical care.22 The ultimate goal is to enhance the providers’ performance; for colonoscopy, improving efficiency for endoscopists is a priority for quality care.23

Most proposed measures for efficiency include an element of time. This is particularly relevant in high-volume endoscopy centers, where the number of cases that can be performed directly affects the revenue of the center. Algorithms used must therefore seek to decrease the amount of time it takes for colonoscopies to be performed with the same level of quality (eg, with no decrease in the adenoma detection rate).

Successful integration also requires adequate study of the human factors and perception of the product, including robust educational programming and training, ongoing user feedback regarding confidence in the tool, and logistical issues regarding its use. For example, training should not only teach providers how to use the product but also educate providers on how to adjudicate recommendations that may not agree with their clinical experience or impression. An ongoing challenge specific to deep learning is the “black box” nature of the deep learning algorithms, which limit the ability to explain how the algorithm came up with specific recommendations.

MACHINE LEARNING PRODUCTS REQUIRE RIGOROUS VALIDATION

When working with deep learning or other machine learning products, it is essential to ensure that the models are appropriately tested, or validated.24 Because the model learns from the available data, the data used to train the models should never be used to evaluate the model performance. Validation of the model using new data allows for the best measure of the model’s true performance, or generalizability. Ideally this would proceed in a well-designed, adequately powered prospective randomized controlled trial with real-time use of the machine learning models during colonoscopy versus colonoscopy without AI.2527 An alternative strategy proposed is a random tandem randomized controlled trial, which would theoretically be better because each patient would serve as his or her own control group.

The only randomized controlled trial to date for CADe was performed in China, an open, nonblinded trial of diagnostic colonoscopy with or without assistance of a real-time polyp detection system running a convoluted neural network in real time. In the trial, 1058 patients were included with 536 controls and 522 with the AI intervention. The intervention arm demonstrated significantly improved adenoma detection rate (29.1% vs 20.3%, P<.001) and significantly higher withdrawal time (6.9 vs 6.4 minutes, P<.001) that became equivalent after excluding biopsy time (6.18 vs 6.07 minutes, P = .15). They reported a total of 39 false alarms in the intervention arm, which averaged to 0.075 false alarms per colonoscopy.

Despite carefully designed experiments to test model performance, in real-world application deep learning models may be limited by model bias and model interoperability stemming from the limitations of the training data.28 For example, if the training data for a polyp detection model were predominantly in East Asian patients with a specific brand of endoscope, the model may not be perform well in a white-predominant population with a different brand.

In addition, the issues of safety, reliability, and demonstrated improvement from the standard of care are key issues that go beyond validation on new datasets. To reassure providers that the tool is both safe and reliable, parallel processes should be in place to identify and deal with errors.22 This emphasizes the importance of continuous maintenance and monitoring, which has been addressed by the Food and Drug Administration (FDA) in a recent whitepaper detailing the regulatory framework for updating software as medical devices.

MACHINE LEARNING PRODUCTS FACE REGULATORY, LEGAL, AND ETHICAL CHALLENGES

Regulatory rules vary from country to country, but are required before implementation of these algorithms for clinical care. In the United States, the FDA has drafted a framework for streamlined review under the term software as medical devices (SaMD).2931 This is defined by the International Medical Device Regulators Forum as “software intended to be used for one or more medical purposes that perform these purposes without being part of a hardware medical device.” The proposed risk-based regulatory framework for SaMD outlines categories of regulation based on severity of health care condition or situation (critical, serious, and nonserious) and significance of information provided by the software to the health care decision (for treatment or diagnosis, to drive clinical management, or to inform clinical management). This framework is adopted from the International Medical Device Regulators Forum and shared by the European Union Medical Device Regulation and Canadian Draft Guidance Document for Software as a Medical Device.32 The 3 risk classes are as follows: Class I (low risk), Class II (divided into IIA and IIB), and Class III (highest risk) with exemptions for classes I and II from 510(k) requirements either by specific FDA exemption or if they were legally marketed before the enactment of Medical Device Amendments on May 28, 1976. Classes I and II require a 510(k), which is a premarket submission that demonstrates that the device is at least as safe and effective as an existing legally marketed device. For Class III, a premarket approval application to demonstrate safety and effectiveness, which includes clinical trial data with study protocols, safety and effectiveness data, adverse reactions and complications, device failures and replacements, patient information, complaints, all subject data, and all statistical analyses (Table 3).

Table 3.

Software as medical device (SaMD) category risk-based classification

State of Health Care Situation or Condition Significance of Information Provided by SaMD to Health Care Decision
Treat or Diagnose Drive Clinical/Patient Management Inform Clinical/Patient Management
Critical III III I or IIb
Serious II or IIIa II or IIIa I or IIb
Nonserious I or IIb I or IIb I or IIb
a

Class III if an erroneous result could lead to immediate danger.

b

Class II if the software is intended to image or monitor a physiologic process or condition. Class I under Rule 12.

Furthermore, manufacturers must manage patient risks throughout the entire product lifecycle, mainly to ensure safety and effectiveness. The updated strategy includes SaMD Prespecifications, or “what” the algorithm is intended to learn, and an algorithm change protocol, or “how” the algorithm will learn while maintaining safety and efficacy.30 An example for colonoscopy is as follows:

1. Polyp Detection and Classification SaMD for Adenomatous Polyps

Description of SaMD: An AI/ML application intended for average risk patients presenting for screening and surveillance colonoscopy. The images from the colonoscopy video are processed and analyzed to detect polyps in the visual field throughout the colon. When an image is captured, it will be classified to be adenomatous or nonadenomatous with a percentage risk generated to indicate the probability of being adenomatous. This SaMD application will improve polyp detection and help guide decisions for low-risk polyps to be resected and discarded or left in.

SaMD Prespecifications:

  • Modify the algorithm to ensure consistent performance across different levels of bowel preparation

  • Reduce false alarm rates while maintaining sensitivity to polyps

Algorithm Change Protocol:

For these modifications, the algorithm change protocol includes detailed methods for database generation, reference standard labeling, and comparative analysis along the performance requirements, including sensitivity and specificity, and statistical analysis plan. The manufacturer follows good machine learning practices.

In Japan, this has been studied by the Pharmaceuticals and Medical Devices Agency.33 Of note, Japan’s regulatory agency has recently approved the first AI-assisted endocytoscopy product was approved without a prospective randomized controlled trial.

Legal

The framework for legal responsibilities has not been established, and in a sensitive field such as medicine, the possibility of error in health care decisions made in part with the help of algorithms may expose providers to unanticipated liability (eg, for misdiagnosis of a low-risk polyp that becomes interval cancer).24 Modifications to malpractice insurance, informed consent for the use of AI, and contingency plans should be in place to mitigate risk. Unfortunately, due to the absence of real-world use of AI in health care, there is limited precedent to draw on to provide guidance for patients, providers, or health care systems.

Ethics and Data Security

Respect for autonomy

AI applications in colonoscopy require a large volume of images and videos to develop, validate, and maintain performance. Because of the insatiable appetite for data, patient privacy and data security are key issues that should be thoughtfully addressed.

Patient privacy has traditionally been maintained through the system of individual patient consents, which has been argued to be prohibitively restrictive for purposes of AI research. A proposed solution is a “broad consent” policy, in which patients allow multiple secondary uses of their fully anonymized health data.34

Data security covers both maintenance of secure databases and also ensuring secure sharing and transfer across institutions and organizations. Data breaches could undermine patient privacy and the patient-physician relationship. Secure cloud computing solutions have been proposed and developed by commercial entities and electronic health record companies, but data harmonization and interoperability across systems remains a particular challenge.

Nonmaleficence

Deployment of AI applications in colonoscopy could result in mistakes that lead to interval cancers or overutilization by performing unnecessary biopsies. High-risk polyps could be wrongly classified as benign, and polyp detection could either miss a cancerous lesion or result in additional unnecessary biopsies of benign lesions. Furthermore, the phenomenon of “automation bias,” in which providers trust AI decisions even if they are incorrect, may lead to overreliance on the technology and deskilling of the endoscopists.35 Proper education and training of providers on the limitations of the AI technology is critical before widespread deployment. Accountability also should be considered in the event that a mistake is made when using the AI application, possibly titrated to the degree of autonomy accorded to the application.

Beneficence

To ensure that AI applications in colonoscopy act for the best interest of the patient, its recommendations should be taken into clinical context. One suggestion has been a new workflow that may incorporate the use of multidisciplinary meetings to review lesions characterized by the AI system and discuss the best approach for treatment.34

Justice

AI applications in colonoscopy may concentrate the benefits in wealthier systems and have data bias that do not include patients from low-resource settings. The expense of computational infrastructure to store data and deploy AI applications for colonoscopy may lead to a disparity in treatment between wealthy and poorer health systems. In addition, patients in low-resource settings may not have their data incorporated into the AI applications, leading to underrepresentation that may lead to poorer performance even if deployed for their care. To promote widespread uptake, funding of prospective trials should include earmarked funds to provide a basic level of infrastructure for every center. Furthermore, careful validation of the AI applications in multiple settings is important to ensure that the performance is consistent for all patients.

DATA LABELING IS A KEY LIMITATION TO DEVELOPING AND MAINTAINING MACHINE LEARNING PRODUCTS

The greatest limitation for the current models being developed is the need for large volumes of labeled datasets, commonly known as “ground truth.” In the case of colonoscopy, the ground truth is either human annotation that the polyp is present, or pathologic diagnosis of a polyp adjudicated by multiple graders. In particular, deep learning algorithms for imaging require an enormous quantity of data to capture the variety of real-world images.36 These include both static images (still pictures) and video recordings of colonoscopies, considering positive images, and quality of labeling. The labeling quality is considered to be the most important aspect, simply because models area data dependent and operate under a “garbage in, garbage out” framework and require expert endoscopist evaluation.24 Current models used in colonoscopy are “supervised” machine learning models, which requires that all data used to train the models have had “ground truth” or labels attached to each of the datapoints. However, the expense and effort required to label images and video frames is prohibitive, particularly due to the need for specialized expertise.

FUTURE DIRECTIONS

Enhancements or additional algorithms could include automatic polyp sizing, evaluation of polyp borders that could be used to gauge the depth of invasion for large polyps, and clinical decision support to guide method of tumor removal.

Endoscopic tissue resection in particular is an emerging area in which the depth of tumor invasion in the esophagus, stomach and colon may help guide management decisions. Several deep learning–based CADx tools have been developed to detect very low risk lesions for esophageal squamous cell, early gastric, and colorectal cancer with similar criteria that have very low risk for lymph node spread.37 In particular, computer-aided diagnosis platforms can help determine whether squamous cell carcinoma of the esophagus is classified as confined to the submucosa (SM1). SM1 lesions have very low risk for spread to lymph nodes and in the absence of high-risk features should undergo curative endoscopic resection.38 Further work could provide real-time guidance that determines the extent of intervention needed (for example, endoscopic mucosal resection vs endoscopic submucosal dissection vs surgical intervention).

AI also has the potential to adjudicate when endoscopic assessment of diminutive colorectal polyps do not agree with pathologic diagnosis. A recent study provides preliminary evidence that an AI clinical decision support solution; for discordant diagnoses, the clinical decision support services provided additional analysis to suggest that endoscopic diagnoses were correct (90.3% of lesions).39 The role for AI in clinical decision support to adjudicate discordant endoscopic and pathologic diagnosis could help future planning for surveillance colonoscopies.

The potential for misclassification rate for deep learning models in CADe exists with a limited understanding of the neural network architecture. Increased interpretability of neural architectures could be achieved through unsupervised machine learning techniques that examine the presence of important or redundant images. By understanding how the neural network classifies polyps, a misclassified polyp image could be compared to see if there is a deficiency in the dataset or architecture that may have accounted for the error. Another limitation to deep learning–based model is the bottleneck of labeled images. Emerging work in unsupervised machine learning techniques to create autoencoders that could be used to generate synthetic datasets for training deep learning models may augment existing datasets to improve model performance.

In the future, imaging data could be integrated in context with other biological and electronic health record data to create a more nuanced and personalized risk assessment. For patients at particular risk (eg, patients with inflammatory bowel disease), the integration of other clinical risk scores could be used to modify the threshold accordingly for polyp detection.

KEY POINTS.

  • Artificial intelligence can improve colonoscopy quality by detecting adenomatous polyps and decrease associated costs by classifying low-risk polyps that do not need removal.

  • An integrated system with artificial intelligence can improve the provider experience by decreasing documentation burden through automating data entry for quality measures.

  • Important limitations of the technology must be recognized, including the dataset features for algorithm development and integration into endoscopist workflow.

  • Future directions include automatic assessment of polyp size and borders, histologic staging and personalized risk stratification.

DISCLOSURE

D.L. Shung: Grant Support through the National Institutes of Health T32 DK007017; M.F. Byrne: CEO and shareholder, Satisfai Health; founder of AI4GI joint venture. Co-development agreement between Olympus America and AI4GI in artificial intelligence and colorectal polyps.

REFERENCES

  • 1.Zauber AG, Winawer SJ, O’Brien MJ, et al. Colonoscopic polypectomy and long-term prevention of colorectal-cancer deaths. N Engl J Med 2012;366(8):687–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Winawer SJ, Zauber AG, Ho MN, et al. Prevention of colorectal cancer by colonoscopic polypectomy. The National Polyp Study Workgroup. N Engl J Med 1993;329(27):1977–81. [DOI] [PubMed] [Google Scholar]
  • 3.Ambulatory surgery centers database. 2018. Available at: https://www.definitivehc.com/product/our-databases/ambulatory-surgery-centers. Accessed: November 20, 2019.
  • 4.Naik AD, Hinojosa-Lindsey M, Arney J, et al. Choosing Wisely and the perceived drivers of endoscopy use. Clin Gastroenterol Hepatol 2013;11(7):753–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Abu-Mostafa YS. Learning from data: a short course 2012. Available at: http://amlbook.com/. Accesses November 20, 2019.
  • 6.Porter ME, Lee TH. From volume to value in health care: the work begins. JAMA 2016;316(10):1047–8. [DOI] [PubMed] [Google Scholar]
  • 7.Corley DA, Jensen CD, Marks AR, et al. Adenoma detection rate and risk of colorectal cancer and death. N Engl J Med 2014;370(14):1298–306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Rex DK, Kahi C, O’Brien M, et al. The American Society for Gastrointestinal Endoscopy PIVI (Preservation and Incorporation of Valuable Endoscopic Innovations) on real-time endoscopic assessment of the histology of diminutive colorectal polyps. Gastrointest Endosc 2011;73(3):419–22. [DOI] [PubMed] [Google Scholar]
  • 9.Ahmad OF, Soares AS, Mazomenos E, et al. Artificial intelligence and computer-aided diagnosis in colonoscopy: current evidence and future directions. Lancet Gastroenterol Hepatol 2019;4(1):71–80. [DOI] [PubMed] [Google Scholar]
  • 10.Wang P, Xiao X, Glissen Brown JR, et al. Development and validation of a deep-learning algorithm for the detection of polyps during colonoscopy. Nat Biomed Eng 2018;2(10):741–8. [DOI] [PubMed] [Google Scholar]
  • 11.Misawa M, Kudo SE, Mori Y, et al. Artificial intelligence-assisted polyp detection for colonoscopy: initial experience. Gastroenterology 2018;154(8):2027–9.e3. [DOI] [PubMed] [Google Scholar]
  • 12.Urban G, Tripathi P, Alkayali T, et al. Deep learning localizes and identifies polyps in real time with 96% accuracy in screening colonoscopy. Gastroenterology 2018;155(4):1069–78.e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kominami Y, Yoshida S, Tanaka S, et al. Computer-aided diagnosis of colorectal polyp histology by using a real-time image recognition system and narrow-band imaging magnifying colonoscopy. Gastrointest Endosc 2016;83(3):643–9. [DOI] [PubMed] [Google Scholar]
  • 14.Chen PJ, Lin MC, Lai MJ, et al. Accurate classification of diminutive colorectal polyps using computer-aided analysis. Gastroenterology 2018;154(3):568–75. [DOI] [PubMed] [Google Scholar]
  • 15.Mori Y, Kudo SE, Berzin TM, et al. Computer-aided diagnosis for colonoscopy. Endoscopy 2017;49(8):813–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rath T, Tontini GE, Vieth M, et al. In vivo real-time assessment of colorectal polyp histology using an optical biopsy forceps system based on laser-induced fluorescence spectroscopy. Endoscopy 2016;48(6):557–62. [DOI] [PubMed] [Google Scholar]
  • 17.Kuiper T, Alderlieste YA, Tytgat KM, et al. Automatic optical diagnosis of small colorectal lesions by laser-induced autofluorescence. Endoscopy 2015;47(1):56–62. [DOI] [PubMed] [Google Scholar]
  • 18.Aihara H, Saito S, Inomata H, et al. Computer-aided diagnosis of neoplastic colorectal lesions using ‘real-time’ numerical color analysis during autofluorescence endoscopy. Eur J Gastroenterol Hepatol 2013;25(4):488–94. [DOI] [PubMed] [Google Scholar]
  • 19.Byrne MF, Chapados N, Soudan F, et al. Real-time differentiation of adenomatous and hyperplastic diminutive colorectal polyps during analysis of unaltered videos of standard colonoscopy using a deep learning model. Gut 2019;68(1):94–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Karnes W, Requa J, Dao T, et al. Automated documentation of multiple colonoscopy quality measures in real-time with convolutional neural networks: 2761. Am J Gastroenterol 2018;113:S1532. [Google Scholar]
  • 21.Guizard N, Ghalehjegh SH, Henkel M, et al. 256 – Artificial intelligence for real-time multiple polyp detection with identification, tracking, and optical biopsy during colonoscopy. Gastroenterology 2019;156(6). S-48–S-49. [Google Scholar]
  • 22.Shortliffe EH, Sepulveda MJ. Clinical decision support in the era of artificial intelligence. JAMA 2018;320(21):2199–200. [DOI] [PubMed] [Google Scholar]
  • 23.Gellad ZF, Thompson CP, Taheri J. Endoscopy unit efficiency: quality redefined. Clin Gastroenterol Hepatol 2013;11(9):1046–9.e1. [DOI] [PubMed] [Google Scholar]
  • 24.Vinsard DG, Mori Y, Misawa M, et al. Quality assurance of computer-aided detection and diagnosis in colonoscopy. Gastrointest Endosc 2019;90(1):55–63. [DOI] [PubMed] [Google Scholar]
  • 25.Steyerberg EW, Vickers AJ, Cook NR, et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology 2010;21(1):128–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Justice AC, Covinsky KE, Berlin JA. Assessing the generalizability of prognostic information. Ann Intern Med 1999;130(6):515–24. [DOI] [PubMed] [Google Scholar]
  • 27.Leggett CL, Wang KK. Computer-aided diagnosis in GI endoscopy: looking into the future. Gastrointest Endosc 2016;84(5):842–4. [DOI] [PubMed] [Google Scholar]
  • 28.Wang F, Casalino LP, Khullar D. Deep learning in medicine-promise, progress, and challenges. JAMA Intern Med 2019;179(3):293–4. [DOI] [PubMed] [Google Scholar]
  • 29.Food and Drug Administration. Proposed Regulatory Framework for Modifications to Artificial Intelligence/Machine Learning (AI/ML)-Based Software as a Medical Device (SaMD). Available at: https://www.fda.gov/medical-devices/software-medical-device-samd/artificial-intelligence-and-machine-learning-software-medical-device. Accessed: October 27, 2019.
  • 30.Food and Drug Administration. Proposed Regulatory Framework for Modifications to Artificial Intelligence/Machine Learning (AI/ML)-Based Software as a Medical Device (SaMD). Available at: https://www.fda.gov/medical-devices/software-medical-device-samd/artificial-intelligence-and-machine-learning-software-medical-device. Accessed: October 27, 2019.
  • 31.Food and Drug Administration. Clinical Decision Support Software Draft Guidance for Industry and Food and Drug Administration Staff. Available at: https://www.fda.gov/regulatory-information/search-fda-guidance-documents/clinical-decision-support-software. Accessed: November 10, 2019.
  • 32.Canada H Draft Guidance Document - Software as a Medical Device (SaMD). In: Canada H, ed2019.
  • 33.Chinzei K, Shimizu A, Mori K, et al. Regulatory science on AI-based medical devices and systems. Advanced Biomedical Engineering 2018;7:118–23. [Google Scholar]
  • 34.Ahmad OF, Stoyanov D, Lovat LB. Barriers and pitfalls for artificial intelligence in gastroenterology: ethical and regulatory issues. Tech Gastrointest Endosc 2019;150636. [Google Scholar]
  • 35.Lyell D, Coiera E. Automation bias and verification complexity: a systematic review. J Am Med Inform Assoc 2016;24(2):423–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Beam AL, Kohane IS. Big data and machine learning in health care. JAMA 2018;319(13):1317–8. [DOI] [PubMed] [Google Scholar]
  • 37.Shahidi N, Bourke MJ. Can artificial intelligence accurately diagnose endoscopically curable gastrointestinal cancers? Tech Gastrointest Endosc 2019;150639. [Google Scholar]
  • 38.Pimentel-Nunes P, Dinis-Ribeiro M, Ponchon T, et al. Endoscopic submucosal dissection: European Society of Gastrointestinal Endoscopy (ESGE) guideline. Endoscopy 2015;47(9):829–54. [DOI] [PubMed] [Google Scholar]
  • 39.Shahidi NC, Rex DK, Kaltenbach T, et al. Use of endoscopic impression, artificial intelligence, and pathologist interpretation to resolve discrepancies from endoscopy and pathology analyses of diminutive colorectal polyps. Gastroenterology 2020;158(3):783–5.e1. [DOI] [PubMed] [Google Scholar]

RESOURCES