Dexterous Identification of Carcinoma through ColoRectalCADx with Dichotomous Fusion CNN and UNet Semantic Segmentation

Akella S Narasimha Raju; Kayalvizhi Jayavel; Thulasi Rajalakshmi

doi:10.1155/2022/4325412

. 2022 Oct 10;2022:4325412. doi: 10.1155/2022/4325412

Dexterous Identification of Carcinoma through ColoRectalCADx with Dichotomous Fusion CNN and UNet Semantic Segmentation

Akella S Narasimha Raju ^1,^✉, Kayalvizhi Jayavel ¹, Thulasi Rajalakshmi ²

PMCID: PMC9576362 PMID: 36262620

Abstract

Human colorectal disorders in the digestive tract are recognized by reference colonoscopy. The current system recognizes cancer through a three-stage system that utilizes two sets of colonoscopy data. However, identifying polyps by visualization has not been addressed. The proposed system is a five-stage system called ColoRectalCADx, which provides three publicly accessible datasets as input data for cancer detection. The three main datasets are CVC Clinic DB, Kvasir2, and Hyper Kvasir. After the image preprocessing stages, system experiments were performed with the seven prominent convolutional neural networks (CNNs) (end-to-end) and nine fusion CNN models to extract the spatial features. Afterwards, the end-to-end CNN and fusion features are executed. These features are derived from Discrete Wavelet Transform (DWT) and Vector Support Machine (SVM) classification, that was used to retrieve time and spatial frequency features. Experimentally, the results were obtained for five stages. For each of the three datasets, from stage 1 to stage 3, end-to-end CNN, DenseNet-201 obtained the best testing accuracy (98%, 87%, 84%), ((98%, 97%), (87%, 87%), (84%, 84%)), ((99.03%, 99%), (88.45%, 88%), (83.61%, 84%)). For each of the three datasets, from stage 2, CNN DaRD-22 fusion obtained the optimal test accuracy ((93%, 97%) (82%, 84%), (69%, 57%)). And for stage 4, ADaRDEV²-22 fusion achieved the best test accuracy ((95.73%, 94%), (81.20%, 81%), (72.56%, 58%)). For the input image segmentation datasets CVC Clinc-Seg, KvasirSeg, and Hyper Kvasir, malignant polyps were identified with the UNet CNN model. Here, the loss score datasets (CVC clinic DB was 0.7842, Kvasir2 was 0.6977, and Hyper Kvasir was 0.6910) were obtained.

1. Introduction

Health is of utmost importance for mankind. Good health practices are essential for the survival of the human race. However, owing to environmental pollution and personal habits of human beings, their health is adversely affected [1]. According to statistics from various Indian health organizations, 22% of patients seek medical treatment at least thrice a year for related health problems. Carcinomas are ranked as the most important health disorder and a hazardous evil [2].

The carcinoma affects the inner cell of the human body and develops exponentially, damaging the entire affected organ. Progressive growth damage to the human body can lead to life-threatening situations. There are many types of carcinomas that affect organs [3]. Cancers that are hazardous to human organs include breast cancer, prostate cancer, cancer of Basel cells, skin cancer (melanoma), lung cancer, leukemia, lymphoma, and colon cancer. This dangerous affliction invades organ tissues and causes potentially fatal conditions [4]. Furthermore, blood cells are ineffective in defending against this malady and cannot protect organs from damage.

Colorectal carcinoma is the most hazardous and irritable disease of the gastrointestinal tract. This condition can have devastating effects on a person's daily routine. This disease affects food digestion and creates severe gastric problems, which then create critical bowel problems. This cancer is caused by human eating and consumption patterns. The disease is associated with the brain as too many psychological thoughts can also affect the human colon. Seven lakh people are diagnosed annually with colorectal cancer, and the morality rate is approximately 525,000 worldwide [5]. Those suffering from severe problems, such as gastric issues, aged 50 years and over must get themselves tested via colonoscopy screening technology.

Colonoscopy is the most advanced screening technology and is 95% acceptable. This test method, with a number of video graphs and frames captured, examines the entire large intestine, which is approximately five feet long and three inches wide. In the videos and images captured using this technique, every inch is scanned for disease diagnosis [6]. This screening procedure is performed by gastroenterologists, and videos are obtained and photographic images are analyzed by radiologists. The average acquisition time is between 30 minutes and one hour. Early detection and rapid diagnosis of these medical conditions provide the appropriate solutions for treatment [7–10].

The videos and the photographic images obtained thus are presented as datasets. Researchers conducting colorectal cancer research organize data and store them on a website for public access. The computer-aided diagnosis system (CADx) [11, 12] diagnoses health problems using computers with technologies such as artificial intelligence (AI). Deep learning (DL) is a subset of AI technology that is key in CADx systems, with which every image is skillfully considered and the image characteristics are extracted for further experimentation. Publicly available colonoscopy datasets such as CVC Clinic DB, Kvasir2, and Hyper Kvasir provide inputs for CADx [13, 14]. CADx works with the appropriate deep learning technologies (DLTs). The system referenced in this article is ColoRectalCADx.

The ColoRectalCADx system works entirely with DL that can recognize carcinoma using an innovative architecture. For medical colonoscopy motion images, we use CNN as a suitable network [15]. CNN works well as a classifier and feature extractor. This article demonstrates how the key elements of the system elaborately interact with CNN [16, 17].

The main objective of this research study is as follows:

Design the colorectal cancer recognition system through a five-stage system.
The early stage of detection is the classification of images through a colonoscopy using seven different CNNs.
The next phase of the system presents nine fusion models of the CNN and obtains the most accurate model for cancer classification using three datasets.
In addition, end-to-end CNN and fusion models represent transfer learning with discrete transform wavelet (DWT) and support vector machine classification (SVM). This classification determines the most appropriate model for cancer recognition.
To accurately identify a cancer polyp for malignant recognition, a visualization technique such as semantic segmentation using the UNet CNN model is used in a later stage.

1.1. Organization of the Study

The rest of the study is organized as follows. Section 2 provides an overall literature review and Section 3 provides the materials and explains the methods. Section 4 discusses the results and Section 5 concludes the study.

2. Related Work

Previously, colorectal cancer was identified using different approaches. In these approaches, an architecture had to be developed to build a system. This system considered various elements of the structure. For every structure, the documentation related to the concept must be examined. Different articles are reviewed on the different blocks with their merits and demerits. The optimal approach is coming up with an appropriate article to identify colorectal cancer. Table 1 represents the literature related to the addressed concept.

Table 1.

Literature survey.

Author	Year	Advantages	Limitations
Souaidi and Ansari [18]	2022	(i) To detect abnormalities in the polyp region of WCE and colonoscopy localization and visualization proposal.	(i) There is no discussion of the numerous CNN models.
		(ii) Here a detector for deep polyps, such as MP-FSSD, is suggested.	(ii) Only CNN models like VGG-16 are used.
		(iii) In this polyp detection work, VGG-16 backbones are used.	(iii) Only WCE and CVC clinic DB dataset are reviewed for polyp recognition.
Nisha and Palanisamy [19]	2022	(i) We automatically detect colorectal polyps with image enhancement.	(i) This method is only effective for a limited number of CNN models.
		(ii) The proposed work is the dual CNN path for classifying polyps and nonpolyps' patches in colonoscopy images.	(ii) This method is not discussed except with a CNN model.
		(iii) To enhance the image, the dual-path CNN and sigmoid classifier is used to efficiently detect polyps.	(iii) Only two sets of colonoscopy image data were proposed, such as CVC clinic DB and ETIS-Larib datasets.
		(iv) The proposed method is promising, and detects with accuracy of 99.60% and 90.81% with CVC clinic DB and ETIS-Larib datasets, respectively.	(iv) Images in the datasets are enhanced owing to which the accuracy of polyp detection will reduce.
			(v) The number of images is increased or live image datasets are used, and the suggested method for its operation is not addressed.
Guo et al. [20]	2022	(i) The two major challenges for the segmentation of colonoscopy image polyps are blurred boundaries and a close resemblance between the polyps and surrounding tissue.	(i) Here, five datasets are tested with a new UnX methodology, so it takes a long time to obtain the results.
		(ii) This system proposed a new transformer-based encounter network known as the uncertainty eXploration (UnX).	(ii) The precision levels of the results are good, but obtaining results is time-consuming.
		(iii) With this method, the system identified the uncertainty areas of polyps.	(iii) The complexity of the system is increased while comprehension of the system is much more tedious to a layman.
		(iv) This removes the uncertain elements of the images and emphatically recognizes the level of precision of malignant polyps.	(iv) There are inconsistent color distributions in the colonoscopy image system that displays poor results.
Yeung et al. [21]	2021	(i) The concept here is the segmentation of polyps and the identification of malignant polyps.	(i) With the five datasets, each image segmentation entails considerable time to obtain the results.
		(ii) The proposed method is CNN based on double attention for segmenting polyps using Focus-UNet.	(ii) Visualization quality may be good for certain datasets.
		(iii) This system combines efficient attention based on the spatial channel into a single focus gate selective deep learning of polyp characteristics.	(iii) The proposed focus-UNet system should have been upgraded to a lightweight design.
		(iv) Here for experimentation with the proposed methodology, inputs are provided using five colonoscopy datasets.	(iv) It is a complicated system.
		(v) The obtained results, such as the dice similarity coefficient, are 0.941 and 0910.
Attallah and Sharkas [22]	2021	(i) Proposed a system called Gastro-CADx to classify several gastrointestinal diseases using deep learning approaches.	(i) Two datasets named dataset I and II, which are Kvasir and Hyper Kvasir, are used to assess the performance of Gastro-CADx.
		(ii) There are three phases to this system. These four different CNNs are used as feature extractors to extract spatial functionality.	(ii) However, this system has not been used on the numerous datasets.
		(iii) The properties extracted in the first stage are applied to the discrete wave transform (DWT) and the discrete cosine transform (DCT), which are used to extract temporal-frequency and spatial-frequency features.	(iii) The system is not even under discussion for the semantic segmentation concept for locating and identifying malignant polyps.
Jha et al. [23]	2021	(i) The design is the detection, localization, and segmentation of polyps in real-time.	(i) The system uses more than just a single dataset for experimentation and recognizing malignant polyps.
		(ii) This work calls for deep learning in technology.	(ii) The system provides moderate results (not highly accurate).
		(iii) The proposed solution to retrieve polyps from colonoscopy images developed ColonSegNet, which is a decoder-encoder architecture.	(iii) Architecture is complex to comprehend for laymen.
		(iv) detection, location, and segmentation are evaluated using standard computer vision measures.
		(v) The system has a high processing rate of 182.38 frames per second.
Ahraf et al. [24]	2020	(i) Suggested automated classification as a new technique for illustrating medical images using deep learning technology.	(i) Vast data of colonoscopy images are classified with different convolutional neural networks and the results are achieved differently.
		(ii) It helps to categorize the diverse medical images of several organs of the body.	(ii) The notions of interest are not addressed here and this has to be comprehensively addressed.
		(iii) It contains a summary of data and other health image classification applications, which support radiologists' efforts to improve diagnosis.
Poudel et al. [25]	2020	(i) Provides a good architecture for classifying endoscopic images using an expanded efficient convolutional neural network.	(i) However, colorectal disorders are classified using convolutional neural networks.
		(ii) Proposed an architecture to classify endoscopic images using an effective convolutional neural network (CNN).	(ii) However, algorithms integrated with the various algorithms are compared with certain parameters.
		(iii) This is a highly accessible domain of assessing deeper layers by accumulating and reducing the expansion factor of spatial elements.	(iii) The results obtained are regarded as the most accurate and best algorithm for the identification of colorectal cancer (CRC).
		(iv) The investigator compares and evaluates the methodology using a variety of parameters.
Zhou and Gao [26]	2019	(i) Here we discuss how CNN technologies enable intelligent recognition of medical motion images.	(i) However, there are no discussions on obtaining colorectal medical images from the colonoscopy screening images.
		(ii) Now large-scale intelligent recognition of medical motion images is assisted by CNN algorithms.	(ii) There is no explanation of the procedure to retrieve and categorize and then convert to results based on their image characteristics.
		(iii) Here, the features of the dense trajectory are initially learned followed by the features of depth, and the dense path functions are merged into the DL methods.	(iii) The techniques involved are time-consuming and require extensive computer statistics.
		(iv) Finally, extreme learning is functional in CNN where the descriptions of the bottom layer to the top layer are determined for medical image recognition
Yang et al. [27]	2019	(i) Proposed a health-based device for categorizing and segmenting CT images for lung disease and hemorrhagic stroke, termed HTSCS for Health Images.	(i) This technique provides an advanced method of categorization and segmentation using art.
Yang et al. [27]	2019	(ii) Internet Health of-Things (IHoT) uses transferable model learning, based on deep learning concepts with traditional methodologies for the best precision for medical image classification and segmentation	(ii) This Internet of medical Things has worked with various IoT devices with the connection of computed tomography devices.

Fusion model	Suggested name
AlexNet + DarkNet-19 + ResNet-50v2 + DenseNet-201 + Efficientnet-B7 + VGG-16 + VGG-19	ADaRDEV²-22
ResNet-50v2 + DensNet-201 + EfficientNet-B7 + VGG-16 + VGG19	RDEV²-22
AlexNet + DarkNet-19 + DenseNet-201 + ResNet-50V2	ADaDR-22
AlexNet + DarkNet-19 + ResNet-50V2	ADaR-22
DarkNet-19 + ResNet-50V2 + DenseNet-201	DaRD-22
AlexNet + DarkNet-19	ADa-22
ResNet-50V2 + DenseNet-20	RD-22
AlexNet + DenseNet-201	AD-22
DarkNet-19 + ResNet-50V2	DaR-22

CNN architecture models		Introduced year	Total params	Trainable params	Nontrainable params	Layers
AlexNet [47, 48]		2012	2,81,02,775	2,80,81,639	21,136	23
DarkNet-19 [49]		2017	1,60,45,847	1,60,32,983	12,864	19
ResNet-50v2 [50, 51]		2016	2,59,33,975	23,69,175	2,35,64,800	50
DenseNet-201 [52, 53]		2018	1,94,29,463	11,07,479	1,83,21,984	201
Efficientnet-B7 [54]		2019	6,55,73,799	14,76,119	6,40,97,680	813
VGG-16 [55]		2014	1,53,14,391	5,99,703	1,47,14,688	16
VGG-19 [56]		2014	2,06,24,087	5,99,703	2,00,24,384	13
Proposed fusion models	ADaRDEV²-22	2022	19,10,28,063	7,02,94,911	12,07,33,152
	RDEV²-22		14,68,78,383	2,61,79,231	12,06,99,152
	ADaDR-22		8,94,87,664	4,75,66,880	4,19,20,784
	ADaR-22		2,59,33,000	23,68,200	2,35,64,800
	DaRD-22		6,13,99,840	1,95,00,192	4,18,99,648
	ADa-22		4,41,26,048	4,40,92,048	34,000
	RD-22		4,53,61,624	34,74,840	4,18,86,784
	AD-22		4,75,16,384	2,91,73,264	1,83,43,120
	DaR-22		4,19,67,694	1,83,90,030	2,35,77,664

System	Precision tower T5810
Company	Dell
Processor	Intel® Xeon® CPU core i7 E5-2630
Speed	2.20 GHz
RAM	32 GB
GPU	GPU NVIDIA Xp.
Software environment	Google Colab Pro with python 3.7.12
Software Python packages	Keras and TensorFlow 2.7.0

Datasets	Training set	Validation set	Test sets	Total images
CVC-clinic DB [32]	900	102	516	1518
Kvasir2 [33, 34].	5120	480	2400	8000
Hyper Kvasir Labeled [35, 36].	7470	634	2577	10681

Dataset	Epochs	Batch sizes	Learning rate	Optimizer	Momentum	Dropout
CVC clinic DB	10	16	0.0001	sgd	0.9	0.5
Kvasir2	10	64	0.0001	sgd	0.9	0.5
Hyper Kvasir labeled	10	64	0.0001	sgd	0.9	0.5

End-to-end CNNs	Accuracy in %
AlexNet	73.00
DarkNet-19	68.00
ResNet-50v2	89.00
DenseNet-201	98.00
Efficientnet-B7	91.00
VGG-16	83.00
VGG-19	86.00

End-to-end CNNs	Accuracy in %
AlexNet	74.00
DarkNet-19	32.00
ResNet-50v2	83.00
DenseNet-201	87.00
Efficientnet-B7	77.00
VGG-16	74.00
VGG-19	67.00

End-to-end CNNs	Accuracy in %
AlexNet	71.00
DarkNet-19	43.00
ResNet-50v2	78.00
DenseNet-201	84.00
Efficientnet-B7	75.00
VGG-16	75.00
VGG-19	65.00

	Accuracy in % (training)	Accuracy in % (testing)	SVM in % (training)	SVM in % (testing)	AUC in %
End-to-end CNNs
AlexNet	74.64	73.00	57.83	59.00	73.40
DarkNet-19	81.61	68.00	78.65	78.00	68.38
ResNet-50v2	88.61	89.00	92.53	89.00	89.18
DenseNet-201	97.78	98.00	95.64	97.00	98.06
Efficientnet-B7	83.36	91.00	73.40	84.00	90.52
VGG-16	82.38	83.00	85.59	88.00	82.73
VGG-19	80.87	86.00	80.60	84.00	85.88

Fusion CNNs
ADaRDEV²-22	94.9	95.0	95.0	97.0	94.56
RDEV²-22	90.1	95.0	69.2	95.0	94.97
ADaDR-22	92.5	93.0	95.3	97.0	93.22
ADaR-22	77.1	79.0	79.9	79.0	78.64
DaRD-22	88.8	92.0	89.4	92.0	91.65
ADa-22	77.4	82.0	75.9	64.0	82.18
RD-22	96.1	97.0	94.2	96.0	96.51
AD-22	50.1	50.0	68.7	54.0	50.00
DaR-22	81.9	85.0	86.2	78.0	85.46

Classes	High performed CNN models
	DenseNet-201	ADaDR-22	Support
	Precision	Precision	Support
Nonpolyps	0.97	0.91	257
Polyps	1	0.96	259

	Accuracy in % (training)	Accuracy in % (testing)	SVM in % (training)	SVM in % (testing)	AUC in %
End-to-end CNNs
AlexNet	71.54	74.00	36.02	31.00	96.89
DarkNet-19	73.00	32.00	77.11	43.00	87.58
ResNet-50v2	67.95	83.00	62.52	84.00	98.03
DenseNet-201	82.20	87.00	78.89	87.00	98.95
Efficientnet-B7	62.14	77.00	54.16	68.00	97.16
VGG-16	54.29	74.00	69.66	77.00	96.79
VGG-19	51.04	67.00	62.52	72.00	95.68

Fusion CNNs
ADaRDEV²-22	82.04	83.00	80.46	85.00	98.52
RDEV²-22	68.04	76.00	74.75	81.00	97.32
ADaDR-22	80.79	74.00	76.84	80.00	97.31
ADaR-22	71.54	68.00	69.29	75.00	93.82
DaRD-22	81.64	82.00	78.39	84.00	97.91
ADa-22	57.79	60.00	53.92	48.00	93.52
RD-22	64.95	66.00	67.70	75.00	94.75
AD-22	66.23	50.00	63.75	64.00	87.21
DaR-22	62.54	70.00	65.11	60.00	94.95

Classes	High performed CNN model
	DenseNet-201	DaRD-22	Support
	Precision	Precision	Support
Dyed-lifted-polyps	0.78	0.81	300
Dyed-resection-margins	0.87	0.82	300
Esophagitis	0.79	0.73	300
Normal-cecum	0.97	0.92	300
Normal-pylorus	0.91	0.95	300
Normal-z-line	0.78	0.72	300
Polyps	0.9	0.76	300
Ulcerative-colitis	0.93	0.92	300

	Accuracy in % (training)	Accuracy in % (testing)	SVM in % (training)	SVM in % (testing)	AUC in %
End-to-end CNNs
AlexNet	71.74	71.00	10.75	11.00	96.70
DarkNet-19	75.56	43.00	77.93	60.00	87.05
ResNet-50v2	61.30	78.00	71.06	78.00	94.81
DenseNet-201	77.12	84.00	75.94	84.00	94.48
Efficientnet-B7	58.29	75.00	53.76	70.00	94.05
VGG-16	54.48	68.00	70.41	75.00	93.73
VGG-19	49.97	65.00	63.29	70.00	91.60

Fusion CNNs
ADaRDEV²-22	69.2	68.0	67.8	63.0	83.0
RDEV²-22	56.8	60.0	64.8	69.0	80.8
ADaDR-22	55.6	51.0	59.1	62.0	80.1
ADaR-22	61.8	55.0	62.5	63.0	80.2
DaRD-22	69.4	69.0	64.6	57.0	80.7
ADa-22	57.5	36.0	48.3	36.0	75.5
RD-22	54.6	57.0	56.1	61.0	78.2
AD-22	63.8	62.0	62.5	65.0	77.4
DaR-22	62.7	60.0	57.5	54.0	82.1

Classes	High performed CNN models
	DenseNet-201	DaRD-22	Support
	Precision	Precision	Support
Barretts	0	0	13
Barretts-short-segment	0	0	16
Bbps-0-1	0.93	0.87	194
Bbps-2-3	0.96	0.87	345
Cecum	0.9	0.51	303
Dyed-lifted-polyps	0.81	0.21	301
Dyed-resection-margins	0.81	0.48	297
Esophagitis-a	0.46	0	121
Esophagitis-b-d	0.63	0	78
Hemorrhoids	0	0	6
Ileum	0	0	3
Impacted-stool	0.85	0	40
Polyps	0.82	0.93	309
Pylorus	0.94	0.91	300
Retroflex-rectum	0.89	0.86	117
Retroflex-stomach	0.99	0.99	230
Ulcerative-colitis-grade-0-1	0	0	11
Ulcerative-colitis-grade-1	0.5	0	61
Ulcerative-colitis-grade-1-2	0	0	4
Ulcerative-colitis-grade-2	0.54	0.69	133
Ulcerative-colitis-grade-2-3	0	0	9
Ulcerative-colitis-grade-3	1	0	40
z-line	0.69	0.53	280

DWT
End-to-end CNN	Accuracy in % (training)	Accuracy in % (testing)	SVM in % (training)	SVM in % (testing)	AUC in %
AlexNet	77.67	52.13	70.64	68.00	51.94
DarkNet-19	85.23	86.04	83.27	77.00	86.05
ResNet-50v2	90.39	96.13	90.48	95.00	96.32
DenseNet-201	97.33	99.03	95.37	99.00	99.03
Efficientnet-B7	84.34	90.69	71.80	71.00	90.68
VGG-16	80.69	82.36	84.88	86.00	82.33
VGG-19	78.65	86.43	80.78	88.00	86.44

DWT
Fusion CNNs	Accuracy in % (training)	Accuracy in % (testing)	SVM in % (training)	SVM in % (testing)	AUC in %
ADaRDEV²-22	94.84	95.73	92.62	94.00	95.73
RDEV²-22	97.24	98.00	93.42	95.00	98.06
ADaDR-22	93.77	95.54	91.99	93.00	95.54
ADaR-22	84.16	89.53	89.32	93.00	89.52
DaRD-22	95.46	96.70	93.86	96.00	96.70
ADa-22	83.10	88.37	70.37	67.00	88.38
RD-22	97.24	98.44	94.48	96.00	98.45
AD-22	90.57	95.93	92.88	96.00	95.92
DaR-22	88.43	93.60	91.28	93.00	93.59

Dataset	Author	Method	Accuracy in %
CVC clinic DB	Attallah and Sharkas [22]	GastroCADx	—
	Liew et al. [71]	Ensemble classifier (ResNet50+Adaboost)	97.91
	Sharma et al. [72]	Ensemble classifier	98.3
	Nisha and Palanisamy [19]	DP-CNN	99.60
	Souaidi and Ansari [18]	MP-FSSD	91.56
	Ours	ColoRectalCADx (proposed)	99.00

Kvasir2	Attallah and Sharkas [22]	GastroCADx	97.3
	Sharma et al. (2022) [72]	Ensemble classifier	97
	Ours	ColoRectalCADx (proposed)	88.00

Hyper Kvasir	Attallah and Sharkas [22]	GastroCADx	99.7
Hyper Kvasir	Ours	ColoRectalCADx (proposed)	84.00

Dataset	Epochs	Learning rate	Batch size	Train loss	Test loss	Total time taken for model (s)
CVC clinic DB	40	0.001	64	0.2998	0.7842	862.61
Kvasir2	40	0.001	64	0.412	0.6977	1477.32
Hyper Kvasir	40	0.001	64	0.4005	0.691	1374.04

PERMALINK

Dexterous Identification of Carcinoma through ColoRectalCADx with Dichotomous Fusion CNN and UNet Semantic Segmentation

Akella S Narasimha Raju

Kayalvizhi Jayavel

Thulasi Rajalakshmi

Abstract

1. Introduction

1.1. Organization of the Study

2. Related Work

Table 1.

3. Materials and Methods

Figure 1.

Figure 2.

3.1. Colonoscopy

3.2. Dataset

Figure 3.

Figure 4.

Figure 5.

3.3. Convolutional Neural Networks

Figure 6.

Table 2.

Table 3.

3.4. Discrete Wavelet Transform (DWT)

Figure 7.

3.5. Support Vector Machines (SVMs)

3.6. Semantic Segmentation

Figure 8.

Table 4.

Table 5.

Table 6.

4. Results

4.1. Stage 1: Experimentation of End-to-End CNNs

Table 7.

Table 8.

Table 9.

Figure 9.

Figure 10.

Figure 11.

4.2. Stage 2: Experimentation of End-to-End CNNs and Fusion of CNNs with SVM

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Figure 12.

Figure 13.

Figure 14.

Figure 15.

Figure 16.

Figure 17.

4.3. Stage 3: Experimentation of End-to-End CNN + DWT + SVM

Table 16.

Table 17.

Table 18.

Figure 18.

Figure 19.

Figure 20.

4.4. Stage 4: Experimentation of Fusion CNNs + DWT + SVM

Table 19.

Table 20.

Table 21.

Figure 21.

Figure 22.

Figure 23.

Table 22.

Figure 24.

Figure 25.

4.5. Stage 5: Semantic Segmentation Using UNet

Table 23.

Figure 26.

Figure 27.

5. Conclusion

Data Availability

Conflicts of Interest

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK