Hybridization of Deep Learning Pre-Trained Models with Machine Learning Classifiers and Fuzzy Min–Max Neural Network for Cervical Cancer Diagnosis

Madhura Kalbhor; Swati Shinde; Daniela Elena Popescu; D Jude Hemanth

doi:10.3390/diagnostics13071363

. 2023 Apr 6;13(7):1363. doi: 10.3390/diagnostics13071363

Hybridization of Deep Learning Pre-Trained Models with Machine Learning Classifiers and Fuzzy Min–Max Neural Network for Cervical Cancer Diagnosis

Madhura Kalbhor ^1,^*, Swati Shinde ^1,^*, Daniela Elena Popescu ², D Jude Hemanth ³

Editor: Zoard T Krasznai

PMCID: PMC10093705 PMID: 37046581

Abstract

Medical image analysis and classification is an important application of computer vision wherein disease prediction based on an input image is provided to assist healthcare professionals. There are many deep learning architectures that accept the different medical image modalities and provide the decisions about the diagnosis of various cancers, including breast cancer, cervical cancer, etc. The Pap-smear test is the commonly used diagnostic procedure for early identification of cervical cancer, but it has a high rate of false-positive results due to human error. Therefore, computer-aided diagnostic systems based on deep learning need to be further researched to classify the pap-smear images accurately. A fuzzy min–max neural network is a neuro fuzzy architecture that has many advantages, such as training with a minimum number of passes, handling overlapping class classification, supporting online training and adaptation, etc. This paper has proposed a novel hybrid technique that combines the deep learning architectures with machine learning classifiers and fuzzy min–max neural network for feature extraction and Pap-smear image classification, respectively. The deep learning pretrained models used are Alexnet, ResNet-18, ResNet-50, and GoogleNet. Benchmark datasets used for the experimentation are Herlev and Sipakmed. The highest classification accuracy of 95.33% is obtained using Resnet-50 fine-tuned architecture followed by Alexnet on Sipakmed dataset. In addition to the improved accuracies, the proposed model has utilized the advantages of fuzzy min–max neural network classifiers mentioned in the literature.

Keywords: convolutional neural networks, machine learning, fuzzy min–max neural network (FMMN), cytology image classification, pre-trained models, transfer learning

1. Introduction

Cervical cancer is a type of cancer that develops in the cells of the cervix, which is the lower part of the uterus that connects to the vagina. Cervical cancer is usually caused by a human papillomavirus (HPV) infection, which is a sexually transmitted infection. HPV is a very common virus that can cause abnormal changes in the cells of the cervix, which can eventually lead to cancer if left untreated [1].

Cervical carcinoma is the most prevalent cancer diagnosed in 23 countries and the primary cause of mortality in 36 nations [1,2]. Furthermore, 85 percent of cervical cancers were encountered in the late stages. It is the fourth most frequent cancer in women as well as the leading cause of death, with an approximate 604,000 reported incidents and 342,000 deaths worldwide in 2020 [1]. Figure 1 depicts the mortality age-standardized rates and region-specific incidence for cervical cancer in 2020. The (W) world age standardized incidence rate is shown in descending order, and the highest national age-standardized incidence and mortality rates are overlaid. In such areas, it is critical to ensure that resource-intensive vaccination and screening programs are carried out to improve the situation [2].

Mortality Age-Standardized Rates and Region-Specific Incidence for Cervical Cancer in 2020. Reprinted with permission from Ref. [1]. Copyright 2020 IARC/WHO.

Pap smear, liquid based cytology, and colposcopy are the main screening methods for cervical cancer diagnosis. In a Pap-smear test, cell samples are collected from the transformation zone of the cervix, and for abnormalities, it is examined under the microscope. The colposcopy examination deals with examining abnormalities in the cervix with the help of the colposcope; it is a direct visual examination done by gynecologists [3]. Regular screening of women over 30 years of age is advisable for early detection and treatment.

The human-based smear analysis is difficult, laborious, time consuming, costly, and prone to errors since each smear slide consists of approximately 3 million cells with varying overlapping and orientation, necessitating the development of a computerized system capable of analyzing the Pap smear effectively and efficiently [4]. Extensive research has been conducted to assist pathologists in tracking cervical cancer with the development of computer-aided diagnostic (CAD) systems. This type of system consists of different steps, including image preprocessing, segmentation, feature extraction, feature selection, and classification. To enhance the image quality, filtering-based preprocessing is carried out. Much work is carried out to segment the nucleus and cytoplasm using different image-processing techniques [5]. The images are used to extract texture, morphological, and color metric features. The feature selection techniques are applied for the identification of the most discriminant features, and then, classifiers are designed to classify the cervical cytology cell images [6].

The above mentioned workflow necessitates multiple steps for processing the data. The handcrafted features lack the guarantee superior classification performance, highlighting the inadequacy of automatic learning. Deep learning methods have demonstrated success in a variety of applications over the last decade, including object recognition, natural language processing, signal processing, image classification, segmentation, and so on [7,8,9,10]. The deep network architecture has the ability to learn features automatically based on the spatial relationships among the pixels. The multiple layers with simple nonlinear activation functions are used to transform input data from abstract to specific at multiple levels of feature representation.

The network can learn such hierarchical feature representations from a large scale of training data in an unsupervised or supervised manner. In many practical applications, such learned hierarchical features have outperformed handcrafted designs [11].

Lotfi A. Zadeh [12] proposed a fuzzy logic data analysis approach and an engineering approach. Fuzzy set theory is the basis for fuzzy logic which deals with reasoning that is approximate rather than precise in classical two-valued logic. As a result, it is a technique for formalizing the human capacity for imprecise reasoning. Such reasoning exemplifies the human ability to reason roughly and make decisions in the face of uncertainty [12]. Fuzzy set theory is considered a good framework for classification problems because of the inherent fuzziness in the cluster. FMMN has been used in many applications, including fault detection, lung cancer detection, breast cancer detection, medical data analysis, etc. [13,14,15].

This paper presents a hybrid method for the classification of cytology Pap-smear images into abnormal and normal. The machine learning classifiers and fuzzy min–max neural network are trained for two-class problems using the features to extract by fine tuning the deep learning pre-trained models. The following are the main contributions of the proposed work.

(1) Presents a novel and hybrid approach by leveraging the strengths of pre-trained deep learning models with machine learning classifiers and fuzzy min–max neural networks.

(2) Fine tunes the pretrained CNN architectures, including Alexnet, ResNet-18, ResNet-50, and GoogleNet, to overcome the dataset limitations.

(3) Extracts the learned and specific features from Pap-smear images, which are proven to be more effective than handcrafted features and classify by using different machine learning classifiers and enhancing the classification performance using fuzzy min-max neural network.

(4) Provides improved accuracy with the advantages of different properties of the fuzzy min–max neural network classifier given by Simpson [16].

2. Literature Review

To classify the cervical cytology images, various deep learning and machine learning-based techniques are used, for example, researchers in [17,18] make use of local binary pattern, texture, histogram features, local binary pattern, and grey level features. The features are then given as input to a hybrid classifier system that combines SVM and a neuro-fuzzy for classification of the cervical images [19].

Jyothi Priyankaa et al. (2021) [20] consider Pap smear test images for cancerous cell prediction combined with deep learning techniques for more efficient results. The ResNet50 pre-trained model of convolutional neural networks (CNNs) for the prediction of cancerous cells produces accurate results. Except for the final layer, which is trained according to the requirements, all the layers in the proposed work are considered as they are. This methodology correctly classifies all classes with 74.04 percent accuracy.

Deep transfer learning was used by Anurag Tripathi et al. (2021) [21] to aid in the diagnosis of cervical cancer. They used the SIPAKMED dataset for this purpose. Dyskeratotic, koilocytotic, metaplastic, parabasal, and superficial intermediate were the five classes used. The testing accuracy of ResNet50 is 93.87 percent. The ResNet-152 model achieved an accuracy of 94.89 percent. VGG-16 performed best with parabasal cells, achieving the lowest accuracy of all four models at 92.85 percent. The testing accuracy of VGG-19 was slightly higher than that of VGG-16, which was 94.38 percent.

Wafa Mousser et al. (2019) [22] used deep neural networks and optimized MLP classifiers for the classification of Herlev Pap-smear images. Feature extraction is done using deep neural networks and classification using optimized MLP classifiers. The ability of feature extraction from four different pre-trained models to classify Pap-smear images was investigated. The comparisons concluded that ResNet50 outperforms the VGGs and the InceptionV3 by 15% in Pap-smear image classification.

Kurnianingsih et al. (2019) [23] applied mask R-CNN to the whole slide cell image, outperforming the previous segmentation method in precision, recall, and ZSI. For classification, a VGG-like net is used on whole segmented cells. Results shown for binary classification problem had 98.1% accuracy and for the seven-class problem accuracy of 95.9% is obtained.

Sornapudi et al. (2019) [24] proposed a method for automatically classifying cervical cell images by generating labelled patch data, fine-tuning convolutional neural networks for the extraction of deep hierarchical features and the novel graph-based cell detection approach for cellular level evaluation. The results demonstrated that the proposed pipeline could classify images of single cells as well as overlapping cells. The VGG-19 model performed accurately at classifying cervical cytology patch data, with a precision-recall curve of 95%.

The deep learning approach reviewed in Swati Shinde et al. (2022) [25] can directly process raw images and offers automated learning of features based on specific objective functions, such as detection, segmentation, and classification. Different existing pre-trained models, such as ResNet-50, ResNet-152, and VGG are used in the literature for the classification of Pap-smear images for the diagnosis of cervical cancer. Table 1 shows the summarization of the different papers studied and analyzed.

Table 1.

Summarization of Prevailing Research Work.

Paper	Data Set	Pre-Processing	Feature Extraction/ Classification	Results
[20]	Herlev University Hospital	Resize, Color to Grey, Expansion of dimensions	RESNET-50	Accuracy 74.04%
[21]	SIPAKMED	Resize 244 × 244	RESNET-50, RESNET-152, VGG-16, VGG-19	Highest 94.89% accuracy was obtained with ResNet-152
[22]	Herlev University Hospital	Data Augmentation	VGG16. InceptionV3 VGG19, ResNet50 Classification—MLP classifier	ResNet-50 89%
[23]	Herlev University Hospital	Data Augmentation Segmentation—Mask R-CNN	VGGNet	Mask R-CNN segmentation produces the best average performance, i.e., 0.92 ± 0.06 precision, 0.91 ± 0.05 recall and 0.91 ± 0.04 ZSI and 0.83 ± 0.10 Binary classification problem 98.1% accuracy Seven-class problem high accuracy of 95.9%
[24]	Herlev University Hospital	Subtraction of blue color space from red color space, skeletonizing and refining boundaries	VGG-19, ResNet-50, DenseNet-120, and Inception_v3	VGG-19—88% Accuracy
[25]	Herlev University Hospital, SIPAKMED, LBC	Data Augmentation	XceptionNet, VGGNet, ResNet50 and Ensemble of classifiers	Accuracy 97%, 99%, and 100%
[26]	Herlev University Hospital	Resize 256 × 256	DCT and Haar transform	Highest 81.11% accuracy was obtained with DCT

Cell Category		Number of Cells
Normal squamous	Normal	74
Intermediate squamous		70
Columnar		98
Mild dysplasia	Abnormal	182
Moderate dysplasia		146
Severe dysplasia		197
Carcinoma in situ		150
Total		917

Cell Category		Number of Cells
Superficial	Normal	831
Parabasal	Normal	787
Koilocytotic	Abnormal	825
Dyskeratotic	Abnormal	813
Metaplastic	Benign	793
Total	Benign	4049

Assessments	Formula
Accuracy	$\frac{TP + TN}{TP + TN + FP + FN}$
Sensitivity/Recall	$\frac{TP}{TP + FN}$
Specificity	$\frac{TN}{TN + FP}$
Precision	$\frac{TP}{TP + FP}$
F1 Score	$2 \times \frac{Pricision \times Recall}{Precision + Recall}$

	AlexNet
Dataset	Classifier	Bayes Net	Navie Bayes	Random Forest	Random Tree	Decision Table	Part	Simple Logistic
Herlev	Testing Accuracy (%)	83.33	82.24	87.68	81.8	88.04	86.59	88.6
Sipakmed	Testing Accuracy (%)	91. 2	91.6	91.2	90.70	93.23	89.5	95.14

GoogleNet
Dataset	Classifier	BayeNet	Navie Bayes	Random Forest	Random Tree	Decision Table	Part	Simple Logistic
Herlev	Testing Accuracy (%)	83.70	82.97	86.96	81.88	84.06	86.59	87.32
Sipakmed	Testing Accuracy (%)	87.37	85.24	90.24	83.11	87.62	89.75	92.21

		Theta	0	0.1	0.2	0.3	0.4	0.5	0.6	0.7	0.8	0.9	1
Alexnet	Herlev Dataset	Accuracy	87.32	84.06	84.06	90.22	82.97	84.78	85.14	88.04	84.78	39.86	34.78
		Sensitivity	0.90	0.94	0.86	0.95	0.85	0.90	0.91	0.97	0.91	0.19	0.11
		Specificity	0.81	0.58	0.78	0.77	0.77	0.70	0.70	0.64	0.68	0.99	1.00
		Precision	0.93	0.86	0.92	0.92	0.91	0.89	0.89	0.88	0.89	0.97	1.00
		F1 Score	0.91	0.90	0.89	0.93	0.88	0.90	0.90	0.92	0.90	0.31	0.20
	Sipakmed Dataset	Accuracy	92.62	93.20	95.08	95.00	93.93	95.33	94.92	93.69	90.82	80.66	80.00
		Sensitivity	0.95	0.93	0.94	0.94	0.93	0.95	0.95	0.94	0.95	0.99	0.99
		Specificity	0.90	0.93	0.96	0.97	0.95	0.96	0.95	0.93	0.85	0.54	0.52
		Precision	0.93	0.95	0.97	0.98	0.97	0.97	0.97	0.95	0.90	0.76	0.76
		F1 Score	0.94	0.94	0.96	0.96	0.95	0.96	0.96	0.95	0.93	0.86	0.86

		Theta	0	0.1	0.2	0.3	0.4	0.5	0.6	0.7	0.8	0.9	1
Googlenet	Herlev Dataset	Accuracy	82.25	86.23	83.70	84.78	86.96	88.41	89.49	88.04	86.96	82.25	82.25
		Sensitivity	0.87	0.93	0.89	0.89	0.92	0.98	0.97	0.97	0.95	0.87	0.87
		Specificity	0.68	0.67	0.70	0.74	0.74	0.62	0.70	0.63	0.64	0.70	0.70
		Precision	0.89	0.89	0.89	0.90	0.91	0.88	0.90	0.88	0.88	0.89	0.89
		F1 Score	0.88	0.91	0.89	0.90	0.91	0.93	0.93	0.92	0.91	0.88	0.88
	Sipakmed Dataset	Accuracy	89.34	90.66	90.66	92.13	91.15	91.80	91.15	88.52	85.16	83.03	82.79
		Sensitivity	0.91	0.91	0.92	0.91	0.89	0.91	0.90	0.86	0.86	0.96	0.93
		Specificity	0.86	0.90	0.89	0.94	0.94	0.92	0.93	0.92	0.84	0.64	0.68
		Precision	0.91	0.93	0.93	0.96	0.96	0.95	0.95	0.94	0.89	0.80	0.81
		F1 Score	0.91	0.92	0.92	0.93	0.92	0.93	0.92	0.90	0.87	0.87	0.87

		Theta	0	0.1	0.2	0.3	0.4	0.5	0.6	0.7	0.8	0.9	1
ResNet-18	Herlev	Accuracy	88.77	75.00	89.49	89.13	91.30	91.67	88.04	86.96	86.23	86.96	86.96
		Sensitivity	0.92	0.92	0.91	0.91	0.97	0.99	0.97	0.94	0.94	0.95	0.95
		Specificity	0.81	0.27	0.86	0.85	0.75	0.73	0.64	0.67	0.64	0.66	0.66
		Precision	0.93	0.78	0.95	0.94	0.92	0.91	0.88	0.89	0.88	0.88	0.88
		F1 Score	0.92	0.84	0.93	0.92	0.94	0.95	0.92	0.91	0.91	0.91	0.91
	Sipakmed	Accuracy	91.48	90.82	91.31	92.79	92.87	93.77	90.90	86.80	81.72	77.21	72.46
		Sensitivity	0.93	0.92	0.92	0.92	0.93	0.93	0.93	0.92	0.91	0.93	0.96
		Specificity	0.89	0.88	0.90	0.94	0.93	0.95	0.87	0.79	0.67	0.53	0.36
		Precision	0.93	0.92	0.93	0.96	0.95	0.96	0.92	0.87	0.81	0.75	0.70
		F1 Score	0.93	0.92	0.93	0.94	0.94	0.95	0.93	0.89	0.86	0.83	0.81

		Theta	0	0.1	0.2	0.3	0.4	0.5	0.6	0.7	0.8	0.9	1
ResNet50	Herlev	Accuracy	88.77	86.23	87.32	88.04	87.32	87.32	85.87	87.32	86.96	82.25	81.88
		Sensitivity	0.91	0.93	0.91	0.90	0.90	0.93	0.89	0.93	0.91	0.83	0.85
		Specificity	0.84	0.68	0.78	0.82	0.79	0.73	0.77	0.73	0.77	0.81	0.73
		Precision	0.94	0.89	0.92	0.93	0.92	0.90	0.91	0.90	0.92	0.92	0.90
		F1 Score	0.92	0.91	0.91	0.92	0.91	0.91	0.90	0.91	0.91	0.87	0.87
	Sipakmed	Accuracy	92.05	92.62	92.70	94.18	95.25	95.33	94.18	89.10	84.02	80.82	72.70
		Sensitivity	0.93	0.93	0.94	0.95	0.94	0.95	0.94	0.85	0.82	0.95	0.99
		Specificity	0.90	0.92	0.91	0.93	0.97	0.96	0.95	0.96	0.87	0.60	0.32
		Precision	0.93	0.95	0.94	0.95	0.98	0.97	0.96	0.97	0.91	0.78	0.69
		F1 Score	0.93	0.94	0.94	0.95	0.96	0.96	0.95	0.90	0.86	0.86	0.81

	AlexNet	GoogleNet	ResNet18	ResNet50
Herlev	90.22 (FMMN)	89.49 (FMMN)	91.67 (FMMN)	92.03 (Simple logistic)
Sipakmed	95.32 (FMMN)	92.21 (Simple logistic)	93.85 (Simple logistic)	95.33 (FMMN)

Approach	Accuracy
Deep Learning (Resnet-50) [20]	74.04%
ResNet-152 [21]	94.89%
ResNet-50 [22]	89%
VGG-19 [24]	88%
Proposed Model [Hybrid CNN] ResNet50	95.33%

PERMALINK

Hybridization of Deep Learning Pre-Trained Models with Machine Learning Classifiers and Fuzzy Min–Max Neural Network for Cervical Cancer Diagnosis

Madhura Kalbhor

Swati Shinde

Daniela Elena Popescu

D Jude Hemanth

Roles

Abstract

1. Introduction

Figure 1.

2. Literature Review

Table 1.

3. Proposed Methodology

Figure 2.

3.1. Module 1

3.1.1. Feature Extraction Using Pre-Trained Models

Table 2.

3.1.2. Min–Max Normalization

3.2. Module 2

3.2.1. Machine Learning Classifiers

3.2.2. Fuzzy Min–Max Neural Network

Figure 3.

Expansion

Overlap Test

Contraction

3.3. Algorithm 1

4. Experimentation Environment

4.1. Herlev Dataset

Table 3.

Figure 4.

4.2. Sipakmed

Figure 5.

Table 4.

4.3. Performance Measures

Table 5.

5. Experiments and Results

Table 6.

Table 7.

Table 8.

Table 9.

Table 10.

Table 11.

Table 12.

Table 13.

Performance Analysis

Figure 6.

Table 14.

Figure 7.

Table 15.

6. Conclusions

Author Contributions

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Funding Statement

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases