Deep learning model for fully automated breast cancer detection system from thermograms

Esraa A Mohamed; Essam A Rashed; Tarek Gaber; Omar Karam

doi:10.1371/journal.pone.0262349

. 2022 Jan 14;17(1):e0262349. doi: 10.1371/journal.pone.0262349

Deep learning model for fully automated breast cancer detection system from thermograms

Esraa A Mohamed ¹, Essam A Rashed ^1,², Tarek Gaber ^3,^4,^*, Omar Karam ⁵

Editor: Robertas Damaševičius⁶

PMCID: PMC8759675 PMID: 35030211

Abstract

Breast cancer is one of the most common diseases among women worldwide. It is considered one of the leading causes of death among women. Therefore, early detection is necessary to save lives. Thermography imaging is an effective diagnostic technique which is used for breast cancer detection with the help of infrared technology. In this paper, we propose a fully automatic breast cancer detection system. First, U-Net network is used to automatically extract and isolate the breast area from the rest of the body which behaves as noise during the breast cancer detection model. Second, we propose a two-class deep learning model, which is trained from scratch for the classification of normal and abnormal breast tissues from thermal images. Also, it is used to extract more characteristics from the dataset that is helpful in training the network and improve the efficiency of the classification process. The proposed system is evaluated using real data (A benchmark, database (DMR-IR)) and achieved accuracy = 99.33%, sensitivity = 100% and specificity = 98.67%. The proposed system is expected to be a helpful tool for physicians in clinical use.

1. Introduction

Breast cancer is one of the most commonly diagnosed malignancies in women around the world [1]. In 2018, breast cancer reached approximately 15% of registered cases of cancer-linked death among women [2, 3]. Breast abnormalities can be detected by self-examination, physicians, or imaging techniques. The only way to assure whether there is cancer or not is biopsy [4]. There are several breast imaging techniques (for examples ultrasound, mammography… etc), which are currently being used for early detection of breast cancer [5]. The leading and most popular screening modality is the mammography due to the relatively high-accuracy, low-cost, and high detectability [6, 7]. Mammograms can provide an effective imaging tool for high accuracy for breast cancer detection and classification. However, its performance is known to be weak in some cases especially for patients with dense breast tissues [8]. Moreover, it may lead to sever side-effects related to ionized radiation for young age ladies [9]. Moreover, it is known that observing small size lesion less than 2 mm is difficult using mammograms [10]. These limitations lead to a high interest in thermography, which is an emerging technology in breast cancer screening. Thermography is a radiation-free, low-cost, non-inclusive, and non-invasive technique [11]. Therefore, it can be used to detect early-stage breast cancer in young women and individuals with dense breasts.

The main idea of thermography is that all living bodies emit infrared (IR) above absolute zero [1, 8]. A thermal infrared camera converts IR radiation into electrical signals, which are shown as a thermogram, in the breast thermography modality [12]. Therefore, potential abnormalities are emphasized and separated from normal tissue as it has a different temperature scale [13, 14]. Breast thermography has several advantages over mammography, including its ability to work with dense breast tissues, effectiveness across all age groups, and ease of use for male patients [5]. Thermography is known for being safe (non-ionized radiation), quick, and leads to early detection of breast cancer [5]. Fig 1 presents the procedure for breast thermography.

Breast area segmentation is a technique for separating the breast region from other parts of the body in thermal images, is an important step in any breast cancer detection system [15]. As much as possible, the extracted region must include all breast tissues, ducts, lobules and lymph nodes. Breast segmentation process ranges from a totally manual to a fully automatic. Because of the unique properties of each breast, which make them amorphous, and the lack of clear boundaries in this type of images, most scientific researches prefer to extract the breast region by using manual or semi-automatic extraction process.

During the last decades, scientific researches were focused on machine learning methods concerned with the diagnosis of breast cancer using thermography; some researchers concentrate their work on determining the size and location of tumors; but others have been concentrated on characteristics such as acquisition protocols and breast quadrants. Deep learning is one of machine learning methods, which uses multilayer convolutional neural networks (CNN) [16]. Deep learning has the ability to automatically extract features from a training dataset [7]. In recent years, scientists have achieved promising results with CNNs for the diagnosis of breast cancer. In the past, the usage of CNNs for the diagnosis of breast cancer with thermal images was not widely used, maybe because of the efficiency of CNNs in comparison with texture or statistical features, or because of the high of computational load [17]. In recent years, CNNs were considered as one of the leading methods for pattern recognition.

The thermal image contains incorporates superfluous areas as neck, shoulder, chess and other parts of the body which behaves as noise during the training in CNN models. However, thermography images are difficult to process due to low-resolution in image spatial domain, it is necessary to extract the breast area from the thermal images which considered as a critical task as the results of the classification process are highly depended on segmentation results.

As previously mentioned, breast cancer is considered one of the leading causes of death among women. Therefore, early detection is necessary to save lives. Thermography imaging is an effective diagnostic technique which is used for breast cancer detection with the help of infrared technology, but it is dependent on the radiologist’s ability to interpret the thermogram. To the best of knowledge, the prior work has some limitations such as: (1) the limitation of the dataset, (2) some researches of the related work did not consider segmentation of the breast area before classification or extract the breast area manually, (3) some segmentation models removed parts of the breast, and (4) some researches evaluate their model by calculating the accuracy metric only. However, if the dataset is unbalanced, model’s high accuracy rate does not guarantee its ability to discriminate distinct classes equally [18]. Therefore, a fully automatic breast cancer detection system from thermograms is needed to diagnose the disease.

In this study, we propose a fully automatic breast cancer detection system. First, U-Net network is utilized to automatically extract and isolate the breast area from the rest of the body in thermograms. Second, we propose a deep learning model, which is trained for the classification of abnormal breast tissues using thermal images. The proposed method consists of three main phases, resizing, breast area segmentation and deep learning model for classification. In resizing phase, the thermal images are resized to a smaller size to accelerate computation. In breast area segmentation phase, the breast region is extracted automatically by using U-Net network. In deep learning model for classification phase, we proposed a deep learning model based two-class CNN, which is trained from scratch and used for the classification of normal and abnormal breast tissue.

The main contribution of this paper is as following:

Extracting and isolating the breast area automatically from other parts of thermal images by using CNN (U-Net).
Proposing a deep learning model for the classification of normal and abnormal breast tissues from thermograms
Evaluating the performance of the proposed model using accuracy, sensitivity and specificity.
Comparing the proposed model with state-of art methods.

The structure of the paper is as follows. Section 2 explains the literature review and Section 3 explains the proposed method. Section 4 contains the experimental results. Finally, the paper is discussed in section 5 and concluded in section 6.

2. Literature review

The majority of efforts related to the diagnosis of breast cancer from thermogram use the web available DMR-IR database [19]. In this section, we will present a review of some studies using the DMR-IR database.

Any Computer-Aided Detection (CAD) system for breast cancer detection can be separated into principally three phases: segmentation process, feature extraction and classification. The thermal image contains incorporates superfluous areas as neck, shoulder, chess and other parts of the body, but during the training in CNN models or during the feature’s identification process, this data acts as noise. Therefore, several authors have focused their research on decrease as much non-relevant information as possible and extracting region of interest (ROI) instead of identifying patterns in thermograms. Mahmoudzadeh et al. [20] used extended hidden Markov models (EHMM), BayesNet and Random Forest for the optimization of breast segmentation techniques. But, the proposed method can be used only as a first stage in automatic or semi-automatic system. Also, the of the algorithm need to be improved in case of online application. Ali et al. [1] proposed an automatic segmentation method for ROI extraction from breast thermograms based on the normal and abnormal breasts based on statistical and texture features extracted from ROI. But, the presented method has a limitation of dataset. Also, by this method some lower parts of the breast will be removed. Gaber et al. [8] Proposed an enhanced segmentation method based on both Neutrosophic sets (NS) and optimized Fast Fuzzy c-mean (F-FCM) algorithm. Then, they used different kernel functions of Support Vector Machine (SVM) to detect normal and abnormal breast. They obtained accuracy = 92.06%, recall = 96.55% and precision = 87.50%. But, the proposed segmentation method implemented on a limited number of dataset.

The main process in CAD system is Feature Extraction. This aims to extract certain features from a breast thermogram, analyze and compare these features to obtain significant results. This process will reduce the complexity of classification process. Araujo et al. [21] presented a symbolic data analysis on 50 patients’ thermograms and obtained the interval data in the symbolic data analysis and statistical analysis. They proposed three-stage feature extraction method. In the first stage, maximum and minimum temperature value from thermal images processed by morphological operations are extracted. In the second stage, interval features are extracted and continuous features are produced. In the third stage, Fisher’s criterion is used to transform the continuous features to new feature space which produce the input data to the classification process. They used a leave-one-out cross validation method during the training process. They reached sensitivity = 85.7% and specificity = 86.5%. De Santana et al [22] study the performance of several classification techniques and group the thermal images into one of the following groups: benign lesion, malignant lesion and cyst with the use of Haralick and Zernike descriptors for attributes extraction. They use Artificial neural networks (ANN), Multi-layer perceptron (MLP), Extreme learning machines(ELM), decision trees (DT) and Bayesian classifiers to perform the classification. They achieved accuracy of 76.01% by using MLP as classifier with 10-fold cross validation. Milosevic et al. [23] extracted 20 Gray Level Co-occurrence Matrices (GLCM) features from 40 thermal images. They used Support Vector Machine (SVM), K-Nearest Neighbor (kNN) and Naïve Bayes (NN) as Classifiers. Also, they used K-fold cross validation method with K = 5 and achieved accuracy = 92.5% by using kNN classifier and Sensitivity = 85.7% by using SVM and Naïve Bayes as classifier. The proposed system extracted the breast area manual and results can’t be generalized due to limitations of the dataset. Dey et al. [24] extract 112 features by using texture features and entropy features. They used DT, KNN, SVM1 and SVM-RBF (SVM2) as classifiers. The proposed system attained an overall accuracy>89%. But, the breast area is extracted manually and a limited number of dataset is used to evaluate the proposed system Francis et al. [25] presented a curvelet transform-based feature extraction approach to detect breast abnormality from thermal images. The curvelet transform enhances the accuracy of the classification process by representing edges and distinctiveness in curves in an image. They obtained accuracy = 90.91%, Sensitivity = 81.82% and Specificity = 100% by using SVM as classifier. Pramanik et al. [26] calculated discrete wavelet transform to determine the initial feature point image of each thermal image. They used Principal Component Analysis (PCA) to reduce feature matrix dimension. Also, they used a feed-forward Perceptron on 306 thermal images and achieved accuracy = 90.48%, sensitivity = 87.6% and specificity = 89.73%. Rajinikanth et al. [27] proposed an automated breast cancer detection system from thermal images. They used two feature extraction pipelines (1) saliency enhancement, morphological segmentation and GLCM feature mining and, (2) Local-Binary-Pattern (LBP) enhancement and feature extraction. Then, serial feature integrations are implemented and Marine-Predators-Algorithm (MPA) is used to choose the optimized features. The optimized features are used to evaluate the performance of different SVM classifiers. They achieved accuracy of 93.5 by using SVM cubic (SVM-C) and SVM Coarse Gaussian (SVM-CG).

Deep learning approaches have recently been developed to extract characteristics and improve the efficiency of medical image analysis. Deep learning is one of machine learning methods, which uses multilayer convolutional neural networks (CNN). Unlike other feature extraction techniques, the CNN is able to extract the features of the images from the dataset directly. This type of feature extraction is used to extract features from different parts of the image using convolution. Mambou et al. [28] proposed a deep neural network (DNN) model depending on a pre- trained Inception V3 model [29] for the classification of sick breast and healthy breast. They involved SVM classifier in the case of an uncertainty in the output of the DNN; additionally, they catch attention to the breast’s physical model camera sensitivity. Gomaz et al. [17] study the impact of data preprocessing, data augmentation and the size of database versus a proposed set of CNN models. Also, they used a tree Parzen estimator to develop a CNN hyper-parameters fine- tuning optimization model. They achieved an accuracy of 92% and F1-score of 92%. Cabıoğlu and Oğul [30] designed various CNNs by using transfer learning technique. They achieved an accuracy of 94.3%, a precision of 94.7% and a recall of 93.3%. But, they didn’t use a segmentation method to extract the breast area from other parts of the thermal images. Barbosa et al. [31] used deep-wavelet neural networks(DWNN) as a feature extraction technique. They found that when features’ number increases, by adding additional levels in the DWNN, better performance can be achieved in solving the classification problem. They obtained 95% of sensitivity and 79% of specificity. Based on bio-data, image analysis, and image statistics. Ekici and Jawzal [32] suggested a new technique for feature extraction. To classify the breast images as normal or suspicious, they used a CNN optimized by the Bayes algorithm. They achieved accuracy around 99%.

From the discussed related work above, it could be remarked that the prior work has some limitations such as:

some related work used a small number of the dataset as in [8, 23].
some related work did not consider segmentation of the breast area before classification such as in [30] or extract the breast area manually such as in [23, 24].
some segmentation models such as in [1] removed parts of the breast.
some work has been evaluated by only calculating the accuracy metric only such as in [32]. However, the high accuracy rate of a model does not ensure its ability to distinguish different classes equally if the dataset is unbalanced [39].

Therefore, a fully automated breast cancer detection system from thermograms is needed and should be evaluated by not only the accuracy but also the most related metrics such as sensitivity and specificity.

3. Proposed method

To automate and improve the accuracy of thermography systems, we designed a deep learning-based system which integrates U-Net network and a proposed deep learning model. The proposed system is a combination two important methods: U-Net network and a two-class CNN-based deep learning model. First, U-Net is a convolutional network architecture which proved very strong in biomedical segmentation and very fast compared with other methods [33]. U-Net is used in our system to automatically extract and isolate the breast area from other parts of the body which act as noise in the detection system. Second, the two-class CNN-based deep learning model is trained from scratch to extract more characteristics from the dataset that is helpful in training the network and improve the efficiency of the classification process. The novelty of the proposed system lays in using U-Net network for automating the segmentation process and building a deep learning model which use the output of U-Net to classify the given thermogram. The combination between U-Net and our proposed deep learning model proved to be effective as it achieved accuracy = 99.33%, sensitivity = 100% and specificity = 98.67%.

The proposed method is divided into three phases, image resizing, breast area segmentation and deep learning model for classification. Fig 2 summarized the proposed method in a flowchart.

3.1 Image resizing

The thermal images are of size 680 × 480 pixels and its computation time will be high due to the limitation of the PC capabilities used in this study. So, the thermal images are resized to a smaller size of 228 × 228 pixels for faster computation.

3.2 Breast area segmentation using deep learning (CNN)

The thermal image contains unnecessary areas as neck, shoulder, chess and other parts of the body which acts as noise during the training in CNN models. The aim of this phase is removing unwanted regions and using the areas destined to have cancer as the input of the CNN model for training and testing.

Because finding a large training dataset in medical problems is challenging, Ronneberger et al. proposed the U-NET network structure [33], a convolutional network architecture for biomedical segmentation that has a good influence on smaller training datasets [34]. So, we use U-net network for breast area segmentation from thermal images. U-net consists of a contracting path (left side) and an expansive path (right side). The contracting path consists of two 3x3 convolutions (unpadded convolutions) that are applied repeatedly, each followed by a rectified linear unit (ReLU) and a 2x2 max-pooling operation with stride 2 for downsampling. The number of feature channels is doubled with each downsampling step. In the expansive path, an upsampling of the feature map is followed by a 2x2 convolution that halves the number of feature channels, a concatenation with the proportionally cropped feature map from the contracting path, and two 3x3 convolutions, each followed by a ReLU. Also, U-net has some advantages such as it has a simple structure, less parameters, needs very few images for training (approximately 30 per application) and the training time is relatively short compared with other networks [35]. Example of U-net architecture is shown in Fig 3. The initial input dimension in UNet is depicted as 572x572. But, we define the network, so we can change the input dimension in the input layer to the desired one.

In U-Net, to decrease the resolution of the input image, an initial set of convolutional layers are combined with max-pooling layers. Then, in sequence, a number of convolutional layers paired with upsampling operators are applied in order to increase the resolution of the input image. When these two pathways are combined, a U-shaped graph is created, which can be used to perform image segmentation. Fig 4 shows example of using U-net network for breast area segmentation.

3.3 Deep learning model for classification

Convolution layer, Rectified Linear Activation Function (RELU) layer, max pooling layer, fully connected layer, and dropout layer are the five parts of a CNN model. The most significant part of CNN is the convolution layer. It consists of trainable filters and updates its parameters at each iteration. RELU layer is the most preferred layer in CNN architectures as it speeds up the training process. Max pooling layer is used to reduce parameter size and control overfitting. Neurons in fully connected layer are a regular neural network. Dropout layer is used to prevent overfitting.

A two-class CNN-based deep learning model, which is trained from scratch and used to classify normal and abnormal breast tissue, is proposed. The network has nine layers, as illustrated in Fig 5, with the first six being convolutional layers and the remaining three being fully connected layers. In the proposed model, the first layer filters the input image, of size 228 × 228 pixels, with 64 kernels of size 7 × 7 with a stride of 6 pixels. Kernels of the first layer are with depth = 3, which define the number of color channels of the input thermogram image. After applying max-pooling, which is used to enhance the robustness and reduce the computation, the output of the first layer is used as input for the second layer, filtering it with 128 kernels of size 3 × 3 × 64. Without pooling layers, the third, fourth, and fifth levels are connected to each other. The third layer consists of 256 kernels with a size of 3 × 3 × 128. The fourth layer has 256 kernels of size 3 × 3 × 256 and the fifth layer consists of 256 kernels with a size of 3 × 3 × 256. The sixth layer is connected to the fifth layer with max-pooling layer and has 256 kernels of size 3 × 3 × 256. On top of the convolutional layers, two fully-connected layers of 1024 neurons are connected to each other. The number of neurons in the third fully-connected layer equals the number of classes. The output of the convolutional and the max-pooling layers is calculated by Eqs (1) and (2), respectively.

O_{c o n v} = \frac{I - K + 2 P}{S} + 1

(1)

O_{p o o l i n g} = \frac{I - P_{S}}{S} + 1

(2)

where O_conv is the size of the output of the convolutional layer, I is the size of the input layer, K is the size of kernels used in the convolutional layer, P is padding, S is the stride of the convolution operation, O_pooling is the size of the output of the max-pooling layer and P_s is the pool size. The CNN model was implemented using the Matlab 2019b platform running on a laptop computer system with the following specifications: Intel (R) Core (TM) i7-2670 CPU@2.20GHZ with 8 GB RAM.

4. Experimental results

To evaluate the proposed method, a benchmark database (DMR-IR) [19] was used. This database is created by collecting the IR images from the Hospital of UFF University and published publicly with the approval of the ethics committee where consent should be signed by any patient. This study used a set of 1000 frontal thermogram images, captured using a FLIR SC-620 IR camera with a resolution of 640 × 480 pixels from this database (including 500 normal and 500 abnormal subjects). These images contain breasts in various shapes and sizes (see Fig 6). The dataset is split for segmentation and classification into training, validation and testing sets with the ratio 70:15:15, randomly. The dataset description is included in Table 1.

Fig 6 — Different cases of breast (a) small breast (b) large breast (c) asymmetric breast.

Table 1. Dataset description.

Dataset categories	Dimension	Training	Validation	Testing	Total
Normal	640x480	350	75	75	500
Abnormal	640x480	350	75	75	500

Open in a new tab

4.1 Breast area segmentation using deep learning (CNN)

During the training of breast segmentation phase with U-Net network, Adaptive Moment Estimation (ADAM) method was used as optimized algorithm with number of epochs = 30. Also, the training process was started with initial learning rate = 1.0e−3. The learning rate used a piecewise schedule and dropped by a factor of 0.3 every 10 epochs to allow the network to train quickly with a higher initial learning rate. The network trained with an 8-batch size to save memory. Fig 7 shows examples of breast area segmentation results.

Fig 7 — Breast area segmentation resuls (a) thermal image (b) ground truth (c) output.

4.2 Evaluation of the deep learning model

Classification Metrics evaluate the performance of the model and measure how good or bad the classification is.

Accuracy: Represents how many instances are completely classified correctly. It is calculated by dividing the total number of predictions by the number of right predictions. It is calculated by Eq (3)

A c c u r a c y = \frac{T_{P} + T_{N}}{T_{P} + T_{N} + F_{P} + F_{N}}

(3)

Sensitivity: Is calculated based on how many patients have the disease are correctly estimated. It is calculated by Eq (4)

S e n s i t i v i t y = \frac{T_{P}}{T_{P} + F_{N}}

(4)

Specificity: Is calculated based on how many patients do not have the disease are predicted right. It is calculated by Eq (5)

S p e c i f i c i t y = \frac{T_{N}}{T_{N} + F_{P}}

(5)

True Positive (TP) refers to a positive-class sample that has been successfully classified by a model.
False Positive (FP) refers to a sample that should have been classed as negative but was instead classified as positive.
True Negative (TN) refers to a negative-class sample that has been successfully classified by a model.
False Negative (FN) refers to a sample that should have been classed as positive but was instead classified as negative.

The accuracy metric indicates how many of the model’s predictions were right. However, if the dataset is unbalanced, a model’s high accuracy rate does not guarantee its ability to differentiate distinct classes equally. In specifically, in the classification of medical images, it is necessary to develop a model with the ability be applied to all classes. In cases, sensitivity and specificity should be used to provide information about the performance of the model. Sensitivity measures [3] the percentage of patient have the disease that the proposed model correctly predicted. Specificity measures the percentage of patient do not have the disease and correctly estimated by the proposed model. These two evaluation metrics measure the ability of the model to decrease FN and FP predictions.

In the training process, we use Adaptive Moment Estimation (ADAM)method as solver with batch size of 60 and number of epochs = 30. Also, the training process was started with initial learning rate = 2.0e−3. According to the training parameters, we achieve accuracy = 99.33%, sensitivity = 100% and specificity = 98.67%. The training progress and the confusion matrix of the proposed model are shown in Figs 8 and 9, respectively.

4.3 Impact of changing the training options on the classification process

We further study the impact of the training options on the classification accuracy, sensitivity and specificity. In Table 2, we show the influence of three different solvers, Stochastic Gradient Descent with Momentum(SGDM) [36], Adaptive Moment Estimation (ADAM) [37] and Root Mean Square propagation(RMSprop) [38]. In this table, the training process was started with initial learn rate = 2.0e−3, batch size was = 60 and number of epochs was = 30. The impact of starting the training process with different number of epochs is shown in Table 3. In Table 3, ADAM was used as solver with batch size of 60 and the initial learn rate of the training process was 2.0e−3. In Table 4, we show the impact of using different batch size, in the training process, on the classification accuracy, sensitivity and specificity. In this table, ADAM was used as solver with number of epochs = 30 and the initial learn rate was 2.0e−3. In Table 5, we show the impact of starting the training process with 32 different initial learning rates on the classification accuracy, sensitivity and specificity. In this table, ADAM was used as solver with batch size = 60 and number of epochs = 30

Table 2. Comparison between solvers (initial learn rate = 2.0e−3, number of epochs = 30 and batch size = 60).

Solver	Accuracy (%)	Sensitivity (%)	Specificity (%)
ADAM	99.33	100	98.67
SGDM	84.17	100	68.33
RMSprop	50	100	0.0

Open in a new tab

Table 3. The impact of using different number of epochs on the classification accuracy, sensitivity and specificity (solver = ADAM, initial learn rate = 2.0e−3, batch size = 60).

Number of Epochs	Accuracy (%)	Sensitivity (%)	Specificity (%)
10	88.67	94.67	82.67
20	97.33	100	94.67
30	99.33	100	98.67
40	100	100	100
50	100	100	100

Open in a new tab

Table 4. Impact of using different batch size on the classification accuracy, sensitivity and specificity (solver = ADAM, initial learn rate = 2.0e−3, number of epochs = 30).

Batch Size	Accuracy (%)	Sensitivity (%)	Specificity(%)
10	50	100	0.0
20	100	100	100
30	100	100	100
40	100	100	100
50	100	100	100
60	99.33	100	98.67
70	98.67	100	97.33
80	95.83	100	91.67
90	91.33	100	82.66
100	83.33	100	66.67

Open in a new tab

Table 5. Impact of starting the training process with different initial learn rate on the classification accuracy, sensitivity and specificity (solver = ADAM, batch size = 60 and number of epochs = 30).

Initial learning rate	Accuracy(%)	Sensitivity(%)	specificity(%)
9.0 e ⁻⁰¹	50	100	0.0
8.0 e ⁻⁰¹	50	100	0.0
7.0 e ⁻⁰¹	50	100	0.0
6.0 e ⁻⁰¹	50	100	0.0
5.0 e ⁻⁰¹	50	100	0.0
4.0 e ⁻⁰¹	50	100	0.0
3.0 e ⁻⁰¹	50	100	0.0
2.0 e ⁻⁰¹	50	100	0.0
1.0 e ⁻⁰¹	50	100	0.0
9.0 e ⁻⁰²	50	100	0.0
8.0 e ⁻⁰²	50	100	0.0
7.0 e ⁻⁰²	50	100	0.0
6.0 e ⁻⁰²	50	100	0.0
5.0 e ⁻⁰²	50	100	0.0
4.0 e ⁻⁰²	50	100	0.0
3.0 e ⁻⁰²	50	100	0.0
2.0 e ⁻⁰²	50	100	0.0
1.0 e ⁻⁰²	50	100	0.0
9.0 e ⁻⁰³	50	0.0	100
8.0 e ⁻⁰³	50	0.0	100
7.0 e ⁻⁰³	83.33	100	66.67
6.0 e ⁻⁰³	41.07	10.0	73.33
5.0 e ⁻⁰³	51.67	5.0	98.33
4.0 e ⁻⁰³	56.67	13.33	100
3.0 e ⁻⁰³	83.33	100	66.67
2.0 e ⁻⁰³	99.33	100	98.67
1.0 e ⁻⁰³	99.33	100	98.67
9.0 e ⁻⁰⁴	100	100	100
8.0 e ⁻⁰⁴	100	100	100
7.0 e ⁻⁰⁴	100	100	100
6.0 e ⁻⁰⁴	100	100	100
5.0 e ⁻⁰⁴	100	100	100

Open in a new tab

4.4 Performance of pretrained CNN models on the dataset

The performance of different pretrained CNN models such as ResNet18, GoogleNet [22], VGG16 and AlexNet is performed on the same Dataset [39]. A comparison between the evaluation metrics of pretrained CNN models and the proposed model is shown in Table 6. According to results in Table 6, we note that the performance of the proposed model is better than the performance of other CNN models on this dataset except VGG16Net.

Table 6. Comparison between the performance metrics of different CNN models and the proposed model.

CNN model	Accuracy	Sensitivity	Specificity
ResNet18	93.3	88.0	98.7
GoogleNet	79.33	84.00	74.67
AlexNet	50.0	0.0	100
VGG16	100	100	100
Proposed CNN	99.33	100	98.67

Open in a new tab

4.5 Impact of training/testing data size

The impact of dataset size is measured on the performance of the proposed model. Fig 6 plots the performance of three evaluation metrics on different dataset size. In this part of the experiment, ADAM was used as solver, number of epochs = 30, batch size = 60 and validation data = 15%. Fig 10 shows that the dataset size is critical on the classification process.

4.6 Performance of machine learning classifier on the dataset

We further study the performance of machine learning classifier such as SVM, KNN and Decision Tree on the classification process on the same dataset. First, we extract texture features by using Gray Level Co-occurrence Matrices (GLCM) [40] and Histogram of Oriented Gradients (HOG) [41]. Then, we distinguish between normal and abnormal breast tissue by using the classifiers. A comparison between the performance of machine learning classifiers with GLCM and HOG features extraction methods and the proposed model are shown in Tables 7 and 8.

Table 7. Comparison between the performance metrics of different machine learning classifier with texture features and the proposed model.

Classifier	Accuracy(%)	Sensitivity(%)	specificity(%)
SVM	89.33	86.67	92.0
KNN	53.33	64.0	42.67
Decision Tree	82.67	96.00	96.0
Proposed method	99.33	100	98.67

Open in a new tab

Table 8. Comparison between the performance metrics of different machine learning classifier with HOG features and the proposed model.

Classifier	Accuracy(%)	Sensitivity(%)	specificity(%)
SVM	78.28	73.33	86.67
KNN	20.0	20.0	20.0
Decision Tree	40.0	58.67	21.33
Proposed method	99.33	100	98.67

Open in a new tab

4.7 Statistical analysis

To analyze the evaluation of the proposed system statistically, we perform the analysis of variance (ANOVA) test [18, 42], where the proposed system is compared with ResNet18, GoogleNet and VGG16 networks. The result of the ANOVA test is shown in Table 9. To reject the null hypothesis, the p-value in the ANOVA test should be less than 0.05. According to Table 9, the p−value is less than 0.05. so, the null hypothesis was rejected by the results of ANOVA test.

Table 9. Results of the ANOVA test of the proposed model and CNN models.

Model	P-value
ResNet18	0.0423
GoogleNet	0.0173
VGG16	0.0023

Open in a new tab

5. Discussion

In this study, we propose a fully automatic breast cancer detection system. The proposed system uses U-Net network to extract the breast area from thermal images and propose a deep learning model, which is trained for the classification of abnormal breast tissues using thermal images. The proposed system consists of three main phases, resizing, breast area segmentation and deep learning model for classification. In resizing phase, the thermal images are resized to a smaller size to accelerate computation. In breast area segmentation phase, the breast region is extracted automatically by using U-Net network. In deep learning model for classification phase, we proposed a two-class CNN-based deep learning model, which is trained from scratch and used for the classification of normal and abnormal breast detection.

The experimental results obtained show an overview of our contribution in (1) extracting the breast area from the thermal images automatically (2) studying the impact of the training options on the classification accuracy, sensitivity and specificity. (3) comparing between the performance of pretrained CNN models such as ResNet18, GoogleNet, AlexNet, VGG16Net and the proposed model. (4) comparing between machine learning classifier such as SVM, KNN and Decision Tree and the proposed model. In Table 2, we study the influence of three different solvers, SGDM, ADAM and RMSprop on the classification process. From Table 2, we can note that ADAM has the best behavior. In Table 3, we study the impact of starting the training process with different number of epochs. From Table 3, we can obtain that when the number of epochs is increased, accuracy, sensitivity and specificity values are increased. We further study the impact of using different batch size, in the training process, on the classification process in Table 4. From this table, we can obtain that when the batch size = 10 the behavior of the classification process is very bad and when the batch size value is from 20 to 50, accuracy, sensitivity and specificity have constant values. Also, when the batch size is greater than 50, the accuracy and specificity values are decreased but the sensitivity has a constant value. In Table 5, we show the impact of starting the training process with different initial learning rates on the classification process. From Table 5, it is obtained that when the learning rate is greater than 3.0 e⁻⁰³, the behavior of the classification process becomes very bad except at learning rate = 7.0 e⁻⁰³. In Table 6, we perform a comparison between the performance of pretrained CNN models such as ResNet18, GoogleNet, AlexNet, VGG16Net and the proposed model. From Table 6, we can note that the performance of the proposed model is better than the performance of other CNN models on this dataset except VGG16Net. In Fig 6, the impact of dataset size is measured on the performance of the proposed model. From Fig 6, we can note that when the training data size is increased and the testing data size is decreased the accuracy values of the proposed model is increased except when the testing data size = 10%. In addition, we compare between the performance of machine learning classifiers with GLCM and HOG features extraction methods and the proposed model in Tables 7 and 8 and we can note that our proposed model has the best result of the performance metrics. From Table 8, we can note that KNN and Decision Tree classifiers have a very bad results on this dataset with HOG features.

To further evaluate our proposed system, as shown in Table 10, a comparison between the proposed system and other studies based on breast area segmentation and breast cancer detection is performed. From this table, we can note that the dataset used by our proposed system is large compared with the dataset of some related work. Also, our system extracts the breast area automatically by using U-Net network, but some related work doesn’t used segmentation method and other extract it manually. In addition, the evaluation metric of our proposed system is better than related work. So, the proposed system outperformed other models. Furthermore, Statistical analysis by ANOVA test indicates the viability of the proposed system. In addition, the proposed system is domain-independent, so it has the ability to be applied to various computer vision tasks.

Table 10. Comparison with other studies on breast cancer detection (n = normal, ab = abnormal, Ea = Early, Ac = Acute).

Ref.	Segmentation method	#patients / Thermograms	Classification Method	Results
[8]	an enhanced segmentation method based on both Neutrosophic sets (NS) and optimized Fast Fuzzy c-mean (F-FCM) algorithm.	63 thermograms (29 N / 34 AB)	SVM Classifier	Accuracy = 92.06%
				Precision = 87.5%
				Recall = 96.55%
[23]	Manual	40 thermograms (26 N / 14 AB)	SVM, Naïve Bayes and KNN classifier	Accuracy = 92.5% and Sensitivity = 78.6% with KNN
				Accuracy = 85% and Sensitivity = 85.7% with SVM
				Accuracy = 80% and Sensitivity = 85.7% with Naïve Bayes
[24]	Manual	68 thermograms (26 Ea / 42 Ac)	DT, KNN, SVM and SVM-RBF	Accuracy = 95.59%,
[24]	Manual	68 thermograms (26 Ea / 42 Ac)	DT, KNN, SVM and SVM-RBF	Sensitivity = 96% and Specificity = 95.35% with SVM-RBF
[25]	Canny edge detection methods followed by gradient operators and Hough transform for boundary detection	Thermograms of 22 women (11 N / 11AB)	SVM Classifier	Accuracy = 90.91%,
				Sensitivity = 81.82%
				Specificity = 100%
[26]	Otsu’s threshold to remove background followed by a reconstruction technique.	306 thermograms (183 N / 123 AB)	Feed-forward artificial neural network with gradient decent	Accuracy = 90.48%,
				Sensitivity = 87.6%,
				Specificity = 89.73%
[27]	Manual	600 thermograms (300 N / 300 AB)	SVM-C	Accuracy = 93.5%,
				Sensitivity = 93%,
				Specificity = 94%
[30]	Not defined	282 thermograms (147 N / 135 AB)	CNN using transfer learning	Accuracy = 94.3%
				Precision = 94.7%
				Recall = 93.3%
[32]	Projection profile analysis	140 patients (98 N / 32 AB)	Convolutional Neural Networks optimized by Bayes algorithm	Accuracy = 98.95%
Proposed method	U-Net network	1000 thermograms (500 N / 500 AB)	Two-class CNN-based deep learning model	Accuracy = 99.33%,
				Sensitivity = 100%,
				Specificity = 98.67%

Open in a new tab

It is worth mention that the study has some limitations: the computation time of the segmentation process is high due to the limitation of the PC capabilities used in this study as well as the proposed deep learning model for classification has a bad behavior when the learning rate is greater than 3.0 e−03

6. Conclusion

Breast cancer is one of the most commonly diagnosed malignancies in women around the world. Several researches have worked on breast cancer segmentation and classification using variety of imaging techniques. Thermography imaging is an effective diagnostic approach which is used for breast cancer detection with the help of infrared technology. In this paper, we propose a fully automatic breast cancer detection system. The proposed method is divided on three main stages. First, the thermal images are resized to a smaller size to accelerate computation. Second, the breast region is extracted automatically by using U-Net network. Third, a deep learning model based two-class CNN is proposed and trained from scratch for the classification of normal and abnormal breast tissue.

Based on the experimental results, the proposed model achieved accuracy = 99.33%, sensitivity = 100% and specificity = 98.67%. In Table 10, a comparison between the proposed system and other studies based on breast area segmentation and breast cancer detection is performed. Furthermore, Statistical analysis by ANOVA test indicates the viability of the proposed system. In addition, the proposed system is domain-independent, so it has the ability to be applied to various computer vision tasks. In future study, we will investigate deep learning models which can highlight and label defect region using thermal images.

List of abbreviations

Table 11 presents the definition of the abbreviations used in this paper.

Table 11. Table of abbreviation.

Abbreviation	Definition
CNN	Convolutional Neural Networks
CAD	Computer-Aided Detection
ROI	Region of Interest
EHMM	Extended Hidden Markov Models
NS	Neutrosophic Sets
F-FCM	Fast Fuzzy C-Mean
SVM	Support Vector Machine
KNN	K-Nearest Neighbor
NN	Naïve Bayes
DT	Decision Tree
PCA	Principal Component Analysis
DNN	Deep Neural Network
DWNN	Deep-Wavelet Neural Networks
RELU	Rectified Linear Activation Function
ADAM	Adaptive Moment Estimation
SGDM	Stochastic Gradient Descent with Momentum
RMSprop	Root Mean Square propagation
T _P	True Positive
T _N	True Negative
F _P	False Positive
F _N	False Negative
GLCM	Gray Level Co-occurrence Matrices
HOG	Histogram of Oriented Gradients
n	normal
ab	abnormal

Open in a new tab

Supporting information

S1 Table. Evaluation metrics of the proposed system over different dataset sizes.

(XLSX)

Click here for additional data file.^{(14.5KB, xlsx)}

Acknowledgments

The authors would like to thank the Department of Computer Science and the Hospital of the Federal University Fluminense, Niterói, Brazil, for providing DMR-IR benchmark database which is accessible through an online user-friendly interface (http://visual.ic.uff.br/dmi) and used for experiments.

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

The author(s) received no specific funding for this work.

References

1.Ali M, Sayed G, Gaber T, Hassanien A, Snasel V and Silva L. Detection of breast abnormalities of thermograms based on a new segmentation method. In: Proceedings of the Federated Conference on Computer Science and Information Systems; Łódź, Poland. 2015.
2.Globocan 2018. All Cancer. International Agency for Research on Cancer WHO (2019), [Online]. Available: http://gco.iarc.fr/today/data/factsheets/cancers/39-All-cancers-fact-sheet.pdf. [Accessed 03 March 2019].
3.Tahoun M, Almazroi AA, Alqami MA, Gaber T, Mahmoud EE, Eltoukhy MM. A grey wolf-based method for mammographic mass classification. Applied Sciences. 2020. Jan;10(23):8422. [Google Scholar]
4.Augusto A, Figueiredo A, Do Nascimento J, Malheiros F, Da Silva Ignacio L, Fernandes H, et al. Breast tumor localization using skin surface temperatures from a 2d anatomic model without knowledge of the thermophysical properties. COMPUT METH PROG BIO. 2019; 172: 65–77. [DOI] [PubMed] [Google Scholar]
5.Hossam A, Harb H and Kader H. Automatic image segmentation method for breast cancer analysis using thermography. JES. 2018; 46: 12–32. [Google Scholar]
6.Dongola N. Mammography in Breast Cancer. MedScape. 2018. [Google Scholar]
7.Gaber T and Hassanien, AE. Digital mammography image analysis system based on mathematical morphology. In: IEEE computer society 7th Iternational Conference On Intelligent Engineering Systems INES; Assiut-Luxor, Egypt. 2003.
8.Gaber T, Ismail G, Anter A, Soliman M., Ali N, Hassanien A, et al. Thermogram breast cancer prediction approach based on neutrosophic sets and fuzzy c-means algorithm. In: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); Milano, Italy. 2015. [DOI] [PubMed]
9.Yao X, Wei W, Li J, Wang L, Xu Z, Wan Y, et al. A comparison of mammography, ultrasonography, and far infrared thermography with pathological results in screening and early diagnosis of breast cancer Asian Biomed. 2014; 8: 11–19. [Google Scholar]
10.Loberg M, Lousda M, Bretthauer M and Kalager M. Benefits and harms of mammography screening. Breast Cancer Res. 2015; 17 63. doi: 10.1186/s13058-015-0525-z [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Zuluaga-Gomez J, Zerhouni N, Masry Z, Devalland C and Varnier C. A survey of breast cancer screening techniques: thermography and electrical impedance tomography. J. Med. Eng. Technol. 2019; 43: 305–22. doi: 10.1080/03091902.2019.1664672 [DOI] [PubMed] [Google Scholar]
12.Fernandes S, Kadry S and Rajinikanth V. A hybrid framework to evaluate breast abnormality using infrared thermal images. IEEE Consumer Electronics Magazine. 2019; 8: 31–36. [Google Scholar]
13.Walker D and Kaczor T. Breast thermography: History, theory, and use. Natural Medicine Journal. 2012; 4. [Google Scholar]
14.Hankare P, Shah K, Nair D and Nair D. Breast cancer detection using thermography. IRJET 3. 2016; 1061–64. [Google Scholar]
15.AlFayez F, El-Soud MW, Gaber T. Thermogram Breast Cancer Detection: a comparative study of two machine learning techniques. Applied Sciences. 2020. Jan;10(2):551 [Google Scholar]
16.Wehle H. Machine learning, deep learning, and AI: What’s the difference? [Online]. 2017. Available: https://www.researchgate.net/publication/31890021_Machine_Learning_Deep_Learning_and_AI_What’s_the_Difference(2007). [Accessed July 2017].
17.Zuluaga-Gomez J, Masry Z, Benaggoune K, Merag S and Zerhouni N. A CNN-based methodology for breast cancer diagnosis using thermal images. arXiv:1910.13757v1. 2019. [Google Scholar]
18.Kundu R, Das R, Geem Z and Han G-T. Pneumonia detection in chest X-ray images using an ensemble of deep learning models. Plos One. 2021; 16 (9): e0256630. doi: 10.1371/journal.pone.0256630 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Silva L, Saade D, Sequeiros G, Silva A, Paiva A, Bravo R, et al. A new database for breast research with infrared image. J. Med. Imaging & Health Infor. 2014; 4: 92–100. [Google Scholar]
20.Mahmoudzadeh E, Montazeri M, Zekri M and Sadri S. Extended hidden markov model for optimized segmentation of breast thermography images. Infrared Phys. Technol. 2015; 72: 19–28. [Google Scholar]
21.Arau´jo M, Lima R and Souza R. Interval symbolic feature extraction for thermography breast cancer detection. Expert Syst. Appl. 2014; 41: 6728–37. [Google Scholar]
22.De Santana M, Pereira J, Da Silva F, De Lima N, De Sousa F, De Arruda G, et al. Breast cancer diagnosis based on mammary thermography and extreme learning machines. Res. Biomed. Eng. 2018; 34: 45–53. [Google Scholar]
23.Milosevic M, Jankovic D and Peulic A. Thermography based breast cancer detection using texture features and minimum variance quantization. Excli J. 2014; 13: 1204–15. [PMC free article] [PubMed] [Google Scholar]
24.Dey N, Rajinikanth V and Hassanien A. E. An Examination System to Classify the Breast Thermal Images into Early/Acute DCIS Class. In: Proceedings of International Conference on Data Science and Applications; Singapore. 2021.
25.Francis S, Sasikala M and Saranya S. Detection of breast abnormality from thermograms using curvelet transform based feature extraction J. Med. Syst. 2014; 38 (23). doi: 10.1007/s10916-014-0023-3 [DOI] [PubMed] [Google Scholar]
26.Pramanik S, Bhattacharjee D and Nasipuri M. Wavelet based thermogram analysis for breast cancer detection. In: Proceedings of the 2015 International Symposium on Advanced Computing and Communication, ISACC 2015; Silchar, India. 2016.
27.Rajinikanth V, Kadry S, Taniar D, Damaševičius R and Rauf H T. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features. In: 2021 Seventh International conference on Bio Signals, Images, and Instrumentation (ICBSII); Chennai, India. 2021.
28.Mambou S, Maresova P, Krejcar O, Selamat A and Kuca K. Breast cancer detection using infrared thermal imaging and a deep learning model. Sensors. 2018; 18: 2799. doi: 10.3390/s18092799 [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, et al. Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR);Boston, USA. 2015.
30.Cabıoğlu Ç and Oğul H. Computer-aided breast cancer diagnosis from thermal images using transfer learning. In: 8th International Work-Conference on Bioinformatics and Biomedical Engineering (IWBBIO 2020). 2020.
31.de Freitas Barbosa V, de Santana M, Andrade M, de Lima R and dos Santos W. Deep-wavelet neural networks for breast cancer early diagnosis using mammary termographies. Deep Learning for Data Analytics: Foundations, Biomedical Applications and Challenges. Elsevier. 2020; 99–124. [Google Scholar]
32.Ekici S and Jawzal H. Breast cancer diagnosis using thermography and convolutional neural networks. J. Med. Hypotheses. 2020; 137: 109542. doi: 10.1016/j.mehy.2019.109542 [DOI] [PubMed] [Google Scholar]
33.Ronnebergerx O, Fischer P and Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Medical Image Computing and Computer-Assisted Intervention (MICCAI), Springer. 2015; 9351: 234–241.
34.Irfan R, Almazroi A, Rauf H, Damaševiˇcius R, Nasr E and Abdelgawad A. Dilated Semantic Segmentation for Breast UltrasonicLesion Detection Using Parallel Feature Fusion. Diagnostics.2021; 11: 1212. doi: 10.3390/diagnostics11071212 [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Tong G, Li Y, Chen H, Zhang Q and Jiang H. Improved U-NET network for pulmonary nodules segmentation. u81rgvsOptik— International Journal for Light and Electron Optics. 2018; 174: 460–469. [Google Scholar]
36.Schaul T, Zhang S and LeCun Y. No more pesky learning rates. arXiv:1206.1106v2. 2012. [Google Scholar]
37.Kingma D and Ba J. Adam: A method for stochastic optimization. arXiv:1412.6980. 2015. [Google Scholar]
38.Ruder S. An overview of gradient descent optimization algorithms. arXiv:1609.04747v2. 2017. [Google Scholar]
39.Zejmo M, Kowal M, Korbicz J and Monczak R. Classification of breast cancer cytological specimen using convolutional neural network. J. Phys. Conf. Ser. 2016; 783: 012060. [Google Scholar]
40.Rajinikanth V, Kadry S, Damaševičius R, Taniar D and Rauf H T. Machine-Learning-Scheme to Detect Choroidal-Neovascularization in Retinal OCT Image. In: 2021 Seventh International conference on Bio Signals, Images, and Instrumentation (ICBSII); Chennai, India. 2021.
41.Abdel-Nasser M, Moreno A and Puig D. Breast Cancer Detection in Thermal Infrared Images Using Representation Learning and Texture Analysis Methods. Electronics. 2019; 8: 1. [Google Scholar]
42.Cuevas A, Febrero M, and Fraiman R. An anova test for functional data. Computational Statistics & Data, 2004; 47: 111–122. [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0262349.r001

Decision Letter 0

Robertas Damaševičius

29 Sep 2021

PONE-D-21-28729Deep Learning Model for Fully Automated Detection of Abnormal Breast Tissues from ThermogramsPLOS ONE

Dear Dr. A. Mohamed,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Specifically, we authors need to improve the description of their methodology and the presentation of the results, which must include statistical analysis.

Please submit your revised manuscript by Nov 13 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Robertas Damaševičius

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We suggest you thoroughly copyedit your manuscript for language usage, spelling, and grammar. If you do not know anyone who can help you do this, you may wish to consider employing a professional scientific editing service.

Whilst you may use any professional scientific editing service of your choice, PLOS has partnered with both American Journal Experts (AJE) and Editage to provide discounted services to PLOS authors. Both organizations have experience helping authors meet PLOS guidelines and can provide language editing, translation, manuscript formatting, and figure formatting to ensure your manuscript meets our submission guidelines. To take advantage of our partnership with AJE, visit the AJE website (http://learn.aje.com/plos/) for a 15% discount off AJE services. To take advantage of our partnership with Editage, visit the Editage website (www.editage.com) and enter referral code PLOSEDIT for a 15% discount off Editage services. If the PLOS editorial team finds any language issues in text that either AJE or Editage has edited, the service provider will re-edit the text for free.

Upon resubmission, please provide the following:

The name of the colleague or the details of the professional service that edited your manuscript

A copy of your manuscript showing your changes by either highlighting them or using track changes (uploaded as a *supporting information* file)

A clean copy of the edited manuscript (uploaded as the new *manuscript* file)”

3. We note that Figure 1 in your submission contain copyrighted images. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission:

a. You may seek permission from the original copyright holder of Figure 1 to publish the content specifically under the CC BY 4.0 license.

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

b. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

Additional Editor Comments:

Revise the article following the reviewer comments.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript sounds technically poor, I have following concerns should be addressed before any decision. The paper currently need revision.

1. The existing literature should be classified and systematically reviewed, instead of being independently introduced one-by-one.

2. The abstract is too general and not prepared objectively. It should briefly highlight the paper's novelty as what is the main problem, how has it been resolved and where the novelty lies?

3. The 'conclusions' are a key component of the paper. It should complement the 'abstract' and normally used by experts to value the paper's engineering content. In general, it should sum up the most important outcomes of the paper. It should simply provide critical facts and figures achieved in this paper for supporting the claims.

4. For better readability, the authors may expand the abbreviations at every first occurrence.

5. The author should provide only relevant information related to this paper and reserve more space for the proposed framework.

6. However, the author should compare the proposed algorithm with other recent works or provide a discussion. Otherwise, it's hard for the reader to identify the novelty and contribution of this work.

7. The descriptions given in this proposed scheme are not sufficient that this manuscript only adopted a variety of existing methods to complete the experiment where there are no strong hypothesis and methodical theoretical arguments. Therefore, the reviewer considers that this paper needs more works.

8. Key contribution and novelty has not been detailed in manuscript. Please include it in the introduction section

9. What are the limitations of the related works

10. Are there any limitations of this carried out study?

11. How to select and optimize the user-defined parameters in the proposed model?

12. There are quite a few abbreviations are used in the manuscript. It is suggested to use a table to host all the frequently used abbreviations with their descriptions to improve the readability

13. Explain the evaluation metrics and justify why those evaluation metrics are used?

14. Some sentences are too long to follow, it is suggested that to break them down into short but meaningful ones to make the manuscript readable.

15. The title is pretty deceptive and does not address the problem completely.

16. Every time a method/formula is used for something, it needs to be justified by either (a) prior work showing the superiority of this method, or (b) by your experiments showing its advantage over prior work methods - comparison is needed, or (c) formal proof of optimality. Please consider more prior works.

17. The data is not described. Proper data description should contain the number of data items, number of parameters, distribution analysis of parameters, and of the target parameter itself for classification.

18. The related works section is very short and no benefits from it. I suggest increasing the number of studies and add a new discussion there to show the advantage. Following studies can be considered

a. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features.

b. Dilated Semantic Segmentation for Breast Ultrasonic Lesion Detection Using Parallel Feature Fusion.

c. Machine-Learning-Scheme to Detect Choroidal-Neovascularization in Retinal OCT Image

19. Use Anova test to record the significant difference between performance of the proposed and existing methods.

Reviewer #2: Dear Authors,

This work employs U-Net model to extract and evaluate the abnormal section from the Breast Thermal Images.

I request you to consider the following suggestions:

1. The image size is fixed as 228x228, but, the initial input dimension in UNet is depicted as 572x572. Please check and correct.

2. The quality of the image is to be improved in every image cases.

3. What is the need for extracting the breast section, please justify. Further, include, how the CNN scheme is trained to segment the image (a clear discription is needed).

4. The image dimension, such as test image, Ground truth and segmented image is not looks like 228x228. Further in this database, the Ground Truth is not available. Please discuss, how the GT is generated (Hope, the test images are from :http://visual.ic.uff.br/dmi/).

5. Please improve the reference section by considering few more related works.

(Please refer, the following research works:

a. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features

b. An Examination System to Classify the Breast Thermal Images into Early/Acute DCIS Class

c. A hybrid framework to evaluate breast abnormality using infrared thermal images

6. I request you to compare the outcome of the proposed scheme with another results existing in the literature.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2022 Jan 14;17(1):e0262349. doi: 10.1371/journal.pone.0262349.r002

Author response to Decision Letter 0

12 Nov 2021

Original Manuscript ID: PONE-D-21-28729

Original Article Title: “Deep Learning Model for Fully Automated Detection of Abnormal Breast Tissues from Thermograms “

Dear Editor,

Thank you for allowing a resubmission of a revised version of the manuscript, with an opportunity to address the reviewers’ comments.

We are uploading (a) our point-by-point response to the comments (below) (response to reviewers), (b) an updated manuscript with yellow highlighting indicating changes, and (c) a clean updated manuscript without highlights

Best regards,

Authors,

Reviewer#1, Concern # 1: The existing literature should be classified and systematically reviewed, instead of being independently introduced one-by-one.

The literature is classified systematically as recommended

Reviewer#1, Concern # 2: The abstract is too general and not prepared objectively. It should briefly highlight the paper's novelty as what is the main problem, how has it been resolved and where the novelty lies?

The abstract is edited as recommended.

Reviewer#1, Concern # 3: The 'conclusions' are a key component of the paper. It should complement the 'abstract' and normally used by experts to value the paper's engineering content. In general, it should sum up the most important outcomes of the paper. It should simply provide critical facts and figures achieved in this paper for supporting the claims.

A conclusion section is added to the manuscript with your instructions.

Reviewer#1, Concern # 4: For better readability, the authors may expand the abbreviations at every first occurrence.

The manuscript is edited with abbreviation definition at every first occurrence.

Reviewer#1, Concern # 5: The author should provide only relevant information related to this paper and reserve more space for the proposed framework.

Information that is not related to this paper were removed. See the update in the Revised Manuscript with Track Changes page 16.

Reviewer#1, Concern # 6: However, the author should compare the proposed algorithm with other recent works or provide a discussion. Otherwise, it's hard for the reader to identify the novelty and contribution of this work.

A comparison between the proposed algorithm and other works is already exist in table 10. Also, we update this table with recent works. See the update in the Revised Manuscript with Track Changes page 18-19. From this table, we can note that the dataset used by our proposed system is large compared with the dataset of some related work. Also, our system extracts the breast area automatically by using U-Net network, but some related work doesn't used segmentation method and other extract it manually. In addition, the evaluation metric of our proposed system is better than related work. Furthermore, Statistical analysis by ANOVA test (see table 9) indicates the viability of the proposed system. In addition, the proposed system is domain-independent, so it has the ability to be applied to various computer vision tasks.

Reviewer#1, Concern # 7: The descriptions given in this proposed scheme are not sufficient that this manuscript only adopted a variety of existing methods to complete the experiment where there are no strong hypothesis and methodical theoretical arguments. Therefore, the reviewer considers that this paper needs more works.

To automate and improve the accuracy of thermography systems, we designed a deep learning-based system which integrates U-Net network and a proposed deep learning model. The proposed system is a combination two important methods: U-Net network and a two-class CNN-based deep learning model. First, U-Net is a convolutional network architecture which proved very strong in biomedical segmentation and very fast compared with other methods [29]. U-Net is used in our system to automatically extract and isolate the breast area from other parts of the body which act as noise in the detection system. Second, the two-class CNN-based deep learning model is trained from scratch to extract more characteristics from the dataset that is helpful in training the network and improve the efficiency of the classification process. The novelty of the proposed system lays in using U-Net network for automating the segmentation process and building a deep learning model which use the output of U-Net to classify the given thermogram. The combination between U-Net and our proposed deep learning model proved to be effective as it achieved accuracy= 99.33%, sensitivity=100% and specificity=98.67%

Reviewer#1, Concern # 8: Key contribution and novelty has not been detailed in manuscript. Please include it in the introduction section.

We add the main contribution of this paper in the introduction section. To quickly and easily check this update, it is in page 4 of Revised Manuscript with Track Changes, it is also copied below:

The main contribution of this paper is as following:

1-Extracting and isolating the breast area automatically from other parts of thermal images by using CNN (U-Net).

2- Proposing a deep learning model for the classification of normal and abnormal breast tissues from thermograms

3-Evaluating the performance of the proposed model using accuracy, sensitivity and specificity.

4-Comparing the proposed model with state-of art methods.

Reviewer#1, Concern # 9: What are the limitations of the related works?

Limitations of the related work are added in the literature review section.

Reviewer#1, Concern # 10: Are there any limitations of this carried out study?

The computation time of the segmentation process is high due to the limitation of the PC capabilities used in this study.

The proposed deep learning model for classification has a bad behavior when the learning rate is greater than 3.0 e−03

Reviewer#1, Concern # 11: How to select and optimize the user-defined parameters in the proposed model?

There is no general rule for it. Selecting the optimum number of the user-defined parameters is dependent on the performance of the model and the result of the evaluation metrics.

Reviewer#1, Concern # 12: There are quite a few abbreviations are used in the manuscript. It is suggested to use a table to host all the frequently used abbreviations with their descriptions to improve the readability

Thank you for your comment. We add a table of abbreviation to the manuscript. To quickly and easily check this table, it is copied below:

Abbreviation Definition

CNN Convolutional Neural Networks

CAD Computer-Aided Detection

ROI Region of Interest

EHMM Extended Hidden Markov Models

NS Neutrosophic Sets

F-FCM Fast Fuzzy C-Mean

SVM Support Vector Machine

KNN K-Nearest Neighbor

NN Naïve Bayes

PCA Principal Component Analysis

DNN Deep Neural Network

DWNN Deep-Wavelet Neural Networks

RELU Rectified Linear Activation Function

ADAM Adaptive Moment Estimation

SGDM Stochastic Gradient Descent with Momentum

RMSprop Root Mean Square propagation

TP True Positive

TN True Negative

FP False Positive

FN False Negative

GLCM Gray Level Co-occurrence Matrices

HOG Histogram of Oriented Gradients

n normal

ab abnormal

Reviewer#1, Concern # 13: Explain the evaluation metrics and justify why those evaluation metrics are used?

We edit evaluation the deep learning model section as following. To quickly and easily check this update in the Revised Manuscript with Track Changes page 11. Also, it is copied below:

Classification Metrics evaluate the performance of the model and measure how good or bad the classification is.

Accuracy: represents how many instances are completely classified correctly. It is calculated by dividing the total number of predictions by the number of right predictions. It is calculated by Eq. (3)

Accuracy=(T_P+T_N)/(T_P+T_N+F_P+F_N ) (3)

Sensitivity: is calculated based on how many patients have the disease are correctly estimated. It is calculated by Eq. (4)

Sensitivity=T_P/(T_P+F_N ) (4)

Specificity: is calculated based on how many patients do not have the disease are predicted right. It is calculated by Eq. (5)

Specificity=T_N/(T_N+F_P ) (5)

•True Positive (TP) refers to a positive-class sample that has been successfully classified by a model.

• False Positive (FP) refers to a sample that should have been classed as negative but was instead classified as positive.

• True Negative (TN) refers to a negative-class sample that has been successfully classified by a model.

• False Negative (FN) refers to a sample that should have been classed as positive but was instead classified as negative.

The accuracy metric indicates how many of the model's predictions were right. However, if the dataset is unbalanced, a model's high accuracy rate does not guarantee its ability to differentiate distinct classes equally. In specifically, in the classification of medical images, it is necessary to develop a model with the ability be applied to all classes. In cases, sensitivity and specificity should be used to provide information about the performance of the model. Sensitivity measures the percentage of patient have the disease that the proposed model correctly predicted. Specificity measures the percentage of patient do not have the disease and correctly estimated by the proposed model. These two evaluation metrics measure the ability of the model to decrease FN and FP predictions.

Reviewer#1, Concern # 14: Some sentences are too long to follow, it is suggested that to break them down into short but meaningful ones to make the manuscript readable

We try to break some sentences down into short as much as we can.

Reviewer#1, Concern # 15: The title is pretty deceptive and does not address the problem completely.

The title of the article is changed into " Deep Learning Model for Fully Automated Breast Cancer Detection System from Thermograms"

Reviewer#1, Concern # 16: Every time a method/formula is used for something, it needs to be justified by either (a) prior work showing the superiority of this method, or (b) by your experiments showing its advantage over prior work methods - comparison is needed, or (c) formal proof of optimality. Please consider more prior works.

The limitations of the prior work are added to the literature review section. To quickly and easily check update in the Revised Manuscript with Track Changes page 7. Also, it is copied below:

From the discussed related work above, it could be remarked that the prior work has some limitations such as:

(1) some related work used a small number of the dataset as in [6, 20].

(2) some related work did not consider segmentation of the breast area before classification such as in [27] or extract the breast area manually such as in [20, 21].

(3) some segmentation models such as in [1] removed parts of the breast.

(4) some work has been evaluated by only calculating the accuracy metric only such as in [29]. However, the high accuracy rate of a model does not ensure its ability to distinguish different classes equally if the dataset is unbalanced [39].

Also, more recent prior works are added to the literature review section and compared with the proposed method in table 10.

Reviewer#1, Concern # 17: The data is not described. Proper data description should contain the number of data items, number of parameters, distribution analysis of parameters, and of the target parameter itself for classification.

A data description table is added to the manuscript as table1. To quickly and easily check this table, it is copied below:

Dataset categories Dimension Training Validation Testing Total

Normal 640x480 350 75 75 500

Abnormal 640x480 350 75 75 500

Reviewer#1, Concern # 18: The related works section is very short and no benefits from it. I suggest increasing the number of studies and add a new discussion there to show the advantage. Following studies can be considered

a. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features.

b. Dilated Semantic Segmentation for Breast Ultrasonic Lesion Detection Using Parallel Feature Fusion.

c. Machine-Learning-Scheme to Detect Choroidal-Neovascularization in Retinal OCT Image

The recommended studies are cited in the manuscript as following:

a�23- Rajinikanth V, Kadry S, Taniar D, Damaševičius R and Rauf H T. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features. In: 2021 Seventh International conference on Bio Signals, Images, and Instrumentation (ICBSII); Chennai, India. 2021.

b�30- Irfan R, Almazroi A, Rauf H, Damaševiˇcius R, Nasr E and Abdelgawad A. Dilated Semantic Segmentation for Breast UltrasonicLesion Detection Using Parallel Feature Fusion. Diagnostics.2021; 11: 1212.

c� 36- Rajinikanth V, Kadry S, Damaševičius R, Taniar D and Rauf H T. Machine-Learning-Scheme to Detect Choroidal-Neovascularization in Retinal OCT Image. In: 2021 Seventh International conference on Bio Signals, Images, and Instrumentation (ICBSII); Chennai, India. 2021.

Reviewer#1, Concern #19: Use Anova test to record the significant difference between performance of the proposed and existing methods.

We add statistical analysis section to the manuscript. To quickly and easily check this section in the Revised Manuscript with Track Changes page 16. Also, it is copied below:

4.7 Statistical Analysis

To analyze the evaluation of the proposed system statistically, we perform the analysis of variance (ANOVA) test, where the proposed system is compared with ResNet18, GoogleNet and VGG16 networks. The result of the ANOVA test is shown in table 9. To reject the null hypothesis, the p-value in the ANOVA test should be less than 0.05. According to Table 9, the p−value is less than 0.05. So, the null hypothesis was rejected by the results of ANOVA test.

Results of the ANOVA test of the proposed model and CNN models

Model P-value

ResNet18 0.0423

GoogleNet 0.0173

VGG16 0.0023

Reviewer#2, Concern # 1: The image size is fixed as 228x228, but, the initial input dimension in UNet is depicted as 572x572. Please check and correct.

Thank you for comment. But, if you define the network, then you can change the input dimension in the input layer to your desired one.

Reviewer#2, Concern # 2: The quality of the image is to be improved in every image cases.

Thank you for your comment. The quality of the image is improved

Reviewer#2, Concern # 3: What is the need for extracting the breast section, please justify. Further, include, how the CNN scheme is trained to segment the image (a clear discription is needed).

We add the answer of this question in Breast area segmentation using deep learning (CNN) section. To quickly and easily check this section in the Revised Manuscript with Track Changes page 8. Also, it is copied below:

The thermal image contains unnecessary areas as neck, shoulder, chess and other parts of the body which acts as noise during the training in CNN models. The aim of extracting the breast section is removing unwanted regions and using the areas destined to have cancer as the input of the CNN model for training and testing.

In U-Net, the initial series of convolutional layers are combined with max pooling layers to decrease the resolution of the input image. Then, these layers are followed by a series of convolutional layers combined with upsampling operators in order, so the resolution of the input image is increased. Combining these two paths produces a U-shaped graph that can used to perform image segmentation. For breast area segmentation with U-Net, Adaptive Moment Estimation (ADAM) method was used as optimized algorithm with number of epochs = 30. In addition, the training process was started with initial learning rate = 1.0e−3. The learning rate used a piecewise schedule and dropped by a factor of 0.3 every 10 epochs to allow the network to train quickly with a higher initial learning rate. The network trained with an 8-batch size to save memory.

Reviewer#2, Concern # 4: The image dimension, such as test image, Ground truth and segmented image is not looks like 228x228. Further in this database, the Ground Truth is not available. Please discuss, how the GT is generated (Hope, the test images are from:http://visual.ic.uff.br/dmi/).

Thank you for your comment. All images are resized to 228x228. Ground Truth images is not available for general in the site, we get it by personal communication. The test images are from the site.

Reviewer#2, Concern # 5 Please improve the reference section by considering few more related works.(Please refer, the following research works:

a. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features

b. An Examination System to Classify the Breast Thermal Images into Early/Acute DCIS Class

c. A hybrid framework to evaluate breast abnormality using infrared thermal images

The recommended studies are cited in the manuscript as following:

a->23- Rajinikanth V, Kadry S, Taniar D, Damaševičius R and Rauf H T. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features. In: 2021 Seventh International conference on Bio Signals, Images, and Instrumentation (ICBSII); Chennai, India. 2021.

b->20- Dey N, Rajinikanth V and Hassanien A. E. An Examination System to Classify the Breast Thermal Images into Early/Acute DCIS Class. In: Proceedings of International Conference on Data Science and Applications; Singapore. 2021.

c->10- Fernandes S, Kadry S and Rajinikanth V. A hybrid framework to evaluate breast abnormality using infrared thermal images. IEEE Consumer Electronics Magazine. 2019; 8: 31-36.

Reviewer#2, Concern # 6 I request you to compare the outcome of the proposed scheme with another results existing in the literature.

Thank you for your comment. This comparison is already existing in Table 10. Also, we compare the proposed scheme with the related work considered in comment 5 in the same table.

Attachment

Submitted filename: Response to Reviewers.docx

Click here for additional data file.^{(34.2KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0262349.r003

Decision Letter 1

Robertas Damaševičius

23 Nov 2021

PONE-D-21-28729R1Deep Learning Model for Fully Automated Breast Cancer Detection System from ThermogramsPLOS ONE

Dear Dr. A. Mohamed,

Specifically, you need to address these concerns in the revised version:

- Improve the quality of all figures before submitting final version.

- Improve Fig. 10. The data is not discernable. Range Y-axis to [80,100]. Add the units of measurement (percentages).

- Include the data availability statement and the acknowledgement for ". " ext-link-type="uri" xlink:type="simple">http://visual.ic.uff.br/dmi/".

Please submit your revised manuscript by Jan 07 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

We look forward to receiving your revised manuscript.

Kind regards,

Robertas Damaševičius

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments:

Improve the quality of all figures before submitting final version.

Improve Fig. 10. The data is not discernable. Range Y-axis to [80,100]. Add the units of measurement (percentages).

Include the data availability statement and the acknowledgement for "" ext-link-type="uri" xlink:type="simple">http://visual.ic.uff.br/dmi/".

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Reviewer #1: The paper is well revised an can be accepted. However, please improve the quality of all figures before submitting final version.

Reviewer #2: Dear Authors,

The revised version of the paper is good.

If possible, include the data availability of include the acknowledgement for "" ext-link-type="uri" xlink:type="simple">http://visual.ic.uff.br/dmi/".

Thank you.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

PLoS One. 2022 Jan 14;17(1):e0262349. doi: 10.1371/journal.pone.0262349.r004

Author response to Decision Letter 1

21 Dec 2021

Original Manuscript ID: PONE-D-21-28729R1

Original Article Title: “Deep Learning Model for Fully Automated Breast Cancer Detection System from Thermograms “

To: PLOS ONE

Re: Response to reviewers

Dear Editor,

Thank you for allowing a resubmission of a revised version of the manuscript, with an opportunity to address the reviewers’ comments.

Best regards,

Authors,

Improve the quality of all figures before submitting final version.

Thank you for your comment. The quality of figures is improved

Improve Fig. 10. The data is not discernable. Range Y-axis to [80,100]. Add the units of measurement (percentages).

Thank you for your comment. Figure 10 is improved as per your instruction.

Include the data availability statement and the acknowledgement for "http://visual.ic.uff.br/dmi/".

Thank you for your comment. The acknowledgement for (http://visual.ic.uff.br/dmi) is added to the paper. To quickly and easily check this, it is in page 20 of Revised Manuscript with Track Changes, it is also copied below:

Acknowledgement

The authors would like to thank the Department of Computer Science and the Hospital of the Federal University Fluminense, Niterói, Brazil, for providing DMR-IR benchmark database which is accessible through an online user-friendly interface (http://visual.ic.uff.br/dmi) and used for experiments.

Attachment

Submitted filename: Response to Reviewers.docx

Click here for additional data file.^{(20.6KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0262349.r005

Decision Letter 2

Robertas Damaševičius

22 Dec 2021

Deep Learning Model for Fully Automated Breast Cancer Detection System from Thermograms

PONE-D-21-28729R2

Dear Dr. Gaber,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Robertas Damaševičius

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

PLoS One. doi: 10.1371/journal.pone.0262349.r006

Acceptance letter

Robertas Damaševičius

27 Dec 2021

PONE-D-21-28729R2

Deep Learning Model for Fully Automated Breast Cancer Detection System from Thermograms

Dear Dr. Gaber:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Robertas Damaševičius

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Table. Evaluation metrics of the proposed system over different dataset sizes.

(XLSX)

Click here for additional data file.^{(14.5KB, xlsx)}

Attachment

Submitted filename: Response to Reviewers.docx

Click here for additional data file.^{(34.2KB, docx)}

Attachment

Submitted filename: Response to Reviewers.docx

Click here for additional data file.^{(20.6KB, docx)}

Data Availability Statement

All relevant data are within the paper and its Supporting Information files.

[pone.0262349.ref001] 1.Ali M, Sayed G, Gaber T, Hassanien A, Snasel V and Silva L. Detection of breast abnormalities of thermograms based on a new segmentation method. In: Proceedings of the Federated Conference on Computer Science and Information Systems; Łódź, Poland. 2015.

[pone.0262349.ref002] 2.Globocan 2018. All Cancer. International Agency for Research on Cancer WHO (2019), [Online]. Available: http://gco.iarc.fr/today/data/factsheets/cancers/39-All-cancers-fact-sheet.pdf. [Accessed 03 March 2019].

[pone.0262349.ref003] 3.Tahoun M, Almazroi AA, Alqami MA, Gaber T, Mahmoud EE, Eltoukhy MM. A grey wolf-based method for mammographic mass classification. Applied Sciences. 2020. Jan;10(23):8422. [Google Scholar]

[pone.0262349.ref004] 4.Augusto A, Figueiredo A, Do Nascimento J, Malheiros F, Da Silva Ignacio L, Fernandes H, et al. Breast tumor localization using skin surface temperatures from a 2d anatomic model without knowledge of the thermophysical properties. COMPUT METH PROG BIO. 2019; 172: 65–77. [DOI] [PubMed] [Google Scholar]

[pone.0262349.ref005] 5.Hossam A, Harb H and Kader H. Automatic image segmentation method for breast cancer analysis using thermography. JES. 2018; 46: 12–32. [Google Scholar]

[pone.0262349.ref006] 6.Dongola N. Mammography in Breast Cancer. MedScape. 2018. [Google Scholar]

[pone.0262349.ref007] 7.Gaber T and Hassanien, AE. Digital mammography image analysis system based on mathematical morphology. In: IEEE computer society 7th Iternational Conference On Intelligent Engineering Systems INES; Assiut-Luxor, Egypt. 2003.

[pone.0262349.ref008] 8.Gaber T, Ismail G, Anter A, Soliman M., Ali N, Hassanien A, et al. Thermogram breast cancer prediction approach based on neutrosophic sets and fuzzy c-means algorithm. In: 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); Milano, Italy. 2015. [DOI] [PubMed]

[pone.0262349.ref009] 9.Yao X, Wei W, Li J, Wang L, Xu Z, Wan Y, et al. A comparison of mammography, ultrasonography, and far infrared thermography with pathological results in screening and early diagnosis of breast cancer Asian Biomed. 2014; 8: 11–19. [Google Scholar]

[pone.0262349.ref010] 10.Loberg M, Lousda M, Bretthauer M and Kalager M. Benefits and harms of mammography screening. Breast Cancer Res. 2015; 17 63. doi: 10.1186/s13058-015-0525-z [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0262349.ref011] 11.Zuluaga-Gomez J, Zerhouni N, Masry Z, Devalland C and Varnier C. A survey of breast cancer screening techniques: thermography and electrical impedance tomography. J. Med. Eng. Technol. 2019; 43: 305–22. doi: 10.1080/03091902.2019.1664672 [DOI] [PubMed] [Google Scholar]

[pone.0262349.ref012] 12.Fernandes S, Kadry S and Rajinikanth V. A hybrid framework to evaluate breast abnormality using infrared thermal images. IEEE Consumer Electronics Magazine. 2019; 8: 31–36. [Google Scholar]

[pone.0262349.ref013] 13.Walker D and Kaczor T. Breast thermography: History, theory, and use. Natural Medicine Journal. 2012; 4. [Google Scholar]

[pone.0262349.ref014] 14.Hankare P, Shah K, Nair D and Nair D. Breast cancer detection using thermography. IRJET 3. 2016; 1061–64. [Google Scholar]

[pone.0262349.ref015] 15.AlFayez F, El-Soud MW, Gaber T. Thermogram Breast Cancer Detection: a comparative study of two machine learning techniques. Applied Sciences. 2020. Jan;10(2):551 [Google Scholar]

[pone.0262349.ref016] 16.Wehle H. Machine learning, deep learning, and AI: What’s the difference? [Online]. 2017. Available: https://www.researchgate.net/publication/31890021_Machine_Learning_Deep_Learning_and_AI_What’s_the_Difference(2007). [Accessed July 2017].

[pone.0262349.ref017] 17.Zuluaga-Gomez J, Masry Z, Benaggoune K, Merag S and Zerhouni N. A CNN-based methodology for breast cancer diagnosis using thermal images. arXiv:1910.13757v1. 2019. [Google Scholar]

[pone.0262349.ref018] 18.Kundu R, Das R, Geem Z and Han G-T. Pneumonia detection in chest X-ray images using an ensemble of deep learning models. Plos One. 2021; 16 (9): e0256630. doi: 10.1371/journal.pone.0256630 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0262349.ref019] 19.Silva L, Saade D, Sequeiros G, Silva A, Paiva A, Bravo R, et al. A new database for breast research with infrared image. J. Med. Imaging & Health Infor. 2014; 4: 92–100. [Google Scholar]

[pone.0262349.ref020] 20.Mahmoudzadeh E, Montazeri M, Zekri M and Sadri S. Extended hidden markov model for optimized segmentation of breast thermography images. Infrared Phys. Technol. 2015; 72: 19–28. [Google Scholar]

[pone.0262349.ref021] 21.Arau´jo M, Lima R and Souza R. Interval symbolic feature extraction for thermography breast cancer detection. Expert Syst. Appl. 2014; 41: 6728–37. [Google Scholar]

[pone.0262349.ref022] 22.De Santana M, Pereira J, Da Silva F, De Lima N, De Sousa F, De Arruda G, et al. Breast cancer diagnosis based on mammary thermography and extreme learning machines. Res. Biomed. Eng. 2018; 34: 45–53. [Google Scholar]

[pone.0262349.ref023] 23.Milosevic M, Jankovic D and Peulic A. Thermography based breast cancer detection using texture features and minimum variance quantization. Excli J. 2014; 13: 1204–15. [PMC free article] [PubMed] [Google Scholar]

[pone.0262349.ref024] 24.Dey N, Rajinikanth V and Hassanien A. E. An Examination System to Classify the Breast Thermal Images into Early/Acute DCIS Class. In: Proceedings of International Conference on Data Science and Applications; Singapore. 2021.

[pone.0262349.ref025] 25.Francis S, Sasikala M and Saranya S. Detection of breast abnormality from thermograms using curvelet transform based feature extraction J. Med. Syst. 2014; 38 (23). doi: 10.1007/s10916-014-0023-3 [DOI] [PubMed] [Google Scholar]

[pone.0262349.ref026] 26.Pramanik S, Bhattacharjee D and Nasipuri M. Wavelet based thermogram analysis for breast cancer detection. In: Proceedings of the 2015 International Symposium on Advanced Computing and Communication, ISACC 2015; Silchar, India. 2016.

[pone.0262349.ref027] 27.Rajinikanth V, Kadry S, Taniar D, Damaševičius R and Rauf H T. Breast-Cancer Detection using Thermal Images with Marine-Predators-Algorithm Selected Features. In: 2021 Seventh International conference on Bio Signals, Images, and Instrumentation (ICBSII); Chennai, India. 2021.

[pone.0262349.ref028] 28.Mambou S, Maresova P, Krejcar O, Selamat A and Kuca K. Breast cancer detection using infrared thermal imaging and a deep learning model. Sensors. 2018; 18: 2799. doi: 10.3390/s18092799 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0262349.ref029] 29.Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, et al. Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR);Boston, USA. 2015.

[pone.0262349.ref030] 30.Cabıoğlu Ç and Oğul H. Computer-aided breast cancer diagnosis from thermal images using transfer learning. In: 8th International Work-Conference on Bioinformatics and Biomedical Engineering (IWBBIO 2020). 2020.

[pone.0262349.ref031] 31.de Freitas Barbosa V, de Santana M, Andrade M, de Lima R and dos Santos W. Deep-wavelet neural networks for breast cancer early diagnosis using mammary termographies. Deep Learning for Data Analytics: Foundations, Biomedical Applications and Challenges. Elsevier. 2020; 99–124. [Google Scholar]

[pone.0262349.ref032] 32.Ekici S and Jawzal H. Breast cancer diagnosis using thermography and convolutional neural networks. J. Med. Hypotheses. 2020; 137: 109542. doi: 10.1016/j.mehy.2019.109542 [DOI] [PubMed] [Google Scholar]

[pone.0262349.ref033] 33.Ronnebergerx O, Fischer P and Brox T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In: Medical Image Computing and Computer-Assisted Intervention (MICCAI), Springer. 2015; 9351: 234–241.

[pone.0262349.ref034] 34.Irfan R, Almazroi A, Rauf H, Damaševiˇcius R, Nasr E and Abdelgawad A. Dilated Semantic Segmentation for Breast UltrasonicLesion Detection Using Parallel Feature Fusion. Diagnostics.2021; 11: 1212. doi: 10.3390/diagnostics11071212 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0262349.ref035] 35.Tong G, Li Y, Chen H, Zhang Q and Jiang H. Improved U-NET network for pulmonary nodules segmentation. u81rgvsOptik— International Journal for Light and Electron Optics. 2018; 174: 460–469. [Google Scholar]

[pone.0262349.ref036] 36.Schaul T, Zhang S and LeCun Y. No more pesky learning rates. arXiv:1206.1106v2. 2012. [Google Scholar]

[pone.0262349.ref037] 37.Kingma D and Ba J. Adam: A method for stochastic optimization. arXiv:1412.6980. 2015. [Google Scholar]

[pone.0262349.ref038] 38.Ruder S. An overview of gradient descent optimization algorithms. arXiv:1609.04747v2. 2017. [Google Scholar]

[pone.0262349.ref039] 39.Zejmo M, Kowal M, Korbicz J and Monczak R. Classification of breast cancer cytological specimen using convolutional neural network. J. Phys. Conf. Ser. 2016; 783: 012060. [Google Scholar]

[pone.0262349.ref040] 40.Rajinikanth V, Kadry S, Damaševičius R, Taniar D and Rauf H T. Machine-Learning-Scheme to Detect Choroidal-Neovascularization in Retinal OCT Image. In: 2021 Seventh International conference on Bio Signals, Images, and Instrumentation (ICBSII); Chennai, India. 2021.

[pone.0262349.ref041] 41.Abdel-Nasser M, Moreno A and Puig D. Breast Cancer Detection in Thermal Infrared Images Using Representation Learning and Texture Analysis Methods. Electronics. 2019; 8: 1. [Google Scholar]

[pone.0262349.ref042] 42.Cuevas A, Febrero M, and Fraiman R. An anova test for functional data. Computational Statistics & Data, 2004; 47: 111–122. [Google Scholar]

PERMALINK

Deep learning model for fully automated breast cancer detection system from thermograms

Esraa A Mohamed

Essam A Rashed

Tarek Gaber

Omar Karam

Roles

Abstract

1. Introduction

Fig 1. Breast thermography procedure (thermal image is aquired at room temperature = 22°C).

2. Literature review

3. Proposed method

Fig 2. Flowchart of the proposed method.

3.1 Image resizing

3.2 Breast area segmentation using deep learning (CNN)

Fig 3. Example of U-Net architecture [33].

Fig 4. Example of breast area segmentation with U-Net.

3.3 Deep learning model for classification

Fig 5. Architecture of the proposed deep learning model.

4. Experimental results

Fig 6.

Table 1. Dataset description.

4.1 Breast area segmentation using deep learning (CNN)

Fig 7.

4.2 Evaluation of the deep learning model

Fig 8. The training progress of the proposed deep learning model.

Fig 9. The confusion matrix of the proposed model.

4.3 Impact of changing the training options on the classification process

Table 2. Comparison between solvers (initial learn rate = 2.0e−3, number of epochs = 30 and batch size = 60).

Table 3. The impact of using different number of epochs on the classification accuracy, sensitivity and specificity (solver = ADAM, initial learn rate = 2.0e−3, batch size = 60).

Table 4. Impact of using different batch size on the classification accuracy, sensitivity and specificity (solver = ADAM, initial learn rate = 2.0e−3, number of epochs = 30).

Table 5. Impact of starting the training process with different initial learn rate on the classification accuracy, sensitivity and specificity (solver = ADAM, batch size = 60 and number of epochs = 30).

4.4 Performance of pretrained CNN models on the dataset

Table 6. Comparison between the performance metrics of different CNN models and the proposed model.

4.5 Impact of training/testing data size

Fig 10. Evaluation metrics over different dataset size.

4.6 Performance of machine learning classifier on the dataset

Table 7. Comparison between the performance metrics of different machine learning classifier with texture features and the proposed model.

Table 8. Comparison between the performance metrics of different machine learning classifier with HOG features and the proposed model.

4.7 Statistical analysis

Table 9. Results of the ANOVA test of the proposed model and CNN models.

5. Discussion

Table 10. Comparison with other studies on breast cancer detection (n = normal, ab = abnormal, Ea = Early, Ac = Acute).

6. Conclusion

List of abbreviations

Table 11. Table of abbreviation.

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Robertas Damaševičius

Roles

Author response to Decision Letter 0

Decision Letter 1

Robertas Damaševičius

Roles

Author response to Decision Letter 1

Decision Letter 2

Robertas Damaševičius

Roles

Acceptance letter

Robertas Damaševičius

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases