Abstract
Melanoma is defined as a disease that has been incurable in advanced stages, which shows the vital importance of timely diagnosis and treatment. To diagnose this type of cancer early, various methods and equipment have been used, almost all of which required a visit to the doctor and were not available to the public. In this study, an automated and accurate process to differentiate between benign skin pigmented lesions and malignant melanoma is presented, so that it can be used by the general public, and it does not require special equipment and special conditions in imaging. In this study, after preprocessing of the input images, the region of interest is segmented based on the Otsu method. Then, a new feature extraction is implemented on the segmented image to mine the beneficial characteristics. The process is then finalized by using an optimized Deep Believe Network (DBN) for categorization into 2 classes of normal and melanoma cases. The optimization process in DBN has been performed by a developed version of the newly introduced Thermal Exchange Optimization (dTEO) algorithm to obtain higher efficacy in different terms. To show the method's superiority, its performance is compared with 7 different techniques from the literature.
1. Introduction
The skin cancer has been known as one of the deadliest types of cancers which has recently witnessed an exponential growth worldwide. With all these interpretations, it can be cured if it is detected in the early stages, and, in most cases, a simple biopsy can prevent cancer from growing [1]. This cancer has been growing dramatically during the recent years and the importance of the initial therapy has been more to be seen. The melanoma skin cancer has been ranked as the 3rd most common malignant cancer among the skin cancer cases [2]. This cancer alters the skin color as a result of the irregular effects of the cells [3]. Notwithstanding this danger, it is known as a curable cancer if it is detected in the early stages. With all these interpretations, early detection of the melanoma from the other types of the benign skin moles is a thought-provoking mission. Any variations in the size, color, or the shape of a skin mole can be considered as initial signs for this cancer.
Based on the statistical information for 2019, skin cancer with 15000 cases is ranked as the fourth most common cancer, and with 1900 deaths cases are ranked as one of the 10th common cancers in the world [4]. The most common method for melanoma detection is by the physicians and their sampling and testing. However, in some cases, even experienced specialists make some mistakes in melanoma detection. Furthermore, laboratory sampling and testing is a time-consuming and expensive experiment that bothers the patients. Therefore, designing a (computer) system that can detect malignant lesions will be very useful. According to researches, the effectiveness of the computer-aided system is more accurate than that of a specialist doctor. Recently, many research works are accomplished for automatic early detection of skin cancers. Also, the high capability of the medical imaging assists the medical doctors to lessen the diagnosis difficulty of this work and to enhance the speed of the initial diagnosis of the disease.
The application of Artificial Neural Networks (ANNs) in different parts of image processing and machine vision has been increasing. The ANNs are imitated from the human brain actions and are used in a variety of applications, from medicine to economics and engineering [5].
The ANN is made up of lots of highly unified processing components called neurons which can solve problems with their interconnectedness. Artificial Neural Networks, like humans, learn by example to solve different kinds of problems from pattern recognition to information classification throughout a learning procedure.
Deep learning is a sort of ANNs which uses mathematical techniques to give a structure like the human brain. On the other hand, advances in technology have led to algorithms for optimizing normal neural networks so that we can count the number of neural layers. Neural networks are increased from multiple layers to thousands of layers and thousands of neurons in each layer, which until a few years ago could not have created such a structure [6]. This type of neural network is called a deep learning network. Several methodologies of using different kinds of ANNs and other machine learning techniques were introduced for skin cancer diagnosis [7, 8]. For instance, Li et al. [9] proposed a new data synthesis methodology to combine individual skin lesions images and heavily expand them to create noteworthy amounts of data. The study worked on a convolutional neural network (CNN) to provide superior efficiency to the old-style detection and tracking methods. Moreover, the system was trained by humans with simple criteria.
Esteva et al. [10] presented a supervised technique for skin lesions using deep learning. The method's efficiency was verified by twenty-one medical images with two critical binary classifications: malignant and benign groups. Two cases were analyzed here. The first one recognizes the most common cancers, and the second case signifies the deadliest skin cancer identification. The final results indicated the high performance of the proposed method.
Jafari et al. [11] proposed a deep learning-based procedure to provide a precise skin cancer area segmentation. After image denoising and preprocessing, a CNN was utilized for segmentation. They combined local and global structure information and outputted a label for the pixels to create a segmentation mask that indicates the lesion area. This mask was then improved with some postprocessing operators. The empirical results indicated the superiority of the presented method compared to the other techniques.
Mohamed et al. [12] proposed a learning technique to categorize skin lesions to determine melanoma. The method was based on applying a CNN with multiple outlines. The results of the suggested technique were performed by 14 layers into the International Skin Imaging Collaboration (ISIC) dataset. Final results indicated successful ratio for the method in the segmentation of the skin cancer area.
Xu et al. [2] proposed an automatic diagnosis method for melanoma. The presented technique was based on an optimized image segmentation methodology along with a CNN optimized by satin bowerbird optimization (SBO). After segmentation, feature extraction was implemented on the processed image. The main features were then selected using the SBO algorithm to trim additional information. At last, the results were presented by performing an SVM into two groups of normal and melanoma cases. The method's results were authenticated by comparison with other techniques from the literature to indicate its performance.
As the research background shows, the application of deep learning, including CNNs in medical images processing, especially in skin lesion diagnosis, has been exponentially increasing. The present study proposes an optimized technique of skin lesion segmentation using a new design of the Thermal Exchange Optimization algorithm which has been used to reach this purpose with higher performance. The overall technique for the proposed technique has been shown in Figure 1.
Therefore, the main contributions of the present study can be highlighted as follows:
A new computer-aided method for the diagnosis of malignant melanoma
Segmenting the region of interest based on the Otsu method
Feature extraction to extract the beneficial features
A new optimized Deep Believe Network (DBN) to classify the melanoma cases
Optimization based on a developed version of the newly introduced Thermal Exchange Optimization (dTEO) algorithm
2. Skin Cancer Preprocessing
Medical imaging is the method and process used to create images of the human body to examine or advance medical science and is widely used in medicine [13]. Diagnostic images are the subject of many scans, examinations, and images used in medicine. Researches have shown that medical images taken are highly susceptible to vulnerability to noise and other factors such as low contrast. Here are two simple ways to do this purpose.
2.1. Noise Reduction
Images noise removal is a significant part of all medical image processing. This process tries to reduce or remove the noise from the original image. Features that expose them to noise are available on all recording devices [14]. The noise may be white or random noise. Noise reduction recreates and restores the noisy image to its original model based on ideal models in which we analyze how this method is done. The noise has a major impact on the image in certain situations, particularly through the identification of the image edge that needs differentiation which increases the influence of high-frequency pixels that specifically contain noise [15, 16].
The noise in medical images is typically Gaussian noise. Gaussian noise is dispersed in the image and is located between the image main pixels, which makes some inconsistencies for the next operations, and because the processing must be done only on the image main pixels, it requires a proper noise elimination for this purpose. Noise pixels are usually recognized as a different pixel among a group of pixels in the original image that differs from adjacent pixels in different terms such as light intensity and transparency. Wang-Mendel algorithm is an efficient fuzzy-based noise removal tool [17]. Because of the simple conception of the fuzzy techniques, they are popular for different applications. Furthermore, because of their speediness as an early-stage process, they are very useful for producing the initial fuzzy model [18]. In this method, the input and the output datasets show the behavior of the solved state of the problem. The production of the rule database is performed as follows:
Consider fuzzy isolation from the space that includes the input variables, which is achieved by the knowledge of an expert along with normalization. Then, it splits it into equivalent or unequal sections by doing a fuzzy-based method for parting the space of input variable. Then a form of the membership function is chosen and each component is given as a fuzzy package. Afterward, the membership and fuzzy set has been established to all the parts.
A set of candidate language rules are produced, which are generated by choosing the most inclusive rules for the samples.
Validity degree is assigned for the laws, which is achieved by multiplying the membership function values for all the components.
The final rules database is obtained from the rules of the candidate language set, which is done by categorizing these rules into some groups, where the groups include the rules of candidates with the same assumption. The final rule is obtained by selecting the highest degree for all the groups.
2.2. Image Contrast Enhancement
Dermoscopy images typically lack high image contrast, which makes them impossible to process next stages. The concern arises from multiple circumstances, such as the poor quality of measuring instruments and cameras, the low degree of the user interface in photography, the environmental factors, and the prevalence of noise. Sometimes, any crucial details from the images disappear from the above examples, rendering the processing too complex [19]. Contrast enhancement is a method for addressing the problems of contrast consistency. In this research, image contrast enhancement is used to enhance and help demonstrate the specifics of the cancer regions. Global enhancement of contrast using the lookup table is used in the present research. For categorizing and storing the received images on the disk, an 8-bit lookup table is used. The strategy is usually developed as follows [20]:
(1) |
where PDFIn and PDFout represent the probability density function of the input and the output corrected images, respectively, and PDFmin and PDFMax represent the minimum and the maximum probability density levels, respectively. Figure 2 shows an example of the preprocessed image for more clarification.
3. Skin Cancer Segmentation
3.1. Color Space
The RGB is the most common and basic color space in images processing, which usually is used for displaying images. Due to the dependence of these three colors, which are considered as base colors in the RGB color space, as well as their dependence on the intensity of ambient light, this color space is less used for main processing. To compensate for this shortcoming, the XYZ color model has been employed. This color space is a quantifiable link between the physiologically supposed colors in human color vision and wavelengths distributions in the electromagnetic visible spectrum . In this color space, the values of X and Z give color information, and Y defines the luminance. To convert color space from RGB to XYZ, the following equation has been used [21]:
(2) |
The most significant benefit of XYZ color model is that it is completely independent of the device.
3.2. Method of Segmentation
In dermoscopy images, the red color space is the only dimension in RGB color space that gives nearly the strength of the images. As mentioned before, both X and Z values give related information from the color for the XYZ color space. Therefore, the red (R) dimension and the X dimension are just normalized for the segmentation; that is [21],
(3) |
On any pixel of the input images, these normalization values are performed. After this normalization, the Otsu thresholding is used for giving a low-cost segmentation in terms of time complexity.
The Otsu technique is a popular method for proper thresholding of the images. The method is based on the conception of the variance of the intergroup maximization and the intragroup minimizing of the pixels. The global threshold has a problem when the image background resolution is inadequate. A local threshold can be used to eliminate the heterogeneity effect. This problem is solved by image preprocessing to eliminate inhomogeneities and apply a global threshold to the processed image. Based on Otsu technique, threshold level is comprehensively searched to provide minimum in-between variance class, which is expressed as follows:
(4) |
where ωi denotes the probability of two separate classes with a threshold value of t. σi2 in this case is the value of the variance of these classes. In fact, Otsu shows that minimizing the amount of variance in a class is like maximizing interclass variance:
(5) |
where vi2 describes the variance and mi signifies the mean value that can be updated intermittently. This technique is briefly given as follows:
Evaluate image histogram and its intensity level probability
Give initial values of ωi(0) and vi(0) for all levels of possible threshold (t = 1, 2,… )
Renew ωi and mi
Calculate vb2(t)
The optimal threshold here is the maximum of σb2(t).
Then, a morphology operation including closing, opening, and filling is used for better results. In this stage, by applying the mathematical filling operator, the extra image holes are filled. This operator is as follows [22]:
(6) |
where A and B represent the area and the constructing component, respectively.
Then, the opening operation has been established on the filled image for eliminating the ignitor particulars with no changes on other gray surfaces. This operator is as follows:
(7) |
Finally, for connecting the narrow parts, the closing operation is performed based on the following equation:
(8) |
Here, the structural element in this study is set as a 5 × 5 identity matrix. Algorithm 1 shows the pseudocode of the image segmentation.
Figure 3 displays some samples of image segmentation using the suggested segmentation.
4. Features Extraction
Feature extraction is the process of collecting more detailed information from the image. The features extraction in this part of processing is utilized for extracting the main features of the segmented skin cancer area to simplify the diagnosis successful. There are numerous features for feature extraction. This research uses statistical features and texture features, and geometric features have been adopted for this purpose. Table 1 gives the features utilized here.
Table 1.
Feature name | Formula | Feature name | Formula |
---|---|---|---|
Mean | 1/MN∑i=1M∑j=1Np(i, j) | Contrast | ∑i=1M∑j=1Np2(i, j) |
Variance | 1/MN∑i=1M∑j=1N(p(i, j) − μ) | Perimeter | ∑i=1M∑j=1NBp(i, j) |
Standard deviation | Entropy | −∑i=1M∑j=1Np(i, j)log p(i, j) | |
Area | ∑i=1M∑j=1Np(i, j) | Solidity | Area/Convex Area |
Rectangularity | Area/b × a | Correlation | ∑i=1M∑j=1Np(i, j) − μrμc/σrσc |
Irregularity index | 4π × Area/Perimeter2 | Invariant moments | φ 1=η20+η02 |
Form factor | Area/a2 | φ 2=(η20 − η02)2+4η112 | |
φ 3=(η30 − 3η12)2+(3η21 − μ03)2 | |||
Eccentricity | 2/a(a2 − b2)0.5 | Energy | ∑i=1M∑j=1Np2(i, j) |
Elongation | Homogeneity | ∑i=1M∑j=1Np(i, j)/1+|i − j| |
In the above table, MN describes the size of image, Bp describes the length of external side in the pixel boundary, p(i, j) represents the intensity values of the pixels at position, μ defines the mean value, σ describes the standard deviation, and a and b define the major axis and the minor axis, respectively. However, some achieved features are not useful for the process and just increase the method's complexity. To prevent this issue, a feature selection procedure has been utilized. To do so, in this study, an optimized technique has been used. For giving optimal results of features selection, the following cost function has been considered:
(9) |
where FP describes the false positive, FN describes the false negative, TP defines true positive, and TN defines the true negative. The idea is to minimize the above equation based on a developed optimization method.
5. Extreme Learning Machine (ELM)
One of the neural network models which has recently received many considerations is the ELM model. Some of the advantages for ELM are their high speed in learning, their simple using, and their ability to be used in numerous activation functions and nonlinear kernel functions. The ELM model can provide an integrated template with a wide variety of feature transmissions that can be used in the hidden layer, which can be utilized directly for regression and categorization. This algorithm is a learning method for Single Hidden Neural Networks (SHNN) based on initializing the input biases and weights randomly and the output weights evaluation. Thus, the network can be trained in just a few steps. Figure 4 shows the arrangement of a simple ELM network.
For a classification problem with D dimension and N number of training samples [23],
(10) |
where x(n) ∈ ℝD and t(n) ∈ ℝK.
An ELM-based feed-forward neural network can be formulated as follows:
(11) |
where g(·) describes the activation function, bm describes the bias of the mth hidden neuron, M signifies the hidden neurons, wm=[wm1, wm2,…, wmD] determines the input weight vector which connects the input neurons to the mth the neuron of the hidden layer, and βm=[βm1, βm2,…, βmK] determines the vector of weight that connects the mth neuron.
This conception is given as follows:
(12) |
where
(13) |
Meanwhile, H defines a nonsquare matrix. The training samples quantity is bigger than the hidden neurons quantity. Therefore, to solve this problem, the following formulation has been used:
(14) |
where H† describes the generalized Moore-Penrose matrix inverse.
Thus, the ELM networks can be shortened to three main steps:
Initializing the input weights and bias with random values
Evaluation of the hidden neurons' outputs matrix H
Evaluation of the hidden neurons' outputs weights matrix by equation (16)
6. The Developed Thermal Exchange Optimization Algorithm
Optimization is the process of using specific techniques to solve the optimization problems. However, in recent years, the performance of the classic optimization methods due to increasing the complexity and nonlinearity of them has been decreased or even failed [24]. Recently, several methods have been introduced for resolving these problems [25]. Metaheuristics are one class of these methods that have been popular due to their simple and suitable results in various problems. The metaheuristic algorithms are derived from various phenomena from nature, human society, animal hunting behavior, and so forth. Several algorithms have been proposed in this field [26–29], for example, World Cup Optimizer [30], Ant Lion Optimizer (ALO) [31], Chimp Optimization Algorithm [32], Harris Hawks Optimization [33], and mayfly optimization algorithm [34]. In the present research, a novel modified model of the Thermal Exchange Optimization (TEO) algorithm has been presented to achieve optimal results for the considered methodology [5]. The TEO algorithm is a novel metaheuristic technique that is inspired by the temperature of the objects and their position that is switched between cold and warm places, indicating the updated positions. In the following, more explanation about this algorithm has been provided.
6.1. The Newton Cooling Law
The tests indicated that the rate of cooling was roughly related to the temperature difference between the environment and the heated object. This is formulated as in the following equation:
(15) |
where A describes the area of the body surface that transmits heat, Q signifies the heat, α defines the coefficient of heat transfer that is reliant on different cases like object geometry, heat transfer mode, and surface state, and Tb and Ts represent the temperature of the body and the ambient temperature, respectively.
The heat loss in time dt is α × A × (Ts − T) dt, which states the variation in kept heat as the temperature drops dT; in other words,
(16) |
where d signifies the density (kg/m3), c defines the specific heat (J/kg/K), and V describes the volume (m3).
So,
(17) |
where TM describes the initial high temperature. Equation (19) is valid if when α × A × t/V × d × c is T-independant,
(18) |
So, by considering a constant ζ,
(19) |
So, consequently, by reformulating equation (19),
(20) |
6.2. The Algorithm
In Thermal Exchange Optimization, some of the candidates have been assumed as the cooling objects and the other remaining candidates are assumed as the environment; afterward, the reverse operation has been done. In TEO algorithm, like any other metaheuristic algorithm, the candidates are first generated randomly. So, the initial temperature for each object is formulated by the following equation:
(21) |
where rnd signifies a random vector limited between 0 and 1, Ti0 describes the initial solution vector of the ithobject, andTmin and Tmax represent the minimum and maximum limitations of the candidates.
The randomly generated candidates are then performed to the cost fiction to evaluate their cost value. After that, the position of some best T candidate vectors is saved as Thermal Memory (TM) to be utilized then for improving the algorithm's efficacy by less computational cost. Afterward, some best TM candidates have been given and the equal numbers of population with the worst values have been eliminated.
The candidates are separated into two equal types that are shown in Figure 5.
For more clarification, T1 determines the environment object for Tn/2+1 cooling object, and, inversely, if any object has lower value than ζ, the temperature exchanging is performed slowly. Therefore, ζ will be achieved by the following formula:
(22) |
Another important term in the algorithm is time, which is associated with the iteration quantity. This parameter is formulated in the following:
(23) |
To give an exploration term to the algorithm, the environmental temperature changing has been modeled, which is given in the following equation:
(24) |
where m1 and m2 describe the control variables and Ti'e signifies the object earlier temperature which has been modified to Tie.
By considering the former models, the new temperature of the objects is updated as follows:
(25) |
The next term is Pr, which is used to indicate whether the cooling objects' element should be altered or not.
The Pr components are compared with R(i)(i = 1,2,…, n), which has a randomly distributed value between 0 and 1. If R(i) < Pr, the ith candidate will be randomly chosen as 1-dimensional and the value is reformulated by the following:
(26) |
where Ti,j determines the jth variable of the ith candidate and Tjminand Tjmax are the minor and the major limitations of the variable number j, respectively. At last, the algorithm is stopped when stopping criteria are checked to terminate the algorithm.
6.3. Developed TEO Algorithm
Although TEO algorithm has numerous benefits in providing the best global solution, it also has some drawbacks which could be resolved. The main drawback of the algorithm could be its local minimum results and low convergence in some cases. Numerous improvements were introduced for resolving the exploration of the metaheuristics. In this research, chaotic theory has been utilized as the first improvement. The reason for using chaos in this algorithm is to improve the algorithm's efficiency to resolve the local optimization followed by an advanced speed of convergence. In the considered dTEO algorithm, the modified randk is replaced with the “rnd,” i.e.,
(27) |
where k signifies the number of iterations.
The other improvement for the algorithm to give a proper trade-off between exploration and exploitation in the algorithm using the Gaussian mutation mechanism is formulated below [35]:
(28) |
where σ2 signifies the variance of the Gaussian PDF and μ determines the Gaussian distribution expectation. This difference is applied to the considered location by renewing the TEO algorithm (Ti,jnew) as follows:
(29) |
where γ describes a decreasing random value between 0 and 1 and g(0,1) describes the standard Gaussian distribution.
6.4. Algorithm Authentication
In this section, the proposed developed Thermal Exchange Optimization (dTEO) algorithm that is stated in the previous section has been programmed by the MATLAB 2017b environment and then validated based on a 16 GB RAM Core™ i7/4720HQ 1.60 GHz processor. The proposed dTEO algorithm validation is based on applying it to some standard test cases. The results are also compared with some other approaches, that is, Multiverse Optimizer (MVO) [36], Locust Swarm Optimization (LSO) [37], Spotted Hyena Optimizer (SHO) [38], and the original TEO [39], to show the proposed method's efficacy. Table 2 shows the details of the studied functions.
Table 2.
# | Test function | Minimum value | Boundary |
---|---|---|---|
1 | F 1=∑n=1Nxn2 | 0 | −∞ ≤ x ≤ ∞ |
2 | F 2=∑n=1N−1(100 × [xn+1 − xn2]2+[1 − xn]2) | 0 | −∞ ≤ xn ≤ ∞ |
3 | 0 | −∞ ≤ xn ≤ ∞ | |
4 | F 4=x sin(4x)+1.1y sin (2y) | −18.5547 | 0 ≤ x, y ≤ 10 |
5 | F 5=[∑n=1Nnxn4]+Nn(0,1) | Varies | −∞ ≤ x ≤ ∞ |
6 | F 6=10N+∑n=1N[xn2 − 10 cos(2πxn)] | 0 | −∞ ≤ xn ≤ ∞ |
7 | 0 | −∞ ≤ xn ≤ ∞ | |
8 | −0.5231 | −∞ ≤ x, y ≤ ∞ |
In Table 3, the parameters settings of all studied algorithms for use in this study are given below.
Table 3.
For determining the performance of the algorithm, they performed on the benchmark functions and two measurement indicators including the minimum value and the standard values are extracted from the results to show the algorithms' accuracy and reliability. It should be noted that, for all algorithms, the population size and the number of iterations are set as 120 and 100, respectively. The simulations have been repeated 20 times to give consistent results. Table 4 illustrates the simulation results.
Table 4.
Function | Algorithm | ||||||
---|---|---|---|---|---|---|---|
LSO [37] | SHO [38] | MVO [36] | ALO [31] | TEO | dTEO | ||
f 1 | Min | 3.837e – 24 | 2.1201e – 28 | −2.2800e – 27 | 3.3197e – 28 | 3.4511e – 28 | 8.2193e – 31 |
Std | 2.559e – 19 | 2.4937e – 27 | 4.0865e – 28 | 2.9938e – 29 | 2.0073e – 30 | 2.37928e – 32 | |
| |||||||
f 2 | Min | 7.2874e – 5 | 7.564e – 4 | 4.824e – 4 | 2.56384e – 5 | 4.5463e – 6 | 6.7811e – 5 |
Std | 3.2084e – 6 | 4.18297e – 5 | 2.1167e – 5 | 3.5918e – 6 | 5.0648e – 7 | 2.1253e – 5 | |
| |||||||
f 3 | Min | −7.2553 | −8.15754 | −10.00 | −7.19376 | −10.00 | −10.00 |
Std | 0.35 | 0.46 | 0.27 | 0.15 | 0.15 | 0.10 | |
| |||||||
f 4 | Min | −5.2846 | −18.053 | −17.1146 | −16.3927 | −18.1106 | −18.1683 |
Std | 3.126 | 1.294 | 2.391 | 4.190 | 1.630 | 1.01 | |
| |||||||
f 5 | Min | 11.46e – 10 | 2.597e – 16 | 2.18769e – 7 | 5.0913e – 9 | 2.8096e – 23 | 3.7938e – 22 |
Std | 8.942e – 11 | 4.0973e – 17 | 2.89430e – 8 | 6.5514e – 10 | 2.8897e – 25 | 7.1937e – 24 | |
| |||||||
f 6 | Min | 4.276e – 11 | 2.1953e – 12 | 2.1967e – 21 | 3.1957e – 10 | 2.0054e – 18 | 5.6124e – 23 |
Std | 7.297e – 12 | 1.5374e – 14 | 4.2849e – 23 | 6.8091e – 11 | 2.3169e – 19 | 4.6498e – 24 | |
| |||||||
f 7 | Min | 2.623e – 15 | 3.3733e – 10 | 5.0406e – 8 | 3.7628e – 13 | 3.6781e – 9 | 6.39482e – 17 |
Std | 1.167e – 16 | 4.1967e – 12 | 4.9364e – 9 | 3.1936e – 15 | 4.1945e – 10 | 2.22864e – 19 | |
| |||||||
f 8 | Min | 0.0067 | −0.1472 | −0.3492 | −0.5846 | −0.5790 | −0.5273 |
Std | 0.653 | 0.467 | 0.385 | 0.805 | 0.152 | 0.100 |
As can be seen from Table 4, the suggested dTEO has the least value of the minimum value for the minimization problems. This means that the suggested method provides the best accuracy toward the other state-of-the-art methods for solving the studied functions. Plus, based on the results achieved for the standard deviation, it is definite that the suggested method has the minimum value of that. This shows that using the proposed method has the highest reliability against the comparative techniques in this study. So, we can conclude that using this algorithm in this study is a good idea in terms of accuracy and reliability.
7. Optimized ELM Network Based on Algorithm
The classification of the final extracted features is performed based on the proposed optimized ELM network. The main task of the ELM classifier here is to determine the skin cancer dermoscopy image into two groups of normal and malignant melanoma classes.
In this study, activation functions of the network are considered for enhancement. Generally, the significant effect of activation function on the ANNs is too clear, because a network with various activation functions gives good generalization capacities. This study considers a sigmoid activation function as one of the popular functions in this area. The formulation for this function is explained as follows:
(30) |
For improving the efficiency of the ELM-based classifier, different sigmoid functions have been utilized in the hidden neurons. In other words, different values are utilized foras and bs. Figure 6 displays the impact of as and bs on the function shape.
In other words, the problem of finding the suitable parameter of the sigmoid function can be considered as an optimization problem. Here, we used a developed Thermal Exchange Optimization algorithm to refine the issue. However, both “as” and “bs” have almost equal effect on the function shape; we used the “bs” for optimization of the network. Therefore, two optimization policies have been used.
The input weights have been first initialized. Then, the developed Thermal Exchange Optimization algorithm is utilized for optimal parameters selection by assuming the initial weights. The objective function of the developed TEO is based on the square error between the output of the desired value and the network by the following:
(31) |
where n and k describe the numbers of training samples and the output layers, respectively, and yji and dji represent the output of the network and the desired value, respectively.
The primary target of the present study is to utilize the mentioned optimized ELM network for diagnosis of the skin cancer. The optimization is using a newly designed metaheuristic, namely, the developed Thermal Exchange Optimization algorithm.
8. Results and Discussion
In this paper, a new optimized methodology is used to detect the skin cancer from dermoscopy images.
8.1. The Database
For validating the proposed skin cancer identification system, there are many databases. This analysis uses a well-known dataset, called SIIM-ISIC Melanoma. The images are generated in DICOM format to be retrieved by popular libraries and include both image and metadata. Two formats of TFRecord and JPEG are considered for the dataset. The size of the images is uniform (1024 × 1024). Some examples of the utilized images in the study are displayed in Figure 7.
8.2. Results and Discussion
As mentioned before, the present study is authenticated by the ACS database and the results have been verified by some different methodologies. The simulations are performed on a laptop with MATLAB 2018b environment with the following information: 16 GB RAM and Core™ i7-4720HQ-1.60 GHz processor. The pipeline methodology of the method is explained in the following:
Start
Apply preprocessing
Apply noise reduction
Apply image contrast enhancement
Perform skin cancer segmentation
Apply features extraction based on texture features, statistical features, and the geometric features
Perform dTEO-based ELM classifier for the final diagnosis
Return the results
End
In the presented dTEO-based ELM classifier, 80% of data are set to train data, and the remaining 20% are set to test the data. The training step of the network is assumed by 700 iterations. The training process has been iterated 15 times to achieve a confidential result. The validation has been performed using 5 measurement indexes: specificity, sensitivity, accuracy, PPV, and NPV, which are given as follows:
(32) |
The results are compared with 7 other methods, fractal analysis [40], CNN [41], Delaunay Triangulation [42], Side-by-Side method [43], Genetic Algorithm [44], fusion method [45], and SVM [46], to verify the method's performance in different terms.
Table 5 tabulates the performance validation for the suggested technique compared with other methods. The results show the mean value of 30 runs for the methods.
Table 5.
Method | Performance metric | ||||
---|---|---|---|---|---|
Accuracy | Sensitivity | Specificity | PPV | NPV | |
Fractal analysis [40] | 88.23 | 91.18 | 88.23 | 80.88 | 92.65 |
CNN [41] | 82.35 | 83.82 | 61.76 | 66.18 | 83.82 |
Delaunay Triangulation [42] | 86.76 | 89.70 | 86.76 | 77.94 | 88.23 |
Side-by-Side method [43] | 67.64 | 83.82 | 58.82 | 58.82 | 83.82 |
Genetic Algorithm [44] | 80.88 | 79.412 | 79.41 | 75.00 | 85.29 |
Fusion method [45] | 83.82 | 82.35 | 76.47 | 70.58 | 80.88 |
SVM [46] | 83.82 | 83.82 | 76.47 | 75.00 | 89.70 |
Proposed method | 92.65 | 91.18 | 89.70 | 86.76 | 94.12 |
As is concluded from Table 5, the suggested method with 95% accuracy provides the uppermost precision among the other compared techniques. Also, with 95% of sensitivity, which is the highest against other methods, the reliability of the proposed method has been proved. This can be proved also for NPV, PPV, and specificity toward the others. The better results of NPV and PPV indicate higher commonness of the condition to determine the probability of a test diagnosing cancer. Also, better results of the specificity and sensitivity for the suggested technique show higher prevalence-independent results for the algorithm. For more clarification, the results of Table 5 are shown in Figure 8.
9. Conclusions
In recent years, skin cancer has been recognized as the most dangerous and common type of cancer in humans. There are different types of skin cancer. Melanoma is a common type of skin cancer, where early detection can be helpful in its treatment and can suggestively stop death from this deadly skin cancer. Designing an approach that facilitates the skin cancer detection in the early stages is very useful and valuable. In this study, an optimized pipeline procedure was utilized for the optimal detection of melanoma from dermoscopy images. In the proposed method, after preprocessing of the input dermoscopy images based on noise reduction and contrast enhancement, the region of interest was segmented. Afterward, feature extraction was implemented on the segmented images to extract useful features from them. Finally, an optimized Deep Believe Network (DBN) was utilized to separate images into two classes of healthy and cancerous. The optimization of the DBN was using a new metaheuristic method called the developed Thermal Exchange Optimization algorithm to improve the network efficiency in terms of reliability and accuracy. So, the main contribution of the proposed method is to use a newly developed version of the newly introduced Thermal Exchange Optimization for the diagnosis of malignant melanoma. The main advantage of using this technique is that the results showed that it improved the system's efficiency in both terms of accuracy and precision or, in other words, its reliability during different runs. The performance of the proposed technique was authenticated by comparing it with 7 other methods: fractal analysis, CNN, Delaunay Triangulation, Side-by-Side method, Genetic Algorithm, fusion method, and SVM. Simulation results specified that, based on some various performance indexes, the suggested procedure gave the best results against the compared techniques. The main limitation of the proposed method is that, due to different soft techniques, it needs a lot of time. In the future study, we will work on designing a method to simplify the method from a theoretical methodology to a real-time system for practical applications.
Acknowledgments
This study was supported by specialty construction project of high level higher vocational colleges and Technical Skills Expert Project of Dongguan Polytechnic (Y17040323).
Data Availability
The data are available at https://www.kaggle.com/c/siim-isic-melanoma-classification.
Conflicts of Interest
The authors declare that there are no conflicts of interest in this research.
References
- 1.Bao S., Ebadi A., Toughani M., et al. A new method for optimal parameters identification of a PEMFC using an improved version of monarch butterfly optimization algorithm. International Journal of Hydrogen Energy . 2020;45(35):17882–17892. doi: 10.1016/j.ijhydene.2020.04.256. [DOI] [Google Scholar]
- 2.Xu Z., Sheykhahmad F. R., Ghadimi N., Razmjooy N. Computer-aided diagnosis of skin cancer based on soft computing techniques. Open Medicine . 2020;15(1):860–871. doi: 10.1515/med-2020-0131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liu Q., Liu Z., Yong S., Jia K., Razmjooy N. Computer-aided breast cancer diagnosis based on image segmentation and interval analysis. Automatika . 2020;61(3):496–506. doi: 10.1080/00051144.2020.1785784. [DOI] [Google Scholar]
- 4.Costa A., Kieffer Y., Scholer-Dahirel A., et al. Fibroblast heterogeneity and immunosuppressive environment in human breast cancer. Cancer Cell . 2018;33(3):463.e10–479.e10. doi: 10.1016/j.ccell.2018.01.011. [DOI] [PubMed] [Google Scholar]
- 5.Yu D., Wang Y., Liu H., Jermsittiparsert K., Razmjooy N. System identification of PEM fuel cells using an improved elman neural network and a new hybrid optimization algorithm. Energy Report . 2019;5:1365–1374. doi: 10.1016/j.egyr.2019.09.039. [DOI] [Google Scholar]
- 6.Rajinikanth V., Couceiro M. S. RGB histogram based color image segmentation using firefly algorithm. Procedia Computer Science . 2015;46:1449–1457. doi: 10.1016/j.procs.2015.02.064. [DOI] [Google Scholar]
- 7.Acharya U. R., Fernandes S. L., WeiKoh J. E., et al. Automated detection of alzheimer’s disease using brain MRI images-a study with various feature extraction techniques. Journal of Medical Systems . 2019;43(9):p. 302. doi: 10.1007/s10916-019-1428-9. [DOI] [PubMed] [Google Scholar]
- 8.Raja N. S. M., Fernandes S. L., Dey N., Satapathy S. C., Rajinikanth V. Contrast enhanced medical MRI evaluation using tsallis entropy and region growing segmentation. Journal of Ambient Intelligence and Humanized Computing . 2018:1–12. doi: 10.1007/s12652-018-0854-8. [DOI] [Google Scholar]
- 9.Li Y., Esteva A., Kuprel B., Novoa R., Ko J., Thrun S. Skin cancer detection and tracking using data synthesis and deep learning. 2016. https://arxiv.org/abs/1612.01074 .
- 10.Esteva A., Kuprel B., Novoa R. A., et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature . 2017;542(7639):115–118. doi: 10.1038/nature21056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Jafari M. H., Karimi N., Nasr-Esfahani E., et al. Skin lesion segmentation in clinical images using deep learning. Proceedings of the 2016 23rd International Conference on Pattern Recognition (ICPR); December 2016; Cancun, Mexico. IEEE; [DOI] [Google Scholar]
- 12.Mohamed A., Mohamed W. A., Zekry A. H. Deep learning can improve early skin cancer detection. International Journal of Electronics and Telecommunications . 2019;65 [Google Scholar]
- 13.Pugalenthi R., Rajakumar M. P., Ramya J., Rajinikanth V. Evaluation and classification of the brain tumor MRI using machine learning technique. Journal of Control Engineering and Applied Informatics . 2019;21(4):12–21. [Google Scholar]
- 14.Satapathy S. C., Sri Madhava Raja N., Rajinikanth V., Ashour A. S., Dey N. Multi-level image thresholding using otsu and chaotic bat algorithm. Neural Computing and Applications . 2018;29(12):1285–1307. doi: 10.1007/s00521-016-2645-5. [DOI] [Google Scholar]
- 15.Anoraganingrum D. Cell segmentation with median filter and mathematical morphology operation. Proceedings of the 10th International Conference on Image Analysis and Processing; September 1999; Venice, Italy. IEEE; [DOI] [Google Scholar]
- 16.Loupas T., McDicken W. N., Allan P. L. An adaptive weighted median filter for speckle suppression in medical ultrasonic images. IEEE Transactions on Circuits and Systems . 1989;36(1):129–135. doi: 10.1109/31.16577. [DOI] [Google Scholar]
- 17.Rajinikanth V., Satapathy S. C. Segmentation of ischemic stroke lesion in brain MRI based on social group optimization and fuzzy-tsallis entropy. Arabian Journal for Science and Engineering . 2018;43(8):4365–4378. doi: 10.1007/s13369-017-3053-6. [DOI] [Google Scholar]
- 18.Dey N., Rajinikanth V., Ashour A., Tavares J. M. Social group optimization supported segmentation and evaluation of skin melanoma images. Symmetry . 2018;10(2):p. 51. doi: 10.3390/sym10020051. [DOI] [Google Scholar]
- 19.Fernandes S. R., de Assis J. T., Estrela V. V., et al. Advances in Multidisciplinary Medical Technologies—Engineering, Modeling and Findings . Berlin, Germany: Springer; 2020. Nondestructive diagnosis and analysis of computed microtomography images via texture descriptors; pp. 249–261. [DOI] [Google Scholar]
- 20.Singh K. B., Mahendra T. V., Kurmvanshi R. S., Rama Rao C. V. Image enhancement with the application of local and global enhancement methods for dark images. Proceedings of the 2017 International Conference on Innovations in Electronics, Signal Processing and Communication (IESC); April 2017; Shillong, India. IEEE; [DOI] [Google Scholar]
- 21.Carney M. N., Johnston W. M. A novel regression model from RGB image data to spectroradiometric correlates optimized for tooth colored shades. Journal of Dentistry . 2016;51:45–48. doi: 10.1016/j.jdent.2016.05.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Maragos P. Differential morphology and image processing. IEEE Transactions on Image Processing . 1996;5(6):922–937. doi: 10.1109/83.503909. [DOI] [PubMed] [Google Scholar]
- 23.Cao J., Zhang K., Luo M., Yin C., Lai X. Extreme learning machine and adaptive sparse representation for image classification. Neural Networks . 2016;81:91–102. doi: 10.1016/j.neunet.2016.06.001. [DOI] [PubMed] [Google Scholar]
- 24.Ramezani M., Bahmanyar D., Razmjooy N. A new optimal energy management strategy based on improved multi-objective antlion optimization algorithm: applications in smart home. SN Applied Sciences . 2020;2(12):1–17. doi: 10.1007/s42452-020-03885-7. [DOI] [Google Scholar]
- 25.Tian M.-W., Yan S.-R., Han S.-Z., Nojavan S., Jermsittiparsert K., Razmjooy N. New optimal design for a hybrid solar chimney, solid oxide electrolysis and fuel cell based on improved deer hunting optimization algorithm. Journal of Cleaner Production . 2020;249 doi: 10.1016/j.jclepro.2019.119414.119414 [DOI] [Google Scholar]
- 26.Yanda L., Yuwei Z., Razmjooy N. Optimal arrangement of a micro-CHP system in the presence of fuel cell-heat pump based on metaheuristics. International Journal of Ambient Energy . 2020:1–12. doi: 10.1080/01430750.2020.1758779. [DOI] [Google Scholar]
- 27.Yuan Z., Wang W., Wang H., Razmjooy N. A new technique for optimal estimation of the circuit-based PEMFCs using developed sunflower optimization algorithm. Energy Report . 2020;6:662–671. doi: 10.1016/j.egyr.2020.03.010. [DOI] [Google Scholar]
- 28.Guo Y., Dai X., Jermsittiparsert K., Razmjooy N. An optimal configuration for a battery and PEM fuel cell-based hybrid energy system using developed krill herd optimization algorithm for locomotive application. Energy Report . 2020;6:885–894. doi: 10.1016/j.egyr.2020.04.012. [DOI] [Google Scholar]
- 29.Yang Z., Liu Q., Zhang L., Dai J., Razmjooy N. Model parameter estimation of the PEMFCs using improved barnacles mating optimization algorithm. Energy . 2020;212 doi: 10.1016/j.energy.2020.118738.118738 [DOI] [Google Scholar]
- 30.Navid R., Khalilpour M., Ramezani M. A new meta-heuristic optimization algorithm inspired by FIFA world cup competitions: theory and its application in PID designing for AVR system. Journal of Control, Automation and Electrical Systems . 2016;27(4):419–440. [Google Scholar]
- 31.Mani M., Bozorg-Haddad O., Chu X. Advanced Optimization by Nature-Inspired Algorithms . Berlin, Germany: Springer; 2018. Ant lion optimizer (ALO) algorithm; pp. 105–116. [DOI] [Google Scholar]
- 32.Khishe M., Mosavi M. R. Chimp optimization algorithm. Expert Systems with Applications . 2020;149 doi: 10.1016/j.eswa.2020.113338.113338 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Heidari A. A., Mirjalili S., Faris H., Aljarah I., Mafarja M., Chen H. Harris hawks optimization: algorithm and applications. Future Generation Computer Systems . 2019;97:849–872. doi: 10.1016/j.future.2019.02.028. [DOI] [Google Scholar]
- 34.Zervoudakis K., Tsafarakis S. A mayfly optimization algorithm. Computers & Industrial Engineering . 2020;145 doi: 10.1016/j.cie.2020.106559.106559 [DOI] [Google Scholar]
- 35.Higashi N., Iba H. Particle swarm optimization with gaussian mutation. Proceedings of the 2003 IEEE Swarm Intelligence Symposium. SIS’03 (Cat. No. 03EX706); April 2003; Indianapolis, IN, USA. IEEE; [DOI] [Google Scholar]
- 36.Mirjalili S., Mirjalili S. M., Hatamlou A. Multi-verse optimizer: a nature-inspired algorithm for global optimization. Neural Computing and Applications . 2016;27(2):495–513. doi: 10.1007/s00521-015-1870-7. [DOI] [Google Scholar]
- 37.Cuevas E., Fausto F., González A. New Advancements in Swarm Algorithms: Operators and Applications . Berlin, Germany: Springer; 2020. The locust swarm optimization algorithm; pp. 139–159. [DOI] [Google Scholar]
- 38.Dhiman G., Kumar V. Spotted hyena optimizer: a novel bio-inspired based metaheuristic technique for engineering applications. Advances in Engineering Software . 2017;114:48–70. doi: 10.1016/j.advengsoft.2017.05.014. [DOI] [Google Scholar]
- 39.Kaveh A., Dadras A. A novel meta-heuristic optimization algorithm: thermal exchange optimization. Advances in Engineering Software . 2017;110:69–84. doi: 10.1016/j.advengsoft.2017.03.014. [DOI] [Google Scholar]
- 40.Dobrescu R., Dobrescu M., Mocanu S., Popescu D. Medical images classification for skin cancer diagnosis based on combined texture and fractal analysis. WISEAS Transactions on Biology and Biomedicine . 2010;7(3):223–232. [Google Scholar]
- 41.Zhang N., Cai Y.-X., Wang Y.-Y., Tian Y.-T., Wang X.-L., Badami B. Skin cancer diagnosis based on optimized convolutional neural network. Artificial Intelligence in Medicine . 2020;102 doi: 10.1016/j.artmed.2019.101756.101756 [DOI] [PubMed] [Google Scholar]
- 42.Pennisi A., Bloisi D. D., Nardi D., Giampetruzzi A. R., Mondino C., Facchiano A. Skin lesion image segmentation using delaunay triangulation for melanoma detection. Computerized Medical Imaging and Graphics . 2016;52:89–103. doi: 10.1016/j.compmedimag.2016.05.002. [DOI] [PubMed] [Google Scholar]
- 43.Babino G., Lallas A., Agozzino M., et al. Melanoma diagnosed on digital dermoscopy monitoring: a side-by-side image comparison is needed to improve early detection. Journal of the American Academy of Dermatology . 2020;85(3):619–625. doi: 10.1016/j.jaad.2020.07.013. [DOI] [PubMed] [Google Scholar]
- 44.Salem C., Azar D., Tokajian S. An image processing and genetic algorithm-based approach for the detection of melanoma in patients. Methods of Information in Medicine . 2018;57(1):74–80. doi: 10.3412/ME17-01-0061. [DOI] [PubMed] [Google Scholar]
- 45.Das R., De S., Bhattacharyya S., Platos J., Snasel V., Hassanien A. E. Data augmentation and feature fusion for melanoma detection with content based image classification. Proceedings of the International Conference on Advanced Machine Learning Technologies and Applications; January 2020; Cairo, Egypt. [DOI] [Google Scholar]
- 46.Alquran H., Qasmieh I. A., Alqudah A. M., et al. The melanoma skin cancer detection and classification using support vector machine. Proceedings of the 2017 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT); October 2017; Aqaba, Jordan. IEEE; [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data are available at https://www.kaggle.com/c/siim-isic-melanoma-classification.