Skin lesion classification of dermoscopic images using machine learning and convolutional neural network

Bhuvaneshwari Shetty; Roshan Fernandes; Anisha P Rodrigues; Rajeswari Chengoden; Sweta Bhattacharya; Kuruva Lakshmanna

doi:10.1038/s41598-022-22644-9

. 2022 Oct 28;12:18134. doi: 10.1038/s41598-022-22644-9

Skin lesion classification of dermoscopic images using machine learning and convolutional neural network

Bhuvaneshwari Shetty ¹, Roshan Fernandes ², Anisha P Rodrigues ², Rajeswari Chengoden ³, Sweta Bhattacharya ³, Kuruva Lakshmanna ^3,^✉

PMCID: PMC9616944 PMID: 36307467

Abstract

Detecting dangerous illnesses connected to the skin organ, particularly malignancy, requires the identification of pigmented skin lesions. Image detection techniques and computer classification capabilities can boost skin cancer detection accuracy. The dataset used for this research work is based on the HAM10000 dataset which consists of 10015 images. The proposed work has chosen a subset of the dataset and performed augmentation. A model with data augmentation tends to learn more distinguishing characteristics and features rather than a model without data augmentation. Involving data augmentation can improve the accuracy of the model. But that model cannot give significant results with the testing data until it is robust. The k-fold cross-validation technique makes the model robust which has been implemented in the proposed work. We have analyzed the classification accuracy of the Machine Learning algorithms and Convolutional Neural Network models. We have concluded that Convolutional Neural Network provides better accuracy compared to other machine learning algorithms implemented in the proposed work. In the proposed system, as the highest, we obtained an accuracy of 95.18% with the CNN model. The proposed work helps early identification of seven classes of skin disease and can be validated and treated appropriately by medical practitioners.

Subject terms: Skin diseases, Computational biology and bioinformatics

Introduction

A skin lesion is a growth or appearance of the skin that is abnormal concerning the surrounding skin. Primary and secondary skin lesions are the two types of skin lesions. Primary skin lesions are abnormal skin conditions that can develop over time or be present at birth. Secondary skin lesions can develop from primary skin lesions that have been exacerbated or altered. When a mole is scraped until it bleeds, the crust that forms, as a result, develops a secondary skin lesion¹. Dermatologists propose one of three treatments for afflicted skin, depending on the type of lesion: home care, medicines, or surgery. Regardless of ways innocent they appear; a few sorts of skin lesions may be pretty risky to the patients, since they will indicate the presence of malignancy and require surgical removal. Melanoma is the most dangerous type of skin cancer; as soon as it has spread, it’s deadly, however, it is treatable in its early stages. As a result, a precise diagnosis of skin patches is essential to protect patients’ growths and ensure that they receive timely treatment².

Machine Learning methods could be used to automate the analysis, resulting in a system and framework in the medical field that would aid in providing contextual relevance, improving clinical reliability, assisting physicians in communicating objectively, reducing errors related to human fatigue, lowering mortality rates, lowering medical costs, and more easily identifying diseases. A machine learning method that can categorize both malignant and benign pigmented skin lesions is a step toward achieving these goals³. In the proposed work, Convolutional Neural Networks (CNN) and Machine Learning algorithms are used to accurately classify pigmented skin lesions in dermoscopic images to detect malignant skin lesions as early as feasible.

The HAM10000 dataset which consists of 10015 images has been used in the proposed work.The HAM10000 dataset is a vast collection of dermoscopic images of pigmented skin lesions which are very common from multiple sources⁴. Datasets with significant class imbalances are fairly common in the medical industry. It is the same with this data set. In the proposed work, it proved to be a significant challenge. The dataset images have a resolution of 600 × 450 pixels and are saved as JPEG formats. They are manually cropped and cantered around the lesion, as well as modified for visual contrast and color reproduction, at first. Each image and patient had seven features, namely, age of the patient, sex of the patient, lesion id which is a unique identifier for a particular type of lesion, image id which is a unique identification number for an image, dx type for technical validation, Skin lesion’s geographical location, and a diagnostic skin lesion category which is a classification of skin lesions that can be used to diagnose a condition.

The patients were mostly between the ages of 35 and 70. The ground truth of the data set was represented by the technical validation field category, which revealed how the skin lesion diagnosis was made. Ground truths were divided into four categories by the researchers, namely, Histopathology, Confocal, Follow-up, and Consensus. In the Histopathology category, dermatopathologists diagnosed excised lesions histopathologically. All images were manually evaluated with the relevant histopathologic diagnosis and confirmed for plausibility by the researchers. In the Confocal category, the reflectance confocal microscopy method is used that provides near-cellular resolution, and it was used to confirm the presence of some benign keratoses on the face. In the Follow-up category, the researchers recognized images as proof of biological benignity, if nevi examined with digital dermoscopy confirmed no modifications during 3 follow-up visits or 1.5 years. The consensus category consists of normal benign instances with no follow-up or histology, as well as examples in which two experts have given the same unequivocal benign diagnosis.

Histopathology was used to diagnose more than half of the skin lesions. The back, lower limbs, and trunk are all significantly impacted skin cancer locations, as demonstrated in the data set’s localization distribution. In terms of diagnostic skin lesion categories, the data set included seven different classes. Figure 1 depicts a selection of sample images from the HAM10000 dataset for each of the classes. The following are the seven categories:

Sample images for the seven skin lesion categories from HAM10000 dataset.

Actinic Keratoses [akiec]: Types of squamous cell carcinoma that are noninvasive and can be treated locally without surgery (327 images are available in the data set).
Basal Cell Carcinoma [bcc]: A type of epithelial skin cancer that seldom spreads but, if left untreated, can be fatal. (514 images are available in the data set).
Benign Keratosis-like Lesions [bkl]: Seborrheic keratoses, lichen-planus like keratoses, and solar lentigo, correlate to a seborrheic keratosis or a sun lentigo with regression and inflammation, are all examples of “benign keratosis” (1099 images are available in the data set).
Dermatofibroma [df]: Skin lesions that are either benign growth or an inflammatory response to minor trauma (115 images are available in the data set).
Melanoma [mel]: Melanoma is a cancerous tumour that develops from melanocytes and can take many different forms. If caught early enough, it can be treated with a basic surgical procedure (1113 images are available in the data set).
Melanocytic Nevi [nv]: Skin lesions are benign neoplasms of melanocytes and appear in a variety of shapes and sizes. From a dermatoscopic standpoint, the variants may differ dramatically (6705 images are available in the data set).
Vascular Lesions [vasc]: Cherry angiomas, angiokeratomas, and pyogenic granulomas are examples of benign or malignant angiomas. (142 images are available in the data set).

The proposed work is organized as follows: Section “Literature survey” contains information on similar research in this field. Section “Proposed methodology” details the methodology performed on the data. Section “Results and discussion” discusses all of the model’s performances; Section “Conclusion and future work” contains the conclusions derived from the research and the future work and next steps for this project.

Literature survey

Dhivyaa et al.¹ have proposed a model that can produce feature maps of high resolution that can be used to assist in the preservation of the image’s spatial information. On two separate datasets, the authors proposed Random Forest and Decision Tree algorithms for skin class categorization. Polat, Kemal, and Kaan Onur Koc⁵ have proposed a system that uses no filtering and feature extraction. Authors claim to have obtained very encouraging results in the identification of skin lesions. Kumar et al.⁶ have proposed a system with RGB color-space, GLCM, and Local Binary Pattern (LBP) methods for pre-processing and image segmentation that uses fuzzy-c clustering and obtained encouraging results in the identification of skin lesions. Adegun and Viriri⁷ have proposed a system with an encoder-decoder Conditional Random Field (CRF) module for contour refining and lesion boundary localization that uses a linear combination of Gaussian kernels for paired edge potentials. A Fully Convolutional Network (FCN) with hyper-parameters tuning gave good results.

Data augmentation strategies have been proposed by Srinivasu et al.⁸ to balance various forms of lesions to the same range of images. The proposed model, which is predicated on the LSTM and MobileNet V2 approaches, was found to be effective in classifying and detecting skin diseases with little effort and computational resources. When using CNN transfer learning, Mahbod et al.⁹ confirmed that image size affects skin lesion categorization performance. They also showed that image cropping outperforms image resizing in terms of performance. Finally, the best classification performance is demonstrated using a simple ensembling strategy that merges the findings from images clipped at six scales and three fine-tuned CNNs. Zhang et al.¹⁰ presented an efficiency result of CNN that was optimized using an upgraded version of the whale optimization technique. This technique is used to find the best weights and biases in the network to reduce the difference between the network output and the desired output.

Hameed et al.¹¹ proposed a Multi-Class Multilevel (MCML) classification technique inspired by the “divide and conquer” strategy. The proposed classification algorithm combines machine learning and deep learning methods. Hasan et al.¹² proposed the DSNet, an automatic semantic segmentation network for skin lesions. To reduce the number of parameters and make the network lighter, they used a separable depth-wise convolution. Hosny et al.¹³ used pre-trained AlexNet with transfer learning. As initial values, the parameters from the original model were used. and the weights of the last three replaced layers were randomly initialized. ISIC 2018, the most recent public dataset, was used to test the suggested technique. Chatterjee et al.¹⁴ used SVM with RBF to identify three lesions by extracting form, fractal dimension, texture, and color variables using fractal-based regional texture analysis (FRTA). Pereira et al.¹⁵ proposed integrating gradients with the local binary patterns (LBP) technique to increase the performance of skin lesion classification algorithms and to further exploit the border-line properties of the lesion segmentation mask.

A kernel sparse coding approach for both segmentation and classification of skin lesions¹⁶. A skin lesion detection system was proposed by Garcia-Arroyo et al.¹⁷ with fuzzy histogram thresholding, lesions were segmented. Using the ABCD rule, Zaqout¹⁸ has developed a model capable of partitioning and classifying skin pictures. In the majority of these efforts, the ABCD rule is followed, which includes image pre-processing, segmentation to locate the lesion, feature extraction, and classification. TDS is a dermoscopy score that assists in the diagnosis of the lesion condition. Khan et al.¹⁹ have proposed a system that uses the deep learning model MASK-RCNN for segmentation and pre-trained DenseNet for classification. Using YOLOv5, Shelatkar et al.²⁰ describe a deep learning-based method for classifying and identifying brain tumours. To extract the features, a transfer learning concept is used which is then exposed to selection and classification stages. Khan et al.²¹ developed an automated method for skin lesion categorization that used pre-trained RESNET-50 and RESNET-101 deep neural network (DCNN) with transfer learning was employed for feature extraction and optimal feature selection based on the kurtosis-controlled principle component (KcPCA). The information was then combined and the best features were chosen, which were then fed into a supervised learning algorithm—SVM of kernel function radial basis function (RBF) for classification.

Khan et al.²² developed a segmentation and classification framework based on deep learning. For skin lesion segmentation, a MASK R-CNN-based architecture with a Resnet50 feature pyramid network (FPN) is used. The final mask is then generated by mapping connected layer-based features. A 24-layer convolutional neural network architecture is built during the classification phase, with activation based on the display of higher characteristics. Finally, the best CNN features are delivered to softmax classifiers for final classification. Harris Hawks Optimization with Deep Learning Model for Detection of Diabetic Retinopathy was proposed by Gundluru et al.²³. Tajeddin et al.²⁴ used highly discriminative characteristics to classify skin melanoma. For lesion segmentation, the authors started with contour propagation. To extract features, lesions were mapped in log-polar space using Daugman’s transformation based on the peripheral area. Finally, the various classifiers used in the proposed work were evaluated.

To construct a dermoscopic skin image recognition system, Yu et al.²⁵ recommended CNN and the local descriptor encoding approach. To extract skin lesion features from images, the authors utilized ResNet101 and ResNet50. Using a Fisher vector (FV) and the collected ResNet features, a global image representation was generated. Finally, a Chi-squared kernel was applied in an SVM for classification. Melanoma was categorized into three groups based on the thickness of the lesion²⁶. The researchers utilized two categorization schemata: one classified lesion as thin or thick, and the other separated them into thin, moderate, and thick categories. To categorize the lesion data, a combination of ANN and logistic regression algorithms is used. Using a cloud computing system, Rajput et al.²⁷ suggested diabetes diagnosis to Indians living in rural locations. To construct a breast cancer detection system, Abbas et al.² proposed the Extremely Randomized Tree and Whale Optimization Algorithm (WOA) to present a unique approach called BCD-WERT for efficient feature selection and classification. To detect ships from satellite imaginary, deep CNN with YOLOv3 for object detection with SHA-256 hashing for security to the detected images³. For the purpose of diagnosing heart disease, Reddy et al.²⁸ suggested a hybrid genetic algorithm and a fuzzy logic classifier.

To conclude, many researchers have contributed to classifying the skin lesion categories using distinct machine learning and deep learning approaches. Also, they have worked on a variety of data sets. The proposed work mainly aims at classifying the skin lesion categorization on the HAM10000 dataset, which is used by a few researchers. The proposed work concentrates on categorizing the skin lesion into seven classes using different machine learning and Convolutional Neural Network technique and obtained comparatively better results.

The main contributions of the proposed work

The HAM10000 dataset images are highly unbalanced. For instance, in the dataset, we observe that Dermatofibroma (df) skin lesion class has a count of 115 images which is the smallest size. The advantage of the proposed work is that we developed a model which is computationally much efficient as compared to existing work as they have considered the entire dataset. This is because a majority of the existing work has considered the entire unbalanced dataset and then they have augmented it to make sure the dataset is balanced. Here they have considered the small class training set (For instance Dermatofibroma (df) skin lesion class) which is augmented around 50 times, which we have avoided in our proposed work. The other contributions include:

Resizing the medical dermoscopic lesion images to reduce memory consumption and improve latency.
Data augmentation to overcome the limited number of images and reduce overfitting.
Global feature descriptors allow the system to extract skin lesion features efficiently even with a little training dataset.
The proposed customized convolution neural network has hyperparameters to train the model and also, the softmax function is used at the end of the fully connected layer of the CNN. Hence the proposed CNN model works well with binary and multi-class detection.
To make the model more trustworthy, quicker, and error-free, different evaluation metrics, and 10-fold cross-validation are applied.
Pipeline for developing a support tool for skin lesion medical practitioners.

Proposed methodology

We developed a fully automated approach for detecting and classifying skin lesions using Machine Learning and customized Convolutional Neural Networks. The proposed work concentrated on pre-processing and classification. The standard HAM10000 dataset is used in the proposed work which contains 10015 skin lesion images divided into seven categories. The steps involved in the proposed work is depicted in Fig. 2.

Image pre-processing

The proposed work applied the following image pre-processing steps.

Step 1 - ordering the dataset

As the dataset images are out of order, sorting each image within each folder by the seven diseases is the necessary step. ’Image id’ and ’dx’ were the most crucial parameters for arranging the images in this scenario. In the dataset, we observe that df skin lesion count has a number 115 which is the smallest size. As a result, choosing 100 images per class and using a dataset of 100*7 images to train the model is insufficient to gain better classification accuracy. As a result, more data will be generated, and data augmentation will be used to achieve this task.

Step 2 - Image resizing

All the images in the folder are resized to 220*220 before processing into different machine learning models.
For the customized CNN model, images are scaled to 96 × 96 with a depth of 3 to speed up the process. Then we have converted the images into a NumPy array to get the value of each pixel of the image. Then we normalized the pixel values to a range of 0–1. The LabelBinarizer class allows us to input class labels that are in string form in the dataset, convert those class labels into one-hot encoded vectors, and then convert them back into a human-readable form from the integer class label prediction of Keras CNN.

Step 3 - Data augmentation

Data augmentation is a technique for generating new “data”. To train the machine learning models, the proposed method used Horizontal Flip augmentation i.e., shifting all pixels of an image in a horizontal direction. As a result, models with data augmentation are more likely to learn more differentiating characteristic features than models without data augmentation. We have 200 images from each class after augmentation and trained the model with a dataset of 200*7 images. Figure 3 depicts the sample images after Horizontal Flip augmentation.
In the CNN model since we are working with a finite number of data points (each class has 200 images), we have applied random transformations (rotations, shearing, etc.) to train. Each epoch had the same number of images as the original images. Overfitting is also avoided by data augmentation.

Feature extraction

Global Feature Descriptors are used to quantify an image in its entirety. These don’t have the concept of interest points and thus, take the entire image for processing. The color of the skin lesion image is quantified using a Colour Histogram. The shape of the skin lesion is quantified using Hu Moments. The texture of the skin lesion is quantified using a Haralick Texture. These features are chosen because the color, shape, and texture are the only properties that dominate in the lesion zone. The feature extraction experiment works with one image at a time, extracting three global features, concatenating them into a single global feature, and saving it in HDF5 format with its label.

Data splitting

The OpenCV application was used to validate the machine learning models. The total number of images obtained from the HAM1000 dataset for Machine Learning model training is 700 (100 images from each class), 560 of which images represent 80% for training and 140 images represent 20% for testing. Due to the massive class imbalance revealed by the data set, this was necessary. After augmenting the dataset, 1400 images (200 images from each class) are used for ML model training, with 1120 images accounting for 80% of the training and 280 images accounting for 20% of the testing.

Image classification

The proposed work used machine learning models and a Convolutional Neural Network model to train and test the image dataset and evaluated the performance using the various parameters, namely, Accuracy, Precision, Recall, and F1-score. The various machine learning models used in the proposed work include Decision Tree (DT), Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), Logistic Regression (LR), Gaussian Naïve Bayes (NB), Linear Discriminant Analysis (LDA) at the end, we have compared the various models in terms of the evaluation parameters. The hyperparameters used by the machine learning algorithms are shown in Table 1.

Table 1.

Machine learning model’s hyperparameter.

Classifier	Hyperparameter value
LR	$r a n d o m_s t a t e$ = 9
LDA	solver = ’svd’
KNN	$n_n e i g h b o r s$ = 5
DT	Estimators = 100
RF	$n_e s t i m a t o r s$ = 200, $r a n d o m_s t a t e$ = 0
GaussianNB	$v a r_s m o o t h i n g$ = 1e−09
SVM	Kernel = ’linear’, c = 1, $r a n d o m_s t a t e$ = 0

Open in a new tab

Convolutional neural network

In contrast to a standard neural net, a CNN learns detailed patterns by applying filters to the raw pixels of an image. To build a CNN, Tensorflow and Keras libraries were used to build and implement the model in Python 3.7.9. A high-level overview of CNN Architecture is shown in Fig. 4. The layers and hyperparameters employed in the network are summarized in Table 2.

A high-level overview of CNN Architecture.

Table 2.

CNN layers and hyperparameters

Layer	Hyperparameters
Conv2D	32 filters, 3 × 3 filter size, ReLU activation, same padding, followed by batch normalization
MaxPool2D	3 × 3 pool size to reduce image spatial dimensions quickly from 96 × 96 to 32 × 32
Dropout (Core Layer)	0.25 Neurons
Conv2D	64 filters, 3 × 3 filter size, ReLU activation, same padding
Conv2D	64 filters, 3 × 3 filter size, ReLU activation, following the same padding, batch normalization is performed
MaxPool2D	2 × 2 pool size
Dropout (Core Layer)	0.25 Neurons
Conv2D	128 filters, 3 × 3 filter, ReLU activation, following the same padding, batch normalization is performed
Conv2D	128 filters, 3 × 3 filter size, ReLU activation, same padding followed by batch normalization
MaxPool2D	2 × 2 pool size
Dropout (Core Layer)	0.25 Neurons
Flatten (Core Layer)	–
Dense	1024 Units, ReLU sctivation, and batch normalization
Dropout (Core Layer)	0.5 Neurons
Dense	7 Units, softmax activation

Open in a new tab

Model hyperparameters

To get a better model evaluation, certain common hyperparameter values are used. The hyperparameter values utilized in the CNN model are highlighted in Table 3. The following section explains why the values of the hyperparameters were chosen in the proposed work: Optimizer: Adam is the most widely used optimization method for training deep neural networks today because it is simple to use, computationally efficient, and effective when dealing with enormous amounts of data and parameters. Loss Function: The Multi-Class calculates the loss value using the “categorical cross-entropy” loss function. Epochs: The epoch count is 150. Found that 150 epochs result in a model with low loss and no overfitting to the training set through experimentation (or not overfitted as best as we can). Batch Size: Several early tests with batch sizes of 5, 10, 20, and 40 found that batch size 32 produced the best results. Learning Rate: The rate of learning is initially set to 0.001. The “step” we take along the gradient is controlled by the learning rate. The smaller the value, the smaller the step, and the larger the value, the bigger the step.

Table 3.

CNN model’s hyperparameters

Hyperparameter	Value
Optimizer	Adam
Loss function	Categorical cross-entropy
Epochs	150
Batch dize	32
Learning rate	0.001–0.00001

Open in a new tab

Results and discussion

All of the Machine Learning models and CNN were trained and tested on a Windows 10 computer with an Intel i5 processor and 8GB of RAM. The models were created with Spyder 5 and Python 3.7.9, with Keras, Imutils, and cv2Numpy as dependencies. The accuracy of the machine learning models with two methods involving/not involving augmentation is shown in Table 4.

Table 4.

Accuracy of the machine learning models.

	LR	LDA	KNN	DT	RF	NB	SVM
Without augmentation for (100*7) images	0.48929	0.35179	0.41786	0.44286	0.58571	0.34107	0.42143
With horizontal flip augmentation for (200*7) images	0.58125	0.57589	0.48393	0.68660	0.87321	0.363393	0.53125

Open in a new tab

From the Table 4 results, we have observed that the Random Forest Machine algorithm provides better accuracy compared to other machine learning algorithms. The experimental results of CNN and Machine learning outcomes with the use of model accuracy and weighted average of precision, recall and F1-score is shown in Table 5.When using K-fold cross-validation, the accuracy measure is the mean of the accuracies of the K-models, not simply the accuracy of one model.

Table 5.

Model accuracy, weighted average of precision, recall and F1-score.

Metrics	Model
Metrics	CNN	RF	DT	LR	LDA	SVM	KNN	NB
Accuracy (%)	94	87	68	58	57	53	48	36
Precision (%)	88	94	75	56	62	54	54	49
Recall (%)	85	94	74	55	58	50	50	36
F1-score (%)	86	94	74	55	54	50	50	35

Open in a new tab

Table 6 gives the customized CNN model’s accuracy and loss for training and testing sets. As seen in this table, the proposed customized CNN model has a performance difference of 9% between the training and testing accuracy. After 150 epochs of training, the model achieved low loss with minimal overfitting. We could also improve our accuracy by adding more training data.

Table 6.

CNN model training and validation.

CNN model
	Train	Test
Accuracy (%)	95.18	86.43
Loss	0.1483	0.4690

Open in a new tab

Figure 5 depict the CNN model’s training history versus the number of epochs. This is an important visualization to ensure that the model continued to train with each epoch, improving its accuracy and reducing losses while aiming to optimize the objective function. The figures are visible in the code. The testing sets accuracy, recall, precision, and f1-score associated with the model are shown in Table 7.

CNN Model’s Accuracy/loss.

Model’s performance measure comparison.

Table 7.

Multi-class classification report of the customised CNN model.

	Precision	Recall	F1-score	Support
0-Actinic keratoses	0.89	0.68	0.77	37
1-Basal cell carcinoma	0.55	0.88	0.67	33
2-Benign keratosis-like lesions	0.89	0.94	0.92	35
3-Dermatofibroma	0.88	0.81	0.84	43
4-Melanoma	0.97	0.89	0.93	44
5-Melanocytic nevi	0.93	0.86	0.89	44
6-Vascular lesions	0.95	0.89	0.92	44
Macro avg	0.87	0.85	0.85	280
Weighted avg	0.88	0.85	0.86	280

Open in a new tab

In Fig. 6 we compare proposed CNN and Machine learning outcomes with the use of model accuracy and weighted average of precision, recall, and f1-score. This is the graphical representation of Table 5. Table 8 gives the comparison of the accuracy obtained in the proposed work with the recent existing work on the HAM10000 dataset. The first row in this table gives the accuracy of the proposed CNN model.

Table 8.

Comparison of proposed work with recent existing techniques on the HAM10000 dataset.

Existing work versus proposed work	Accuracy (%)
Proposed CNN	95.18
EW-FCM+wide-shufflenet²⁹	84.80
Shifted MobileNetV2³⁰	81.90
Shifted GoogLeNet³⁰	80.50
Shifted 2-Nets³⁰	83.20
MobileNetV2-LSTM⁸	85.34
24 layered CNN²²	86.50
InceptionV3³¹	91.56
ResNetXt101³¹	93.20
InceptionResNetV2³¹	93.20
Xception³¹	91.47
NASNetLarge³¹	91.11
9 layered CNN³²	80.00
Resnet50, Restnet101+KcPCA+SVM RBF²¹	89.80
Vgg16+googLeNet ensemble³³	81.50
Modified MobileNet³⁴	83.93

Open in a new tab

We have also implemented Inception V3, a pre-trained model, and compared the accuracy with the proposed work. The InceptionV3 model resulted in an accuracy of 95.14%. Even though there is not much difference in the accuracy, the proposed CNN model has computationally less overhead in terms of total training time as compared to the InceptionV3 model due to more layers in the network.

Conclusion and future work

The proposed work has applied machine learning and CNN techniques to classify the skin lesion images. The experiments were conducted on the HAM10000 dataset. The machine learning and the customized CNN techniques were evaluated after the experiments based on Accuracy, Precision, Recall, and F1-Score. Before the training/testing phase, the images were pre-processed, then separated into feature and target values, and formed data augmentation. The results show that the customized CNN has obtained an accuracy of 95.18%, which is better than the proposed machine learning algorithms. This suggests that the proposed CNN has a better classification performance for the HAM10000 data set. The proposed work has been compared with the recent existing work on the same data set and proved to obtain better accuracy with minimum loss and errors. As a future work, researchers can improve CNN architecture and implementation by fine-tuning hyper parameters such as the number of layers, type of layers, and hyper parameter values for the layers and can explore other pre-trained CNN models. Researchers can also focus on image segmentation and Skin lesion categorization in real-time with better accuracy and minimum time.

Author contributions

B.S. and A.P.R. have carried out the experiments; R.F., R.C. and S.B. have equally contributed towards the revision process and manuscript preparation; K.L. has validated the results and reviewed the entire article.

Funding

We would like to thank the School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India - 632014 for paying APC for this study.

Data availability

HAM10000 dataset used in the experiment is available on web publicly at https://www.kaggle.com/datasets/kmader/skin-cancer-mnist-ham10000.

Competing interests

The authors declare no competing interests.

Footnotes

The original online version of this Article was revised: The Funding section in the original version of this Article was omitted. The Funding section now reads: “We would like to thank the School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India - 632014 for paying APC for this study.”

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Change history

12/19/2022

A Correction to this paper has been published: 10.1038/s41598-022-26516-0

References

1.Dhivyaa CR, et al. Skin lesion classification using decision trees and random forest algorithms. J. Ambient Intell. Hum. Comput. 2020 doi: 10.1007/s12652-020-02675-8. [DOI] [Google Scholar]
2.Abbas S, Jalil Z, Javed AR, Batool I, Khan MZ, Noorwali A, Gadekallu TR, Akbar A. BCD-WERT: A novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm. PeerJ Comput. Sci. 2021;7:e390. doi: 10.7717/peerj-cs.390. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Gadamsetty S, Ch R, Ch A, Iwendi C, Gadekallu T. Hash-based deep learning approach for remote sensing satellite imagery detection. Water. 2022;14:707. doi: 10.3390/w14050707. [DOI] [Google Scholar]
4.Tschandl P, Rosendahl C, Kittler H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data. 2018;5:180161. doi: 10.1038/sdata.2018.161. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Polat K, Koc KO. Detection of skin diseases from dermoscopy image using the combination of convolutional neural network and one-versus-all. J. Artif. Intell. Syst. 2020;2(1):80–97. doi: 10.33969/AIS.2020.21006. [DOI] [Google Scholar]
6.Kumar M, et al. A de-ann inspired skin cancer detection approach using fuzzy c-means clustering. Mob. Netw. Appl. 2020;25:1319–1329. doi: 10.1007/s11036-020-01550-2. [DOI] [Google Scholar]
7.Adegun A, Viriri S. FCN-based DenseNet framework for automated detection and classification of skin lesions in dermoscopy images. IEEE Access. 2020;8:150377–150396. doi: 10.1109/ACCESS.2020.3016651. [DOI] [Google Scholar]
8.Srinivasu PN, et al. Classification of skin disease using deep learning neural networks with MobileNet V2 and LSTM. Sensors. 2021;21(8):2852. doi: 10.3390/s21082852. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Mahbod A, Schaefer G, Wang C, Dorffner G, Ecker R, Ellinger I. Transfer learning using a multi-scale and multi-network ensemble for skin lesion classification. Comput. Methods Progr. Biomed. 2020;193:105475. doi: 10.1016/j.cmpb.2020.105475. [DOI] [PubMed] [Google Scholar]
10.Zhang N, Cai Y, Wang Y, Tian Y, Wang X, Badami B. Skin cancer diagnosis based on optimized convolutional neural network. Artif. Intell. Med. 2020;102:101756. doi: 10.1016/j.artmed.2019.101756. [DOI] [PubMed] [Google Scholar]
11.Hameed N, Shabut AM, Ghosh MK, Hossain MA. Multi-class multi-level classification algorithm for skin lesions classification using machine learning techniques. Expert Syst. Appl. 2020;141:112961. doi: 10.1016/j.eswa.2019.112961. [DOI] [Google Scholar]
12.Hasan K, Dahal L, Samarakoon PN, Tushar FI, Martí R. DSNet: Automatic dermoscopic skin lesion segmentation. Comput. Biol. Med. 2020;120:103738. doi: 10.1016/j.compbiomed.2020.103738. [DOI] [PubMed] [Google Scholar]
13.Hosny KM, Kassem MA, Foaud MM. Classification of skin lesions into seven classes using transfer learning with AlexNet. J. Digit. Imaging. 2020;33:1325–1334. doi: 10.1007/s10278-020-00371-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Chatterjee S, Dey D, Munshi S. Integration of morphological preprocessing and fractal-based feature extraction with recursive feature elimination for skin lesion types classification. Comput. Methods Progr. Biomed. 2019;178:201–218. doi: 10.1016/j.cmpb.2019.06.018. [DOI] [PubMed] [Google Scholar]
15.Pereira PMM, Fonseca-Pinto R, Paiva RP, Assuncao PAA, Tavora LMN, Thomaz LA, Faria SMM. Skin lesion classification enhancement using border-line features–The melanoma vs nevus problem. Biomed. Signal Process. Control. 2020;2020:57. [Google Scholar]
16.Moradi N, Mahdavi-Amiri N. Kernel sparse representation based model for skin lesions segmentation and classification. Comput. Methods Programs Biomed. 2019;182:105038. doi: 10.1016/j.cmpb.2019.105038. [DOI] [PubMed] [Google Scholar]
17.Garcia-Arroyo JL, Garcia-Zapirain B. Segmentation of skin lesions in dermoscopy images using fuzzy classification of pixels and histogram thresholding. Comput. Methods Programs Biomed. 2019;168:11–19. doi: 10.1016/j.cmpb.2018.11.001. [DOI] [PubMed] [Google Scholar]
18.Zaqout I. Diagnosis of skin lesions based on dermoscopic images using image processing techniques. J. Pattern Recogn. Sel. Methods Appl. 2019;2019:189–204. [Google Scholar]
19.Khan MA, et al. Attributes based skin lesion detection and recognition: A mask RCNN and transfer learning-based deep learning framework. Pattern Recogn. Lett. 2021;143:58–66. doi: 10.1016/j.patrec.2020.12.015. [DOI] [Google Scholar]
20.Shelatkar T, Urvashi D, Shorfuzzaman M, Alsufyani A, Lakshmanna K. Diagnosis of brain tumor using light weight deep learning model with fine-tuning approach. Comput. Math. Methods Med. 2022 doi: 10.1155/2022/2858845. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Khan, M. A., Javed, M. Y., Sharif, M., Saba, T. & Rehman, A. Multi-model deep neural network based features extraction and optimal selection approach for skin lesion classification. In International Conference on Computer and Information Sciences (ICCIS) 1–7. 10.1109/ICCISci.2019.8716400 (2019).
22.Khan MA, Kadry S, Zhang YD, Akram T, Sharif M, Rehman A, Saba T. Pixels to classes: Intelligent Learning framework for multiclass skin lesion localization and classification. Comput. Electr. Eng. 2021;90:106956. doi: 10.1016/j.compeleceng.2020.106956. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Gundluru N, Rajput DS, Lakshmanna K, Kaluri R, Shorfuzzaman M, Uddin M, Rahman Khan MA. Enhancement of detection of diabetic retinopathy using Harris Hawks optimization with deep learning model. Comput. Intell. Neurosci. 2022 doi: 10.1155/2022/8512469. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Tajeddin NZ, Asl BM. Melanoma recognition in dermoscopy images using lesion’s peripheral region information. Comput. Methods Programs Biomed. 2018;163:143–153. doi: 10.1016/j.cmpb.2018.05.005. [DOI] [PubMed] [Google Scholar]
25.Yu Z, Jiang X, Zhou F, Qin J, Ni D, Chen S, Lei B, Wang T. Melanoma recognition in dermoscopy images via aggregated deep convolutional features. IEEE Trans. Biomed. Eng. 2019;66:1006–1016. doi: 10.1109/TBME.2018.2866166. [DOI] [PubMed] [Google Scholar]
26.Sáez A, Sánchez-Monedero J, Gutiérrez PA, Hervás-Martínez C. Machine learning methods for binary and multiclass classification of melanoma thickness from dermoscopic images. IEEE Trans. Med. Imaging. 2016;35:1036–1045. doi: 10.1109/TMI.2015.2506270. [DOI] [PubMed] [Google Scholar]
27.Rajput DS, Basha SM, Xin Q, Gadekallu TR, Kaluri R, Lakshmanna K, Maddikunta PKR. Providing diagnosis on diabetes using cloud computing environment to the people living in rural areas of India. J. Ambient Intell. Humaniz. Comput. 2022;13(5):2829–2840. doi: 10.1007/s12652-021-03154-4. [DOI] [Google Scholar]
28.Reddy GT, Reddy M, Lakshmanna K, Rajput DS, Kaluri R, Srivastava G. Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis. Evol. Intell. 2020;13(2):185–196. doi: 10.1007/s12065-019-00327-1. [DOI] [Google Scholar]
29.Hoang L, Lee SH, Lee EJ, Kwon KR. Multiclass skin lesion classification using a novel lightweight deep learning framework for smart healthcare. Appl. Sci. 2022;12(5):2677. doi: 10.3390/app12052677. [DOI] [Google Scholar]
30.Thurnhofer-Hemsi K, López-Rubio E, Domínguez E, Elizondo DA. Skin lesion classification by ensembles of deep convolutional networks and regularly spaced shifting. IEEE Access. 2021;9:112193–112205. doi: 10.1109/ACCESS.2021.3103410. [DOI] [Google Scholar]
31.Chaturvedi SS, Tembhurne JV, Diwan T. A multi-class skin cancer classification using deep convolutional neural networks. Multimed. Tools Appl. 2020;79(39):28477–28498. doi: 10.1007/s11042-020-09388-2. [DOI] [Google Scholar]
32.Nugroho AA, Slamet I, Sugiyanto Skins cancer identification system of HAMl0000 skin cancer dataset using convolutional neural network. AIP Conf. Proc. 2019;2202(1):020039. doi: 10.1063/1.5141652. [DOI] [Google Scholar]
33.Mobiny A, Singh A, Van Nguyen H. Risk-aware machine learning classifier for skin lesion diagnosis. J. Clin. Med. 2019;8(8):1241. doi: 10.3390/jcm8081241. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Sae-Lim, W., Wettayaprasit, W., Aiyarak, P. Convolutional neural networks using MobileNet for skin lesion classification. In 2019 16th International Joint Conference on Computer Science and Software Engineering (JCSSE) 242–247 (2019).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

HAM10000 dataset used in the experiment is available on web publicly at https://www.kaggle.com/datasets/kmader/skin-cancer-mnist-ham10000.

[CR1] 1.Dhivyaa CR, et al. Skin lesion classification using decision trees and random forest algorithms. J. Ambient Intell. Hum. Comput. 2020 doi: 10.1007/s12652-020-02675-8. [DOI] [Google Scholar]

[CR2] 2.Abbas S, Jalil Z, Javed AR, Batool I, Khan MZ, Noorwali A, Gadekallu TR, Akbar A. BCD-WERT: A novel approach for breast cancer detection using whale optimization based efficient features and extremely randomized tree algorithm. PeerJ Comput. Sci. 2021;7:e390. doi: 10.7717/peerj-cs.390. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Gadamsetty S, Ch R, Ch A, Iwendi C, Gadekallu T. Hash-based deep learning approach for remote sensing satellite imagery detection. Water. 2022;14:707. doi: 10.3390/w14050707. [DOI] [Google Scholar]

[CR4] 4.Tschandl P, Rosendahl C, Kittler H. The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Sci. Data. 2018;5:180161. doi: 10.1038/sdata.2018.161. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Polat K, Koc KO. Detection of skin diseases from dermoscopy image using the combination of convolutional neural network and one-versus-all. J. Artif. Intell. Syst. 2020;2(1):80–97. doi: 10.33969/AIS.2020.21006. [DOI] [Google Scholar]

[CR6] 6.Kumar M, et al. A de-ann inspired skin cancer detection approach using fuzzy c-means clustering. Mob. Netw. Appl. 2020;25:1319–1329. doi: 10.1007/s11036-020-01550-2. [DOI] [Google Scholar]

[CR7] 7.Adegun A, Viriri S. FCN-based DenseNet framework for automated detection and classification of skin lesions in dermoscopy images. IEEE Access. 2020;8:150377–150396. doi: 10.1109/ACCESS.2020.3016651. [DOI] [Google Scholar]

[CR8] 8.Srinivasu PN, et al. Classification of skin disease using deep learning neural networks with MobileNet V2 and LSTM. Sensors. 2021;21(8):2852. doi: 10.3390/s21082852. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR9] 9.Mahbod A, Schaefer G, Wang C, Dorffner G, Ecker R, Ellinger I. Transfer learning using a multi-scale and multi-network ensemble for skin lesion classification. Comput. Methods Progr. Biomed. 2020;193:105475. doi: 10.1016/j.cmpb.2020.105475. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Zhang N, Cai Y, Wang Y, Tian Y, Wang X, Badami B. Skin cancer diagnosis based on optimized convolutional neural network. Artif. Intell. Med. 2020;102:101756. doi: 10.1016/j.artmed.2019.101756. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Hameed N, Shabut AM, Ghosh MK, Hossain MA. Multi-class multi-level classification algorithm for skin lesions classification using machine learning techniques. Expert Syst. Appl. 2020;141:112961. doi: 10.1016/j.eswa.2019.112961. [DOI] [Google Scholar]

[CR12] 12.Hasan K, Dahal L, Samarakoon PN, Tushar FI, Martí R. DSNet: Automatic dermoscopic skin lesion segmentation. Comput. Biol. Med. 2020;120:103738. doi: 10.1016/j.compbiomed.2020.103738. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Hosny KM, Kassem MA, Foaud MM. Classification of skin lesions into seven classes using transfer learning with AlexNet. J. Digit. Imaging. 2020;33:1325–1334. doi: 10.1007/s10278-020-00371-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Chatterjee S, Dey D, Munshi S. Integration of morphological preprocessing and fractal-based feature extraction with recursive feature elimination for skin lesion types classification. Comput. Methods Progr. Biomed. 2019;178:201–218. doi: 10.1016/j.cmpb.2019.06.018. [DOI] [PubMed] [Google Scholar]

[CR15] 15.Pereira PMM, Fonseca-Pinto R, Paiva RP, Assuncao PAA, Tavora LMN, Thomaz LA, Faria SMM. Skin lesion classification enhancement using border-line features–The melanoma vs nevus problem. Biomed. Signal Process. Control. 2020;2020:57. [Google Scholar]

[CR16] 16.Moradi N, Mahdavi-Amiri N. Kernel sparse representation based model for skin lesions segmentation and classification. Comput. Methods Programs Biomed. 2019;182:105038. doi: 10.1016/j.cmpb.2019.105038. [DOI] [PubMed] [Google Scholar]

[CR17] 17.Garcia-Arroyo JL, Garcia-Zapirain B. Segmentation of skin lesions in dermoscopy images using fuzzy classification of pixels and histogram thresholding. Comput. Methods Programs Biomed. 2019;168:11–19. doi: 10.1016/j.cmpb.2018.11.001. [DOI] [PubMed] [Google Scholar]

[CR18] 18.Zaqout I. Diagnosis of skin lesions based on dermoscopic images using image processing techniques. J. Pattern Recogn. Sel. Methods Appl. 2019;2019:189–204. [Google Scholar]

[CR19] 19.Khan MA, et al. Attributes based skin lesion detection and recognition: A mask RCNN and transfer learning-based deep learning framework. Pattern Recogn. Lett. 2021;143:58–66. doi: 10.1016/j.patrec.2020.12.015. [DOI] [Google Scholar]

[CR20] 20.Shelatkar T, Urvashi D, Shorfuzzaman M, Alsufyani A, Lakshmanna K. Diagnosis of brain tumor using light weight deep learning model with fine-tuning approach. Comput. Math. Methods Med. 2022 doi: 10.1155/2022/2858845. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Khan, M. A., Javed, M. Y., Sharif, M., Saba, T. & Rehman, A. Multi-model deep neural network based features extraction and optimal selection approach for skin lesion classification. In International Conference on Computer and Information Sciences (ICCIS) 1–7. 10.1109/ICCISci.2019.8716400 (2019).

[CR22] 22.Khan MA, Kadry S, Zhang YD, Akram T, Sharif M, Rehman A, Saba T. Pixels to classes: Intelligent Learning framework for multiclass skin lesion localization and classification. Comput. Electr. Eng. 2021;90:106956. doi: 10.1016/j.compeleceng.2020.106956. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Gundluru N, Rajput DS, Lakshmanna K, Kaluri R, Shorfuzzaman M, Uddin M, Rahman Khan MA. Enhancement of detection of diabetic retinopathy using Harris Hawks optimization with deep learning model. Comput. Intell. Neurosci. 2022 doi: 10.1155/2022/8512469. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Tajeddin NZ, Asl BM. Melanoma recognition in dermoscopy images using lesion’s peripheral region information. Comput. Methods Programs Biomed. 2018;163:143–153. doi: 10.1016/j.cmpb.2018.05.005. [DOI] [PubMed] [Google Scholar]

[CR25] 25.Yu Z, Jiang X, Zhou F, Qin J, Ni D, Chen S, Lei B, Wang T. Melanoma recognition in dermoscopy images via aggregated deep convolutional features. IEEE Trans. Biomed. Eng. 2019;66:1006–1016. doi: 10.1109/TBME.2018.2866166. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Sáez A, Sánchez-Monedero J, Gutiérrez PA, Hervás-Martínez C. Machine learning methods for binary and multiclass classification of melanoma thickness from dermoscopic images. IEEE Trans. Med. Imaging. 2016;35:1036–1045. doi: 10.1109/TMI.2015.2506270. [DOI] [PubMed] [Google Scholar]

[CR27] 27.Rajput DS, Basha SM, Xin Q, Gadekallu TR, Kaluri R, Lakshmanna K, Maddikunta PKR. Providing diagnosis on diabetes using cloud computing environment to the people living in rural areas of India. J. Ambient Intell. Humaniz. Comput. 2022;13(5):2829–2840. doi: 10.1007/s12652-021-03154-4. [DOI] [Google Scholar]

[CR28] 28.Reddy GT, Reddy M, Lakshmanna K, Rajput DS, Kaluri R, Srivastava G. Hybrid genetic algorithm and a fuzzy logic classifier for heart disease diagnosis. Evol. Intell. 2020;13(2):185–196. doi: 10.1007/s12065-019-00327-1. [DOI] [Google Scholar]

[CR29] 29.Hoang L, Lee SH, Lee EJ, Kwon KR. Multiclass skin lesion classification using a novel lightweight deep learning framework for smart healthcare. Appl. Sci. 2022;12(5):2677. doi: 10.3390/app12052677. [DOI] [Google Scholar]

[CR30] 30.Thurnhofer-Hemsi K, López-Rubio E, Domínguez E, Elizondo DA. Skin lesion classification by ensembles of deep convolutional networks and regularly spaced shifting. IEEE Access. 2021;9:112193–112205. doi: 10.1109/ACCESS.2021.3103410. [DOI] [Google Scholar]

[CR31] 31.Chaturvedi SS, Tembhurne JV, Diwan T. A multi-class skin cancer classification using deep convolutional neural networks. Multimed. Tools Appl. 2020;79(39):28477–28498. doi: 10.1007/s11042-020-09388-2. [DOI] [Google Scholar]

[CR32] 32.Nugroho AA, Slamet I, Sugiyanto Skins cancer identification system of HAMl0000 skin cancer dataset using convolutional neural network. AIP Conf. Proc. 2019;2202(1):020039. doi: 10.1063/1.5141652. [DOI] [Google Scholar]

[CR33] 33.Mobiny A, Singh A, Van Nguyen H. Risk-aware machine learning classifier for skin lesion diagnosis. J. Clin. Med. 2019;8(8):1241. doi: 10.3390/jcm8081241. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Sae-Lim, W., Wettayaprasit, W., Aiyarak, P. Convolutional neural networks using MobileNet for skin lesion classification. In 2019 16th International Joint Conference on Computer Science and Software Engineering (JCSSE) 242–247 (2019).

PERMALINK

Skin lesion classification of dermoscopic images using machine learning and convolutional neural network

Bhuvaneshwari Shetty

Roshan Fernandes

Anisha P Rodrigues

Rajeswari Chengoden

Sweta Bhattacharya

Kuruva Lakshmanna

Abstract

Introduction

Figure 1.

Literature survey

The main contributions of the proposed work

Proposed methodology

Figure 2.

Image pre-processing

Step 1 - ordering the dataset

Step 2 - Image resizing

Step 3 - Data augmentation

Figure 3.

Feature extraction

Data splitting

Image classification

Table 1.

Convolutional neural network

Figure 4.

Table 2.

Model hyperparameters

Table 3.

Results and discussion

Table 4.

Table 5.

Table 6.

Figure 5.

Table 7.

Figure 6.

Table 8.

Conclusion and future work

Author contributions

Funding

Data availability

Competing interests

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases