. 2022 Jul 13;12:893972. doi: 10.3389/fonc.2022.893972

Table 5.

Different methods for improving model generalization ability and robustness.

Ref.	Dataset	Highlights	Limitations	Performance
(95)	DermIS DermQuest	Investigated the advantages of large-scale supervised pre-training for medical imaging applications.	In addition to the analysis of the weights and features of the model, it is necessary to conduct a comprehensive analysis of other features such as network structure to explore the importance of pre-training.	Accuracy: 0.871 (DermIS)
(95)	DermIS DermQuest			Accuracy: 0.974 (DermQuest)
(96)	HAM10000 MoleMap	Proposed transfer learning and adversarial learning in skin disease classification to improve the generalization ability of models to new samples and reduce cross-domain shift.	When the data domain and target domain are significantly different, the method’s overall accuracy suffers.	Accuracy: 0.909
(96)	HAM10000 MoleMap			AUC: 0.967
(97)	HAM10000	Performed adversarial training on MobileNet and VGG-16 using the innovative attacking models FGSM and PGD for skin cancer classification.	The number of datasets tested for this experiment is very limited, and there may be local optimizations.	Accuracy: 0.7614
(98)	ISIC-2016	Proposed a comprehensive deep learning framework combining adversarial training and transfer learning for melanoma classification. At the same time, focal loss was introduced to iteratively optimize the network to better learn hard samples.	This method does not consider more types of skin diseases, and it had a high computational cost.	Accuracy: 0.812
(98)	ISIC-2016			Sensitivity: 0.918
(99)	ISIC2017 HAM10000	Presented a Multi-view Filtered Transfer Learning approach to extract useful information from the original samples for domain adaption, thereby improving representation ability for skin disease image.	The effectiveness of this domain adaptation method should be validated on more dermatology datasets.	Accuracy: 0.918
(99)	ISIC2017 HAM10000			AUC: 0.879
(100)	ISBI-2017, PH²	Proposed an adversarial training method combined with attention module to enhance the robustness of the model in skin-disease classification and segmentation.	Due to the limited amount of training data and the unclear boundaries of skin disease images, the model still suffers from under-segmentation and over-segmentation.	Accuracy: 0.968
				Sensitivity: 0.962
				Specificity: 0.941
(101)	ISIC-2018	Using seven universal adversarial perturbations to investigate the vulnerability of the classification model.	This method does not perform adversarial training on more skin disease datasets, so the robustness of its model needs to be further improved.	Accuracy: 0.873
(102)	ISIC-2019	Proposed Monte Carlo dropout, Ensemble MC dropout, and Deep Ensemble for uncertainty quantification.	Further optimization of the robustness of the model is required, and the model should also be tested for noise detection to provide a confidence score.	Accuracy: 0.90
(102)	ISIC-2019			AUC: 0.945
(103)	ISIC Archive MED-NODE Dermofit	Proposed a transfer learning method to address the shortage of data in skin lesion images. Also, they utilized a hybrid deep CNN model to accurately extract features and ensure training stability while avoiding overfitting.	The model requires a considerable amount of computational resources while also lacking the diversity of domains.	Accuracy: 0.853
(103)	ISIC Archive MED-NODE Dermofit			F1 score: 0.891
(104)	HAM10000, Dermofit, Derm7pt, MSK PH², SONIC, UDA	Proposed to improve the generalization performance of the model by combining data augmentation and domain alignment. Designed a Bayesian generative model for continual learning based on a fixed pretrained feature extractor.	Due to the privacy of medical images, this trained model may underperform on ethnic groups with a small proportion of the population.	Accuracy: 0.670
(105)	Skin7, Skin40		To increase the method’s overall performance, better pre-training of the extractor can be investigated.	Mean class recall: 0.65