BTC-fCNN: Fast Convolution Neural Network for Multi-class Brain Tumor Classification

Basant S Abd El-Wahab; Mohamed E Nasr; Salah Khamis; Amira S Ashour

doi:10.1007/s13755-022-00203-w

. 2023 Jan 2;11(1):3. doi: 10.1007/s13755-022-00203-w

BTC-fCNN: Fast Convolution Neural Network for Multi-class Brain Tumor Classification

Basant S Abd El-Wahab ^1,^✉, Mohamed E Nasr ¹, Salah Khamis ¹, Amira S Ashour ¹

PMCID: PMC9807719 PMID: 36606077

Abstract

Timely prognosis of brain tumors has a crucial role for powerful healthcare of remedy-making plans. Manual classification of the brain tumors in magnetic resonance imaging (MRI) images is a challenging task, which relies on the experienced radiologists to identify and classify the brain tumor. Automated classification of different brain tumors is significant based on designing computer-aided diagnosis (CAD) systems. Existing classification methods suffer from unsatisfactory performance and/or large computational cost/ time. This paper proposed a fast and efficient classification process, called BTC-fCNN, which is a deep learning-based system to distinguish between different views of three brain tumor types, namely meningioma, glioma, and pituitary tumors. The proposed system’s model was applied on MRI images from the Figshare dataset. It consists of 13 layers with few trainable parameters involving convolution layer, 1 × 1 convolution layer, average pooling, fully connected layer, and softmax layer. Five iterations including transfer learning and five-fold cross-validation for retraining are considered to increase the proposed model performance. The proposed model achieved 98.63% average accuracy, using five iterations with transfer learning, and 98.86% using retrained five-fold cross-validation (internal transfer learning between the folds). Various evaluation metrics were measured to evaluate the proposed model, such as precision, F-score, recall, specificity and confusion matrix. The proposed BTC-fCNN model outstrips the state-of-the-art and other well-known convolution neural networks (CNN).

Keywords: Brain tumor classification, Convolution neural network, Average pooling layer, Convolution layer, Transfer learning

Introduction

Brain tumors are life-threatening having various types classified as benign and malignant. The malignant tumors have degree of malignancy that can be categorized into glioma, meningioma, and pituitary. For accurate and fast diagnosis, computer-aided diagnosis (CAD) systems become a must, especially with the advancement of deep learning networks that attract scientists to implement them for supporting healthcare [1–3]. From the clinical perspectives, the improvements in the image enhancement, object detection, and image classification pulled the consideration for early disease diagnosis and treatment plans [4–6]. To provide various views of tissues and organs, such as the brain, the magnetic resonance imaging (MRI) has been effectively enacted to analyse, monitor, diagnose and treat brain tumors.

For efficient diagnosis, several methods for brain tumor classification based on MRI images have been conducted. For example, Cheng et al. [7] implemented brain tumor classification for glioma, pituitary, and meningioma, using Gray level co-occurrence matrix (GLCM), and bag-of-words (BoW) model. The results reported 91.28% classification accuracy using BOW model. Furthermore, Ismael et al. [8] classified same three brain tumor types by combining statistical features and neural network, which achieved 91.9% classification accuracy. Ari et al. [9] classified the benign and malignant tumors using local smoothing, and nonlocal means procedures to remove noise, then applying the extreme learning machine local receptive fields (ELM-LRF). The results depicted 97.18% classification accuracy, 97.12% specificity, and 96.80% sensitivity. Gumaei et al. [10] established brain tumor classification depending on a hybrid feature extraction using principal component analysis (PCA) with normalized descriptors, followed by regularized extreme learning machine achieving 94.23% accuracy.

In contrast, different deep learning-based models were implemented for brain tumor classification, for example, Sajjad et al. [11] implemented brain tumor classification using pre-trained convolution neural network (CNN) with data augmentation based on VGG-19. This model was fine-tuned to provide 94.58% classification accuracy, 88.41% sensitivity, and 96.12% specificity. Kutlu et al. [12] established a classification model based on AlexNet CNN with 10 layers using different trainable parameters on 300 images, namely 100 glioma, 100 meningioma, and 100 pituitary tumors. Moreover, a pre-trained VGG19 model was proposed by Swati et al. [13] for brain tumor classification of the same brain tumor types achieving a mean accuracy of 94.82%. Excitation and squeeze ResNet model were implemented by Ghosal et al. [14] for brain tumor classification with 89.93%, and 93.83% accuracies without and data augmentation, respectively. Anaraki et al. [15] proposed a classification model based on the structure of the CNN, which consisted of convolution layers, max-pooling layers, and a fully connected layer with genetic algorithm. The results demonstrated accuracy of 94.2% on classifying the three tumor types with 90.9% accuracy for classifying three grades of Glioma. Deepak et al. [16] introduced a pre-trained CNN network with transfer learning to classify the three classes of the brain tumor using a pre-trained GoogleNet leading to 97.1% accuracy.

Another trend based on combining two paths of the CNN was designed by Alshayeji et al. [17] for classification, which consisted of convolutional, dropout, max-pooling, batch normalization, flatten, and dense layers with applying Bayesian optimization. It reported an accuracy of 97.37%. Kakarla et al. [18]. exhibited brain tumor classification network based on CNN structure with eight layers, which achieved classification accuracy 97.42%, 97.41% precision, 97.42% recall, and 95.09% Jaccard. Kumar et al. [19] developed a classification network based on pre-trained ResNet-50 by replacing the output layer with average pooling and softmax layers for 97.08% classification accuracy with data augmentation, and 97.48% without augmentation. Table 1 summaries the previously mentioned techniques for brain tumor classification.

Table 1.

Different classification techniques for brain tumor diagnosis

Reference	Method	Number of images in the dataset	Limitations	Accuracy %
Cheng et al. [7]	BoW, intensity histogram and GLCM	3064	Elevated computational complicatedness	91.28
Cheng et al. [7]	Ring from partition for classification	3064	Elevated computational complicatedness	91.28
Ismael et al. [8]	The histogram and the GLCM for feature extraction	3064	Elevated computational complicatedness	91.9
Ismael et al. [8]	ANN for classification	3064	Elevated computational complicatedness	91.9
Ari et al. [9]	ELM-LRF for classification	108	Small dataset	97.18
	ELM-LRF for classification		Inappropriate for another training dataset
	Watershed segmentation for segmentation		Inappropriate for another training dataset
Gumaei et al. [10]	(PCA) with GIST descriptors for feature extraction	3064	Elevated computational complicatedness	94.23
Gumaei et al. [10]	Regularized extreme learning machine for classification	3064	Elevated computational complicatedness	94.23
Sajjad et al. [11]	VGG19 with data augmentation	3064	Elevated computational cost	94.58
Sajjad et al. [11]	VGG19 with data augmentation	3064	Large storage required	94.58
Kutlu et al. [12]	Based on AlexNet	300	Small dataset	98.6
			Elevated computational cost
			Large storage requirements
Swati et al. [13]	VGG with fine tuning	3064	Elevated computational cost	94.82
Swati et al. [13]	VGG with fine tuning	3064	Large storage requirements	94.82
Ghosal et al. [14]	Based on AlexNet	3049	Elevated computational cost	93.83
Ghosal et al. [14]	Based on AlexNet	3049	Large storage requirements	93.83
Anaraki et al. [15]	CNN with Genetic Algorithm	3064	Elevated computational cost	94.2
Anaraki et al. [15]	CNN with Genetic Algorithm	3064	Large storage requirements	94.2
Deepak et al. [16]	GoogleNet with Transfer Learning	3064	Time-consuming	97.1
			Elevated computational cost
			Large storage requirements
Alshayeji et al. [17]	Aggregation of two paths from CNN	3064	Time-consuming	97.37
			Elevated computational cost
			Large storage requirements
Kakarla et al. [18]	Average pooling convolutional neural network	3064	Time-consuming	97.42
			Elevated computational cost
			Large storage requirements
Kumar et al. [19]	ResNet-50 with Global Average Pooling at the output layer	3064	Time-consuming	97.48
			Elevated computational cost
			Large storage requirements

Layer type	Number of filters	Kernel size	Output size	Number of parameter
Conv2D	32	3 × 3	(254, 254, 32)	320
Conv2D	10	1 × 1	(254, 254, 10)	330
AveragePooling2D	–	2 × 2	(127, 127, 10)	0
Conv2D	10	1 × 1	(127, 127, 10)	110
Conv2D	32	3 × 3	(125, 125, 32)	2912
Conv2D	10	1 × 1	(125, 125, 10)	330
AveragePooling2D	-	2 × 2	(62, 62, 10)	0
Conv2D	10	1 × 1	(62, 62, 10)	110
Conv2D	32	3 × 3	(60, 60, 32)	2912
Conv2D	10	1 × 1	(60, 60, 10)	330
AveragePooling2D	–	2 × 2	(30, 30, 10)	0
Flatten	–	–	9000	0
Dense	–	–	64	576,064
Softmax	–	–	3	195
Total parameters	583,613
Trainable parameters	583,613

No. of filter	Accuracy %		Loss	F1-score %	Precision %	Recall %	Specificity %	Time (Sec.)	Number of model parameters
16 filters	Mean	90.96	0.28	90.02	90.18	89.97	95.27	49.43	933,587
16 filters	SD	± 1.55	± 0.04	± 1.73	± 1.65	± 1.87	± 0.86	± 6.51	933,587
10 filters	Mean	93.08	0.24	92.21	92.47	92.01	96.34	41.43	583,613
10 filters	SD	± 0.44	± 0.03	± 0.64	± 0.58	± 0.86	± 0.29	± 1.61	583,613
8 filters	Mean	91.25	0.29	90.13	90.72	89.85	95.34	37.56	466,987
8 filters	SD	± 1.81	± 0.07	± 2.09	± 2.13	± 2.16	± 0.89	± 3.68	466,987

Fold	Accuracy %	Loss	F1-score %	Precision %	Recall %	Specificity %	Time (Sec.)
1	92.82	0.24	91.75	92.39	91.26	96.11	42.33
2	93.64	0.21	93.11	92.81	93.43	96.75	39.53
3	92.82	0.26	92.15	92.39	91.92	96.19	42.88
4	92.66	0.28	91.49	91.61	91.41	96.08	40.78
5	93.46	0.22	92.52	93.15	92.03	96.56	40.64
Mean	93.08	0.24	92.21	92.47	92.01	96.34	41.43
SD	± 0.44	± 0.03	± 0.64	± 0.58	± 0.86	± 0.29	± 1.61

Fold	Accuracy %	Loss	F1-score %	Precision %	Recall %	Specificity %	Time (Sec.)
1	97.72	0.07	97.66	97.48	97.84	98.83	42.17
2	97.55	0.11	97.41	97.41	97.41	98.72	41.69
3	97.88	0.06	97.63	97.29	98.01	98.97	40.21
4	97.72	0.09	97.51	97.42	97.59	98.85	37.79
5	96.73	0.11	96.16	96.86	95.66	98.22	33.19
Mean	97.52	0.09	97.27	97.29	97.31	98.72	39.01
SD	± 0.46	± 0.02	± 0.63	± 0.25	± 0.95	± 0.29	± 3.67

Fold	Accuracy %	Loss	F1-score %	Precision %	Recall %	Specificity%	Time (Sec.)
1	99.02	0.03	98.82	98.74	98.89	99.55	41.71
2	98.21	0.05	98.03	97.79	98.27	99.14	41.68
3	98.53	0.05	98.42	98.54	98.31	99.23	40.31
4	98.53	0.06	98.28	98.29	98.28	99.28	41.65
5	98.69	0.04	98.58	98.68	98.49	99.29	41.39
Mean	98.59	0.05	98.43	98.41	98.45	99.29	41.35
SD	± 0.29	± 0.01	± 0.29	± 0.39	± 0.26	± 0.15	± 0.59

Fold	Accuracy %	Loss	F1-score %	Precision %	Recall %	Specificity%	Time (Sec.)
1	98.37	0.06	98.29	98.42	98.18	99.14	28.76
2	98.37	0.05	98.13	98.05	98.21	99.18	28.79
3	98.21	0.05	97.98	97.85	98.13	99.15	28.98
4	99.35	0.03	99.29	99.33	99.25	99.64	28.99
5	98.86	0.05	98.66	98.59	98.74	99.46	28.46
Mean	98.63	0.05	98.47	98.45	98.51	99.31	28.79
SD	± 0.47	± 0.01	± 0.52	± 0.57	± 0.49	± 0.23	± 0.22

Fold	Accuracy %	Loss	F1-score %	Precision %	Recall %	Specificity%	Time (Sec.)
1	98.21	0.06	97.99	98.13	97.88	99.07	29.09
2	99.18	0.04	99.05	98.88	99.23	99.62	28.51
3	98.37	0.05	98.05	98.14	97.98	99.19	28.69
4	99.02	0.03	98.95	98.91	98.98	99.51	28.67
5	98.37	0.05	98.28	98.19	98.38	99.18	28.91
Mean	98.63	0.05	98.46	98.45	98.49	99.31	28.77
SD	± 0.44	± 0.01	± 0.51	± 0.41	± 0.59	± 0.24	± 0.23

Class	F1-score%	Precision%	Recall%	Specificity%	Accuracy %
Fold 1
Meningioma	96.99	96.67	97.32	98.92	98.53
Glioma	98.41	98.58	98.23	98.79
Pituitary	100	100	100	100
Fold 5
Meningioma	97.67	97.67	97.67	99.38	99.02
Glioma	98.94	98.93	98.94	99.09
Pituitary	100	100	100	100

Fold	Accuracy%	Loss	F1-score%	Precision%	Recall%	Specificity%	Time (sec)
1	90.54	0.29	89.83	91.08	89.19	94.66	66.75
2	92.33	0.20	91.58	91.41	91.78	96.14	82.53
3	90.21	0.33	89.16	88.66	89.78	95.07	82.39
4	92.66	0.34	91.96	92.26	91.77	96.22	82.35
5	92.81	0.25	91.37	91.61	91.15	96.15	82.62
Mean	91.71	0.28	90.78	91.01	90.73	95.65	79.33
SD	± 1.24	± 0.06	± 1.22	± 1.38	± 1.19	± 0.73	± 7.03

Fold	Accuracy%	Loss	F1-score%	Precision%	Recall%	Specificity%	Time (Sec.)
1	91.19	0.36	90.34	90.26	90.42	95.42	84.57
2	94.62	0.17	93.91	93.83	94.02	97.27	83.39
3	91.84	0.27	91.22	90.74	91.86	96.05	83.55
4	91.19	0.34	90.25	91.04	89.71	95.27	83.31
5	90.69	0.27	89.53	89.82	89.29	95.13	84.19
Mean	91.91	0.28	91.05	91.14	91.06	95.83	83.81
SD	± 1.57	± 0.07	± 1.71	± 1.58	± 1.92	± 0.88	± 0.55

Fold	Accuracy%	Loss	F1-score%	Precision%	Recall%	Specificity%	Time (Sec.)
1	95.27	0.13	94.64	94.34	95.01	97.66	82.59
2	96.41	0.13	96.09	96.31	95.93	98.14	60.11
3	95.92	0.13	95.66	95.69	95.64	97.87	61.97
4	95.11	0.12	94.49	94.45	94.55	97.42	60.62
5	97.22	0.11	96.79	96.82	96.79	98.61	82.64
Mean	95.99	0.12	95.53	95.52	95.58	97.94	69.59
SD	± 0.86	± 0.01	± 0.97	± 1.11	± 0.86	± 0.46	± 11.91

Fold	Accuracy%	Loss	F1-score%	Precision%	Recall%	Specificity%	Time (Sec.)
1	95.76	0.17	95.19	95.17	95.22	97.84	49.39
2	96.25	0.11	95.95	96.05	95.87	98.07	82.61
3	96.41	0.09	95.88	95.81	95.96	98.19	82.59
4	96.25	0.14	95.93	95.92	95.95	98.09	82.62
5	95.75	0.10	95.22	95.13	95.31	97.76	49.01
Mean	96.08	0.12	95.63	95.62	95.66	97.99	69.24
SD	± 0.31	± 0.03	± 0.39	± 0.43	± 0.37	± 0.18	± 18.29

Fold	Accuracy%	Loss	F1-score%	Precision%	Recall%	Specificity%	Time (Sec.)
1	97.23	0.08	97.05	97.05	97.05	98.56	82.59
2	97.39	0.13	97.13	97.21	97.06	98.64	60.76
3	97.23	0.05	96.88	96.57	97.22	98.69	82.62
4	97.06	0.07	96.77	96.94	96.61	98.42	60.29
5	99.02	0.03	98.86	98.71	99.01	99.54	56.39
Mean	97.59	0.07	97.34	97.29	97.39	98.77	68.53
SD	± 0.81	± 0.04	± 0.86	± 0.82	± 0.93	± 0.44	± 12.96

Fold	Accuracy%	Loss	F1-score%	Precision%	Recall%	Specificity%	Time (Sec.)
1	98.53	0.07	98.37	98.34	98.39	99.24	58.64
2	96.74	0.13	96.61	96.27	96.99	98.42	58.39
3	97.39	0.08	97.15	97.31	97.01	98.64	58.94
4	98.04	0.07	97.78	98.13	97.46	98.86	58.49
5	97.39	0.06	97.09	96.99	97.19	98.71	58.85
Mean	97.62	0.08	97.41	97.41	97.41	98.77	58.66
SD	± 0.69	± 0.03	± 0.68	± 0.85	± 0.58	± 0.31	± 0.23

Fold	Accuracy%	Loss	F1-score%	Precision%	Recall%	Specificity%	Time (Sec.)
1	91.19	0.06	90.16	89.68	90.72	95.59	85.12
2	97.88	0.06	97.65	97.82	97.65	98.87	86.25
3	98.85	0.04	98.74	98.76	98.72	99.39	80.31
4	98.53	0.06	98.48	98.55	98.43	99.24	69.68
5	98.03	0.05	97.73	97.47	97.73	99.07	65.34

Fold	Accuracy%	Loss	F1-score%	Precision%	Recall%	Specificity%	Time (Sec.)
1	97.23	0.07	96.92	96.81	97.04	98.63	55.16
2	98.04	0.05	97.85	97.49	98.23	99.05	51.49
3	97.72	0.05	97.62	97.61	97.64	98.81	56.47
4	97.23	0.09	96.85	96.91	96.79	98.47	52.36
5	98.37	0.05	98.28	98.39	98.17	99.13	50.65
Mean	97.72	0.06	97.51	97.44	97.57	98.82	53.23
SD	± 0.51	± 0.02	± 0.61	± 0.63	± 0.65	± 0.28	± 2.48

Model		Accuracy %	Loss	F1-score %	Precision %	Recall %	Specificity %	Time (Sec.)
The initial proposed model (case 1)	Mean	93.08	0.24	92.21	92.47	92.01	96.34	41.43
The initial proposed model (case 1)	SD	± 0.44	± 0.03	± 0.64	± 0.58	± 0.86	± 0.29	± 1.61
The initial proposed model (case 2)	Mean	98.63	0.05	98.46	98.45	98.49	99.31	28.77
The initial proposed model (case 2)	SD	± 0.44	± 0.01	± 0.51	± 0.41	± 0.59	± 0.24	± 0.23
The final proposed BTC-fCNN model (case 3)	Mean	98.86	0.05	98.77	98.72	98.83	99.41	26.92
The final proposed BTC-fCNN model (case 3)	SD	± 0.45	± 0.02	± 0.53	± 0.48	± 0.59	± 0.28	± 0.23
Case 1 without 1 × 1conv. layer	Mean	91.71	0.28	90.78	91.01	90.73	95.65	79.33
Case 1 without 1 × 1conv. layer	SD	± 1.24	± 0.06	± 1.22	± 1.38	± 1.19	± 0.73	± 7.03
Case 2 without 1 × 1conv. layer	Mean	97.62	0.08	97.41	97.41	97.41	98.77	58.66
Case 2 without 1 × 1conv. layer	SD	± 0.69	± 0.03	± 0.68	± 0.85	± 0.58	± 0.31	± 0.23
Case 3 without 1 × 1conv. layer	Mean	97.72	0.06	97.51	97.44	97.57	98.82	53.23
Case 3 without 1 × 1conv. layer	SD	± 0.51	± 0.02	± 0.61	± 0.63	± 0.65	± 0.28	± 2.48

Model		Accuracy %	Time (Sec.)	Number of model parameters
VGG16	Mean	92.07	975.15	165,730,115
VGG16	SD	± 0.61	± 134.70	151,015,427
VGG19	Mean	93.05	1189.35	171,039,811
VGG19	SD	± 0.94	± 129.52	151,015,427
InceptionV3	Mean	80.35	1637.19	23,904,035
InceptionV3	SD	± 2.42	± 204.06	2,101,251
ResNet50	Mean	74.48	428.53	23,593,859
ResNet50	SD	± 2.15	± 9.17	6,147
MobileNet	Mean	89.16	555.74	4,256,867
MobileNet	SD	± 0.98	± 17.88	1,028,003
The initial proposed model in case 1	Mean	93.08	41.43	583,613
The initial proposed model in case 1	SD	± 0.44	± 1.61	583,613
The initial proposed model in case 2	Mean	98.63	28.77	583,613
The initial proposed model in case 2	SD	± 0.44	± 0.23	583,613
The final proposed BTC-fCNN model in case 3	Mean	98.86	26.92	583,613
The final proposed BTC-fCNN model in case 3	SD	± 0.45	± 0.23	583,613

Reference	Model	Accuracy %
Gumaei et al. [10]	Regularized extreme learning machine	94.23
Sajjad et al. [11]	VGG19 with extensive data augmentation	94.58
Anarki et al. [15]	CNN with genetic algorithm	94.20
Swati et al. [13]	VGG19 with fine tuning	94.82
Deepak et al. [16]	GoogleNet with transfer learning	97.10
Alshayeji et al. [17]	Aggregation of two paths from CNN	97.37
Kakarla et al. [18]	Average pooling convolutional neural network	97.42
Kumar et al. [19]	ResNet-50 with Global Average Pooling at the output layer	97.48
The proposed BTC-fCNN model (case 3)	The proposed model with retraining the model in each fold during five folds	98.86

PERMALINK

BTC-fCNN: Fast Convolution Neural Network for Multi-class Brain Tumor Classification

Basant S Abd El-Wahab

Mohamed E Nasr

Salah Khamis

Amira S Ashour

Abstract

Introduction

Table 1.

Methodology

Brain Tumor Dataset

Fig. 1.

Traditional Convolution Neural Network

One by One Convolution Layer

Transfer Learning

Proposed CNN- Based Models for Multi-Class Classification

Fig. 2.

The Initial Proposed Model (Case 1)

Table 2.

The Initial Proposed Model (Case 2)

The Final Proposed BTC-fCNN Model (Case 3)

Fig. 3.

Evaluation Metrics

Experimental Simulation Results

The proposed model evaluation

The results of the Initial of Proposed Model (Case 1)

Fig. 4.

Fig. 5.

Table 3.

The Results of the Initial Proposed Model (Case 2)

Table 4.

Table 5.

Table 6.

Table 7.

Table 8.

The Results of the Final Proposed BTC-fCNN Model (Case 3)

Fig. 6.

Fig. 7.

Table 9.

Table 10.

Table 11.

Fig. 8.

Comparative Study of Proposed System Without 1 × 1 Convolution Layer

Proposed Model Without Transfer Learning nor 1 × 1 Convolution Layer

Fig. 9.

Table 12.

Fig. 10.

Proposed Model with Transfer Learning and Without 1 × 1 Convolution Layer

Table 13.

Table 14.

Table 15.

Table 16.

Table 17.

Proposed Model with Retrained Fivefold Cross Validation and Without 1 × 1 Convolution Layer

Table 18.

Table 19.

Table 20.

Comparative Study with Well-Known CNN Networks

Table 21.

Discussion

Table 22.

Conclusion

Funding

Data Availability

Declarations

Conflict of interest

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases