Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques

Rajasekhar Chaganti; Furqan Rustam; Isabel De La Torre Díez; Juan Luis Vidal Mazón; Carmen Lili Rodríguez; Imran Ashraf

doi:10.3390/cancers14163914

. 2022 Aug 13;14(16):3914. doi: 10.3390/cancers14163914

Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques

Rajasekhar Chaganti ^1,^†, Furqan Rustam ^2,^†, Isabel De La Torre Díez ^3,^*, Juan Luis Vidal Mazón ^4,^5,⁶, Carmen Lili Rodríguez ^4,⁷, Imran Ashraf ^8,^*

Editors: Stefania Masone, Nunzio Velotti

PMCID: PMC9405591 PMID: 36010907

Abstract

Simple Summary

The study presents a thyroid disease prediction approach which utilizes random forest-based features to obtain high accuracy. The approach can obtain a 0.99 accuracy to predict ten thyroid diseases.

Abstract

Thyroid disease prediction has emerged as an important task recently. Despite existing approaches for its diagnosis, often the target is binary classification, the used datasets are small-sized and results are not validated either. Predominantly, existing approaches focus on model optimization and the feature engineering part is less investigated. To overcome these limitations, this study presents an approach that investigates feature engineering for machine learning and deep learning models. Forward feature selection, backward feature elimination, bidirectional feature elimination, and machine learning-based feature selection using extra tree classifiers are adopted. The proposed approach can predict Hashimoto’s thyroiditis (primary hypothyroid), binding protein (increased binding protein), autoimmune thyroiditis (compensated hypothyroid), and non-thyroidal syndrome (NTIS) (concurrent non-thyroidal illness). Extensive experiments show that the extra tree classifier-based selected feature yields the best results with 0.99 accuracy and an F1 score when used with the random forest classifier. Results suggest that the machine learning models are a better choice for thyroid disease detection regarding the provided accuracy and the computational complexity. K-fold cross-validation and performance comparison with existing studies corroborate the superior performance of the proposed approach.

Keywords: machine learning, thyroid prediction, forward feature selection, bidirectional feature elimination

1. Introduction

Thyroid disease incidences have been on the rise in recent times. The thyroid gland has one of the most important functions in regulating metabolism. Irregularities in the thyroid gland can lead to different abnormalities; two of the most common are hyperthyroidism and hypothyroidism. A large number of people are diagnosed with thyroid diseases such as hypothyroidism and hyperthyroidism yearly [1]. The thyroid gland produces levothyroxine (T4) and triiodothyronine (T3) and insufficient thyroid hormones may lead to hypothyroidism and hyperthyroidism [2]. Many approaches are proposed to detect thyroid disease diagnosis in the literature. A proactive thyroid disease prediction is essential to properly treat the patient at the right time and save human lives and medical expenses. Due to the technological advancements in data processing and computation, machine learning and deep learning techniques are applied to predict the thyroid diagnosis in the early stages and classify the thyroid disease types hypothyroidism, hyperthyroidism, etc.

Due to the advancement in technologies such as data mining, big data, image and video processing, and parallel computing, the healthcare domain benefited from leveraging technology in many healthcare areas for human well-being [3]. The range of data mining-based health care applications may include the early detection of diseases and diagnosis, prediction of virus outbreaks, drug discovery and testing, health care data management, and patient personalized medicine recommendations, etc. [4]. Health care professionals strive to identify the diseases in the early stages so that proper treatment can be provided to the patients and cures the disease within a short time and with less expenditure. Thyroid disease is one of the diseases which impacts a sizeable human population worldwide. According to the world-leading professional association (American thyroid association), 20 million Americans have some form of thyroid disease [5]. Twelve percent of the US population is diagnosed with a thyroid condition at least once in a lifetime. These statistics signify that thyroid-based disease should not be taken lightly. Improving the health care practices to detect and prevent thyroid diseases using advanced technologies is highly desired.

Existing research works predominantly focus on binary classification problems where the subjects are classified into thyroid patients or health subjects, while multiclass-based detection works are only a few. Even for those, the focus is on three categories including normal, hypothyroidism, and hyperthyroidism. For the most part, the emphasis is placed on the optimization of machine learning and deep learning models and the feature selection part is under-studied or completely ignored for a thyroid disease problem. Despite the high accuracy reporting approaches, such approaches are tested on samples under 1000, and results are not validated. The classification in terms of the patient status like treatment condition, health condition, and general health issues based categorization is desired to predict the patient thyroid condition effectively and proactively treat the patient. Moreover, the performance comparison of machine learning and deep learning models is not carried out. This study aims at working on these issues and makes the following contributions

A novel machine learning-based thyroid disease prediction approach is proposed that focus on the multi-class problem. Contrary to previous studies that focus on the binary or three-class problem, this study considers a five-class disease prediction problem.
Four feature engineering approaches are investigated in this study to analyze their efficacy for the problem at hand. It includes forward feature selection (FFS), backward feature elimination (BFE), bidirectional feature elimination (BiDFE), and machine learning-based feature selection using an extra tree classifier.
For experiments, five machine learning models are selected based on their reported performance for disease prediction, including random forest (RF), logistic regression, support vector machine (SVM), AdaBoost (ADA), and Gradient boosting machine (GBM). Moreover, three deep learning models are adopted as well, which include convolutional neural network, long short-term memory (LSTM) network, and CNN-LSTM. Performance is evaluated in terms of confusion matrix, 10-fold cross-validation, and standard deviation, in addition to accuracy, precision, recall, and F1 score.

The remainder of this article is organized as follows. Section 2 discusses the state-of-the-art works to detect and classify thyroid diseases. Section 3 presents the proposed methodology to address the thyroid disease prediction problem. This section also includes feature selection methods, machine learning techniques used in the article, and dataset description considered for this study. Section 4 describes the experimental results obtained in our study and comparison with prior art studies. Section 5 concludes the article with our contributions.

2. Literature Review

With recent technological advancements in data processing and computation, machine learning and deep learning techniques have been used in several research studies for thyroid disease prediction. Prediction of this disease at its early stages and its classification into cancer, Hypothyroidism, or Hyperthyroidism is helpful for timely treatment and recovery. The literature survey is performed using peer-reviewed article databases such as google scholar and Scopus. The searches were performed within the scope of the last five years to identify the recent works in our study. The keywords “Thyroid disease”, “Thyroid cancer”, “machine learning”, and “deep learning” combinations were used to select the relevant articles. As the number of retrieved results is much more for finding the relevant articles, we have further tuned the search queries and used a strict keyword search. Overall, more than 100 relevant articles were identified during our first screening. We further analyzed those articles and shortlisted 25 articles that are closely relevant to our work. Machine learning and deep learning methods are used both for thyroid disease detection and thyroid cancer detection. As the process of applying these methods is different for both tasks, they are discussed separately.

2.1. Thyroid Cancer Detection

The study [6] leveraged the least absolute shrinkage and selection operator (LASSO) and LR model to select the malignant thyroid nodule-associated ultrasonic characteristics. Then, RF is applied along with a scoring system to classify the malignant thyroid nodules. The logistic lasso regression (LLR) with RF obtained the best performance with 82% accuracy. Another study [7] performed machine learning-based prediction of the BRAF mutation presence in the confirmed cancer thyroid nodules. The authors selected 96 thyroid nodule ultrasonic images for this study. 86 radiomic features were extracted from the images, and three models, LR, SVM, and RF were applied to predict the presence of the BRAF mutation. The classification accuracy is reported as 64.3% for all three models. Idarraga et al. [8] performed machine learning-based thyroid nodule malignancy prediction using the ultrasonic and fine-needle aspiration (FNA) feature to avoid false-negative diagnosis in the early stages of thyroid cancer. The RF technique performed better than other techniques like decision tree (DT) and gradient descent (GD). All the above-mentioned works’ performance is not optimal to predict the thyroid cancer diagnosis and still has room for performance improvement.

2.2. Thyroid Disease Prediction

Several thyroid disease detection and classification approaches have been presented in the literature. For example, Garcia et al. [9] predicted the high probable molecules initiating the thyroid hormone homeostasis using machine learning algorithms RF, LR, GBM, SVM, and deep neural networks (DNN). The early prediction of the molecules is helpful for further testing in the first stages of thyroid disease. The molecular events were obtained from ToxCast datasets for running the experiments. The article reported that Thyroid Peroxidase (TPO) and Thyroid Hormone receptor (TR) achieved the best predictive performance with an F1 score of 0.83 and 0.81, respectively. The authors in [10] utilized the image processing techniques and feature selection methods to pick the important features from the dataset and achieve the best performance for thyroid disease prediction.

The thyroid disease classification is also a significant problem to be solved in the health industry. Razia et al. [11] compared the performance of various machine learning algorithms to classify Thyroid disease into normal, Hypothyroidism, or hyperthyroidism categories. The authors obtained the datasets from the University of California Irvine (UCI) machine learning library. The dataset contains 7200 samples, and each sample has 21 attributes. The authors reported that DT outperformed the SVM, NB, and multilinear regression (MLR) with 99.23%. However, multi-classification is limited to three categories, and limited information is provided on data preprocessing to assess the applicability of the results for real-time datasets. A multi-kernel SVM is proposed in the paper [12] to classify thyroid diseases. The authors mentioned that the multi-kernel SVM achieved 97.49% performance accuracy on UCI thyroid datasets. The improved gray wolf optimization performs the feature selection and enhances the performance.

A study [13] performed multiclass hypothyroidism using selective features and machine learning algorithms. Hypothyroidism is classified into four categories. The results show that RF performed well with 99.81% accuracy compared to the SVM, KNN, and DT algorithms. However, the authors did not mention the performance of their proposed methodology for thyroid disease classification. Another study [14] tested three feature selection methods along with SVM, DT, RF, LR, and Naive Bayes (NB) to make early predictions for hypothyroidism. Three feature selection methods, recursive feature selection (RFE), univariate feature selection (UFS), and principal component analysis (PCA), are tested in combination with ML algorithms. The RFE combination with ML algorithms performed better than other feature selection methods. All the five ML algorithms obtained 99.35% accuracy when combined with RFE feature selection. However, the data sample size is very small, with only 519 records. A large-scale dataset is needed to evaluate the effectiveness of their method.

The authors [15] evaluated the performance of the thyroid disease classification using various machine learning algorithms. SVM, RF, DT, NB, LR, K nearest neighbor (KNN), and MLP are used for disease prediction. A dataset sample of 1250 is taken from hospitals and laboratories in Iraq. The MLP predicted the thyroid classification with 96.4% accuracy. However, there is still room for performance improvement. Hosseinzadeh et al. [16] proposed a multiple multi-layer perception (MMLP) technique to classify thyroid diseases. When the MMLP is applied along with a set of six networks, the accuracy is improved by 0.7% compared to a single MLP. Although MMLP obtained 99% classification accuracy on large dataset samples, training deep learning techniques like MMLP is costly and needs high computational resources to train faster. The KNN with various distance functions is implemented to test the thyroid disease detection in [17]. The chi-square and L1-based featured selection methods were used to select the optimal features before applying the KNN with Euclidean and Cosine distances. The authors reported that KNN obtained promising results. However, the tested sample size is very small, with 590 samples in total.

Mishra et al. [18] applied the ML techniques sequential minimal optimization (SMO), DT, RF, and K-star classifier to predict hypothyroid disease. A sample size of unique 3772 records is considered for this study. The authors reported that RF and DT performed better than the other two techniques, with accuracy scores of 99.44% and 98.97%. However, the authors did not consider hyperthyroid predication. Alyas et al. [19] performed a comparative analysis of the machine learning techniques DT, RF, KNN, and artificial neural network (ANN) to detect thyroid disease. The tests were conducted on the largest dataset and considered both sampled and unsampled data for thyroid disease prediction. RF obtained the best prediction with 94.8% accuracy. However, the authors did not perform the thyroid disease type prediction tests. Researchers also applied deep learning models to predict thyroid disease classification. For instance, the authors [20] used a deep neural network (DNN) to predict the thyroid disease classification. The performance evaluation is done on the UCI dataset of 3152 unique samples. The authors reported 99.95% accuracy when using DNN to classify thyroid disease. However, a large dataset is required to train the model for performance evaluation properly. Additionally, more computing resources are needed to train the deep learning models.

Table 1 provides the comparative analysis of the existing works discussed in this section. Various datasets are used in the literature to evaluate the performance of thyroid disease detection. However, most of the datasets given in Table 1 are not standard datasets for performance evaluation and comparison with the existing work. Therefore, we elected a well-known UCI dataset for our study. Although tremendous work has been done in the above studies with high accuracy results to detect and classify thyroid disease, detailed research on the feature selection is not well explored for thyroid disease classification problems. Besides, the performance results reported in the context of thyroid disease classification accuracy are insufficient, and there is still scope for improvement. Furthermore, all the prior works classify thyroid problems into three categories (normal, hypothyroidism, or hyperthyroidism). The classification in terms of the patient status like treatment condition, health condition, and general health issues based categorization is desired to predict the patient thyroid condition effectively and proactively to treat the patient. Moreover, the detailed evaluation of the machine learning and deep learning-based techniques for thyroid disease classification and their performance comparison is not well discussed in the state-of-the-art. So, we propose a feature selection-based, highly accurate, multiclass supportive thyroid disease classification solution to overcome those limitations and provide a detailed performance comparison of machine learning and deep learning-based solutions.

Table 1.

Summary of the systematic analysis of the state-of-the-art thyroid disease studies.

Authors	Year	Sample Size	Dataset Source	Model	Classes	Evaluation Metrics	Results
[9]	2020	-	ToxCast	LR RF SVM XGB ANN	2	F1-score	(TPO) XGB-83% and (TR) RF-81%
[11]	2018	7200 samples, 21 attributes	UCI	SVM, Multiple Linear Regression(MLR), NB and DT	2	Accuracy	MLR 91.59% SVM 96.04% Naive Bayes 6.31% Decision Trees 99.23%
[12]	2020	7547, 30 features	UCI	multi-kernel SVM	3	Accuracy, Sensitivity, and Specificity	Accuracy (97.49%), Sensitivity (99.05%), and Specificity (94.5%)
[13]	2021	3771 samples, 30 attributes	UCI	DT, KNN, RF, and SVM	4	Accuracy	KNN 98.3% SVM 96.1% DT 99.5% RF 99.81%
[14]	2021	519 samples	diagnostic center Dhaka, Bangladesh	SVM, DT, RF, LR, and NB. Recursive Feature Selection (RFE), Univariate Feature Selection (UFS) and PCA	4	Accuracy	RFE, SVM, DT, RF, LR accuracy—99.35%
[15]	2021	1250 with 17 attributes	external hospitals and laboratories	SVM, RF, DT, NB, LR, KNN, MLP, linear discriminant analysis (LDA) and DT	3	Accuracy	DT 90.13, SVM 92.53 RF 91.2 NB 90.67 LR 91.73 LDA 83.2 KNN 91.47 MLP 96.4
[16]	2021	7200 patients, with 21 features	UCI	multiple MLP	3	Accuracy	multiple MLP 99%
[17]	2021	690 samples, 13 features	datasets from KEEL repo and District Headquarters teaching hospital, Pakistan	KNN without feature selection, KNN using L1-based feature selection, and KNN using chi-square-based feature selection	3	Accuracy	KNN 98%
[18]	2021	3772 and 30 attributes	UCI	RF, sequential minimal optimization (SMO), DT, and K-star classifier	2	Accuracy	K = 6, RF 99.44%, DT 98.97%, K-star 94.67%, and SMO 93.67%
[19]	2022	3163	UCI	DT, RF, KNN, and ANN	2	Accuracy	Best performance Accuracy RF 94.8%
[21]	2022	215 with 5 features	UCI	KNN, XGB, LR, DT	3	Accuracy	KNN 81.25 XGBoost 87.5 LR 96.875 DT 98.59
[20]	2022	3152, 23 features	UCI	DNN	2	Accuracy	Accuracy 99.95%

Attribute	Description	Data Type
age	age of the patient	(int)
sex	sex patient identifies	(str)
on_thyroxine	whether patient is on thyroxine	(bool)
query on thyroxine	whether patient is on thyroxine	(bool)
on antithyroid meds	whether the patient is on antithyroid meds	(bool)
sick	whether patient is sick	(bool)
pregnant	whether patient is pregnant	(bool)
thyroid_surgery	whether patient has undergone thyroid surgery	(bool)
I131_treatment	whether patient is undergoing I131 treatment	(bool)
query_hypothyroid	whether the patient believes they have hypothyroid	(bool)
query_hyperthyroid	whether the patient believes they have hyperthyroid	(bool)
lithium	whether patient * lithium	(bool)
goitre	whether patient has goitre	(bool)
tumor	whether patient has tumor	(bool)
hypopituitary	whether patient * hyperpituitary gland	(float)
psych	whether patient * psych	(bool)
TSH_measured	whether TSH was measured in the blood	(bool)
TSH	TSH level in blood from lab work	(float)
T3_measured	whether T3 was measured in the blood	(bool)
T3	T3 level in blood from lab work	(float)
TT4_measured	whether TT4 was measured in the blood	(bool)
TT4	TT4 level in blood from lab work	(float)
T4U_measured	whether T4U was measured in the blood	(bool)
T4U	T4U level in blood from lab work	(float)
FTI_measured	whether FTI was measured in the blood	(bool)
FTI	FTI level in blood from lab work	(float)
TBG_measured	whether TBG was measured in the blood	(bool)
TBG	TBG level in blood from lab work	(float)
referral_source		(str)
target	hyperthyroidism medical diagnosis	(str)
patient_id	unique id of the patient	(str)

Condition	Diagnosis Class	Count
hyperthyroid	hyperthyroid (A)	147
	T3 toxic (B)	21
	toxic goiter (C)	6
	secondary toxic (D)	8
hypothyroid	hypothyroid (E)	1
	primary hypothyroid (F)	233
	compensated hypothyroid (G)	359
	secondary hypothyroid (H)	8
binding protein:	increased binding protein (I)	346
binding protein:	decreased binding protein (J)	30
general health	concurrent non-thyroidal illness (K)	436
replacement therapy:	underreplaced (M)	111
	consistent with replacement therapy (L)	115
	overreplaced (N)	110
antithyroid treatment:	antithyroid drugs (O)	14
	I131 treatment (P)	5
	surgery (Q)	14
miscellaneous:	discordant assay results (R)	196
	elevated TBG (S)	85
	elevated thyroid hormones (T)	0
no condition	(-)	6771

Class	Prepossessed Count	Final Count
Normal	6771	400
primary hypothyroid	233	233
increased binding protein	346	346
compensated hypothyroid	359	359
concurrent non-thyroidal illness	436	436

age	sex	on_thyroxine	query_on_thyroxine	on_antithyroid_meds	sick	pregnant	thyroid_surgery
29	F	f	f	f	f	f	f
71	F	t	f	f	f	f	f
61	M	f	f	f	t	f	f
88	F	f	f	f	f	f	f
I131_treatment	query_hypothyroid	query_hyperthyroid	lithium	goitre	tumor	hypopituitary	psych
f	t	f	f	f	f	f	f
f	f	f	f	f	f	f	f
f	f	f	f	f	f	f	f
f	f	f	f	f	f	f	f
TSH_measured	TSH	T3_measured	T3	TT4_measured	TT4	T4U_measured	T4U
t	0.3	f		f		f
t	0.05	f		t	126	t	1.38
t	9.799999	t	1.2	t	114	t	0.84
t	0.2	t	0.4	t	98	t	0.73
FTI_measured	FTI	TBG_measured	TBG	referral_source	target	patient_id
f		f		other	-	$8.41 \times 10^{8}$
t	91	f		other	I	$8.41 \times 10^{8}$
t	136	f		other	G	$8.41 \times 10^{8}$
t	134	f		other	K	$8.41 \times 10^{8}$

Class	Hyper-Parameters	Tuning Range
LR	solver = liblinear, C = 5.0	solver = {liblinear, saga, sag}, C = {1.0 to 8.0}
SVM	kernel = ‘linear’, C = 5.0	kernel = {‘linear’, ‘poly’, ‘sigmoid’} C = {1.0 to 8.0}
RF	n_estimators = 200, max_depth = 20	n_estimators = {10 to 300}, max_depth = {2 to 50}
GBM	n_estimators = 200, max_depth = 20, learning_rat = 0.5	n_estimators = {10 to 300}, max_depth = {2 to 50}, learning_rat = {0.1 to 0.9}
ADA	n_estimators = 200, max_depth = 20, learning_rat = 0.5	n_estimators = {10 to 300}, max_depth = {2 to 50}, learning_rat = {0.1 to 0.9}

Target Class	Training	Testing	Total
“_” (0)	325	75	400
F (1)	190	43	233
G (2)	280	79	359
I (3)	271	75	346
K (4)	353	83	436

Model	Accuracy	Precision	Recall	F1 Score
RF	0.98	0.98	0.98	0.98
GBM	0.97	0.98	0.98	0.98
ADA	0.97	0.97	0.97	0.97
LR	0.85	0.85	0.85	0.85
SVM	0.85	0.85	0.85	0.85

Model	Accuracy	Precision	Recall	F1 Score
RF	0.97	0.97	0.96	0.96
GBM	0.97	0.97	0.96	0.96
ADA	0.93	0.92	0.92	0.92
LR	0.83	0.83	0.82	0.82
SVM	0.92	0.92	0.92	0.92

Model	Accuracy	Precision	Recall	F1 Score
RF	0.96	0.96	0.95	0.95
GBM	0.92	0.92	0.91	0.91
ADA	0.83	0.84	0.83	0.83
LR	0.83	0.83	0.82	0.82
SVM	0.92	0.92	0.92	0.92

Model	Accuracy	Precision	Recall	F1 Score
RF	0.99	0.99	0.99	0.99
GBM	0.98	0.98	0.98	0.98
ADA	0.97	0.97	0.97	0.97
LR	0.87	0.88	0.87	0.87
SVM	0.92	0.92	0.92	0.92

Feature	Model	Accuracy	SD	Time
Original	RF	0.94	+/−0.10	1.689
	GBM	0.93	+/−0.13	3.831
	ADA	0.93	+/−0.08	1.758
	LR	0.84	+/−0.13	0.330
	SVM	0.88	+/−0.12	243.126
FS	RF	0.93	+/−0.10	0.440
	GBM	0.90	+/−0.14	1.349
	ADA	0.89	+/−0.08	0.743
	LR	0.78	+/−0.13	0.330
	SVM	0.90	+/−0.15	210.65
BE	RF	0.93	+/−0.11	0.601
	GBM	0.90	+/−0.14	1.380
	ADA	0.87	+/−0.07	0.635
	LR	0.78	+/−0.13	0.111
	SVM	0.90	+/−0.15	173.80
BiDFE	RF	0.93	+/−0.03	0.677
	GBM	0.90	+/−0.02	8.733
	ADA	0.89	+/−0.06	0.617
	LR	0.78	+/−0.06	0.111
	SVM	0.90	+/−0.04	42.496
ML FS	RF	0.94	+/−0.01	1.689
	GBM	0.93	+/−0.13	3.831
	ADA	0.93	+/−0.08	1.758
	LR	0.84	+/−0.13	0.330
	SVM	0.91	+/−0.13	365.51

Model	Hyperparameters
LSTM	Embedding (4000, 100, input_length = …) Dropout (0.5) LSTM (128) Dense (5, activation = ‘softmax’)
CNN	Embedding (4000, 100, input_length = …) Conv1D (128, 5, activation = ‘relu’) MaxPooling1D (pool_size = 5) Activation (‘relu’) Dropout (rate = 0.5) Flatten() Dense (5, activation = ‘softmax’)
CNN-LSTM	Embedding (4000, 100, input_length = …) Conv1D (128, 5, activation = ‘relu’) MaxPooling1D (pool_size = 5) LSTM (100) Dense (5, activation = ‘softmax’)
loss = ‘categorical_crossentropy’, optimizer = ‘adam’, epochs = 100, batch_size = 16

Feature	Model	Accuracy	Precision	Recall	F1 Score
Original	LSTM	0.84	0.84	0.83	0.83
	CNN	0.93	0.94	0.92	0.93
	CNN-LSTM	0.90	0.90	0.88	0.88
FS	LSTM	0.62	0.63	0.59	0.59
	CNN	0.86	0.87	0.84	0.85
	CNN-LSTM	0.77	0.78	0.73	0.74
BE	LSTM	0.57	0.61	0.54	0.54
	CNN	0.86	0.87	0.84	0.84
	CNN-LSTM	0.86	0.87	0.84	0.85
BiDFE	LSTM	0.83	0.83	0.80	0.80
	CNN	0.85	0.84	0.81	0.82
	CNN-LSTM	0.87	0.88	0.84	0.86
ML FS	LSTM	0.57	0.63	0.54	0.55
	CNN	0.89	0.89	0.87	0.88
	CNN-LSTM	0.92	0.91	0.91	0.91

Model	FFS	BFE	BiDFE	MLFS	Original
LSTM	44.975	87.842	98.067	66.361	170.28
CNN	83.088	37.796	131.48	30.852	56.436
CNN-LSTM	150.53	65.992	214.96	47.922	97.662

PERMALINK

Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques

Rajasekhar Chaganti

Furqan Rustam

Isabel De La Torre Díez

Juan Luis Vidal Mazón

Carmen Lili Rodríguez

Imran Ashraf

Roles

Abstract

Simple Summary

Abstract

1. Introduction

2. Literature Review

2.1. Thyroid Cancer Detection

2.2. Thyroid Disease Prediction

Table 1.

3. Proposed Methodology

Figure 1.

3.1. Dataset Acquisition

Table 2.

Table 3.

Table 4.

Table 5.

Table 6.

3.2. Feature Selection

Figure 2.

3.2.1. Forward Feature Selection

3.2.2. Backward Feature Elimination

3.2.3. Bi-Directional Elimination

3.2.4. Machine Learning Feature Selection

Figure 3.

3.3. Machine Learning Models

Table 7.

4. Results and Discussion

Table 8.

4.1. Results Using Original Feature Set

Table 9.

4.2. Performance of Models with FFS

Table 10.

Figure 4.

4.3. Results Using BFE Features

Table 11.

Figure 5.

4.4. Models’ Performance Using BiDFE Features

Table 12.

4.5. Performance of Models Using MLFS Features

Table 13.

4.6. K-Fold Cross-Validation for Models

Table 14.

4.7. Deep Learning Models Results

Table 15.

Figure 6.

Figure 7.

Figure 8.

Table 16.

Table 17.

4.8. Limitations of Current Study

4.9. Comparison with Other Studies

Table 18.

4.10. Discussion on Hyperthyroidism and Hypothyroidism

5. Conclusions

Author Contributions

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Funding Statement

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases