Improved Binary Grasshopper Optimization Algorithm for Feature Selection Problem

Gui-Ling Wang; Shu-Chuan Chu; Ai-Qing Tian; Tao Liu; Jeng-Shyang Pan

doi:10.3390/e24060777

. 2022 May 31;24(6):777. doi: 10.3390/e24060777

Improved Binary Grasshopper Optimization Algorithm for Feature Selection Problem

Gui-Ling Wang ^1,^†, Shu-Chuan Chu ^1,^2,^†, Ai-Qing Tian ¹, Tao Liu ¹, Jeng-Shyang Pan ^1,^3,^*

Editors: Donald J Jacobs, Friedhelm Schwenker

PMCID: PMC9223162 PMID: 35741497

Abstract

The migration and predation of grasshoppers inspire the grasshopper optimization algorithm (GOA). It can be applied to practical problems. The binary grasshopper optimization algorithm (BGOA) is used for binary problems. To improve the algorithm’s exploration capability and the solution’s quality, this paper modifies the step size in BGOA. The step size is expanded and three new transfer functions are proposed based on the improvement. To demonstrate the availability of the algorithm, a comparative experiment with BGOA, particle swarm optimization (PSO), and binary gray wolf optimizer (BGWO) is conducted. The improved algorithm is tested on 23 benchmark test functions. Wilcoxon rank-sum and Friedman tests are used to verify the algorithm’s validity. The results indicate that the optimized algorithm is significantly more excellent than others in most functions. In the aspect of the application, this paper selects 23 datasets of UCI for feature selection implementation. The improved algorithm yields higher accuracy and fewer features.

Keywords: grasshopper optimization, binary version, transfer function, feature selection

1. Introduction

Recent years have witnessed a spurt of the development of informatics, and the data scale of applications such as statistical analysis and data mining is becoming larger and larger. Accordingly, the number of features obtained from the dataset is also increasing. However, some features may be irrelevant or redundant, independent of the final classification goal [1]. Therefore, it is necessary to reduce the dimension of the data and obtain representative features before the classification task. Data preprocessing can smooth out noisy and incomplete data, detect redundancy, and have strong robustness. As an essential preprocessing function, feature selection can clean and remove useless data features effectively [2]. Thus, FS plays an essential role in dimensionality reduction and improving classification performance.

FS is an effective strategy to reduce dimensionality and eliminate noisy and unreliable data [3]. It refers to finding feature-related subsets from a large set of attributes. There are $2^{N} - 1$ possible feature subsets in a dataset with N features. Davies proves that the search for the smallest subset of features is an NP problem, which means there is no guarantee of finding an optimal solution other than an exhaustive search [4,5]. However, when the number of features is large, an exhaustive search cannot be applied in practical applications because of the large amount of calculation. Therefore, people are committed to using a heuristic search algorithm to find the suboptimal solution. Many studies have attempted to model feature selection as a combinatorial optimization problem. The objective function can be classification accuracy or some other criterion that considers the best trade-off between the number of extracted features and efficiency [6].

The meta-heuristic algorithms are used to find the optimal or satisfactory solution to complex optimization problems [7,8,9]. The principles of optimization algorithms are revealed through knowledge of relevant behaviors and experiences in biological, physical, and other system domains. In 1991, an Italian scholar proposed the theory of ant colony optimization (ACO) [10]. Since then, swarm intelligence has been formally proposed as a theory. Swarm intelligence takes advantage of group information. It has been extensively used in optimization problems. In 1995, some scholars presented the particle swarm optimization (PSO) algorithm [11], and then the research on this subject was carried out rapidly. The cat swarm optimization based on feline predation strategies was introduced in 2006 [12]. In 2010, fish migration optimization (FMO) emerged, which integrated migration and swim models into the optimization process [13]. In 2017, Saremi et al. proposed the grasshopper optimization algorithm (GOA) [14]. GOA solves optimization problems by mathematically modeling and simulating the behavior of grasshopper swarms in nature. Compared with other existing algorithms, GOA has a higher search efficiency and faster convergence speed. It also solves the continuous problem of finding the best shape for a 52-bar truss and a 3-bar truss. Over recent years, more sophisticated algorithms have been put forward, such as the sparrow search algorithm (SSA) [15], seagull optimization algorithm (SOA) [16], quasi-affine transformation evolution with external archive (QUATRE-EAR) [17], and polar bear optimization algorithm (PBO) [18].

Nonetheless, many optimization problems are discrete problems, such as FS. Conventional methods can not satisfy practical needs, so binary algorithms are needed to solve this problem. Up till now, scholars have proposed many binary algorithms and achieved quite fruitful results. Among them, the well-known PSO algorithm and its binary variants have been put into feature selection [19,20,21,22]. A binary whale optimization algorithm was presented to handle discrete problems in this work [23]. Binary fish migration optimization algorithm (ABFMO) [24] and improved binary symbiotic organism search algorithm (IBSOS) using transfer function also solved the FS problem [25]. Accordingly, the pigeon flock optimization algorithm (PIO) and the gray wolf optimization algorithm (GWO) were improved for better application in feature selection [26,27,28]. The pigeon flock optimization algorithm simulates the pigeons’ homing behavior. Based on the binary pigeon flock optimization algorithm (BPOI), Tian et al. proposed improved binary pigeon-inspired optimization (IBPIO) [29]. They offered a new speed update equation and finally achieved excellent results. Additionally, the binary approach enabled the GWO to be applied to discrete problems [30,31]. The novel gray wolf optimization algorithm (BGWO) added a new parameter update equation to enhance the search capability [32]. Besides, the author gave five transfer functions for the feature selection of UCI datasets. Beyond that, the binary version of GOA was also used to solve the FS problem [33]. Hichem et al. proposed a Novel Binary GOA (NBGOA) by modeling position vectors as binary vectors in [34]. Pinto et al. [35] developed a binary GOA based on the percentile concept for solving the Multidimensional Knapsack Problem (MKP). Moreover, BGOA-M, a binary GOA algorithm based on the mutation operator, was introduced for the FS problem [36].

The sigmoid transfer function is a common method used when converting algorithms to binary versions [37,38]. Some scholars suggested improved binary EO (BEO) for FS problems using the sigmoid function [39]. The authors presented binary MPA (BMPA) and its improved versions using sigmoid and eight transfer functions [6]. In the binary grasshopper optimization algorithm (BGOA), the authors used the sigmoid transfer function to convert space to binary [36]. It has been well applied in feature selection. However, there is a weakness in the original BGOA. The conversion probability of position only accounts for a small range, which can not satisfy the exploration requirement of the algorithm. Thus, this paper presents an improved BGOA to avoid this situation. In the first place, the improved BGOA optimizes the step size in the original BGOA. Secondly, two sigmoid-based and one V-shaped transfer function are proposed based on the new step size. To evaluate the effectiveness of the improved algorithm, 23 well-known datasets are used for experiments. For the performance analysis, the improved BGOA is compared with BGOA, BPSO, and BGWO. Experiments prove that the proposed algorithm performs excellently than the original BGOA in the FS problem. There are the main contributions:

The range of step size variables in the original BGOA is optimized.
Three new transfer functions and two position conversion formulas are proposed based on the new step size.
The efficiency of the improved algorithm is examined by several experiments on 23 benchmark functions [40].
The improved algorithm achieves satisfactory results in feature selection application.

The rest of this paper is shown below. Section 2 is the preliminaries, which contain GOA and the original BGOA. Section 3 presents the improved version of BGOA. Section 4 shows the effect of the improved BGOA on 23 benchmark functions. Section 5 describes the application of the improved BGOA to feature selection. Section 6 analyzes the results of feature selection. Section 7 gives a discussion of this paper.

2. Preliminaries

GOA has been maturely applied to continuity problems. Its binary variants have also been gradually refined. This section introduces the standard GOA and the BGOA based on the sigmoid transfer function.

2.1. GOA

Grasshoppers are incompletely metamorphosed insects, consisting of three stages: egg, worm, and adult. Grasshoppers are a worldwide agricultural pest and generally occur individually. Nevertheless, they are swarming organisms that excel in periodic population outbreaks and can migrate over long distances. Grasshoppers are usually found in the worm and adult stages. The adult grasshoppers have solid hind feet, which causes tremendous damage to agriculture, forestry, and animal husbandry. They are adept at jumping and flying through the air with an extensive wide range of movement. In addition to migration, grasshoppers are also characterized by their predation process. Nature-inspired optimization algorithms have two phases: exploration and exploitation. Exploration is a large-scale search to prevent falling into a local optimum, while the exploitation phase is a small-scale search to find the optimal solution [41,42]. Grasshoppers can instinctively perform these two steps to find the target. Furthermore, according to the grasshopper’s characteristics, GOA has a unique adaptive mechanism. It can effectively regulate the global and partial search process with higher search accuracy. This phenomenon is mathematically modeled by Saremi et al. [43]:

X_{i} = S_{i} + G_{i} + A_{i},

(1)

here $X_{i}$ represents the position of the i-th grasshopper at this time. $S_{i}$ represents the influence factor of two individuals, $G_{i}$ is the gravitational influence. $A_{i}$ is the wind influence. Each operator is multiplied by a random number from 0 to 1 to enhance the randomness, as shown in Equation (2):

X_{i} = k_{1} S_{i} + k_{2} G_{i} + k_{3} A_{i} .

(2)

The details of $S_{i}$ for the social interaction operator are as below:

S_{i} = \sum_{\begin{matrix} j = 1 \\ j \neq i \end{matrix}}^{N} s (d_{i j}) \hat{d_{i j}},

(3)

where $d_{i j}$ is the distance between the i-th and j-th grasshoppers, function s calculates the intensity of social interaction, $\hat{d_{i j}}$ = $\frac{x_{j} - x_{i}}{d_{i j}}$ is the unit vector between the i-th and j-th grasshoppers. Where $x_{i}$ and $x_{j}$ represent the positions of the i-th and j-th grasshoppers, respectively. The s function is defined as shown in Equation (4):

s (a) = f e^{\frac{- a}{l}} - e^{- a},

(4)

where f is the attraction strength, l is the attraction length scope. The value of s is negative to indicate mutual repulsion, while positive indicates mutual attraction between grasshoppers. 0 means that they are in their comfort zone. The value of f is 0.5, and the value of l is 1.5. When two grasshoppers are too far apart, the force does not exist. Therefore, the distance has to be normalized. In his paper, the author does not take gravity into account. The wind direction is toward the best value. The final position formula is shown in Equation (5). The Pseudocode of GOA is given in Algorithm 1.

X_{i}^{d} = c (\sum_{\begin{matrix} j = 1 \\ j \neq i \end{matrix}}^{N} c \frac{u b_{d} - l b_{d}}{2} s (| X_{j}^{d} - X_{i}^{d} |) \frac{x_{j} - x_{i}}{d_{i j}}) + T_{d} .

(5)

Algorithm 1: Pseudocode of GOA

1:
Initialize $C_{m a x}$ , $C_{m i n}$ (two extreme values of parameter c), $M a x_i t e r$ (iterations’ maximum) and N (population of grasshoppers)
2:
Initialize the position of each grasshopper: $X_{i}$ (i = 1,2, …, n)
3:
Set the best solution as Target
4:
whilet ≤ $M a x i t e r$ do
5:
Update c with Equation (6)
6:
for each agent do
7:
Normalize the distance among two individuals to [1, 4]
8:
Update $X_{i}$ using Equation (5)
9:
Update Target, if a better value is obtained
10:
end for
11:
$t = t + 1$
12:
end while
13:
Output Target

Num	Name	Space	Dim
1	Sphere	[−100, 100]	30
2	Schwefel’s function 2.21	[−10, 10]	30
3	Schwefel’s function 1.2	[−100, 100]	30
4	Schwefel’s function 2.221	[−100, 100]	30
5	Rosenbrock	[−30, 30]	30
6	Step	[−100, 100]	30
7	Dejong’s noisy	[−1.28, 1.28]	30

Num	Name	Space	Dim	TM
8	Schwefel	[−500, 500]	30	−12,569
9	Rastringin	[−5.12, 5.12]	30	0
10	Ackley	[−32, 32]	30	0
11	Griewank	[−600, 600]	30	0
12	Generalized penalized 1	[−50, 50]	30	0
13	Generalized penalized 2	[−50, 50]	30	0

Num	Name	Space	Dim	TM
14	Fifth of Dejong	[−65, 65]	2	1
15	Kowalik	[−5, 5]	4	0.00030
16	Six-hump camel back	[−5, 5]	6	−1.0316
17	Branins	[−5, 5]	2	0.398
18	Goldstein–Price	[−2, 2]	2	3
19	Hartman 1	[0, 10]	3	−3.86
20	Hartman 2	[0, 1]	6	−3.32
21	Shekel 1	[0, 1]	4	−10.1532
22	Shekel 2	[0, 1]	4	−10.4028
23	Shekel 3	[0, 1]	4	−10.5363

Parameter	Value
Cmax	1
Cmin	0.00004
C1	2
C2	2
wmax	0.9
wmin	0.2
SigmaMax	1
SigamaMin	0.1
Vmax	6
Vmin	−6
popnum	30
Max_iter	500

Functions	BGOA_S1	BGOA_S2	BGOA_V	BGOA	BPSO	BGWO
f1	10.0000	9.0000	7.8000	10.0000	4.6000	3.2000
f2	0.0000	0.8000	0.0000	0.6000	0.0000	0.0000
f3	0.4000	0.0000	0.0000	0.2000	0.0000	0.0000
f4	0.2000	0.6000	0.0000	0.2000	0.0000	0.2000
f5	20.8000	2.4000	0.0000	20.0000	0.0000	0.0000
f6	2.8500	1.2500	1.2500	2.4500	1.2500	1.2500
f7	0.0062	0.2078	0.0016	1.0064	0.0055	0.0002
f8	−4.2074	−4.2074	−4.2074	−4.2074	−3.8708	−3.8708
f9	0.8000	0.6000	0.0000	0.4000	0.0000	0.2000
f10	1.0267	1.0267	$8.88 \times 10^{- 16}$	0.3422	$8.88 \times 10^{- 16}$	$8.88 \times 10^{- 16}$
f11	0.0861	0.0197	0.0000	0.0394	0.0000	0.0000
f12	4.1233	4.1862	4.1233	4.4846	4.1233	4.1233
f13	0.0400	0.0400	$1.35 \times 10^{- 32}$	0.0600	$1.35 \times 10 - 32$	$1.35 \times 10 - 32$
f14	12.6705	12.6705	12.6705	12.6705	12.6705	12.6705
f15	0.1484	0.1484	0.1484	0.1484	0.1484	0.1484
f16	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f17	27.7029	27.7029	27.7029	27.7029	27.7029	27.7029
f18	600.0000	600.0000	600.0000	600.0000	600.0000	600.0000
f19	−0.3348	−0.3348	−0.3348	−0.3348	−0.3348	−0.3348
f20	−0.1343	−0.1196	−0.1657	−0.0989	−0.1657	−0.1469
f21	−4.2205	−5.0552	−5.0552	−5.0552	−5.0552	−5.0552
f22	−3.4172	−5.0877	−5.0877	−5.0877	−5.0877	−5.0877
f23	−5.1285	−5.1285	−5.1285	−5.1285	−5.1285	−5.1285

Functions	BGOA_S1	BGOA_S2	BGOA_V	BGOA	BPSO	BGWO
f1	1.4142	1.2247	0.8367	1.4142	0.8944	1.4832
f2	0.0000	0.4472	0.0000	0.5477	0.0000	0.0000
f3	0.5477	0.0000	0.0000	0.4472	0.0000	0.0000
f4	0.5477	0.5477	0.0000	0.4472	0.0000	0.4472
f5	44.3080	2.1909	0.0000	44.7214	0.0000	0.0000
f6	0.8944	0.0000	0.0000	1.0954	0.0000	0.0000
f7	0.0046	0.4485	0.0018	1.4092	0.0053	0.0001
f8	0.4609	0.4609	0.0000	0.0000	0.0000	0.0000
f9	0.4472	0.5477	0.0000	0.5477	0.0000	0.4472
f10	0.9373	0.9373	0.0000	0.7653	0.0000	0.0000
f11	0.0887	0.0441	0.0000	0.0540	0.0000	0.0000
f12	0.0000	0.1405	0.0000	0.4947	0.0000	0.0000
f13	0.0548	0.0548	0.0000	0.0548	0.0000	0.0000
f14	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f15	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f16	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f17	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f18	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f19	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000
f20	0.0195	0.0215	0.0000	0.0272	0.0000	0.0615
f21	1.8663	2.2858	0.0000	2.2858	0.0000	0.0000
f22	1.8676	0.0000	0.0000	2.2877	0.0000	0.0000
f23	0.0000	0.0000	0.0000	0.0000	0.0000	0.0000

Functions	BGOA_S1	BGOA_S2	BGOA_V	BPSO	BGWO
f1	0.0556	0.7619	0.0556	0.0079	0.0079
f2	0.5238	0.2063	0.0476	0.0476	0.0476
f3	0.7143	0.2063	0.0476	0.0476	0.0476
f4	0.5238	1	0.1667	0.1667	1
f5	1	1	1	1	1
f6	0.1667	1	1	1	1
f7	0.6905	0.8413	1	0.6905	0.6905
f8	1	1	0.4444	0.4444	0.4444
f9	1	1	0.4444	0.4444	0.4444
f10	0.5238	1	1	1	1
f11	0.0476	0.4444	1	1	1
f12	0.2857	0.5238	0.1667	0.1667	0.1667
f13	1	0.5238	0.4444	0.4444	0.4444
f14	1	1	1	1	1
f15	1	1	1	1	1
f16	1	1	1	1	1
f17	1	1	1	1	1
f18	1	1	1	1	1
f19	1	1	1	1	1
f20	0.3810	0.6825	0.1667	0.1667	1
f21	1	1	1	1	1
f22	1	1	1	1	1
f23	1	1	1	1	1

Friedman	Sum of Squares	Degree of Freedom	Mean Squares	p-Value
f1	73.2	5	14.62	0.0006
f2	77.3	5	15.46	0.0004
f3	70.7	5	14.14	0.0011
f4	0	5	0	1
f5	80.3	5	16.06	0.0003
f6	72.1	5	14.42	0.0008
f7	73.1	5	14.62	0.0009
f8	69.7	5	13.94	0.0011
f9	78.1	5	15.62	0.0004
f10	72.8	5	14.56	0.0008
f11	71.5	5	14.3	0.0010
f12	73	5	14.6	0.0008
f13	78	5	15.6	0.0004
f14	0	5	0	1
f15	0	5	0	1
f16	0	5	0	1
f17	0	5	0	1
f18	0	5	0	1
f19	0	5	0	1
f20	1.5	5	5	0.4159
f21	6	5	1.2	0.0752
f22	1.5	5	0.3	0.4159
f23	1.5	5	0.3	0.4159

S.no.	Datasets	Instances	Number of Classes (k)	Features of Each Class (d)	Size of Classes
1	Air	359	3	64	107, 103, 149
2	Appendicitis	106	2	7	21, 85
3	Austra	690	2	14	395, 295
4	Balancescale	625	3	4	49, 288, 288
5	Blood	748	2	4	570, 178
6	Breast	277	2	9	196, 81
7	Breast_gy	277	2	9	196, 81
8	Bupa	345	2	6	145, 200
9	Cleve	296	2	13	160, 139
10	Cloud	1024	2	10	627, 403
11	Diabetes	768	8	2	268, 500
12	Ecoli	336	8	8	143, 77, 2, 2, 259, 20, 5, 52
13	Glass	214	6	9	29, 76, 70, 17, 13, 9
14	Heartstatlog	270	2	13	150, 120
15	Jain	373	2	2	276, 97
16	phoneme	5404	2	5	15, 863, 818
17	Robotnavigation	5456	4	25	82, 620, 972, 205, 329
18	Seeds	210	3	7	70, 70, 70
19	segmentation	210	7	18	30, 30, 30, 30, 30, 30, 30
20	Sonar	208	2	60	97, 111
21	Thyroid	215	3	5	150, 35, 30
22	Vowel	871	6	3	72, 89, 172, 151, 207, 180
23	zoo	101	7	16	41, 20, 5, 13, 4, 7, 10

Dataset	BGOA_S1	BGOA_S2	BGOA_V	BGOA	BPSO	BGWO
Air	0.07380	0.05781	0.08481	0.07234	0.08249	0.07068
Appendicitis	0.13522	0.13996	0.14689	0.09244	0.14039	0.13982
Austra	0.30696	0.31836	0.32391	0.32767	0.32356	0.32189
Balancescale	0.22665	0.15280	0.19935	0.17176	0.18391	0.17874
WDBC	0.04557	0.04067	0.04772	0.04700	0.05628	0.05696
Blood	0.23635	0.23644	0.23634	0.23404	0.22684	0.23165
Breast	0.24452	0.24101	0.25303	0.23613	0.24856	0.24672
Breast_gy	0.20910	0.20922	0.22714	0.23690	0.21706	0.21775
Bupa	0.33957	0.32704	0.36654	0.31300	0.33118	0.32752
Cleve	0.17600	0.18140	0.20421	0.19038	0.17440	0.18164
Cloud	0.00514	0.01155	0.01852	0.01020	0.01450	0.01450
Diabetes	0.24661	0.25597	0.27359	0.24301	0.26049	0.26189
Segmentation	0.11190	0.10644	0.11683	0.12310	0.11451	0.11996
Thyroid	0.05067	0.05067	0.06083	0.08206	0.06728	0.06933
Heartstatlog	0.16154	0.16343	0.19129	0.16682	0.15647	0.17342
Ecoli	0.15588	0.13980	0.16796	0.16516	0.15877	0.16499
Glass	0.57879	0.57890	0.60825	0.56943	0.57092	0.57003
Jain	0.01000	0.01000	0.02855	0.05000	0.05000	0.05000
Vowel	0.14382	0.14220	0.14974	0.17170	0.17422	0.17374
Seeds	0.04764	0.05504	0.06730	0.04882	0.06160	0.06160
Sonar	0.18299	0.18421	0.21231	0.17410	0.20257	0.19467
Balance scale	0.21900	0.17074	0.25572	0.19357	0.19930	0.19926
Zoo	0.04597	0.04576	0.05720	0.03232	0.05771	0.06155

Dataset	BGOA_S1	BGOA_S2	BGOA_V	BGOA	BPSO	BGWO
Air	33.66667	33.33333	31.00000	32.66667	36.00000	40.33333
Appendicitis	5.66667	5.33333	1.00000	6.00000	8.33333	6.66667
Austra	2.66667	4.33333	3.00000	2.33333	2.00000	2.00000
Balancescale	2.33333	6.00000	0.33333	4.00000	4.66667	5.66667
WDBC	3.33333	4.00000	3.66667	4.00000	4.00000	4.00000
Blood	12.00000	15.00000	16.66667	3.33333	12.33333	13.66667
Breast	0.00000	0.00000	0.00000	0.00000	0.00000	0.33333
Breast_gy	3.33333	5.33333	4.66667	5.33333	4.00000	4.00000
Bupa	2.33333	5.33333	5.66667	2.66667	4.00000	3.00000
Cleve	3.33333	4.00000	4.66667	4.00000	3.66667	4.00000
Cloud	5.00000	6.00000	6.33333	6.00000	6.33333	6.00000
Diabetes	1.00000	2.33333	3.33333	1.33333	1.66667	1.66667
Segmentation	4.00000	5.66667	4.00000	3.66667	2.33333	3.33333
Thyroid	7.66667	10.00000	8.33333	8.00000	8.00000	8.33333
Heartstatlog	2.66667	2.66667	2.33333	1.66667	1.66667	2.00000
Ecoli	5.66667	8.00000	8.33333	5.00000	6.66667	6.66667
Glass	5.00000	5.33333	5.33333	4.66667	4.33333	4.66667
Jain	3.00000	3.66667	5.33333	2.66667	3.66667	5.00000
Vowel	2.00000	2.00000	1.66667	2.00000	2.00000	2.00000
Seeds	3.00000	3.00000	3.00000	3.00000	3.00000	3.00000
Sonar	2.00000	2.66667	3.33333	2.00000	2.33333	2.33333
Balancescale	13.66667	25.66667	28.33333	32.66667	26.66667	31.66667
Zoo	3.33333	4.00000	2.66667	4.00000	4.00000	4.00000

Dataset	BGOA_S1	BGOA_S2	BGOA_V	BGOA	BPSO	BGWO
Air	0.93077	0.94687	0.91923	0.95072	0.94277	0.95877
Appendicitis	0.95714	0.95714	0.94286	0.98571	0.96667	0.95714
Austra	0.86726	0.86488	0.85595	0.92024	0.86726	0.86786
Balancescale	0.69162	0.68275	0.67306	0.67012	0.67696	0.68247
WDBC	0.77947	0.85576	0.80789	0.87183	0.85904	0.86448
Blood	0.95801	0.96397	0.95741	0.97002	0.96239	0.96402
Breast	0.76126	0.76118	0.76127	0.75364	0.76122	0.76055
Breast_gy	0.75675	0.76254	0.74965	0.78263	0.76175	0.76368
Bupa	0.79140	0.79465	0.77693	0.76623	0.79491	0.78833
Cleve	0.66261	0.67639	0.63761	0.70561	0.68356	0.69033
Cloud	0.82611	0.82143	0.79865	0.82389	0.84206	0.83310
Diabetes	0.99581	0.99070	0.98466	0.99628	0.99351	0.99351
Segmentation	0.75595	0.74860	0.72870	0.76832	0.74115	0.74626
Thyroid	0.89127	0.89810	0.88667	0.89381	0.90286	0.89810
Heartstatlog	0.95556	0.95556	0.94444	0.93556	0.95111	0.95333
Ecoli	0.84123	0.84113	0.81326	0.84464	0.86228	0.84444
Glass	0.84976	0.86649	0.83804	0.86123	0.86546	0.86141
Jain	0.41873	0.41937	0.39159	0.41619	0.42048	0.42921
Vowel	1.00000	1.00000	0.97958	1.00000	1.00000	1.00000
Seeds	0.86483	0.86647	0.85885	0.87189	0.86924	0.86974
Sonar	0.95476	0.94825	0.93683	0.96365	0.95270	0.95270
Balancescale	0.81746	0.81825	0.79032	0.84540	0.81016	0.82286
Zoo	0.78721	0.83763	0.74843	0.84887	0.84285	0.84288

Dataset	BGOA_S1	BGOA_S2	BGOA_V	BPSO	BGWO
Air	0.1746	0.4444	0.0079	0.0397	0.0079
Appendicitis	0.5397	0.4762	0.6508	0.8095	0.2460
Austra	0.7460	0.3095	0.8413	1.0000	0.4206
Balancescale	0.5714	1.0000	0.6905	0.0952	0.0317
WDBC	0.5873	1.0000	0.5714	0.4524	0.1746
Blood	1.0000	0.0079	0.1508	0.1429	0.4603
Breast	0.5714	0.4206	0.7302	0.0952	0.5873
Breast_gy	1.0000	0.1667	0.6905	0.1508	0.3095
Bupa	0.7460	0.2540	0.3095	0.0079	0.0079
Cleve	0.7460	0.1508	0.0317	0.0079	0.0079
Cloud	0.8571	0.3095	0.2381	0.8413	0.8413
Diabetes	0.1508	0.4206	1.0000	0.1508	0.1508
Segmentation	0.1349	0.0079	0.1508	0.0079	0.0317
Thyroid	0.0714	1.0000	0.1190	0.4762	0.1190
Heartstatlog	0.5476	0.6825	0.4365	0.0079	0.0159
Ecoli	0.0635	0.5079	0.0556	0.0079	0.0079
Glass	0.1349	0.8413	0.4206	0.0476	0.8016
Jain	1.0000	1.0000	1.0000	1.0000	1.0000
Vowel	0.5476	0.3968	1.0000	0.0079	0.0079
Seeds	0.9524	0.7063	0.2302	0.1190	0.0397
Sonar	0.1984	0.5476	0.0952	0.0556	0.0317
Balancescale	0.1508	0.3095	0.1032	0.0079	0.0079
Zoo	0.4921	1.0000	1.0000	0.0476	0.2063

Dataset	Sum of Squares	Degree of Freedom	Mean Squares	p-Value
Air	5.2	2	2.6	0.2548
Appendicitis	0.4	2	0.2	0.0916
Austra	2.8	2	1.4	0.8557
Balancescale	2.8	2	1.4	0.0823
WDBC	0.7	2	0.35	0.0382
Blood	3.6	2	1.8	0.4060
Breast	1.9	2	0.95	0.1132
Breast_gy	0.4	2	0.2	0.8995
Bupa	3.6	2	1.8	0.0427
Cleve	0	2	0	0.1257
Cloud	0	2	0	0.1018
Diabetes	0.3	2	0.15	0.4830
Segmentation	4.9	2	2.45	0.0342
Thyroid	4.8	2	2.4	0.0513
Heartstatlog	0	2	0	0.6151
Ecoli	0.4	2	0.2	0.2311
Glass	1.2	2	0.6	0.0663
Jain	2.8	2	1.4	0.0174
Vowel	3.6	2	1.8	0.0427
Seeds	2.8	2	1.4	0.6151
Sonar	0.4	2	0.2	0.0427
Balancescale	3.6	2	1.8	0.1546
Zoo	1.9	2	0.95	0.0513

PERMALINK

Improved Binary Grasshopper Optimization Algorithm for Feature Selection Problem

Gui-Ling Wang

Shu-Chuan Chu

Ai-Qing Tian

Tao Liu

Jeng-Shyang Pan

Roles

Abstract

1. Introduction

2. Preliminaries

2.1. GOA

2.2. BGOA

3. Analysis and the Improvement of Binary Grasshopper Optimization Algorithm

Figure 1.

Figure 2.

Figure 3.

4. Experimental Results

Table 1.

Table 2.

Table 3.

Table 4.

Table 5.

Table 6.

Table 7.

Table 8.

5. Application of Feature Selection

Table 9.

6. Results of Feature Selection

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

7. Discussion

Author Contributions

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Funding Statement

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases