Competitive Particle Swarm Optimization for Multi-Category Text Feature Selection

Jaesung Lee; Jaegyun Park; Hae-Cheon Kim; Dae-Won Kim

doi:10.3390/e21060602

. 2019 Jun 18;21(6):602. doi: 10.3390/e21060602

Competitive Particle Swarm Optimization for Multi-Category Text Feature Selection

Jaesung Lee ¹, Jaegyun Park ¹, Hae-Cheon Kim ¹, Dae-Won Kim ^1,^*

PMCID: PMC7515086 PMID: 33267316

Abstract

Multi-label feature selection is an important task for text categorization. This is because it enables learning algorithms to focus on essential features that foreshadow relevant categories, thereby improving the accuracy of text categorization. Recent studies have considered the hybridization of evolutionary feature wrappers and filters to enhance the evolutionary search process. However, the relative effectiveness of feature subset searches of evolutionary and feature filter operators has not been considered. This results in degenerated final feature subsets. In this paper, we propose a novel hybridization approach based on competition between the operators. This enables the proposed algorithm to apply each operator selectively and modify the feature subset according to its relative effectiveness, unlike conventional methods. The experimental results on 16 text datasets verify that the proposed method is superior to conventional methods.

Keywords: multi-label text categorization, feature selection, hybrid search, evolutionary algorithm, particle swarm optimization

1. Introduction

Text categorization involves the identification of the categories associated with specified documents [1,2,3,4]. According to the presence or frequency of words within a document, the so-called bag-of-words model represents each document as a word vector [5]. Each word vector is then assigned to multiple categories because, in general, a document is relevant to multiple sub-concepts [6,7,8]. Text datasets are composed of a large number of words. However, not all the words are useful for solving the associated problem. Irrelevant and redundant words can confound a learning algorithm, deteriorating the performance of text categorization [9]. To resolve these issues, conventional methods have attempted to identify a subset of important words by discarding unnecessary ones prior to text categorization [10,11,12,13]. Thus, multi-label feature selection can be an effective preprocessing step for improving the accuracy of text categorization.

Given a set of word features $F = {f_{1}, \dots, f_{d}}$ , multi-label feature selection involves the identification of a subset $S \subset F$ or a solution composed of $n ≪ d$ features that are significantly relevant to the label set $L = {l_{1} \dots, l_{| L |}}$ . To solve this task, conventional approaches use feature wrappers and filters. At the risk of selecting ineffective features for the learning algorithm to be used subsequently, filters can rapidly identify a feature subset that is mostly composed of important features based on the intrinsic properties of the data [14]. In contrast, wrappers directly determine the superiority of candidate feature subsets by using a specific learning algorithm. Moreover, they generally outperform the filters in terms of the learning performance [10]. Notwithstanding their essential differences, devising an effective search method is important in both approaches. This is because the algorithm must locate the final feature subset from a vast search space specified by thousands of word features.

As an effective search method for feature wrappers, population-based evolutionary algorithms are frequently used in conventional studies because of their stochastic global search capability [15]. These evolutionary algorithms evaluate the fitness of a feature subset based on the categorization performance of the learning algorithm. Furthermore, an evolutionary operator such as a mutation operator modifies the feature subset. Moreover, recent studies have reported that the search capability of an evolutionary algorithm can be further improved through hybridization with a filter [16,17]. Specifically, the feature filter operator can rapidly improve the feature subset by considering only the intrinsic properties of the data, particularly when the solution is overwhelmed by unnecessary features [18].

To achieve an effective hybrid search, the fitness of the feature subset modified by an evolutionary or filter operator must be improved. However, the fitness of a feature subset is not always improved after modification. This is because the evolutionary operator exhibits random properties, and the filter operator is independent of the fitness evaluation function [17,19,20,21]. If the fitness is not improved after modification by each operator, the modified feature subset is discarded. Thereby, computations performed to evaluate the fitness are wasted. A preferred hybrid search is one in which the modification of a feature subset by each operator always improves the fitness, thus avoiding wastage of computation. If an algorithm can ascertain the fitness after modification by each operator without evaluating the feature subset, it can decide in advance which operator in the feature subset is to be modified. However, this is unfeasible in practice [20]. The second-best option may be a method that estimates the relative effectiveness of each operator based on the fitness of the feature subset already computed in the previous iteration and decides which operator to apply. According to our experiment, although selective engagement of operators can significantly increase the effectiveness of a hybrid search, less attention has been paid to this aspect in recent studies.

To overcome the problems described above, we devise a competitive particle swarm optimization (PSO) algorithm. Unlike conventional PSOs, the proposed method applies each operator selectively based on a novel process for estimating the effectiveness of each operator for each particle. As a result, the particles can be separated into two groups depending on which operator is to be applied in the next iteration. Then, based on the fitness of the particles in each group, a tournament is run. Its results decide which operators will be applied in the next iteration by changing their memberships. Consequently, the proposed method competitively engages each operator in a feature subset search through a fitness-based tournament of the feature subset in each iteration. Our contributions are as follows:

We proposed a novel competitive particle swarm optimization for multi-label feature selection problem by employing an information-theoretic multi-label feature filter as a filter operator.
To selectively apply the evolutionary and filter operators, we proposed a new process for estimating their relative effectiveness based on the fitness-based tournament of the feature subset in each iteration.
To demonstrate the superiority of the information-theoretic measure for improving the search capability, we employed an information-theory-based feature filter and a frequency-based feature filter simultaneously and conducted an in-depth analysis.

Our experiments revealed that the proposed method outperformed conventional methods. It indicates the effectiveness of the proposed estimation process and information-theoretic feature filter operator.

2. Related Work

In the field of text categorization, feature selection is a crucial task because the feature space is generally high-dimensional. Conventional feature selection methods can be largely categorized into feature filters and feature wrappers. Feature filter methods assess the importance of features using a score function such as the $χ^{2}$ statistic, information gain, or mutual information [14]. The top-n features containing the highest scores are then selected. Uysal and Gunal [22] proposed a distinguishing feature selector that investigates the relationship between the absence or presence of a word within a document and the correct label for that document. Rehman et al. [23] proposed a normalized difference measure to remedy the problem of a balanced accuracy measure. It omits the relative document frequency in the classes. Tang et al. [24] proposed a maximum discrimination method based on a new measure for multiple distributions, namely the Jeffreys-multi-hypothesis divergence. However, these methods exhibit limited categorization accuracy because they do not interact with the subsequent learning algorithm.

In contrast, feature wrapper methods evaluate the discriminative power of feature subsets based on a specific learning algorithm and select the best feature subset. Among feature wrapper methods, population-based evolutionary algorithms are widely used for text feature selection owing to their stochastic global search capability. Aghdam et al. [25] applied ant colony optimization to text feature selection. Meanwhile, Lin et al. [26] proposed an improved cat swarm optimization algorithm to reduce the computation time of their originally proposed method. Lu et al. [27] demonstrated the enhanced performance of PSO based on a functional constriction factor and an inertia weight. However, unlike feature filters, these methods generally require significant computational resources for identifying a high-quality feature subset because of their randomized mechanism [28].

To resolve this issue, recent studies have considered hybrid approaches that combine an evolutionary feature wrapper with a filter. These hybrid methods can be categorized into two types according to how the filter operator is applied. One type applies the filter operator to initialize the population of the evolutionary algorithm during the initialization step. For example, Lu and Chen [21] initialized the candidate feature subsets of a small world algorithm using the $χ^{2}$ statistic and information gain. Meanwhile, Mafarja and Mirjalili [18] initialized ants in a binary ant lion optimizer using a quick reduct and an approximate entropy reduct based on rough set theory. Although this approach involves the algorithm starting its search from a region exhibiting potential, the algorithm can be deficient in diversity, resulting in premature convergence. In addition, these algorithms can fail to refine the final feature subset because the filter operator is not engaged in the final stage of the search.

The second type of hybrid approach applies the filter operator to modify the feature subset in each iteration during the search process. Ghareb et al. [16] proposed an enhanced genetic algorithm by modifying the crossover and mutation operations by using the ranks of features obtained from six filter methods. Lee et al. [29] proposed an exploration operation that uses a filter to select important features from among those not selected by a genetic operator. Then, a new feature subset is generated. Moradi and Gholampour [30] constructed an enhanced binary PSO using correlation information. Meanwhile, Mafarja and Mirjalili [31] improved the whale optimization algorithm using simulated annealing for the local search. Dong et al. [19] enhanced the genetic algorithm using granular information to address feature selection in high-dimensional data with a low sample size. Zhou et al. [32] proposed a hybrid search that adjusts the influence of the feature filter according to the degree of convergence. However, these methods exhibit limited performance because the evolutionary and filter operators are not engaged selectively. Table 1 presents a brief summary of conventional feature-selection approaches.

Table 1.

Brief summary of conventional feature selection approaches.

	Advantages	Disadvantages
Filter methods	Rapid identification of a feature subset	Lower performance than that of wrapper
Wrapper methods	High performance than that of filter	High complexity
Hybrid methods (first type)	To start in a region exhibiting potential	Premature convergence
Hybrid methods (second type)	Improved search capability	Randomized engagement of operator

Terms	Meanings
C	The evolution-based particle group
F	The filter-based particle group
$m_{c}$	The number of the evolution-based particles
$m_{f}$	The number of the filter-based particles
$E_{c}$	The fitness values for feature subsets generated from C
$E_{f}$	The fitness values for feature subsets generated from F
u	The number of spent fitness function calls (FFCs)
v	Maximum number of permitted FFCs
S	The best feature subset

Dataset	$\| W \|$	$\| F \|$	Type	$\| L \|$	$Card .$	$Den .$	$Distinct .$	Domain
RCV1 (S1)	6000	945	Numeric	101	2.880	0.029	1028	Text
RCV1 (S2)	6000	945	Numeric	101	2.634	0.026	954	Text
RCV1 (S3)	6000	945	Numeric	101	2.614	0.026	939	Text
RCV1 (S4)	6000	945	Numeric	101	2.484	0.025	816	Text
RCV1 (S5)	6000	945	Numeric	101	2.642	0.026	946	Text
Arts	7484	1157	Numeric	26	1.654	0.064	599	Text
Business	11,214	1096	Numeric	30	1.599	0.053	233	Text
Computers	12,444	1705	Numeric	33	1.507	0.046	428	Text
Education	12,030	1377	Numeric	33	1.463	0.044	511	Text
Entertainment	12,730	1600	Numeric	21	1.414	0.067	337	Text
Health	9205	1530	Numeric	32	1.644	0.051	335	Text
Recreation	12,828	1516	Numeric	22	1.429	0.065	530	Text
Reference	8027	1984	Numeric	33	1.174	0.036	275	Text
Science	6428	1859	Numeric	40	1.450	0.036	457	Text
Social	12,111	2618	Numeric	29	1.279	0.033	361	Text
Society	14,512	1590	Numeric	27	1.670	0.062	1054	Text

Dataset	Proposed	EGA+CDM	bALO-QR	CSO
RCV1 (S1)	0.029 ± 0.001	0.030 ± 0.001	0.030 ± 0.000	0.029 ± 0.001✓
RCV1 (S2)	0.027 ± 0.001✓	0.028 ± 0.003	0.027 ± 0.001	0.027 ± 0.001
RCV1 (S3)	0.026 ± 0.000✓	0.027 ± 0.001	0.027 ± 0.001	0.026 ± 0.001
RCV1 (S4)	0.024 ± 0.001✓	0.025 ± 0.001	0.025 ± 0.001	0.024 ± 0.001
RCV1 (S5)	0.026 ± 0.001✓	0.028 ± 0.003	0.028 ± 0.001	0.026 ± 0.001
Arts	0.061 ± 0.001✓	0.067 ± 0.002	0.069 ± 0.002	0.066 ± 0.002
Business	0.030 ± 0.001✓	0.036 ± 0.004	0.034 ± 0.001	0.034 ± 0.002
Computers	0.042 ± 0.002✓	0.051 ± 0.004	0.046 ± 0.001	0.047 ± 0.001
Education	0.043 ± 0.001✓	0.048 ± 0.002	0.048 ± 0.002	0.047 ± 0.001
Entertainment	0.059 ± 0.002✓	0.069 ± 0.003	0.065 ± 0.001	0.065 ± 0.001
Health	0.039 ± 0.001✓	0.050 ± 0.003	0.047 ± 0.001	0.047 ± 0.002
Recreation	0.058 ± 0.001✓	0.070 ± 0.003	0.067 ± 0.002	0.065 ± 0.001
Reference	0.031 ± 0.001✓	0.040 ± 0.003	0.037 ± 0.002	0.037 ± 0.001
Science	0.036 ± 0.001✓	0.043 ± 0.003	0.042 ± 0.001	0.042 ± 0.001
Social	0.026 ± 0.001✓	0.042 ± 0.004	0.032 ± 0.002	0.032 ± 0.001
Society	0.057 ± 0.001✓	0.065 ± 0.004	0.064 ± 0.001	0.063 ± 0.001
Avg. Rank	1.06✓	3.88	2.94	2.13

Dataset	Proposed	EGA+CDM	bALO-QR	CSO
RCV1 (S1)	0.573 ± 0.151✓	0.637 ± 0.129	0.621 ± 0.134	0.648 ± 0.125
RCV1 (S2)	0.513 ± 0.013✓	0.654 ± 0.023	0.580 ± 0.015	0.599 ± 0.019
RCV1 (S3)	0.609 ± 0.206✓	0.718 ± 0.150	0.671 ± 0.174	0.683 ± 0.168
RCV1 (S4)	0.591 ± 0.216✓	0.696 ± 0.160	0.671 ± 0.175	0.672 ± 0.174
RCV1 (S5)	0.603 ± 0.210✓	0.695 ± 0.161	0.656 ± 0.182	0.652 ± 0.185
Arts	0.649 ± 0.181✓	0.712 ± 0.149	0.710 ± 0.149	0.712 ± 0.149
Business	0.383 ± 0.410✓	0.399 ± 0.409	0.398 ± 0.400	0.396 ± 0.406
Computers	0.415 ± 0.009✓	0.469 ± 0.006	0.445 ± 0.009	0.448 ± 0.007
Education	0.598 ± 0.012✓	0.661 ± 0.008	0.616 ± 0.020	0.639 ± 0.016
Entertainment	0.536 ± 0.017✓	0.605 ± 0.019	0.563 ± 0.015	0.586 ± 0.015
Health	0.726 ± 0.342✓	0.774 ± 0.282	0.764 ± 0.300	0.778 ± 0.238
Recreation	0.553 ± 0.011✓	0.739 ± 0.013	0.675 ± 0.013	0.675 ± 0.011
Reference	0.690 ± 0.262✓	0.715 ± 0.243	0.718 ± 0.241	0.715 ± 0.243
Science	0.630 ± 0.024✓	0.707 ± 0.018	0.696 ± 0.027	0.696 ± 0.023
Social	0.439 ± 0.197✓	0.571 ± 0.152	0.472 ± 0.186	0.490 ± 0.179
Society	0.447 ± 0.014✓	0.510 ± 0.017	0.489 ± 0.019	0.479 ± 0.016
Avg. Rank	1.00✓	3.75	2.31	2.94

Dataset	Proposed	EGA+CDM	bALO-QR	CSO
RCV1 (S1)	0.198 ± 0.010✓	0.176 ± 0.011	0.168 ± 0.013	0.124 ± 0.013
RCV1 (S2)	0.243 ± 0.013✓	0.177 ± 0.011	0.179 ± 0.014	0.157 ± 0.018
RCV1 (S3)	0.227 ± 0.018✓	0.161 ± 0.004	0.178 ± 0.019	0.168 ± 0.014
RCV1 (S4)	0.267 ± 0.016✓	0.170 ± 0.007	0.192 ± 0.014	0.183 ± 0.019
RCV1 (S5)	0.234 ± 0.013✓	0.187 ± 0.009	0.191 ± 0.016	0.165 ± 0.012
Arts	0.195 ± 0.012✓	0.094 ± 0.008	0.099 ± 0.008	0.106 ± 0.012
Business	0.680 ± 0.010✓	0.662 ± 0.009	0.654 ± 0.008	0.656 ± 0.011
Computers	0.424 ± 0.007✓	0.369 ± 0.010	0.388 ± 0.006	0.391 ± 0.008
Education	0.122 ± 0.010✓	0.075 ± 0.008	0.109 ± 0.012	0.085 ± 0.018
Entertainment	0.267 ± 0.011✓	0.173 ± 0.007	0.220 ± 0.011	0.188 ± 0.011
Health	0.502 ± 0.010✓	0.410 ± 0.017	0.397 ± 0.019	0.423 ± 0.020
Recreation	0.235 ± 0.014✓	0.045 ± 0.004	0.111 ± 0.011	0.119 ± 0.008
Reference	0.387 ± 0.015✓	0.360 ± 0.010	0.352 ± 0.009	0.350 ± 0.013
Science	0.130 ± 0.011✓	0.075 ± 0.007	0.064 ± 0.010	0.070 ± 0.017
Social	0.533 ± 0.015✓	0.340 ± 0.021	0.471 ± 0.014	0.449 ± 0.025
Society	0.357 ± 0.043✓	0.290 ± 0.019	0.254 ± 0.012	0.211 ± 0.041
Avg. Rank	1.00✓	3.19	2.75	3.06

Dataset	Proposed	EGA+CDM	bALO-QR	CSO
RCV1 (S1)	0.017 ± 0.007✓	0.009 ± 0.002	0.012 ± 0.007	0.012 ± 0.005
RCV1 (S2)	0.099 ± 0.012✓	0.011 ± 0.005	0.087 ± 0.010	0.087 ± 0.004
RCV1 (S3)	0.115 ± 0.018✓	0.025 ± 0.005	0.093 ± 0.009	0.102 ± 0.005
RCV1 (S4)	0.150 ± 0.008✓	0.033 ± 0.014	0.120 ± 0.016	0.126 ± 0.016
RCV1 (S5)	0.094 ± 0.014✓	0.013 ± 0.003	0.082 ± 0.012	0.091 ± 0.011
Arts	0.151 ± 0.010✓	0.058 ± 0.007	0.071 ± 0.007	0.075 ± 0.006
Business	0.527 ± 0.014✓	0.514 ± 0.016	0.507 ± 0.011	0.512 ± 0.011
Computers	0.351 ± 0.011✓	0.299 ± 0.011	0.316 ± 0.010	0.319 ± 0.009
Education	0.094 ± 0.011✓	0.047 ± 0.009	0.074 ± 0.007	0.064 ± 0.013
Entertainment	0.228 ± 0.010✓	0.130 ± 0.009	0.188 ± 0.010	0.176 ± 0.022
Health	0.389 ± 0.010✓	0.307 ± 0.016	0.314 ± 0.009	0.308 ± 0.054
Recreation	0.192 ± 0.010✓	0.020 ± 0.003	0.093 ± 0.013	0.106 ± 0.016
Reference	0.345 ± 0.011✓	0.321 ± 0.006	0.316 ± 0.011	0.294 ± 0.074
Science	0.109 ± 0.014✓	0.053 ± 0.008	0.048 ± 0.005	0.055 ± 0.011
Social	0.488 ± 0.016✓	0.287 ± 0.022	0.432 ± 0.016	0.412 ± 0.012
Society	0.284 ± 0.015✓	0.215 ± 0.012	0.179 ± 0.028	0.157 ± 0.021
Avg. Rank	1.00✓	3.56	2.81	2.63

Dataset	Proposed	EGA+CDM	bALO-QR	CSO
RCV1 (S1)	0.037 ± 0.002✓	0.038 ± 0.001	0.039 ± 0.001	0.040 ± 0.001
RCV1 (S2)	0.034 ± 0.002✓	0.037 ± 0.003	0.037 ± 0.001	0.036 ± 0.000
RCV1 (S3)	0.033 ± 0.002✓	0.037 ± 0.001	0.037 ± 0.003	0.036 ± 0.001
RCV1 (S4)	0.033 ± 0.002✓	0.036 ± 0.002	0.035 ± 0.001	0.034 ± 0.001
RCV1 (S5)	0.034 ± 0.001✓	0.036 ± 0.001	0.035 ± 0.001	0.035 ± 0.001
Arts	0.080 ± 0.002✓	0.092 ± 0.005	0.089 ± 0.001	0.088 ± 0.002
Business	0.028 ± 0.001✓	0.029 ± 0.001	0.029 ± 0.001	0.029 ± 0.001
Computers	0.042 ± 0.001✓	0.045 ± 0.001	0.044 ± 0.001	0.044 ± 0.001
Education	0.052 ± 0.001✓	0.060 ± 0.002	0.057 ± 0.001	0.056 ± 0.001
Entertainment	0.078 ± 0.004✓	0.088 ± 0.003	0.088 ± 0.004	0.083 ± 0.002
Health	0.038 ± 0.001✓	0.049 ± 0.002	0.047 ± 0.002	0.046 ± 0.001
Recreation	0.090 ± 0.003✓	0.115 ± 0.006	0.102 ± 0.003	0.100 ± 0.005
Reference	0.034 ± 0.001✓	0.038 ± 0.001	0.037 ± 0.001	0.037 ± 0.001
Science	0.047 ± 0.003✓	0.053 ± 0.002	0.051 ± 0.001	0.050 ± 0.001
Social	0.026 ± 0.001✓	0.036 ± 0.001	0.028 ± 0.001	0.029 ± 0.001
Society	0.060 ± 0.001✓	0.064 ± 0.002	0.062 ± 0.001	0.062 ± 0.001
Avg. Rank	1.00✓	3.75	2.88	2.38

Dataset	Proposed	EGA+CDM	bALO-QR	CSO
RCV1 (S1)	0.531 ± 0.016✓	0.704 ± 0.026	0.602 ± 0.018	0.614 ± 0.014
RCV1 (S2)	0.526 ± 0.009✓	0.715 ± 0.023	0.612 ± 0.017	0.611 ± 0.017
RCV1 (S3)	0.521 ± 0.018✓	0.727 ± 0.010	0.598 ± 0.020	0.606 ± 0.014
RCV1 (S4)	0.484 ± 0.025✓	0.698 ± 0.011	0.589 ± 0.020	0.567 ± 0.018
RCV1 (S5)	0.512 ± 0.030✓	0.692 ± 0.014	0.580 ± 0.025	0.588 ± 0.029
Arts	0.542 ± 0.011✓	0.633 ± 0.021	0.637 ± 0.018	0.626 ± 0.019
Business	0.131 ± 0.008✓	0.132 ± 0.007	0.133 ± 0.006	0.131 ± 0.007
Computers	0.416 ± 0.010✓	0.455 ± 0.009	0.441 ± 0.006	0.439 ± 0.009
Education	0.594 ± 0.012✓	0.636 ± 0.014	0.598 ± 0.013	0.620 ± 0.020
Entertainment	0.527 ± 0.019✓	0.591 ± 0.016	0.556 ± 0.019	0.569 ± 0.022
Health	0.326 ± 0.014✓	0.433 ± 0.017	0.422 ± 0.017	0.398 ± 0.023
Recreation	0.541 ± 0.018✓	0.741 ± 0.025	0.661 ± 0.019	0.666 ± 0.021
Reference	0.450 ± 0.018✓	0.511 ± 0.017	0.507 ± 0.014	0.502 ± 0.012
Science	0.582 ± 0.018✓	0.689 ± 0.025	0.663 ± 0.016	0.674 ± 0.021
Social	0.355 ± 0.014✓	0.512 ± 0.021	0.386 ± 0.017	0.421 ± 0.020
Society	0.433 ± 0.011✓	0.479 ± 0.018	0.470 ± 0.014	0.463 ± 0.015
Avg. Rank	1.00✓	3.88	2.63	2.50

Dataset	Proposed	EGA+CDM	bALO-QR	CSO
RCV1 (S1)	0.275 ± 0.009✓	0.214 ± 0.007	0.220 ± 0.006	0.215 ± 0.011
RCV1 (S2)	0.305 ± 0.014✓	0.198 ± 0.010	0.242 ± 0.016	0.243 ± 0.013
RCV1 (S3)	0.320 ± 0.020✓	0.202 ± 0.006	0.251 ± 0.010	0.258 ± 0.010
RCV1 (S4)	0.343 ± 0.014✓	0.215 ± 0.009	0.266 ± 0.009	0.275 ± 0.010
RCV1 (S5)	0.309 ± 0.013✓	0.206 ± 0.006	0.256 ± 0.012	0.256 ± 0.012
Arts	0.362 ± 0.009✓	0.275 ± 0.012	0.283 ± 0.009	0.284 ± 0.014
Business	0.686 ± 0.007	0.686 ± 0.010✓	0.680 ± 0.010	0.681 ± 0.008
Computers	0.475 ± 0.008✓	0.427 ± 0.010	0.441 ± 0.010	0.441 ± 0.008
Education	0.337 ± 0.009✓	0.286 ± 0.011	0.315 ± 0.012	0.318 ± 0.013
Entertainment	0.418 ± 0.018✓	0.336 ± 0.015	0.362 ± 0.014	0.362 ± 0.009
Health	0.545 ± 0.013✓	0.449 ± 0.011	0.462 ± 0.019	0.466 ± 0.012
Recreation	0.379 ± 0.009✓	0.210 ± 0.007	0.263 ± 0.017	0.285 ± 0.023
Reference	0.493 ± 0.012✓	0.437 ± 0.007	0.437 ± 0.016	0.447 ± 0.009
Science	0.340 ± 0.017✓	0.246 ± 0.011	0.254 ± 0.017	0.270 ± 0.014
Social	0.583 ± 0.016✓	0.435 ± 0.021	0.543 ± 0.015	0.519 ± 0.021
Society	0.422 ± 0.014✓	0.392 ± 0.010	0.398 ± 0.011	0.402 ± 0.011
Avg. Rank	1.06✓	3.81	2.81	2.31

Dataset	Proposed	EGA+CDM	bALO-QR	CSO
RCV1 (S1)	0.025 ± 0.016✓	0.012 ± 0.002	0.011 ± 0.006	0.013 ± 0.006
RCV1 (S2)	0.114 ± 0.009✓	0.011 ± 0.003	0.090 ± 0.012	0.099 ± 0.009
RCV1 (S3)	0.129 ± 0.017✓	0.011 ± 0.004	0.108 ± 0.007	0.111 ± 0.009
RCV1 (S4)	0.166 ± 0.016✓	0.023 ± 0.005	0.120 ± 0.014	0.126 ± 0.012
RCV1 (S5)	0.113 ± 0.009✓	0.008 ± 0.003	0.090 ± 0.014	0.092 ± 0.012
Arts	0.190 ± 0.011✓	0.118 ± 0.009	0.143 ± 0.009	0.140 ± 0.020
Business	0.528 ± 0.008	0.527 ± 0.015	0.526 ± 0.013	0.529 ± 0.011✓
Computers	0.372 ± 0.010✓	0.323 ± 0.011	0.338 ± 0.011	0.340 ± 0.007
Education	0.247 ± 0.009✓	0.186 ± 0.016	0.197 ± 0.019	0.214 ± 0.010
Entertainment	0.326 ± 0.018✓	0.231 ± 0.021	0.243 ± 0.017	0.276 ± 0.013
Health	0.408 ± 0.014✓	0.315 ± 0.015	0.325 ± 0.011	0.352 ± 0.014
Recreation	0.270 ± 0.041✓	0.086 ± 0.010	0.137 ± 0.016	0.146 ± 0.017
Reference	0.427 ± 0.015✓	0.376 ± 0.007	0.379 ± 0.013	0.386 ± 0.015
Science	0.228 ± 0.030✓	0.153 ± 0.015	0.176 ± 0.015	0.179 ± 0.013
Social	0.520 ± 0.011✓	0.333 ± 0.018	0.482 ± 0.017	0.468 ± 0.014
Society	0.296 ± 0.014✓	0.274 ± 0.012	0.281 ± 0.010	0.289 ± 0.014
Avg. Rank	1.06✓	3.88	3.00	2.06

Evaluation Measure	Friedman Statistics	Critical Values ( $α = 0.05$ )
Hamming loss	101.914	2.812
One-error	63.304
Multi-label accuracy	24.520
Subset accuracy	34.557

Evaluation Measure	Friedman Statistics	Critical values ( $α = 0.05$ )
Hamming loss	61.632	2.812
One-error	81.314
Multi-label accuracy	51.484
Subset accuracy	114.668

PERMALINK

Competitive Particle Swarm Optimization for Multi-Category Text Feature Selection

Jaesung Lee

Jaegyun Park

Hae-Cheon Kim

Dae-Won Kim

Abstract

1. Introduction

2. Related Work

Table 1.

3. Proposed Method

3.1. Preliminary

3.2. Motivation and Approach

Figure 1.

3.3. Competitive Particle Swarm Optimization

Table 2.

3.4. Information-Theoretic Multi-Label Feature Filter Operator

4. Experimental Results

4.1. Experimental Settings

Table 3.

4.2. Comparison Results

Table 4.

Table 5.

Table 6.

Table 7.

Table 8.

Table 9.

Table 10.

Table 11.

Table 12.

Table 13.

Figure 2.

Figure 3.

5. Analysis for Engagement of the Evolutionary and Filter Operators

Figure 4.

Figure 5.

6. Discussion

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases