A Grey Wolf Optimizer for Modular Granular Neural Networks for Human Recognition

Daniela Sánchez; Patricia Melin; Oscar Castillo

doi:10.1155/2017/4180510

. 2017 Aug 14;2017:4180510. doi: 10.1155/2017/4180510

A Grey Wolf Optimizer for Modular Granular Neural Networks for Human Recognition

Daniela Sánchez ¹, Patricia Melin ¹, Oscar Castillo ^1,^*

PMCID: PMC5574275 PMID: 28894461

Abstract

A grey wolf optimizer for modular neural network (MNN) with a granular approach is proposed. The proposed method performs optimal granulation of data and design of modular neural networks architectures to perform human recognition, and to prove its effectiveness benchmark databases of ear, iris, and face biometric measures are used to perform tests and comparisons against other works. The design of a modular granular neural network (MGNN) consists in finding optimal parameters of its architecture; these parameters are the number of subgranules, percentage of data for the training phase, learning algorithm, goal error, number of hidden layers, and their number of neurons. Nowadays, there is a great variety of approaches and new techniques within the evolutionary computing area, and these approaches and techniques have emerged to help find optimal solutions to problems or models and bioinspired algorithms are part of this area. In this work a grey wolf optimizer is proposed for the design of modular granular neural networks, and the results are compared against a genetic algorithm and a firefly algorithm in order to know which of these techniques provides better results when applied to human recognition.

1. Introduction

In this paper, a grey wolf optimizer for modular granular neural networks (MGNN) is proposed. The main goal of this optimizer is the design of modular neural networks architectures using a granular approach and to evaluate its effectiveness, these modular granular neural networks are applied to one of the most important pattern recognition problems, human recognition. For a long time human recognition has been a widely studied area, where its study mainly lies in finding those techniques and biometric measures that allow having a trustworthy identification of persons to protect information or areas [1, 2]. Some of the most used biometric measures are face [3, 4], iris [5], ear [6, 7], voice [8], vein pattern [9], hand geometry [10], signature [11], and gait [12], among others.

On the other hand, within the most used techniques are those that belong to the soft computing category such as artificial neural networks [13, 14], fuzzy logic [15], computational vision [16], granular computing [17, 18], data mining [19], and evolutionary computation [20, 21]. Within the evolutionary computation area, bioinspired algorithms are found to be one of type of method. The already well-known genetic algorithm (GA) [22, 23], ant colony system (ACO) [24], particle swarm optimization (PSO) [25], bat algorithm (BA) [26], grey wolf optimizer (GWO) [27], harmony search (HS) [28], gravitational search algorithm (GSA) [29], and firefly algorithm (FA) [30, 31], just to mention a few, belong to this category.

It is important to mention that some soft computing techniques such as neural networks and fuzzy logic combined with a bioinspired algorithm can allow achieving better performance when they are individually used. When two or more techniques are combined the resulting system is called hybrid intelligent system [7, 32]. In this paper a hybrid intelligent system is proposed using modular neural networks (MNN), granular computing (GrC), and a grey wolf optimizer (GWO). The optimization of artificial neural network (ANN) using a grey wolf optimizer was already proposed in [33–36]. These works applied their methods to classification and function-approximation, where optimal initials weights of a neural network are sought using the grey wolf optimizer.

A modular neural network is an improvement of the conventional artificial neural network, where a task is divided into subtasks and an expert module learns some of these subtasks without communication with other modules; this technique allows having systems resistant to failures and works with a large amount of information. Usually this kind of networks has been used for human recognition based on biometric measures, classification problems, and time series prediction [40]. On the other hand, granular computing defines granules as classes or subsets used for complex applications to build computational models where a large amounts of data and information are used [19, 41, 42]. In this work granular computing is applied to perform granulation of information into subsets that also define number of modules of a modular neural network; the combination of modular neural networks and granular computing was already proposed in [7, 37, 38], where the advantages of modular granular neural networks over conventional neural networks and modular neural networks were widely demonstrated. In [7], the modular granular neural network architectures were designed using an improvement of a genetic algorithm, a hierarchical genetic algorithm (HGA), where the main differences between them are the control genes in the HGA that allow activating and deactivating genes allowing solving complex problems. That design consisted in optimization of number of modules (subgranules), percentage of data for the training phase, learning algorithm, goal error, and number of hidden layers with their respective number of neurons. In [38], a firefly algorithm was proposed for MGNN optimization using an experts submodules for each division of image. In [37], also modular granular neural network architectures were designed but using a firefly algorithm and without an expert submodule for each division of image. In this work, the design of MGNN architecture is performed and applied to human recognition based on ear, face, and iris, but using a grey wolf optimizer, statistical comparisons are performed to define which of these optimization techniques is better to perform optimization of MGNNs.

This paper is organized as follows. In Section 2, the proposed method is described. The results achieved by the proposed method are explained in Section 3. In Section 4, statistical comparisons of results are presented. Finally, conclusions are given in Section 5.

2. Proposed Method

The proposed hybrid intelligence method is described in this section; this method uses modular neural networks with a granular approach and their architectures are designed by a grey wolf optimizer.

2.1. General Architecture of the Proposed Method

The proposed method uses modular granular neural networks, this kind of artificial neural network was proposed in [7] and [37], and their optimization were performed using, respectively, a hierarchical genetic algorithm and a firefly algorithm. In this work, the optimization is performed using a grey wolf optimizer and a comparison among HGA, FA, and GWO is performed to know which of these techniques is better for MGNN optimization. As a main task, the optimization techniques have to find the number of subgranules (modules), and as a preprocessing process each image is divided into 3 regions of interest; these regions will be described later. In Figure 1, the granulation process used in this work and proposed in [7] is illustrated, where a database represents a whole granule. This granule can be divided into “m” subgranules (modules), this parameter (m) can have up to a certain limit set depending on the application, each of these subgranules can have different size for example, when this granulation is applied to human recognition, and each granule can have different number of persons that the corresponding submodules will learn. The grey wolf optimizer in this work performs optimization of the granulation and hidden layers and other parameters described later.

The general architecture of proposed method.

2.1.1. Description of the Grey Wolf Optimizer

This algorithm is based on the hunting behavior of grey wolf and was proposed in [27]. A group of wolves has been between 5 and 12 wolves, and each wolf pack has a dominant hierarchy where the leaders are called alphas, and this type of wolves makes the most important decisions of the pack. The complete social dominant hierarchy is illustrated in Figure 2.

This algorithm is based on 5 points: social hierarchy, encircling prey, hunting, attacking prey, and search for prey. These points are explained as follows.

Social Hierarchy. The best solution is alpha (α), the second best solution is beta (β), the third best solution is delta (δ), and the rest of the population are considered as omega (ω), where the omega solutions follow alpha, beta, and delta wolves.

Encircling Prey. During the hunt process grey wolves encircle their prey. Mathematically model encircling behavior can be represented using the equations

\begin{matrix} \vec{D} = |\vec{C} \cdot \vec{X_{p}} (t) - \vec{X} (t)|, \\ \vec{X} (t + 1) = \vec{X_{p}} (t) - \vec{A} \cdot \vec{D}, \end{matrix}

(1)

where $\vec{A}$ and $\vec{C}$ are coefficient vectors, $\vec{X_{p}}$ is the prey position vector, $\vec{X}$ is the position vector of a grey wolf, and t is the current iteration. Vectors $\vec{A}$ and $\vec{C}$ are calculate by

\begin{matrix} \vec{A} = 2 \vec{a} \cdot \vec{r_{1}} - \vec{a}, \\ \vec{C} = 2 \cdot \vec{r_{2}}, \end{matrix}

(2)

where $\vec{r_{1}}$ and $\vec{r_{2}}$ are random vectors with values in 0 and 1 and $\vec{a}$ is a vector with components that linearly decreased from 2 to 0 during iterations.

Hunting. It is assumed that alpha, beta, and delta are the best solutions; therefore, they have knowledge about location of prey, as these solutions are saved; the position of the other search agents is updated according to the position of the best search agent. This part is mathematically represented by

\begin{matrix} \vec{D_{α}} = |\vec{C_{1}} \cdot \vec{X_{α}} - \vec{X}|, \\ \vec{D_{β}} = |\vec{C_{2}} \cdot \vec{X_{β}} - \vec{X}|, \\ \vec{D_{δ}} = |\vec{C_{3}} \cdot \vec{X_{δ}} - \vec{X}|, \\ \vec{X_{1}} = \vec{X_{α}} - \vec{A_{1}} \cdot (\vec{D_{α}}), \\ \vec{X_{2}} = \vec{X_{β}} - \vec{A_{2}} \cdot (\vec{D_{β}}), \\ \vec{X_{3}} = \vec{X_{δ}} - \vec{A_{3}} \cdot (\vec{D_{δ}}), \\ \vec{X} (t + 1) = \frac{\vec{X_{1}} + \vec{X_{2}} + \vec{X_{3}}}{3} . \end{matrix}

(3)

Attacking Prey (Exploitation). $\vec{a}$ decreases from 2 to 0 during iterations and $\vec{A}$ has random numbers in an interval [−a, a] so the next position of a search agent will be any position between its current position and the prey.

Search for Prey (Exploration). There are different components that allow having divergence and a good exploration. The divergence is mathematically modeled using $\vec{A}$ , this part obliges solutions to diverge and to have a global search; meanwhile $\vec{C}$ contains values in an interval [0,2] and provides to the prey random weights to favor exploration and avoid a local optima problem. In Pseudocode 1, the pseudo code of the grey wolf optimizer is shown.

Pseudocode 1 — Pseudocode of the grey wolf optimizer.

2.1.2. Description of the Grey Wolf Optimizer for MGNN

The grey wolf optimizer seeks to optimize modular granular neural networks architectures. The optimized parameters are as follows:

Number of subgranules (modules).
Percentage of data for the training phase.
Learning algorithm (backpropagation algorithm for training the MGNN).
Goal error.
Number of hidden layers.
Number of neurons of each hidden layer.

Each parameter is represented by a dimension in each solution (search agent), and to determine the total number of dimensions for each solution the next equation is used:

\begin{matrix} Dimensions = 2 + (3 * m) + (m * h), \end{matrix}

(4)

where m is the maximum number of subgranules that the grey wolf optimizer can use and h is maximum of number of hidden layers per module that the optimizer can use to perform the optimization. The variables mentioned above can be established depending of the application or the database, and the values used for this work are mentioned in the next section. In Figure 3, the structure of each search agent is shown.

This optimizer aims to minimize the recognition error and the objective function is given by the equation:

\begin{matrix} f = \sum_{i = 1}^{m} (\frac{(\sum_{j = 1}^{n_{m}} X_{j})}{n_{m}}), \end{matrix}

(5)

where m is the total number of subgranules (modules), X_j is 0 if the module provides the correct result and 1 if not, and n_m is total number of data/images used for testing phase in the corresponding module.

2.2. Proposed Method Applied to Human Recognition

One of the most important parameters of the architecture is its learning algorithm, backpropagation algorithms are used in the training phase to perform the learning, and 3 variations of this algorithm can be selected by the proposed optimizer: gradient descent with scaled conjugate gradient (SCG), gradient descent with adaptive learning and momentum (GDX), and gradient descent with adaptive learning (GDA). These 3 algorithms were selected because they have between demonstrated to be the fastest algorithms and with them better performances and results have been obtained [6, 7, 37–39].

The main comparisons with the proposed method are the optimizations proposed in [7, 37, 38]. In the first one a hierarchical genetic algorithm is developed, in the second and third work a firefly algorithm is developed to perform the MGNN optimization, and to have a fair comparison the number of individuals/fireflies and number of generations/iterations used in [7, 37, 38] are the same used by the proposed method in this work; obviously for the GWO these values are number of search agents and iterations. In Table 1, the values of the parameters used for each optimization algorithm are presented.

Table 1.

Table of parameters.

HGA [7]		FA [37, 38]		GWO
Parameter	Value	Parameter	Value	Parameter	Value
Individuals (n)	10	Fireflies	10	Search agents (n)	10
Maximum number of generations (t)	30	Maximum number of iterations (t)	30	Maximum number of iterations (t)	30

Parameters of MNNs	Minimum	Maximum
Modules (m)	1	10
Percentage of data for training	20	80
Error goal	0.000001	0.001
Learning algorithm	1	3
Hidden layers (h)	1	10
Neurons for each hidden layers	20	400

Trial	Images		Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
Trial	Training	Testing	Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
1	80% (1, 2, and 3)	20% (4)	5 (126, 96, 179, 239, 37) 4 (188, 196, 93, 171) 5 (109, 107, 110, 168, 29)	Module #1 (1 to 12) Module #2 (13 to 40) Module #3 (41 to 77)	100% (77/77)	0

2	69% (2, 3 and 4)	31% (1)	5 (222, 238, 113, 27, 75) 4 (151, 53, 99, 79) 2 (209, 31) 2 (144, 71) 4 (30, 218, 194, 199) 4 (25, 81, 239, 20) 5 (237, 43, 83, 102, 128)	Module #1 (1 to 5) Module #2 (6 to 21) Module #3 (22 to 31) Module #4 (32 to 46) Module #5 (47 to 63) Module #6 (64 to 73) Module #7 (74 to 77)	100% (77/77)	0

3	66% (2, 3, and 4)	34% (1)	5 (141, 70, 120, 158, 242) 4 (124, 55, 23, 243) 3 (96, 186, 213) 4 (28, 62, 51, 42) 1 (223)	Module #1 (1 to 34) Module #2 (35 to 40) Module #3 (41 to 44) Module #4 (45 to 75) Module #5 (76 to 77)	100% (77/77)	0

4	74% (2, 3, and 4)	26% (1)	5 (139, 97, 200, 121, 231) 5 (204, 114, 164, 216, 138) 5 (195, 137, 124, 71, 86) 5 (144, 70, 92, 220, 63) 5 (119, 176, 154, 167, 161) 4 (199, 162, 96, 65)	Module #1 (1 to 6) Module #2 (7 to 29) Module #3 (30 to 50) Module #4 (51 to 58) Module #5 (59 to 71) Module #6 (72 to 77)	100% (77/77)	0

5	63% (2, 3, and 4)	37% (1)	5 (136, 183, 149, 193, 161) 5 (181, 132, 175, 140, 155)	Module #1 (1 to 68) Module #2 (69 to 77)	100% (77/77)	0

Method	Best	Average	Worst
HGA [7]	100%	99.70%	93.50%
HGA [7]	0	0.00303	0.0649
FA [38]	100%	99.89%	98.05%
FA [38]	0	0.0011	0.0195
Proposed GWO	100%	100%	100%
Proposed GWO	0	0	0

Method	Best	Average	Worst
HGA [7]	98.05%	94.82%	79.65%
HGA [7]	0.01948	0.0518	0.20346
FA [38]	97.40%	96.82%	95.45%
FA [38]	0.0260	0.0318	0.04545
Proposed GWO	96.75%	96.15%	95.45%
Proposed GWO	0.03247	0.03853	0.04545

Method	Best	Average	Worst
Mendoza et al. [4]	97.50%	94.69%	91.5%
Sánchez et al. [38]	100%	100%	100%
Sánchez et al. [39]	100%	99.27%	98.61%
Proposed GWO	100%	100%	100%

Method	Best	Average	Worst
Azami et al. [43]	96.50%	95.91%	95.37%
Ch'Ng et al. [3]	96.5%	94.75%	94%
Sánchez et al. [38]	99%	98.30%	98%
Sánchez et al. [39]	98.43%	97.59%	94.55%
Proposed GWO	99%	98.50%	98%

Method	Best	Average	Worst
Sánchez and Melin [44]	99.68%	98.68%	97.40%
Sánchez and Melin [44]	0.0032	0.0132	0.0260
Sánchez et al. [37]	99.13%	98.22%	96.59%
Sánchez et al. [37]	0.0087	0.0178	0.0341
Proposed GWO	100%	99.31%	98.70%
Proposed GWO	0	0.0069	0.0130

Database	Number of persons	Max. number of images per person		Image size (pixels)
Database	Number of persons	Training	Testing	Image size (pixels)
Ear	77	3	3	132 × 91
ORL	40	9	9	92 × 112
FERET	200	6	6	100 × 100
Iris	77	13	13	21 × 21

Method	Number of images for training	Recognition rate
Method	Number of images for training	Best	Average	Worst
Proposed method (ear database)	3 (up to 80%)	100%	100%	100%
Proposed method (ear database)	2 (up to 50%)	96.75%	96.15%	95.45%
Proposed method (ORL database)	8 (up to 80%)	100%	100%	100%
Proposed method (ORL database)	5 (up to 50%)	99%	98.50%	98.50%
Proposed method (FERET database)	(up to 80%)	98%	92.63%	88.17%
Proposed method (iris database)	(up to 80%)	100%	99.31%	98.70%

Method	Number of images for training	Recognition rate
Method	Number of images for training	Best (%)	Average (%)	Worst (%)
Sánchez and Melin [7] (ANN)	3	100%	96.75%	—
Melin et al. [45] (MNN)	3	100%	93.82%	83.11%
Sánchez and Melin [7] (MGNN)	3	100%	99.69%	93.5%
Sánchez et al. [38] (FA)	3	100%	99.89%	98.05%
Proposed method (MGNN)	3	100%	100%	100%
Sánchez and Melin [7] (ANN)	2	96.10%	88.53%	—
Sánchez and Melin [7] (MGNN)	2	98.05%	94.81%	79.65%
Sánchez et al. [38] (FA)	2	97.40%	96.82%	95.45%
Proposed method (MGNN)	2	96.75%	96.15%	95.45%

Trial	Images		Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
Trial	Training	Testing	Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
2	43% (2 and 3)	57% (1 and 4)	5 (115, 49, 187, 122, 194) 5 (182, 139, 50, 217, 54) 5 (132, 182, 56, 187, 159) 5 (167, 132, 121, 123, 219) 4 (116, 195, 54, 174) 5 (157, 108, 166, 95, 88) 5 (116, 119, 76, 121, 94) 5 (102, 58, 69, 111, 42)	Module #1 (1 to 9) Module #2 (10 to 22) Module #3 (23 to 33) Module #4 (34 to 36) Module #5 (37 to 51) Module #6 (52 to 63) Module #7 (64 to 75) Module #8 (76 to 77)	96.75% (149/154)	0.0325

4	48% (2 and 3)	52% (1 and 4)	4 (98, 136, 165, 141) 3 (176, 104, 215) 4 (142, 222, 65, 28) 5 (97, 139, 129, 99, 28) 4 (225, 83, 188, 34)	Module #1 (1 to 26) Module #2 (27 to 39) Module #3 (40 to 55) Module #4 (56 to 65) Module #5 (66 to 77)	96.75% (149/154)	0.0325

7	49% (2 and 3)	51% (1 and 4)	5 (201, 84, 169, 113, 131) 5 (199, 189, 62, 159, 151) 5 (104, 129, 88, 166, 66) 5 (123, 96, 52, 26, 67) 5 (125, 141, 86, 77, 105) 5 (121, 145, 87, 122, 31) 5 (36, 126, 146, 143, 145) 5 (126, 140, 88, 173, 206)	Module #1 (1 to 5) Module #2 (6 to 17) Module #3 (18 to 32) Module #4 (33 to 34) Module #5 (35 to 40) Module #6 (41 to 51) Module #7 (52 to 63) Module #8 (64 to 77)	96.75% (149/154)	0.0325

8	39% (2 and 3)	61% (1 and 4)	5 (125, 75, 69, 114, 140) 5 (138, 157, 101, 164, 98) 5 (76, 78, 86, 135, 70) 4 (74, 53, 57, 73) 5 (123, 55, 75, 125, 143) 5 (99, 118, 149, 224, 67) 5 (130, 184, 156, 180, 153)	Module #1 (1 to 11) Module #2 (12 to 14) Module #3 (15 to 27) Module #4 (28 to 33) Module #5 (34 to 43) Module #6 (44 to 57) Module #7 (58 to 77)	96.75% (149/154)	0.0325

14	40% (2 and 3)	60% (1 and 4)	5 (58, 26, 159, 123, 106) 5 (157, 156, 197, 22, 112) 4 (215, 78, 97, 220) 5 (120, 68, 219, 194, 58) 5 (142, 185, 141, 33, 187) 5 (108, 160, 61, 100, 54)	Module #1 (1 to 12) Module #2 (13 to 20) Module #3 (21 to 40) Module #4 (41 to 52) Module #5 (53 to 66) Module #6 (67 to 77)	96.75% (149/154)	0.0325

Trial	Images		Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
Trial	Training	Testing	Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
1	80% (1, 2, 3, 4, 7, 8, 9, and 10)	20% (5 and 6)	5 (109, 109, 69, 74, 210) 5 (175, 32, 170, 214, 86) 4 (117, 52, 134, 197) 4 (190, 162, 99, 81) 5 (111, 130, 247, 160, 64) 4 (111, 250, 116, 127)	Module #1 (1 to 4) Module #2 (5 to 12) Module #3 (13 to 15) Module #4 (16 to 24) Module #5 (25 to 33) Module #6 (34 to 40)	100% (80/80)	0

2	80% (1, 3, 4, 5, 6, 7, 8, and 10)	20% (2 and 9)	5 (52, 188, 138, 154, 71) 5 (216, 183, 74, 142, 112) 5 (73, 204, 139, 94, 114) 5 (101, 124, 144, 207, 133) 4 (96, 205, 157, 238) 5 (46, 160, 86, 119, 105) 5 (138, 169, 152, 146, 48) 5 (32, 65, 173, 156, 56)	Module #1 (1 to 5) Module #2 (6 to 15) Module #3 (16 to 17) Module #4 (18 to 19) Module #5 (20 to 29) Module #6 (30 to 32) Module #7 (33 to 38) Module #8 (39 to 40)	100% (80/80)	0

3	80% (1, 2, 4, 5, 7, 8, 9, and 10)	20% (3 and 6)	5 (158, 67, 80, 49, 124) 5 (138, 72, 51, 87, 218) 5 (138, 176, 108, 21, 139) 5 (136, 46, 66, 41, 68) 5 (182, 40, 246, 104, 45) 5 (126, 202, 171, 45, 228) 5 (228, 153, 133, 199, 85) 4 (98, 140, 72, 188)	Module #1 (1 to 3) Module #2 (4 to 5) Module #3 (6 to 13) Module #4 (14 to 18) Module #5 (19 to 23) Module #6 (24 to 25) Module #7 (26 to 30) Module #8 (31 to 40)	100% (80/80)	0

4	80% (1, 3, 4, 5, 7, 8, 9, and 10)	20% (2 and 6)	5 (39, 55, 21, 84, 210) 1 (224) 3 (98, 204, 243) 5 (61, 86, 237, 49) 2 (199, 62) 1 (180) 5 (206, 29, 240, 215, 105)	Module #1 (1 to 7) Module #2 (8 to 9) Module #3 (10 to 12) Module #4 (13 to 17) Module #5 (18 to 26) Module #6 (27 to 34) Module #7 (35 to 40)	100% (80/80)	0

5	80% (1, 2, 3, 5, 6, 7, 8, and 10)	20% (4 and 9)	5 (75, 156, 197, 128, 233) 5 (225, 87, 193, 58, 182) 5 (161, 240, 36, 157, 151) 5 (228, 222, 64, 102, 132) 5 (161, 50, 80, 175, 105) 5 (150, 105, 194, 122, 80) 5 (121, 116, 122, 88, 42) 5 (66, 210, 92, 48, 179)	Module #1 (1 to 4) Module #2 (5 to 13) Module #3 (14 to 16) Module #4 (17 to 23) Module #5 (24 to 26) Module #6 (27 to 29) Module #7 (30 to 31) Module #8 (32 to 40)	100% (80/80)	0

Trial	Images		Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
Trial	Training	Testing	Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
1	50% (2, 3, 4, 7, and 9)	50% (1, 5, 6, 8 and, 10)	5 (139, 149, 64, 49, 69) 5 (112, 89, 137, 112, 203) 5 (109, 141, 115, 142, 206) 5 (69, 183, 84, 33, 233) 5 (43, 127, 176, 236, 39) 5 (124, 192, 92, 92, 193) 5 (70, 188, 227, 165, 98) 5 (75, 79, 128, 171, 159)	Module #1 (1 to 5) Module #2 (6 to 12) Module #3 (13 to 17) Module #4 (18 to 22) Module #5 (23 to 30) Module #6 (31 to 34) Module #7 (35 to 36) Module #8 (37 to 40)	99% (198/200)	0.0100

2	50% (1, 2, 4, 5, and 7)	50% (3, 6, 8, 9 and, 10)	5 (141, 99, 172, 88, 81) 4 (198, 101, 244, 148) 5 (159, 31, 175, 125, 168) 5 (31, 90, 125, 116, 111) 5 (102, 107, 110, 87, 21) 5 (113, 78, 55, 184, 209) 5 (248, 108, 150, 88, 40) 4 (119, 136, 90, 126) 3 (213, 71, 127) 4 (207, 131, 182, 48)	Module #1 (1 to 7) Module #2 (8 to 12) Module #3 (13 to 15) Module #4 (16 to 18) Module #5 (19 to 21) Module #6 (22 to 23) Module #7 (24 to 30) Module #8 (31 to 33) Module #9 (34 to 38) Module #10 (39 to 40)	98.50% (197/200)	0.0150

3	50% (3, 5, 7, 8, and 10)	50% (1, 2, 4, 6, and 9)	4 (60, 37, 220, 169) 5 (84, 106, 155, 187, 182) 5 (33, 222, 144, 23, 123) 5 (199, 85, 38, 78, 103) 5 (63, 143, 89, 191, 93) 5 (122, 189, 135, 95, 181) 5 (91, 194, 227, 119, 130) 3 (188, 124, 238) 5 (44, 105, 217, 102, 199) 5 (114, 129, 24, 140, 208)	Module #1 (1 to 2) Module #2 (3 to 7) Module #3 (8 to 10) Module #4 (11 to 16) Module #5 (17 to 21) Module #6 (22 to 23)Module #7 (24 to 27) Module #8 (28 to 31) Module #9 (32 to 35) Module #10 (36 to 40)	98% (196/200)	0.0200

4	50% (3, 4, 7, 9, and 10)	50% (1, 2, 5, 6 and 8)	5 (52, 173, 68, 176, 133) 5 (143, 202, 54, 67, 55) 5 (82, 142, 191, 47, 183) 5 (205, 115, 95, 143, 218)5 (95, 142, 73, 47, 117) 5 (182, 86, 87, 113, 102) 5 (40, 115, 98, 95, 120) 5 (196, 181, 82, 69, 154) 5 (97, 117, 142, 216, 65) 5 (153, 155, 91, 48, 124)	Module #1 (1 to 3) Module #2 (4 to 6) Module #3 (7 to 9) Module #4 (10 to 13) Module #5 (14 to 15) Module #6 (16 to 22) Module #7 (23 to 27) Module #8 (28 to 31) Module #9 (32 to 35) Module #10 (36 to 40)	99% (198/200)	0.0100

5	50% (2, 3, 5, 8, and 9)	50% (1, 4, 6, 7, and 10)	5 (128, 150, 50, 26, 73) 5 (145, 149, 49, 69, 58) 5 (129, 58, 124, 86, 70) 5 (127, 69, 126, 139, 69) 5 (33, 174, 146, 137, 218) 5 (137, 95, 232, 187, 97) 5 (101, 104, 158, 66, 95) 5 (142, 207, 48, 140, 51) 5 (79, 157, 191, 129, 222) 5 (199, 102, 148, 103, 49)	Module #1 (1 to 2) Module #2 (3 to 4) Module #3 (5 to 13) Module #4 (14 to 18) Module #5 (19 to 20) Module #6 (21 to 25) Module #7 (26 to 30) Module #8 (31 to 33) Module #9 (34 to 35) Module #10 (36 to 40)	98% (196/200)	0.0200

Trial	Images		Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
Trial	Training	Testing	Number of hidden layers and number of neurons	Persons per module	Rec. rate	Error
1	79% (1, 2, 3, 5, 6, 8, 10, 11, 12, 13, and 14)	21% (4, 7, and 9)	5 (133, 205, 93, 203, 184) 4 (112, 198, 134, 97) 5 (39, 159, 68, 76, 119) 2 (158, 148) 5 (183, 139, 135, 51, 72) 4 (224, 168, 148, 195) 5 (152, 170, 65, 47, 55) 5 (114, 218, 162, 85, 107) 3 (86, 205, 172)	Module #1 (1 to 15) Module #2 (16 to 22) Module #3 (23 to 34) Module #4 (35 to 45) Module #5 (46 to 47) Module #6 (48 to 49) Module #7 (50 to 64) Module #8 (65 to 74) Module #9 (75 to 77)	99.57% (230/231)	0.0043

2	75% (2, 3, 4, 5, 6, 8, 9, 10, 12, 13, and 14)	25% (1, 7, and 11)	5 (97, 66, 149, 117, 144) 5 (69, 210, 77, 70, 203) 4 (159, 102, 153, 152) 5 (35, 171, 134, 124, 101) 3 (167, 166, 169) 5 (198, 64, 80, 176, 131) 3 (81, 80, 227) 4 (106, 114, 89, 148)	Module #1 (1 to 4) Module #2 (5 to 15) Module #3 (16 to 23) Module #4 (24 to 31) Module #5 (32 to 46) Module #6 (47 to 58) Module #7 (59 to 62) Module #8 (63 to 77)	100% (231/231)	0

6	76% (1, 2, 3, 4, 5, 6, 8, 9, 12, 13 and, 14)	24% (7, 10, and 11)	4 (73, 210, 138, 49) 5 (119, 161, 63, 96, 112) 3 (180, 135, 77) 5 (124, 164, 177, 216, 94) 5 (129, 123, 215, 88, 100) 5 (65, 89, 69, 144, 80) 5 (67, 110, 112, 200, 134) 3 (86, 72, 160)	Module #1 (1 to 3) Module #2 (4 to 13) Module #3 (14 to 30) Module #4 (31 to 40) Module #5 (41 to 51) Module #6 (52 to 60) Module #7 (61 to 65) Module #8 (66 to 77)	99.57% (230/231)	0.0043

7	78% (1, 2, 3, 4, 5, 6, 7, 8, 10, 11, and 13)	22% (9, 12, and 14)	5 (168, 99, 94, 156, 175) 4 (90, 122, 124, 122) 5 (129, 32, 159, 174, 50) 4 (218, 93, 237, 71) 5 (117, 36, 167, 143, 52) 5 (135, 60, 226, 140, 112) 5 (169, 117, 95, 36, 96) 5 (97, 71, 225, 147, 176) 3 (162, 170, 139)	Module #1 (1 to 4) Module #2 (5 to 16) Module #3 (17 to 20) Module #4 (21 to 37) Module #5 (38 to 46) Module #6 (47 to 51) Module #7 (52 to 71) Module #8 (72 to 73) Module #9 (74 to 77)	99.57% (230/231)	0.0043

11	78% (1, 2, 3, 4, 5, 6, 7, 8, 10, 13, and 14)	22% (9, 11, and 12)	5 (86, 162, 217, 168, 168) 4 (167, 189, 62, 193) 5 (115, 53, 154, 105, 79) 3 (62, 89, 134, 87)4 (119, 142, 105, 204) 3 (128, 115, 175, 127)5 (147, 197, 61, 110, 217) 3 (142, 164, 96, 141) 5 (140, 104, 57, 108, 122)	Module #1 (1 to 4) Module #2 (5 to 8) Module #3 (9 to 16) Module #4 (17 to 32) Module #5 (33 to 39) Module #6 (40 to 46) Module #7 (47 to 57) Module #8 (58 to 68) Module #9 (69 to 77)	100% (231/231)	0

Method	Images for training	Recognition rate
Method	Images for training	Best (%)	Average (%)	Worst (%)
Mendoza et al. [4] (FIS)	8	97.50%	94.69%	91.50%
Sánchez et al. [38] (FA)	8	100%	100%	100%
Sánchez et al. [39] (MGNNs + complexity)	8	100%	99.27%	98.61%
Proposed method	8	100%	100%	100%
Azami et al. [43] (CGA + PCA)	5	96.5%	95.91%	95.37%
Ch'Ng et al. [3] (PCA + LDA)	5	96.5%	94.75%	94%
Sánchez et al. [38] (FA)	5	99%	98.30%	98%
Sánchez et al. [39] (MGNNs + complexity)	5	98.43%	97.59%	94.55%
Proposed method	5	99%	98.5%	98%

Method	Number of persons	Number of images	Recognition rate
Wang et al. [46] (SIFT)	50	7	86%
Proposed method	50	7	98%
Wang et al. [46] (SIFT)	100	7	79.7%
Proposed method	100	7	92.33%
Wang et al. [46] (SIFT)	150	7	79.1%
Proposed method	150	7	92%
Wang et al. [46] (SIFT)	200	7	75.7%
Proposed method	200	7	88.17%

Number of persons	Experiment 1	Experiment 2	Experiment 3	Experiment 4	Experiment 5	Average
50	93.33%	95.33%	94.00%	94.67%	94.67%	94.40%
100	83.67%	88.33%	89.00%	91.33%	92.00%	88.87%
150	79.78%	86.44%	87.78%	90.22%	89.33%	86.71%
200	76.17%	83.00%	82.83%	84.50%	85.83%	82.47%

Method	N	Mean	Standard deviation	Error standard deviation of the mean	Estimated difference	t-value	P value	Degree of freedom
Sánchez and Melin [7] (MGNN)	30	0.0030	0.0121	0.0022	0.003	1.38	0.1769	29
Proposed method	30	0	0	0	0.003	1.38	0.1769	29

Sánchez et al. [38] (MGNN)	30	0.00108	0.00421	0.00077	0.001082	1.41	0.169	29
Proposed method	30	0	0	0	0.001082	1.41	0.169	29

Method	N	Mean	Standard deviation	Error standard deviation of the mean	Estimated difference	t-value	P value	Degree of freedom
Mendoza et al. [4] (MG + FIS2)	4	94.69	2.58	1.3	−5.31	−4.12	0.026	3
Proposed method	4	100	0	0	−5.31	−4.12	0.026	3

Sánchez et al. [39] (MGNNs + complexity)	5	99.27	0.676	0.30	−0.73	−2.42	0.072	4
Proposed method	5	100	0	0	−0.73	−2.42	0.072	4

Method	N	Mean	Standard deviation	Error standard deviation of the mean	Estimated difference	t-value	P value	Degree of freedom
Wang et al. [46] (SIFT)	4	80.13	4.29	2.1	−12.50	−4.24	0.00547	6
Proposed method	4	92.63	4.05	2.0	−12.50	−4.24	0.00547	6

Method	N	Mean	Standard deviation	Error standard deviation of the mean	Estimated difference	t-value	P value	Degree of freedom
Sánchez and Melin [44]	20	98.68	0.779	0.17	−0.624	−3.18	0.0035	29
Proposed method	20	99.30	0.407	0.091	−0.624	−3.18	0.0035	29

Sánchez et al. [37]	20	98.22	0.758	0.17	−1.083	−5.62	1.8623E − 06	38
Proposed method	20	99.30	0.407	0.091	−1.083	−5.62	1.8623E − 06	38

Method	N	Mean	Standard deviation	Error standard deviation of the mean	Estimated difference	t-value	P value	Degrees of freedom
Sánchez and Melin [7] (MGNN)	30	0.0518	0.0345	0.0063	0.01328	2.09	0.045	29
Proposed method	30	0.03853	0.00449	0.00082	0.01328	2.09	0.045	29

Sánchez et al. [38] (FA)	30	0.03182	0.00462	0.00084	−0.00671	−5.70	4.1926E − 07	57
Proposed method	30	0.03853	0.00449	0.00082	−0.00671	−5.70	4.1926E − 07	57

Method	N	Mean	Standard deviation	Error standard deviation of the mean	Estimated difference	t-value	P value	Degrees of freedom
Azami et al. [43] (CGA + PCA)	5	95.91	0.409	0.18	−2.590	−8.96	1.9091E − 05	8
Proposed method	5	98.50	0.500	0.22	−2.590	−8.96	1.9091E − 05	8

Ch'Ng et al. [3] (PCA + LDA)	4	94.75	1.19	0.60	−3.750	−5.90	0.004	3
Proposed method	5	98.50	0.500	0.22	−3.750	−5.90	0.004	3

Sánchez et al. [38] (FA)	5	98.30	0.447	0.20	−0.20	−0.67	0.523	8
Proposed method	5	98.50	0.500	0.22	−0.20	−0.67	0.523	8

Sánchez et al. [39] (MGNNs + complexity)	5	97.59	1.71	0.76	−0.94	−1.15	0.314	4
Proposed method	5	98.50	0.500	0.22	−0.94	−1.15	0.314	4

PERMALINK

A Grey Wolf Optimizer for Modular Granular Neural Networks for Human Recognition

Daniela Sánchez

Patricia Melin

Oscar Castillo

Abstract

1. Introduction

2. Proposed Method

2.1. General Architecture of the Proposed Method

Figure 1.

2.1.1. Description of the Grey Wolf Optimizer

Figure 2.

Pseudocode 1.

2.1.2. Description of the Grey Wolf Optimizer for MGNN

Figure 3.

2.2. Proposed Method Applied to Human Recognition

Table 1.

Table 2.

Figure 4.

2.3. Data Selection, Databases, and Preprocessing

2.3.1. Data Selection

Figure 5.

2.3.2. Database of Ear

Figure 6.

2.3.3. Database of Face (ORL)

Figure 7.

2.3.4. Database of Face (FERET)

Figure 8.

2.3.5. Database of Iris

Figure 9.

2.3.6. Preprocessing

Figure 10.

3. Experimental Results

3.1. Ear Results

3.1.1. Test #1 Results for the Ear

Table 3.

Figure 11.

Figure 12.

Figure 13.

Table 4.

Figure 14.

3.1.2. Test #2 Results for Ear

Table 5.

Figure 15.

Figure 16.

Figure 17.

Table 6.

Figure 18.

3.2. Face Results (ORL)

3.2.1. Test #1 Results for Face

Table 7.

Figure 19.

Figure 20.

Figure 21.

Table 8.

3.2.2. Test #2 Results for Face

Table 9.

Figure 22.

Figure 23.

Figure 24.

Table 10.

3.3. Iris Results

Table 11.

Figure 25.

Figure 26.

Figure 27.

Table 12.

Figure 28.

3.4. Summary Results

Figure 29.

Figure 30.

Table 13.

Table 14.

Table 15.

Table 16.

Table 17.

Table 18.

Table 19.

Table 20.

Table 21.