Abstract
In view of the fact that there are disadvantages in that the class number must be determined in advance, the value of learning rates are hard to fix, etc., when using traditional competitive neural networks (CNNs) in electronic noses (E-noses), an optimized CNN method was presented. The optimized CNN was established on the basis of the optimum class number of samples according to the changes of the Davies and Bouldin (DB) value and it could increase, divide, or delete neurons in order to adjust the number of neurons automatically. Moreover, the learning rate changes according to the variety of training times of each sample. The traditional CNN and the optimized CNN were applied to five kinds of sorted vinegars with an E-nose. The results showed that optimized network structures could adjust the number of clusters dynamically and resulted in good classifications.
Keywords: electronic nose, competitive neural networks, optimize
1. Introduction
The volatile odor of substances such as alcohol, tobacco, tea, food, etc. is closely linked to their quality. The electronic nose (EN) imitates an animal’s olfactory mechanism, which tests the volatile smell of food to detect the quality of certain foods. After their development over decades, ENs have become an objective and reliable tool for food quality testing applied to alcohol [1], fruits and vegetables [2], tea [3], meat [4] and other food industry products.
An EN is composed of a group of sensor arrays and some form of pattern recognition algorithm. The single sensor is unable to recognize certain complex odors. In order to increase the measuring accuracy of the sensors, researchers use gas sensors with partial selectivity to constitute an array and adopt an appropriate algorithm. Therefore, pattern recognition plays an important role in EN technology [5].
Presently, the pattern recognition algorithms which are applied to EN can be divided into two types—linear algorithms and nonlinear algorithms—according to the relationship between input variables and output variables. Examples of the former are k-nearest neighbor (k-NN) [6], linear discriminate analysis (LDA) [7–10], cluster analysis (CA) [11], principal component analysis (PCA) [12–17], Least Square Regression (LSR) [18–20] and of the latter, back propagation artificial neural network (BP-ANN) [21–23], probabilistic neural network (PNN) [24,25], Support Vector Machine [26], Radial Basis Function(RBF) [27], and self-organizing map (SOM) [28]. Among these algorithms, the neural network algorithm which is based on a biological neural network composition principle, with its self-organization, self-learning and parallel processing has been used widely in EN applications.
A competitive neural network (CNN) is a neural network clustering method. It has many merits like other neural network algorithms. Moreover, it has the merit that its learning algorithm is simple and fast. Consequently it is used widely in ENs. However, it also has many disadvantages as do most neural network algorithms:
It must determine the number of clusters first, namely fix the number of output neurons.
Once the network is successfully trained, the network will bear in mind the typical pattern. In the future, we can only use the network to identify these same types of samples. If a new sample is encountered, the sample can be attributed to its closest typical class. When the user cannot determine the number of samples in advance, the accuracy of competitive network identification results will be greatly reduced.
It selects initial weights randomly, and sometimes improper selection can lead to a slow convergence and incorrect sorting results.
The selection rule of the learning rate has a conflict between the convergence speed and the stability of the system.
These shortcomings restrict the application of the algorithm in electronic noses. For example, when evaluating the grade of tobacco, spices, and food freshness with electronic noses, the classification number of samples is not predictable and sometimes a new sample is not the same as the original samples stored in the network. However, it is also classified as one of them. Initial weights and learning rates that are selected randomly will make the electronic nose classification generate an undesirable result. In conclusion, there is a perceived need to improve the current competitive neural network algorithms in order to obtain more intelligent, and more practical ones.
This paper presents an open CNN structure which in terms of the DB value [29] determines the number of output neurons, specifically, the best number of clusters. The learning rate adjustment method and the selection of initial weights are also discussed. Finally, the optimized algorithm was applied to the classification of five kinds of vinegar with an EN, and the results showed that the network had a good dynamic classification; the network structure was stable, and quickly converged.
2. Experimental
2.1. Materials and Equipment
The experiment used five different kinds of vinegar samples: Zilin mature vinegar (ZiLin Food Co., Ltd.), Jiangcheng white vinegar (Jilin Brewing Industry Group Co., Ltd.), Lao Caichen aromatic vinegar (Lao Cai Chen Food Co., Ltd.), Liu Biju rice vinegar (Liu Biju Food Co., Ltd.), and Haitian fruit vinegar (Haitian Flavoring Food Co., Ltd.).
In this study, a self-made EN system was used to test these vinegar samples. The core part was the gas sensor array, the specific sensor models included were: TGS 822, TGS 813, TGS 821, TGS 830, TGS 831, TGS 832, TGS 825, TGS 826 (produced by Tian Jin Figaro Electronic Co., China). The response signals of these sensors were in the 0∼5 V range, so the system did not need to amplify the signal. The sensor array was placed in the sample room. The sample room was a transparent 4,000 mL glass bottle, equipped with temperature-humidity sensor and gas mixing device. The system used an integrated HMT323 temperature and humidity sensor, which was produced by Vaisala Co. Its probe is small and flexible and thus easy to install. The measurement ranges of the sensor are: −40 °C to 80 °C, 0–100% RH. The gas mixing device used a small 1W fan to mix gas inside the room. The room was contained good air tightness so that various gas environments could be simulated. A typical data acquisition (DAQ) card iUSBU12086 (produced by HYIEK Automation Inc., Waterloo, Ontario, Canada) was employed as the A/D converter in the system. It can implement 8 Single-Ended, 12-Bit Analog Input Conversion, with a 32 k samples/s rate and a ≤0.1% conversion error. A schematic of the electronic nose is shown in Figure 1.
2.2. Experimental Methods
Before starting the equipment, the system needed to be preheated. When the response signals of these sensors were stable, we took a 10 mL sample of vinegar of each brand and put it into the evaporating dishes, in succession. Next we turned on the built-in fan to speed up the evaporation rate of the gas in order to make the gas concentrations in the sample room more uniform. In 40 seconds, the sensor signal has an obvious ascendant tendency; the data was collected and transferred to the computer through the data acquisition card. This data collection was maintained for 2 minutes, meanwhile, the switch of the fan was also controlled according to the data from the integrated temperature and humidity sensor lest any great change in temperature and humidity affect the results of the experiments. In the end, the data during the stable response was selected as the characteristic value. The characteristic value was first normalized, and then the data was put into the pattern recognition algorithm for classification. After each test, the system kept a fan on for a while, to reduce the adsorption of the previous sample on the sensor array and in order to prepare for the next test.
3. Competitive Neural Network
Competitive neural networks imitate excitement, competition, inhibition and other mechanisms in biological neural networks to establish the network. The mode involves unsupervised network training, with parallel processing, simple learning algorithms, self-organization, and self-adaptive capacity, etc. The specific structure is shown in Figure 2.
The competitive neural network is composed of two layers. The first layer is the input layer; the number of neurons is the same as the dimension of input samples. The second layer is the output layer, also known as the competitive layer. The neurons in this layer are the same number as the kinds of the samples. The network structure has a two-way connection. The connective weights can be represented as W = (wij, i = 1, 2, ... m; j = 1, 2, ..., n), where wij represents the competitive weight of the input neuron i and the competitive neuron j. The specific learning methods are:
Confirm the specific network structure: fix the number of the neurons in the input layer and the competitive layer, and then, the weights and the learning rate are assigned the random numbers in [0, 1] as the initial value.
- Supposing that the data of the input sample is vector: X = [×1, ×2... ×n]T, we can calculate the Euclidean distance for all neurons in two layers:
(1) - The output neuron which has the minimum value is the winner. Then the weights which are connected to it are adjusted to a favorable direction for its future success. This is seen in Formula (2):
(2) Calculate the value of the error function Et : Et = Σ[w(t) –w(t + 1)]2, if the value is less than the given threshold, the training is stopped, otherwise return to Step (2), until it meets the minimum error value.
Ultimately, each network layer weight vector of neurons is adjusted to the nearest value of a certain type of input vectors. When the test sample is put in, the network will attribute it to the closest of its kind. According to the experimental data, we selected the feature data of four kinds of samples (Zilin mature vinegar, Jiangcheng white vinegar, Lao Caichen aromatic vinegar, Liu Biju rice vinegar.), then put the data into the traditional competition in order to be classified. Figure 3 is the result when the network used random initial weight values, took a fixed learning rate η = 0.4, and the given number of output neurons was four. Figure 3 shows how the test samples of Zilin mature vinegar, Jiangcheng white vinegar, Lao Caichen aromatic vinegar and Liu Biju rice vinegar were put into the network. The samples of Lao Caichen aromatic vinegar and Jiangcheng white vinegar were judged as the same sample.
Figure 4 shows the error convergence of the traditional CNN, because the initial weight and the learning rate were selected randomly, the value of error function dropped extremely slowly, even if the training times reached the maximum (3,000), it still could not achieve the objective error ɛ = 0.001 .
The initial weight and the learning rate were adjusted until the network could classify the samples correctly. After saving the adjusted network, the fifth sample (Haitian fruit vinegar) was identified. The sample was not recognized as a new class, but rather was classified into the Liu Biju rice vinegar group as shown in Figure 5.
4. Method of Optimizing the Competitive Neural Network
To optimize it this paper presents an open competitive neural network architecture style. The specific learning methods are as follows.
4.1. Confirm Initial Connection Weights
Firstly, the initial connection weights have a great influence on the convergence and learning rates. If the learning vector is a finite part of the whole pattern space, while the connection weights are distributed randomly in all directions, there will be many differences between the input and the weight vectors which will result in the convergence rate slowing or not converging. Therefore, this design gave all wij (i = 1, 2, ... m; j = 1, 2, ... n) the same initial value. Because of this, the initial values are close to the normalized characteristic values of each sample, thus reducing the time of the input vector selecting the weight vector in the initial stage to enhance the rate of adjustment of the weight vector.
4.2. Adjustment of Learning Rate
Learning rate η refers to the rate of change of connecting weight vectors to the input sample. Learning rate affects the training results of the network greatly. According to the results of a large number of experiments, it is known that if η is too small, it will result in the convergence rate slowing, however, if it is too big, it will cause the structure of the network to become unsteady, so we made ηa function, shown in Equation 3, as follows: it can make η small in the beginning stage, with the training times increasing η augmented slowly step by step, in the end, the value of η is decreased gradually.
(3) |
where t is for the current training times, c is the number of training required for each sample, T is the total times for the training, N is the number of categories for the current sample. The result of the experiment shows that the adjustment method of the learning rate not only could stabilize the structure of the network but also could ensure fast convergence.
4.3. Adjust the Number of Neurons
Here we introduced the DB value which was proposed by Davies and Bouldin and used to determine the optimal clustering of a number. The specific definition of DB is:
(4) |
where n is the number of clusters, di (j) is the average distance between class i (j) samples and their cluster centers ci (j), d (ci, cj) is the distance between the cluster center ci and cj. The cluster center of each class is the farther, and the most effective is the better. When the DB value reaches the minimum, the classification effect is the best.
The most appropriate number of output neurons is determined according to the DB values, then merging, splitting or deleting the output neurons can occur. The concrete method is executed as follows:
- The method of merging neurons is that the comparability of the weight vectors is computed first. If the value of the comparability exceeds a certain threshold, we merge the two output neurons. The comparability is calculated as follows:
(5) (6)
The weights of merged neuron are showed as follows:(7)
where ni and nj represent the number of i samples and j samples, respectively.(8) - The method of splitting neurons is that the split neuron which has the largest volume of super ball [30] into two neurons; the ball’s volume is computed by Formula (9):
where k is the category mark of xi. On the assumption that wm is selected to be split, the new divided neuron weight vector wm1 and wm2 are shown as:(9) (10)
where θ is an empirical constant in (0,1), σk is the variance vector of splitting neurons.(11) The method of deleting the neuron is to remove the one which doesn’t have any samples, then moving the other neurons to new locations, and the number of clusters decreases.
- When the network structure is stored, and a new sample is added, firstly, we compute the comparability between the new sample and former sample as seen in Formula 5, if the value of the comparability exceeds a certain threshold, determine it to belong to this kind, otherwise, add a new neuron:
(12)
4.4. Main Steps of Optimized Competitive Neural Network
The main steps of the optimized structure of neural network are as follows:
Give the number of output neurons the initial value N.
Set values of initial weight wij.
The samples are classified by the traditional competitive neural network. Firstly, compute the comparability between the sample and any weights, if the values are all very small, increase a new neuron as Formula (12), and then go to Step (3), otherwise, go to Step (4).
If the output neurons do not have a corresponding sample, delete the node and reduce the number of output neurons, then repeat Steps (3), otherwise go to Step (5)
Calculate the value of the current DB (k). If DB(k) –DB(k –1) > α (α is empirical value), then go to Step (6). Otherwise calculate the error function of each weight, if it reaches the threshold value, the algorithm will stop, otherwise, go to Step (3).
Calculate the comparability among the weights of all neurons, if the comparability is greater than the threshold, combine neurons as seen in Formula (8), and the number of category is reduced 1, after that go to Step (3), otherwise, go to Step (7).
Calculate the volume of super ball, and choose the largest one to split the node according to Formula (10), (11), and the number of categories is reduced 1, and then go to step (3).
5. Application of the Optimized Competitive Neural Network
Set the parameters θ = 0.1, ɛ = 0.001, comparability λ = 0.65, threshold of DB α = 0.028, initial number of clusters N = 2, the total training time T = 3000. Because the parameters have been changed, the network structure is entirely different from the previous traditional CNN, that this, it is a new network. In succession, we confirmed the validity of the optimized CNN as follows:
First, we selected four kind samples (Zilin mature vinegar, Jiangcheng white vinegar, Lao Caichen aromatic vinegar, Liu Biju rice vinegar), the same as the traditional CNN, and put their feature data into the optimized network. The samples were separated completely. The results are shown in Figure 6.
The number variation of the output nodes is shown in Figure 7. It shows that during the number of output nodes, four was the maximum density, finally, the number of output nodes is stably four, indicating that the optimized network can adjust well to the number of categories.
Figure 8 shows that the DB value achieved a stable minimum through continuous adjustment, namely, the number of categories was the most reasonable at this time.
Figure 9 shows that the optimized learning rate was changed along with the training times. The revised direction was corrected in the training process, which made the value of error function decrease rapidly, and the training objective was reached in step 459. The training speed of the optimized network was much faster than the traditional CNN (Figure 4).
After saving the network, the fifth sample (Haitian fruit vinegar) was identified. The results are shown in Figure 10. The new sample was correctly determined as a new category with the optimized network by judging the comparability, so the optimization purpose was reached.
In order to verify the stability and the reliability of the network, we made a repeated independent measurement trial with a new set of samples (Zilin mature vinegar, Jiangcheng white vinegar, Liu Biju rice vinegar Haitian fruit vinegar), and the classifying result is shown as Figure 11.
After the network was saved, we put the fifth sample (Lao Caichen aromatic vinegar) into the network as seen above, and the classifying result is shown in Figure 12. We can see the performance of the e-nose coupled to the Optimized Competitive Neural Network method is good.
6. Conclusions
The paper introduces an optimized competition neural network implemented by setting the initial weights, adjusting the learning rates, and adjusting the number of neurons according to the DB value. The optimized CNN was used to recognize the vinegar samples with EN and received a good classification effect. Therefore, the optimized algorithm can be applied in EN and make EN more intelligent.
Acknowledgments
This work was supported by Jilin Province Education Department Research Program of China (2011, NO.79) to Hong Men and Haiping Zhang.
References
- 1.Santonico M, Bellincontro A, Santis DD, Di Natale C, Mencarelli F. Electronic nose to study postharvest dehydration of wine grapes. Food Chem. 2010;3:789–796. [Google Scholar]
- 2.Gómez AH, Wang J, Hu G, Pereira AG. Monitoring storage shelf life of tomato using electronic nose technique. J. Food Eng. 2008;4:625–631. [Google Scholar]
- 3.Tudu B, Jana A, Metla A, Ghosh D, Bhattacharyya N, Bandyopadhyay R. Electronic nose for black tea quality evaluation by an incremental RBF network. Sens. Actuat. B Chem. 2009;1:90–95. [Google Scholar]
- 4.El Barbri N, Mirhisse J, Ionescu R, El Bari N, Correig X, Bouchikhi B, Llobet E. An electronic nose system based on a micro-machined gas sensor array to assess the freshness of sardines. Sens. Actuat. B Chem. 2009;2:538–543. [Google Scholar]
- 5.Bicego M, Tessari G, Tecchiolli G. A comparative analysis of basic pattern recognition techniques for the development of small size electronic nose. Sens. Actuat. B Chem. 2002;85:137–144. [Google Scholar]
- 6.Yolanda GM, Concepcion CO, Jose LPP. Electronic nose based on metal oxide semiconductor sensors and pattern recognition techniques: Characterization of vegetable oils. Anal. Chim. Acta. 2001;449:69–80. [Google Scholar]
- 7.Delpha C, Lumbreras M, Siadat M. Discrimination and identification of a refrigerant gas in a humidity controlled atmosphere containing or not carbon dioxide: Application to the electronic nose. Sens. Actuat. B Chem. 2004;98:46–53. [Google Scholar]
- 8.Zakaria A, Shakaff AYM, Adom AH, Ahmad M, Masnan MJ, Aziz AHA, Fikri NA, Abdullah AH, Kamarudin LM. Improved Classification of Orthosiphon stamineus by Data Fusion of Electronic Nose and Tongue Sensors. Sensors. 2010;10:8782–8796. doi: 10.3390/s101008782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Burian C, Brezmes J, Vinaixa M, Cañellas N, Llobet E, Vilanova X, Correig X. MS-electronic nose performance improvement using the retention time dimension and two-way and three-way data processing methods. Sens. Actuat. B Chem. 2010;2:759–768. [Google Scholar]
- 10.Musatov VY, Sysoev VV, Sommer M, Kiselev I. Assessment of meat freshness with metal oxide sensor microarray electronic nose: A practical approach. Sens. Actuat. B Chem. 2010;1:99–103. [Google Scholar]
- 11.Xu Z, Shi X, Lu S. Integrated sensor array optimization with statistical evaluation. Sens. Actuat. B Chem. 2010;149:239–244. [Google Scholar]
- 12.Hernández GA, Wang J, Hu G, Pereira AG. Monitoring storage shelf life of tomato using electronic nose technique. J. Food Eng. 2008;4:625–631. [Google Scholar]
- 13.Wongchoosuk C, Lutz M, Kerdcharoen T. Detection and classification of human body odor using an electronic nose. Sensors. 2009;9:7234–7249. doi: 10.3390/s90907234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hidayat W, Shakaff AY, Ahmad MN, Hamid-Adom A. Classification of agar wood oil using an electronic nose. Sensors. 2010;10:4675–4685. doi: 10.3390/s100504675. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rodriguez SD, Monge ME, Olivieri AC, Negri RM, Bernik DL. Time dependence of aroma pattern emitted by an encapsulated essence studied by means of electronic nose and chemo metric analysis. Food Res. Int. 2010;3:797–804. [Google Scholar]
- 16.Zampetti E, Pantalei S, Scalese S, Bearzotti A, Cesare F de, Spinella C, Macagnano A. Biomimetic sensing layer based on electrospun conductive polymer webs. Biosens. Bioelectron. 2011;5:2460–2465. doi: 10.1016/j.bios.2010.10.032. [DOI] [PubMed] [Google Scholar]
- 17.Ziyatdinov A, Marco S, Chaudry A, Persaud K, Caminal P, Perera A. Drift compensation of gas sensor array data by common principal component analysis. Sens. Actuat. B Chem. 2010;2:460–465. [Google Scholar]
- 18.Tikk K, Haugen JE, Andersen HJ, Aaslyng MD. Monitoring of warmed-over flavour in pork using the electronic nose—Correlation to sensory attributes and secondary lipid oxidation products. Meat Sci. 2008;80:1254–1263. doi: 10.1016/j.meatsci.2008.05.040. [DOI] [PubMed] [Google Scholar]
- 19.Song S, Zhang X, Hayat K, Jia C, Xia S, Zhong F, Xiao Z, Tian H, Niu Y. Correlating chemical parameters of controlled oxidation tallow to gas chromatography-mass spectrometry profiles and e-nose responses using partial least squares regression analysis. Sens. Actuat. B Chem. 2010;147:660–668. [Google Scholar]
- 20.Sohn JH, Atzeni M, Zeller L, Pioggia G. Characterisation of humidity dependence of a metal oxide semiconductor sensor array using partial least squares. Sens. Actuat. B Chem. 2008;131:230–235. [Google Scholar]
- 21.Bucak İÖ, Karlık B. Hazardous odor recognition by CMAC based neural networks. Sensors. 2009;9:7308–7319. doi: 10.3390/s90907308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bhattacharya N, Tudu B, Jana A, Ghosh D, Bandhopadhyaya R, Bhuyan M. Preemptive identification of optimum fermentation time for black tea using electronic nose. Sens. Actuat. B Chem. 2008;1:110–116. [Google Scholar]
- 23.Markom MA, Shakaff AY, Adom AH, Ahmad MN, Hidayat W, Abdullah AH, Fikri NA. Intelligent electronic nose system for basal stem rot disease detection. Comput. Electron. Agric. 2009;66:140–146. [Google Scholar]
- 24.Aleixandre M, Lozano J, Gutiérrez J, Sayago I, Fernández MJ, Horrillo MC. Correlating chemical parameters of controlled oxidation tallow to gas chromatography-mass spectrometry profiles and e-nose responses using partial least squares regression analysis. Sens. Actuat. B Chem. 2008;131:71–76. [Google Scholar]
- 25.Dutta R, Hines EL, Gardner JW. Tea quality prediction using a tin oxide-based electronic nose: an artificial intelligence approach. Sens. Actuat. B Chem. 2003;94:228–237. [Google Scholar]
- 26.Wang X, Ye M, Duanmu CJ. Classification of data from electronic nose using relevance vector machines. Sens. Actuat. B Chem. 2009;140:143–148. [Google Scholar]
- 27.Yin Y, Yu H, Zhang H. A feature extraction method based on wavelet packet analysis for discrimination of Chinese vinegars using a gas sensors array. Sens. Actuat. B Chem. 2008;134:1005–1009. [Google Scholar]
- 28.Sohn JH, Pioggia G, Craig IP, Stuetz RM, Atzeni MG. Identifying major contributing sources to odour annoyance using a non-specific gas sensor array. Sens. Actuat. B Chem. 2009;102:305–312. [Google Scholar]
- 29.Davis DL, Boulin DW. A cluster separation measure. IEEE Trans Patt Anal Mach Int. 1979;PAMI-1:224–227. [PubMed] [Google Scholar]
- 30.Gath I, Geve AB. Unsupervised Optimal Fuzzy Clustering. IEEE Trans. Patt. Anal. Mach. Int. 1989;11:773–880. [Google Scholar]