Abstract
Due to the high degree of strong coupling and nonlinearity of marine lysozyme fermentation process, it is difficult to accurately model the mechanism. In order to achieve real‐time online measurement and effective control of bacterial concentration during fermentation, a generalized predictive control method based on least squares support vector machines is proposed. The particle swarm optimization least squares support vector machine (PSO‐LS‐SVM) model of lysozyme concentration is established by optimizing the regularization parameters and the kernel parameters of the least squares support vector machine by particle swarm optimization. To avoid the nonlinear problems in predictive control, the model is linearized at each sampling point and the generalized predictive algorithm is used to predict the bacteria concentration of lysozyme. The experimental simulation shows that the least squares support vector machine model with particle swarm optimization can achieve good prediction effect. The linearized model performs generalized predictive control, which makes the total activity of the enzyme increased from 60% to 80% and the yield improved by 30%.
Keywords: bacteria concentration, generalized predictive control, least squares support vector machine, lysozyme, particle swarm optimization
1. INTRODUCTION
Due to the characteristics of nonlinear, time‐varying, and high dimension in the process of microbial fermentation, parameter detection is the key to determine the optimal control point in each stage of fermentation process (Zhong, He, Pi, & Sun, 2005). With the limitation of hardware detection technology, the bacteria concentration of marine lysozyme is difficult to measure online in real time and is prone to large errors when effective control is applied (Boulkaibet, Belarbi, Bououden, Marwala, & Chadli, 2017; Wang et al., 2000). Generalized predictive control (GPC) has strong robustness, which is applicable to stochastic systems and online identification. It has the strategies of moving horizon optimization, multi‐step prediction, and feedback compensation in the predictive control algorithm, which means that the feedback compensation control is obtained by optimizing a certain performance index in the moving finite time interval,while the requirements of the mathematical model are higher when improving the accuracy of predictive control. (Araúzo‐Bravo et al., 2004; Yang, Li, Ding, Guo, & Hao, 2012; Zhu, Liu, Sun, & Wang, 2010). The least squares support vector machine (LS‐SVM) replaces the inequality constraint condition in SVM standard algorithm by using equality constraints and overcomes the dimensionality disaster problem of classical quadratic programming method for solving SVM (Li, Su, & Chu, 2007; Liu, Jiang, & Fang, 2008). In the actual lysozyme fermentation process, the fermentation process is a slow time‐varying process, which does not require high real‐time performance because of the physiological characteristics of lysozyme itself. Therefore, although the lag caused by the LS‐SVM‐based prediction model is longer than that caused by the SVM‐based prediction model, it does not affect the bacterial concentration prediction. (Huang, Zhai, Sui, & Chai, 2010; Suykens & Vandewalle, 1999; Wang, Zhen, & Zhu, 2013). However, the regularization parameter C and the kernel parameter σ of the LS‐SVM model have a great influence on the fitting precision and generalization ability. Particle swarm optimization (PSO) is a population‐based stochastic optimization method, which can simultaneously search for more regions in the solution space of the target function to be optimized, and solves the problem of LS‐SVM parameter selection (Li, Tang, & Liu, 2010; Yan & Cui, 2013). Therefore, this paper proposes a nonlinear model for establishing bacteria concentration after optimization of LS‐SVM using PSO. To avoid solving nonlinear problems in predictive control (Liu, Su, & Zhu, 2004; Mahmoodi, Poshtan, Jahed‐Motlagh, & Montazeri, 2008; Xi, Li, & Lin, 2013), the obtained LS‐SVM nonlinear model is linearized at each sampling point, the generalized predictive control algorithm is used to solve multi‐step prediction and process control is performed on its prediction parameters.
2. BACTERIA CONCENTRATION MODELING ANALYSIS
Lysozyme is an important enzyme preparation, which can hydrolyze mucopolysaccharide in pathogenic biomass. According to the bacteriolytic characteristics of lysozyme, it can be used in medical treatment, food preservation, and bioengineering. Especially in food preservation, it has been widely used in aquatic products, meat products, cakes, sake, wine, and beverages to replace chemically synthesized food preservatives (Ren et al., 2013; Wang et al., 2000; Zhao, Bai, Zhang, & Wu, 2010). However, the bacteria concentration is too high or too low, which can make the fermentation broth viscous or dilute, and the poor mass transfer conditions will make the product enzyme difficult to synthesize in the fermentation process. Therefore, reasonable control of bacteria concentration can increase enzyme activity and yield. Through the in‐depth analysis of the process mechanism, the substrate feed rate has a great influence on the bacteria concentration, and the reasonable feed rate can improve the product activity (Huang, Sun, Sun, Liu, & Nie, 2013; Zhu, He, Sun, & Wang, 2013). The lysozyme concentration model can be expressed in the following nonlinear form:
(1) |
where f(g) represents a complex nonlinear function.
3. ESTABLISHMENT OF LS‐SVM MODEL BASED ON PSO OPTIMIZATION
3.1. Establishment of LS‐SVM model
There is given a training set {x i, y i} with N data, and x i is input data, y i is output data, x i ⊂ R n, y i ⊂ R, i = 1, 2, ···, N.
The LS‐SVM model can use the following functions in the eigenspace:
(2) |
where ϕ(•):R n → R nh is a function that maps the input data of the original space to the higher‐dimensional eigenspace, w is weight vector, δ is constant deviation, w ∊ R nh, δ ∊ R.
The LS‐SVM regression optimization problem is as follows:
(3) |
The constraint is as follows:
(4) |
where e i is error variable, e i ∊ R, C is regularization parameter, C > 0.
In solving the above optimization problem, the Lagrangian function is introduced as:
(5) |
where α i is the Lagrange multiplier, α i ∊ R.
The optimization problem solved according to the KKT condition has the following solution:
(6) |
where y = [y 1, y 2, ···, y N], α = [α 1, α 2, ···, α N], Γij = φ(x i)T φ(x j) = k(x i, x), i, j = 1, 2, ···, N, k(x, x) is a kernel function, I is unit matrix.
In this paper, the Gauss radial basis function (RBF) is used as a kernel function (Lu & Yang, 2007; Zhu, Ling, Wang, Hao, & Ding, 2018). After obtaining δ and α in Equation (6), w can be further calculated, and the nonlinear model obtained by LS‐SVM is as follows:
(7) |
When solving the above equation, the kernel parameter σ and the regularization parameter C have a great influence on the model fitting accuracy and generalization ability, in order to achieve the prediction effect, the two variables need to be PSO optimized.
3.2. PSO‐based parameter optimization
The basic idea of the particle swarm algorithm is to find the optimal solution through information transmission and information sharing among individuals in a group (Gu, Zhao, & Wu, 2010; Yao, Cai, & Zhang, 2009). Assuming that in a D‐dimensional search space, population X = (X 1, X 2, ···, X n) consists of n particles, where the i‐th particle is represented as a D‐dimensional vector X i = (x i1, x i2, ···, x iD)T that is the position of the i‐th particle in the D‐dimensional search space. According to the objective function, the fitness value corresponding to each particle position X i can be calculated, which represents the pros and cons of the particle. The optimal position of the i‐th particle is P i = (P i1, P i2, ···, P iD)T, whose corresponding fitness value is called the individual optimal solution P best,i; the optimal position of the population is P g = (P g1, P g2, …, P gD), whose corresponding fitness value is called the global optimal solution G best,i. The search speed of particle i is V i = (V i1, V i2, ···, V iD)T, and the particle updates its speed and position through individual value and group extremum during the iterative process as follows:
(8) |
where w is the inertia weight, d = 1, 2, ···, n, V id is the particle velocity, c 1 and c 2 are acceleration factors, r 1 and r 2 are random numbers distributed in the range of [0,1].
In order to prevent the blind search of particles, whose position and speed are limited to a certain interval [−X max, X max], [−V max, V max].
3.3. Establishment of LS‐SVM model based on PSO optimization
To sum up, the specific steps of the least squares support vector machine modeling based on PSO are as follows:
A set of {C, σ} is randomly generated to establish the LS‐SVM regression model. The particle swarm dimension is set to 2, the number of particles in each particle swarm is 20, the number of iterations is 150, c 1 = 1.5, c 2 = 1.7, and the regularization parameter C and the kernel parameter σ are selected within the optimization range of 0~2,000 and 0.01~100, respectively.
-
The average absolute percentage error is chosen as the fitness function of the PSO algorithm, whose expression is as follows:
where y i and are the actual value and model prediction value, respectively, and N is the total number of training data.(9) According to the size of each particle value, {C, σ} is substituted into the LS‐SVM reconstruction regression model, and the corresponding fitness value of each particle can be obtained from Equation (9) through the calculation results of the calibration sample.
According to calculating the fitness value of each particle, which is compared with the fitness value of individual optimal solution P best,i and global optimal solution G best,i. If it is better than P best,i and G best,i, update P best,i and G best,i, otherwise keep the original data.
According to the PSO optimization Equations (8) and (9), the velocity and position of the particles are adjusted to produce new species.
Check the end condition. If the condition is satisfied, the optimization is ended; otherwise, go to step (3) until the maximum number of iterations is satisfied.
The LS‐SVM is assigned to the {C, σ} obtained after the optimization is completed. The prediction model is established by using the test data, whose prediction result of the test sample is obtained.
The LS‐SVM model that has been optimized is linearized at the sampling point x 0 by using the Taylor formula, and the linearization model can be obtained through the method as follows:
(10) |
where A(z −1) = 1 + a 1 z −1 + ··· + a n z −n, B(z −1) = 1 + b 1 z −1 + ··· + b m z −m, ∂ is a constant.
4. GENERALIZED PREDICTION ALGORITHM FOR BACTERIA CONCENTRATION
After a simple model transformation, the constant ∂ is discretized, and the following controlled autoregressive integral moving (CARIMA) average model is obtained as follows:
(11) |
where Δ = 1−z −1 is a difference operator, ɛ(t) is an unrelated random sequence that represents the effect of random noise, and the discretized constant ∂ is contained in the random sequence ɛ(t).
After continuing processing according to the standard GPC method (Deng, Huang, Fei, Zhen, & Jiang, 2014; Guo, Chen, Zhu, & Hu, 2002), the multi‐step prediction vector expression that can output the predicted value is as follows:
(12) |
where is the forecast output, U = [Δu(t), ···, Δu(t + L−1)]T is the control increment, F = [f 1(t), f 2(t), ···, f P(t)]T is a vector consisted of the free phases in the output prediction sequence, is the unit step coefficient, P is the prediction time domain, and L is the control time domain.
The moving horizon optimization performance index at t time in GPC takes the following form:
(13) |
where E is the mathematical expectation, w is the expected reference value of the object output, N 1 and N 2 are the initial and final values of the optimization time domain, respectively. λ(j) is a control weighting coefficient that is zero or a very small number, which can be increased until a satisfactory control effect is obtained if the control system is stable, but the control variable changes greatly in the actual selection (Liu, 2007.). The parameter λ(j) is generally set as a constant λ.
The reference trajectory is introduced to track it well for the output value:
(14) |
(15) |
where β is the adjustment factor in interval [0, 1), y r is the reference trajectory, and y s is the set value for next moment.
When W = [w(t + 1), ···, w(t + P)]T, the formula (13) can be expressed as:
(16) |
When , the control amount can be obtained as follows:
(17) |
where d T is the first line of (G T G + λI)−1 G T.
The generalized predictive control block diagram of marine lysozyme bacteria concentration based on LS‐VM is shown in Figure 1.
5. TEST AND RESULT ANALYSIS
The experimental data are from the fermentation control system platform of Jiangsu University. The fermenter model is RT‐100L‐Y, and the fermented variety is lysozyme. Batch fermentation experiments are performed according to the medium formulation provided by the fermentation process. After high‐temperature sterilization of the fermenter steam, the tank pressure is controlled at 0.04 MPa by adjusting the gas output, the temperature is set at 32°C, the stirring speed is 400 r/min, the dissolved oxygen range is 35%–40%, and the pH is set at 7.3. In the experimental fermentation conditions, the control system collects the data of the substrate feed rate f that is obtained by the flow meter every hour and transmits it from the lower computer to the upper computer to form a database (Zhu et al., 2010). Under normal fermentation conditions, the bacteria concentration is measured by dry weight method. The fermentation broth is centrifuged at 20 ml/hr, washed with distilled water, and centrifuged twice; then, it is transferred to a constant‐weight measuring flask, dried to constant weight at 105°C, and weighed; the bacteria concentration (g/L) can be calculated (Sun, Wang, Huang, & Ji, 2010). According to the data collected by the upper computer, a batch of data is taken from one fermentation cycle, and 10 batches of data are extracted. The first nine batches of data are used as the training sample set, and the last batch is used as a test set. The simulation results are shown in Figures 2 and 3.
In the comparison of the prediction models in Figures 2 and 3, the LS‐SVM prediction model based on PSO optimization is obviously better than LS‐SVM model in fitting degree and prediction precision and has good modeling ability. Where the optimized parameters after PSO optimization are C = 508.06 and σ = 8.32.
After data preprocessing, the modeling method introduced in this paper is used to train the data, which is verified the fitting degree and prediction accuracy with test data, and select the root mean square error (RMSE) and maximum absolute error (MAXE) as the evaluation criteria for model prediction accuracy.
(18) |
(19) |
where y i and are the actual value and model prediction value, respectively, and N is the total number of training data. Two types of model simulation results are shown in Table 1.
Table 1.
Prediction model | Bacteria concentration | |
---|---|---|
RMSE | MAXE | |
PSO‐LS‐SVM model | 0.1032 | 0.486 |
LS‐SVM model | 0.7835 | 1.493 |
LS‐SVM: least squares support vector machine; MAXE: maximum absolute error; PSO‐LS‐SVM: particle swarm optimization least squares support vector machine; RMSE: root mean square error.
By using the linearization method of this paper, the established LS‐SVM model is linearized and identified by Taylor:
(20) |
where − 10 ≤ Δu ≤ 10, 10 ≤ u ≤ 55.
During the accelerated and peak period of enzyme production, the cell increased logarithmically, and the activity and yield of the enzyme could be improved by controlling the bacteria concentration in this period. The predictive control is performed at the first 360 min of logarithmic growth period, where the prediction time domain is P = 5, the control time domain is N u = 3, the initial output is u = 15, the initial increment is Δu = 4.5, the initial output is y = 14.5, and the simulation step length is 1 min. In the acceleration period of enzyme production, the bacteria concentration is not too high, which is set to 20 g/L; at the peak period of enzyme production, the growth rate of the bacteria is slowed down due to the rapid consumption of the substrate, which is set to 35 g/L to accelerate the substrate feeding rate and improve the enzyme activity and yield. The simulation results are shown in Figures 4 and 5.
From Figures 4 and 5, it can be seen that the LS‐SVM‐based predictive control outputs the better tracking reference trajectory and the output is relatively stable, and the error between predicted control value and set value is mostly at [−0.5, 0.5], the minority is at [‐1, 1]., which is consistent with the control requirements for the bacteria concentration in practical engineering.
Figures 4 and 5 are model‐based simulation results, so the input can be set as a step signal to study and analyze the performance of the control method. In the actual fermentation process, the bacterial concentration will not be changed rapidly, which is a slow time‐varying process, so the reference input also needs to be a relatively slow rising process. Figures 6 and 7 are control process diagrams of the actual fermentation process.
Under such control, the bacteria concentration grows fast during the growing phase and remains high during the producing phase, which is good for the enzyme productivity. The total activity of the enzyme is increased from 60% to 80%, and the yield is improved by 30% in the actual fermentation process.
6. CONCLUSION
In this paper, the generalized predictive control based on least squares support vector machine is proposed. After the regularization parameter C and kernel parameter σ of the model are optimized by using the particle swarm optimization algorithm, the LS‐SVM model of the bacterial concentration is established, which has high prediction accuracy and high fitting degree. To avoid solving nonlinear problems, the LS‐SVM model is linearized at each sampling point, and the generalized predictive control algorithm is used to solve the multi‐step prediction. The experimental results show that the method has good adaptability and robustness to the control of bacterial concentration in the fermentation process. It can be applied to the control of physicochemical parameters and biological indicators in the general fermentation process.
CONFLICT OF INTEREST
The authors declare that they have no conflict of interests.
ETHICAL STATEMENT
This study does not involve any human or animal testing.
ACKNOWLEDGMENT
This research is especially grateful for the technical and financial support of the Key R&D Program in Zhenjiang City: R&D on soft‐sensing and control of key parameters for microbial fermentation (SH2017002), National Science Research Foundation of CHINA (41376175), Natural Science Foundation of Jiangsu Province (BK20140568, BK20151345), and Natural Science Research Foundation of Higher Education of Jiangsu Province (17KJB510008).
Zhu X, Zhu Z. The generalized predictive control of bacteria concentration in marine lysozyme fermentation process. Food Sci Nutr. 2018;6:2459–2465. 10.1002/fsn3.850
REFERENCES
- Araúzo‐Bravo, M. J. , Cano‐Izquierdo, J. M. , Gómez‐Sánchez, E. , López‐Nieto, M. J. , Dimitriadis, Y. A. , & López‐Coronado, J. (2004). Automatization of a penicillin production process with soft sensors and an adaptive controller based on neuro fuzzy systems. Control Engineering Practice, 12(9), 1073–1090. 10.1016/j.conengprac.2003.11.002 [DOI] [Google Scholar]
- Boulkaibet, I. , Belarbi, K. , Bououden, S. , Marwala, T. , & Chadli, M. (2017). A new T‐S fuzzy model predictive control for nonlinear processes. Expert Systems With Applications, 88, 132–151. 10.1016/j.eswa.2017.06.039 [DOI] [Google Scholar]
- Deng, L. , Huang, Y. , Fei, M. R. , Zhen, M. , & Jiang, J. (2014). Improved generalized predictive control and its application in temperature system. Chinese Journal of Scientific Instrument, 35(5), 1057–1064. [Google Scholar]
- Gu, Y. P. , Zhao, W. J. , & Wu, Z. S. (2010). Least squares support vector machine algorithm. Journal of Tsinghua University, 50(7), 1063–1066. [Google Scholar]
- Guo, J. , Chen, Q. W. , Zhu, R. J. , & Hu, W. L. (2002). Adaptive predictive control of a class of nonlinear system. Control Theory and Applications, 19(1), 68–72. [Google Scholar]
- Huang, Y. H. , Sun, L. N. , Sun, Y. K. , Liu, G. H. , & Nie, W. H. (2013). Soft sensor of generalized dynamic fuzzy neural network for marine protease fermentation process based on dynamic data exchange. Transactions of the Chinese Society of Agricultural Engineering, 29(19), 268–276. [Google Scholar]
- Huang, X. Y. , Zhai, G. J. , Sui, L. F. , & Chai, H. Z. (2010). Study on the detection of abnormal sounding data based on LS‐SVM. Acta Oceanologica Sinica, 29(6), 115–120. 10.1007/s13131-010-0082-3 [DOI] [Google Scholar]
- Li, L. J. , Su, H. Y. , & Chu, J. (2007). Generalized predictive control with online least squares support vector machines. Acta Automatica Sinica, 33(11), 1182–1188. 10.1360/aas-007-1182 [DOI] [Google Scholar]
- Li, Y. J. , Tang, X. J. , & Liu, J. H. (2010). Application of least square support vector machine based on particle swarm optimization in quantitative analysis of gas mixture. Spectroscopy and Spectral Analysis, 30(3), 774–778. [PubMed] [Google Scholar]
- Liu, G. B. (2007). Parameter selection in the generalized predictive control algorithm. Journal of Heilongjiang Institute Technology, 21(2), 68–70. [Google Scholar]
- Liu, B. , Jiang, Z. , & Fang, K. (2008). LS‐SVM based stable generalized predictive control. Control Conference, 7, 58–61. [Google Scholar]
- Liu, B. , Su, H. Y. , & Zhu, J. (2004). Predictive control algorithm based on least squares support vector machines. Control and Decision, 19(12), 1399–1402. [Google Scholar]
- Lu, D. N. , & Yang, M. Y. (2007). Nonlinear model predictive control based on RBF networks. Journal of Zhejiang University of Technology, 35(2), 123–126. [Google Scholar]
- Mahmoodi, S. , Poshtan, J. , Jahed‐Motlagh, M. R. , & Montazeri, A. (2008). Nonlinear model predictive control of a pH neutralization process based on Wiener–Laguerre model. Chemical Engineering Journal, 146, 328–337. [Google Scholar]
- Ren, X. Y. , Hu, Y. Q. , Hu, Q. L. , Yang, S. B. , Yu, H. X. , Zhen, G. , & Yang, Z. J. (2013). Application of lysozyme in preservation of aquatic products. Science and Technology of Food Industry, 34(8), 390–394. [Google Scholar]
- Sun, Y. K. , Wang, B. , Huang, Y. H. , & Ji, X. F. (2010). Soft‐sensing method for L‐lysine fermentation process based on FDLS‐SVM. Chinese Journal of Scientific Instrument, 31(2), 404–409. [Google Scholar]
- Suykens, J. , & Vandewalle, J. (1999). Least squares support vector machine classifiers. Neural Processing Letters, 9(3), 293–300. 10.1023/A:1018628609742 [DOI] [Google Scholar]
- Wang, Y. J. , Sun, M. , Zhang, Y. B. , Hong, Y. G. , Hao, J. H. , Liu, X. P. , & Wang, C. B. (2000). Studies on preparation and characteristic of the marine low temperature lysozyme. Marine Fisheries Research, 21, 54–63. [Google Scholar]
- Wang, Z. H. , Zhen, Y. , & Zhu, J. (2013). A preliminary study on thunderstorm forecast with LS‐SVM method. Journal of Tropical Meteorology, 19(1), 104–108. [Google Scholar]
- Xi, Y. G. , Li, D. W. , & Lin, S. (2013). Model predictive control–status and challenges. Acta Automatica Sinica, 39(3), 222–236. 10.1016/S1874-1029(13)60024-5 [DOI] [Google Scholar]
- Yan, Z. C. , & Cui, C. L. (2013). An intelligent model predicting the damage depth of coal seam floor based on LS‐SVM optimized by PSO. Journal Applied Sciences, 13(11), 1954–1959. [Google Scholar]
- Yang, J. S. , Li, F. , Ding, H. T. , Guo, K. H. , & Hao, B. Q. (2012). Vehicle yaw stability control based on generalized predictive control. Journal of Agricultural Machinery, 43(1), 1–5. [Google Scholar]
- Yao, Q. Z. , Cai, J. , & Zhang, J. L. (2009). Simultaneous feature selection and LS‐SVM parameters optimization algorithm based on PSO. Word Congress on Computer Science and Information Engineering, 5(6), 723–727. [Google Scholar]
- Zhao, D. B. , Bai, Y. H. , Zhang, X. Y. , & Wu, Y. J. (2010). Research progress of modification of natural biological preservative lysozyme. China Food Additives, 5, 200–204. [Google Scholar]
- Zhong, W. M. , He, G. L. , Pi, D. Y. , & Sun, Y. X. (2005). SVM with quadratic polynomial kernel function based nonlinear model one‐step‐ahead predictive control. Chinese Journal of Chemical Engineering, 13(3), 373–379. [Google Scholar]
- Zhu, X. L. , He, Y. J. , Sun, M. , & Wang, Y. J. (2013). Soft sensing algorithm of marine lysozyme mycelium fermentation bacteria concentration based on SUKF. Journal of Jiangsu University (Nature Science Edition), 34(6), 699–703. [Google Scholar]
- Zhu, X. L. , Ling, J. , Wang, B. , Hao, J. H. , & Ding, Y. H. (2018). Soft‐sensing modeling of marine protease fermentation process based on improved PSO‐RBFNN. CIESC Journal, 69, 1221–1227. [Google Scholar]
- Zhu, X. L. , Liu, Y. F. , Sun, M. , & Wang, Y. J. (2010). Development of intelligent control system for bioreactor of marine microbial enzymes. Transactions of the Chinese Society of Agricultural Engineering, 26(9), 185–191. [Google Scholar]