Abstract
The control of the open loop unstable systems with nonlinear structure is challenging work. In this paper, for the first time, we present a sand cat swarm optimization (SCSO) algorithm-based state feedback controller design for open-loop unstable systems. The SCSO algorithm is a newly proposed metaheuristic algorithm with an easy-to-implement structure that can efficiently find the optimal solution for optimization problems. The proposed SCSO-based state feedback controller can successfully optimize the control parameters with efficient convergence curve speed. In order to show the performance of the proposed method, three different nonlinear control systems such as an Inverted pendulum, a Furuta pendulum, and an Acrobat robot arm are considered. The control and optimization performances of the proposed SCSO algorithm are compared with well-known metaheuristic algorithms. The simulation results show that the proposed control method can either outperform the compared metaheuristic-based algorithms or have competitive results.
Keywords: State feedback control, Nonlinear systems, Trajectory control, Metaheuristic algorithms, Sand cat swarm optimization (SCSO)
1. Introduction
There has been extensive study of the open-loop unstable benchmarks such as inverted pendulums in modern control. Its nonlinear and underactuated yet simple structure makes it an interesting and challenging problem for the control engineers to handle. There are a number of robots, spacecraft, and marine vessels that can be represented by their models, and as a result, they can ignite the curiosity of the researchers and experts in the field of nonlinear control [29], [45]. From the structural point of view, they can be categorized as pendulum carts which can move along the x-axis with a stick installed to the pivot point of the cart, a pendulum that can have motions on both x and y horizons like Furuta pendulum, and a spatial three-dimensional pendulum capable of moving in directions [41]. To evaluate the control performance of the inverted pendulums there exist numerous linear and nonlinear control strategies such as proportional integral derivative (PID), state-feedback controller, fuzzy Q-learning, sliding mode controller, type-2 fuzzy controller, Lyapunov-based and model predictive path integral control [2], [11], [18], [25], [27], [36], [40]. Among these control strategies, PID can be considered as the simplest one with three parameters to be tuned for optimal performance, but when it comes to controlling nonlinear complicated systems traditional PID controllers cannot perform well enough [17]. To conquer this, some other classical methods like linear quadratic regulator (LQR) and state feedback can be beneficial [3].
One of the main challenges for these intelligent and traditional control techniques is associated with the parameter tuning involved in their structure. Usually, based on the operator's knowledge and experience, these parameters are tuned (empirical gain tuning). For various control features such as good and fast transient response, reduced overshoots, settling times, and steady-state errors an optimal solution is exceedingly challenging to obtain. It is possible to think of this parameter tuning issue as an optimization problem in which a variety of nature-inspired and evolutionary algorithms can play a vital role in solving them.
In the last decades, many metaheuristic algorithms have been presented. The real-world problem with different dimensions is being resolved by the metaheuristic algorithms. The metaheuristic algorithms have the ability to find the best or near optimum solution. Based on the No-Free-Lunch (NFL) [44] there is no algorithm capable of finding all possible optimization problems' solutions. There exist many metaheuristic algorithms, however, they can be distinguished in terms of hindering them in trapping local optimum, convergence speed. Also, there are some other metrics to evaluate each metaheuristic algorithm's performance. Most newly proposed metaheuristic algorithms try to find the near-optimum solution with new inspirational rules [43]. The main objective of these algorithms is the trade-off between two phases: exploration and exploitation.
Generally, the metaheuristic algorithms are divided into, evolution-based, physics-based, and swarm intelligence methods [32]. The evolution-based algorithms (EA) are inspired by the evolutionary rules in nature. The genetic algorithm (GA) [8], differential evolution (DE) [28], and Bio-geography-based optimizer (BBO) [21] are among the algorithms in the EA category. The physics-based algorithms originated from biological or physical phenomena. The gravitational search algorithm (GSA) [30], Big-Bang Big-Crunch (BBBC) [6], and black hole (BH) algorithms [9] are imitated by physical events. The swarm intelligence (SI) algorithms are inspired by the social behaviors of humans. Particle swarm optimization (PSO) [24], artificial bee colony (ABC) [15], ant colony optimization (ACO) [5], sand cat swarm optimization (SCSO) [33], grey wolf optimization (GWO) [23], and whale optimization algorithm (WOA) [22] belong to the SI category. The functionality of the metaheuristic algorithms is given in Fig. 1
Figure 1.
Metaheuristic algorithms mechanism.
In the context of classical control, there are gain-tuning methods for controllers based on various optimization algorithms. In the work done by [26], PID gains are calculated for an inductive system with a linear mathematical model in which the GA's fitness function is formed based on the settling time, rise time, and overshoot of the time response of the system. A hybrid particle swarm optimization (PSO) and differential evolution (DE) algorithm has been applied to a linearized tank-liquid level model to obtain the optimal PID gains to control the system in [24]. In the domain of modern control, PSO is used to design a linear quadratic regulator (LQR) for a three-phase inverter [38]. Big-Bang Big-Crunch (BBBC) is employed to optimize the PID and sliding mode controller gains in controlling the nonlinear model of a spatial inverted pendulum [41], [42]. Whale optimization algorithm (WOA) [34] is used to calculate the state feedback controller gains for a linear model of a thermal system. For the state feedback controller design, grey wolf optimization (GWO) is also applied to the control problem of a permanent magnet machine [35]. In the context of nonlinear systems [1] has used the Improved cuckoo optimization algorithm to solve the system of the nonlinear equations.
In this study, to learn and optimize the gains of the state feedback controller for nonlinear unstable systems of inverted pendulum, Furuta pendulum, and Acrobat robot for the first time the sand cat swarm optimization (SCSO) is proposed. Most recently, the SCSO as a novel nature-based algorithm is developed in [33]. The SCSO algorithm has several advantages over other metaheuristic algorithms, such as the exploration versus exploitation trade-off, the efficient performance in the convergence curve, and ease of implementation on different problems. Therefore, the SCSO method has been preferred and employed in some engineering problems [12], [13], [14], [20], [31]. Regardless of its recent successful performances, its applicability in real-time control problems of complex nonlinear systems has not been investigated yet. To fill this gap and benefit from the above-mentioned advantages of the SCSO, we apply it to the control engineering domain.
In this paper the contributions motivated by the need for novel and efficient optimization algorithms for developing optimal controllers in engineering problems can be summarized as follows:
-
1.
A lightweight yet powerful algorithm
A new state feedback controller design based on SCSO algorithm is proposed for the nonlinear unstable systems, reducing the tuning difficulties for optimal gains. The justification for hiring SCSO is its algorithmic simplicity owning to the fact that it has few parameters, making it easy to implement.
-
2.
Applications in real-world problems
The proof of the convergence of the SCSO algorithm is shown via various simulations when applied to highly nonlinear complex systems, which confirms the effectiveness of this algorithm in real-world engineering problems.
-
3.
Numerous comparisons
To verify the validity of the proposed SCSO-based control design, its performance has been compared with well-known optimization algorithms such as PSO, BBBC, WOA, and GWO in which it exhibited either better or close performance.
The rest of the paper is planned as follows. The proposed SCSO-based control algorithm and the physical model of the systems are presented in Section 2 as fundamentals. The simulation result and analysis are given in Section 3. Finally, the conclusions and future observations are written in Section 4.
2. Fundamentals
In the first part of this section, a general overview of the SCSO optimization algorithm will be provided. In the second part, we consider the equations of motions of three nonlinear systems and their control performance will be examined using some well-known metaheuristic algorithms. Finally, the proposed SCSO-based control algorithm will be explained.
2.1. The metaheuristic SCSO algorithm
The sand cat swarm optimization (SCSO) algorithm is inspired by the sand cat behaviors in nature [33]. The SCSO algorithm mimics the search and hunting mechanism of this animal. The sand cat has some specific characteristics in comparison to the domestic cats such as the ability to detect the low frequency noises, live in the harsh environment of deserts, and own specific methods for hunting. There are not any differences in the appearance of both cat types, but the sand cat has a high density of fur in the palms and soles. As mentioned earlier, the sand cat has a special method in the foraging and hunting mechanism. They predict the location of the prey according to their extraordinary ability in detecting low-frequency noises. The main characteristic of the sand cat is to hear noises with frequencies below . The SCSO algorithm tries to find a solution close to the optimum. Based on the NFL theorem [44], one cannot find a metaheuristic algorithm capable of finding an optimum solution to each individual problem. Among the available metaheuristic algorithms, however, the SCSO algorithm has been shown to be effective with reliable performance in most of the optimization problems.
For implementing the SCSO algorithm, the first step is the initialization of the parameter search space. This step is similar to all population-based metaheuristic algorithms. The search space is populated randomly between the lower and upper boundaries of the parameters according to the limitations of the given optimization problem. The assigned parameter ranges determine the borders of the search space. In each iteration of the algorithm, the SCSO algorithm controls the boundaries of each parameter in the search space. The number of rows in the search space is equal to the number of the search agents and the number of columns representing the number of the dimensions of the problem. The randomly populated search space includes a list of candidate solutions that the metaheuristic algorithm should update and finally find a solution near the optimum value from the search space. For this and to evaluate the candidate solutions, a well-defined fitness function (cost) is required for each specific optimization problem. Based on the problem's goal, which could be either a maximization or minimization one, the metaheuristic algorithm tries to guide the search mechanism towards the optimum solution. In the next step, the search agents perform a continuous exploration within the search space to update their positions and move towards the regions where the optimal solution is placed, (reaching near to the prey's location). Here, the search and hunting mechanism of each metaheuristic algorithm is different.
In the SCSO algorithm, the searching for prey is empowered by each sand cat's ability in the low-frequency noise emission. The sensitivity range (R) of each search agent is predefined between 2 to 0. The parameter indicates the general sensitivity range which decreases from 2 to 0 according to Equations. (1), (2), (3). In the equation (1) the , which is associated with the sand-cat's ability to catch low-frequencies below 2 KHz, is assumed to be 2, is the current iteration number, and is the iterations maximum value. indicates the current position, indicates the best position, and is the random position according to Equations (4), (5a), (5b).
In the SCSO algorithm, each search agent moves in a circular manner, in this way to move in different directions to search for possible global solutions, the random angle α between 0 to 360 is considered as a function. The main structural equations of the SCSO algorithm are given in Equation (6).
| (1) |
| (2) |
| (3) |
| (4) |
| (5a) |
| (5b) |
| (6) |
2.2. Physical systems model
To show the performance of the optimization algorithms we apply them to three different nonlinear systems; Inverted pendulum, Furuta pendulum, and Acrobat Robot. In the following, the mathematical formulations of the systems along with their parameters and states will be given.
2.2.1. Inverted pendulum
The inverted pendulum system in modern control is a good example of a benchmark experiment for an open-loop unstable system which is being considered for different linear and nonlinear control applications. Its working principle generally is based on a moving cart over a rail with a stick (pendulum) attached to it. The control objective here is to stabilize the pendulum in a vertical position by applying desired control signals to the cart, considering the limitations on the position of the cart as well as the angle of the pendulum with respect to the vertical position. The continuous state space of the inverted pendulum is composed of four state variables as which stand for pendulum's angle and its angular velocity, cart position and cart velocity, respectively. The control objective is to design a suitable feedback controller for this system capable of stabilizing the pendulum's angle in the vertical position as well as bringing the cart position to the origin. The dynamical equations of the system are given in Equations (7a), (7b) as shown by [4]
| (7a) |
| (7b) |
where the model parameters of the system are summarized in Table 1. For each state, a stabilizing feedback gain is learned by the optimization algorithms. To have a fair comparison between the metaheuristic algorithms, the same quadratic cost function is used which encompasses the information regarding the state variables and the generated control signal. This quadratic function is defined as:
| (8) |
with a positive definite weight matrix defined as and a positive penalty value for the control signal u. For this cost function to have a global minimum, the matrices and have to be positive definite. A matrix is positive definite if it is symmetric and all its eigenvalues are positive. If these matrices are positive definite, then the function J is a strictly convex function. A strictly convex function has a unique global minimum, which can be found by taking the derivative of the function and setting it to zero. This ensures that the optimization algorithm can converge to the optimal solution. It is also important to check that the domain of and u is closed and bounded.
Table 1.
Inverted pendulum's model parameters.
| M | Cart mass (kg) | 0.5 |
| m | Stick mass (kg) | 0.5 |
| l | Stick length (m) | 0.6 |
| b | Friction constant (N/m × s) | 0.1 |
| g | Gravity (m/s2) | 9.81 |
2.2.2. Furuta pendulum
The Furuta pendulum is an underactuated nonlinear system whose structure is composed of a rotating arm with a pendulum attached to it. For this system also, we consider four continuous states consisting of the arm angle, arm angular velocity, pendulum angle, and its angular velocity represented as . The governing equations of the Furuta pendulum with respect to the arm and pendulum angles are represented in [7] and given as follows:
| (9) |
where is obtained by Equation (10)
| (10) |
The parameters of the Furuta pendulum model in Equation (9) are defined according to Table 2.
Table 2.
Furuta pendulum's model parameters.
| m1 | pendulum mass (kg) | 5.38 × 10−2 |
| L0 | Arm length (m) | 0.215 |
| g | Gravity (m/s2) | 9.81 |
| l1 | Distance to pendulum's center of gravity | 0.113 |
| J1 | pendulum Inertia | 1.98 × 10−4 |
| I0 | Arm Inertia | 1.75 × 10−2 |
2.2.3. Acrobat robot
A simple structure of the 2-link robot known also as the Acrobat robot is given in Fig. 2, where it is used to obtain the dynamical equations of the robot, assuming that the mass of the links is and to be equal to , their lengths are and equal to , joint angles of the links as and . These links are controlled by two motors with torques and . Using the Lagrangian formulation, the general form of the acrobat robot's dynamics can be written as given in Equation (11)
| (11) |
where M, V, and G stand for inertia matrix, Coriolis, and gravity vectors, respectively. One can extract the dynamics of the acrobat robot as shown in Equation (12), which are elaborately covered in [19].
| (12) |
To control the acrobat robot, Proportional-Derivative Computed Torque (PD-CT) is used and its working principle is based on the feedback linearization [37]. For this, first, an error trajectory is defined as Equation (13):
| (13) |
The nonlinear controller which ensures tracking the desired motion trajectory can be written as Equation (14):
| (14) |
where is composed of nonlinear terms defined by Equation (15):
| (15) |
with friction components represented by . Taking the control signal as a PD controller will yield the PD-CT controller as provided in Equation (16):
| (16) |
The structure of the resulting PD-CT control loop is given in Fig. 3, where the optimization algorithms are responsible for learning the controller gains. It is built with a multiloop structure, consisting of an inner nonlinear feedback linearization loop and an exterior unity-gain tracking loop. It is important to keep in mind that for each joint there is an outer loop. In this closed-loop control system, we have
The desired trajectory is also assumed to be a sinusoidal one with a period of 2 and amplitude of 0.1 according to Equation (17):
| (17) |
We defined the cost function of the applied optimization algorithms considering the error trajectory as:
| (18) |
where Q is a diagonal matrix as ;
Figure 2.

Mechanism of the acrobat robot.
Figure 3.
Block diagram representation of the PD-CT with the online optimization algorithms.
2.3. SCSO-based controller design
The fundamental principle of operation of the SCSO method is to find the possible optimal solutions from a random search space (seeking for prey and attacking them). The objective of the algorithm based on the problem at hand can be either minimization or maximization of an appropriate cost function. Fig. 4 demonstrates SCSO's flowchart. According to this flowchart and based on the following steps, the optimal control parameters for the nonlinear unstable systems can be determined.
Figure 4.
Flowchart for the SCSO algorithm.
Step 1 The first step is to assign the initial parameters for the SCSO-based control algorithm as well as define the lower and upper bounds for the unknown parameters in the search space; For the inverted pendulum and Furuta pendulum, the parameters to be obtained are the gains of the state feedback controller, which for these systems we have in total 4 controller gains. For the acrobat robot, the PD gains will be obtained. Moreover, the number of search agents, and the maximum number of iterations should be specified. Then, the initialization of the random parameter space according to the boundary conditions can be done based on Equation (19):
| (19) |
where ub and lb are the upper and lower boundaries for each parameter to be learned and dim is the dimension of each parameter.
Step 2 In step two, each search agent calculates the fitness value of the appropriate possible solution based on the available problem. For the inverted pendulum and Furuta pendulum, the relevant fitness function is given in Equation (8) and for the acrobat robot it is given in Equation (18).
Step 3 SCSO is initiated here in order to determine the optimum solution. On the basis of the fitness function, the best score and the best position of the optimum solution are determined. Accordingly, the SCSO algorithm updates the search agent positions based on Equation (1). It is worthwhile mentioning that selecting an appropriate performance index for the metaheuristic algorithm is crucial.
Step 4 As part of the SCSO algorithm, a predefined stopping criterion is assigned. In our case, the stopping criteria are met when the iteration maximum value is reached.
3. Simulation and result analysis
A metaheuristic algorithm typically produces different results depending on the values of its control parameters, and it is often found that different control parameters produce dissimilar results depending on their values. In this section first, the initialization of the compared metaheuristic algorithms such as GWO, WOA, PSO, and BBBC with the SCSO is presented. Then, they are applied to the nonlinear models of the inverted pendulum, Furuta pendulum, and the acrobat robot to learn a suitable state-feedback controller gain vector to stabilize and control them. Their control behavior is then illustrated as time response plots and the stabilizing gains and resulted cost values are provided in relevant tables. Finally, the section will be concluded with some discussions regarding the extracted observations from the results.
3.1. Parameter initialization
For all the considered meta-heuristic algorithms including the proposed SCSO algorithm, it is assumed that each one is run with a population size of 30 and a number of iterations of 500. The simulation configuration composed of the different initial parameters and specific parameter ranges for each meta-heuristic algorithm, is given in Table 3 To have a fair evaluative comparison, the environment in which all the experiments are conducted, is considered under the same conditions. All the simulations are implemented using a Core processor, 8 GB of RAM in the MATLAB 2020 environment.
Table 3.
Initialization of the algorithms.
| Algorithm | Parameter | Value |
|---|---|---|
| SCSO | Sensitivity Range (rG) | [0 2] |
| Phase Control Range (R) | [−2rG 2rG] | |
| GWO | a linearly decreasing vector (a) | [0 2] |
| A coefficient vector | [0 2] | |
| C coefficient vector | 2 × rand(0,1) | |
| WOA | a decreases linearly from 2 to 0 | [0 2] |
| A coefficient vector | [0 2] | |
| PSO | Weight w | [0.4 0.9] |
| Acceleration constants c1,c2 | 2 | |
| BBBC | Iteration parameter Ng | 30 |
| Space limiting parameter β | 0.15 | |
| Acceleration constant c1 | [1 15] | |
| Acceleration constant c2 | [2 20] | |
3.2. Result analysis
To show the effectiveness of the proposed SCSO parameter optimization for the controller design of physical systems, a comparison based on different metaheuristic algorithms such as PSO, BBBC, WOA, and GWO in three complex nonlinear systems as represented in Section 2.2, has been made. To control the Inverted pendulum and Furuta pendulum, we design two different full-state feedback controllers according to the number of states involved for each system. A state feedback controller is represented according to Equation (20):
| (20) |
where k stands for a gain vector of the controller, which is responsible for stabilizing the pendulum angles, and u is the resulting control signal. Specifically for the Inverted pendulum, another control objective is to steer the system's cart position towards the origin from any given initial cart position, as well. The time response variations of the inverted pendulum for the obtained state vector by metaheuristic algorithms are given in Fig. 5, where the simulation time is chosen to be 10 sec with a sampling time of sec. It should be emphasized that for a more accurate portrayal of the plots, we showed only 3 seconds of the simulation in this figure. To obtain the time response plots of the system, an initial state is assigned as . Although according to Fig. 5, the performance of the applied meta-heuristic algorithms for stabilizing the states of the Inverted pendulum is very close to each other, we can discuss the performance of each algorithm according to the desired specifications of the system. Specifically, when considering Fig. 5a, in terms of the undershoots and overshoots of the angle deviations, BBBC demonstrates better performance. Among the algorithms, GWO decreased the rise time of the system's response, whereas WOA resulted in a greater rise time. Although PSO and SCSO exhibited similar performances, PSO seems to result in a less undershoot for the angle response. For the pendulum's cart position shown in Fig. 5c, the performance of GWO can be distinguished from the other algorithms, where it shows a reduced rise-time, settling-time, and overshoot. The behavior of the control signal for the Inverted pendulum, is also shown in Fig. 6. As observed from this figure, less initial control effort devoted to controlling the inverted pendulum belongs to WOA, SCSO, PSO, BBBC, and GWO, respectively (the less control effort, the better the controller design). Moreover, one can observe the velocity behavior of the pendulum's angle and the cart in Figs. 5b, 5d.
Figure 5.
Time responses for the Inverted pendulum.
Figure 6.
Control signals for the Inverted pendulum.
The simulation results of the Furuta pendulum are shown in Fig. 7. To have a clear and reliable comparison, specifically for Fig. 7a, assuming the starting time for the comparison is around 150, in terms of the overshoots, the PSO owns the first place by having the minimum value, SCSO and GWO are in the second place, and BBBC has the maximum value of the overshoot. In Fig. 7b, SCSO owns the second-best performance after GWO, in having a better settling time.
Figure 7.
Time responses for the Furuta pendulum.
For hardware setup implementations, the behavior of the output control signals of the systems is crucial. When considering Fig. 7c, the control signal variations for GWO, WOA, and PSO are relatively large, and in real-world physical experiments depending on the limitations of the available actuators, they could not be realistic. Whereas, these variations for BBBC and SCSO are small, which could give better results in real-world experiments.
For the Acrobat robot, the resulted trajectory error plots, for the given sinusoidal inputs are given in Fig. 8. As can be seen, while PSO outperforms the other algorithms, SCSO, WOA, GWO, and BBBC show a similar trend in tracking the inputs. In terms of the algorithmic analysis we can discuss the convergence of the applied metaheuristic optimization methods according to Fig. 9 and the given Table 4, Table 5, Table 6, where for each system the obtained state feedback parameters and the resulting cost values, are given. According to our results, there are similar functional behaviors in all of these nonlinear system models as they are in the uni-modal fixed function models; the best or optimal solution for these types of functions can only have one global solution. One of the objectives of our study is to examine the metaheuristic algorithms' capacity for exploration and exploitation by the knowledge that finding a global optimum solution requires a good level of exploitation. The optimized gains for the pendulum's angle, angular velocity, the cart's position, and its linear velocity are given in Table 4 and represented by where . The functionality of the implemented metaheuristic optimization algorithms is based on an appropriate cost function. The converged value of the cost function for each algorithm is also given in Table 4 which is calculated according to Equation (8). As it is obvious from this table, the cost value of the SCSO algorithm is better than the others.
Figure 8.
Trajectory error and the desired trajectory for the acrobat.
Figure 9.
Convergence plots for the system models.
Table 4.
Optimized parameters and the converged cost function for the Inverted pendulum.
| Algorithm | k1 | k2 | k3 | k4 | Cost |
|---|---|---|---|---|---|
| SCSO | −106.5173 | −15.3428 | −24.3197 | −16.5111 | 6.06e − 37 |
| GWO | −199.7924 | −28.3381 | −54.4326 | −34.7815 | 1.78e − 36 |
| WOA | −90.1191 | −13.1692 | −19.5456 | −13.3067 | 1.42e − 34 |
| PSO | −126.4230 | −18.1780 | −29.7044 | −20.1036 | 1.38e − 35 |
| BBBC | −195.7704 | −28.2128 | −47.3587 | −32.1615 | 4.65e − 32 |
Table 5.
Optimized parameters and the converged cost function for the Furuta.
| Algorithm | k1 | k2 | k3 | k4 | Cost |
|---|---|---|---|---|---|
| SCSO | −30.9614 | −8.6242 | −60.8159 | −7.3316 | 3.14e − 69 |
| GWO | −102.1752 | −27.9689 | −192.2926 | −23.1986 | 1.39e − 66 |
| WOA | −98.7970 | −23.3394 | −161.9467 | −19.3776 | 1.04e − 63 |
| PSO | −101.0838 | −27.7087 | −191.1006 | −23.0322 | 6.70e − 70 |
| BBBC | −38.5966 | −12.1674 | −60.6569 | −10.6516 | 1.73e − 28 |
Table 6.
Optimized parameters and the converged cost function for the acrobat robot.
| Algorithm | kp | kd | Cost |
|---|---|---|---|
| SCSO | 117.3465 | 200 | 3.5629e − 08 |
| GWO | 117.84117 | 200 | 3.5636e − 08 |
| WOA | 117.5706 | 200 | 3.5634e − 08 |
| PSO | 200 | 107.1564 | 6.3848e − 08 |
| BBBC | 117.5924 | −12.1674 | 3.5636e − 08 |
The obtained results of the Furuta pendulum, which is designed to rotate an arm attached to a pendulum, are also shown in Table 5. Similar to Table 4, there are gain parameters to be optimized, which should control the arm angle, its angular velocity, the pendulum angle, and its angular velocity. According to Table 5, the SCSO algorithm also provides better results in comparison to other algorithms for finding optimized parameters. For this system, the cost function values are obtained using Equation (8).
Table 6 shows the result that was obtained for the acrobat robot system model during the simulation process. In this system model, there are two gain parameters that control the angles shown in Fig. 2. According to the obtained results, the cost values are competitive. For this system, we have used the cost function given in Equation (18). It is worth noting that, except for BBBC, the other algorithms have crossed their parameter boundaries. Based on Fig. 9a, 9b, and 9c convergence curves of the metaheuristic algorithms are presented. As can be seen from this figure, the SCSO algorithm's convergence rate, in terms of keeping the trade-off between the exploration and exploitation phases, is satisfactory.
3.3. Discussions
Although in Section 3.2 we provide the time-response and convergence properties of the proposed SCSO control algorithm and showed its effectiveness when compared to some optimization algorithms, there still exist some inquiries that should be explored.
-
•
In our simulations, we did not incorporate the effect of the disturbances, parameter uncertainties, and noises involved, and ideal conditions are taken. Actually, to design robust control and optimization algorithms that can be better integrated with real-world system models, the role of the disturbances and the uncertainties are indispensable.
-
•
As the control algorithms for our systems, the classical state-feedback, and PD-CT are hired. Other sophisticated traditional and intelligent controllers could also have been used like a hybrid-neural network-based sliding mode control (SMC) [16], nonlinear SMC [39], and Fuzzy control [10].
-
•
The obtained result for the robotic arm falls into the boundaries of the initial parameters except for the BBBC algorithm. This is because of the exploration-exploitation behavior of the metaheuristic algorithms. This can be overcome by modifying the range of the parameters defined. Assigning the bounds for the parameter spaces can be tricky and problem-specific, therefore it is crucial to have a firm grasp of the issue at hand before you start making assumptions.
-
•
Based on the obtained result for each nonlinear system model, the metaheuristic algorithms get the optimum value as a cost function. However, the obtained optimum cost for the acrobat robot for all the algorithms is in a competitive manner, which shows the sensitivity and accuracy of the algorithms.
4. Conclusions and future observations
In this paper, for the first time, the SCSO-based state feedback controller is proposed and successfully applied to three nonlinear physical systems. In this direction, first, we explore the stabilization performance of the applied optimization-based control strategy in the Inverted pendulum and Furuta pendulum. Then the trajectory tracking control of an Acrobat robot is investigated. To put our methodology in context and demonstrate its efficiency, a thorough comparison has been made with some famous optimization algorithms such as BBBC, PSO, WOA, and GWO in terms of both control performance and convergence. The main tasks of the current paper can be pointed out in the following:
-
•
The proposed SCSO-state feedback controller is implemented in the controller design for the nonlinear, unstable, and underactuated models of the Inverted pendulum and Furuta pendulum, where their control is challenging work. The resulting control can successfully stabilize the Inverted pendulum's angle and cart position. Similarly, the Furuta pendulum managed to stabilize both the pendulum and arm angles based on the designed controller.
-
•
To guarantee the tracking error for given sinusoidal reference inputs to the Acrobat robot, an SCSO-based PD-CT controller is designed.
-
•
The results of the simulations for both the convergence and control performance reveal the competitiveness of the proposed SCSO-based control design with some famous optimization algorithms. In relation to the time response features of the systems, we can infer that the SCSO-based control can even perform better than the tested optimization algorithms, or its performance is sufficient to match those of the competitors and achieve the intended control objectives.
-
•
From the algorithmic point of view, SCSO inherits a simple structure and is easy to implement, which entitles it as a suitable candidate for real-world control and engineering problems. Moreover, it can ensure an accurate convergence by retaining a balance between exploration and exploitation.
As a future research path, we first intend to extend the optimization idea from the classical to the advanced intelligent control algorithms and apply the SCSO algorithm to learn the model of the fuzzy controllers which may demand a high-dimensional parameter optimization space. The next step is to investigate the robustness of the proposed SCSO algorithm in terms of disturbances, parameter uncertainties, and noises applied to the system model.
Funding statement
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
CRediT authorship contribution statement
Vahid Tavakol Aghaei and Amir SeyyedAbbasi: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Wrote the paper.
Jawad Rasheed: Contributed reagents, materials, analysis tools or data; Analyzed and interpreted the data.
Adnan M. Abu-Mahfouz: Contributed reagents, materials, analysis tools, or data; Analyzed and interpreted the data.
Declaration of Competing Interest
The authors declare no conflict of interest.
Contributor Information
Vahid Tavakol Aghaei, Email: vahid.aghaei@istinye.edu.tr.
Amir SeyyedAbbasi, Email: amir.seyyedabbasi@istinye.edu.tr.
Jawad Rasheed, Email: jawad.rasheed@nisantasi.edu.tr.
Adnan M. Abu-Mahfouz, Email: a.abumahfouz@ieee.org.
Data availability
No data was used for the research described in the article.
References
- 1.Abdollahi Mahdi, Bouyer Asgarali, Abdollahi Davoud. Improved cuckoo optimization algorithm for solving systems of nonlinear equations. J. Supercomput. 2016;72(3):1246–1269. [Google Scholar]
- 2.Tavakol Aghaei Vahid, Onat Ahmet, Eksin Ibrahim, Guzelkaya Mujde. 2015 European Control Conference (ECC) 2015. Fuzzy PID controller design using Q-learning algorithm with a manipulated reward function; pp. 2502–2507. [DOI] [Google Scholar]
- 3.Bezci Yüksel Ediz, Tavakol Aghaei Vahid, Akbulut Batuhan Ekin, Tan Deniz, Allahviranloo Tofigh, Fernandez-Gamiz Unai, Noeiaghdam Samad. Classical and intelligent methods in model extraction and stabilization of a dual-axis reaction wheel pendulum: a comparative study. Results Eng. 2022;16 doi: 10.1016/j.rineng.2022.100685. [DOI] [Google Scholar]
- 4.Dann Christoph, Neumann Gerhard, Peters Jan. Policy evaluation with temporal differences: a survey and comparison. J. Mach. Learn. Res. January 2014;15(1):809–883. [Google Scholar]
- 5.Dorigo Marco, Birattari Mauro, Stutzle Thomas. Ant colony optimization. IEEE Comput. Intell. Mag. 2006;1(4):28–39. [Google Scholar]
- 6.Erol Osman K., Eksin Ibrahim. A new optimization method: Big Bang–Big Crunch. Adv. Eng. Softw. 2006;37(2):106–111. [Google Scholar]
- 7.Fantoni Isabelle, Lozano Rogelio. Springer-Verlag; 2001. Non-Linear Control for Underactuated Mechanical Systems. [Google Scholar]
- 8.Goldberg David E. Pearson Education India; 2013. Genetic Algorithms. [Google Scholar]
- 9.Hatamlou Abdolreza. Black hole: a new heuristic optimization approach for data clustering. Inf. Sci. 2013;222:175–184. [Google Scholar]
- 10.Ben Hazem Zied, Fotuhi Mohammad Javad, Bingül Zafer. Development of a Fuzzy-LQR and Fuzzy-LQG stability control for a double link rotary inverted pendulum. J. Franklin Inst. 2020;357(15):10529–10556. doi: 10.1016/j.jfranklin.2020.08.030. [DOI] [Google Scholar]
- 11.Homburger Hannes, Wirtensohn Stefan, Reuter Johannes. 2022 30th Mediterranean Conference on Control and Automation (MED) 2022. Swinging up and stabilization control of the Furuta pendulum using model predictive path integral control; pp. 7–12. [DOI] [Google Scholar]
- 12.Iraji Amin, Karimi Javad, Keawsawasvong Suraparb, Nehdi Moncef L. Minimum safety factor evaluation of slopes using hybrid chaotic sand cat and pattern search approach. Sustainability. 2022;14(13):8097. [Google Scholar]
- 13.Jia Shiduo, Kang Xiaoning, Cui Jinxu, Tian Bowen, Xiao Shuwen. Hierarchical stochastic optimal scheduling of electric thermal hydrogen integrated energy system considering electric vehicles. Energies. 2022;15(15) doi: 10.3390/en15155509. [DOI] [Google Scholar]
- 14.Jovanovic Dijana, Marjanovic Marina, Antonijevic Milos, Zivkovic Miodrag, Budimirovic Nebojsa, Bacanin Nebojsa. 2022 International Conference on Artificial Intelligence in Everything (AIE) 2022. Feature selection by improved sand cat swarm optimizer for intrusion detection; pp. 685–690. [DOI] [Google Scholar]
- 15.Dervis Karaboga. Artificial bee colony algorithm. Scholarpedia. 2010;5(3):6915. [Google Scholar]
- 16.Keighobadi Jafar, Hosseini-Pishrobat Mehran, Faraji Javad. Adaptive neural dynamic surface control of mechanical systems using integral terminal sliding mode. Neurocomputing. 2020;379:141–151. doi: 10.1016/j.neucom.2019.10.046. [DOI] [Google Scholar]
- 17.Hwan Kim Jun, Oh S.J. A fuzzy PID controller for nonlinear and uncertain systems. Soft Comput. 2000;4(2):123–129. [Google Scholar]
- 18.Tufan Kumbasar, Hagras Hani. Big Bang–Big Crunch optimization based interval type-2 fuzzy PID cascade controller design strategy. Inf. Sci. 2014;282:277–295. doi: 10.1016/j.ins.2014.06.005. [DOI] [Google Scholar]
- 19.Lewis F.L., Yesildirak A., Jagannathan Suresh. Taylor & Francis, Inc.; USA: 1998. Neural Network Control of Robot Manipulators and Nonlinear Systems. [Google Scholar]
- 20.Li Yiming, Wang Gencheng. Sand cat swarm optimization based on stochastic variation with elite collaboration. IEEE Access. 2022;10:89989–90003. doi: 10.1109/ACCESS.2022.3201147. [DOI] [Google Scholar]
- 21.Mirjalili Seyedali. Evolutionary Algorithms and Neural Networks. Springer; 2019. Biogeography-based optimisation; pp. 57–72. [Google Scholar]
- 22.Mirjalili Seyedali, Lewis Andrew. The whale optimization algorithm. Adv. Eng. Softw. 2016;95:51–67. [Google Scholar]
- 23.Mirjalili Seyedali, Mohammad Mirjalili Seyed, Lewis Andrew. Grey wolf optimizer. Adv. Eng. Softw. 2014;69:46–61. [Google Scholar]
- 24.Moharam Amal, El-Hosseini Mostafa A., Ali Hesham A. Design of optimal PID controller using hybrid differential evolution and particle swarm optimization with an aging leader and challengers. Appl. Soft Comput. 2016;38:727–737. doi: 10.1016/j.asoc.2015.10.041. [DOI] [Google Scholar]
- 25.Montoya Oscar Danilo, Gil-Gonzalez Walter. Nonlinear analysis and control of a reaction wheel pendulum: Lyapunov-based approach. Int. J. Eng. Sci. Technol. 2020;23(1):21–29. doi: 10.1016/j.jestch.2019.03.004. [DOI] [Google Scholar]
- 26.Neath Michael J., Swain Akshya K., Madawala Udaya K., Thrimawithana Duleepa J. An optimal PID controller for a bidirectional inductive power transfer system using multiobjective genetic algorithm. IEEE Trans. Power Electron. 2014;29(3):1523–1531. doi: 10.1109/TPEL.2013.2262953. [DOI] [Google Scholar]
- 27.Ouyang Huimin, Hu Jinxin, Zhang Guangming, Mei Lei, Deng Xin. Sliding-mode-based trajectory tracking and load sway suppression control for double-pendulum overhead cranes. IEEE Access. 2019;7:4371–4379. doi: 10.1109/ACCESS.2018.2888563. [DOI] [Google Scholar]
- 28.Price Kenneth V. Handbook of Optimization. Springer; 2013. Differential evolution; pp. 187–214. [Google Scholar]
- 29.Rahimi Afshin, Kumar Krishna Dev, Alighanbari Hekmat. Fault isolation of reaction wheels for satellite attitude control. IEEE Trans. Aerosp. Electron. Syst. 2020;56(1):610–629. doi: 10.1109/TAES.2019.2946665. [DOI] [Google Scholar]
- 30.Rashedi Esmat, Nezamabadi-Pour Hossein, Saryazdi Saeid. GSA: a gravitational search algorithm. Inf. Sci. 2009;179(13):2232–2248. [Google Scholar]
- 31.Seyyedabbasi Amir. 2022 International Conference on Theoretical and Applied Computer Science and Engineering (ICTASCE) IEEE; 2022. Solve the inverse kinematics of robot arms using sand cat swarm optimization (SCSO) algorithm; pp. 127–131. [Google Scholar]
- 32.Seyyedabbasi Amir. WOASCALF: a new hybrid whale optimization algorithm based on sine cosine algorithm and levy flight to solve global optimization problems. Adv. Eng. Softw. 2022;173 [Google Scholar]
- 33.Seyyedabbasi Amir, Kiani Farzad. Sand cat swarm optimization: a nature-inspired algorithm to solve global optimization problems. Eng. Comput. 2022 doi: 10.1007/s00366-022-01604-x. [DOI] [Google Scholar]
- 34.Swamy Simhadri Kumara, Mohanty Banaja, Kumar Panda Sanjaya. Comparative performance analysis of 2DOF state feedback controller for automatic generation control using whale optimization algorithm. Optim. Control Appl. Methods. 2019;40(1):24–42. doi: 10.1002/oca.2462. [DOI] [Google Scholar]
- 35.Sun Xiaodong, Jin Zhijia, Cai Yingfeng, Yang Zebin, Chen Long. Grey wolf optimization algorithm based state feedback control for a bearingless permanent magnet synchronous machine. IEEE Trans. Power Electron. 2020;35(12):13631–13640. doi: 10.1109/TPEL.2020.2994254. [DOI] [Google Scholar]
- 36.Tavakol Aghaei Buse ilayda Komurcu Vahid, Saka Didar, Aydogan Bahar, Kizilca Gizem, Erener Seda. 2021 29th Signal Processing and Communications Applications Conference (SIU) 2021. Modeling and simulation of the control performance of a reaction wheel pendulum; pp. 1–4. [DOI] [Google Scholar]
- 37.Tavakol Aghaei Vahid, Ağababaoğlu Arda, Yıldırım Sinan, Onat Ahmet. A real-world application of Markov chain Monte Carlo method for Bayesian trajectory control of a robotic manipulator. ISA Trans. 2022;125:580–590. doi: 10.1016/j.isatra.2021.06.010. [DOI] [PubMed] [Google Scholar]
- 38.Ufnalski Bartlomiej, Kaszewski Arkadiusz, Grzesiak Lech M. Particle swarm optimization of the multioscillatory LQR for a three-phase four-wire voltage-source inverter with an LC output filter. IEEE Trans. Ind. Electron. 2015;62(1):484–493. doi: 10.1109/TIE.2014.2334669. [DOI] [Google Scholar]
- 39.Wadi Ali, Lee Jin-Hyuk, Romdhane Lotfi. 2018 11th International Symposium on Mechatronics and Its Applications (ISMA) 2018. Nonlinear sliding mode control of the Furuta pendulum; pp. 1–5. [DOI] [Google Scholar]
- 40.Wang Jia-Jun. Simulation studies of inverted pendulum based on PID controllers. Simul. Model. Pract. Theory. 2011;19(1):440–449. doi: 10.1016/j.simpat.2010.08.003. Modeling and performance analysis of networking and collaborative systems. [DOI] [Google Scholar]
- 41.Wang Jia-Jun, Kumbasar Tufan. Big Bang-Big Crunch optimized hierarchical sliding-mode control of X-Z inverted pendulum. Simul. Model. Pract. Theory. 2018;86:25–35. doi: 10.1016/j.simpat.2018.05.002. [DOI] [Google Scholar]
- 42.Wang Jia-Jun, Kumbasar Tufan. Optimal PID control of spatial inverted pendulum with Big Bang–Big Crunch optimization. IEEE/CAA J. Autom. Sin. 2020;7(3):822–832. doi: 10.1109/JAS.2018.7511267. [DOI] [Google Scholar]
- 43.Wang Jiao, Li Yan, Hu Gang, Yang MingShun. An enhanced artificial hummingbird algorithm and its application in truss topology engineering optimization. Adv. Eng. Inform. 2022;54 [Google Scholar]
- 44.Wolpert David H., Macready William G. No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1997;1(1):67–82. [Google Scholar]
- 45.Zhang Pengfei, Wu Zhengxing, Dong Huijie, Tan Min, Yu Junzhi. Reaction-wheel-based roll stabilization for a robotic fish using neural network sliding mode control. IEEE/ASME Trans. Mechatron. 2020;25(4):1904–1911. doi: 10.1109/TMECH.2020.2992038. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
No data was used for the research described in the article.








