Skip to main content
Sensors (Basel, Switzerland) logoLink to Sensors (Basel, Switzerland)
. 2021 Dec 29;22(1):243. doi: 10.3390/s22010243

Stabilization of the Cart–Inverted-Pendulum System Using State-Feedback Pole-Independent MPC Controllers

Lotfi Messikh 1, El-Hadi Guechi 1, Sašo Blažič 2,*
Editors: Jose Machado, Katarzyna Antosz, Vijaya Kumar Manupati, Yi Ren, Rochdi El Abdi, Dariusz Mazurkiewicz, Marina Ranga, Pierluigi Rea, Emilia Villani, Erika Ottaviano
PMCID: PMC8749679  PMID: 35009786

Abstract

In this paper, a pole-independent, single-input, multi-output explicit linear MPC controller is proposed to stabilize the fourth-order cart–inverted-pendulum system around the desired equilibrium points. To circumvent an obvious stability problem, a generalized prediction model is proposed that yields an MPC controller with four tuning parameters. The first two parameters, namely the horizon time and the relative cart–pendulum weight factor, are automatically adjusted to ensure a priori prescribed system gain margin and fast pendulum response while the remaining two parameters, namely the pendulum and cart velocity weight factors, are maintained as free tuning parameters. The comparison of the proposed method with some optimal control methods in the absence of disturbance input shows an obvious advantage in the average peak efficiency in favor of the proposed SIMO MPC controller at the price of slightly reduced speed efficiency. Additionally, none of the compared controllers can achieve a system gain margin greater than 1.63, while the proposed one can go beyond that limit at the price of additional degradation in the speed efficiency.

Keywords: cart–inverted pendulum (CIP) system, explicit control scheme (ECS), cascade control scheme, model predictive control (MPC), coefficient diagram method (CDM), coincident pole placement method (CPP)

1. Introduction

The cart–inverted pendulum (CIP) system that belongs to the class of fast single-input, multiple-output (SIMO), under-actuated systems and satisfies a set of complicated characteristics, such as fourth-order highly nonlinear dynamics, open-loop instability, state coupling, and non-minimum-phase (NMP) behavior, provides many challenging problems to standard and modern control techniques [1]. In the context of the CIP system stabilization, moving the cart from an initial position to a final destination while keeping the pendulum erected in the upright position has been extensively studied in the past, and many output-feedback and (static and dynamic) state-feedback control techniques have been developed to solve it. However, solving this task efficiently in the framework of linear static state-feedback control (SFC) to ensure prescribed system gain margin in addition to good time response behavior with pole-independent parameter tuning is a subject that still needs further investigation.

There are different types of control methods that have been applied to the inverted pendulum systems [1], including model predictive control (MPC) and non-MPC methods. With regard to the complicated characteristics of the inverted pendulum plants, the needed controlled system performance, and the limited control input effort resource, time-domain optimization techniques, such as the MPC [2,3,4,5,6], seem to be one of the most convenient ways to tackle the above control problem, especially when state and control input constraints are considered. The key feature of the MPC method is based on the following three successive steps [3]: (i) the explicit use of a model and system measurements to predict the future behavior of the controlled variables over a specified future time horizon, (ii) the calculation of a control sequence minimizing a cost function, and (iii) the application of the first control signal of the sequence for a given time before returning to step (i). MPC algorithms differ amongst themselves in the model used to represent the plant, the cost function to be minimized, the optimization method, and the adopted horizon time size and partition. Depending on the optimization problem underhand, they can lead to explicit or non-explicit control schemes. For fast, NMP, and under-actuated systems, such as the CIP system, the above design issues appear to be more challenging when dealing with the design of SIMO MPC controllers, especially if the stabilization requirements are to obtain (i) prescribed system gain margin, (ii) short CIP settling time with insignificant overshoot and undershoot, and (iii) reduced control effort. Examples of MPC and non-MPC methods are described in the next sections.

Based on the linearized CIP dynamics about the upright (unstable) equilibrium point and the linear MPC theory, many linear MPC control schemes have also been designed to solve the CIP stabilization problem. In [7,8], the concept of predictive pole placement was established, and the application of its intermittent linear quadratic formulation to an inverted pendulum was successfully realized in [9], showing good control performance. In [10], a mathematical model of the PS600 CIP system was derived and linearized. Then, a model predictive controller was designed on the basis of a linearized discrete model and a quadratic cost function. The controller was verified in both simulations and real-time experiments. In [11], a linear model predictive control with a quadratic cost function was designed and experimentally validated on a rotary inverted pendulum apparatus to study the effect of the input disturbance. In [12], a cascade MPC CIP stabilization controller was derived from the minimizing of two separate pendulum and cart-associated quadratic functions. The inner and outer controllers are tuned to obtain a double critically damped behavior for the inner and outer loops using a set of two adjusted parameters. Nonlinear MPC techniques have been also proposed to stabilize the CIP system [13,14,15,16]. Although these techniques have shown promising performance in tracking and stabilization problems, they are the most complicated control techniques in implementation due to the difficulties in obtaining an accurate nonlinear model, in adjusting the quadratic cost function weight factors, and in choosing or developing appropriate dedicated online optimization methods. The linear MPC technique shows fewer implementation difficulties, especially for explicit control schemes, in comparison to the nonlinear MPC at a price of reduced performance. Therefore, whenever linear MPC shows good performance for the considered problem, it usually is favored.

On the other hand, there are many proposed (MPC or non-MPC) linear explicit control schemes (ECS) to stabilize the CIP system, where the control input is evaluated directly in a single step and applied at the same time on the cart control input. These ECS include the coincident pole placement (CPP) [17], dominant pole placement (DPP) [18,19], two proportional-integral-derivatives (TPID) [20,21,22,23], and linear quadratic regulator (LQR) [22,23,24]. From the linear control theory point of view, the design task to satisfy some prescribed time (i.e., steady-state and transient) response performance may be regarded as a pole placement problem, especially when using CPP, DPP, and LQR methods. Once this problem is solved off-line by specifying the pole locations a priori with a pole-dependent method, as in the case of the CPP and DPP, or a posteriori with a pole-independent method, as in the LQR method, and the control parameter computation is also conducted off-line, the ECS-based control can be implemented easily in hardware and run in real-time. The design phase with controller tuning, which consists of determining the pole locations for the CPP and DPP and the weight matrices for the LQR, is the main challenge of such methods. It is usually performed by trial and error and depends on the designer’s experience [23]. In general, the use of such a tuning method to obtain some prescribed requirements not only takes much time but also does not guarantee that the best solution possible is found. To solve these difficulties, advanced numerical tuning algorithms, such as the genetic and particle swarm optimization algorithms have been proposed for automatic parameter tuning [24]. Notice that with a reduced set of parameters, the tuning difficulty becomes less problematic. Therefore, if a controller with few tuning parameters shows good performance for the considered problem, this can be seen as a huge practical advantage.

As can be seen, there are several interesting attempts to design linear static SFC for the CIP system stabilization in the form of MPC or non-MPC control schemes. MPC control schemes are more significant, because they can be considered optimal for a specified cost function. Notice that an ECS can be considered as an MPC method if there is a correspondence between the ECS gains and MPC parameters. The comparison, conducted in [25], between the MPC and LQR has shown that the MPC method is more suitable for the trajectory tracking task and smoothing in the control input, while the LQR is more convenient for fixed-value control and disturbance rejection, but it may generate adverse and rapid changes in the control signal. However, for both approaches, MPC and non-MPC, the presence of real NMP zeros in the cart part of the fourth-order linearized CIP transfer function limits the robustness performance and prevents the achievement of monotonic cart step responses. In order to obtain the best possible performance, the optimal choice of the SFC gains needs to be addressed. An important contribution in this line for pole-dependent non-MPC SFC controllers is attributed to the authors of [17], who proposed an analytical formula for optimal tuning of the SFC gains for the CIP system. In the derivation of the formula, the authors promote a priori a coincident-pole structure, which has a single tuning parameter (see Appendix C) for the closed-loop poles before maximizing the worst gain margin associated with the CIP output signals. In doing so, it is clear that the adopted configuration will impose an upper limit for the achievable system gain margin and prevent the controller from exploiting other possible pole configurations that may be more helpful in specifying a priori a prescribed gain margin and in reducing the impact of the closed-loop CIP zeros on its performance. In addition to the CIP stabilization problem, designing controllers that achieve non-overshooting/undershooting for all-pole systems (i.e., systems only having poles in their transfer functions) or minimum overshooting/undershooting for non-all-pole systems (i.e., systems having poles and zeroes in their transfer functions) have received considerable attention [26,27,28,29,30]. In [26], for example, the authors studied the overshoot of an all-pole fourth-order system with respect to the variation of pole locations, where the poles are parameterized with two damping ratios and two undamped natural frequencies. One of the main obtained results states that the system step input overshoot remains unchanged if the ratio of their two natural frequencies is kept constant. The other main result states that the overshoot of the considered system does not have monotonicity with respect to each damping ratio or their sum. The above statements no longer hold for our CIP system, which is a non-all-pole system. In this case, and due to the presence of a single real NMP zero, achieving the cart step response monotonicity with linear SFC is almost impossible [29,30], and it appears natural, as a possible objective, to turn toward the design of controllers that achieve as little undershoot/overshoot as possible while maintaining good robustness stability.

On the other hand, addressing the SFC gains tuning problem to achieve monotonic step responses with pole-independent tuning methods can be performed for all-pole systems using the well-known coefficient diagram method (CDM) [27,28,31]. In this method, controllers are designed via the assignment of the so-called characteristic ratios and generalized time constant (GTC), which may have a strong physical relationship with the damping (i.e., overshoot) and speed of response of the closed-loop system, respectively. With non-all-pole systems, the complexity of pole–zero interaction makes the standard CDM no longer valid. However, in the case of a non-all-pole system with one pair of jω-axis zeroes, the authors of [28] showed the possibility to obtain monotonic step response if the GTC is kept above a certain lower bound. For the CIP system, the cart system can be decomposed into the difference between a non-all-pole system with one pair of jω-axis zeroes and an all-pole system. Since it is impossible to ensure the cart monotonic step response, one can suggest as another possible objective to ensure the monotonicity of the non-all-pole system in the context of Manabe form, as is done in [28], without considering the impact of the remaining all-pole system (see Appendix A). In such a case, the robustness stability issue must be addressed, as well as the overshoot and undershoot response requirements (see Appendix A).

To the authors’ knowledge, the attempts to meet a prescribed system gain margin, short CIP settling time with insignificant overshoot and undershoot, and reduced control effort with pole-independent parameter tuning have not been considered before in the context of linear static SIMO MPC SFC control schemes. The proposed MPC controller is designed with a generalized prediction model to circumvent an obvious stability problem [32] and a quadratic cost function with four control parameters. The first two parameters, namely the horizon time and the relative cart–pendulum weight factor, are automatically adjusted to ensure a priori prescribed system gain margin and fast pendulum response, while the remaining two parameters, namely the pendulum and cart velocity weight factors, are maintained as free tuning parameters to tackle the damping and the reduced control effort problem. In contrast to the existing state-feedback methods to stabilize the CIP system, this work has the following main distinguishing features:

  • From the theoretical point of view, a new robust pole-independent SIMO MPC controller with only two adjusted control parameters is proposed to solve the fourth-order CIP stabilization problem under full state availability, known CIP parameters, and a pendulum mass that is negligible in comparison to the cart mass. It is shown in this paper how to constraint the SFC controllers to ensure a priori a prescribed CIP system gain margin and how to constrain the MPC controller to have a fast pendulum response in comparison to the cart response, i.e., satisfying a two-time-scale structure in which the closed-loop pendulum subsystem responds faster than the closed-loop cart subsystem [33]. This contribution leads effectively to the reduction in the number of free tuned parameters from four to two. Another contribution, mainly inspired by the work of [17], is to derive optimal controllers that maximize the system gain margin for the standard CDM and cascade methods (see Appendix A, Appendix B and Appendix C). Finally, in order to obtain the optimal MPC weight factors and to allow performance comparison, the indices of speed and average peak efficiencies are introduced for characterizing the closed-loop CIP transient responses.

  • From a practical point of view, for a prescribed system gain margin, the impact of the velocity cost function weight factors on the closed-loop system transient performance in the context of the proposed MPC method can be evaluated off-line, and the obtained trends can be easily clarified using two-dimensional graphical and contour plot representations. Some useful guidelines for rapid weighting factor adjustment are developed for the proposed method in the presence (or absence) of disturbance input. Additionally, standard and advanced numerical tuning algorithms can benefit from the obtained reduced two-dimensional space search to achieve global optimality for a given criterion. Such a situation helps in rapidly obtaining a solution to the parameter tuning problem and also helps in checking easily its optimality.

The paper is organized as follows. Section 2 defines the CIP models and states the problem under consideration. Section 3 deals with the SIMO MPC controller design. Section 4 provides simulations results, and finally, Section 5 concludes this paper.

2. CIP Models and Problem Statement

2.1. Nonlinear Inverted Pendulum Dynamics

The CIP system consists of a cart and a rigid rod pendulum with a pivot mounted on the top of the cart, as shown in Figure 1. Under the action of the horizontal force that is regarded as the control input u(t), the cart moves left or right on a one-dimensional bounded track, whereas the pendulum swings in the vertical plane determined by the track. It is assumed that no friction exists in the system between the cart and the track or between the cart and the pendulum. The cart is characterized by a mass M, and the pendulum is characterized by a length L of the mass-less rod and a point mass m located at the free end of the pendulum rod.

Figure 1.

Figure 1

The inverted pendulum system.

To describe the dynamics, a state vector x=(x1,x2,x3,x4)T is used, where the state signals x1, x2, x3, and x4 are the pendulum angle (measured clockwise w.r.t the upward vertical), the angular velocity of the pendulum, the cart position, and the cart velocity, respectively. Using Newton’s laws, it is possible to derive and describe the dynamics of the CIP system explicitly by the following fourth-order SIMO under-actuated and highly nonlinear dynamical system (see Appendix D):

x˙1=x2x˙2=uL1cosx1(M+m)gL1sinx1+mx22sinx1cosx1M+mmcos2x1x˙3=x4x˙4=+umgsinx1cosx1+mLx22sinx1M+mmcos2x1. (1)

Notice that, in the above model, the pendulum and cart accelerations are driven with opposite direction input control forces over the upper half-plane of the pendulum angle. This makes our stabilization task more challenging.

2.2. Generalized Linear Prediction Model

Under the three assumptions taken from [12], the assumption of the light mass at the end of the rod, and the control input domination assumption

sinx1x1;cosx11;gL1>>x22;M>>m;mg|x1|<<|u|, (2)

the nonlinear dynamical model (1) can be approximated by the following linear model:

[x˙1x˙2x˙3x˙4]=[0100a100000010000][x1x2x3x4]+[0b10+b2]u, (3)

with

a1=+gL1;b1=1ML;b2=1Mgb1=a1b2. (4)

Notice that the parameters a1, b1, and b2 are positive. It will be shown later in the paper that all the assumptions given in (2) are indeed satisfied. From (3), one can deduce the following unstable transfer functions:

H1(s)=X1(s)U(s)=b1s2a1H3(s)=X3(s)U(s)=+b2s2. (5)

Let us now define the following generalized linear CIP model with k{1,+1}, which can be seen as a generalization or modification of model (3):

[x˙1x˙2y˙3y˙4]=[0100a100000010000][x1x2y3y4]+[0b10+kb2]u, (6)

where the pendulum subsystem model remains unchanged when compared to the original model (3) while the cart subsystem model is substituted by a fictitious one. This last subsystem, with the new states y3 and y4, is driven at each instant t with the control input ku. Using the above-adopted model (6), and assuming y3=x3 and y4=x4 at the instant t, we define the following pendulum and cart generalized prediction models:

x^1(t+h)=A1x1(t)+B1x2(t)E1ux^2(t+h)=A2x1(t)+B2x2(t)E2uy^3(t+h)=x3(t)+hx4(t)+kE3uy^4(t+h)=x4(t)+kE4u, (7)

where h is a positive constant horizon time. If u(t)=u is constant on the interval [t,t+h], then by solving (6) and using the last equality of (4), we obtain (7) with the following parameters:

A1=cosh(a11/2h)A2=a11/2sinh(a11/2h)B1=a11/2sinh(a11/2h)B2=cosh(a11/2h)E1=2a11b1sinh2(a11/2h/2)E2=a11/2b1sinh(a11/2h)E3=0.5b2h2E4=b2h. (8)

Notice that the parameters A1, A2, B1, B2, E1, E2, E3, and E4 are positive.

2.3. Problem Statement

For the CIP system stabilization, we are interested in using the following static state-feedback control law (SFC):

u(t)=+Nx(t)N3x3d=+N1x1(t)+N2x2(t)+N3[x3(t)x3d]+N4x4(t), (9)

where N=(N1,N2,N3,N4) is the control gain vector to be determined, and x3d is the constant cart reference position. The main result of this paper is how to simplify the control tuning based on the good robustness properties of the control system and the optimized two-time controller structure. The controllability of the linear system (3) ensures the existence of a set of SFC having the form (9) that can achieve the stabilization of the CIP system in the vicinity of the unstable equilibrium point. Combining (5) and the Laplace transform of (9) yields:

F1(s)=X1(s)X3d(s)=+b1N3s2P(s)F3(s)=X3(s)X3d(s)=b2N3(s2a1)P(s)Fu(s)=U(s)X3d(s)=N3s2(s2a1)P(s)P(s)=s4+(b1N2b2N4)s3+(b1N1b2N3a1)s2+a1b2N4s+a1b2N3. (10)

The obtained linearized closed-loop CIP system is of the fourth order and has the basic configuration shown in Figure 2. Obviously, each controller belonging to the class (9) can be interpreted as the combination of two PD controllers (one for the pendulum subsystem and the other for the cart subsystem).

Figure 2.

Figure 2

State-feedback linearized CIP control system.

Remark 1. 

The fourth-order transfer functions (10) must at least be stable. The Routh necessary and sufficient stability conditions for such a system are given in (11). From these conditions and the positivity of the parameters (4), it follows that the controller gains(N1,N2,N3,N4)must be positive.

b1N2b2N4>0b1N1b2N3a1>0N4>0N3>0b1N2b2N4>N4N2gN1N2N3N4a1b1N2. (11)

Remark 2. 

The cart transfer function F3(s)has two real opposite zeroes, z1,2=±a11/2. The single real NMP zero, z2=+a11/2, provokes the appearance of an undesirable initial undershoot in the cart step response. The amplitude of this undershoot grows to infinity if the settling time is reduced to 0 [30]. This behavior therefore places a limitation on the cart speed of response. Overshoot in the above response is another undesirable effect that may be reduced with the undershoot phenomenon if an appropriate selection of SFC controller gains is conducted.

Remark 3. 

From (10), it can be seen that the pendulum and cart transfer functionsF1(s)andF3(s)have fixed zeroes and adjustable poles. Thus, whatever the method used to determine the SFC gains, it is always considered as a pole placement method. In the sequel, we shall refer to the SFC control method (9) as pole-dependent and pole-independent if its gain tuning method specifies a priori and a posteriori the closed-loop pole locations of (10), respectively.

In this paper, we investigate a subclass of SFC controllers (9), where the designed controller carries all the key features of SIMO MPC controllers and can be implemented in an ECS fashion. As it is depicted in Figure 3, the considered controller receives as its inputs the cart reference position, x3d, and the state vector, x=(x1,x2,x3,x4), and produces as its single output the control signal, v, which is fed directly to the control input u of the CIP system. In our investigation, we need to consider the following assumption.

Figure 3.

Figure 3

The proposed CIP SIMO MPC control system.

Assumption 1:

  • the state vector x=(x1,x2,x3,x4) is measurable;

  • the parameters (M,m,L,x3d) are known constants; and

  • the set of hypotheses in Equation (2) is satisfied.

Under Assumption 1, the CIP stabilization problem may be formulated in two steps as follows. In the first step, restrict the class of SFC controllers so as to satisfy the following property:

  • Property 1: The SFC controller (9) ensures robust stability with a prescribed closed-loop gain margin, GMmin.

In the second step, and under the above gain margin constraint and the generalized prediction model (7) and (8), design a pole-independent SIMO MPC controller such that the following properties are satisfied:

  • Property 2: The closed-loop CIP system satisfies a two-time-scale structure in which the closed-loop pendulum subsystem responds faster than the closed-loop cart subsystem [33].

  • Property 3: In the absence of disturbance input, the controller ensures low cart settling time T>0 without excessive peaking (undershoot and overshoot) phenomenon in the CIP input and output responses.

3. SIMO MPC Controller Design

To design robust SIMO MPC controllers, we first developed Equation (23), associated with the SFC gain margin property, using the stability conditions (11) and the prescribed gain margin, GMmin. Then, Property 1 holds when the above-mentioned equation is satisfied. Next, based on the generalized prediction model (7–8) and the quadratic cost function (24) that is characterized by a set of MPC parameters φ=(h,ρ1,r,ρ2), i.e., the horizon time and the positive weight factors to be defined later, we establish the explicit relationship (26) between φ and N. Finally, we propose a pole-independent tuning method that relies primarily on choosing the parameter set φ of the cost function as a starting point of the optimal control. This method leads to determining the pole locations a posteriori, i.e., at its final stage. Notice that the controller design developed hereafter uses the linearized CIP model (3) in the neighborhood of the equilibrium point, where it is assumed to be justified at least from the approximation point of view. The validity of the adopted design methods is demonstrated from the control point of view in Section 4, where the nonlinear model (1) is used instead of its linearized version.

3.1. Closed-Loop SGM Constraint

Let us consider the following change of variables:

N1=α12N2N3=α34N4α=α12α34, (12)

where α12>0 and α34>0. Introducing (12) in the last condition of (11) gives:

b1N2>[1+b1b2gb1N2αa1]b2N4. (13)

This means that there exists a critical gain N4 that depends on the fixed value of N2. The critical positive gain b2N4i>0 at which the system becomes marginally stable is:

b2N4i=b1N21+b1b2gb1N2αa1>b2N4. (14)

From (14), we define the cart-loop gain margin as follows:

GMC=b2N4ib2N4=b1N2b2N411+a1b1N2αa1. (15)

On the other hand, the critical positive gains b1N2iα>0 and yi=b1N2iαa1 at which the system becomes marginally stable is deduced from (13) as the positive solutions of the following system of equations:

yi2+[a1b2N4α]yigb1N4α=0yi=b1N2iαa1, (16)

with

yi=12[b2N4αa1]+12[b2N4αa1]2+4gb1N4α>0b1N2iα=yi+a1. (17)

Using the last equality gb1=a1b2 of (4), (17) reduces to:

yi=b2N4α>0b1N2iα=b2N4α+a1. (18)

From (18), we define the pendulum-loop gain margin as follows:

GMP=b1N2αb1N2iα=b1N2b2N411+a1b2N4α. (19)

Taking into account the worst case, the system gain margin index can be defined as follows [17]:

GM=min(GMP,GMC)GM={GMC if (b1N2b2N4)α<a1GMP if (b1N2b2N4)αa1. (20)

Since the gain margins, GMC and GMP, must be greater than 1, from (16) and (19), we deduce, respectively:

(b1N2b2N4)α=a1+(GMC1)αb2N4a1(b1N2b2N4)α=GMPa1+(GMP1)αb2N4a1. (21)

Regarding (21), the system gain margin (20) reduces to:

GM=GMP=b1N2b2N411+a1b2N4α. (22)

The following remark states some of the obvious properties of the closed-loop system gain margin.

Remark 4. 

Consider the fourth-order closed-loop system in (10) with the change of variables (12) and the system gain margin (22). Then, the following properties hold: (i) with fixed parametersα and b1N2, GM decreases with the increase in b2N4; (ii) with fixed parameters α and b2N4, GM increases with the increase in b1N2; and (iii) with fixed parameters b1N2 and b2N4, GM remains unchanged for any value γ if N1 and N3 are increased by the amounts N2γ and N4γ, respectively.

Remark 5. 

Given the value ofGMminand using Equation (22), Property 1 can be satisfied under the following constraint:

(b1N2GMminb2N4)α=GMmina1, (23)

whereαis defined in (12).

3.2. Deriving the SIMO MPC Control Law

To derive the SIMO MPC control law, we first adopt the generalized linear prediction model (7) and (8) as a prediction model for the CIP system. Then, we consider the following quadratic cost function:

JMPC(t,h)=12e12(t+h)+12ρ1e22(t+h)+12re32(t+h)+12rρ2e42(t+h)e1(t+h)=0x^1(t+h)e2(t+h)=0x^2(t+h)e3(t+h)=x3dy^3(t+h)e4(t+h)=x˙3dy^4(t+h)=0y^4(t+h), (24)

where φ=(h,ρ1,r,ρ2)+4 is the set of positive MPC parameters, e1(t+h) and e2(t+h) are the predicted pendulum angle and velocity errors, while e3(t+h) and e4(t+h) are the predicted cart position and velocity errors. Now, given the cart destination x3d and the prediction models (7), we obtain the optimal input v0, which minimizes the value of the quadratic cost function (24). Recall that the references associated with x1, x2, and x4 are assumed to be zero. Substituting (7) into (24) and setting the gradient of JMPC(t,h) with respect to v to zero yields:

v0(t)=+N1x1(t)+N2x2(t)+N3[x3(t)x3d]+N4x4(t), (25)

with the SIMO MPC controller gains given below:

N1=D1A1E1+D1ρ1A2E2N2=D1B1E1+D1ρ1B2E2N3=D1r×kE3N4=D1r×k(hE3+ρ2E4)D=E12+ρ1E22+r×k2(E32+ρ2E42). (26)

Notice that the positivity of (ρ1,r,rρ2) together with the positivity of (8) imply the positivity of N1 and N2 in addition to the fact that the gains N3 and N4 have an undefined sign that follows the one associated with the value of k.

Remark 6. 

Since the signs ofN3andN4are identical to the sign ofk, the designed SIMO MPC controller that is defined by (23) and (24) does not ensure the stability condition (11) when using the trivial prediction model, i.e., the generalized prediction model (10) withk=+1. To circumvent such a drawback, we putk=1in the above model. This modification is equivalent to the substitution of the original cart subsystem model by another cart subsystem model that is driven at each instant and from the same state point with an opposite control input sign. The modification of the cost function while keeping the trivial prediction model is another option to solve the encountered stability problem. However, this option is more complicated and is not considered in this paper.

3.3. Time-Scale Structure Constraint

Here, we are interested in designing a SIMO MPC controller that ensures Property 2, i.e., a two-time-scale structure, without using a priori targeted closed-loop pole locations. To this end, the CIP control system of Figure 2 is transformed to the configuration of Figure 4, where a virtual reference x1d and a new gain K are introduced for the closed-loop pendulum subsystem. The pendulum transfer function to the considered reference and the value of K that leads to a unity static gain are given by:

F1F(s)=X1(s)X1d(s)=Kb1s2+b1N2s+b1N1a1K=N1+a1b11=N1+Mg. (27)

Figure 4.

Figure 4

Equivalent state-feedback linearized CIP control system.

The GTC associated with (10) and (27) are given, respectively, by:

τC=a1b2N4a1b2N3=N4N3=1α34τP=b1N2b1N1a1=1α12MgN21>τPL=1α12, (28)

where τPL is a lower bound for the pendulum GTC.

For the sake of developing a general tuning method that is valid for a large class of CIP systems, let us define the following normalized parameters:

h0=a11/2hρ10=a1ρ1ρ20=a1ρ2r0=a12g2rr12=a11/2α12r34=a11/2α34φ0=(h0,ρ10,r0,ρ20), (29)

where φ0 is the set of normalized positive MPC cost function parameters. Substituting (29) into (28) and (26), we obtain:

r12=N10N20=a11/2τPL1r34=N30N40=a11/2τC1, (30)
N1=a1b11N10D01N2=a11/2b11N20D01N3=a1b21N30D01N4=a11/2b21N40D01, (31)

with

N10=2sinh2(h0/2)cosh(h0)+ρ10sinh2(h0)=4sinh4(h0/2)+2sinh2(h0/2)+ρ10sinh2(h0)N20=2sinh(h0)sinh2(h0/2)+ρ10cosh(h0)sinh(h0)N30=0.5r0h02N40=r0(0.5h02+ρ20)h0D0=4sinh4(h0/2)+ρ10sinh2(h0)+r0(0.25h02+ρ20)h02. (32)

Now, combining (30) and (32) yields:

r12=a11/2τPL1=4sinh4(h0/2)+2sinh2(h0/2)+ρ10sinh2(h0)2sinh(h0)sinh2(h0/2)+ρ10cosh(h0)sinh(h0). (33)

In order to speed the pendulum response, we plan to maximize (33) by a proper choice of the horizon time. More precisely, for a given positive value ρ10, setting the gradient of r12 with respect to h0 to zero yields:

h0=ln[1+ρ10+ρ102+ρ10(1+2ρ10)(2+ρ10)1ρ102]. (34)

From (34) and Figure 5, it is clear that the parameter ρ10 increases with the increase in h0 and satisfies 0<ρ10<1. Concerning the amount r12, it is obvious that this parameter decreases toward 1 with the increase in h0.

Figure 5.

Figure 5

Evolution of (r12, ρ10, ρ20min) versus h0.

Now, introducing the expressions of α12 and α34 from (29) together with the expressions of N2 and N4 from (31) in (23) gives

(N20GMminN40)(r12r34)=GMminD0. (35)

Substituting the expressions of N40 and D0 from (32) in (35) and solving for r0 gives:

r0=GMmin1N20(r12r34)4sinh4(h0/2)ρ10sinh2(h0)(r12r34)(0.5h02+ρ20)h0+(0.25h02+ρ20)h02. (36)

Since h0 and ρ10 are linked by the relationship (34) and the fact that r0 and (h0,ρ10,ρ20) are linked by the relationship (36), we only need to specify two parameters among the elements of the set φ0 to achieve the controller design. In our case, we shall fine-tune the weight factor vector ρ0=(ρ10,ρ20) inside an appropriate domain. We have already shown that 0<ρ10<1, and the interval associated with ρ20 remains to be determined. To this end, it is reasonable to satisfy Property 2 by assuming that the GTC of the closed-loop cart subsystem is lower than the GTC of the pendulum subsystem. According to (30) and (32) and the fact that r0>0, this leads us to impose the constraint:

0<r34=0.5h00.5h02+ρ20<r12, (37)

from which we obtain:

ρ20min<ρ20<ρ20min=12(r121h0)h0. (38)

A lower bound ρ20min, which is shown in Figure 5 as a function of h0, is then imposed to ρ20 to ensure Property 2.

Now, let us define a set of five indices to evaluate the CIP transient response performance: Px1(ρ0), the maximum absolute value of the pendulum angle response; Vx3(ρ0), the cart response overshoot; Dx3(ρ0), the cart response undershoot; Pu(ρ0), the maximum absolute value of the control input signal; and tcs(ρ0), the cart settling time at 5%. To derive general comments on the behavior of the CIP system, it is useful to introduce the change variable s=a11/2z and the gains (31) in (10) to obtain the following transfer functions:

F1(z)=L1N30D01z2P(z)F3(z)=N30D01(z21)P(z)Fu(s)=MgL1N30D01z2(z21)P(z)=P(z)=z4+(N20D01N40D01)z3+(N10D01N30D011)z2+N40D01z+N30D01. (39)

With the above processing, all the considered CIP transient response characteristics remain unchanged, except the one associated with the modified (normalized) cart settling time tcsn, which is now linked to the original cart settling time tcs by the relationship:

tcsn=a11/2tcs. (40)

Regarding the transfer functions (39), it appears naturally useful to normalize the indices associated with the pendulum angle response and control input signal as follows:

Px1n=LPx1Pun=Pu/(MgL1). (41)

Now, we are ready to summarize in Algorithm 1 how to design the proposed SIMO controller and how to obtain the closed-loop linearized CIP performance when the system gain margin GMmin and the weight vector ρ0=(ρ10,ρ20) are chosen.

Algorithm 1: Controller design and closed-loop linearized CIP system evaluation.
  1. Set the worst system gain margin, GMmin, and choose the weight vector ρ0=(ρ10,ρ20);

  2. Evaluate (h0,r12,r34,r0,ρ20min) using (34), (33), (37), (36), and (38), respectively;

  3. If r0 < 0 or ρ20ρ20 min, go to step 7;

  4. Evaluate (N10, N20, N30, N40, D0) using (32);

  5. Evaluate the step responses of (39) with (L,Mg) = (1,1);

  6. Evaluate the indices (Px1n,Vx3n,Dx3n,Pun,tcsn) from the obtained step responses;

  7. End.

3.4. Parameter Tuning

In this section, parameter tuning for the weight vector ρ0 is performed to ensure the validity of Property 3. To this end, the peaking and speed constraints need to be considered simultaneously. To solve this problem, a time-domain optimization technique that involves tackling two issues, namely the choice of the performance criterion and the mean to optimize it, is proposed hereafter. For the first issue, we have found it useful, regarding the desirable Property 3, to define two proposed sound and scaled-free indices. These indices are the speed efficiency SE(ρ0) and the average peak efficiency SE(ρ0) and are defined for a given standard (reference) state-feedback method (SSF) and our control method as follows:

SE(ρ0)=100P4,SSFP4(ρ0)+P4,SSFGE(ρ0)=1003i=13Pi,SSFPi(ρ0)+Pi,SSF, (42)

where P1(ρ0)=Px1(ρ0), P2(ρ0)=Px3(ρ0), P3(ρ0)=Pu(ρ0), and P4(ρ0)=tcs(ρ0). The index Pi,SSF has the same interpretation as the index Pi(ρ0), with the proposed control method replaced by the SSF one. Obviously, an efficiency index greater than 50% indicates that the proposed control method outperforms the SSF method; otherwise, degradation in the performance of our method in comparison to the SSF one is noted. Then, taking into account the worst efficiency case, we may formulate the setting problem as a maximin optimization model as follows:

(ρ10,g,ρ20,g)=arg(ρ10,ρ20)max0ρ10ρ10,max0ρ20ρ20,maxJ1(ρ0)J1(ρ0)=min(SE(ρ0),λ1GE(ρ0)), (43)

where λ=0 if speed efficiency is the only concern, and λ=1 if peaking and speed efficiency are both considered in the optimization. It should be noted that (43) is a continuous nonlinear optimization model with a highly nonlinear (possibly discontinuous) objective function J1(ρ0), and it is difficult to know a priori whether such a function is unimodal or multimodal before starting the optimization. To avoid erroneous solutions, problem (43) has to be solved to global optimality in the considered parameter space domain. Then, to solve the second issue, we opt for global optimization using an exhaustive search over the domain that is defined by 0ρ10ρ10,max and 0ρ20ρ20,max. This is interesting, since there are only two tuning parameters.

4. Numerical Simulations

Numerical simulations are conducted in several separate sections to show the potential of the proposed pole-independent SIMO MPC controller (SIM) and its advantage in comparison to the pole-independent standard CDM controller (CDM), the pole-dependent CASC MPC controller (CAS), and the coincident pole controller (CPP). The definitions of the CDM, CAS, and CPP controllers are given in Appendix A, Appendix B and Appendix C, respectively. These controllers are chosen for their efficiencies and for the reduced number of adjusted parameters, which leads to yield, without significant difficulty, a guarantee of the overall best performance for each tuned controller. Based on the linearized CIP system, we begin the numerical simulation in Section 4.1 with the establishment of some guidelines that help in choosing the SIM weight factors for a prescribed system gain margin in the absence of disturbance input. These guidelines are quite general, since they can be applied to any linear CIP model of the form (3), as long as Assumption 1 stays satisfied. Additionally, they are very useful since they inform us how to tune the SIM weight factors to get some desirable transient closed-loop linearized CIP system performance. Next, to make a fair comparison, we tune, for a given physical CIP system, the best possible CDM, CAS, and CPP controller so as to obtain a maximum system gain margin for each tuned controller. For the SIM controller, the system gain margin is set a priori as the best one so far obtained by the above controllers, and the SIM weight factors are tuned according to (43). In Section 4.3 and Section 4.4, we compare the obtained controllers on the nonlinear CIP system in the absence and presence of disturbance input.

4.1. Guidelines for Weighting Factor Adjustment

To obtain some insights into how to choose the weight vector ρ0=(ρ10,ρ20) for different system gain margin values in the absence of disturbance input, we conducted 50,000 simulations with Algorithm 1 (obtained by using a coarse grid discretization with 50 points for ρ10 in the interval 0<ρ10<0.5, 50 points for ρ20 in the interval 0<ρ20<5, and 20 points for GMmin in the interval 1<GMmin<3). Figure 6 and Figure 7 show the obtained contour plots for the considered transient response characteristics for GMmin=1.63. To generate the contours, we constructed a grid interpolant using the Matlab function “griddedInterpolant” with the “pchip” option and interpolate the considered index with 103 spacing. To avoid dummy solutions, only the contours that have more than 103 points were retained.

Figure 6.

Figure 6

CIP control performance GMmin=1.63. (a) Normalized pendulum maximum absolute value index; (b) cart overshoot and undershoot indices.

Figure 7.

Figure 7

CIP control performance for GMmin=1.63. (a) Normalized control input maximum absolute value index; (b) normalized cart settling time index.

From Figure 6 and Figure 7, one can observe that the choice of the weight factors cannot be performed independently to address the peaking phenomenon and the cart speed of response issues simultaneously. Additionally, a strong correlation between the contours of Px1n and Dx3 is noticed. For the peaking phenomenon, it is clearly seen that the increase in ρ20 while maintaining ρ10 constant reduces without ambiguity the cart overshoot Vx3(ρ0). Ensuring a non-overshooting behavior for the cart subsystem thus appears possible with a high enough value of ρ20. For a constant ρ10, the peaking phenomenon indices, namely Px1n, Dx3, and Pun, are only reduced when ρ0 is located above their associated red line frontiers, as shown in Figure 6 and Figure 7. The evaluation of these frontiers is performed without considering the interpolation processing. Ensuring a reduced undershooting behavior for the cart subsystem appears to be possible with the increase in ρ20. In addition to that, one can also clearly observe that the increase in ρ10 while maintaining ρ20 constant contribute, without ambiguity, to reduce the control input effort Pun and to increase the cart overshoot Vx3(ρ0), as long as the value of ρ10 still far enough from the frontier that it defines the validity of the MPC controller (see step 3 of Algorithm 1). For a constant ρ20, the increase in ρ10 may lead to an increase in Px1n and Dx3. Concerning the cart speed of response, there are two separate regions, i.e., the upper region UR and the lower region LR, where the settling time can be reduced below the value 20. These regions are located above the red line, where the increase in ρ20 while maintaining ρ10 constant reduces the peaking phenomenon. Table 1 shows the associated region frontier locations and transient performance intervals. Obviously, choosing ρ0 in the UR appears to be more recommended than choosing it in the LR.

Table 1.

Transient response performance on the UR and LR frontiers.

ρ 10 ρ 20 P x1n Vx3 (%) Dx3 (%) P un
LR 0.03 − 0.10 0.38 − 0.96 3.05 − 5.61 12.32 − 26.55 3.15 − 7.68 0.11 − 0.47
UR 0.02 − 0.16 0.87 − 3.23 0.97 − 2.80 0.36 − 6.31 0.82 − 3.61 0.02 − 0.22

When the cart speed is the only concern, i.e., when using the proposed tuning method (43) with λ=0, there is no need to select an SSF method, and the best-achieved cart settling time, tcsn,b, and its associated weight factor, ρ10,b, are defined as follows:

tcsn,b=min0ρ100.50ρ205tcsn(ρ10,ρ20,GMmin)ρ10,b=argρ10min0ρ100.50ρ205tcsn(ρ10,ρ20,GMmin). (44)

Figure 8 shows the above indices together with the cart overshoot and undershoot as a function of the prescribed system gain margin. It is clearly seen that the best-achieved cart settling time is obtained with GMmin1.2; additionally, the increase in GMmin above this value leads to an increase in the optimal cart settling time and a reduction in the cart undershoot. The cart overshoot remains between 4 and 5%, while the optimal weight factor ρ10,b remains, in all studied cases, under 0.12. As we shall see in the next section, the considered controllers for the comparison task do not exceed a system gain margin of 1.63, while our SIM controller can go beyond this limit, as is indicated in Figure 8.

Figure 8.

Figure 8

Impact of the gain margin on the cart transient of response. (a) Optimal cart control subsystem performance versus gain margin; (b) weight factor versus gain margin setting for high cart speed of response.

4.2. Disturbance-Free Parameter Tuning

Now, let us consider the nonlinear CIP system (1) with a set of physical parameters M=2.4 kg, m=0.23 kg, L=0.36 m, g=9.81 m/s2, and a cart track length limited between ±0.5m [23]. Since the behavior of the linearized CIP system is considered quite similar to the behavior of the nonlinear CIP system in the vicinity of the equilibrium point, the tuning of the controllers is thus based only on the linearized system. Figure 9 and Figure 10 show the evolution of the system gain margin together with the associated linearized cart transient performance for the considered controllers. In contrast to non-undershooting cart response behavior, ensuring a non-overshooting cart response behavior is a feasible objective for all controllers. Reducing the cart response undershoot can be undertaken at the price of an increase in the cart settling time and/or deterioration in the system gain margin performance. Regarding the system gain margin trend, it is seen that the system gain margin is limited by about 1.4 for the CDM method and by about 1.6 for the CAS and CPP methods. For these methods, dependencies are typically observed between the system gain margin and the tuning parameters, which render them less flexible. Now, following [17], we retain the best tuning parameter for the CDM, CAS, and CPP as those that maximize the system gain margin. The gains of these controllers and the resulting linear CIP system transient performance are summarized in Table 2 and Table 3, respectively. The transient performances are obtained for zero initial conditions and a cart step x3d=0.1.

Figure 9.

Figure 9

Evolution of the system gain margin and the cart response performance. (a) CDM method; (b) CAS method.

Figure 10.

Figure 10

Evolution of the system gain margin and the cart response performance. (a) CPP method; (b) SIM method.

Table 2.

State-feedback controller gains and associated parameters.

N1 N2 N3 N4 Control Parameter GM
CPP 91.43 17.46 13.08 14.99 p=3.49 1.62
CDM 82.08 14.87 16.35 14.81 τ=0.90 1.39
CAS 230.9 53.31 73.72 73.72 h0=0.20 1.59
SIM 75.39 10.21 5.63 7.55 (ρ10,ρ20)=(0.09,1.5) 1.63

Table 3.

Controller performance comparison using linearized CIP model.

Px1 (deg) Vx3 (%) Dx3 (%) Pu(N) tcs (s) SE (%) GE (%)
CPP 0.93 0.00 3.83 1.31 2.14 50.0 50.0
CDM 1.44 0.03 5.92 1.68 1.47 59.2 40.8
CAS 1.63 0.00 8.22 7.53 2.36 47.5 27.6
SIM 0.62 4.51 2.40 0.56 2.55 45.6 58.6

For the proposed SIM method, one can impose a priori the system gain margin as a constraint to the controller. Using a gain margin of 1.63 and applying the proposed tuning method (43) with λ=1, we obtain ρ10,g=0.09 and ρ20,g=1.5. The obtained controller gains and their associated performances are also given in Table 2 and Table 3, respectively. Notice that the last assumption of Equation (2) is satisfied, since we have, for all controllers, N1>>mg. In comparison to the other control methods, the proposed SIM method has the best pendulum angle deviation, cart undershoot, and control input effort at the price of degradation in the cart overshoot and settling time. Using the CPP for the linearized CIP system as an SSF, we obtained the best GE at a price of a slight degradation in SE, as is indicated in Table 3. Figure 11 indicates that the obtained optimal SIM controller depends only on SE, since we always have SE<GE. In addition to that, Table 4 and Figure 11 show that the optimal SIM controller is characterized by a set of two complex conjugates poles: one of them has a real part that is equal to the CPP controller pole, and the other one has the highest real part, which somewhat explains why the SIM controller has the largest cart settling time. Finally, Figure 10 tells us that the proposed SIM method allows to further reduce the peaking phenomenon while maintaining the system gain margin constant by increasing ρ20 above 1.5 and maintaining ρ10=0.09, but this enhancement is followed by an increase in tcs. In the vicinity of ρ20=2.5, the overshoot vanishes, and the cart settling time takes a value of about 4.8 s.

Figure 11.

Figure 11

Analysis of the SIM method performance. (a) Speed and peak efficiencies; (b) pole locations.

Table 4.

Closed-loop poles for the considered controllers.

p1+ip¯1 p2+ip¯2 p3+ip¯3 p4+ip¯4
CPP 3.49 3.49 3.49 3.49
CDM 2.780.90i 2.783.82i 2.78+3.82i 2.78+0.90i
CAS 3.084.05i 1.39 23.77 3.08+4.05i
SIM 3.495.58i 0.850.84i 0.85+0.84i 3.49+5.58i

4.3. Performance Analysis without a Disturbance Input

To compare the performance of the above controllers, the reference cart position x3d is set to 0m, and all the initial state values are set to zeros, except the initial cart position, which is set to x3(0)=0.1 m. The simulation results applied to the nonlinear CIP system (1), without considering disturbance input, are shown graphically in Figure 12 and Figure 13 and numerically in Table 5. One can notice at this point that all the assumptions given by Equation (2) are satisfied. The strong similarity that exists between the performance of the closed-loop linearized CIP system in Table 3 and the performance of the closed-loop nonlinear CIP system in Table 5 for each controller confirms the potential of linear control theory in solving the considered stabilization problem. On the other hand, Figure 12 shows that the SIM method exhibits good performance in the control input effort, the pendulum angle, and the cart position undershoot at the price of relatively large cart position overshoot and settling time. Concerning Figure 13, the fact that the control input effort demand occurs essentially at the beginning of the CIP system stabilization for all the considered controllers is obviously distinguished.

Figure 12.

Figure 12

CIP output responses using CAS, CPP, CDM, and SIM methods. (a) Pendulum angle responses; (b) cart position responses.

Figure 13.

Figure 13

CIP input responses using CAS, CPP, CDM, and SIM methods. (a) Overall input responses; (b) restricted input responses.

Table 5.

Controller performance comparison using nonlinear CIP model.

Px1 (deg) Vx3 (%) Dx3 (%) Pu(N) tcs (s) SE (%) GE (%)
CPP 0.95 0.00 3.88 1.31 2.16 49.8 49.7
CDM 1.47 0.00 5.99 1.68 1.42 60.1 40.5
CAS 1.64 0.00 8.26 7.53 2.37 47.4 27.5
SIM 0.62 3.65 2.37 0.54 2.55 45.6 60.6

When the initial states begin from an equilibrium point and reach another one while the assumptions of Equation (2) are satisfied during the motion, the controlled nonlinear CIP system behaves similarly to the controlled linear CIP system. In such a situation, the performance of the controlled nonlinear CIP system can be naturally evaluated using the contour plots of Figure 6 and Figure 7 for a gain margin of 1.63 and given MPC parameters. However, when the states are far away from the equilibrium point, the classical prediction model, defined by Equations (7) and (8) together with k=+1, gives only partially consistent estimation or no longer holds at all. In the former case, which is not covered by the numerical study, it may perhaps be interesting to design the SIMO MPC controller in such a way to increase enough ρ0=(ρ10,ρ20) in the hope to simultaneously increase the pendulum gains N1 and N2 while reducing the cart gains N3 and N4, as is suggested by Equations (26) and (29). When the pendulum angle is too large, the linearized model loses its validity, and it is necessary to apply some nonlinear technique to bring the pendulum to the vicinity of the upright equilibrium state. Then, it is possible to switch to the proposed control.

4.4. Performance Analysis with a Disturbance Input

To check the sensitivity of the studied controllers to disturbance input, a Matlab-Simulink band-limited white noise n(t) with sampling time 0.01 s, seed = 23,341, and power P=103 was added to the control input signal of the nonlinear CIP system over a time interval of 120 s [23]. This disturbance input, which has an amplitude between 1.17 and 1.25, is depicted in Figure 14. Table 6 summarizes the resulting maximum absolute values of the CIP input and output signals that are obtained from the analysis of the last 100 s, i.e., in the assumed steady-state region. Regarding the disturbance input amplitude that appears in Figure 14, the controller performance of Table 5, and the maximum absolute values of Table 6, all the considered controllers appear to exhibit useful noise rejection capabilities. The CAS method, with its highest gains (see Table 2) and high transient control input effort (see Table 5), exhibits the best noise rejection capabilities in the steady-state region, whereas the proposed SIM method, with its lowest gains (see Table 2) and lowest transient control input effort (see Table 5), exhibits the worst one in the same steady-state region. Figure 15 and Figure 16 show graphically the obtained CIP output responses. Although the disturbance input level appears to dominate completely the amplitude of the SIM transient noise-free control effort, i.e., 0.54 N (see Table 5), the proposed SIM method still performs well even with a tuning method that does not take into account the presence of a disturbance input.

Figure 14.

Figure 14

Input disturbance signal.

Table 6.

Performance in a noisy situation for 20 s ≤ t ≤ 120 s.

CPP CDM CAS SIM
σx1 (deg) 0.2014 0.3107 0.0750 0.2647
σx3 (m) 0.0080 0.0078 0.0016 0.0153
σu (N) 0.4271 0.4615 0.5623 0.4203

Figure 15.

Figure 15

Pendulum angle responses with a disturbance input.

Figure 16.

Figure 16

Cart position responses with a disturbance input.

Let us now study the impact of the weight vector ρ0=(ρ10,ρ20) on the closed-loop nonlinear CIP steady-state performance when the adopted disturbance input is considered. To this end, we keep GMmin=1.63 and conduct 2500 simulations (obtained by using a coarse grid discretization with 50 points for ρ10 in the interval 0<ρ10<0.5 and 50 points for ρ20 in the interval 0<ρ20<5). Figure 17 shows the obtained contour plots for the maximum absolute values of the input and outputs nonlinear CIP system. It can be seen that the weight factor ρ10 has a strong impact on σx1, σx3, and σu, while ρ20 has only a moderate impact on σx3. In addition, one can clearly see that the increase in ρ10 reduces the CIP input index σu, but as a side effect, it increases the CIP output indices σx1 and σx3.

Figure 17.

Figure 17

CIP steady-state performance with a disturbance input using SIM controllers. (a) Maximum absolute values of the pendulum angle and cart position; (b) maximum absolute values of the control input signal.

Remark 7. 

This paper does not address the measurement noise issue explicitly. The experience with the cascade MPC controller in [17] suggests that low and moderate measurement noises are not problematic, and simple low-pass filtering can be used. This is also favored in [33]. However, high measurement noises present a challenge to the implementation of linear static state-feedback controllers. The problem lies in the need for differentiation of the pendulum angle and the cart position w.r.t. time. In such a situation, it is useful to consider sophisticated differentiations or other controller structures such as the output-feedback controller with or without observers to perform more efficient noise filtering at the price of an increase in controller design complexity.

5. Conclusions

Most of the standard non-MPC linear static state-feedback controllers that are used to stabilize the nonlinear fourth-order CIP system around its desired equilibrium points are designed using pole-dependent techniques and without referring to a meaningful and transparent cost function. They are also prone to parameter tuning difficulties when dealing with the system transient performance and stability robustness simultaneously. To circumvent these difficulties, a linear static pole-independent MPC controller with only two tuning parameters is proposed. The controller ensures a priori a prescribed system gain margin and a two-time scale structure, allowing the response of the pendulum to be fast in comparison to the cart response. Using the linearized CIP model, some guidelines are developed in the form of two-dimensional contour plots for rapid MPC parameter tuning. Compared to the optimal CMD, CAS, and CPP methods, the proposed SIM method exhibits the best performance in the absence of disturbance input at the price of a slight degradation in the cart position overshoot and settling time. Future works will be devoted to studying the region of attraction associated with the proposed controller, reducing the sensitivity of the proposed method to disturbance input by optimizing the two-time-scale structure, relaxing the hypothesis of Assumption 1, and applying a similar method for other (linearized) robotic systems, e.g., the rotary inverted pendulum robot and the CIP with multiple links. The extension of the proposed method to deal directly with nonlinear system dynamics with or without uncertainties is also an interesting research topic.

Acknowledgments

The authors would like to thank the Ministry of Higher Education and Scientific Research of Algeria for its support (PRFU A01L08UN210120220001).

Appendix A. The Standard CDM Controllers (CDM)

The CDM method is a general design tool that can be applied for example to a general fourth-order all-pole closed-loop system with the following transfer function:

G(s)=c0c4s4+c3s3+c2s2+c1s+c0, (A1)

where c0,c1,c2,c3,c4 are the coefficients of its the characteristic polynomial:

C(s)=c4s4+c3s3+c2s2+c1s+c0. (A2)

Using the CDM method, the characteristic polynomial of a fourth-order control system can be written in the following form [28]:

C(s)=1γ3γ22γ13τ4c0s4+1γ2γ12τ3c0s3+1γ1τ2c0s2+τc0s+c0, (A3)

where γ1,γ2,γ3 are the characteristic ratios, and τ is the GTC of the targeted closed-loop system. The set of adjusted parameters is then given by φ=(γ1,γ2,γ3,τ). It is well known that the step response of an all-pole system with such a characteristic polynomial has a small overshoot under the condition that the characteristic ratios are larger than 2 [28]. In addition, non-overshooting in the step response can be obtained using γ1γ1LB with γ2=γ3=2, where γ1LB=2.53 is the minimum value of γ1 that enable non-overshooting step responses. The standard CDM method, i.e., the Manabe form, is obtained by setting γ1=2.5 and γ2=γ3=2. In this case, the settling time is about 2.5τ to 3τ, and for the fourth-order control system, all the poles are exactly on the vertical line, which means that the obtained two sets of complex conjugate poles have the same pole real part and different pole imaginary parts [31].

The design of an SFC using the CDM method for the CIP system can be conducted as follows. First, the cart transfer function F3(s) is transformed to the following form:

F30(s)=F3(s)=s2a11b2N3s4+b1N2b2N4b2N3s3+b1N1b2N3a1b2N3s2+a1N4N3s+a1. (A4)

By equating the denominator of (A4) with the characteristic polynomial (A3) and using the standard CDM method, we obtain the following set of equations:

b1N2b2N4=γ3γ2γ1τ1=10τ1b1N1b2N3a1=γ3γ22γ12τ2=50τ2N3=a11b21γ3γ22γ13τ4=125a11b21τ4N4=a11b21γ3γ22γ13τ3=125a11b21τ3. (A5)

Solving (A5) gives the SFC gains as follows:

N1=(a1b1)1τ4(a12τ4+50a1τ2+125)N2=5(a1b1)1τ3(2a1τ2+25)N3=125a11b21τ4N4=125a11b21τ3, (A6)

where the GTC, i.e., τ, is the only parameter to be adjusted.

On the other hand, the transfer function (A4), which has two real zeroes, can be decomposed as follows:

F30(s)=s2a1P(s)=C1(s)C2(s)C1(s)=s2+a1P(s);C2(s)=2a1P(s), (A7)

where C1(s) is a non-all-pole system with one pair of jω-axis zeroes, and C2(s) is an all-pole system. Under the Manabe form, the authors of [28] showed the possibility to obtain a monotonic step response for C1(s) if τ is kept above a certain lower bound. However, obtaining the step response monotonicity for (A7) is almost impossible due to the presence of the NMP zero in this transfer function [29,30].

Now, combining (12), (22), and (A6) yields:

GMmin=1+55a1τ22a12τ4+50a1τ2+1000=1+55τ022τ04+50τ02+1000, (A8)

where τ0=a11/2τ. The maximum system gain margin is equal to 0.5+25/51.39 at the normalized GTC τ0=5001/44.73.

Appendix B. The Cascade MPC Controller (CAS)

In a previous paper [12], we have developed a two-horizon time cascade linear MPC controller to stabilize the CIP system. The controller scheme is composed of two MPC controllers in cascade. These controllers are tuned so as to obtain a doubly critically damped behavior for the inner and outer loops. For the purpose of a fair comparison, the cascade MPC (CAS) controller presented hereafter and shown in Figure A1 is an adapted version of the one developed in [12]. More precisely, to develop this version, we assume, as for the proposed SIMO MPC controller of this paper, that Assumption 1 is satisfied. In this case, the associated CAS pendulum prediction model, which is identical to the SIMO one, and the CAS cart prediction model, which is distinct from the SIMO one, are given hereafter:

x^1(t+h)=A1x1(t)+B1x2(t)E1ux^2(t+h)=A2x1(t)+B2x2(t)E2u (A9)
x^3(t+h2)=x3(t)+h2x4(t)+0.5gx1dx^4(t+h2)=x4(t)+gh2x1d, (A10)

where (Ai,Bi,Ei) with i=1,2 are evaluated using (8), and x1d is an unknown low-speed pendulum angle reference.

Figure A1.

Figure A1

Cascade MPC controller for CIP system stabilization.

The cascade controller design can be conducted using two successive steps. In the first step, the design of the inner MPC controller K1=(K11,K12,K13) is considered using the following quadratic cost function:

J1(t,h)=e12(t+h)+ρ1e22(t+h)e1(t+h)=x1dx^1(t+h)e2(t+h)=0x^2(t+h). (A11)

From the angle prediction model (A9), the control input is obtained by minimization of (A11) as follows:

v0(t)=K11x1(t)K12x2(t)+K13x1dK11=A1E1+ρ1A2E2E12+ρ1E22;K12=B1E1+ρ1B2E2E12+ρ1E22;K13=E1E12+ρ1E22. (A12)

Imposing the critically damped constraint to the second-order inner loop gives [12]:

K11=a1b11(d02+4)/4;K12=a11/2b11d0;K13=a1b11d02/4, (A13)

where d0=2a11/2ωn01.2h01 with the inner loop undamped natural frequency ωn00.6h1.

In the second step, the design of the outer MPC controller K2=(K21,K22,K23) is considered using a second quadratic cost function:

J2(t,h2)=e32(t+h2)+ρ2e42(t+h2)e3(t+h2)=x3dx^3(t+h2)e4(t+h2)=0x^4(t+h2). (A14)

Using the cart prediction model (A10), the remaining control input is obtained by minimization of (A13) as follows:

x1d0(t)=K21x3(t)K22x4(t)+K23x3dK21=K23=2g1h22+4ρ2;K22=2h22+4ρ2h22+4ρ2h21g1. (A15)

Assuming that K21=K22=K23=K and ωn0 is high enough to obtain, the pole dominant principle leads to approximate the fourth-order cart transfer function by a simple second-order model [12]. Tuning the gain K to obtain a critically damped behavior for the outer loop leads to determining it with the following expression:

K=K21=K22=K23=4/(4L+g). (A16)

From Figure A1, we get the following ECS:

u(t)=+N1x1(t)+N2x2(t)+N3x3(t)+N4x4(t)N3x3d, (A17)

with

N1=K11N2=K12LKK13N3=N4=KK13, (A18)

where the cascade MPC controller gains are given in Equations (A13) and (A16).

Finally, let us notice here that by construction, the obtained cascade MPC gains are positive. The trivial prediction models, defined in (A9) and (A10), appear to be sufficient to conduct the design, and there is no need to define a generalized prediction model as in the case of the SIMO MPC method. Additionally, the evaluation of the optimal system gain margin for this controller leads to solving a fourth-order polynomial equation that is preferable to solve numerically.

Appendix C. The Coincident Pole Placement Controller (CPP)

Taking into account the usefulness of driving the cart system with little or no oscillation to x3d, a standard approach to design SFC controllers consist of determining the SFC controller gains (N1,N2,N3,N4) so as to obtain a coincident real negative pole p structure for the targeted closed-loop system F3(s) [17]. In this situation, the CIP characteristic polynomial follows the configuration P(s)=(sp)4 that leads directly to evaluate the SFC controller gains by:

N1=b11a11p4+6b11p2+b11a1N2=4b11a11p34b11pN3=a11b21p4N4=4a11b21p3. (A19)

The above controller can be viewed as optimal if its pole maximizes the gain margin (22). To this end, let us define the following normalized pole parameter:

p0=a11/2p. (A20)

Then, from (31), (A19), and (A20), we obtain the set of algebraic equations:

N10D01=p04+6p02+1N20D01=4p034p0N30D01=p04N40D01=4p03. (A21)

Now, substituting (31) in (22) and combining the obtained result with (A21) yields:

GMmin=N20D01(r12r34)1+N40D01(r12r34)=1+4p025p04+2p02+1. (A22)

The maximum gain margin is equal to 0.5(1+5)1.62 at the pole location p0=51/40.6687. As it is stated in [17], the parameters (A19) that result from the above pole location choice allow one to tune the controller gains for any CIP system in the simplest fashion to ensure significantly improved performance and robustness. In this paper, we take this controller as a reference for the comparison task.

Appendix D. Derivation of the Nonlinear CIP Dynamics

In this appendix, the basic equations for the CIP system, shown in Figure 1, are given as well as relevant explanation and derivation of the system of Equation (1). It is assumed that the pendulum rigid rod is mass-less, and the whole pendulum mass m is concentrated in the center of gravity of the pendulum ball. It is also assumed that no friction exists in the system between the cart and the track or between the cart and the pendulum. The coordinates of the center of the ball at the end of the pendulum, which are also the coordinates of the center of gravity of the mass-less pendulum, are given by:

xG=x3+Lsinx1yG=Lcosx1. (A23)

Now, Newton’s second law is applied to the cart to obtain:

Mx¨3=umx¨G. (A24)

Then, substituting (A23) into (A24) and using the fact that x2=x˙1 and x4=x˙3 gives:

u=Mx˙4+m(x˙4+L[x˙2cosx1x22sinx1])=(M+m)x˙4mLx22sinx1+mLx˙2cosx1. (A25)

From (A25), it is easy to get:

x˙4=1M+m[u+mLx22sinx1mLx˙2cosx1]. (A26)

The rotational motion of the pendulum about its center of gravity involves two forces: the force due to gravity and the force due to the acceleration of the cart. Indeed, since the cart is moving, it applies a force on the pendulum. The moment of the force due to gravity is mgLsinx1, and the moment of the force due to the acceleration of the cart is mLx˙4cosx1. The equation that describes the above motion is obtained by applying the rotational version of Newton’s second law. Summing the moments about the center of gravity of the pendulum, we obtain:

mL2x˙2=mgLsinx1mLx˙4cosx1. (A27)

From (A27), it is easy to get:

x˙2=1mL2[mgLsinx1mLx˙4cosx1]. (A28)

Equations (A26) and (A28) describe the basic equations of the CIP system. Substituting (A26) into (A28) obtains:

x˙2=uL1cosx1(M+m)gL1sinx1+mx22sinx1cosx1M+mmcos2x1. (A29)

Then, substituting (A29) into (A26) obtains:

x˙4=+umgsinx1cosx1+mLx22sinx1M+mmcos2x1. (A30)

Finally, from x˙1=x2, x4=x˙3, (A29), and (A30), we obtain the state-space format (1) of the nonlinear CIP dynamics.

Author Contributions

Conceptualization, L.M.; Formal analysis, E.-H.G. and S.B.; Investigation, S.B.; Methodology, E.-H.G. and S.B.; Validation, L.M.; Writing—Original draft, L.M. and E.-H.G.; Writing—Review and editing, E.-H.G. and S.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by PHC Tassili project (project code: 43790QB) and the Slovenian Research Agency (research core funding No. P2-0219).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The study did not report any data.

Conflicts of Interest

The authors declare no conflict of interest.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.Boubaker O. The inverted pendulum benchmark in nonlinear control theory: A survey. Int. J. Adv. Robot. Syst. 2013;10:1–9. doi: 10.5772/55058. [DOI] [Google Scholar]
  • 2.Magni L., Scattolini R. On the solution of the tracking problem for non-linear systems with MPC. Int. J. Syst. Sci. 2005;36:477–484. doi: 10.1080/00207720500139666. [DOI] [Google Scholar]
  • 3.Camacho E.F., Bordons C. Model Predictive Control. Springer; New York, NY, USA: 2004. [Google Scholar]
  • 4.Zarzycki K., Ławryńczuk M. Fast Real-Time Model Predictive Control for a Ball-on-Plate Process. Sensors. 2021;21:39–59. doi: 10.3390/s21123959. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Li L., Xie Z., Luo X., Li J. Trajectory Planning of Flexible Walking for Biped Robots Using Linear Inverted Pendulum Model and Linear Pendulum Model. Sensors. 2021;21:1082. doi: 10.3390/s21041082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Ławryńczuk M., Nebeluk R. Computationally Efficient Nonlinear Model Predictive Control Using the L1 Cost-Function. Sensors. 2021;21:5835. doi: 10.3390/s21175835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gawthrop P.J. Linear predictive pole-placement control: Practical issues; Proceedings of the 39th IEEE Conference on Decision and Control; Sydney, Austrilia. 12–15 December 2000; pp. 160–165. [Google Scholar]
  • 8.Gawthrop P.J., Ronco E. Predictive pole-placement control with linear models. Automatica. 2002;38:421–432. doi: 10.1016/S0005-1098(01)00231-X. [DOI] [Google Scholar]
  • 9.Gawthrop P.J., Wang L. Intermittent predictive control of an inverted pendulum. Control Eng. Pract. 2006;14:1347–1356. doi: 10.1016/j.conengprac.2005.09.002. [DOI] [Google Scholar]
  • 10.Chalupa P., Bobál V. Modelling and predictive control of inverted pendulum; Proceeding of the 22nd European Conference on Modelling and Simulation; Nicosia, Cyprus. 3–6 June 2008; pp. 531–537. [Google Scholar]
  • 11.Askari M., Mohamed H.A., Moghavvemi M., Yang S.S. Model predictive control of an inverted pendulum; Proceedings of the International Conference for Technical Postgraduates; Kuala Lumpur, Malaysia. 14–15 December 2009; pp. 1–4. [DOI] [Google Scholar]
  • 12.Messikh L., Guechi E.H., Benloucif M.L. Critically damped stabilization of inverted-pendulum systems using continuous-time cascade linear model predictive control. J. Frankl. Inst. 2017;354:7241–7265. doi: 10.1016/j.jfranklin.2017.08.039. [DOI] [Google Scholar]
  • 13.Mills A., Wills A., Ninness B. Nonlinear model predictive control of an inverted pendulum; Proceeding of the American Control Conference; St. Louis, MO, USA. 10–12 June 2009; pp. 2335–2340. [Google Scholar]
  • 14.Jung S., Wen J.T. Nonlinear model predictive control for the swing-up of a rotary inverted pendulum. J. Dyn. Syst. Meas. Control. 2004;126:666–673. doi: 10.1115/1.1789541. [DOI] [Google Scholar]
  • 15.Altmannshofer S. Fast suboptimal nonlinear model predictive control of an inverted pendulum. IFAC Proc. Vol. 2012;45:442–447. doi: 10.3182/20120823-5-NL-3013.00051. [DOI] [Google Scholar]
  • 16.Magni L., Scattolini R., Åström K.J. Global stabilization of the inverted pendulum using model predictive control. IFAC Proc. Vol. 2002;35:141–146. doi: 10.3182/20020721-6-ES-1901.00592. [DOI] [Google Scholar]
  • 17.Chatterjee S., Das S.K. An analytical formula for optimal tuning of the state feedback controller gains for the cart-inverted pendulum system. IFAC-Pap. 2018;51:668–672. doi: 10.1016/j.ifacol.2018.05.112. [DOI] [Google Scholar]
  • 18.Roshdy A.A., Mokbel H.F., Lin Y.Z., Wang T. Stabilization of real inverted pendulum using pole separation factor; Proceedings of International Conference on Mechanical Engineering and Material Science; Chicago, IL, USA. 4–8 June 2012; [DOI] [Google Scholar]
  • 19.Bugeja M. Non-linear swing-up and stabilizing control of an inverted pendulum system; Proceeding of IEEE Region 8 EUROCON; Ljubljana, Slovenia. 22–24 September 2003; pp. 437–441. [DOI] [Google Scholar]
  • 20.Wang J.J. Simulation studies of inverted pendulum based on PID controllers. Simul. Model. Pract. Theory. 2011;19:440–449. doi: 10.1016/j.simpat.2010.08.003. [DOI] [Google Scholar]
  • 21.Kuczmann M. Comprehensive Survey of PID Controller Design for the Inverted Pendulum. Acta Tech. Jaurinensis. 2019;12:55–81. doi: 10.14513/actatechjaur.v12.n1.492. [DOI] [Google Scholar]
  • 22.Shehu M., Ahmad M.R., Shehu A., Alhassan A. LQR, double-PID, and pole placement stabilization and tracking control of single link inverted pendulum; Proceedings of the 5th IEEE International Conference on Control System, Computing and Engineering (ICCSCE); Penang, Malaysia. 27–28 August 2015; pp. 218–223. [Google Scholar]
  • 23.Prasad L.B., Tyagi B., Gupta H.O. Optimal control of nonlinear inverted pendulum system using PID controller and LQR: Performance analysis without and with disturbance input. Int. J. Autom. Comput. 2014;11:661–670. doi: 10.1007/s11633-014-0818-1. [DOI] [Google Scholar]
  • 24.Önen Ü., Cakan A., Ilhan I. Performance comparison of optimization algorithms in LQR controller design for a nonlinear system. Turk. J. Electr. Eng. Comput. Sci. 2019;27:1938–1953. doi: 10.3906/elk-1808-51. [DOI] [Google Scholar]
  • 25.Jezierski A., Mozaryn J., Suski D. Polish Control Conference. Springer; Cham, Switzerland: 2017. A comparison of LQR and MPC control algorithms of an inverted pendulum; pp. 65–76. [Google Scholar]
  • 26.Kim Y.C., Lee J.G. Some Remarks on the Maximum Overshoot of a Fourth-order System. Trans. Korean Inst. Electr. Eng. 2016;65:1854–1859. doi: 10.5370/KIEE.2016.65.11.1854. [DOI] [Google Scholar]
  • 27.Manabe S. Importance of coefficient diagram in polynomial method; Proceeding of the 42nd IEEE International Conference on Decision and Control; Maui, HI, USA. 9–12 December 2003; pp. 3489–3494. [Google Scholar]
  • 28.Qiao Y., Ma C. The assignment of generalized time constant for a non-all-pole system. IEEE Trans. Ind. Electron. 2015;62:4276–4287. doi: 10.1109/TIE.2015.2393832. [DOI] [Google Scholar]
  • 29.Lau K., Middleton R.H., Braslavsky J.H. Undershoot and settling time tradeoffs for nonminimum phase systems. IEEE Trans. Autom. Control. 2003;48:1389–1393. doi: 10.1109/TAC.2003.815025. [DOI] [Google Scholar]
  • 30.Stewart J., Davison D.E. On overshoot and nonminimum phase zeros. IEEE Trans. Autom. Control. 2006;51:1378–1382. doi: 10.1109/TAC.2006.878745. [DOI] [Google Scholar]
  • 31.Manabe S. Brief tutorial and survey of coefficient diagram method; Proceedings of the 4th Asian Control Conference; Singapore. 25–27 September 2002; pp. 1161–1166. [Google Scholar]
  • 32.Müller M., Worthmann K. Quadratic costs do not always work in MPC. Automatica. 2017;82:269–277. doi: 10.1016/j.automatica.2017.04.058. [DOI] [Google Scholar]
  • 33.Lee J., Mukherjee R., Khalil H.K. Output feedback stabilization of inverted pendulum on a cart in the presence of uncertainties. Automatica. 2015;54:146–157. doi: 10.1016/j.automatica.2015.01.013. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The study did not report any data.


Articles from Sensors (Basel, Switzerland) are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES