Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Dec 11.
Published in final edited form as: IEEE Trans Control Syst Technol. 2013 Jan 1;21(1):10.1109/TCST.2011.2177525. doi: 10.1109/TCST.2011.2177525

An Improved Formulation of Hybrid Model Predictive Control With Application to Production-Inventory Systems

Naresh N Nandola 1, Daniel E Rivera 2
PMCID: PMC3859541  NIHMSID: NIHMS529525  PMID: 24348004

Abstract

We consider an improved model predictive control (MPC) formulation for linear hybrid systems described by mixed logical dynamical (MLD) models. The algorithm relies on a multiple-degree-of-freedom parametrization that enables the user to adjust the speed of setpoint tracking, measured disturbance rejection and unmeasured disturbance rejection independently in the closed-loop system. Consequently, controller tuning is more flexible and intuitive than relying on objective function weights (such as move suppression) traditionally used in MPC schemes. The controller formulation is motivated by the needs of non-traditional control applications that are suitably described by hybrid production-inventory systems. Two applications are considered in this paper: adaptive, time-varying interventions in behavioral health, and inventory management in supply chains under conditions of limited capacity. In the adaptive intervention application, a hypothetical intervention inspired by the Fast Track program, a real-life preventive intervention for reducing conduct disorder in at-risk children, is examined. In the inventory management application, the ability of the algorithm to judiciously alter production capacity under conditions of varying demand is presented. These case studies demonstrate that MPC for hybrid systems can be tuned for desired performance under demanding conditions involving noise and uncertainty.

Index Terms: Hybrid systems, model predictive control, production-inventory systems, adaptive behavioral interventions, supply chain management

I. Introduction

Hybrid systems are characterized by interactions between continuous and discrete dynamics. The term hybrid has also been applied to describe processes that involve continuous dynamics and discrete (logical) decisions [1], [2]. Applications of hybrid systems occur in many diverse settings; these include manufacturing, automotive systems, and process control. In recent years, significant emphasis has been given to modeling [1], [2], identification [3], [4], control [5], [6], estimation [7] and optimization [8], [9] of linear and nonlinear hybrid systems. A recent review paper [10] notes that despite the considerable interest within the control engineering community for model predictive control for hybrid systems, the field has not been fully developed, and many open challenges remain. Among these is the application to new areas outside of the industrial community, and the need for novel formulations that can be effectively used in noisy, uncertain environments. This paper represents an effort to obtain a flexible MPC formulation displaying ease of tuning that is amenable to robust performance in hybrid systems, and its application in two non-traditional problem settings that can be expressed as production-inventory systems: adaptive interventions in behavioral health and inventory management in supply chains.

The production-inventory system is a classical problem in enterprise systems that has application in many problem arenas. Figure 1 shows a diagram of a production-inventory system under combined feedback-feedforward control action. The production node is represented by a pipe, while the inventory component consists of fluid in a tank. The goal is to manipulate the inflow to the production node (i.e., starts) in order to replenish an inventory that satisfies exogeneous demand. The demand signal is broken down into forecasted and unforecasted components. A substantial literature exists that examines production-inventory systems from a control engineering standpoint [11], [12], [13], [14]; recently, [15] examined both internal model control (IMC) and model predictive control (MPC) for a linear production-inventory system with continuous inputs. The hybrid production-inventory system, in which production occurs at discrete levels (or is decided by discrete-event decisions) is an important yet less examined problem; we consider it the focus of this paper.

Fig. 1.

Fig. 1

Diagrammatical representation for a classical production-inventory system under combined feedback-feedforward control.

This paper highlights two distinct application areas that can be described as hybrid production-inventory systems. The first is adaptive interventions in behavioral health, which is a topic receiving increasing attention as a means to address the prevention and treatment of chronic, relapsing disorders, such as drug abuse [16]. In an adaptive intervention, dosages of intervention components (such as frequency of counseling visits or medication) are assigned to participants based on the values of tailoring variables that reflect some measure of outcome or adherence. Recent work has shown the relationship between forms of adaptive interventions and dynamical modeling and control of production-inventory systems [17], [18], [19]; [20] presents a risk-based model predictive control approach to the problem. In practice, these problems are hybrid in nature because dosages of intervention components correspond to discrete values. These interventions have to be implemented on a participant population that may display significant levels of interindividual variability. Hence a robust problem formulation is necessary that insures that the decision policy makes appropriate decisions for all members of a population, without demanding excessive modeling effort for each individual participant.

Inventory management in supply chains also represents a fertile application area for hybrid production-inventory systems. Prior work has established the benefits of a control engineering approach over classical Economic Order Quantity (EOQ) inventory management policies [15], [21], [22], [23], [24]; [15] in particular applies Monte Carlo simulation to demonstrate the improved performance and robustness of MPC over a smoothing replenishment rule representing an advanced “order-up-to” strategy [25]. The scenario considered in this paper is that of an enterprise that must make decisions on the startup and shutdown of an auxiliary manufacturing facility in order to accomplish operational goals under changing market demand. The problem is hybrid in nature, and the dynamics of this system can be highly uncertain [26]. Its solution calls for a novel problem formulation to insure efficient use of resources and an optimal operating policy for the inventory management problem, while accounting for changing market demand.

This paper presents an improved formulation of model predictive control (MPC) for linear hybrid systems that is amenable to achieving robust performance. The proposed scheme extends the use of mixed logical and dynamical (MLD) framework [1] for the control of hybrid systems subjected to measured and unmeasured disturbances and plant-model mismatch. The formulation offers a three-degree-of-freedom tuning for linear hybrid systems that addresses the requirements of setpoint tracking, measured (anticipated) disturbance rejection, and unmeasured disturbance rejection in the closed-loop system. This multi-degree-of-freedom formulation is patterned after the work of Wang and Rivera [26], which relies on the filter parametrization developed by Lee, Morari and García [27] and Lee and Yu [28] for unmeasured disturbance rejection in non-hybrid discrete-time systems. Tuning parameters are used in lieu of weights for independently adjusting the speeds of setpoint tracking and disturbance rejection (both measured and unmeasured) for each output of the closed-loop system. Systems with and without integrating dynamics are considered to accommodate a broad base of application settings and practical performance requirements. The paper extends the presentation in [29] with additional details, as well as describes a supply chain application, thus expanding the relevance of the approach to a broader class of problems.

The paper is organized as follows: Section 2 develops the MPC formulation with the multi-degree-of-freedom tuning for MLD systems. An example relying on the classical production-inventory system from Figure 1 but with discrete production levels is used to exemplify the algorithm and the three-degree-of-freedom tuning philosophy. Two case studies, namely a hypothetical adaptive intervention based on Fast Track program, and the supply chain management problem described previously are discussed in Section 3. Summary, conclusions, and directions for future research are presented in Section 4.

II. Model Predictive Control for Hybrid Systems

Model predictive control (MPC) is a form of closed-loop control where the current value of the manipulated variables is determined online as the solution of an optimal control problem over a horizon of given length [30]. The behavior of the system over the horizon is predicted with a model and the current plant state estimate assumed as the initial state for this prediction. When information about the plant state is available at the next sampling instant, the model is updated and optimization repeated over a shifted horizon. The ability to systematically include constraints and plants with multiple inputs and outputs, the flexibility given to the user to define a cost function, and the disturbance rejection properties of MPC have made it an attractive technique in the process industries, and beyond [31].

This section describes the model form and algorithm which incorporates the three degree-of-freedom formulation; it applies to any MLD model. The section concludes with an illustrative example from a standard production-inventory system with discrete-level production starts.

A. Process Model

The MPC controller presented in this paper uses a MLD-based model framework [1] represented as follows,

x(k)=Ax(k-1)+B1u(k-1)+B2δ(k-1)+B3z(k-1)+Bdd(k-1) (1)
y(k)=Cx(k)+d(k)+ν(k) (2)
E5E2δ(k-1)+E3z(k-1)-E4y(k-1)-E1u(k-1)+Edd(k-1) (3)

where x=[xcTxdT]T,xcnxc,xd{0,1}nxd and u=[ucTudT]T,ucnuc,ud{0,1}nud,nunuc+nud represent states (both discrete and continuous) and inputs (both discrete and continuous) of the system. y ∈ ℝny is a vector of outputs, and d, d′ and ν represent measured disturbances, unmeasured disturbances and measurement noise signals, respectively. δ ∈ {0, 1}nd and z ∈ ℝnz are discrete and continuous auxiliary variables that are necessary in order to incorporate logical/discrete decisions via linear inequality constraints; these are represented by Equation (3). The dimension of these auxiliary variables and the number of linear constraints in (3) depends on the specific character of the discrete logical/discrete decisions that have to be enforced in the particular hybrid system. The conversion of logical/discrete decisions into (3) using auxiliary variables is given by Bemporad and Morari [1] in detail. In particular, Bemporad and Morari [1] rely on propositional calculus and Big-M constraints [32] to express discrete actions in terms of (3). The framework permits the user to include and prioritize constraints as well as incorporate heuristic rules in the description of the model. Because disturbances are an inherent part of any system, it is necessary to incorporate these in the controller model. Equations (1)(3) extend the MLD framework shown in [1] through the inclusion of measured and unmeasured disturbances. Furthermore, the model lumps the effect of all unmeasured disturbances on the outputs only, which is a common practice in the process control literature [26], [28]. We consider d′, the unmeasured disturbance, as a stochastic signal described by the model,

xw(k)=Awxw(k-1)+Bww(k-1) (4)
d(k)=Cwxw(k) (5)

Aw has all eigenvalues inside the unit circle while w(k) is a vector of integrated white noise. Here it is assumed that the disturbance vector d′ consists of uncorrelated components. Consequently, Bw = Cw = I and Aw = diag{α1, α2, ···, αny} where αi = 1, i = 1, 2, ···, ny for doubly-integrated (i.e., Type II, per [33]) disturbances, while αi = 0, i = 1, 2, ···, ny for single-integrating (i.e., Type I) disturbances. ny corresponds to the total number of outputs. In order to take advantage of the well understood properties of white noise signals, we consider differenced forms of the disturbance and system models and augment them as follows,

X(k)=AX(k-1)+B1Δu(k-1)+B2Δδ(k-1)+B3Δz(k-1)+BdΔd(k-1)+BwΔw(k-1) (6)
y(k)=CX(k)+ν(k) (7)

where,

X(k)=[ΔxT(k)ΔxwT(k)yT(k)]TA=[A000Aw0CAAwI];Bi=[Bi0CBi],i=1,2,3,d;Bw=[0II];C=[00I]

Here Δ * (k) = *(k) − *(k − 1) and Δw(k) is a white noise sequence.

B. MPC Problem

In this work, we rely on a quadratic cost function of the form,

Ji=1p(y(k+i)-yr)Qy2+i=0m-1(Δu(k+i))QΔu2+i=0m-1(u(k+i)-ur)Qu2+i=0p-1(δ(k+i)-δr)Qd2+i=0p-1(z(k+i)-zr)Qz2 (8)

The optimization problem consists of finding the sequence of control actions u(k), …, u(k+m−1), d(k), …, d(k+ p − 1), and z(k), …, z(k + p − 1) that minimize J

min{[u(k+i)]i=0m-1,[δ(k+i)]i=0p-1,[z(k+i)]i=0p-1}J (9)

subject to mixed integer constraints according to (3) and various process constraints,

yminy(k+i)ymax,1ip (10)
uminu(k+i)umax,0im-1 (11)
ΔuminΔu(k+i)Δumax,0im-1 (12)

p is the prediction horizon and m is the control horizon. umin, umax, and ymin, ymax are lower and upper bound on inputs and outputs, respectively. (·)r stands for reference, (·)Q(·)TQ(·) is the vector 2-norm weighted by matrix Q* and Qy, QΔu, Qu, Qd, and Qz are penalty weights on the control error, move size, control signal, auxiliary binary variables and auxiliary continuous variables, respectively. The objective function in (8) corresponds to that used by Bemporad and Morari [1] with the additional term of move suppression (the penalty term on Δu). In most practical problems, only the first three terms of the objective function (control error, move size, control signal) are considered; however, in keeping with [1] and to maintain generality of the MPC formulation, we include the last two terms that minimize the error of the auxiliary variables δ and z from user-specified reference values. Moreover, the objective function (8) and the linear hybrid system described by the MLD model (1)–(3) are governed by both binary and continuous variables. Consequently, (8) along with the linear hybrid system dynamics (1)–(3) and linear inequality constraints described in (10)–(12), the solution to (9) forms a mixed integer quadratic program (miqp).

The MPC problem described by (8)–(12) requires future predictions of the outputs and the mixed integer constraints in (3). The future predictions can be obtained by propagating (6), (7) and (3) for p steps in future, resulting in the following prediction equations,

Y(k+1)=ΦX(k)+H1U(k)+H2δ¯(k)+H3Z(k)+HdD(k)-H11u(k-1)-H21δ(k-1)-H31z(k-1)-Hd1dflt(k-1) (13)
E¯5E¯2δ¯(k)+E¯3Z(k)+E¯1U(k)+E¯4Y(k)+E¯dD(k) (14)

Equation (14) can be further simplified by substituting Inline graphic(k) and rewriting as,

E5E2δ¯(k)+E3Z(k)+E1U(k)+E4X(k)+EdD(k)-E41u(k-1)-E42δ(k-1)-E43z(k-1)-E4ddflt(k-1) (15)

where dflt represents the filtered value of the measured disturbance obtained using a discrete time filter; this is discussed in more detail in the next section. Inline graphic(k + 1), Inline graphic(k),δ̄(k), Inline graphic(k) and Inline graphic(k) are future values of outputs, inputs, auxiliary binary variables, auxiliary continuous variables and filtered measured disturbances as given below:

Y(k+1)=[yT(k+1)yT(k+2)yT(k+p)]T (16)
U(k)=[uT(k)uT(k+m-1)]T (17)
δ¯(k)=[δT(k)δT(k+p-1)]T (18)
Z(k)=[zT(k)zT(k+p-1)]T (19)
D(k)=[dfltT(k)dfltT(k+p-1)]T (20)

Φ, Inline graphic, H*, Inline graphic and Ē* are the appropriate coefficients matrices that can be generated using (3), (6) and (7); these are given in Appendix-A. Inline graphic, δ̄ and Inline graphic are the decision variables of the MPC problem described in (8)–(12) and can be found by an miqp optimizer. Inline graphic is an externally generated forecast of the measured disturbances values that is provided externally to the algorithm. Using (13)–(20), the MPC problem in (8)–(12) can be rewritten in vector form as

min{U(k),δ¯(k),Z(k)}J(Y(k+1)-Yr)Q^y2+(RuU(k)-Ru0u(k-1))Q^Δu2+(U(k)-Ur)Q^u2+(δ¯(k)-δ¯r)Q^d2+(Z(k)-Zr)Q^z2 (21)

subject to mixed integer constraints according to (15) and various process constraints,

YminY(k+1)Ymax (22)
UminU(k)Umax (23)
ΔUminΔU(k)ΔUmax (24)

where,

Q^=[Q0·0·0·Q];Ru=[I000-II000-I00-II];Ru0=[I00]

Inline graphic, Inline graphic, δ̄r and Inline graphic are the reference vectors for the outputs, inputs, auxiliary binary variables and auxiliary continuous variables as given below:

Yr=[yrT(k+1)yrT(k+2)yrT(k+p)]T (25)
Ur=[urT(k)urT(k+1)urT(k+m-1)]T (26)
δ¯r=[δrT(k)δrT(k+1)δrT(k+p-1)]T (27)
Zr=[zrT(k)zrT(k+1)zrT(k+p-1)]T (28)

Substituting (13) for Inline graphic(k + 1) into (21) and rearranging the objective function in (21) (such that one group consist of all the quadratic terms of the decision variables, while the other group consists of all the linear terms) with constraints described in (15) and (22)–(24) leads to defining the MPC problem into a standard mixed integer quadratic program (miqp) as follows:

minξJ12ξTHξ+GTξ (29)
Sξb (30)

where ξ = [ Inline graphic(k)T δ̄(k)T Inline graphic(k)T]T is the vector of the decision variables, Inline graphic and Inline graphic are coefficient matrices for the quadratic and linear terms of the objective function, respectively, and Inline graphic and b represent the coefficient matrices for the linear constraints. The coefficient matrices Inline graphic, Inline graphic, Inline graphic and b are defined using Φ, Inline graphic, H*, Inline graphic and Ē* and are given in Appendix-B. Thus (29)–(30) represent the MPC control law for linear hybrid systems with measured and unmeasured disturbances, which consists of a standard miqp problem. The size of this problem will vary with the number of inputs (u), auxiliary variables (δ and z), and mixed integer constraints in (3), as well as the length of the prediction (p) and control (m) horizons. For a particular linear hybrid system the size of the miqp problem considered here will be similar to that obtained from [1]. The tremendous advances in miqp solution techniques and commercialized solvers make it possible to solve moderate to large size miqp problems in real-time. While any miqp solver available in the market should be suitable, in this work, we have relied on the TOMLAB-CPLEX solver. Alternatively, one can also opt for a multiparametric approach ([34]) that gives an explicit solution for the constrained miqp and thus reduces the computational time.

The nominal stability and convergence properties of MPC using MLD models are discussed extensively in [1]; Theorem 1 in [1] indicates that the choice of objective function weights will guarantee stability and convergence. Because the proposed MPC algorithm is based on a MLD model description with a similar objective function to [1], these properties apply as well to the problem formulation presented in this paper. It should be noted that our algorithm relies on externally generated reference trajectories, forecasts of the measured disturbances, and estimates of (disturbance-free) initial states X(k) to achieve nominal and robust performance; these considerations are discussed in the ensuing section.

C. Three-Degree-of-Freedom Tuning Parameters

The MPC formulation considered in this work features three-degree-of-freedom tuning where the performance requirements associated with setpoint tracking, unmeasured disturbance rejection, and anticipated measured disturbance rejection can be adjusted independently. This multiple-degree-of-freedom functionality influences the controller response more intuitively and conveniently than the traditional approach of adjusting penalty weights in the objective function; this is developed as follows:

  1. Reference Trajectory and Setpoint Tracking: The output reference trajectory Inline graphic is generated using the filter
    yr(k+i)ytarget=f(q,αrj),1jny,1ip (31)
    where f(q,αrj) is a Type-I discrete-time filter for the j-th reference as given in [33]
    f(q,αrj)=(1-αrj)qq-αrj,1jny (32)

    and q is the forward shift operator. The speed of setpoint tracking can be adjusted by choosing αrj between [0,1) for each individual output j. The smaller the value for αrj, the faster the output response for setpoint tracking. This adjustment is more intuitive than adjusting move suppression weights (QΔu) that directly affect the manipulated variables and consequently, the effect on a specific controlled variable response is more difficult to predict. We note that the reference trajectories for the inputs and auxiliary binary and continuous variables ( Inline graphic, δ̄r and Inline graphic) are considered constant over the prediction horizon at their respective predefined target values.

  2. Measured Disturbance Rejection: The proposed formulation relies on an externally generated forecast of the measured disturbance as in (20); this forecast is filtered and provided as an anticipated signal to the control algorithm. The speed desired to reject measured disturbances can be adjusted independently by using a filter f(q,αdj), 1 ≤ jndist for each measured disturbance signal. Here ndist is the number of measured disturbances and αdj is a tuning parameter whose value lies between [0,1), for each j-th measured disturbance. The lower the value for αdj, the faster the speed of disturbance rejection. The form of the transfer function f(q,αdj) required depends on whether the disturbance is an asymptotically step or ramp (i.e., Type-I or Type-II signal, as defined in [33]). A Type-II filter structure should be used when integrating system dynamics are present [26]. The Type-I filter structure is given in (32) and the Type-II filter structure is given as follows [33]:
    f(q,αdj)=(β0+β1q-1++βωq-ω)×(1-αdj)qq-αdj (33)
    βk=-6kαdj(1-αdj)ω(ω+1)(2ω+1),1kω (34)
    β0=1-(β1++βω) (35)
  3. Unmeasured Disturbance Rejection: The proposed MPC problem (29)–(30) requires calculating the coefficient matrix Inline graphic for the linear term of the objective function (29) and the right-hand side of constraint equation (30) (b) at each time step. This requires initial states X(k) of the augmented system at each time instance. The augmented states X(k) can be estimated from the current measurements y(k) with unmeasured disturbance rejection achieved by a suitably designed state observer/filter. In order to decouple the effects of measured and unmeasured disturbance rejection and true multiple degree-of-freedom tuning for these controller modes, we propose a two-step procedure for estimating the augmented states X(k). In the first step, state estimation is accomplished utilizing the model, the actual (unfiltered) measured disturbance signal d, and a filter Kf as follows,
    X(kk-1)=AX(k-1k-1)+B1Δu(k-1)+B2Δδ(k-1)+B3Δz(k-1)+BdΔd(k-1) (36)
    X(kk)=X(kk-1)+Kf(y(k)-CX(kk-1)) (37)
    Since we rely on the unfiltered measured disturbance d in (36), the second term in (37) represents the effect of unmeasured disturbances only; the choice for Kf will define the speed and character of unmeasured disturbance rejection. Next, we estimate an augmented state of the system by considering the filtered measured disturbance signal dflt and the contribution of the unmeasured disturbance (i.e., the prediction error) from (37) as follows,
    Xflt(kk-1)=AXflt(k-1k-1)+B1Δu(k-1)+B2Δδ(k-1)+B3Δz(k-1)+BdΔdflt(k-1) (38)
    Xflt(kk)=Xflt(kk-1)+Kf(y(k)-CX(kk-1)) (39)
    The first term of (39) captures the effect of the filtered measured disturbances, while the second term (which is same as in (37)) captures the effect of the unmeasured disturbances. Thus, Xflt(k|k) accounts for the both filtered measured and unmeasured disturbances such that the measured disturbance tuning specified through αdj does not influence unmeasured disturbance rejection, while the unmeasured disturbance filter matrix Kf does not affect measured disturbance rejection. Finally, Xflt(k|k) is used in (B.2) and (B.4) to calculate the coefficient matrices Inline graphic and b that appear in the MPC problem (29)–(30). The optimal value of the filter gain Kf could be found by solving an algebraic Riccati equation; however, this will require estimating covariance matrices for the unmeasured disturbance and measurement noise, which may not be known accurately. Instead, we rely on the parametrization shown in Lee, Morari, and García [27] and Lee and Yu [28], which, in keeping with the setpoint tracking and measured disturbance modes, enables specifying the speed of unmeasured disturbance rejection for each output channel independently by the user. For the case of Aw = diag{α1, α2, ···, αny}, we apply the following parametrization of the filter gain:
    Kf=[0FbFa] (40)
    where
    Fa=diag{(fa)1,,(fa)ny} (41)
    Fb=diag{(fb)1,,(fb)ny} (42)
    (fb)j=(fa)j21+αj-αj(fa)j,1jny (43)

    (fa)j is a tuning parameter that lies between 0 and 1; the speed of unmeasured disturbance rejection is proportional to the tuning parameter (fa)j. As (fa)j approaches zero, the state estimator increasingly ignores the prediction error correction, and the control solution is mainly determined by the deterministic model (36) and the the feedforward anticipation signal. On the other hand, the state estimator tries to compensate for all prediction error as (fa)j approaches 1, and consequently the controller becomes extremely aggressive. Thus, by adjusting (fa)j, the user can directly influence unmeasured disturbance rejection for each output response individually, which is more intuitive than tuning with move suppression in the traditional MPC formulation.

    It should be noted that the MLD model presented in (1)–(3) can also accommodate discrete outputs. However, in this work we assume that all output signals are continuous. The filter gain matrix Kf is parameterized such that the elements corresponding to Δx(k) (which can consist of continuous as well discrete states of the system) is always 0; this is observed from (40). Hence, the original system states x(k) are never corrected for unmeasured disturbances; only the outputs y(k) are corrected for the cumulative effect of all the disturbances in the process. Thus, the proposed MPC formulation yields offset-free responses in the presence of the asymptotically ramp disturbances or asymptotically step disturbances on the outputs by choosing Aw = I or Aw = 0, respectively. This requires measurements of all the controlled outputs. Furthermore, the controller is able to reject asymptotically ramp disturbances because of the integral action resulting from the use of the augmented model in the difference form with Aw = I.

  4. Robustness considerations: Addressing performance-robustness tradeoffs is a fundamental consideration in tuning MPC controllers [35]. In the three degree-of-freedom formulation, plant-model mismatch is experienced by the control system as an unmeasured disturbance, and consequently the tuning of the unmeasured disturbance mode through (fa)j will dictate the controller response to model uncertainty [15], [36]. The robustness of the filter according to (36)–(43) under unconstrained conditions for linear continuous (non-hybrid) systems is examined in detail by Lee and Yu [28]. Lee and Yu [28] show the direct relationship between the shape and bandwidth of the sensitivity and complementary sensitivity functions for various settings of (fa)j and αi, and how formal robustness guarantees for this controller can be determined using the concept of the Structured Singular Value [37]. For the constrained case with discrete decision variables, formally assessing the robustness of MPC represents a much greater challenge, although some intutive approaches can be taken [38]. A formal proof of the effect of these tuning and disturbance parameters on robustness for a hybrid, constrained system lies beyond the scope of this paper; however, because of the direct connection between the value of (fa)j and closed-loop bandwidth, it is expected that robustness at the expense of performance will be obtained as (fa)j approaches 0. Both a simulation example (presented in the next subsection) as well as Case Study I in Section III.A are provided in support of this claim.

D. Example: Production-Inventory System

As an illustration of the algorithm we consider a classical production-inventory system (Fig. 1), the basic node of a supply chain [15]. This system can be modeled using a fluid analogy where the inventory is represented as a tank containing fluid, while the production node is represented as a pipe with throughput time θ = θ′ + 1 sampling instants and yield K. The net stock y(k) representing fluid level and the inlet pipe flow u(k) representing factory starts are the controlled and manipulated variables, respectively. The outlet flow represents the demand d(k) that is composed of an unforecasted component du(k) and a forecasted component df (kθf), with θf representing the forecast time. The dynamics of this system are given by an overall mass balance as follows,

y(k+1)=y(k)+Ku(k-θ)-d(k) (44)

where d(k) = df (kθf) + du(k) is the total customer demand. The operational goal of this system is to meet the customer demand d(k) while maintaining the net stock inventory level at a predefined target. This can be accomplished by manipulating factory starts u(k) and feedforward compensation of forecasted demand df (k). In this example, the manipulated input u(k) is restricted to assume only four values: 0, 33.33, 66.66 and 100. Thus, the plant according to (44) is a class of hybrid system with discrete inputs and continuous output y(k) which can be modeled using the MLD representation described by (1)–(3). In order to capture discreteness in the input, four binary auxiliary variables (δ1, δ2, δ3, δ4) and three continuous auxiliary variables (z1, z2, z3) are introduced with the following relationships:

δ1(k)=1z1(k)=100 (45)
δ2(k)=1z2(k)=66.66 (46)
δ3(k)=1z3(k)=33.33 (47)
i=14δi(k)=1 (48)
u(k)=i=13zi(k) (49)

The symbol “⇔” should be read as if and only if (e.g. δ1 = 1 if and only if z1 = 100 else 0 or viceversa). The conditions in (45)–(49) insure that the input u(k) takes only one of four possible values at a particular time instant. The implication (⇔) in (45)–(47) can be converted into linear inequality constraints using equivalence with propositional logic [32] followed by matrix representation as in (3). This yields a mixed logical dynamical (MLD) model for the production-inventory problem discussed above. The MLD model can then be used to formulate an MPC problem of the form (29)–(30). In this case we consider a sampling interval of T = 1 day, θf = p = 30 days, system gain K = 0.9 and throughput time of four (θ′ = 3) days.

Figure 2 shows how the parameters αr, αd and fa can be manipulated to affect the nominal speed of response and hence the robustness of the algorithm. As noted previously, the speeds of setpoint tracking and measured disturbance rejection are inversely proportional to the values of αr and αd, respectively, while the speed of unmeasured disturbance rejection is proportional to the value of fa. Initially, a step setpoint change of magnitude 300 is introduced. The solid line represent the controller performance for the three-degree-of-tuning parameters αr = 0.9, αd = 0.9 and fa = 0.3, while the dashed lines represent the controller performance for the parameters αr = 0, αd = 0 and fa = 1. It can be seen that the system achieves faster setpoint tracking for αr = 0 as compared to αr = 0.9. At day 40, a step change of magnitude 60 occurs in the customer demand d(k), which is forecasted without error (i.e., d(k) = df (k) = 60) until day 67. For αd = 0, the controller is able to quickly notice a change in the demand and takes an early feedforward action in order to maintain the inventory at the setpoint without any deviation (dashed line). On the other hand, for αd = 0.9, the rate of production slows down, resulting in a dip in inventory (solid line). At day 68, demand suddenly decreases to 30 from 60. This change in the demand yields a negative bias of 30 units in the forecast that can be considered as an unmeasured disturbance (i.e., du(k) = d(k)−df (k) = −30). In this case, feedback action is taken by the controller and by the time the controller realizes the decrease in demand, inventory has exceeded the setpoint. As a consequence of continued control action, the system response eventually settles at the setpoint. Faster unmeasured disturbance rejection is accomplished for fa = 1 than for fa = 0.3. The simulation results show how the adjustable parameters αr, αd and fa have an intuitive effect on the closed-loop response of the production-inventory system, and can be selected by the user to achieve desired levels of performance and robustness.

Fig. 2.

Fig. 2

Evaluation of tuning parameters, classical production-inventory example: (i) dashed line: αr = 0, αd = 0., fa = 1 (ii) solid line: αr = 0.9, αd = 0.9, fa = 0.3. The bottom panel shows a measured disturbance df (k) and an unmeasured disturbance du(k) that enter at days 40 and 53, respectively. The solid black line represents the total disturbance d(k), while the dashed line represents the disturbance forecast.

III. Case Studies

In this section, we examine the efficacy of the proposed algorithm on case studies taken from two non-traditional applications for control systems engineering: adaptive behavioral interventions and inventory management in supply chains. While appearing on the surface as distinct (and unrelated) application areas, both of these problems can be abstracted as hybrid production-inventory control systems, and consequently stand to benefit from the general hybrid MPC problem formulation developed in this paper. The case study on adaptive interventions (Section III.A) illustrates for the case of a discrete input the benefits of the multiple degree-of-freedom formulation on a problem involving significant uncertainty. Meanwhile, the supply chain case study (Section III.B) illustrates the benefits of the algorithm for problems involving a discrete decision based on the value of the outputs, and under circumstances when both measured and unmeasured disturbances (requiring combined feedback and feedforward control action) are present. In both case studies a comparison is made to hybrid MPC tuning using only objective function weights, without independent degrees-of-freedom.

A. Adaptive Behavioral Interventions

As a representative case study of a time-varying adaptive behavioral intervention we examine a hypothetical intervention based on the Fast Track program [39]. Fast Track was a multi-year, multi-component program designed to prevent conduct disorder in at-risk children. Youth showing conduct disorder are at increased risk for incarceration, injury, depression, substance abuse, and death by homicide or suicide. In Fast Track, some intervention components were delivered universally to all participants, while other specialized components were delivered adaptively. In this paper we focus on the adaptive intervention scenario described in [16] for assigning family counseling, which was provided to families on the basis of parental functioning. There are several possible levels of intensity, or doses, of family counseling. The idea is to vary the dose of family counseling depending on the needs of the family, in order to avoid providing an insufficient amount of counseling for very troubled families, or wasting counseling resources on families that may not need them or be stigmatized by excessive counseling. The decision about which dose of counseling to offer each family is based primarily on the family’s level of functioning, assessed by a questionnaire completed by the parents. As described in [16], based on the questionnaire and the clinician’s assessment, parental function is determined to fall in one of the following categories: very poor, poor, near threshold, or at/above threshold. A traditional approach in behavioral health is to adapt the intervention through a set of simple “IF-THEN” decision rules that are applied as follows: families with very poor functioning are given weekly counseling; families with poor functioning are given biweekly counseling; families with near threshold functioning are given monthly counseling; and families at or above threshold are given no counseling. Family functioning is reassessed at a review interval of three months, at which time the intervention dosage may change. This process goes on for three years, with twelve opportunities for a dose of family counseling to be assigned.

Rivera et al. [17] examine the links between adaptive behavioral interventions and feedback control systems, proposing means by which a control system engineering approach can optimize the delivery of these interventions. The approach followed in [17] models the intervention using a fluid analogy, represented in Figure 3. Parental function PF(k), the controlled variable, is treated as liquid in a tank, which is depleted by exogenous disturbances D(k). The tank is replenished by the intervention I(k), which is the manipulated variable. The use of a fluid analogy enables developing a mathematical model of the open-loop dynamics of the intervention using the principle of conservation of mass. This model can be described by the following difference equations that relate parental function PF(k) with the intervention I(k):

PF(k+1)=PF(k)+KII(k-θ)-D(k) (50)
D(k)=i=1ndDi(k) (51)
PFmeas(k)=PF(k)+N(k) (52)

Fig. 3.

Fig. 3

Fluid analogy corresponding to the hypothetical adaptive behavioral intervention. Parental function PF(k) is treated as material (inventory) in a tank, which is depleted by disturbances D(k) and replenished by intervention dosage I(k), which is the manipulated variable.

PF(k) is the parental function, I(k) refers to the intervention dosage (frequency of counselor home visits), KI is the intervention gain, θ represent time delay between intervention and its effect on parental function, and PFmeas(k) is the parental function measurement. D(k) is the source of parental function depletion and N(k) represents the measurement noise. I(k), the frequency of counselor visits, has a restriction such that these can only be possible either weekly, biweekly, monthly, or none at all. This problem requires imposing a restriction on the intervention I(k) such that it takes only four values: 0, Iweekly, Ibiweekly and Imonthly. Thus, the problem has inherent discreteness along with the continuous dynamics, which can be characterized by a hybrid dynamical system. The parental function system can be modeled using the MLD framework. In order to capture discreteness in the intervention, four binary auxiliary variables, δ1, δ2, δ3, δ4 and three continuous auxiliary variables, I1, I2, I3 are introduced. The detailed description of the MLD model is not shown here but can be obtained by following a similar procedure as in the production-inventory example of Section II.D.

In the case study the controller is based on a nominal model with parameters KI = 0.15 and θ = 0, prediction horizon p = 30 and control horizon m = 10. The TOMLAB-CPLEX solver is used to solve the resulting mixed integer quadratic program (miqp) optimization problem. Figure 4(a) documents the simulation results for various levels of gain and delay mismatch in the presence of a setpoint change in parental function to 50% and a simultaneous step unmeasured disturbance D(k) = 5 using the proposed MPC formulation. Tuning parameters Qy = 1, QΔu = Qu = Qd = Qz = 0, and (αr, fa) = (0, 0.3) are used for all the cases. αd is not considered as this problem is feedback-only (i.e., no measured disturbance variables are considered). Six cases involving gain and delay mismatch (ΔKI, Δθ) and representing intervention participant variability are evaluated, starting with case 1: (0, 0; nominal model, no mismatch), case 2: (21%, 0), case 3: (−15%, 0), case 4: (0%, 1), case 5: (−15%, 1), and case 6: (21%, 1). Various characteristics of the integrating system dynamics are reflected in these results. For example, a negative gain mismatch leads to sluggish response in parental function (case 3), while a positive gain mismatch yields faster response (case 2) than the nominal (case 1). On the other hand, mismatch in time delay is responsible for a slight overshoot in case 6. In spite of all these variations, the hybrid control system displays satisfactory performance in the face of significant plant-model mismatch, with stable and predominantly overdamped responses in the parental function response, displaying no offset for all the cases presented in this paper.

Fig. 4.

Fig. 4

Evaluation of the hybrid MPC formulation with 3-DoF tuning parameters ((αr, fa) = (0, 0.3), Qy = 1 and QΔu = 0, (a)) and the IF-THEN rule-based decision policy (b) for the counselor visits-parental function intervention under conditions of gain and delay mismatch. A setpoint change from 10% to 50% parental function with simultaneous step disturbance D(k) = 5 are evaluated under varying conditions of gain and delay mismatch (ΔKI, Δθ): case 1 (nominal): (0, 0), case 2: (21%, 0), case 3: (−15%, 0), case 4: (0%, 1), case 5: (−15%, 1), case 6: (21%, 1).

In order to better appreciate the performance of the proposed three-degree-of freedom MPC formulation, we analyze the problem using the “IF-THEN “ rule-based decision policy [17] and an MPC formulation that relies on constant plant-model mismatch ((αr, fa) = (0, 1.0)), with and without move suppression QΔu. Fig. 4(b) displays the simulation results using the IF-THEN rule-based decision policy. These results show that the IF-THEN decision policy leads to significant offset for the first five cases, producing desirable performance only for case 6. This policy relies only on the current measurement of parental function and does not take into account the dynamic relationship between parental function and intervention dose, which accounts for the subpar performance. Figure 5(a) and Fig. 5(b) display the responses from an MPC formulation relying on constant plant-model mismatch over a prediction horizon for QΔu = 0 (no move suppression) and QΔu = 1, respectively. For the case of no move suppression (QΔu = 0, Fig. 5(a)), parental function achieves the desired goal without offset for all six cases. However, the controlled variable responses result in overshoot and oscillations for Cases 3 – 6, and the dosage assignments in general display wide variations that would not be well tolerated in a clinical setting. Applying QΔu = 1 attenuates the aggressive nature of the controller, but the last three cases still exhibit significant overshoot, and the controller settling times display the largest variation of all the control systems considered in the case study.

Fig. 5.

Fig. 5

Evaluation of the hybrid MPC formulation relying on constant plant-model mismatch and move suppression for the counselor visits-parental function intervention under conditions of gain and delay mismatch. A setpoint change from 10% to 50% parental function with simultaneous step disturbance D(k) = 5 are evaluated under varying conditions of gain and delay mismatch (ΔKI, Δθ): case 1 (nominal): (0, 0), case 2: (21%, 0), case 3: (−15%, 0), case 4: (0%, 1), case 5: (−15%, 1), case 6: (21%, 1). (a) Qy = 1 and QΔu = 0 (b) Qy = 1 and QΔu = 1.

Figure 6 displays the values for the closed-loop performance metrics Je and JI that represent measures of cumulative deviation of parental function from the goal and cumulative intervention energy, respectively. Je and JI are defined as follows,

Fig. 6.

Fig. 6

Comparison of the performance metrics Je and JI using the proposed MPC formulation (‘*’); MPC formulation assuming a constant plant-model mismatch over the prediction horizon (‘o’ for QΔu = 1 and ‘∇’ for QΔu = 0) and IF-THEN decision policy [17] (‘+’). Tuning parameters as in Figures 4 and 5.

Je=k=1t/Ts(PF(k)-PFgoal)T(PF(k)-PFgoal) (53)
JI=k=1t/Ts(I(k))T(I(k)) (54)

where t represents the total simulation time and Ts is a sampling time. Table I, meanwhile, summarizes the mean and variance of Je and JI over all the participant cases that are evaluated. In the case of the proposed algorithm, the mean and variance of Je are 2.17×103 and 3.36×105, respectively. On the other hand, using the MPC formulation with constant plant-model mismatch these values are 3.74 × 103 and 13.8 × 105, respectively. In addition, from Figure 6 one can determine that the proposed formulation uses less intervention energy as characterized by the lower values of the performance metric JI, with mean and variance 4.79 × 104 and 17.0 × 107 as compared to the MPC formulation with constant plant-model mismatch (QΔu = 1) that yields 5.63×104 mean with 37.46×107 variance. Figure 6 also documents the values for Je and JI using the MPC formulation (denoted by circle ‘∇’ in the figure) with constant plant-model mismatch over a prediction horizon without move suppression weight. It yields lower values of Je for the first three cases and higher for last three cases than the proposed formulation with overall mean and variance 1.63 × 103 and 12.21 × 105, respectively. However, it requires higher energy for all six cases with the mean of the performance metric JI equal to 6.37 × 104 and variance equal to 19.3 × 107. Moreover, as noted earlier it is susceptible to large oscillation and overshoot (see Fig. 5(a)). Thus, it can be concluded that the proposed multiple-degree-of-freedom algorithm offers more output response uniformity than the traditional MPC formulation that assumes constant plant-model mismatch, while faced with significant uncertainties. Corresponding values for the performance metrics Je and JI using the IF-THEN based decision policy [17] are also represented in Figure 6 with a plus ‘+’. This policy requires least intervention energy of all controllers considered, with mean and variance 3.98 × 104 and 6.19 × 107 for the performance metric JI. However, the mean and variance for Je are 2.87 × 103 and 15.2 × 105, which are much higher than the proposed MPC formulation. In addition, the parental function responses result in significant offset for the first five participant cases (Fig. 4(b)). Thus, the proposed formulation assigns intervention dosages more efficiently in the presence of uncertainty and reduces waste of intervention resources, while taking parental function to a desired goal.

TABLE I.

Mean and variance of closed-loop performance metrics Je and JI using various decision policies

Decision policy Je JI

mean variance mean variance

IF-THEN 2.87 × 103 15.2 × 105 3.98 × 104 6.19 × 107
MPC (QΔu = 0) 1.62 × 103 12.21 × 105 6.37 × 104 19.34 × 107
MPC (QΔu = 1) 3.74 × 103 13.80 × 105 5.63 × 104 37.40 × 107

MPC (3-DoF) 2.17 × 103 3.36 × 105 4.79 × 104 17.0 × 107

B. Supply chain management

We noted in the Introduction that prior work has shown proof-of-concept and the benefits obtained from a control engineering approach (in general) and IMC and MPC (in particular) as decision frameworks for inventory management in supply chains [15], [22], [23], [24], [26], [36]. In this case study, we consider a supply chain management problem comprised of a production-inventory system with one inventory and two production nodes that requires a hybrid MPC solution. The production nodes consist of one primary factory and a secondary auxiliary one, as shown in Figure 7. Throughput time θ1=θ1+1 and yield K1 for the primary factory are 4 days and 0.9, respectively; for the auxiliary factory, the throughput time θ2=θ2+1 and yield K2 are 9 days and 0.8, respectively. Here we consider that the operating cost for the auxiliary factory is greater than the primary one because of the longer throughput time and lower yield. Therefore, this auxiliary production node should be accessed if and only if the primary node is running at its full capacity and unable to meet future market demand; likewise, production from the auxiliary node must be discontinued if future demand cannot justify its operation. The discrete decision of startup or shutdown of the auxiliary factory is a function of the continuous variables df (k) (demand forecast) and the work-in-progress (WIP) in the primary production node. Consequently, the system can be categorized as a hybrid system, where the inventory dynamics are governed by continuous variables (inventory, factory starts, demand and WIP) and a discrete decision (shut-down/start-up of the auxiliary factory). The dynamics of this system can be described using first principles model follows:

y(k+1)=y(k)+K1u1(k-θ1)+K2u2(k-θ2)-d(k) (55)
WIP(k+1)=i=0θ1u1(k-i) (56)

where d(k) = df (kθf)+du(k) is the total customer demand comprising forecasted and unforecasted components, u1(k) represents the starts for the primary production node, u2(k) is the starts for the auxiliary production node, and WIP represents the work-in-progress in the primary production node. In order to ensure that the auxiliary factory is activated if and only if WIP in the primary factory is at its maximum capacity, we embed the following logical condition into the dynamical model:

u2(k-θ2)0WIP(k+1)>Capmax=600 (57)

Fig. 7.

Fig. 7

Diagrammatical representation of a production-inventory system consisting of two production nodes (one primary, one secondary) and a single inventory examined in the supply chain management (SCM) case study.

The implication (⇔) in (57) can be converted into linear inequality constraints using Big-M constraints [40]. This conversion can be accomplished by introducing an auxiliary binary variable (δ) and an auxiliary continuous variable (z) followed by a MLD representation of (55)–(57) as in (1)–(3). In addition, we consider constraints on the process variables as follows:

0ui(k)200,i=1,2 (58)
0WIP(k)600 (59)
0y(k)1000 (60)

The MLD model of (55)–(57) and constraints (58)–(60) are then used to formulate an MPC problem of the form (29)–(30). The operational goal of this system is to meet the customer demand d(k) relying on production from the auxiliary factory only when necessary, while maintaining the net stock inventory level at a predefined setpoint. This can be accomplished by manipulating factory starts u1(k), u2(k) and feedforward compensation of forecasted demand df (k) simultaneously imposing the logical condition described in (57). Here we consider a sampling interval of T = 1 day, θf = p = 30 days.

In practice, it is desirable to keep the factory starts as constant as possible (i.e., avoid factory “thrash”) while maintaining inventory at desired levels in the face of uncertain customer demand [26]. In order to examine the performance of the proposed multi-degree-of-freedom MPC formulation, we consider a piecewise stochastic customer demand signal and its forecast, which are shown in Figure 8. Figure 9 demonstrates the performance of the proposed formulation that uses multi-degree-of-freedom tuning parameters, αr = 0.9, αd = 0, fa = 0.1, penalty weight parameters Qy = 1, QΔu = diag{0 0}, Qu = Qd = Qz = 0, prediction horizon p = 30 and control horizon m = 25. From the figure, it can be observed that the proposed MPC algorithm is able to satisfactorily maintain inventory at its predefined target, while producing little variation in the starts of the factories. Moreover, it is able to decide on the start-up/shutdown of the auxiliary factory in a desirable manner (i.e., by making minimal use of the auxiliary factory). This reduces operational cost while allowing the production system to meet customer demand without backorders.

Fig. 8.

Fig. 8

Customer demand (red dashed line) and demand forecast (solid black line), SCM case study.

Fig. 9.

Fig. 9

Response of net stock (y), work-in-progress (WIP), and factory starts (u1,u2) for the proposed multiple-degree-freedom formulation with tuning parameters: αr = 0.9, αd = 0, fa = 0.1 and Qy = 1, QΔu = diag{0 0}, Qu = Qd = Qz = 0 for the demand in Figure 8.

In order to assess the effectiveness of the proposed formulation, we compare its performance with the MPC formulation that relies on a constant plant-model mismatch over the prediction horizon. This formulation uses the move suppression weight (QΔu) in lieu of fa to reduce the variation in the manipulated variables (i.e., starts) of the factories. Figure 10 presents the simulation results using QΔu = diag{0 0} while keeping all other parameters as in the previous case. The responses show evidence of very poor inventory control and very high variation in the starts of the factories as compared to the proposed formulation. To reduce the variation in the starts of the factories and verify the performance against the proposed formulation, we apply various values of the move suppression weight QΔu. Table II documents the maximum (peak) value of the inventory (ymax) and the closed-loop performance metrics Je and JΔu using the MPC formulation relying on a constant plant-model mismatch over the prediction horizon for five values of QΔu between 0 to 200. The last row of Table II documents these values for the three-degree-of-freedom (3-DoF) MPC formulation. The performance metrics Je and JΔu are measures of cumulative error (e = yyr) and variation in the rate-of-change of factory starts, respectively, which are given as follows:

Je=k=1t/Ts(y(k)-yr)T(y(k)-yr) (61)
JΔu=k=1t/Ts(u(k)-u(k-1))T(u(k)-u(k-1)) (62)

Fig. 10.

Fig. 10

Response of net stock (y), Work-in-Progress (WIP), and factory starts (u1,u2) for the MPC formulation that relies on constant plant-model mismatch and move suppression tuning, with QΔu = diag{0 0} and Qy = 1, Qu = Qd = Qz = 0 for the demand in Figure 8.

TABLE II.

Closed-loop performance metrics JΔu, Je and maximum inventory (ymax) using mpc formulations relying on constant plant-model mismatch for various QΔu and a three-degree-of-freedom (3-dof) mpc formulation for fa = 0.1.

Sl. No. QΔu JΔu Je ymax

1 diag{0 0} 267.9 × 104 13.15 × 105 834.63
2 diag{1 1} 94.67 × 104 11.15 × 105 763.865
3 diag{10 10} 25.82 × 104 10.69 × 105 809.24
4 diag{100 100} 4.81 × 104 12.71 × 105 890.94
5 diag{200 200} 3.038 × 104 15.71 × 105 939.24

6 3-DoF 8.345 × 104 4.426 × 105 692.76

From Table II, it can be seen that the proposed 3-DoF formulation outperforms the constant plant-model mismatch based MPC formulation in terms of maximum peak in the inventory and Je for all values of QΔu. On the other hand, increasing the move suppression weight QΔu lowers JΔu, with the last two cases (i.e. QΔu = diag{100 100} and QΔu = diag{200 200}) yielding lower values of JΔu than the 3-DoF MPC formulation. Comparing Figure 9 with Figure 11 shows that for the 3-DoF formulation, factory starts remain constant over a significant portion of the simulation, without leading to the substantial maximum inventory peak and corresponding demands on warehouse space resulting from QΔu = diag{200 200}. Thus, it can be concluded that the proposed 3 DoF-MPC algorithm can be useful for reducing overall operating costs and efficiently managing this form of hybrid production-inventory systems.

Fig. 11.

Fig. 11

Response of net stock (y), Work-in-Progress (WIP), and factory starts (u1,u2) for the MPC formulation that relies on constant plant-model mismatch and move suppression tuning with QΔu = diag{200 200} and Qy = 1, Qu = Qd = Qz = 0 for the demand in Figure 8.

IV. Summary and Conclusions

Applications of hybrid systems are becoming increasingly common in many fields, yet many open challenges remain [10]. Adaptive behavioral interventions [17] and supply chain management [36] represent emerging application areas for control systems engineering that lie outside of the traditional industrial community, but that can be abstracted as hybrid production-inventory systems. Consequently, a control-oriented formulation for optimally managing these systems over time is relevant; this has been the primary focus of this paper.

To accomplish this goal, an improved MPC formulation for linear hybrid systems has been developed. The formulation relies on the MLD model representation that has been modified to include measured and the unmeasured disturbances that play an important role in these application domains. The controller formulation offers multiple-degree-of-freedom tuning parameters that enable the user to directly influence the speed of setpoint tracking and disturbance rejection (both measured and unmeasured) on each controlled variable; this tuning philosophy is intuitive in nature, amenable for achieving robustness, and has both fundamental and practical appeal. The applicability and efficiency of the proposed MPC formulation is demonstrated on a classical production-inventory problem example, a hypothetical adaptive behavioral intervention case study intended for improving parental function in homes of at-risk children, and a supply chain management (SCM) case study involving the need to systematically add (or remove) factory capacity in the face of varying external demand.

Simulation results in the adaptive intervention case study highlight the ability of the algorithm to be tuned to handle significant plant-model mismatch and disturbances while offering robust performance. The robustness properties of the algorithm are useful in the context of applying control engineering-based interventions to populations or cohort groups that may display wide participant variability in model parameters. In the SCM problem, the algorithm was useful in determining both the choice and magnitude of factory starts, while maintaining relatively uniform net stock inventory levels in the presence of drastic changes in market demand. The case study results showed that the proposed MPC scheme offered substantial performance and robustness benefits over a traditional MPC formulation with constant plant-model mismatch that uses move suppression weights to adjust the system response.

A formal robustness study lies beyond the scope of this paper, but represents a useful future addition to this body of work. Likewise, approaches that reduce the modeling effort required in practice and extend the functionality of the algorithm to nonlinear problems are particularly appropriate. Current efforts by the authors involve the development of controller formulations that are data-centric in nature; in this setting we are relying on the concept of model-on-demand (MoD) [41], [42], [43] and have developed a hybrid MoD-MPC approach that has been evaluated for adaptive behavioral interventions [44].

Acknowledgments

Support for this work has been provided by the Office of Behavioral and Social Sciences Research (OBSSR) of the National Institutes of Health and the National Institute on Drug Abuse (NIDA) through grants K25 DA021173 and R21 DA024266. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute on Drug Abuse or the National Institutes of Health. Insights provided by Linda M. Collins (Methodology Center, Penn State University) and Susan A. Murphy (Statistics, University of Michigan) towards better understanding adaptive behavioral interventions are greatly appreciated.

Appendix-A. COEFFICIENT MATRICES OF THE p-step AHEAD PREDICTIONS OF THE OUTPUT IN (13) AND CONSTRAINTS IN (14) AND (15)

H1=[CB1000CAB1-CB1CB100CA2B1-CAB1CAB1-CB1CB1CAm-1B1-CAm-2B1CAm-2B1-BAm-1B1CAB1-CB1CB1CAmB1-CAm-1B1CAm-1B1-CAm-2B1CA2B1-CAB1CAB1CAp-1B1-CAp-2B1CAp-2B1-CAp-1B1CAp-m+1B1-CAp-mB1CAp-mB1] (A.1)
Hi=[CBi000CABi-CBiCBi00CA2Bi-CABiCABi-CBiCAp-1Bi-CAp-2BiCAp-2Bi-CAp-3BiCABi-CBiCBi],i=2,3,d (A.2)
Φ=[CACA2CAp] (A.3)
Hi1=[CBiCABiCA2BiCAp-1Bi]i=1,2,3,d (A.4)
E¯i=diag{Ei,,Ei},i=2,3,d (A.5)
E¯4=diag{-E4,,-E4} (A.6)
E¯5=[E5E5E5]T (A.7)
E¯1=[-E10·00··-E10·-E1] (A.8)
Ei=(E¯4H¯i+E¯i),i=1,2,3,d (A.9)
E4=E¯4Φ¯ (A.10)
E4i=E¯4H¯i1,i=1,2,3,d (A.11)
E5=E¯5 (A.12)
H¯j=[[0]nyHj(1:(p-1)ny,:)]j=1,2,3,d,11,21,31,d1 (A.13)
Φ¯=[CΦ(1:(p-1)ny,:)] (A.14)

Here ny is number of outputs, [0]ny denotes matrix with ny rows that has all the elements 0 and *(1: (p−1)ny, :) should be read as row 1 to row (p − 1)ny of the matrix * with all the columns.

Appendix-B. COEFFICIENT MATRICES OF THE OBJECTIVE FUNCTION IN (29) AND CONSTRAINTS IN (30) OF THE MPC PROBLEM

H=2[H1TQ^yH1+Q^Δu+Q^uH1TQ^yH2H1TQ^yH3H2TQ^yH1H2TQ^yH2+Q^dH2TQ^yH3H3TQ^yH1H3TQ^yH2H3TQ^yH3+Q^z] (B.1)
G=2[g1g2g3]T (B.2)

where,

g1=X(k)TΦTQ^yH1-YrTQ^yH1-UrTQ^u+D(k)THdTQ^yH1-u(k-1)T(Ru0TQΔu^Ru+H11TQ^yH1)-δ(k-1)TH21TQ^yH1-z(k-1)TH31TQ^yH1-dflt(k-1)THd1TQ^yH1g2=X(k)TΦTQ^yH2-YrTQ^yH2-δ¯rTQ^d+D(k)THdTQ^yH2-u(k-1)TH11TQ^yH2-δ(k-1)TH21TQ^yH2-z(k-1)TH31TQ^yH2-dflt(k-1)THd1TQ^yH2g3=X(k)TΦTQ^yH3-YrTQ^yH3-ZrTQ^z+D(k)THdTQ^yH3-u(k-1)TH11TQ^yH3-δ(k-1)TH21TQ^yH3-z(k-1)TH31TQ^yH3-dflt(k-1)THd1TQ^yH3S=[s1s2-s2] (B.3)

where

s1=[E1E2E3];s2=[H1H2H3Im(nu)[0]m(nu)×p(nd)[0]m(nu)×p(nz)Ru[0]m(nu)×p(nd)[0]m(nu)×p(nz)]b=[E5-E4X(k)-EdD(k)+E41u(k-1)+E42δ(k-1)+E43z(k-1)+E4ddflt(k-1)Ymax-ΦX(k)-HdD(k)+H11u(k-1)+H21δ(k-1)+H31z(k-1)+Hd1dflt(k-1)UmaxΔUmax+Ru0u(k-1)-Ymin+ΦX(k)+HdD(k)-H11u(k-1)-H21δ(k-1)-H31z(k-1)-Hd1dflt(k-1)-Umin-ΔUmin-Ru0u(k-1)] (B.4)

Here nu, nd and nz are dimensions of the inputs, the auxiliary binary variables δ and the auxiliary continuous variables z, respectively.

Contributor Information

Naresh N. Nandola, Email: nareshkumar.nandola@in.abb.com, Control Systems Engineering Laboratory, School for Engineering of Matter, Transport, and Energy, Arizona State University, Tempe, AZ 85287-6106, USA. He is currently with ABB Corporate Research Center, Bangalore, India

Daniel E. Rivera, Email: daniel.rivera@asu.edu, Control Systems Engineering Laboratory, School for Engineering of Matter, Transport, and Energy, Arizona State University, Tempe, AZ 85287-6106, USA.

References

  • 1.Bemporad A, Morari M. Control of systems integrating logic, dynamics, and constraints. Automatica. 1999;35(3):407–427. [Google Scholar]
  • 2.Nandola NN, Bhartiya S. A multiple model approach for predictive control of nonlinear hybrid systems. J Process Control. 2008;18(2):131–148. [Google Scholar]
  • 3.Nandola NN, Bhartiya S. Hybrid system identification using a structural approach and its model based control: An experimental validation. Nonlinear Anal Hybrid Syst. 2009;3(2):87–100. [Google Scholar]
  • 4.Roll J, Bemporad A, Ljung L. Identification of piecewise affine systems via mixed-integer programming. Automatica. 2004;40(1):37–50. [Google Scholar]
  • 5.Branicky MS, Borkar VS, Mitter SK. A unified framework for hybrid control: model and optimal control theory. IEEE Trans Autom Control. 1998;43(1):31–45. [Google Scholar]
  • 6.Nandola NN, Bhartiya S. A computationally efficient scheme for model predictive control of nonlinear hybrid systems using generalized outer approximation. Ind Eng Chem Res. 2009;48(12):5767–5778. [Google Scholar]
  • 7.Prakash J, Patwardhan SC, Shah SL. State estimation ad nonlinear predictive control of autonomous hybrid system using derivative free state estimators. Journal of Process Control. 2010;20(7):787–799. [Google Scholar]
  • 8.Lee CK, Singer AB, Barton PI. Global optimization of linear hybrid systems with explicit transitions. Systems & Control Letters. 2004;51(5):363–375. [Google Scholar]
  • 9.Till J, Engell S, Panek S, Stursberg O. Applied hybrid system optimization: An empirical investigation of complexity. Control Engineering Practice. 2004;12(10):1291–1303. [Google Scholar]
  • 10.Camacho EF, Ramirez DR, Limon D, Munoz de la Peña D, Alamo T. Model predictive control techniques for hybrid systems. Annual Reviews in Control. 2010;34:21–31. [Google Scholar]
  • 11.Grubbström RW, Wikner J. Inventory trigger control policies developed in terms of control theory. International Journal of Production Economics. 1996;45:397–406. [Google Scholar]
  • 12.Riddalls CE, Bennett S. Production-inventory system controller design and supply chain dynamics. International Journal of Systems Science. 2002;33(3):181–195. [Google Scholar]
  • 13.Ortega M, Lin L. Control theory applications to the production-inventory problem: a review. International Journal of Production Research. 2004;42(11):2303–2322. [Google Scholar]
  • 14.Sarimveis H, Patrinos P, Tarantilis CD, Kiranoudis CT. Dynamic modeling and control of supply chain systems: A review. Computers & Operations Research. 2008;35:3530–3561. [Google Scholar]
  • 15.Schwartz JD, Rivera DE. A process control approach to tactical inventory management in production-inventory systems. International Journal of Production Economics. 2010;125(1):111–124. [Google Scholar]
  • 16.Collins LM, Murphy SA, Bierman KL. A conceptual framework for adaptive preventive interventions. Prevention Science. 2004;5(3):185–196. doi: 10.1023/b:prev.0000037641.26017.00. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Rivera DE, Pew MD, Collins LM. Using engineering control principles to inform the design of adaptive interventions a conceptual introduction. Drug and Alcohol Dependence. 2007;88(2):S31–S40. doi: 10.1016/j.drugalcdep.2006.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Navarro-Barrientos JE, Rivera DE, Collins LM. A dynamical model for describing behavioural interventions for weight and body composition change. Mathematical and Computer Modelling of Dynamical Systems. 2011 Apr;17(2):183–203. doi: 10.1080/13873954.2010.520409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Riley WT, Rivera DE, Atienza AA, Nilsen W, Allison SM, Mermelstein R. Health behavior models in the age of mobile interventions: are our theories up to the task? Translational Behavioral Medicine. 2011 Mar;1(1):53–71. doi: 10.1007/s13142-011-0021-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zafra-Cabeza A, Rivera DE, Collins LM, Ridao MA, Camacho EF. A risk-based Model Predictive Control approach to adaptive behavioral interventions in behavioral health. IEEE Transactions in Control Systems Technology. 2011 Jun;19(4):891–901. doi: 10.1109/TCST.2010.2052256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Silver EA, Pyke DF, Petersen R. Inventory Management and Production Planning and Scheduling. 3. Wiley; Upper Saddle River, NJ: 1998. [Google Scholar]
  • 22.Rivera DE, Pew MD. Evaluating PID control for supply chain management: a freshman design project. 44th IEEE Conference on Decision and Control and 2005 European Control Conference; Seville, Spain. December 2005.pp. 3415–3419. [Google Scholar]
  • 23.Schwartz JD. PhD thesis. Arizona State University; USA: 2008. Next generation supply chain management: control, optimization, and system identification. [Google Scholar]
  • 24.Schwartz JD, Arahal MR, Rivera DE, Smith KD. Control-relevant demand forecasting for tactical decision-making in semiconductor manufacturing supply chain management. IEEE Transactions on Semiconductor Manufacturing. 2009 Feb;22(1):154–163. [Google Scholar]
  • 25.Dejonckheere J, Disney SM, Lambrecht MR, Towill DR. Measuring and avoiding the bullwhip effect: a control theoretic approach. European Journal of Operational Research. 2003;147:567–590. [Google Scholar]
  • 26.Wang W, Rivera DE. Model predictive control for tactical decision-making in semiconductor manufacturing supply chain management. IEEE Trans Control Syst Technol. 2008 Sep;16(5):841–855. [Google Scholar]
  • 27.Lee JH, Morari M, García CE. State-space interpretation of model predictive control. Automatica. 1994;30(4):707–717. [Google Scholar]
  • 28.Lee JH, Yu ZH. Tuning of model predictive controllers for robust performance. Comput Chem Eng. 1994;18(1):15–37. [Google Scholar]
  • 29.Nandola NN, Rivera DE. A novel model predictive control formulation for hybrid systems with application to adaptive behavioral interventions. ACC; Baltimore, Maryland, USA. June 30–July 02 2010; pp. 6286–6292. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Camacho EF, Bordons C. Model predictive control. 2. Springer-Verlag; 2004. [Google Scholar]
  • 31.Prett DM, García . Butterworths Series in Chemical Engineering. Butterworths publishers; 1988. Fundamental Process Control. [Google Scholar]
  • 32.Williams HP. Model building in mathematical programming. John Wiley & Sons; New York: 1993. [Google Scholar]
  • 33.Morari M, Zafiriou E. Robust Process Control. Englewood Cliffs, NJ: Prentice-Hall; 1989. [Google Scholar]
  • 34.Dua V, Bozinis NA, Pistikopoulos EN. A multiparametric programming approach for mixed-integer quadratic engineering problems. Computers & Chemical Engineering. 2002;26(4–5):715–733. [Google Scholar]
  • 35.Garriga JL, Soroush M. Model predictive control tuning methods: a review. Ind Eng Chem Res. 2010;49:3505–3515. [Google Scholar]
  • 36.Schwartz JD, Wang W, Rivera DE. Simulation-based optimization of process control policies for inventory management in supply chains. Automatica. 2006;42(8):1311–1320. [Google Scholar]
  • 37.Zhou K, Doyle JC, Glover K. Robust and Optimal Control. PTR Prentice Hall; New Jersey: 1996. [Google Scholar]
  • 38.Rossiter JA. Model-based predictive control: A practical approach. CRC Press; Boca Raton, FL: 2003. [Google Scholar]
  • 39.Conduct Problems Prevention Research Group. A developmental and clinical model for the prevention of conduct disorders: The Fast Track program. Development and Psychopathology. 1992;4:509–528. [Google Scholar]
  • 40.Raman R, Grossmann IE. Relation between MILP modelling and logical inference for chemical process synthesis. Comput Chem Eng. 1991;15(2):73–84. [Google Scholar]
  • 41.Braun MW, Rivera DE, Stenman A. A model-on-demand identification methodology for nonlinear process systems. Int J Control. 2001;74(18):1708–1717. [Google Scholar]
  • 42.Stenman A. PhD thesis. Linköping University; Sweden: 1999. Model-on-demand: Algorithms, analysis and applications. [Google Scholar]
  • 43.Braun MW. PhD thesis. Arizona State University; USA: 2001. Model-on-demand nonlinear estimation and model predictive control: Novel methodologies for process control and supply chain management. [Google Scholar]
  • 44.Nandola NN, Rivera DE. Model-on-demand predictive control for nonlinear hybrid systems with application to adaptive behavioral interventions. 49th IEEE Conference on Decision and Control; Atlanta, Georgia. December 2010; pp. 6113–6118. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES