Abstract
The widespread adoption of closed-loop control in systems biology has resulted from improvements in sensors, computing, actuation, and the discovery of alternative sites of targeted drug delivery. Most control algorithms for circadian phase resetting exploit light inputs. However, recently identified small-molecule pharmaceuticals offer advantages in terms of invasiveness and potency of actuation. Herein, we develop a systematic method to control the phase of biological oscillations motivated by the recently identified small molecule circadian pharmaceutical KL001. The model-based control architecture exploits an infinitesimal parametric phase response curve (ipPRC) that is used to predict the effect of control inputs on future phase trajectories of the oscillator. The continuous time optimal control policy is first derived for phase resetting, based on the ipPRC and Pontryagin’s maximum principle. Owing to practical challenges in implementing a continuous time optimal control policy, we investigate the effect of implementing the continuous time policy in a sampled time format. Specifically, we provide bounds on the errors incurred by the physiologically tractable sampled time control law. We use these results to select directions of resetting (i.e. phase advance or delay), sampling intervals, and prediction horizons for a nonlinear model predictive control (MPC) algorithm for phase resetting. The potential of this ipPRC-informed pharmaceutical nonlinear MPC is then demonstrated in silico using real-world scenarios of jet lag or rotating shift work.
Keywords: Phase response curve, nonlinear dynamics, circadian oscillators, systems biology, optimal control, model predictive control, phase, entrainment
1. Introduction
A coalescing of control engineering and systems biology has resulted in widespread use of open-loop and feedback control approaches in biological and biomedical applications [1, 2]. These include identifying and modulating cellular behavior [3–5], treating diseases [6, 7], constructing synthetic biological circuits [8, 9], optimizing biomanufacturing productivity [10, 11], and formulating targeted drug delivery systems [12–15]. Unlike open-loop systems that require manual intervention at critical times to prevent deleterious outcomes, closed-loop drug delivery systems enable effective regulation of targeted biological pathways by leveraging control-relevant models, systematic prediction, or decision-making based on clinical targets. Therefore, these approaches have found use in manufacturing and medical devices. Additional improvements in the quality of closed-loop drug delivery and adherence within society has been fueled by the invention of wearable sensors [16], minimally invasive actuators, and embedded decision-making platforms [17], along with novel drug delivery mechanisms [18] or input-output pairs [19]. This paper demonstrates how selecting control input (from light-based methods to small-molecule pharmaceuticals) or controller design parameters plays an essential role in resetting the phase of mammalian circadian rhythms.
Circadian rhythms are endogenous daily oscillations in gene expression or metabolism driving temporal adaptations in most organisms. In mammals, these oscillations are generated by genetic feedback loops within each cell of the organism [20]. Since environmental signals (such as ambient light) set the time of this biological clock, mistimed environmental cues may result in adverse changes to these rhythms, and consequently, a loss of temporal regulation of the genetic architecture, and psychological and physiological pathologies [21]. In recent years, small-molecule pharmaceuticals have gained significant interest as a path toward modulating the circadian clock to reduce the effects of circadian disturbances [22, 23]. Small-molecule pharmaceuticals present critical benefits over the use of light for clock resetting including avoiding the day-time “dead-zone” where light evokes a minimal phase response, and reducing the impractical and burdensome challenge of attempting to tightly control one’s light environment. For example, light-based regulation would necessitate sustained periods of time wearing low-transmission glasses or light visors [24]. Most importantly, light dosing strategies typically take several days to reset circadian rhythms: hence the prevalence of sustained jet lag following long trips across time zones. Even state-of-the-art optimal control policies involving light necessitate as much as eight days to complete a phase resetting when allowing for regular sleep timing [25].
Pharmaceuticals are expected to enable direct manipulation of sensitive control targets within the clock, allowing more rapid clock resetting [26]. The complex nature of the circadian oscillator necessitates a control approach toward dosing strategies for these drugs, as identical stimuli applied at different times of the day may have drastically disparate effects on the clock. Modeling approaches such as those described in [27] have been employed to identify the underlying mechanistic action of these drugs. Such models may, in turn, be used to inform advanced model-based control strategies such as model predictive control (MPC) for circadian regulation.
Prior studies employing models of circadian dynamics have independently investigated optimal control-based approaches for manipulating the circadian clock. For example, nonlinear MPC and multi-target MPC for light-resetting a Drosophila circadian clock model was demonstrated in [26, 28], and complementary phase response curves (PRCs) were leveraged to identify potential control targets. Other studies using nonlinear Drosophila clock models [29, 30] proposed more efficient procedures to obtain light-based optimal control trajectories by relaxing the nonlinear control problem into a mixed-integer formulation. Recent work has extended the concept of using light-based feedback control to investigate re-entrainment of clocks in humans. Attempts have been made to pose this complex problem as a scheduling problem to obtain the best bang-off-bang strategy [24] for light inputs. Recently, the authors in [31] used lower harmonics to formulate an approximate model of the circadian oscillations in humans, and exploited the structure of the co-state equation to propose an efficient line-search algorithm to obtain optimal control sequences. Although these methods are elegant and have been demonstrated to be effective via numerous simulation studies, a significant drawback of optimal control approaches is that they are rarely immune to measurement inaccuracies, exogenous noise, or plant-model mismatch (i.e. discrepancies between the system under control and the model describing it). Further-more, without model approximation, deriving optimal control sequences remain computationally prohibitive, if not intractable, for complex high-dimensional nonlinear models [30].
The use of feedback in circadian entrainment remains a relatively unexplored problem. A notable exception is [32], where the authors developed a framework for control of circadian rhythms using short-duration pulses to shift the phase, via proportional-integral-derivative (PID) control by exploiting a low-dimensional approximate predictive model of the phase dynamics, known as a phase response curve (PRC). This low-dimensional representation of pertinent phase information and the predictive power of a ipPRC enables the construction of MPC algorithms that are well-known to be inherently robust to noise and plant-model mismatch. A prominent recent study derived a feedback control law for circadian entrainment using lookup tables from the optimal control [33]. This study focused on a light input which may be switched on and off at will, and although highly effective for light, pharmaceutical delivery under this policy would incur errors. More importantly, the main result proven herein bounds the nearness of their feedback control approach to their optimal control law. A preliminary investigation of the potential of MPC in circadian control with pharmaceuticals was undertaken in [34, 35], which demonstrated MPC of mammalian circadian rhythms via KL001, a small-molecule pharmaceutical that was recently used to manipulate (in vivo) the circadian clock in mammals [22, 23]. Although previous investigations have revealed the importance of carefully selecting prediction horizon and controller sampling time of the MPC to reset the circadian oscillator, no systematic method of selecting these design variables has been derived.
This study seeks to answer three fundamental questions:
What does the optimal control policy for pharmaceutical-driven circadian phase resetting look like?
What design considerations should be made in implementing feedback control of circadian phase? More specifically, how does the selection of sampling or switching times and the prediction horizon affect the optimality of the applied control?
How can these design considerations shape our selection of circadian therapeutics?
In answering these questions, this study presents a systematic framework for constructing ipPRC-based nonlinear MPC formalisms capable of manipulating mammalian clocks using small-molecule pharmaceuticals.
2. Phase-Reduced Model and Optimal Phase Shifting
First, the circadian oscillator model and its accompanying notation are presented.
2.1. Circadian oscillator model
Circadian dynamics are most commonly represented by limit-cycle oscillators. These oscillators are generally modeled using a class of smooth nonlinear dynamical systems of the form
(1) |
where denote states such as mRNA or protein concentrations, denote kinetic parameters, and denote control inputs. The zero-input system comprises an exponentially-attractive limit cycle that satisfies
(2) |
with period T and angular frequency ω = 2π/T. A two-dimensional limit cycle with states x = [x1, x2] is visualized in Fig. 1A.
This study is concerned with dynamics of the phase of oscillation, and so we first map every unique point on the limit cycle to a unique scalar phase . Phase is defined such that for an oscillator on the limit cycle, phase advances linearly with time. Let denote the corresponding map relating x0 to oscillator phase .
Let γ(t; x0) denote the solution of system (1) with initial condition . Thus a time-dependent phase variable
(3) |
may be established. Considering that the unperturbed oscillator traverses the limit cycle at the constant rate ω, the system phase evolves in time according to:
(4) |
The application of an exogenous input (such as a control action) will divert the state of an oscillatory system away from the limit cycle Г. In order to ascertain the phase shift incurred by such an input, we must assign a phase to points on that are not on the limit cycle Г, that are exponentially attracted to Г. This can be done by assigning them the phase of the trajectory to which they ultimately converge. The trajectory denoted θ(t; xa, u) is the time evolution of point xa ∉ Г, with dynamics given by (1) and control input u. For the zero-input case (u = 0), the state xa may be assigned the phase of the initial condition of the trajectory γ(t; x0) to which the trajectory θ(t; xa, 0) ultimately converges. The asymptotic phase of xa, denoted ϕa, is equal to the phase ϕ0 of point x0 ∈ Г, subject to
(5) |
This relationship yields the asymptotic phase map ϕa = Φ(xa). The asymptotic phase map may be used to formulate a time-dependent asymptotic phase variable
(6) |
for the zero-input case. The trajectories γ and θ with identical phase, and their asymptotic convergence, are shown schematically in Fig. 1B.
This formulation yields phase dynamics identical to the case for ϕ, with particular solutions of the form ϕ(t; 0) = ωt + ϕ0 with initial phase ϕ0. The following subsection describes how the concept of an asymptotic phase variable can be extended for the more challenging and useful case of non-zero control input u.
2.2. Infinitesimal parametric phase response curves and the phase-reduced model
For an oscillator in the neighborhood of Г, the phase dynamics resulting from a control input may be derived by the chain rule:
(7) |
which yields a first-order approximation of phase dynamics for a nonzero control input.
Pharmaceutical manipulations of the clock are generally considered to be mediated by temporary changes in one or more parameter values (e.g. increased degradation of a protein, or decreased transcriptional rate) [23, 36]. Input u(t) is incorporated into the ODE model as a time-dependent modification of parameters p
(8) |
Substituting this into (7) yields
(9) |
where is the infinitesimal parametric PRC (ipPRC) for the oscillator on the limit cycle [37]. The formulation (9) is valid up to the first-order approximation in both duration and magnitude of the control, and is referred to as the phase-reduced model.
The one-dimensional phase equation is therefore given by:
(10) |
where B(t; ϕ) is the ipPRC and u(t) is the parametric perturbation.
Assumption 1.
The ipPRC B(·) is fully known and dependent only on the asymptotic phase ϕ(t).
The ipPRC can be calculated numerically from mechanistic models of the clock for a known control input as above, or alternately could be characterized experimentally [37, 38]. Since the ipPRC is generated from a clock model, it may exhibit minor differences with the ipPRC of the mammalian circadian system. This contributed to our decision to later apply control via MPC, as it is inherently robust to plant-model mismatch. The ipPRC may be quantified directly via impulse response tests, though these experiments are challenging to perform.
A prior method of formulating asymptotic phase dynamics assumed short pulses of control inputs that relax completely to the limit cycle after each pulse [32], and thus the phase sensitivity on the limit cycle may be used. Assumption 1, rather, implies that the phase response dynamics are constant along isochrons (state-space regions of constant phase). This assumption is violated as the oscillator becomes more distant from its limit cycle, or closer to the fixed point at the center of the limit cycle. Identifying violations of this assumption may be studied in the future, and in these cases, the model should be simulated in full. This assumption has been used in prior studies as well [25, 33]. Our assumption yields a few practical advantages when implementing pharmaceutical inputs. Short pulses are less able to evoke significant phase shifts, as the realized phase shift is determined by integration of the ipPRC. Thus, the ipPRC must be of extreme magnitude to yield appreciable shifts for short pulses. Pharmaceutical inputs to the clock are not likely to have the extremely rapid pharmacokinetics needed to be approximated as short pulses.
Assumption 2.
The ipPRC map is continuous on [0, 2π] and has exactly two zero crosses in that interval; the derivatives of the ipPRC at the zero crosses have opposite sign.
This PRC form is readily found in the literature, and for stimuli of a finite duration is commonly referred to as a type 1 phase response [38, 39]. Methods for calculation of the ipPRC using limit-cycle oscillator models are described elsewhere [37]. The other phase response observed (solely) for finite-duration stimuli, a type 0 phase response, contains a discontinuity. This form is associated with a strong and extended stimulus or multiple stimuli over several days, and therefore does not correspond to the ipPRC [38–40]. Furthermore, a type 0 phase response may be generated from an ipPRC that conforms to Assumption 2 by applying a long-duration stimulus.
2.3. The phase resetting problem and optimal phase resetting via Pontryagin’s maximum principle
The phase-resetting problem involves tracking a reference oscillator with phase given by ϕr(t) ϵ [0, 2π), with the oscillator ϕ(t; u) which obeys (10), and with control input bounded by u ϵ [umin, umax]. The phase of the reference oscillator is given by:
Assumption 3.
The reference angular velocity is identical to the angular velocity of the oscillator, that is, ω ≡ ωr.
Assumption 4.
The oscillator is not perturbed by exogenous inputs (e.g. light, metabolic changes) other than control actions.
Through these assumptions we seek to achieve a phase shift rather than entrain to a dynamic phase angle or an environment that also perturbs the system under control. Future studies may approach the more complicated case in which small differences between the day length and human circadian periods (see [41]), and dynamic light input, are accounted for. This simple case of achieving a static phase shift more simply yields optimal control policies that are sufficient to provide understanding for feedback controller design.
The phase difference between the reference oscillator and the oscillator under control is:
The total desired phase shift is such that χ(tf ) = 0 where tf is the time at which the control action is finished, and therefore
In an ideal case where pharmaceutical inputs can be continuously dosed, one can derive an optimal control trajectory for entrainment via Pontryagin’s Maximum/Minimum Principle [42]. The optimal control problem may be formulated with state dynamics
(11) |
and cost functional
(12) |
for minimum-time entrainment. This cost functional does not include a penalty for introducing a drug which may have side effects or be otherwise undesirable. If the drug selected elicits dose-dependent off-target effects, the cost functional could be modified to penalize dose as well, and the performance between cases could be compared.
This yields a free-time fixed-endpoint problem, which can be solved using standard optimal control approaches [33, 42]. The Hamiltonian
(13) |
is formulated with λ is the dynamic costate with terminal condition λ(tf) = 0. Using Pontryagin’s Maximum Principle with constrained control inputs in the range [umin, umax] we can write
which yields the optimal “bang-bang” control policy
(14) |
Here, the terminal condition of the costate, λ(tf ) = 0, forces the costate to be either positive or negative for any given initial condition for all time, and so for a specific initial condition the control input depends exclusively on the sign of B(ϕ), c.f. [33].
This implies that the oscillator should be shifted in only either the negative or positive direction toward its final phase shift, but not both. For an oscillator with desired phase shift Δϕf and initial phase ϕ0, the optimal control input u⋆ is one of two admissible trajectories: either that for achieving the positive phase shift
(15) |
or the negative phase shift
(16) |
Which of these two trajectories is optimal depends upon which reaches ∆ϕf more rapidly, where ∆ϕ accumulates over time according to
(17) |
A decision boundary between advances and delays is denoted ∆ϕb (i.e., ∆ϕf > ∆ϕb indicates that delays are preferable, and ∆ϕf < ∆ϕb indicates that advances are preferable). Correspondingly, that boundary for the continuous time optimal control is denoted . Thus, the two admissible trajectories may be calculated and compared to determine optimality.
This concept is illustrated for the small-molecule pharmaceutical KL001 input in Fig. 3. For KL001 drug action, umax = 0:06 to reflect a 60% decrease in degradation rate, and umin = 0 to reflect that decreasing the degradation rate cannot result in the synthesis of new biomolecules. The ipPRC is shown in Fig. 2 Here, we visualize the two admissible trajectories as two oscillators ϕ+(t) and ϕ−(t) receiving control inputs and , respectively, causing the accumulation of phase shifts ∆ϕ+(t) and ∆ϕ−(t) moving in opposite directions around the circle of all phase shifts ∆ϕ ϵ [0, 2π) starting at ∆ϕ = 0. The time at which these oscillators meet therefore achieves any phase shift on [0, 2π). This time may be found for any initial phase ϕ0 by solving the coupled equations (10) and (17) for inputs and , with the terminal condition of ∆ϕ+(t⋆) = ∆ϕ−(t⋆).
Thus, is defined as the phase shift at which ϕ−(t) and ϕ+(t) meet. The maximum time to achieve any phase shift starting from ϕ0 is given by , and (depicted as a solid black line in Fig. 3D) yields the boundary between where a positive shift and negative shift are optimal to reach a desired ∆ϕf from phase ϕ0. The optimal control input for a given ϕ0 and ∆ϕf is therefore defined as:
(18) |
Therefore the optimal shift time may be calculated by applying this continuous time optimal control until ∆ϕ = ∆ϕf One can formally write as
(19) |
Remark. A bang-bang optimal control policy for light entrainment was shown previously in [24] and [31]. Applying these solutions, however, necessitated either a line search or a search for switching times due to the nonlinear temporal relationship between light input and driving force on the oscillator. Because pharmaceutical control of the clock can be parameterized directly within the limit cycle model, this formulation provides a simple way to compute the theoretical optimal phase resetting policy. Therefore, this provides a means for determining the relative efficacy of any applied control policy.
3. Closing the loop: designing an ipPRC-based nonlinear MPC
3.1. Motivation
Model predictive control is generally implemented by solving a finite-horizon optimal control problem at discrete sampling times, that is, for t = 0, τ, 2τ, , where τ is the sampling time. This section will demonstrate that time discretization causes a loss of optimal control in regions where the step includes a zero cross of the ipPRC, as the input cannot be adjusted during the step and the continuous-time optimal control policy will be violated (visualized in Fig. 5). Additionally, care must be taken to ensure that a controller selects the optimal direction of phase shifting (e.g., 8h advance vs. 16h delay) when only observing a finite portion of the future ipPRC. For the continuous time optimal control, that boundary is known to be , however, this is unlikely to be the correct boundary for the sampled-time case. Further, there is no guarantee that this boundary will appear from simply solving the finite-horizon optimal control.
This section explores how the formulation of the finite-horizon optimal control problem for MPC affects our ability to control the clock. For example, consider an ipPRC with a small positive region. A large sampling time in this case may result in a controller unable to access the positive ipPRC region due to the surrounding negative region being included in each step, and the inadvertent delays canceling the desired advance. This would force such a controller to use phase delays to achieve even small positive phase shifts.
It is therefore useful to compare the performance of a controller with piecewise constant control and uniformly spaced steps in sampling and switching time (the sampling-time control) to the continuous-time optimal control case to identify where inefficiency accumulates. This technique can be used to find a balance between high performance and the computational cost and physical impracticality of frequently updating the algorithm via measurement of physiological phase markers.
Additionally, the controller design will inform the suitability of a specific pharmaceutical for use. For example, a drug with a small positive ipPRC region will not be an appropriate choice for achieving large phase advances if an alternative drug with a larger positive ipPRC region exists. Alternately, the drug with the larger positive ipPRC region may be unsuitable if its pharmacokinetics are slow, such that sampling times must be very long to allow drug clearance between adjusting the dose.
3.2. Exploiting the ipPRC to choose MPC sampling time
Consider the more practical scenario of implementing control inputs parameterized as piecewise constant signals with equispaced time intervals between the knots (i.e., the switching times) of the control parameterization trajectory. This is commonly done when solving and applying MPC in continuous-time. The control input is defined as constant across time steps of uniform duration in a finite horizon, as commonly done for MPC:
(20) |
where ti are the discrete sampling times, with i = 1, … , Np where Np is the number of steps in the prediction horizon (and control horizon, in this paper). Then, ui ϵ [umin, umax] is a scalar constant, and τ = ti − ti−1 is the sampling time for the controller.
Next, we are interested in quantifying the error incurred due to piecewise constant control actions at discrete sampling times. To better understand this error, we derive bounds on two metrics: the residual phase error, and the number of additional cycles to complete reset. Definitions for and proofs of bounds on these metrics follow.
Definition 1. The residual phase error is the phase error remaining at sampling time tk, the first sampling time for which the continuous-time optimal control would have first completed the reset.
Recall that is the time at which χ(t) = 0 under the optimal control and so
(21) |
Based on Definition 1, the residual phase error is given by
(22) |
First, we provide bounds on and demonstrate that these bounds are solely dependent on the sampling time τ and the ipPRC specific to the pharmaceutical being used. To do so, let ncyc be the number of 2π cycles required to achieve the continuous time optimal control resetting using the optimal control policy (18). Furthermore, we denote ,, and Es to be the errors incurred at the step including the negative zero cross, the step including the positive zero cross, and a correction for phase advance cases, respectively.
Theorem 1 (Bounding of ). Suppose Assumptions 1–4 hold. The residual phase error incurred by the sampled- time control (20) satisfies
Proof. Note that the sampling-time control is able to identically track the continuous-time optimal control except where consecutive sampling times occur on either side of a zero-cross of the ipPRC.
By Assumption 2, the ipPRC has two zero-crosses in each [0, 2π) cycle and so this occurs at most twice per cycle. A zero cross occurs at t = t0 (where ti−1 < t0 < ti). The phase shift accumulated for a single step of the controller from ti−1 to ti is given by
(23) |
which must be solved numerically with (10) to provide ϕ(t′). Based on the arguments made in the previous section using Pontryagin’s principle, we know that the optimal control policy selects either umax or umin until the zero-cross, then the opposite following it. Thus, the maximal loss of phase advance or delay is incurred if the switching times are positioned such that the shift incurred over that time step Δϕi(ui) = 0 for the maximal input case, as this indicates that phase advances and delays cancel. The worst-case loss is therefore incurred where the controller takes no action on either side of the zero-cross, as the maximal action would also incur no shift (as shown in Fig. 5).
The maximal loss in phase shift is of equal magnitude in the positive and negative directions because competing advances and delays of identical magnitudes cancel. The maximal error incurred at a given zero-cross is
(24) |
and the error for the positive or negative zero-cross is denoted or , respectively. The maximal phase error incurred for each cycle is the sum of the maximal error from each zero-cross, all other steps are identical to the continuous-time optimal control. A demonstration of the phase-shift cancellation by competing advances and delays is shown in Fig. 5.
For phase advances, additional error is incurred by phase advances accumulating less due to the zero crosses, thus, the oscillator phase will reach the next positive ipPRC region later than it would if it could achieve the full phase advance. The additional error accumulated is equal to the maximal shift that could be achieved in the extra time that it takes to reach the positive region. The time lost per cycle for phase advances is:
(25) |
where T is the oscillatory period. This results in an additional residual phase error per cycle of
(26) |
where ∆ϕb is the decision boundary between phase advances and delays. In the case of phase delays, this quantity is 0, since the oscillator phase will reach the next negative region more rapidly due to achieving less phase delay.
These errors are caused by deviations from the optimal control policy at ipPRC zero crosses, and the sampling-time optimal control otherwise exactly follows the continuous time optimal control. Each of these errors occurs at a rate of once per cycle. Thus, we can bound the residual time phase error
(27) |
for a given τ, which concludes the proof.
The bound may be thought of as the maximal value of χ(tk) for optimal control with sampling time τ.
Remark. At first glance, it may seem that this bound is loose for the case of arbitrarily large ncyc, the number of cycles to achieve any reset is bounded by the finite scalar
the maximum number of number of cycles to achieve any continuous time optimal reset. Applying this value in (27), we may bound the residual time phase error for any sampling-time optimal reset.
Fig. 6 shows how the residual phase error bounds evolve over time for phase advances or delays, ϕ0 = 0, and τ = 2 h. This corresponds to the actual residual phase error incurred, by numerically calculating the optimal control. Although the error bound increased identically for each cycle, the actual residual phase error varies each cycle, depending upon how each zero-cross aligns with the sampling times.
Fig. 7A-B demonstrates this bound for KL001 under τ = 1, 2, 4 h, for ϕ0 = 0. The number of cycles to achieve each reset for phase advances or delays ∆ϕf ϵ [−2π, 2π] (to explicitly show advances and delays) and ϕ0 = 0 under the continuous-time optimal control for KL001 is shown in Fig. 7A. The region of this plot that corresponds to the phase resetting directions selected by the continuous-time optimal control (advances for delays for ) are denoted between the dashed lines. Notably, the number of cycles to achieve phase advances accumulates more rapidly due to the smaller positive region of the ipPRC, and as such, error will incur more quickly in phase advance regions. Fig. 7B shows the bounds on the residual phase error for τ = 1, 2, 4 h, and the actual residual phase error incurred for numerically calculating the optimal control for each shift and sampling time. As observed, the calculated residual phase errors fall within the computed bounds.
In addition to the residual phase error, it is useful to determine how much additional time it would take to reach a complete reset. Effectively, we aim to nd tadd for Finding or bounding tadd is complicated, as it necessitates knowledge of the entire discrete-time optimal control trajectory for each ϕ0 and ∆ϕf How-ever, the most additional time to reset is incurred when a
phase advance requires waiting through the negative ipPRC region (or the inverse case, a phase delay forcing a wait through the positive ipPRC region) to complete the resetting. Conveniently, we may bound the number of additional cycles to complete a phase reset without explicit knowledge of the full discrete-time optimal control.
Definition 2.The number of additional cycles to complete reset is a positive scalar that enforces the constraint
where T +,− is the accelerated/decelerated period of the oscillator induced by a phase advance/delay, respectively.
Remark. We note that T + < T < T −, and by virtue of being on a limit cycle yields
and
which hold true for any T +-periodic control input u+ that phase advances the oscillator and shortens the period, or T −-periodic control input u− that phase delays the oscillator and lengthens the period. In sum, phase advances result in transient shortening of the period, and phase delays result in a transient lengthening of the period.
The following upper bound is proposed on the number of additional cycles to complete reset.
Theorem 2 (Bounding of ).
Let and be the maximum phase delay and advance that is guaranteed to be achievable in a cycle of 2 respectively. Recall that ∆ϕb is the decision boundary between phase advances or delays. The number of additional cycles to correct for the residual phase error incurred by the sampled-time control (20) is bounded above by
(28) |
where ⌊ · ⌋ denotes the floor function.
Proof. Let denote the phase advance that are guaranteed to be achievable in a single cycle. This is more simply thought of as the maximum total phase advance or delay achievable in a cycle minus the worst-case zero-cross error. For advances, the maximum phase advance achieved in a cycle is achieved by the continuous time optimal control policy for advances:
(29) |
The phase shift that is guaranteed to be achievable for each cycle for a step size of τ is therefore
(30) |
and an equivalent method can be used to calculate .
The bound may be calculated by dividing the remaining error by the advance or delay that is guaranteed to be achieved each cycle for a sampling time of τ This implies (28).
This bound is plotted for the use of KL001 and τ = 1, 2, 4 h in Fig. 7C. For sampling times of 1 or 2 h, the residual phase error may corrected in 1–2 cycles, with the vast majority of cases needing only an additional cycle. Furthermore, because this method assumes the worst possible alignment of the control switching times for every cycle, it is unlikely that the full two additional cycles will be needed. Alternately, the 4 h sampling time potentially results in many additional cycles needed to achieve a complete reset. This is because, as shown in Fig. 5, a 4 h step could be aligned such that nearly the entire phase advance region of the ipPRC is lost. Correspondingly, is small in value. Thus, we may conclude that a sampling time of 2 h is approximately as effective as 1 h, and provides a 50% reduction in computational and sampling costs over 1 h sampling times, given our assumptions. Our controller designed in Section 4 therefore uses τ = 2 h. However, if a controller was designed to only produce phase delays, a 4 h sampling time is sufficiently near to the optimal control. This metric is therefore useful in designing controllers for a given problem and potential pharmaceutical given its ipPRC.
For these plots, dashed lines bound the phase shifts that are performed by the continuous-time optimal control to achieve complete coverage of phase shifts from 0 to 2π This implicitly assumes that the direction of optimal phase shifting is known for our choice of τ and is identical to the continuous time optimal control bound ∆ϕ⋆. The following subsection addresses the choice of resetting direction (given ∆ϕ⋆ for the continuous optimal case).
While numerical calculation of the in finite-horizon optimal control may seem advantageous, the optimizations involved are costly, as the selection of each step affects the region of the ipPRC available in the ensuing steps. Rather, we propose to formulate the MPC problem using the bounds derived from the optimal control as guidelines, and perform phase shifts by repeatedly solving the finite-horizon optimal control. It must be emphasized that because the bound we have derived is for the maximum residual phase error incurred at each zero-cross, a controller need not solve the in finite-time optimal sampling time control to be governed by the bound. Solving a finite-horizon optimal control problem is sufficient for these bounds on accumulated error to apply.
3.3. On MPC formulation and the selection of a prediction and control horizon
MPC involves repeatedly solving a finite horizon optimal control problem. For the continuous-time optimal control (in Section 2.3) the direction of phase shifting was determined by calculation of the equal-time phase shift boundary ∆ϕ⋆. That boundary may differ for the discrete-time optimal control for a given step size. However, in MPC, whether the oscillator should advance or delay is implicitly solved in the finite horizon optimization. This subsection demonstrates via simulations that the finite-horizon optimal control in MPC is dependent on the number of steps. Because the finite-horizon optimal phase shifting direction can differ from the true optimal direction, we propose a multi-staged optimization for solving the finite-horizon optimal control.
The MPC problem is formulated in a similar fashion as [34], where the prediction and control horizons are set to be identical. The prediction horizon consists of Np steps with sampling time τ For an oscillator at t = tk this yields:
(31) |
as the knots of the control trajectory defined at each of the Np steps. Eqn. (10) is used as the MPC predictor, which estimates the oscillator phase at the end of each of steps. The predictor formulation of (10) is
(32) |
Here, i is the index of the applied control steps, and k is the index of the predictor. The predicted phase error is de ned as the magnitude of the phase difference between the predicted phase and the environmental phase:
(33) |
We want the phase error to reflect the absolute phase distance between reference and controlled oscillators, and so we define
To avoid overcorrecting to for small amounts of noise in the phase estimate, we ignore phase error below a constant δϕ:
(34) |
so that imprecisions in calculating phase do not result in controller action (similar to the zone-MPC approach of [7]). Thus, the finite-horizon optimal control problem at each time ti may be solved for the optimal trajectory :
(35) |
Subject to: Eqn. (32) Eqn. (33)
for all ℓ = 1, , Np, where wϕ and wu are positive weighting scalars evaluated at the end of the time step and start of the time step, respectively, as phase error is calculated after the control is applied for that step. Weights wu are nominal so that the control input is 0 when input would incur no phase shift. After identifying the optimal piecewise control trajectory u⋆, the initial step is applied to the system (in this case the full 8-state ODE model) for t ϵ (ti, ti + τ] as is standard in model predictive control.
For the ensuing simulations, a sampling time of τ = 2 h was used, as described in the prior section. Additionally, weights wϕ = 10 and wu = 1, so that no control is delivered for marginal gains in phase shifting, and δϕ = 0.1 as in [35].
Simulations demonstrate that the optimization step in the MPC is susceptible to errors induced by choosing the slower direction to achieve a phase reset. The example in Fig. 8 shows how the finite horizon optimal control calculated for ∆ϕf = π, ϕ0 = 0 varies with Np. For short prediction horizons, the controller applies a phase advance, because more of the positive ipPRC region is immediately visible to the predictor, suggesting that this will achieve most rapid reset. By extending the prediction horizon to Np = 10 (20 h), more of the negative region of the ipPRC is available, and so the controller targets phase delays, consistent with the in finite-time optimal control for τ = 2 h. In some cases, performing the optimization with both advances and delays available results in an “indecisive” controller that changes directions during the phase shifting, causing excessive lags to reset.
One solution to the problem of inconsistent finite-horizon optimal control policies is to extend the prediction horizon such that it always observes the entirety of the reset, to guarantee that the finite-horizon optimal control calculated during MPC is equal to the in finite-horizon optimal control, that is: However, this leads to excessive computational costs, especially for small sampling times. Furthermore, the optimization formulation itself poses challenges. Minimizing the time to complete the reset is challenging, as there is no gradient along which to move unless the reset is completed during the prediction horizon, leading to cases with an in finite cost. Minimizing phase error at each step may be suboptimal as well: the fastest reset achievable may involve temporarily increasing χ(t) for cases where ∆ϕf < π and ∆ϕf >.
An alternate approach, which is employed in the following section, is to use a two-stage optimization, where the direction of the reset (advance or delay) is chosen first using a pre-selected bound ∆ϕb. Furthermore, we propose using ∆ϕb = ∆ϕ⋆, as this will ensure that the bounds on residual phase error and additional cycles to reset hold, as derived in the prior section, and thus provide guarantees on controller performance in the absence of plant-model mismatch or noise. One further important consideration is that a controller with Np = 1 will be unable to avoid ac-cumulating error at the zero-crosses, whereas a controller with a larger Np provides exibility to adjust the control inputs preceding a zero-cross to minimize the accumulated error. For this reason, the design parameters were set to Np = 5 and ∆ϕb = ∆ϕ⋆ for our controller. We note, however, that the derived bounds hold even for Np = 1.
4. In silico application of designed MPC controller for phase resetting
To demonstrate the efficacy of this control approach, this section presents two control scenarios in our in silico circadian simulator (shown schematically in Fig. 4). We note here that the predictor in our MPC formulation was the phase only model, but the system that we applied the control input to is the full 8-state ODE model. Thus, we are testing Assumption 1, since control inputs will perturb the system from the limit cycle, and control will only be viable if the phase-only model reduction accurately describes phase dynamics in state space. We parameterized the controller with τ = 2h, Np = 5, and ∆ϕb = ∆ϕ⋆, as prescribed by the design considerations in the prior sections.
The first scenario (Fig. 9) is an example of extreme jet lag, in which the environment undergoes a 5 h phase advance, followed three days later by a phase delay of 11 h This is the equivalent of a flight from Boston to London, then London to Honolulu. Here, the controller first waits for a positive region of the ipPRC rather than immediately attempting a delay, as ∆ϕf < ∆ϕb. Thus, it completes this phase advance in approximately 30 h. The 11 h delay occurs during a region where B(ϕ) < 0, and thus control input is provided immediately. This shift is completed in a total of two continuous doses, in approximately 36 h. Without a control approach, light input alone achieves a shift of 2–3h per day at most, thus, this represents a significant speed up in phase resetting. If light is delivered under optimal control, these shifts are predicted to take several days [24], thus demonstrating the potential for improvement by a pharmaceutical input.
The second scenario (Fig. 10) involves the delivery of KL001 to align an individual working on a rotating shift pattern, from [43]. Here, the time on the x-axis is local time, and the phase of the oscillator is initialized for an individual entrained to a normal light-dark cycle. In this protocol, the individual works two days of morning shift, followed by two days of evening shift, followed by two days of night shift (work shifts denoted by gray boxes. This perpetually phase-delaying schedule is easily tracked by KL001 input, as KL001 more easily achieves phase delays. Notably, by taking one large dose after the conclusion of each pair of shifts, the individual rapidly shifts to the new work phase. Each shift worked in this protocol occurs while the individual is exactly phase-aligned with their work environment. This scenario exclusively involves phase delays, and so under the analysis of Section 3 a larger sampling time of τ = 4 h (or possibly even larger) would be similarly effective.
Importantly, the possibly competing effects of light was neglected in these scenarios. This could be achieved simply by restricting light input during the phase-shifting steps, or treating light as a complementary control input, using an approach such as [25]. This is a possible topic for future study.
5. Conclusion
This paper presents tools for designing feedback control of circadian rhythms based upon an optimal control policy. Importantly, this paper provides a bound on the errors incurred by discrete sampling and update times during pharmaceutical control of circadian oscillation. These results may then be applied to compare against the efficacy of a light-based approach; see, for example, [33]. Thus, we implicitly considered the non-instantaneous pharmacokinetics of circadian pharmaceuticals. The optimal control problem in the case of known pharmacokinetics will likely necessitate computational, rather than analytic, solutions. In our simpler discrete-time formulation, one may think of the time step of each input as an approximation of the pharmacokinetics by a square wave. While this approximation is clearly non-physiological, it yields analytic solutions that can guide the search for viable pharmaceutical actuators. The principle that imprecision in following the optimal control policy is incurred when drug action occurs during ipPRC zero-crosses points indicates, for example, that a drug with slow pharmacokinetics and a short ipPRC positive region will be inefficient at achieving phase advances. Future studies will directly incorporate pharmacokinetics, which may be easily added to an MPC formulation, and then MPC solutions may be compared to performance under the optimal control policy. Further-more, we have shown the advantages provided by pharmacological actuation, as any phase reset is predicted to be achievable in under 60 h. Uniquely, we propose a formulation of the nonlinear MPC based on the bounds derived from the optimal control problem.
Since drug discovery for the manipulation of circadian rhythms is ongoing, this paper presents a perspective on which pharmaceuticals will enable control. Specifically, an ideal circadian pharmaceutical will have (i) large-magnitude positive and negative ipPRC regions, to enable bidirectional phase resetting, and (ii) rapid pharmacokinetics, to enable precise timing of its effect and minimal losses during ipPRC zero-crosses. This is in contrast to traditional pharmaceutical approaches that select for a long drug half-life to ensure its effect is sustained. Future studies should explicitly incorporate simple pharmacokinetic profiles, rather than piecewise-constant control. However, achieving closed-form solutions or error bounds for these problems is challenging.
While this study has approached control of circadian phase, circadian amplitude has been also been recognized as an important physiological parameter. Previous studies have predicted that light resetting of the clock is associated with diminished amplitudes [24, 44]. Changes in amplitude of circadian parameters may be due to deviation from the limit cycle within individual cells, or due to desynchrony among cellular oscillators during phase shifting [45]. Deviation from the limit cycle may be predicted through amplitude response curves (ARCs), and changes in synchrony may be predicted by judicious use of the ipPRC. Elsewhere, we have begun the development of MPC that maintains circadian synchrony through phase shifting [46] Future studies of circadian control may explicitly include control of circadian amplitudes in addition to phase.
Finally, the remaining link in achieving closed-loop circadian control is the development of a noninvasive circadian phase sensor. While phase may be assessed relatively simply from cellular bioluminescent reporters, noninvasive in vivo phase inference in mammals is an open challenge. Recent work in assessing phase from ambulatory monitoring has shown promise and reasonably accurate predictions [47], however, the propagation of measurement errors through such a control algorithm also remains an open question. An ideal phase sensor would rely on simple data such as solely actigraphy, rather than additional biophysical sensors (e.g. skin temperature, salivary melatonin), and remains in development. The abilities of phase inference methods and controller robustness will be of consequence for the implementation of any form of circadian feedback control mentioned herein.
Acknowledgments
1 This work was funded by: National Institutes of Health (NIH) T32-HLO7901, R01-GM096873–01, and K24-HL105664.
Biographies
John H. Abel is a Postdoctoral Research Fellow in the Department of Anesthesiology, Critical Care and Pain Medicine at Massachusetts General Hospital, and an affiliated researcher at the Picower Institute for Learning and Memory at the Massachusetts Institute of Technology (MIT). He received the B.S. degree in Chemical Engineering from Tufts University (2013), the M.S. degree in Chemical Engineering from the University of California, Santa Barbara (2015), and the Ph.D. degree in Systems Biology from Harvard University (2018). His research interests are in the application of dynamical systems and control theory to neuroscience, systems biology of circadian rhythms, and physiological closed-loop control.
Ankush Chakrabarty received a B.E. with first-class honors from Jadavpur University, India (2011), and a Ph.D. from Purdue University, West Lafayette, USA (2016), both in Electrical Engineering. Since 2018, he is a Visiting Research Scientist in the Control and Dynamical Systems Group at Mitsubishi Electric Research Laboratories (MERL), prior to which he was a Postdoctoral Fellow at Harvard University. His work leverages artificial intelligence and approximation/relaxation methods to provide computationally efficient and provably safe solutions to challenging problems arising in control and estimation, with special emphasis on model predictive control and unknown input estimation.
Elizabeth B. Klerman is an Associate Professor of Medicine at Harvard Medical School, and a Physician in the Department of Medicine at the Brigham and Women's Hospital. She received the B.S. in Interdisciplinary Science from the Massachusetts Institute of Technology (MIT), and the PhD in Physiology and M.D. degrees from Harvard University. She is a member of the Sleep Research Society, the Society for Research in Biological Rhythms, the Association for Clinical and Translational Science, and the International Academy of Astronautics. She was received Phi Beta Kappa for her undergraduate work and a NASA JSC Director’s Innovation Award as member of International Space Station Flexible Lighting Team. Dr. Klerman's current research focuses on the interaction of endocrine, circadian and sleep rhythms in normal and pathological states; and mathematical modeling of the sleep and circadian system and markers of its function.
Francis J. Doyle III is the John A. Paulson Dean of the Harvard John A. Paulson School of Engineering and Applied Sciences at Harvard University, where he also is the John A. & Elizabeth S. Armstrong Professor. He received a B.S.E. degree from Princeton, C.P.G.S. from Cambridge, and Ph.D. from Caltech, all in Chemical Engineering. He has held faculty appointments at Purdue University, the University of Delaware, and UCSB. In addition he has held visiting positions at DuPont, Weyerhaeuser, and Stuttgart University. He has been recognized as a Fellow of multiple professional organizations including: IEEE, IFAC, AIMBE, and the AAAS. He was the President for the IEEE Control Systems Society in 2015, and was the Vice President of the International Federation of Automatic Control from 2014 to 2017. In 2005, he was awarded the Computing in Chemical Engineering Award from the AIChE for his innovative work in systems biology, and in 2015 received the Control Engineering Practice Award from the American Automatic Control Council for his development of the artificial pancreas. In 2016, he was inducted as a Fellow into the National Academy of Medicine for his work on biomedical control. His research interests are in systems biology, network science, modeling and analysis of circadian rhythms, and drug delivery for diabetes.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- [1].Lauffenburger DA, Cell signaling pathways as control modules: Complexity for simplicity?, Proc. Natl. Acad. Sci. U. S. A 97 (10) (2000) 5031–5033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Bailey J, Haddad W, Drug dosing control in clinical pharmacology, IEEE Control Syst. Mag 25 (2) (2005) 35–51. [Google Scholar]
- [3].Julius AA, Halasz A, Sakar MS, Rubin H, Kumar V, Pappas GJ, Stochastic modeling and control of biological systems: The lactose regulation system of escherichia coli, IEEE Trans. Automat. Contr 53 (2008) 51–65. [Google Scholar]
- [4].Perley JP, Mikolajczak J, Harrison ML, Buzzard GT, Rundell AE, Multiple model-informed open-loop control of uncertain intracellular signaling dynamics, PLoS Comput. Biol 10 (4). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Yan G, Vertes PE, Towlson EK, Chew YL, Walker DS, Schafer WR, Barabasi A-L, Network control principles pre-dict neuron function in the caenorhabditis elegans connectome, Nature 550 (7677) (2017) 519–523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].El-Samad H, Kurata H, Doyle JC, Gross CA, Khammash M, Surviving heat shock: Control strategies for robustness and performance, Proc. Natl. Acad. Sci. U. S. A 102 (8) (2005) 2736–2741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Gondhalekar R, Dassau E, Doyle III FJ, Periodic zone-mpc with asymmetric costs for outpatient-ready safety of an artificial pancreas to treat type 1 diabetes, Automatica 71 (2016) 237– 246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].Chen S, Harrigan P, Heineike B, Stewart-Ornstein J, El-Samad H, Building robust functionality in synthetic circuits using engineered feedback regulation, Curr. Opin. Biotechnol 24 (4) (2013) 790–796. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Del Vecchio D, Abdallah H, Qian Y, Collins JJ, A blueprint for a synthetic genetic feedback controller to reprogram cell fate, Cell Syst 4 (1) (2017) 109–120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Craven S, Whelan J, Glennon B, Glucose concentration control of a fed-batch mammalian cell bioprocess using a nonlinear model predictive controller, J. Process Control 24 (4) (2014) 344–357. [Google Scholar]
- [11].Giordano N, Mairet F, Gouze JL, Geiselmann J, de Jong H, Dynamical allocation of cellular resources as an optimal control problem: Novel insights into microbial growth strategies, PLoS Comput. Biol 12 (3) (2016) 1–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Haddad WM, Hayakawa T, Bailey JM, Adaptive control for non-negative and compartmental dynamical systems with applications to general anesthesia, Int. J. Adapt. Control Signal Process 17 (3) (2003) 209–235. [Google Scholar]
- [13].Luni C, Doyle III FJ, Robust multi-drug therapy design and application to insulin resistance in type 2 diabetes, Int. J. Robust Nonlinear Control 21 (15) (2011) 1730–1741. [Google Scholar]
- [14].Yu J, Zhang Y, Yan J, Kahkoska AR, Gu Z, Advances in bioresponsive closed-loop drug delivery systems, Int. J. Pharm (2017) (online). [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Jin X, Kim C-S, Dumont GA, Ansermino JM, Hahn J-O, A semi-adaptive control approach to closed-loop medication infusion, Int. J. Adapt. Control Signal Process 31 (2) (2017) 240–254. [Google Scholar]
- [16].Amendola S, Lodato R, Manzari S, Occhiuzzi C, Marrocco G, R d technology for iot-based personal healthcare in smart spaces, IEEE Internet Things J 1 (2) (2014) 144–152. [Google Scholar]
- [17].Zavitsanou S, Chakrabarty A, Dassau E, Doyle F III, Embedded control in wearable medical devices: Application to the artificial pancreas, Processes 4 (4) (2016) 35. [Google Scholar]
- [18].Qiu WY, Tong S, Zhang L, Sakurai Y, Myers DR, Hong L, Lam WA, Bao G, Magnetic forces enable controlled drug delivery by disrupting endothelial cell-cell junctions, Nat. Commun 8 (2017) 15594. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Bakh NA, Cortinas AB, Weiss MA, Langer RS, Anderson DG, Gu Z, Dutta S, Strano MS, Glucose-responsive insulin by molecular and physical design, Nat. Chem 9 (10) (2017) 937–943. [DOI] [PubMed] [Google Scholar]
- [20].Mohawk JA, Green CB, Takahashi JS, Central and peripheral circadian clocks in mammals, Annu. Rev. Neurosci 35 (1) (2012) 445–462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Marquie J-C, Tucker P, Folkard S, Gentil C, Ansiau D, Chronic effects of shift work on cognition: findings from the visat longitudinal study, Occup. Environ. Med 72 (4) (2015) 258–264. [DOI] [PubMed] [Google Scholar]
- [22].Hirota T, Lee JW, Lewis WG, Zhang EE, Breton G, Liu X, Garcia M, Peters EC, Etchegaray J-P, Traver D, Schultz PG, Kay SA, High-throughput chemical screen identifies a novel potent modulator of cellular circadian rhythms and reveals ckiα as a clock regulatory kinase, PLoS Biol 8 (12) (2010) e1000559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Hirota T, Lee JW, John PC, Sawa M, Iwaisako K, Noguchi T, Pongsawakul PY, Sonntag T, Welsh DK, Brenner DA, Doyle FJ III, Schultz PG, Kay SA, Identfication of small molecule activators of cryptochrome, Science 337 (6098) (2012) 1094–1097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Serkh K, Forger DB, Optimal schedules of light exposure for rapidly correcting circadian misalignment, PLoS Comput. Biol 10 (4) (2014) e1003523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Julius A, Yin J, Wen JT, Time-optimal control for circadian entrainment for a model with circadian and sleep dynamics, in: IEEE Conf. Decis. Control 2017, pp. 4709–4714. [Google Scholar]
- [26].Bagheri N, Stelling J, Doyle III FJ, Circadian phase resetting via single and multiple control targets, PLoS Comput. Biol 4 (7) (2008) e1000104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].John PC, Hirota T, Kay SA, Doyle FJ III, Spatiotemporal separation of per and cry posttranslational regulation in the mammalian circadian clock, Proc. Natl. Acad. Sci. U. S. A 111 (5) (2014) 2040–2045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [28].Bagheri N, Stelling J, Doyle III FJ, Circadian phase entrainment via nonlinear model predictive control, Int. J. Robust Nonlinear Control 17 (2007) 1555–1571. [Google Scholar]
- [29].Slaby O, Sager S, Shaik OS, Kummer U, Lebiedz D, Optimal control of self-organized dynamics in cellular signal transduction, Math. Comput. Model. Dyn. Syst 13 (5) (2007) 487–502. [Google Scholar]
- [30].Shaik O, Sager S, Slaby O, Lebiedz D, Phase tracking and restoration of circadian rhythms by model-based optimal control, IET Syst. Biol 2 (1) (2008) 16–23. [DOI] [PubMed] [Google Scholar]
- [31].Zhang J, Qiao W, Wen JT, Julius A, Light-based circadian rhythm control: Entrainment and optimization, Automatica 68 (2016) 44–55. [Google Scholar]
- [32].E mov D, Sacre P, Sepulchre R, Controlling the phase of an oscillator: A phase response curve approach, in: IEEE Conf. Decis. Control 2009, pp. 7692–7697. [Google Scholar]
- [33].Qiao W, Wen JT, Julius A, Entrainment control of phase dynamics, IEEE Trans. Automat. Contr 62 (1) (2017) 445–450. [Google Scholar]
- [34].Abel JH, Doyle III FJ, A systems theoretic approach to analysis and control of mammalian circadian dynamics, Chem. Eng. Res. Des 116 (2016) 48–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Abel JH, Chakrabarty A, Doyle FJ III, Nonlinear model predictive control for circadian entrainment using small-molecule pharmaceuticals, IFAC-PapersOnLine 50 (1) (2017) 9864–9870. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Kim JK, Forger DB, Marconi M, Wood D, Doran A, Wager T, Chang C, Walton KM, Modeling and validating chronic pharmacological manipulation of circadian rhythms, CPT Pharmacometrics Syst. Pharmacol 2 (7) (2013) e57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [37].Taylor SR, Gunawan R, Petzold LR, Doyle FJ III, Sensitivity measures for oscillating systems: Application to mammalian circadian gene network, IEEE Trans. Automat. Contr 53 (2008) 177–188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [38].Johnson CH, Forty years of prcs-what have we learned?, Chronobiol. Int 16 (6) (1999) 711–743. [DOI] [PubMed] [Google Scholar]
- [39].Winfree AT, The geometry of biological time, Springer-Verlag, 2001. [Google Scholar]
- [40].Kronauer RE, Forger DB, Jewett ME, Quantifying human circadian pacemaker response to brief, extended, and repeated light stimuli over the phototopic range, J. Biol. Rhythms 14 (6) (1999) 500–515. [DOI] [PubMed] [Google Scholar]
- [41].Duffy JF, Cain SW, Chang A-M, Phillips AJK, Munch MY, Gron er C, Wyatt JK, Dijk D-J, Wright KP, Czeisler CA, Sex difference in the near-24-hour intrinsic period of the human circadian timing system, Proc. Natl. Acad. Sci. U. S. A 108 (Supplement 3) (2011) 15602–15608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Kirk DE, Optimal Control Theory: An Introduction, Dover Publications, 2012. [Google Scholar]
- [43].Vetter C, Fischer D, Matera JL, Roenneberg T, Aligning work and circadian time in shift workers improves sleep and reduces circadian disruption, Curr. Biol 25 (7) (2015) 907–911. [DOI] [PubMed] [Google Scholar]
- [44].Diekman CO, Bose A, Reentrainment of the circadian pacemaker during jet lag: East-west asymmetry and the effects of north-south travel, J. Theor. Biol 437 (2018) 261–285. [DOI] [PubMed] [Google Scholar]
- [45].John PC, Taylor SR, Abel JH, Doyle FJ III, Amplitude metrics for cellular circadian bioluminescence reporters, Biophys. J 107 (11) (2014) 2712–2722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [46].Abel JH, Chakrabarty A, Doyle FJ III, Controlling biological time: Nonlinear model predictive control for populations of circadian oscillators, in: Emerg. Appl. Control Syst. Theory. Lect. Notes Control Inf. Sci. Proceedings, Springer, 2018, pp. 123–138. [Google Scholar]
- [47].Kolodyazhniy V, Spati J, Frey S, Gotz T, Wirz-Justice A, Krauchi K, Cajochen C, Wilhelm FH, An improved method for estimating human circadian phase derived from multichannel ambulatory monitoring and artificial neural networks, Chronobiol. Int 29 (8) (2012) 1078–1097. [DOI] [PubMed] [Google Scholar]