Time optimal entrainment control for circadian rhythm

A Agung Julius; Jiawei Yin; John T Wen

doi:10.1371/journal.pone.0225988

. 2019 Dec 18;14(12):e0225988. doi: 10.1371/journal.pone.0225988

Time optimal entrainment control for circadian rhythm

A Agung Julius ^1,^2,^*, Jiawei Yin ^1,², John T Wen ^1,^2,³

Editor: Tjeerd olde Scheper⁴

PMCID: PMC6919585 PMID: 31851723

Abstract

The circadian rhythm functions as a master clock that regulates many physiological processes in humans including sleep, metabolism, hormone secretion, and neurobehavioral processes. Disruption of the circadian rhythm is known to have negative impacts on health. Light is the strongest circadian stimulus that can be used to regulate the circadian phase. In this paper, we consider the mathematical problem of time-optimal circadian (re)entrainment, i.e., computing the lighting schedule to drive a misaligned circadian phase to a reference circadian phase as quickly as possible. We represent the dynamics of the circadian rhythm using the Jewett-Forger-Kronauer (JFK) model, which is a third-order nonlinear differential equation. The time-optimal circadian entrainment problem has been previously solved in settings that involve either a reduced-order JFK model or open-loop optimal solutions. In this paper, we present (1) a general solution for the time-optimal control problem of fastest entrainment that can be applied to the full order JFK model (2) an evaluation of the impacts of model reduction on the solutions of the time-optimal control problem, and (3) optimal feedback control laws for fastest entrainment for the full order Kronauer model and evaluate their robustness under some modeling errors.

1 Introduction

The circadian rhythm is a mechanism with which living beings can synchronize their biological processes with the light and dark pattern of the terrestrial day [1]. Effectively, the circadian rhythm functions as a master clock that regulates these processes [2]. In humans, the circadian rhythm is heavily linked to various physiological processes, including sleep, metabolism, hormone secretion, and neurobehavioral processes. Disruption of the circadian rhythm is known to have negative impacts on health, ranging from fatigue in travelers with jet-lag to an increased risk of cancer in rotating shift workers [3].

Light is the strongest circadian stimulus. In the literature, there are mathematical models that capture the dynamics of the circadian rhythm and how light affects it. The most detailed models are based on the biochemical and gene regulation processes behind the circadian rhythms, such as those in [4–6]. Empirical models, such as variants of the well-known Jewett-Forger-Kronauer (JFK) model [7, 8], are simpler and capture the essential behavior of the human core body temperature oscillation and the effect of light on the phase and amplitude of this oscillation. As demonstrated in [9], the JFK model may be considered as the asymptotic case of the biochemical models in an averaged sense.

One aspect of circadian rhythm regulation that has received a lot of attention is the (re)entrainment problem, i.e., the problem of driving a misaligned circadian phase to a reference circadian phase. Such problems occur, for example, in travelers with jet-lag or workers with rotating shifts. This problem is typically expressed as an optimal control problem of a system with nonlinear dynamics. The control inputs into the system are typically light [10] and chemicals that target circadian genes [11]. Some researchers have proposed to use model predictive control to deal with the nonlinear dynamics of the circadian rhythm without any guarantee of optimality of the solutions [12–15]. In contrast, others consider the time-optimal control problem related to circadian entrainment [16]. Our prior work (c.f. [17–21]) that used the Pontryagin Maximum Principle approach falls under this category. A related work reported in [22] also posed the time-optimal control problem and solved it by assuming that the optimal light schedule would alternate between using maximally bright circadian light and darkness. In the optimal control literature, such strategies are called bang-off strategies. Subsequently, the time-optimal scheduled is computed by optimizing the switching times of the light (i.e., between light on and off).

In this paper, we solve the time-optimal control problem of the fastest entrainment on the JFK model. The JFK model for human circadian rhythm is a third-order nonlinear differential equation, which is detailed in Sec. 2. In our prior work [18, 19], we study the optimal entrainment of a reduced model with second-order dynamics, which is obtained by ignoring a (fast) part of the dynamics, which is called the Process-L. In some later work [20, 21], we study the optimal entrainment of a further reduced model, which is obtained by ignoring the amplitude of the circadian oscillation and focusing on the phase dynamics of the oscillation. These reductions were necessary to solve the time-optimal control problem; otherwise, the solution procedures that we used were not numerically stable. The contributions of this paper can be stated as follows.

We formulate a general solution for the time-optimal control problem of fastest entrainment that can be applied to the full order JFK model.
We evaluate the impacts of the ignored dynamics in the reduced-order models on the solutions of the time-optimal control problem.
We formulate optimal feedback control laws for fastest entrainment for the full order JFK model and evaluate their robustness under modeling error.

The main tool that we use to achieve these results are (i) the lower order models that allow us to compute the solutions of similar (but simpler) problems as initial approximations of the optimal solution, and (ii) the calculus of variations that allows us to formulate a functional gradient descent algorithm to minimize the objective (i.e., entrainment time) from the initial approximations above. Note that the algorithms which are proposed to calculate the (locally or globally) optimal light input for the minimum-time entrainment only depend on the known initial states (open-loop form) instead of the current circadian states during the whole entrainment time (feedback form). If the given values of the initial states are not accurate or disturbances in the light inputs or circadian states occur during the entrainment processes, the minimum-time optimal light inputs which are given as a function of time might turn out to be invalid for entrainment. The feedback implementation of the optimal light input becomes necessary for robust entrainment. To implement the feedback entrainment, we collect data from the computed open-loop optimal trajectories to learn an optimal feedback control strategy. In contrast, existing results to this problem, e.g., those reported in [22], only report on the procedure to obtain the open-loop optimal trajectories. We also demonstrate the robustness of the optimal feedback control strategy by applying it on problem setups that are not used in the training data set.

The rest of the paper is organized as follows. Section 2 covers the full order model and the definition of the time-optimal control problem. In Section 3, we define the reduced-order models and the corresponding time-optimal control problems. In Section 4, we present the algorithms that we use to solve the time-optimal control problems above. In Section 5, we evaluate the impacts of the ignored dynamics in the reduced-order models on the solutions of the time-optimal control problem. In Section 6, we present the optimal feedback control laws for the time-optimal control problem. Finally, Section 7 concludes the paper with some discussion on the main results.

2 Problem formulation

The circadian rhythm entrainment problem in this paper is studied based on the JFK model [7]. This model is proposed on the dynamics and light-induced variations in the core body temperature (CBT), comprised with two processes: (1) the Process-L simulates the transduction from the light energy received by the retina to the neuron stimulus transmitted by the retina. It is also closely linked to the light adaption process of our eyes; (2) the Process-P simulates when the SCN receives the stimulus from the retina, how does the CBT change with this light-induced stimulus. The dynamics of the CBT oscillator is formulated and normalized on the Van Der Pol limit cycle. The dynamics of this circadian rhythm model is expressed as the following ordinary differential equations:

\frac{d x}{d t} = \frac{π}{12} [x_{c} + μ (\frac{1}{3} x + \frac{4}{3} x^{3} - \frac{256}{105} x^{7}) + B],

(1)

\frac{d x_{c}}{d t} = \frac{π}{12} [q x_{c} B - {(\frac{24}{0.99729 τ_{x}})}^{2} x - k x B],

(2)

\frac{d n}{d t} = 60 (α (I) (1 - n) - β n),

(3)

α (I) = α_{0} \cdot {(\frac{I}{I_{0}})}^{p},

(4)

u = G α (I) (1 - n),

(5)

B = (1 - 0.4 x) (1 - k_{c} x_{c}) u .

(6)

Here x(t) and x_c(t) are the states of the CBT oscillator, and n(t) is the state of the process that represents retinal photoreceptor saturation (termed Process-L). The states x(t) and n(t) are normalized and do not have physical units. The unit of x_c(t) is h⁻¹. Light input that enters the receptor is represented by its intensity I(t), which drives the Process-L. The signal u(t) is the input to the circadian oscillator, downstream from the Process-L. The values of the parameters of this model are reported in Table 1 below.

Table 1. Model parameters.

Parameter	Value	Parameter	Value
μ	0.13 h⁻¹	q	1/3
k	0.55 h⁻¹	p	0.5
α₀	0.05 h⁻¹	β	0.0075 h⁻¹
G	33.75	I₀	9500 lux
τ_x	24.2 h	k_c	0.4 h

Open in a new tab

We define a 24 h- periodic light input

\begin{matrix} I_{w} (t) = {\begin{matrix} I_{w, max}, & 0 \leq t < 16, \\ 0, & 16 \leq t < 24, \end{matrix} \end{matrix}

(7)

simply simulating the natural light-dark cycle, which is treated as the reference light in the following entrainment processes. The reference state trajectories in this model, denoted as $\bar{x} (t)$ , ${\bar{x}}_{c} (t)$ and $\bar{n} (t)$ , respectively, are the stable limit cycle solution of the JFK model with the periodic input I_w(t). The time-optimal entrainment (TOE) problem is formulated as finding the light input I(t) that drives a given initial state to this periodic solution as quickly as possible. The jet-lag caused by rapid transmeridian travel is considered in this paper. We basically assume that the circadian rhythm of the entraining subject keeps synchronization with the local time in the starting point during the travel time, formally:

PROBLEM (TOE)

Given the system dynamics (1)–(6) and initial conditions

\begin{matrix} [x, x_{c}, n] (0) = [\bar{x}, {\bar{x}}_{c}, \bar{n}] (T_{l a g}), \end{matrix}

(8)

where T_lag ∈ (0, 24) is the amount of jet-lag, i.e., time shift between the destination and starting point of travel. We want to minimize T such that

\begin{matrix} {∥ [\begin{matrix} x (T) \\ x_{c} (T) \end{matrix}] - [\begin{matrix} \bar{x} (T) \\ {\bar{x}}_{c} (T) \end{matrix}] ∥}^{2} - t o l_{1} = 0, \end{matrix}

(9)

using I(t) as the optimization variable, where tol₁ = 0.01 corresponds to about 30 minutes difference in the circadian phase and is small enough to ignore. We assume that, when the entraining state trajectory reaches the reference one (achieves the stopping criterion), the reference light in (7) is applied on the subject. Thus, the entrainment light strategy in this model during the whole time is given as

\begin{matrix} \hat{I} (t) = {\begin{matrix} I^{*} (t), & t \leq T, \\ I_{w} (t), & t > T, \end{matrix} \end{matrix}

where I*(t) represents the optimal light input of the TOE problem. As $[\bar{x}, {\bar{x}}_{c}]$ is the stable periodic solution of the dynamics with the input I_w, the reference light I_w will drive the entrainment state [x, x_c] to the reference state $[\bar{x}, {\bar{x}}_{c}]$ gradually if they are close with each other. Note that n is introduced in the Process-L of the JFK model to simulate the nonlinear relation of the light input and circadian drive received by the SCN. It has a very fast time scale and very small effects on the stability of the limit cycle, which is shown in the following section. The full order model behaves just like the 2nd-order one. Therefore, n is ignored in the terminal condition (9). We also impose the constraint

\begin{matrix} 0 \leq I (t) \leq I_{max}, \end{matrix}

(10)

for all t ∈ [0, T], where I_max is a parameter representing the maximum light intensity that can be used.

Throughout the paper, we define the circadian phase(in radian) of any point on the limit cycle of the circadian oscillation [x, x_c, n] as

\begin{matrix} θ ≜ - {tan}^{- 1} (\frac{x_{c}}{x}), θ \in [0, 2 π) . \end{matrix}

(11)

3 Model simplification

To reduce the complexity of solving the optimization problem above, some simpler models have been introduced. The optimization problem was then reformulated for the simplified models. In this section, we briefly review the simplified models and compare their behaviors under the periodic light input in Eq (7).

3.1 Second-order model

In this model, we exploit the fact that the Process-L (3) has a much faster time scale than the circadian oscillator. We use quasi steady-state approximation to assume that the Process-L is always at its steady-state. This yields

\begin{matrix} n = \frac{α (I)}{α (I) + β} = \frac{α_{0} {(\frac{I}{I_{0}})}^{p}}{α_{0} {(\frac{I}{I_{0}})}^{p} + β} . \end{matrix}

(12)

Further, from (5) we have

\begin{matrix} u = \frac{G β α_{0} {(\frac{I}{I_{0}})}^{p}}{α_{0} {(\frac{I}{I_{0}})}^{p} + β} . \end{matrix}

(13)

Since u is related to I through a static nonlinear mapping (13), we can simplify the circadian oscillation model by assuming that u is the control input and removing the Process-L from the model. The resulting model is therefore

\frac{d x}{d t} = \frac{π}{12} [x_{c} + μ (\frac{1}{3} x + \frac{4}{3} x^{3} - \frac{256}{105} x^{7}) + B],

(14)

\frac{d x_{c}}{d t} = \frac{π}{12} [q x_{c} B - {(\frac{24}{0.99729 τ_{x}})}^{2} x - k x B],

(15)

B = (1 - 0.4 x) (1 - k_{c} x_{c}) u .

(16)

Further, we map the periodic input in (7) through (13) and obtain a 24 h- periodic circadian input

\begin{matrix} U_{w} (t) = {\begin{matrix} U_{w, max}, & 0 \leq t < 16, \\ 0, & 16 \leq t < 24 . \end{matrix} \end{matrix}

(17)

The reference state trajectories in this part, denoted as $\tilde{x} (t)$ and ${\tilde{x}}_{c} (t)$ , respectively, are the stable limit cycle solution of the 2nd-order model with the periodic input U_w(t).

For the 2nd-order model, we can define a surrogate for the time optimal entrainment problem (TOE) as follows:

PROBLEM (TOE-2nd Order)

Given the system dynamics (14)–(16) and initial conditions

\begin{matrix} [x, x_{c}] (0) = [\tilde{x} (T_{l a g}), {\tilde{x}}_{c} (T_{l a g})], \end{matrix}

(18)

minimize T such that

\begin{matrix} {∥ [\begin{matrix} x (T) \\ x_{c} (T) \end{matrix}] - [\begin{matrix} \tilde{x} (T) \\ {\tilde{x}}_{c} (T) \end{matrix}] ∥}^{2} - t o l_{1} = 0, \end{matrix}

(19)

using u(t) as the optimization variable. We also impose the constraint

\begin{matrix} 0 \leq u (t) \leq u_{max}, \end{matrix}

(20)

for all t ∈ [0, T], where u_max is a parameter representing the maximum circadian input that can be used.

3.2 First-order model

Exploiting the fact that the 2nd-order circadian model has a stable limit cycle, we can further reduce the 2nd-order model to one that basically only captures the dynamics of the phase of the oscillation, and thus ignores the amplitude of the oscillation. The first-order model is of the form

\begin{matrix} \frac{d θ}{d t} = ω_{0} + f (θ) u, \end{matrix}

(21)

where θ (in radians) is the circadian phase. The parameter ω₀ is the so-called free running frequency, whose value ω₀ = 2π/24.2 rad/h is chosen to match the frequency of the limit cycle (with u(t) ≡ 0) in the second-order model (14)–(16). The input u is the circadian input with the same interpretation as in (13). The function f(θ) is called the phase response function (or phase response curve) in the literature [16, 20, 23, 24]. Based on (21), we can interpret this function as a map from the timing of the introduction of an impulsive input to the resulting shift in the circadian phase.

We compute the phase response function f(θ) with the following procedure.

Step 1
Obtain the free running periodic solution to (14)–(16) by setting u(t) ≡ 0. Denote this solution as $({\tilde{x}}_{f} (t), {\tilde{x}}_{c f} (t))$ and the period as T_ω. We assume the timing convention occurs at ${\tilde{x}}_{f c} (0) = 0$ and ${\tilde{x}}_{f} (0)$ is at maximum.
Step 2
Choose different values of τ ∈ [0, T_ω], run the model in (14)–(16) with initial conditions $x (0) = {\tilde{x}}_{f} (τ)$ and $x_{c} (0) = {\tilde{x}}_{c f} (τ)$ with the impulsive input signal
$\begin{matrix} u_{θ} (t) = {\begin{matrix} u_{max}, & t \in [0, Δ], \\ 0, & else . \end{matrix} \end{matrix}$ (22)

The corresponding impulsing phase is defined as
$\begin{matrix} θ = - {tan}^{- 1} (\frac{{\tilde{x}}_{c f} (τ)}{{\tilde{x}}_{f} (τ)}), θ \in [0, 2 π), \end{matrix}$ (23)
where u_max = 0.2208 corresponds to I_max = 10000 lux. Here, Δ is a much shorter time interval than 24 hours. In our implementation, we choose Δ = 0.5 hour.
Step 3
Since (x, x_c) will converge back to the free running periodic orbit, we define the resulting time shift T_θ (modulo the period T_ω) such that
$\begin{matrix} lim_{t \to \infty} ∥ [\begin{matrix} x (t) \\ x_{c} (t) \end{matrix}] - [\begin{matrix} {\tilde{x}}_{f} (t + τ + T_{θ}) \\ {\tilde{x}}_{c f} (t + τ + T_{θ}) \end{matrix}] ∥ = 0 . \end{matrix}$ (24)

Then, the phase response function is given by
$\begin{matrix} f (θ) = \frac{2 π T_{θ}}{T_{ω} u_{max} Δ} . \end{matrix}$ (25)

The resulting PRC is demonstrated as Fig 1, which is used for the formulation of the 1st-order model in (21).

Forcing the first-order model in Eq (21) with the periodic circadian input U_w(t) from Eq (17) results in a periodic limit cycle (modulo 2π), where the circadian phase trajectory is denoted as $\bar{θ} (t)$ . For the first-order model, we can define a surrogate for the time-optimal entrainment problem (TOE) as follows:

PROBLEM (TOE-1st order)

Given the system dynamics (21) with an initial condition

\begin{matrix} θ (0) = \bar{θ} (T_{l a g}), \end{matrix}

(26)

minimize T such that

\begin{matrix} | θ (T) - \bar{θ} (T) | - t o l_{2} = 0 mod 2 π = 0, \end{matrix}

(27)

using u(t) as the optimization variable, where tol₂ = 0.1 also corresponds to about 30 minutes difference in the circadian phase. We also impose the constraint

\begin{matrix} 0 \leq u (t) \leq u_{max}, \end{matrix}

(28)

for all t ∈ [0, T], where u_max is a parameter representing the maximum circadian input that can be used.

3.3 Behaviors under periodic light inputs

Under the periodic inputs I_w(t) (for the full order model) and U_w(t) (for the reduced models), the system exhibits stable limit cycles. Fig 2 shows a comparison between the periodic orbits of these models for I_w,max = 100 lux (comparable to the light intensity in a dim indoor space such as office corridors or elevators [25]), and I_w,max = 10000 lux (comparable to the light intensity outdoor on a bright day). The corresponding values of U_w,max (through (13)) are 0.1028 and 0.2208, respectively. We can make the following observations from these simulation data:

The periodic orbits of the full order model and the second-order model are practically the same.
There is a relative phase shift between these models, marked by the circadian phases at sunrise. At I_w,max = 100 lux, the largest gap is 0.0974 rad (22.31 min) between the first-order and second-order models. At I_w,max = 10000 lux, the largest gap is 0.1534 rad (35.15 min) between the second-order and the full order model.
The impact of the Process-L is more significant at the higher circadian light intensity. This can be observed by comparing the waveforms of the circadian input u(t) in the full order and second-order models. At I_w,max = 10000 lux, u(t) has more pronounced spikes when the light is switched on, compared to at I_w,max = 100 lux.

4 Solution strategies for the time optimal entrainment problems

Solution strategies for TOE-2nd order and TOE-1st order have been reported in [19, 20], respectively. For completeness, we summarize the strategies in this section.

First, note that all system models are single input affine nonlinear systems. That is, they can be written in the following format

\begin{matrix} \frac{d ξ}{d t} = ϕ (ξ) + γ (ξ) υ, \end{matrix}

(29)

where $ξ \in R^{n}$ is the system state, and $υ \in R$ is the control input. We assume that the control input is bounded,

\begin{matrix} 0 \leq υ \leq υ_{max} . \end{matrix}

(30)

Further, the time optimal entrainment (TOE) problem can be cast as a time optimal control problem with terminal state constraint, which can be formulated as minimizing the objective

\begin{matrix} J = \int_{0}^{T} 1 d t, \end{matrix}

(31)

subject to the initial and terminal state constraints

ξ (0) = ξ_{init},

(32)

η (ξ (T), T) = 0 .

(33)

The function $η : R^{n} \times R \to R$ can be used to represent the fact that the terminal state must be at a time-varying target state (i.e., the reference). In terms of the entrainment problem of the three models mentioned in this paper, we define the expression of η = 0 based on (9), (19) and (27), respectively.

Following the Pontryagin’s Maximum Principle, we formulate the augmented cost function

\begin{matrix} L (ξ, υ, T, λ, κ) ≜ T + \int_{0}^{T} λ^{T} (t) (ϕ (ξ (t)) + γ (ξ (t)) υ (t) - \frac{d ξ}{d t}) d t + κ^{T} η (ξ (T), T), \end{matrix}

(34)

where $λ (t) \in R^{n}$ is the co-state and κ is the Lagrange multiplier corresponding to the terminal state constraint.

The first variation of the objective J with respect to the input signal υ(t) is given by

\begin{matrix} δ J = \int_{0}^{T} λ^{T} (t) γ (ξ (t)) δ υ (t) d t, \end{matrix}

(35)

where δυ(t) is the perturbation of υ(t). The optimal control input can be found by

\begin{matrix} υ^{*} (t) = arg min_{υ} λ^{*} {(t)}^{T} γ (ξ^{*} (t)) υ, \end{matrix}

(36)

where ξ*(t) and λ*(t) are optimal state and co-state trajectories, respectively. We can see that the optimal control input therefore necessarily follows a bang-off strategy

\begin{matrix} υ^{*} (t) = {\begin{matrix} 0, & λ^{* T} (t) γ (ξ^{*} (t)) > 0, \\ υ_{max}, & λ^{* T} (t) γ (ξ^{*} (t)) < 0 . \end{matrix} \end{matrix}

(37)

Remark 1 The case where λ*^T γ(ξ*) is zero for a time interval of nonzero length is called a singular arc. We show in [19] that singular arcs do not affect the solution of TOE-2nd Order. Further, although not presented here, it is not difficult to see that the singular arc does not exist in the TOE-1st Order as it would violate the transversality condition explained below.

The evolution of the state follows the model in (29), while the co-state evolves according to

\begin{matrix} \frac{d λ^{*}}{d t} = - {(\frac{\partial ϕ (ξ^{*})}{\partial ξ^{*}} + \frac{\partial γ (ξ^{*}) υ^{*}}{\partial ξ^{*}})}^{T} λ^{*} . \end{matrix}

(38)

Further, λ*(t) needs to satisfy its terminal condition,

\begin{matrix} {(\frac{\partial η (ξ^{*} (T^{*}), T^{*})}{\partial ξ})}^{T} κ^{*} & = λ^{*} (T^{*}), \\ 1 + λ^{* T} (T^{*}) (ϕ (ξ^{*} (T^{*})) + γ (ξ^{*} (T^{*})) υ^{*} (T^{*})) + κ^{* T} \frac{\partial η (ξ^{*} (T^{*}), T^{*})}{\partial T} & = 0 . \end{matrix}

(39)

which is also known as the transversality condition [26, 27]. Here, T* and κ* denote the optimal time and Lagrange multiplier, respectively.

To solve the optimal control problem above, we use two algorithms, which are explained below along with their strengths and weaknesses.

4.1 Direct shooting algorithm (DSA)

In this algorithm, we view the two point boundary value problem above as finding the appropriate initial co-state λ*(0). Observe that if (ξ*(t), υ*(t), λ*(t)) satisfy (29), (37), and (38), then so do (ξ*(t), υ*(t), cλ*(t)) for any positive scalar c. The right scalar c has to be chosen to satisfy the transversality condition (39). Therefore, the search space for λ*(0) can be reduced from $R^{n}$ to the unit sphere in $R^{n}$ . It turns out that for TOE-2nd Order and TOE-1st Order, we can solve the two point boundary value problem rather efficiently. For TOE-2nd Order, this means λ*(0) can be sought on a circle. For TOE-1st Order, there are only two candidates for λ*(0). The basic direct shooting algorithm can then be expressed as follows:

Step 1
Create N sample points on the unit sphere in $R^{n}$ . Denote them as λ₁, ⋯, λ_N.
Step 2
For each of i ∈ {1, ⋯, N} do: simulate ξ(t) and λ(t) forward using (29), (37), and (38), under initial conditions ξ(0) = ξ_init and λ(0) = λ_i. Terminate the simulation when the final state constraint is satisfied,
$\begin{matrix} t = T_{i}, η (ξ (T_{i}), T_{i}) = 0, \end{matrix}$ (40)
or when t reaches an upper bound T_max. The upper bound T_max can be initially set based on the time needed for open-loop entrainment, and subsequently reduced as shorter entrainment times are found.
Step 3
Several locally optimal solutions can be found using the direct shooting method by searching abundant guesses of the initial co-state values. The optimal solution with the minimum entrainment time among all found solutions is treated as the globally optimal one, i.e., the optimal entrainment time is T* = min{T_i}_i=1,⋯,N.

Note that (29), (37), (38), and (39) provide us with a necessary condition for local optimality. This direct shooting algorithm provides us with a means to search (through sampling) for all solutions that satisfy this condition and therefore has the advantage of not getting trapped in local minima. The weakness of this algorithm is that for higher-order systems, such as the full order model, the number of samples generated can be impractically large. Another weakness is the co-state dynamics in (38) is unstable for the second-order and the full order models. This means forward numerical integration of this dynamics is unreliable for long periods of time, e.g., in jet-lag cases with long entrainment time.

Representatives of the optimization results using the DSA for a jet-lag case, where a subject whose initial circadian phase is at 6 am (corresponding to θ(0) = 0.8075 in TOE-1st Order) seeks to minimize her entrainment time for a 12-hour jet-lag are shown in Fig 3. We assume that u_max = 0.1028, u_max = 0.1731 and u_max = 0.2208 (corresponding to steady circadian lighting levels of 100 lux, 1000 lux and 10000 lux, respectively). Applying the same algorithm for the corresponding TOE-2nd Order case (corresponding to x(0) = 0.8033 and x_c(0) = −0.6905) leads to a success for the bright circadian light cases (u_max = 0.2208 and 0.1731), as shown in Fig 4. For dim circadian light (u_max = 0.1028), the DSA does not converge because of the instability of numerical forward integration of the co-state dynamics in (38). This case is illustrated in Fig 5, where we can observe that even with the same initial co-state value at t = 0, λ₁ and λ₂ from the numerical forward integration diverge from the backward integration values at about 180 hours. As the optimal entrainment times with I_max = 1000 and 10000 lux are always less than 150 hours, the direct shooting algorithm has no issues in the forward integration in these cases.

Fig 3 — The black and blue curves indicate the circadian phases of the subject and the reference, respectively. The red curve indicates the optimal light input of the 1st-order model. The green dash curves give the corresponding TOE solutions of the full order model.

Fig 4 — (Top) The black and blue curves indicate the circadian phases of the subject and the reference, respectively. The red curve indicates the optimal circadian input. (Bottom) The optimal co-state trajectories.

Fig 5 — The backward integration (blue curves) is performed first. When forward integration is performed to re-trace the backward integration, the solutions (black curves) diverge.

4.2 Gradient descent algorithm (GDA)

This algorithm addresses the weaknesses of the DSA. The idea is to use gradient descent search for the optimal input υ*(t) by using the first variation in (35). The basic gradient descent algorithm can then be expressed as follows:

Step 1
Set k = 0. Choose an initial guess for control input $υ_{0} : R_{+} \to [0, υ_{max}]$ .
Step 2
Simulate ξ_k(t) forward using (29), the initial condition ξ_k(0) = ξ_init, and the control input υ_k(t) until t = T_k such that η(ξ_k(T_k), T_k) = 0.
Step 3
Find the final condition for the co-state by solving
$\begin{matrix} {(\frac{\partial η (ξ_{k} (T_{k}), T_{k})}{\partial ξ})}^{T} κ_{k} & = λ_{k} (T_{k}), \\ 1 + λ_{k}^{T} (T_{k}) (ϕ (ξ_{k} (T_{k})) + γ (ξ_{k} (T_{k})) υ_{k} (T_{k})) + κ_{k}^{T} \frac{\partial η (ξ_{k} (T_{k}), T_{k})}{\partial T} & = 0, \end{matrix}$ (41)
for κ_k and λ_k(T_k).
Step 4
Simulate λ_k(t) backward using (38) and the final condition λ_k(T_k).
Step 5
Compute the descent direction using (35), i.e.,
$\begin{matrix} Δ_{k} (t) ≜ λ_{k}^{T} (t) γ (ξ_{k} (t)), t \in [0, T_{k}] . \end{matrix}$ (42)
Step 6
Update the control input along the descent direction:
$\begin{matrix} υ_{k + 1} (t) = min (max (0, υ_{k} (t) - α_{k} Δ_{k} (t)), υ_{max}) . \end{matrix}$ (43)

The step size α_k > 0 should be chosen such that the entrainment time is improved or the same as the previous value (T_k+1 ≤ T_k). In our simulations, we use a line search (bisection search, etc.) to solve the optimization problem given as follows:
$α_{k} = arg min_{α > 0} T (min (max (0, υ_{k} (t) - α Δ_{k} (t)), υ_{max})) .$

This process would be time-consuming, but it guarantees that the updating step decreases the entrainment time as much as possible or has no effects on the entrainment time in every iteration.
Step 7
Increment k by 1. Repeat from Step 2 until convergence. In our numerical simulations, we set the stopping criterion for the gradient descent process as
$\int_{0}^{T_{k + 1}} | υ_{k} (t) - υ_{k + 1} (t) | d t \leq 10^{- 10} .$

The stopping criterion is used to verify that υ_k(t) has reached or approximately reached the fixed point and the entrainment time cannot be improved. In most cases, the gradient descent algorithm takes at most 50 iterations to reach the stopping criterion mentioned above.

We can show that the following property characterizes fixed points of the iteration procedure above.

Lemma 2 The iteration reaches a fixed point, i.e., υ_k+1(t) = υ_k(t) for any positive step size α_k in Step 6 if and only if the following condition is satisfied for all t ∈ [0, T_k]

\begin{matrix} υ_{k} (t) = 0 \Rightarrow Δ_{k} (t) \geq 0, \\ υ_{k} (t) = υ_{max} \Rightarrow Δ_{k} (t) \leq 0, \\ υ_{k} (t) \in (0, υ_{max}) \Rightarrow Δ_{k} (t) = 0 . \end{matrix}}

(44)

Proof: Assume Eq (44) is satisfied. For all t ∈ [0, T_k], when υ_k(t) = 0, we have Δ_k(t)≥0. Based on Step 6, for any positive α_k, υ_k+1(t) = υ_k(t) = 0. It can also be proved that, for any t ∈ [0, T_k], if υ_k(t) = υ_max, υ_k+1(t) = υ_max; if υ_k(t) ∈ (0, υ_max) and Δ_k(t) = 0, we must have υ_k+1(t) = υ_k(t) for any α_k > 0. Therefore, the iteration reaches a fixed point. If Eq (44) is not satisfied, for example, if υ_k(t) = 0 but Δ_k(t) < 0 for any t ∈ [0, T_k], we have υ_k+1(t) > 0 by Eq (43). We can also prove that υ_k(t) ≠ υ_k+1(t) if the second or third row of Eq (44) does not hold. Therefore, υ_k(t) is not a fixed point if it does not satisfy the condition in (44).

Note that convergence of the iteration essentially means we hit a locally optimal control input. While the locality is a drawback of this algorithm, it has an advantage over the DSA because it does not require forward simulation of (38) that is numerically unstable. We demonstrate the use of GDA in solving the 12-hour jet-lag case for TOE-2nd Order (corresponding to x(0) = 0.8033 and x_c(0) = −0.6905) under dim circadian light (u_max = 0.1028). We have shown in the previous section that DSA cannot be used to solve this problem. The TOE solution of this case is shown in the upper panels in Fig 6. Observe that since the co-states dynamics are integrated backward, we do not have the instability issue encountered in the DSA. Fig 6 also shows the TOE results of the 2nd-order model with I_max = 1000 and 10000 lux. We can observe in these cases that the amplitude suppression occurs in the oscillator of the core body temperature. The amplitude suppression regions are usually called the phaseless set [28], which, under higher light intensities, gives a ‘shortcut’ for the circadian states to follow and shortens the entrainment time. We can also observe in the amplitude suppression cases that, in the first two days, the light inputs are almost centered around the minimum value of x, i.e., the minimum value of the core body temperature. These light strategies are very similar to those in the trials in [29], whose results also prove that amplitude suppression contributes to the rapid circadian phase shift (as strong type 0 PRC) and fast entrainment under bright light.

Fig 6 — The left column are the graphs of the state trajectories, locally optimal light inputs, and the reference trajectories. The red curves indicate the optimal light inputs of the 2nd-order model. The green dash curves give the corresponding TOE solutions of the full order model. The right column are the graphs of the corresponding gradient of the entrainment time with respect to the circadian light input. Observe that, per Lemma 2, both solutions are indeed (locally) optimal.

We also apply GDA on the full order TOE problem. Again, we assume that the subject’s initial circadian phase is at 6 am (corresponding to x(0) = 0.7562, x_c(0) = −0.7591, and n(0) = 0.4061). We consider 3 values of I_max, 100 lux (dim indoor space), 1000 lux (overcast day outdoor or bright indoor space), and 10000 lux (full daylight outdoor). Fig 7 demonstrates the 12-hour minimum-time entrainment cases with I_max = 100, 1000 and 10000 lux and TOE solutions. We can also observe that the amplitude suppression occurs in the bright light cases, e.g. 1000 and 10000 lux, these results are consistent with those in the 2nd-order model. Fig 8 shows the results of applying the GDA on the 11-hour jet-lag case for the full order model, using I_max = 1000 lux. The fact that the locally optimal solutions provided by the GDA are generally not unique is demonstrated in Fig 8, where two solutions are shown to satisfy the conditions in Lemma 2. Fig 9 shows how the objective function improves over the iterations of the GDA for the two solutions shown in Fig 8.

Fig 7 — On the left column are the graphs of the state trajectories, locally optimal light input, and the reference trajectories. The red curve indicates the optimal circadian input of the full order model. On the right column are the graphs of the corresponding gradients and light inputs. Observe that, per Lemma 2, both solutions are indeed (locally) optimal.

Fig 8 — (Left) Two locally optimal solutions of TOE (full order) for the 11-hour jet-lag case with I_max = 1000 lux, obtained by using the GDA with two different initial guesses of the circadian light input signal. (Right) Graphs of the corresponding optimal circadian light inputs I*(t) and the descent direction Δ(t). We can also observe that, per Lemma 2, both solutions are (locally) optimal.

Fig 9 — The initial guess 1 in the blue curve in this figure converges to the TOE solution in the upper panels in Fig 8, the initial guess 2 in the black curve in this figure converges to the TOE solution in the lower panels in Fig 8.

4.3 Combining direct shooting and gradient descent algotihms

Local optimality of the results of GDA implies that their quality heavily depends on the initial guess of the optimal control input. On the other hand, DSA can be used effectively to find the global optimal solution for lower-order models. We then combine the use of both algorithms by using the solutions obtained using DSA on a lower-order model to warm start the GDA. Specifically, if u*(t) is the optimal circadian input found by applying DSA on a 1st- or 2nd-order model, then our initial guess for the optimal circadian light input intensity I*(t) for the full order model is determined by

\begin{matrix} I^{*} (t) = {\begin{matrix} 0, & u^{*} (t) = 0, \\ I_{max}, & u^{*} (t) = u_{max} . \end{matrix} \end{matrix}

(45)

Then, we apply the GDA to further optimize I*(t). Note that we do not assume that the GDA will converge to I*(t) that only assumes two values, 0 or I_max.

We compare the performance of the solutions that we obtained using this strategy with published results by Serkh and Forger [22]. The entrainment time for various jet-lag cases for our results and those from [22] are shown in Fig 10. In the same figure, we can also see the performance improvement between the solutions before the application of the GDA algorithm (thus, the initial guesses) and the solutions after the GDA algorithm. The comparison between the blue (results in [22]) and black curves (our results) verifies the performance of our algorithm. Also, the TOE light inputs from our algorithm (shown as the green dash curves in Fig 6) are also consistent with those in [22] (in particular, Fig S13 in [22]).

The entrainment time in Fig 10 implies that the maximum entrainment time in our gradient descent results occurs at around 12-13 hours shift (westward travel), which means that the JFK model is almost east-west symmetrical. These results are consistent with those in [22], which addressed the minimum-time entrainment problem of the same circadian model. However, if we only use the natural light-dark cycle in (7) with I_w,max = 100 lux for entrainment without any other light input, the entrainment time in Fig 11 demonstrates that the JFK model is still east-west asymmetrical as maximum entrainment time occurs at around 15 hours westward shift. These results are consistent with those in [28], which also considered the entrainment of the JFK model using the natural light-dark cycle. These results imply that the JFK model is east-west asymmetrical under natural light but the optimal light inputs for minimum-time entrainment can remove this east-west asymmetry.

The solution strategy proposed by Serkh and Forger is briefly described as follows. It assumes that the optimal light input I*(t) only has two values, 0 or I_max. This assumption is motivated by the fact that time optimal control problems with bounded input constraints generally have bang-bang or bang-off optimal solutions [26, 27]. The exceptions of this case are when the optimal solutions lie in a so-called singular arc [26, 27].

The switching time instants of the light input are parameterized as t₁, t₂, ⋯, t_N and considered as the optimization variables. The optimal control problem in TOE is reformulated as follows. To represent entrainment, the concept of isochrons is used. Essentially, this is established by using a phase function $φ : R \times R \to S^{1},$ such that two pair of states of the circadian oscillator (x₁, x_c1) and (x₂, x_c2) are considered to have the same phase if φ(x₁, x_c1) = φ(x₂, x_c2). Then, the following objective is optimized

\begin{matrix} J = T + C \cdot {(∥ [\begin{matrix} x (T) \\ x_{c} (T) \end{matrix}] ∥ - ∥ [\begin{matrix} \bar{x} (T) \\ {\bar{x}}_{c} (T) \end{matrix}] ∥)}^{2}, \end{matrix}

(46)

under the constraint

\begin{matrix} φ (x_{1} (T), x_{c 1} (T)) = φ (x_{2} (T), x_{c 2} (T)) . \end{matrix}

(47)

The constant C > 0 in (46) represents the trade-off between minimizing the end time and matching the amplitude of the circadian oscillator with that of the reference signal. The optimal switching time instants $t_{1}^{*}, t_{2}^{*}, \dots, t_{N}^{*}$ are then computed using a gradient descent technique. Compared with [22], the algorithm used in this paper is proposed without the assumption that the minimum-time optimal input is always bang-off and computes the light input during the whole entrainment process instead of only the switching times. The results in [19] and [30] show the existence of singular arcs in the minimum-time entrainment solutions in several circadian rhythm models. These singular arcs violate the assumption that the optimal input is always bang-off. Therefore, our algorithm is more general in solving the optimal solution in various models.

5 The impacts of model reduction on the TOE solutions

In Sec. 3, we discussed two steps of model reduction of the full order JFK model. First, we ignored the dynamics of the Process-L and introduced the 2nd-order model. Second, we ignored the amplitude of the oscillation and introduced the 1st-order model. Here, we study the impacts of these model reduction steps on the solutions of the time-optimal entrainment problems. The TOE solutions of the full order model are also demonstrated in Figs 3 and 6. Comparing them with the TOE solutions of the 1st-order and 2nd-order models, we observe that, when the light input is dim (100 lux in the upper panels), the light on-off switching times in the TOE solutions of the 1st-order and 2nd-order models are both very similar with those in the full order model, while the difference between the 1st-order and full order model is enlarged when the light intensity is increased (middle and lower panels in Fig 3). The TOE solutions of the 2nd-order model are close to those of the full order model even under a bright light input, as demonstrated in the lower panel in Fig 6. However, even if we apply the optimal light input from the 2nd-order model on the full order model in the form of (45) in the bright light cases, it still brings a large difference in the entrainment time from the optimal entrainment time in many cases, shown as Fig 10. Specifically, for jet-lag cases, we evaluate the performance of the solutions derived using the lower-order models when applied on the higher-order models. For this, we simply replay the computed optimal control input trajectory u*(t) on the higher-order models until we reach entrainment, which is marked by the circadian states of the subject matching those of the reference, within a certain error tolerance band. If at the end of the computed u*(t) we do not reach entrainment, we append the control input with the reference input trajectory until we reach entrainment. As a concrete example, suppose that we consider the 12-hour jet-lag case with the maximum circadian lighting level of 10000 lux, and want to replay the optimal control input trajectory u*(t) computed using the 1st-order model on the full order model. The trajectory u*(t) in this case has a duration of 121 hours, as shown in Fig 3. The same 12-hour jet-lag case for the full order model is associated with the initial conditions

[x, x_{c}, n] (0) = [0.7562, - 0.7591, 0.4061] .

We then simulate the full order JFK model with the initial conditions above under the circadian light intensity signal I*(t) obtained using Eq (45) until entrainment is reached or t = 121 hours. If we do not reach entrainment by t = 121 hours, we continue I*(t) with the reference circadian lighting,

I^{*} (t) = I_{w} (t), t > 121,

until entrainment is reached. Physically, this step means exposing the subject to the local/reference circadian lighting until his circadian rhythm is synchronized with the local/reference circadian rhythm.

Fig 12 shows the results from the procedure above. We can observe that at lower circadian light intensity the gaps between the solutions of the time optimal entrainment problem of the reduced model and those of the full order model are smaller. In particular, at I_max = 100 lux, the solutions obtained using the second order are very similar to those from the full order model. However, the solutions obtained using the first order model already deviate quite significantly from those from the higher order models. This indicates that taking into account the amplitude dynamics is relatively more important than the Process-L dynamics in solving the time optimal entrainment problem.

6 Feedback implementation of the TOE solutions

The solution strategies obtained with all of the approaches described in the previous section are open loop in nature. Given an initial state and the reference state trajectory, the entire optimal control input trajectory υ*(t) and the corresponding optimal state trajectory ξ*(t) can be computed. However, because of Bellman’s Principle of Optimality [26, 27], we also know that the optimal input can be given in terms of an optimal feedback law υ*(ξ*, t). In this section, we discuss our approach to construct such an optimal feedback law from the obtained open loop optimal solutions.

6.1 Feedback implementation of the TOE solutions—First order case

A feedback implementation of the TOE 1st-order problems amounts to expressing the optimal circadian input u*(t) as a function of the subject’s circadian phase θ(t) and the reference circadian phase $\bar{θ} (t)$ . We have obtained and reported such feedback laws in our prior publication [20].

Feedback implementation of the optimal control solutions confers more robustness to the solutions, e.g., under modeling error, compared to the open-loop implementation of the solutions. By open-loop implementation we mean simply replaying the optimal control input, as done in Sec. 5. We demonstrate this point by comparing the performance of the open-loop and feedback implementations of the solutions of TOE 1st-order on the full order model. The data are shown in Fig 13.

Fig 13 — The top, middle, and bottom graphs represent the cases of circadian light intensity of 100 lux, 1000 lux, and 10000 lux, respectively. The red and black curves represent the results from the feedback and open-loop implementations, respectively. The blue curves represent results from the full order TOE, which are shown as benchmarks.

6.2 Feedback implementation of the TOE solutions—Full order case

Applying the idea above to the solution of full order TOE means we need to compute the optimal circadian light input I* as a function of the three states of the system (x, x_c, and n) and the states of the reference system ( $\bar{x}$ , ${\bar{x}}_{c}$ , and $\bar{n}$ ). However, since the reference trajectory is periodic, we can replace the states of the reference system with its circadian phase θ_r, where the definition of the circadian phase is given in Eq (11). Further, we aim for the feedback law to be independent of n (the state of the Process L), because the implementation of such feedback law would require measuring (or estimating) n, which has much faster dynamics than the circadian rhythm.

The procedure for constructing the feedback law is as follows:

Step 1
For a given I_max, collect a number of optimal trajectories $[x^{*} (t), x_{c}^{*} (t), n^{*} (t), θ_{r} (t), I^{*} (t)]$ each with different initial conditions $[x^{*} (0), x_{c}^{*} (0), n^{*} (0)]$ .
Step 2
Generate N_s data points by sampling the trajectories. That is, each data point $[x_{i}^{*}, x_{c, i}^{*}, n_{i}^{*}, θ_{r, i}, I_{i}^{*}] \in R^{5}$ , i ∈ {1, ⋯, N_s} is a point on a trajectory described in Step 1.
Step 3
Find an interpolation function $F : R^{3} \to R$ that interpolates $[x_{i}^{*}, x_{c, i}^{*}, θ_{r, i}] \to I_{i}^{*}$ , i ∈ {1, ⋯, N_s}. For the results reported in this paper we use a feedforward neural network trained by gradient propagation and cross-entropy loss function with a single hidden layer of 32 neurons (implemented by the patternnet function in Matlab) to represent the feedback function $F : R^{3} \to {0, I_{max}}$ .

Fig 14 shows the representations of the feedback controllers that we generated using this procedure for 3 values of I_max, 100 lux (dim indoor space), 1000 lux (overcast day outdoor or bright indoor space), and 10000 lux (full daylight outdoor). The optimal trajectories for the training data are obtained by solving 23 cases of jetlag (T_lag ∈ {1, 2, ⋯, 23}) and sampled at intervals of 0.01 hour.

We evaluate the robustness of the feedback controller in three ways, as described below.

Cross-validation with new jetlag cases

We apply the learned feedback controllers on jetlag cases that are not used in the training data set. Specifically, we consider the jetlag cases with $T_{l a g} \in {\frac{1}{2}, 1 \frac{1}{2}, \dots, 23 \frac{1}{2}}$ . We compare performance of the optimal open loop solutions of the TOE as described in Sec. 4.3 for these cases with the respective performance of the learned feedback controllers. The results are shown in Fig 15. We can confirm that the feedback controllers indeed perform well in this cross-validation.

Fig 15 — The vertical axis denote entrainment time in hours.

Robustness w.r.t. n(0)

The feedback controllers ignore the influence of n(t) in computing the optimal control response. To evaluate the validity of the assumption that n(t) can be ignored, we use the feedback controllers under various initial conditions n(0). The results shown in Fig 16 confirm that n(t) plays a very small role in the performance of the feedback controlled system.

Fig 16 — Observe that n(0) (and consequently n(t)) only has little effect on the entrainment times.

Robustness w.r.t. change in I_max

We train several feedback controllers for different I_max values. As shown in Fig 14, there are significant differences between these controllers. To evaluate their generalizability, we use the learned feedback controllers with light inputs with different I_max. Specifically, we evaluate the performance of the feedback controllers that are learned using data with I_max = 10000 lux and I_max = 1000 lux on circadian light source with I_max = 5000 lux. The results are shown in Fig 17, which shows that these controllers can still perform well under the change in I_max. For example, the feedback controller for I_max = 10000 lux results in entrainment times that are less than 10 hours longer compared to the optimal open-loop solutions.

Fig 17 — Observe that the feedback controllers perform quite well compared to the actual optimal solutions (black curve).

7 Discussion and conclusions

In summary, we developed a method for constructing an optimal feedback controller for the time-optimal entrainment problem for the Jewett-Forger-Kronauer (JFK) circadian rhythm model. Our approach is based on the calculus of variations, which enables us to formulate local optimality conditions for the solutions. It also enables us to define a gradient descent algorithm based on the local optimality criteria.

Comparison of our method and the method in [22]

The optimal entrainment results from our method and the method in [22] demonstrate that our method works as well as the method in [22] in minimizing the entrainment time. The optimal solutions found in this paper demonstrate that we do not encounter singular arcs. In other words, the optimal light input for the minimum-time entrainment of the JFK model is bang-off. The method in [22] is less complex because it has fewer decision variables (i.e., the switching times), but the assumption that the minimum-time optimal light input is bang-off does not always hold, as shown in [19] and [30], due to the existence of the singular arc in the optimal solution. Our method is more general because it does not assume bang-off solutions and, then, could find an optimal solution that contains some singular arcs.

On the relevance of lower-order circadian rhythm models

Because of the local optimality property, finding the global optimum would require us to search the solution space exhaustively for solutions that satisfy these conditions. While this procedure is not feasible for the full order JFK model, we demonstrate that it can be performed successfully on reduced-order models that we discuss in this paper. Further, the resulting optimal solutions can be used as initial guesses for the optimal solution of the full order model and then improved using the gradient descent algorithm. Therefore, effectively we use the lower-order models as surrogates to the full order model to warm start the gradient descent search algorithm for the optimal solutions.

Impacts of model reduction on the TOE solutions

Two simplified versions of the JFK model have been discussed in this paper: (i) the 2nd-order model that is the result of ignoring the dynamics of the Process-L, (ii) the 1st-order model resulting from ignoring the dynamics of both the Process-L and the amplitude of the circadian oscillation. We evaluate the impacts of the model reduction on the solutions of the TOE problems by replaying the solutions for the lower-order models on the higher-order ones. The results indicate that for low light intensity, the solutions from the second-order model are practically the same as those from the full order model. Meanwhile, the solutions from the first-order model differ more significantly from those from the second-order model. This suggests that the amplitude dynamics is relatively more important than the Process-L dynamics in solving the TOE problems. Another indicator of the importance of the amplitude dynamics is the fact that in many optimal solutions for the second and full order models the oscillation amplitude is quenched along the way to reach entrainment. This is shown, e.g., in Fig 6 (bottom) and Fig 7 (bottom).

Potential impacts of amplitude suppression in circadian oscillator

The amplitude suppression phenomenon had been discussed in [29], whose experimental data showed that both the amplitude and phase of the circadian temperature were dramatically changed by the bright light pulses (7000∼12000 lux). This helps explain why the circadian oscillator is dramatically quenched in I_max = 10000 lux but almost remains on the limit cycle when I_max = 100 lux during the minimum-time entrainment processes. The amplitude suppression in the circadian oscillator is closely linked to the minimum-time entrainment with bright light in this paper and previous literature. However, CBT amplitude suppression may also have potential connections with health-related issues: the experimental results in [31] indicates that the CBT rhythm tends to flatten with increasing age. The work in [32] also demonstrates that the quenched amplitude of the CBT is closely connected with sleep disruption. Therefore, the entrainment under very bright light with potential sleep disorders may be not suitable for every individual.

Robustness of the optimal feedback controller

We use a machine learning algorithm to learn an optimal feedback controller for the (full order) TOE problem using data collected from the optimal open-loop trajectories, which are computed using the approach above. In general, we discover that the optimal feedback controller depends on I_max, the maximum circadian light intensity that is used for entrainment. We subsequently train separate optimal feedback controllers for three different values of I_max. However, we also demonstrate that the trained optimal controller is robust to some variations in I_max. Further, the trained feedback controllers depend only on the states of the circadian oscillator, but not on the state of the Process-L. We find that the state of the Process-L has little effect on controller performance.

The minimum-time optimal light strategies proposed in this paper can be implemented on the entraining travelers if the initial circadian states and local time in the destination are known. Although the light intensities of most outdoor lights are highly noisy and maintaining the same lux for hours could be unrealistic, some lighting devices (re-timer and light therapy glasses) and indoor light (the light in the hospital rooms and offices) can be used for offering stable light inputs. The robustness of entrainment against the variation in the light intensity and circadian states is improved by using the feedback entrainment strategy in Section 6, which requires the real-time measurement of the circadian states, i.e., core body temperature. For this purpose, some portable sensors had been developed for instantaneous measurements of the core body temperature [33]. The feedback implementation of the entrainment process is enabled by these sensors.

Acknowledgments

The authors would like to thank Dr. Daniel Forger for valuable discussions that led to the study and results reported in this paper.

Data Availability

All relevant data are within the paper and figures.

Funding Statement

All authors are supported by the Army Research Office (https://www.arl.army.mil/www/default.cfm?page=29) through grant W911NF-17-1-0562, and the National Science Foundation (https://nsf.gov) through an Engineering Research Center grant EEC-0812056. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Aschoff J. Circadian rhythms in man. Science. 1965;148(3676):1427–1432. 10.1126/science.148.3676.1427 [DOI] [PubMed] [Google Scholar]
2. Refinetti R. Circadian Physiology. CRC Press; 2005. [Google Scholar]
3. Rea MS, Bierman A, Figueiro MG, Bullough JD. A new approach to understanding the impact of circadian disruption on human health. J Circadian Rhythms. 2008;6(1):7 10.1186/1740-3391-6-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Baggs JE, Price TS, DiTacchio L, Panda S, FitzGerald GA, Hogenesch JB. Network features of the mammalian circadian clock. PLoS Biol. 2009;7(3):563–575. 10.1371/journal.pbio.1000052 [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Leloup JC, Goldbeter A. A model for circadian rhythms in Drosophila incorporating the formation of a complex between the PER and TIM proteins. J Biol Rhythms. 1998;13:70–87. 10.1177/074873098128999934 [DOI] [PubMed] [Google Scholar]
6. Leloup JC, Goldbeter A. Toward a detailed computational model for the mammalian circadian clock. Proc Nati Acad Sci USA. 2003;100:7051–7056. 10.1073/pnas.1132112100 [DOI] [PMC free article] [PubMed] [Google Scholar]
7. Forger DB, Jewett ME, Kronauer RE. A simpler model of the human circadian pacemaker. J Biol Rhythms. 1999;14(6):533–538. 10.1177/074873099129000867 [DOI] [PubMed] [Google Scholar]
8. Kronauer RE, Forger DB, Jewett ME. Quantifying human circadian pacemaker response to brief, extended, and repeated light stimuli over the phototopic range. J Biol Rhythms. 1999;14(6):501–516. 10.1177/074873049901400609 [DOI] [PubMed] [Google Scholar]
9. Forger DB, Kronauer RE. Reconciling mathematical models of biological clocks by averaging on approximate manifolds. SIAM J Appl Math. 2002;62:1281–1296. 10.1137/S0036139900373587 [DOI] [Google Scholar]
10. Doyle FJ III, Gunawan R, Bagheri N, Mirsky H, To TL. Circadian rhythm: A natural, robust, multi-scale control system. Comput Chem Eng. 2006;30(10-12):1700–1711. 10.1016/j.compchemeng.2006.05.029 [DOI] [Google Scholar]
11. Abel JH, Chakrabarty A, Doyle FJ III. Controlling biological time: nonlinear model predictive control for populations of circadian oscillators. In: Tempo R, Yurkovich S, Misra P, editors. Emerging Applications of Control and Systems Theory. Lecture Notes in Control and Information Sciences. Springer; 2018. p. 123–138. 10.1007/978-3-319-67068-3_9 [DOI] [Google Scholar]
12.Mott C, Mollicone D, van Wollen M. Modifying the human circadian pacemaker using model based predictive control. In: Proc Am Control Conf. 2003; p. 453–458.
13. Bagheri N, Stelling J, Doyle FJ III. Circadian phase entrainment via nonlinear model predictive control. Int J Robust Nonlin. 2007;17(17):1555–1571. 10.1002/rnc.1209 [DOI] [Google Scholar]
14. Bagheri N, Stelling J, Doyle FJ. Circadian phase resetting via single and multiple control targets. PLoS Comput Biol. 2008;4(7):e1000104 10.1371/journal.pcbi.1000104 [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Abel JH, Doyle FJ. A systems theoretic approach to analysis and control of mammalian circadian dynamics. Chem Eng Res Des. 2016;116:48–60. 10.1016/j.cherd.2016.09.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Forger DB. Biological clocks, rhythms, and oscillations: The theory of biological timekeeping. MIT Press; 2017. [PubMed] [Google Scholar]
17.Zhang JX, Wen JT, Julius AA. Optimal circadian rhythm control with light input for rapid entrainment and improved vigilance. In: Proc IEEE Conf Decis Control; 2012. p. 3007–3012.
18.Zhang JX, Wen JT, Julius AA. Optimal and feedback control for light-based circadian entrainment. In: Proc IEEE Conf Decis Control; 2013. p. 2677–2682.
19. Zhang JX, Qiao W, Wen JT, Julius AA. Light-based circadian rhythm control: Entrainment and optimization. Automatica. 2016;68:44–55. 10.1016/j.automatica.2016.01.052 [DOI] [Google Scholar]
20. Qiao W, Wen JT, Julius AA. Entrainment control of phase dynamics. IEEE Trans Autom Control. 2017;62(1):445–450. 10.1109/TAC.2016.2555885 [DOI] [Google Scholar]
21.Julius AA, Yin J, Wen JT. Time-optimal control for circadian entrainment for a model with circadian and sleep dynamics. In: Proc IEEE Conf Decis Control. Melbourne, Australia; 2017. p. 4709–4714.
22. Serkh K, Forger DB. Optimal schedules of light exposure for rapidly correcting circadian misalignment. PLOS Comput Biol. 2014;10(4):e1003523 10.1371/journal.pcbi.1003523 [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Khalsa SBS, Jewett ME, Cajochen C, Czeisler CA. A phase response curve to single bright light pulses in human subjects. J Physiol. 2003;549:945–952. 10.1113/jphysiol.2003.040477 [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Taylor SR, Gunawan R, Petzold LR, Doyle FJ III. Sensitivity measures for oscillating systems: application to mammalian circadian gene network. IEEE Trans Autom Control. 2008;53:177–188. 10.1109/TAC.2007.911364 [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Moeck M, Yoon YJ. Green buildings and potential electric light energy savings. J Archit Eng. 2004;10(4):143–159. 10.1061/(ASCE)1076-0431(2004)10:4(143) [DOI] [Google Scholar]
26. Bryson AE, Ho YC. Applied optimal control. Taylor & Francis; 1975. [Google Scholar]
27. Liberzon D. Calculus of variations and optimal control theory. Princeton University Press; 2012. [Google Scholar]
28. Diekman CO, Bose A. Reentrainment of the circadian pacemaker during jet lag: East-west asymmetry and the effects of north-south travel. J Theor Biol. 2018;437:261–285. 10.1016/j.jtbi.2017.10.002 [DOI] [PubMed] [Google Scholar]
29. Jewett ME, Kronauer RE, Czeisler CA. Light-induced suppression of endogenous circadian amplitude in humans. Nature. 1991;350:59–62. 10.1038/350059a0 [DOI] [PubMed] [Google Scholar]
30.Yin J, Julius AA, Wen JT. Rapid circadian entrainment in models of circadian genes regulation. In: Proc IEEE Conf Decis Control. 2019.
31. Baehr EK, Revelle W, Eastman CI. Individual differences in the phase and amplitude of the human circadian temperature rhythm: with an emphasis on morningness-eveningness. J Sleep Res. 2000;9:117–127. 10.1046/j.1365-2869.2000.00196.x [DOI] [PubMed] [Google Scholar]
32. Kräuchi K, Wirz-Justice A. Circadian clues to sleep onset mechanisms. Neuropsychopharmacology. 2001;25:S92–S96. 10.1016/S0893-133X(01)00315-3 [DOI] [PubMed] [Google Scholar]
33.Pell R.’Earable’ sensor monitors core body temp; 2017. Available from: https://www.eenewseurope.com/news/earable-sensor-monitors-core-body-temp.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All relevant data are within the paper and figures.

[pone.0225988.ref001] 1. Aschoff J. Circadian rhythms in man. Science. 1965;148(3676):1427–1432. 10.1126/science.148.3676.1427 [DOI] [PubMed] [Google Scholar]

[pone.0225988.ref002] 2. Refinetti R. Circadian Physiology. CRC Press; 2005. [Google Scholar]

[pone.0225988.ref003] 3. Rea MS, Bierman A, Figueiro MG, Bullough JD. A new approach to understanding the impact of circadian disruption on human health. J Circadian Rhythms. 2008;6(1):7 10.1186/1740-3391-6-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0225988.ref004] 4. Baggs JE, Price TS, DiTacchio L, Panda S, FitzGerald GA, Hogenesch JB. Network features of the mammalian circadian clock. PLoS Biol. 2009;7(3):563–575. 10.1371/journal.pbio.1000052 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0225988.ref005] 5. Leloup JC, Goldbeter A. A model for circadian rhythms in Drosophila incorporating the formation of a complex between the PER and TIM proteins. J Biol Rhythms. 1998;13:70–87. 10.1177/074873098128999934 [DOI] [PubMed] [Google Scholar]

[pone.0225988.ref006] 6. Leloup JC, Goldbeter A. Toward a detailed computational model for the mammalian circadian clock. Proc Nati Acad Sci USA. 2003;100:7051–7056. 10.1073/pnas.1132112100 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0225988.ref007] 7. Forger DB, Jewett ME, Kronauer RE. A simpler model of the human circadian pacemaker. J Biol Rhythms. 1999;14(6):533–538. 10.1177/074873099129000867 [DOI] [PubMed] [Google Scholar]

[pone.0225988.ref008] 8. Kronauer RE, Forger DB, Jewett ME. Quantifying human circadian pacemaker response to brief, extended, and repeated light stimuli over the phototopic range. J Biol Rhythms. 1999;14(6):501–516. 10.1177/074873049901400609 [DOI] [PubMed] [Google Scholar]

[pone.0225988.ref009] 9. Forger DB, Kronauer RE. Reconciling mathematical models of biological clocks by averaging on approximate manifolds. SIAM J Appl Math. 2002;62:1281–1296. 10.1137/S0036139900373587 [DOI] [Google Scholar]

[pone.0225988.ref010] 10. Doyle FJ III, Gunawan R, Bagheri N, Mirsky H, To TL. Circadian rhythm: A natural, robust, multi-scale control system. Comput Chem Eng. 2006;30(10-12):1700–1711. 10.1016/j.compchemeng.2006.05.029 [DOI] [Google Scholar]

[pone.0225988.ref011] 11. Abel JH, Chakrabarty A, Doyle FJ III. Controlling biological time: nonlinear model predictive control for populations of circadian oscillators. In: Tempo R, Yurkovich S, Misra P, editors. Emerging Applications of Control and Systems Theory. Lecture Notes in Control and Information Sciences. Springer; 2018. p. 123–138. 10.1007/978-3-319-67068-3_9 [DOI] [Google Scholar]

[pone.0225988.ref012] 12.Mott C, Mollicone D, van Wollen M. Modifying the human circadian pacemaker using model based predictive control. In: Proc Am Control Conf. 2003; p. 453–458.

[pone.0225988.ref013] 13. Bagheri N, Stelling J, Doyle FJ III. Circadian phase entrainment via nonlinear model predictive control. Int J Robust Nonlin. 2007;17(17):1555–1571. 10.1002/rnc.1209 [DOI] [Google Scholar]

[pone.0225988.ref014] 14. Bagheri N, Stelling J, Doyle FJ. Circadian phase resetting via single and multiple control targets. PLoS Comput Biol. 2008;4(7):e1000104 10.1371/journal.pcbi.1000104 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0225988.ref015] 15. Abel JH, Doyle FJ. A systems theoretic approach to analysis and control of mammalian circadian dynamics. Chem Eng Res Des. 2016;116:48–60. 10.1016/j.cherd.2016.09.033 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0225988.ref016] 16. Forger DB. Biological clocks, rhythms, and oscillations: The theory of biological timekeeping. MIT Press; 2017. [PubMed] [Google Scholar]

[pone.0225988.ref017] 17.Zhang JX, Wen JT, Julius AA. Optimal circadian rhythm control with light input for rapid entrainment and improved vigilance. In: Proc IEEE Conf Decis Control; 2012. p. 3007–3012.

[pone.0225988.ref018] 18.Zhang JX, Wen JT, Julius AA. Optimal and feedback control for light-based circadian entrainment. In: Proc IEEE Conf Decis Control; 2013. p. 2677–2682.

[pone.0225988.ref019] 19. Zhang JX, Qiao W, Wen JT, Julius AA. Light-based circadian rhythm control: Entrainment and optimization. Automatica. 2016;68:44–55. 10.1016/j.automatica.2016.01.052 [DOI] [Google Scholar]

[pone.0225988.ref020] 20. Qiao W, Wen JT, Julius AA. Entrainment control of phase dynamics. IEEE Trans Autom Control. 2017;62(1):445–450. 10.1109/TAC.2016.2555885 [DOI] [Google Scholar]

[pone.0225988.ref021] 21.Julius AA, Yin J, Wen JT. Time-optimal control for circadian entrainment for a model with circadian and sleep dynamics. In: Proc IEEE Conf Decis Control. Melbourne, Australia; 2017. p. 4709–4714.

[pone.0225988.ref022] 22. Serkh K, Forger DB. Optimal schedules of light exposure for rapidly correcting circadian misalignment. PLOS Comput Biol. 2014;10(4):e1003523 10.1371/journal.pcbi.1003523 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0225988.ref023] 23. Khalsa SBS, Jewett ME, Cajochen C, Czeisler CA. A phase response curve to single bright light pulses in human subjects. J Physiol. 2003;549:945–952. 10.1113/jphysiol.2003.040477 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0225988.ref024] 24. Taylor SR, Gunawan R, Petzold LR, Doyle FJ III. Sensitivity measures for oscillating systems: application to mammalian circadian gene network. IEEE Trans Autom Control. 2008;53:177–188. 10.1109/TAC.2007.911364 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0225988.ref025] 25. Moeck M, Yoon YJ. Green buildings and potential electric light energy savings. J Archit Eng. 2004;10(4):143–159. 10.1061/(ASCE)1076-0431(2004)10:4(143) [DOI] [Google Scholar]

[pone.0225988.ref026] 26. Bryson AE, Ho YC. Applied optimal control. Taylor & Francis; 1975. [Google Scholar]

[pone.0225988.ref027] 27. Liberzon D. Calculus of variations and optimal control theory. Princeton University Press; 2012. [Google Scholar]

[pone.0225988.ref028] 28. Diekman CO, Bose A. Reentrainment of the circadian pacemaker during jet lag: East-west asymmetry and the effects of north-south travel. J Theor Biol. 2018;437:261–285. 10.1016/j.jtbi.2017.10.002 [DOI] [PubMed] [Google Scholar]

[pone.0225988.ref029] 29. Jewett ME, Kronauer RE, Czeisler CA. Light-induced suppression of endogenous circadian amplitude in humans. Nature. 1991;350:59–62. 10.1038/350059a0 [DOI] [PubMed] [Google Scholar]

[pone.0225988.ref030] 30.Yin J, Julius AA, Wen JT. Rapid circadian entrainment in models of circadian genes regulation. In: Proc IEEE Conf Decis Control. 2019.

[pone.0225988.ref031] 31. Baehr EK, Revelle W, Eastman CI. Individual differences in the phase and amplitude of the human circadian temperature rhythm: with an emphasis on morningness-eveningness. J Sleep Res. 2000;9:117–127. 10.1046/j.1365-2869.2000.00196.x [DOI] [PubMed] [Google Scholar]

[pone.0225988.ref032] 32. Kräuchi K, Wirz-Justice A. Circadian clues to sleep onset mechanisms. Neuropsychopharmacology. 2001;25:S92–S96. 10.1016/S0893-133X(01)00315-3 [DOI] [PubMed] [Google Scholar]

[pone.0225988.ref033] 33.Pell R.’Earable’ sensor monitors core body temp; 2017. Available from: https://www.eenewseurope.com/news/earable-sensor-monitors-core-body-temp.

PERMALINK

Time optimal entrainment control for circadian rhythm

A Agung Julius

Jiawei Yin

John T Wen

Roles

Abstract

1 Introduction

2 Problem formulation

Table 1. Model parameters.

PROBLEM (TOE)

3 Model simplification

3.1 Second-order model

PROBLEM (TOE-2nd Order)

3.2 First-order model

Fig 1. Phase response curve generated from the 2nd-order model.

PROBLEM (TOE-1st order)

3.3 Behaviors under periodic light inputs

Fig 2.

4 Solution strategies for the time optimal entrainment problems

4.1 Direct shooting algorithm (DSA)

Fig 3. Solutions of TOE-1st order for the 12-hour jetlag case for the circadian lighting levels of 100 lux (top), 1000 lux (umax = 0.1731) (middle) and 10000 lux (bottom), obtained using the DSA.

Fig 4. Solution of TOE-2nd Order for 12 hours jetlag case with circadian lighting level of 10000 lux, obtained using DSA.

Fig 5. Numerical integrations of the co-state dynamics (38) in TOE-2nd Order.

4.2 Gradient descent algorithm (GDA)

Fig 6. Locally optimal solutions of TOE-2nd Order for the 12-hour jet-lag case for the circadian lighting levels of 100 lux (umax = 0.1028) (top row), 1000 lux (umax = 0.1731) (middle row) and 10000 lux (umax = 0.2208) (bottom row), obtained using the GDA.

Fig 7. Locally optimal solutions of TOE-full order for the 12 hours jet-lag case for the circadian lighting levels of 100 lux (top row), 1000 lux (middle row) and 10000 lux (bottom row), obtained using GDA.

Fig 8.

Fig 9. The optimization objective (i.e., entrainment time) improves over the iterations of the GDA for both of the solutions shown in Fig 8.

4.3 Combining direct shooting and gradient descent algotihms

Fig 10. Comparison of the entrainment costs for the TOE (full order) problem for various jet-lag cases with the levels of circadian light intensity of 100 lux (top), 1000 lux (middle), and 10000 lux (bottom).

Fig 11. Entrainment time by the natural light-dark cycle in (7) with Iw,max = 100 lux.

5 The impacts of model reduction on the TOE solutions

Fig 12. Entrainment time when the optimal input derived using a lower-order model is applied to the higher-order models.

6 Feedback implementation of the TOE solutions

6.1 Feedback implementation of the TOE solutions—First order case

Fig 13. Entrainment time when the optimal input derived using the first-order model is applied to the full order models.

6.2 Feedback implementation of the TOE solutions—Full order case

Fig 14. Optimal feedback control laws learned from the optimal feedforward solutions data for three levels of circadian light intensity, 100 lux (left), 1000 lux(center), and 10000 lux (right).

Cross-validation with new jetlag cases

Fig 15. Performance of the feedback optimal control compared to that of the open-loop optimal solutions for jet-lag cases with Tlag∈{12,112,⋯,2312} for three levels of circadian light intensity.

Robustness w.r.t. n(0)

Fig 16. Performance of the feedback optimal control for the 12-hour jet-lag case with three levels of circadian light intensity and various initial conditions for n(0).

Robustness w.r.t. change in Imax

Fig 17. Performance of the optimal feedback controllers trained using data from the cases of circadian light intensities of 1000 and 10000 lux (red and blue curves, respectively) applied to the case of circadian light intensity of 5000 lux.

7 Discussion and conclusions

Comparison of our method and the method in [22]

On the relevance of lower-order circadian rhythm models

Impacts of model reduction on the TOE solutions

Potential impacts of amplitude suppression in circadian oscillator

Robustness of the optimal feedback controller

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Fig 3. Solutions of TOE-1st order for the 12-hour jetlag case for the circadian lighting levels of 100 lux (top), 1000 lux (u_max = 0.1731) (middle) and 10000 lux (bottom), obtained using the DSA.

Fig 6. Locally optimal solutions of TOE-2nd Order for the 12-hour jet-lag case for the circadian lighting levels of 100 lux (u_max = 0.1028) (top row), 1000 lux (u_max = 0.1731) (middle row) and 10000 lux (u_max = 0.2208) (bottom row), obtained using the GDA.

Fig 11. Entrainment time by the natural light-dark cycle in (7) with I_w,max = 100 lux.

Fig 15. Performance of the feedback optimal control compared to that of the open-loop optimal solutions for jet-lag cases with $T_{l a g} \in {\frac{1}{2}, 1 \frac{1}{2}, \dots, 23 \frac{1}{2}}$ for three levels of circadian light intensity.

Robustness w.r.t. change in I_max