A contraction approach to dynamic optimization problems

Leif K Sandal; Sturla F Kvamsdal; José M Maroto; Manuel Morán

doi:10.1371/journal.pone.0260257

. 2021 Nov 22;16(11):e0260257. doi: 10.1371/journal.pone.0260257

A contraction approach to dynamic optimization problems

Leif K Sandal ¹, Sturla F Kvamsdal ^2,^*, José M Maroto ^3,⁴, Manuel Morán ^4,⁵

Editor: Mohd Nadhir Ab Wahab⁶

PMCID: PMC8608347 PMID: 34807942

Abstract

An infinite-horizon, multidimensional optimization problem with arbitrary yet finite periodicity in discrete time is considered. The problem can be posed as a set of coupled equations. It is shown that the problem is a special case of a more general class of contraction problems that have unique solutions. Solutions are obtained by considering a vector-valued value function and by using an iterative process. Special cases of the general class of contraction problems include the classical Bellman problem and its stochastic formulations. Thus, our approach can be viewed as an extension of the Bellman problem to the special case of nonautonomy that periodicity represents, and our approach thereby facilitates consistent and rigorous treatment of, for example, seasonality in discrete, dynamic optimization, and furthermore, certain types of dynamic games. The contraction approach is illustrated in simple examples. In the main example, which is an infinite-horizon resource management problem with a periodic price, it is found that the optimal exploitation level differs between high and low price time intervals and that the solution time paths approach a limit cycle.

1. Introduction

Periodicity is an important characteristic of many systems that are subject to control. A rigorous treatment of periodicity in optimization problems is nontrivial because periodicity is a special case of nonautonomy [1]. Nonautonomy typically renders many optimal control problems difficult and costly to deal with, or even intractable. Thus, periodicity in applied work is often abstracted from altogether, or treated by considering the aggregate or mean forcing. For example, in many natural resource management models where natural growth is described as an annual process—modeled by a growth operator that is applied once per year—and where environmental conditions are known to have significant seasonal variations, an implicit assumption is that seasonal effects are aggregated up or averaged out [2]. But as shown in various research, modeling seasonal or periodic effects properly can lead to surprising and operationally important results [1, 3, 4].

To the best of our knowledge, periodicity in infinite-horizon optimal control problems in discrete time has not been treated formally in the literature. A large class of periodic problems is a special case of a general class that can be shown to be fix-point problems for a family of contraction operators. The contraction operator can be used to obtain the solution in an iterative procedure. The class of problems considered in this study includes the classical Bellman problem, the periodic problem formulations of initial interest [1], stochastic problems, and other, more esoteric formulations. Our key contribution is nevertheless an extension of the classical Bellman result in the special case of nonautonomy that periodicity represents.

Let us first clarify the term periodicity. By periodicity, or periodic characteristics or features of optimization problems, we presently mean conditions or structures that repeat themselves at given time intervals. These recurring conditions or structures are represented by an objective function or in the description of how the state develops over time. Perhaps the simplest illustration of such periodicity is the seasonal variation inherent in many natural systems, where growth varies over a year but where the same growth conditions arise repeatedly. But note that periodic features are not necessarily described by some trigonometric relationship, and neither are solutions periodic in any other sense than that the decision rule is the same when the same conditions arise. That is, the structure of the problem repeats itself at given time intervals. The classical discrete, infinite-time dynamic programming problem in economics has discounting as the only nonautonomic feature; that is, time enters explicitly only through the discounting of the objective function. This problem represents a problem with period 1 in our setting, which is reflected by the fact that the running value function does not explicitly depend on time.

To motivate our study of periodicity in decision problems, let us briefly mention some applied examples. These examples include demand systems subject to supply control. In particular, annual, seasonal, weekly, or daily cycles in demand are well-known for electricity [5] and energy in general, and a broad range of consumer goods have seasonal fluctuations in demand. For example, McClain and Thomas [6] considered production planning under seasonal demand, whereas Bradley and Arntzen [7] further considered inventory policy and capacity constraints. More recently, Nagaraja et al. [8] provided a brief review of the theory related to seasonal demand problems and applied it to the bullwhip effect in supply chains. There is also an extensive literature on cyclical pricing policies discussed by Besbes and Lobel [9]. Other dynamic decision problems with periodic features are found in transport and logistics systems subject to routing control (Liebchen [10] discussed the use of optimization in the periodic event-scheduling problem) or natural systems subject to management control. For example, renewable natural resources may exhibit periodicity in growth or other natural processes, as well as in prices and costs [11, 12]. In particular, Ni and Sandal [4] studied a multiseason, multistate bioeconomic model. Kvamsdal et al. [1] provided a generic treatment of periodicity in resource management problems, which can be considered a special case of our results below.

To illustrate our approach to periodic problems, we apply our derived numerical scheme to a stylized decision problem with periodicity in the objective function. This example demonstrates the feasibility of our approach and suggests significant, practical implications of explicitly accounting for periodicity. In particular, the solution of the periodic problem exhibits features that are not typically present in problems with no periodicity. We also present a solution to a simple dynamic game to further illustrate the generality of our approach.

Given the prevalence of periodic characteristics of many systems subject to control, we believe that our contribution is important and highly valuable. We show that the classical Bellman problem approach can be extended to periodic problems and that this extension is, while nontrivial, both conceptually and numerically feasible and practical. Ultimately, a broader class of problems can be treated using our approach. However, both the Bellman and periodic problems are directly applicable to real-world decision problems, and thus, we maintain our focus on these formulations. Furthermore, as the periodic problem is the motivation behind considering the problems we target in our most general result, we begin our analysis by showing how the general problem formulation suggests itself from the periodic problem setup.

The remainder of our paper is organized as follows. In the next section, we first set up a standard infinite-horizon, discrete-time optimization problem. We then generalize the problem formulation to allow for periodic variation in the problem structure and establish a set of equations that govern the problem solution. The set of equations is a special case of a general class of problems that we show are contraction problems whose solutions can be obtained through iterations. In the following section, we offer two illustrative examples: A periodic optimization problem and a dynamic game. In the dynamic game example, the notation is interpreted differently, but the equations governing the problem solution belong to the general class that we consider. In the final section, we summarize and discuss potential applications of our generalized problem formulation.

2. A contraction operator for the periodic problem

Dynamic decision problems under various periodic variations are our primary problem type of interest in this paper, and we begin our analysis by showing how the general problem formulation suggests itself from the periodic problem setup. We arrive at the periodic problem setup by allowing for periodic variations in a standard, infinite-horizon, discrete-time optimization problem.

A deterministic, infinite-horizon, autonomous, discounted, discrete-time optimization problem considers the following:

\max_{{u_{k}}_{k = 0}^{\infty}} \sum_{k = 0}^{\infty} β^{k + 1} \cdot Π (x_{k}, u_{k})

(1)

such that x_k+1 = F(x_k, u_k), u_k∈U(x_k), k = 0, 1, 2,…, and x₀∈X given. For the discount factor β, we have 0<β<1. X⊂ℝⁿ is a feasible state space, and x_k∈X is an n-dimensional dynamic state variable at the beginning of time interval k. Here, we use the term “interval” rather than “period,” and reserve the latter to denominate the periodic length characteristic (denoted T, see below). U: X→ℝ^p is a nonempty and compact valued correspondence that specifies the admissible p-dimensional controls u_k in state x_k. That is, u_k is the decision or control variable that must be decided for each instant of the infinite time sequence {t₀, t₁, t₂.…}. Π: X×U→ℝ is bounded and continuous and gives the performance measure (return) at the end of each interval. F: X×U→X is a continuous operator that governs the state variable such that x_k+1 = y_k is the state at the beginning of interval k = 1. Under these conditions, optimal controls ${u_{k}^{*}}_{k = 0}^{\infty}$ and corresponding paths ${x_{k}^{*}}_{k = 0}^{\infty}$ exist, as does the value function $V (x) = \sum_{k = 0}^{\infty} β^{k + 1} \cdot Π (x_{k}^{*}, u_{k}^{*})$ with $x_{0}^{*} = x$ . The value function is the unique fix-point of the Bellman operator $T_{B}$ , which is defined on the space of real, bounded, and continuous functions on X, denoted BC(X), and given by

T_{B} V (x) = \max_{u \in U (x)} {β \cdot Π (x, u) + β \cdot V (y)}

(2)

with V∈BC(X) and y = F(x, u). Using the operator defined in Eq (2), the Bellman equation for the problem in Eq (1) can be written simply as

V (x) = T_{B} V (x)

(3)

See Bertsekas [13] for a more general treatment of the type of problems presented in Eq (1).

We now consider a nonautonomous but periodic problem where Π_k(x, u) is the return function and F_k(x, u) is the time evolution operator for interval k. That is, the return function and the time evolution operator may vary between intervals. Sets for the feasible states (X_k⊆X) and admissible controls (U_k) may also vary between intervals. The control set may vary with the state such that U_k = U_k(x_k), but we typically omit the state argument. The problem is periodic in the sense that for a finite integer N≥1 and for all k∈ℕ, we have Π_k = Π_k+N, F_k = F_k+N, X_k = X_k+N, and U_k = U_k+N. We say that the problem is periodic with period N, where N is the smallest integer satisfying these equalities, and that the performance or return measure and the dynamic constraint functionally repeat themselves. Each period comprises k intervals. The classical outset yielding Eq (1) is then a problem with period 1.

Without adding significant complexity, we can allow for varying interval lengths. Thus, each interval has potentially different discount factor values. We write the length of interval k as T_k = t_k−t_k−1 and its discount factor as β_k. Periodicity implies T_k = T_k+N and β_k = β_k+N. Then, the length of the cycle of N intervals can be expressed as

T = \sum_{i = 1}^{N} T_{i} = t_{N} - t_{0}

(4)

The discount factor for the cycle of N intervals is $β = \prod_{i = 1}^{N} β_{i}$ . Here, $\prod_{i = 1}^{N} (\cdot)$ is the usual product operator, unrelated to the objective function elsewhere denoted by Π_k(⋅). Fig 1 accounts for interval index references.

Fig 1 — Return (Π_k) is yielded at the end of interval k, whereas the value V_k refers to the beginning of interval k.

Although a real discounted problem cannot have a periodic present value, the running value will be periodic under time discounting if β_k, involved operators (Π_k, F_k), or spaces (U_k, X_k) are periodic, as described above. As suggested above, a periodic feature repeats itself with some inherent period. If a problem includes several periodic features, the problem period N will be the least common multiple of the potentially different inherent periods of the different features.

The periodic problem intuitively suggests a set of N nested equations (see S1 Appendix):

\begin{array}{l} V_{k} (x) = \max_{u_{k} \in U_{k} (x)} {β_{k} Π_{k} (x, u_{k}) + β_{k} V_{k + 1} (x')}, k = 1, \dots, N - 1 \\ V_{N} (x) = \max_{u_{N} \in U_{N} (x)} {β_{N} Π_{N} (x, u_{N}) + β_{N} V_{1} (x')} \end{array}

(5)

In equation set (5), x′ = F_k(x, u_k) is a shorthand notation for the state variable one interval ahead. If V_k is interpreted as the value function for interval k, the equation set (5) follows from value additivity with its inherent economic logic that the present value is what you earn presently plus the discounted value of future earnings. “Earn” is not necessarily meant in its strict, monetary sense, but can be any type of utility-like flow.

In what follows, we first define generalized operators for deterministic and two stochastic formulations of optimization problems, of which the periodic problems discussed above are special cases. We then present a theorem that holds for all generalized formulations. Finally, we present a corollary that applies our theorem to the periodic problem in equation set (5).

First, we consider functional equations of the type

V (x) = T V (x)

(6)

where $V (x)$ is an N-dimensional bounded vector function in BC(X) (that is, the components are bounded and continuous), and further, x∈X⊂ℝⁿ. The components of the operator $T$ are defined as

T_{k} V (x) ≜ \max_{u \in Γ_{k} (x)} {{\hat{Π}}_{k} (x, u) + β_{k} L_{k} V (x^{'})}, k = 1, \dots, N

(7)

In (7), $L_{k}$ are Lipschitz continuous with Lipschitz constants $γ_{k}, {\hat{Π}}_{k} (x, u)$ are bounded functions, and the correspondence Γ_k(x) specifies admissible sets. ${\hat{Π}}_{k} (\cdot)$ can take on two forms depending on the timing of the return. If returns are realized at the end of each interval, as in equation set (5), they are discounted and we have ${\hat{Π}}_{k} (x, u) = β_{k} Π_{k} (x, u)$ . In contrast, if returns are realized at the beginning of each interval, they are not discounted and we have ${\hat{Π}}_{k} (x, u) = Π_{k} (x, u)$ . We have β_k∈(0,1). As we will argue below, equation set (5) is a special case of (6) when $T$ is defined by (7).

Furthermore, the definition in Eq (7) is a special case of two different stochastic formulations. Let z∈Z⊆ℝ^q be a real-valued, q-dimensional vector of stochastic elements that are realizations of a known, stochastic process (that is, the probability transition function is known and the expectation operator over z, denoted E_z, is well-defined; see Stokey et al. [14], p. 241). The stochastic elements can be present in both the return functions and the operators governing the state variables that are considered to be Markov decision processes. We thus write Π_k(x, u, z) and F_k(x, u, z), both of which are measurable. If both present and future realizations of the stochastic process are uncertain, we consider the following definition of $T$ :

T_{k} V (x) ≜ \max_{u \in Γ_{k} (x)} E_{z} {{\hat{Π}}_{k} (x, u, z) + β_{k} L_{k} V (x')}, k \in {1, \dots, N}

(8)

The definition in Eq (8) aligns with the typical formulation in Bertsekas [13].

Other problem formulations, however, consider the present realization of the stochastic process as known. Such formulations require the stochastic elements to be considered as part of the state vector. We consider s = (x, z) as an extension of the state variable and consider the following definition of $T$ :

T_{k} V (s) ≜ \max_{u \in Γ_{k} (s)} {{\hat{Π}}_{k} (s, u) + β_{k} E_{z'} L_{k} V (s')}, k \in {1, \dots, N}

(9)

In (9), s′ is a shorthand for the extended state one interval ahead, and E_z′ is the expectancy over possible realizations of the stochastic elements in the next interval, z′. The definition in (9) aligns with the typical formulation in Stokey et al. [14]. By inspection, we see that (7), the deterministic case, is a special case of both (8) and (9). The following theorem holds for all these potential definitions of $T$ , that is, (7)–(9).

Theorem: $T$ is a contraction operator on bounded vector functions if

η ≜ \max {β_{k} γ_{k} | k = 1, \dots, N} < 1

(10)

Proof: Let $V (x)$ and $W (x)$ be arbitrary elements in B(X), which is the space of bounded vector functions over X, and let ‖⋅‖ denote the sup-norm. If we write $L_{k}^{'} = β_{k} E_{z'} L_{k}$ , we have, for component k,

\begin{array}{l} T_{k} V = T_{k} (W + V - W) \\ = \max_{u \in Γ_{k} (s)} {{\hat{Π}}_{k} (s, u) + L_{k}^{'} W (s^{'}) + L_{k}^{'} V (s^{'}) - L_{k}^{'} W (s^{'})} \\ \leq T_{k} W + | | L_{k}^{'} V - L_{k}^{'} W | | \\ \leq T_{k} W + β_{k} | | L_{k} V - L_{k} W | | \\ \leq T_{k} W + β_{k} γ_{k} | | V - W | | \end{array}

(11)

The first inequality follows from the properties of the sup-norm, the second inequality follows from the expectancy operator having a Lipschitz constant of one, and the final inequality follows from the properties of the Lipschitz operator $L_{k}$ . From Eq (11), we have

T_{k} V - T_{k} W \leq β_{k} γ_{k} | | V - W | |

We can revert the roles of $V$ and $W$ in Eq (11) to obtain

T_{k} W - T_{k} V \leq β_{k} γ_{k} ‖ V - W ‖

We can thus conclude that

| T_{k} V - T_{k} W | \leq β_{k} γ_{k} | | V - W | |

(12)

Inequality (12) holds for all values of k, and we have

| | T V - T W | | \leq η | | V - W | |

(13)

where η≜max{β_kγ_k|k = 1,…,N}. That is, $T$ is a contraction operator if η<1. Q.E.D.

If $T$ is operating on continuous functions on a compact state space, then it has a unique fix-point. Because $T$ is a contraction, the fix-point can be obtained by iterations.

For our result to apply to a periodic problem, it remains to be shown that the equation set (5) is a special case of (6) and that the requirement on η holds. By definition, the left-hand sides of (5) and (6) are identical. We thus need to show that the right-hand side in (5), for all values of k, is a special case of (7), which defines the right-hand side of (6). Because we have proved that the stochastic formulations in (8) and (9) are also contractions, our result also applies to the stochastic analogous extensions of equation set (5). We summarize this result in the following corollary:

Corollary: The periodic optimization problem in equation set (5) and analog stochastic problems are contraction problems having unique solutions, that is, the value functions.

Proof: The operator defined by $L_{k} V ≜ V_{i}$ for all values of k, with i = k+1 for k∈(1,…,N−1) and i = 1 for k = N, is a Lipschitz operator with Lipschitz constant γ_k = 1. That is, (5) is a special case of (6). The parameter β_k in (5) is a discount factor, and for all values of k we have β_k<1. Thus, η<1, and the corollary follows from the theorem. Q.E.D.

The proof of the corollary can be readily modified to show that the classical Bellman problem (that is, set i = k for all k in the proof) is also a special case of (6), as is any choice for i∈{1,…,N}. Furthermore, there exists a large set of Lipschitz continuous operators that fulfill the requirements of the theorem, and there are many potential applications of (6).

Note the use of the sup-norm in the theorem above. It represents a type of worst-case scenario regarding convergence (that is, no single point in state space can “hang out”). Thus, in many applications, we expect convergence to be faster than that implied by η.

A varying interval length requires suitable adaptions of Π_k, F_k, X, and Y_k (or the comparable stochastic elements). If discounting is uniform in time and interval k represents a share δ_k of the N-cycle, such that t_k−t_k−1 = δ_k⋅(t_N−t₀), we have $β_{k} = β^{δ_{k}}$ . In many applied settings, the N-cycle represents a year, and β, the discount factor over N intervals, is then the annual discount factor. The extension to varying interval length is important, not least because it allows for reductions in dimensionality. For example, consider a problem that is formulated on an annual level, but where one month is different such that the problem is periodic. Without the option of varying the interval length, the model will require N = 12 to rigorously capture the periodic feature. With varying interval length, N = 2 suffices.

3. Examples of applications

In this section, we provide two examples that illustrate the use of our method and the proposed numerical scheme. We first return to the problem that led to the above developments: a dynamic, infinite-horizon, discounted discrete-time optimization problem with a periodic feature. In addition to demonstrating our method, the first example also shows the relevance and potential importance of considering periodic features in operational decision problems. The example has two intervals with different prices. In the second example, we consider a simple, dynamic game where two agents with different production parameters supply a good to a common market. This example can be solved exactly and we illustrate that the iterative solution converges to the exact solution. It also illustrates a different type of application than that in the first example.

3.1 Periodic price in a resource model

We consider a management model for a renewable capital stock with periodic variations in the price parameter. The question becomes how to shift resource extraction toward the high-price intervals to maximize the net present value of returns. We compare the optimal periodic solution with solutions for models where the price is assumed constant and nonperiodic. The equation governing the stock evolution is

x_{k + 1} = F (x_{k}) - u_{k}

(14)

where x_k is the capital level at the beginning of interval k, and u_k is the level of exploitation. Natural growth, represented by the function F(x_k) = δx_k/(1−x_k(1−δ)/ω), with δ = 4 and ω = 1, is a version of the Beverton–Holt growth function. δ is the growth rate and ω is the saturation point where F(x_k) =x_k. Beyond the saturation point (x_k>ω), the natural surplus growth (F(x_k)−x_k) is negative. The growth function is identical in all intervals and thus carries no interval subscript. The return function for interval k is a constant relative risk aversion utility function:

Π_{k} (x_{k}, u_{k}) = p_{k} \frac{u_{k}^{1 - γ}}{1 - γ}

(15)

The parameter p_k is an interval-specific price, and γ measures the degree of relative risk aversion. The example has two intervals with different prices, where p₁ = 1.0 and p₂ = 0.2. Furthermore, γ = 1/2. We consider the capital left for future growth, y_k = x_k+1, as our decision variable. That is, u_k is eliminated with u_k = F(x_k)−y_k. With the price entering linearly as in Eq (15), the decision is independent of price in the nonperiodic problem (that is, if p₁ = p₂, the price parameter can be factored out of the decision problem). Thus, the periodic problem poses a more complex problem than that posed by the “associated nonperiodic problems.”

We solve equation set (5) numerically, subject to Eqs (14) and (15), and derive periodic optimal feedback decision rules as functions of the capital level at the beginning of each interval: y₁(x) and y₂(x). Fig 2 reports these decision rules together with the replacement curve (the 45-degree line, y = x). If the curves of the decision rules are below the replacement curve, the capital level is effectively reduced in the given interval and for the given capital level. The replacement curve also serves as the identity map used to transfer the state between subsequent periods (y_k = x_k+1). For comparison, the figure also reports the solutions to the two nonperiodic problems (with p₁ = p₂, one case with prices equal to 1.0 and the other with prices equal to 0.2). Because the price enters linearly in the objective function, the solutions to the nonperiodic problems are, as mentioned above, independent of the price and thus identical. Finally, the figure reports a sample time path for the stock level under the periodic solution, tracing out the stock level from interval to interval.

Fig 2 — Decision rules for the periodic problem (blue solid curve y₁, red solid curve y₂), nonperiodic problems with prices 1.0 (blue dashed curve) and 0.2 (red dashed curve), replacement curve (black dashed curve), and sample path with initial x₀ = 0.1 (thin black path).

The decision rules for the periodic problem show significant and considerable discrepancies between intervals. In intervals with a high price (p₁), relatively small levels of capital are left for future growth (see the blue curve in Fig 2). Contrarily, in intervals with a low price (p₂), relatively high levels of capital are left for the future (see the red curve in Fig 2). Again, when the price is nonperiodic, the decision rule is invariant to the price level and, here, lies between the periodic decision rules (see the dashed red and blue curves in Fig 2). The sample time path starts at a chosen initial stock level (x₀ = 0.1), moves to the stock level in the first interval on the identity map via the first interval decision rule (x₁ = y₁(x₀) = 0.138), then moves to the stock level in the second interval via y₂ (x₂ = y₂(x₁) = 0.365), then moves to the stock level in the third interval via y₁ again (x₃ = y₁(x₂) = 0.199), and so on. The sample time path shows that the system under the periodic solution moves toward a stable two-period limit cycle and exhibits a more complex behavior than what can be discerned from the two nonperiodic problems that are associated with the periodic problem. The stock values of the limit cycle are approximately x_k = 0.208 at interval 1 times (high price) and x_k = 0.463 at interval 2 times (low price). In the associated nonperiodic problems with both the high and the low price, the stock level approaches x_k = 0.323.

This example suggests that taking periodicity into account has significant practical implications. A more elaborate case was studied by Ni and Sandal [4], who examined a commercial fishery management problem involving a seasonal and regional separation of the spawning stock from the remaining stock. The problem is a two-state case with two intervals of 3 and 9 months. They implemented a special case of the present approach and were the first ones to demonstrate that a no-harvest region and a seasonal closure can develop naturally as a consequence of a first-best feedback policy. Another example was examined in Kvamsdal et al. [1], which demonstrated the potential pitfalls of a heuristic approximation of a periodic feature in a dynamic decision problem.

3.2 A dynamic game of supply

In our second example, we illustrate a different type of application than that considered in the first example. We consider a simple dynamic game where two agents with different cost parameters supply a good to a common market. In this market, the agents face a price that they both influence through their supply. The agents thus play a game where the optimal supply of each agent depends on the supply of the opposite agent. The example is designed such that it can be solved exactly for a given set of parameter values. We use this feature to compare the iterative solution—based on our theorem—with the exact solution.

The index k has a different role in this example than in the periodic problem formulation (Fig 1). In the periodic problem, k refers to time intervals; however, in this dynamic game problem, k refers to agents. As we will see, the equations governing the solution of the dynamic game have the same structure as that of the equations governing the solutions of periodic problems, and thus, our approach is applicable.

Each agent k (k = 1,2) owns a capital stock x_k, which has the following dynamics:

x_{k, t + 1} = a + b x_{k, t} - u_{k}

(16)

The production u_k is supplied to the market where it obtains the price 1−∑_iu_i, i = 1,2. That is, the common price depends on the total supply to the market. The agents have private costs C_k(x_k) = c_k+d_kx_k. Then, the objective function of agent k is revenues minus costs, Π_k(x_k, u) = (1−∑_iu_i)u_k−C_k(x_k). Agent k solves the following problem:

V_{k} (x) = \max_{u_{k} \leq a + bx} {Π_{k} (x, u) + β_{k} V_{k} (x')}, x^{'} = a + b x - u_{k}, k \in {1,2}

(17)

That is, each agent maximizes the net present value of present returns, Π_k(x, u), where the price they obtain depends on the total market supply, plus the discounted future value of their capital stock. Eq (17) defines an equation set that can be solved using our iterative approach.

The significance of this example is that we can obtain a formal solution that yields an exact solution for certain parameter values. Thus, we can confirm that our contraction approach yields the correct solution. To obtain the formal solution, we make an educated guess that the value functions take the quadratic form

V_{k} (x) = θ_{1, k} x_{1}^{2} + θ_{2, k} x_{1} x_{2} + θ_{3, k} x_{2}^{2} + θ_{4, k} x_{1} + θ_{5, k} x_{2} + θ_{6, k}

(18)

where θ_i,k are coefficients to be determined. Value functions of the second order is reasonable because the involved expressions (objective functions and dynamic constraints) are at most of the second order.

The first-order conditions can be solved to derive expressions for the decision rules of the agents, yielding a solvable set of equations for the coefficients θ_i,k. That is, we take the derivative of the argument of the maximum operator in (17), substitute in Eqs (16) and (18), and put the resulting expressions equal to zero. These equations provide a solvable set of equations for the coefficients in Eq (18). The expression for V_k(x) in (18) has six unknown coefficients for each of the two values of k, and we can derive 12 equations that define the solution. The first-order conditions also provide expressions for the decision rules u_k defined in terms of the coefficients θ_i,k. The resulting algebra is of limited interest; the interested reader can, in the Supplemental material, find pseudocode to solve the problem in Maple (or the algebraic solver of choice). Below, we only report the exact solution and convergence measures for the iterative solution.

For a numeric illustration, we choose the following set of parameters:

[a, b, c_{1}, c_{2}, d_{1}, d_{2}, β_{1}, β_{2}] = [0.5, 0.5, 0.4, 0.3, 0.2, 0.1, 0.95, 0.8]

For these parameters, we obtain the following exact solution (reporting only the first two nonzero digits):

\begin{array}{l} V_{1} (x) = 0.019 x_{1}^{2} - 0.0095 x_{1} x_{2} + 0.0012 x_{2}^{2} + 0.072 x_{1} - 0.018 x_{2} + 1.0 \\ V_{2} (x) = 0.0041 x_{1}^{2} - 0.0090 x_{1} x_{2} + {0.0050 x}_{2}^{2} - 0.046 x_{1} + 0.056 x_{2} + 0.39 \end{array}

(19)

The coefficients in Eq (19) show that the value function for each agent increases with their private capital stock and decreases with the capital stock of the opposite agent. The corresponding optimal decision variables are defined in terms of the coefficients in equation set (19) and are given as the following feedback formulas:

\begin{array}{l} u_{1} (x) = 0.12 x_{1} - 0.030 x_{2} + 0.13 \\ u_{2} (x) = - 0.061 x_{1} + 0.064 x_{2} + 0.26 \end{array}

(20)

That is, the optimal supply of each agent increases with their own capital stock and decreases with that of the other agent. The two supply functions in (20) are illustrated in Fig 3. The numeric approximation obtained from the contraction scheme closely matches the solutions in equation sets (19) and (20). To obtain the contraction solution, we use a rough uniform grid (50×50 on the unit square, [0,1]²) and apply five main policy iterations, each followed by 1,000 value iterations. The largest numeric deviations from the exact solutions are as follows:

V_{1} : 0.0029, V_{2} : 0.00029, u_{1} : 0.00032, u_{2} : 0.00038

These deviations are small with regard to the grid size, and we conclude that the iterative solution has converged to the exact solution.

Fig 3 — Optimal supply for agent 1 (blue surface) and agent 2 (red surface) in the dynamic game.

4. Conclusions

We arrived at the above theorem while working on periodic optimization problems. The major innovation that facilitated our insights was the consideration of a vector function rather than a scalar value function. The use of a vector function and our theorem above may be useful in applications other than periodic optimization problems. In what follows, we will discuss some potential applications and how problems can be formulated for our method to apply. We presume here that (8) is a suitable definition of the contraction operator, but depending on the application, the definitions in (7) or (9) may be better suited.

An application closely related to periodic optimization problems is finite-time optimization problems. These problems are typically solved by backward induction, but such solutions may be cumbersome to conciliate with given initial values. In contrast, our approach can be directly applied, where each interval is represented by an element in the vector function. Any form of nonautonomy can be accommodated (as with backward induction). Thus, for interval k, we have

V_{k} (x) = \max_{u \in Γ_{k} (x)} E_{z} {{\hat{Π}}_{k} (x, u, z) + β_{k} V_{k + 1} (x')}, k \in {1, \dots, N}

(21)

With V_N+1(x) = G(x) representing the salvage value, the system of equations can be interpreted as a finite-time optimization problem with N time intervals. Our corollary applies and shows that we have a contraction problem that, in general, can be solved. Solutions are on a general feedback form that is readily conciliated with a given initial value.

As one of our examples above demonstrates, some game theory problems can also be addressed using our methodology. We consider dynamic games over infinite time with non-cooperative (self-serving) behavior of N agents, but where the decisions of other agents influence the return of each individual agent. Many common-pool resource games [15] fall within this type of games. For agent k, the problem is to maximize over one’s own decisions while taking account of the decisions of others on both the current and future returns. Furthermore, the decisions may depend on, or be restricted by, a state vector x. Elements in the state vector may be common or private goods. The problem can be formulated as follows:

V_{k} (x) = \max_{u_{k} \in Γ_{k} (x)} E_{z} {{\hat{Π}}_{k} (x, u_{k}, u_{- k}, z) + β_{k} V_{k} (x')}, k \in {1, \dots, N}

(22)

The notation ${\hat{Π}}_{k} (x, u_{k}, u_{- k}, z)$ explicitly indicates that the return for agent k depends on the agent’s own decisions (u_k) and those of all other agents (u_−k). The fact that the return function depends on the entire vector of decision variables ([u₁,…,u_N]) necessitates the consideration of a vector function ([V₁,…,V_N]). The proof of the corollary can be modified (with i = k) to show that this definition of the vector function can have a unique feedback solution. It relies on the specificities of the game and whether they imply the properties required for the various sets involved.

While known methods are applicable to decisions under uncertainty over future states, some decisions under risk of regime changes can be addressed using our approach. We think of regime changes as imposing significant changes to conditions for growth or production (utility). Say there are N different regimes, and under a given regime, the return is given by ${\hat{Π}}_{k} (x, u, z)$ , while time evolution of the state variable may be regime-dependent and given by F_k(x,u,z). Furthermore, let ω_k(x,u,z) denote a vector of probabilities for transitioning from regime k to one of the N regimes in the next period. These probabilities may differ under the different regimes and may further depend on the state variable, decision variable, or the stochastic component. Under regime k, the decision problem is as follows:

V_{k} (x) = \max_{u \in Γ_{k} (x)} E_{z} {{\hat{Π}}_{k} (x, u, z) + β_{k} \sum_{i} ω_{k i} (x, u, z) V_{i} (x^{'})}, k \in {1, \dots, N}

(23)

Here, we sum over i = 1..N. The probabilities sum to one, such that β_k<1 ensures that the above theorem holds. From this formulation, we see that the value for any given regime depends on a weighted sum of the elements in the value vector function. While the other suggested applications are clearly reminiscent of the original Bellman problem and can perhaps be perceived as “Bellman in a higher dimension,” considering a linear combination of the value vector function is a fundamentally different structure.

A general solution method to solve periodic optimization problems is potentially a valuable tool in a wide range of settings. The renewable capital example above suggests that complex and atypical dynamics arise for a relatively modest deviation from the autonomous (nonperiodic) formulation. Fig 2 shows that the optimal periodic solution approaches a long-run limit cycle. Moreover, abstracting from periodicity—for example, by using heuristic approaches, such as considering an average effect rather than a periodic effect—can lead astray. Further examples show that such heuristics have severe, adverse consequences if management decisions are based on an autonomous approximation while agents, subject to these decisions, observe and adapt to the periodic phenomenon [1]. Inter-annual or within-season inefficiencies that agree well with these examples are observed in empirical studies of fisheries and have gained considerable attention [11, 12].

Periodicity in applied work in bioeconomics is often treated as short-term (intra-seasonal) dynamics, where considerable progress has been made in developing models to analyze inter-annual or within-season inefficiencies [16, 17]. In these models, natural population growth and discounting processes are frequently ignored. As indicated by Birkenbach et al. [17], these processes are more significant for the inter-season perspective, as developed in Kvamsdal et al. [1, 3]. Our results make it possible to merge these developments and develop models that account for intra- as well as inter-seasonal dynamics.

Our theorem is an intuitive extension of the Bellman result. The classical Bellman result is valid for a scalar value function. The periodic problems given in equation set (5) and implied by (7), (8), and (9) are non-autonomous, their value functions are autonomous vector functions, and the Bellman result is not applicable to these problems. But when the periodic cycle is perceived as the time unit, periodic problems can be perceived as autonomous in a higher dimension. As our suggestions for applications to dynamic games (equation set (22)) and regime shifts (equation set (23)) demonstrate, our result applies to further problems with a genuinely different structure than that of the classical Bellman problem.

Supporting information

S1 File. Pseudo-code for Example 1.

(PDF)

Click here for additional data file.^{(293.3KB, pdf)}

S2 File. Code for Example 2.

(PDF)

Click here for additional data file.^{(346.5KB, pdf)}

S1 Appendix

(DOCX)

Click here for additional data file.^{(15.8KB, docx)}

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

Kvamsdal received financial support from the Research Council of Norway (grant no. 257630 and no. 302197). Maroto recieved financial support from Santander Bank - Universidad Complutense de Madrid (grant no. PR108/20-14).

References

1.Kvamsdal SF, Maroto JM, Morán M, Sandal LK. Bioeconomic modeling of seasonal fisheries. European Journal of Operational Research. 2020;281(2): 332–340. [Google Scholar]
2.Maroto JM, Morán M, Sandal LK, Steinshamn SI. Potential collapse in fisheries with increasing returns and stock-dependent costs. Marine Resource Economics. 2012;27: 43–63. doi: 10.5950/0738-1360-27.1.43 [DOI] [Google Scholar]
3.Kvamsdal SF, Maroto JM, Morán M, Sandal LK. A bridge between continuous and discrete-time bioeconomic models: Seasonality in fisheries. Ecological Modelling. 2017;364: 124–131. [Google Scholar]
4.Ni Y, Sandal LK. Seasonality matters: A multi-season, multi-state dynamic optimization in fisheries. European Journal of Operational Research. 2019;275(2): 648–658. [Google Scholar]
5.Cappers P, Goldman C, Kathan D. Demand response in U.S. electricity markets: Empirical evidence. Energy. 2010;35(4): 1526–1535. [Google Scholar]
6.McClain JO, Thomas J. Horizon effects in aggregate production planning with seasonal demand. Management Science. 1977;23(7): 728–736. [Google Scholar]
7.Bradley JR, Arntzen BC. The simultaneous planning of production, capacity, and inventory in seasonal demand environments. Operations Research. 1999;47(6): 795–806. doi: 10.1287/opre.47.6.795 [DOI] [Google Scholar]
8.Nagaraja CH, Thavaneswaran A, Appadoo SS. Measuring the bullwhip effect for supply chains with seasonal demand components. European Journal of Operational Research. 2015;242(2): 445–454. [Google Scholar]
9.Besbes O, Lobel I. Intertemporal price discrimination: structure and computation of optimal policies. Management Science. 2015;61(1): 92–110. [Google Scholar]
10.Liebchen C. The first optimized railway timetable in practice. Transportation Science. 2008;42(4): 420–435. [Google Scholar]
11.Smith MD. The new fisheries economics: incentives across many margins. Annual Review of Resource Economics. 2012;4: 379–429. [Google Scholar]
12.Huang L, Smith MD. The dynamic efficiency costs of common-pool resource exploitation. The American Economic Review. 2014;104(12): 4071–4103. [Google Scholar]
13.Bertsekas DP. Dynamic programming and optimal control. US: Athena Scientific; 2001. [Google Scholar]
14.Stokey N.L., Lucas R., Prescott E. Recursive methods in economic dynamics. Cambridge, US: Harvard University Press; 1989. [Google Scholar]
15.Ostrom E, Gardner R, Walker J. Rules, games, & common-pool resources. Ann Arbor, US: The University of Michigan Press; 1994. [Google Scholar]
16.Holland DS. Optimal intra-annual exploitation of the Maine lobster fishery. Land Economics. 2011;87(4): 699–711. [Google Scholar]
17.Birkenbach AM, Cojocaru AL, Asche F, Guttormsen AG, Smith MD. Seasonal harvest patterns in multispecies fisheries. Environmental and Resource Economics. 2020;75: 631–655. [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0260257.r001

Decision Letter 0

Mohd Nadhir Ab Wahab

16 Jun 2021

PONE-D-21-01335

A contraction approach to dynamic optimization problems

PLOS ONE

Dear Dr. Kvamsdal,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jul 10 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Mohd Nadhir Ab Wahab, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

3.Thank you for submitting the above manuscript to PLOS ONE. During our internal evaluation of the manuscript, we found significant text overlap between your submission and the following working paper, of you are also authors:

https://openaccess.nhh.no/nhh-xmlui/bitstream/handle/11250/2573862/A14_17.pdf?sequence=1&isAllowed=y

We note that this working paper appears to be copyrighted by KOPINOR. Please note that should your paper be accepted, all content including images, results, and duplicated text will be published under the Creative Commons Attribution (CC BY) 4.0 license, which means that they will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. In order to publish any previously copyrighted material, PLOS ONE requires permission from the original copyright holder of the content to publish it under the CC BY 4.0 license.

Please clarify whether the authors have received written permission from KOPINOR to publish this content specifically under the CC BY 4.0 license and upload the granted permission to the manuscript as a supporting information file.

To seek permission from the copyright owner to publish this content under the Creative Commons Attribution License (CCAL), CC BY 4.0, please contact them with the following text and PLOS ONE Request for Permission form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf):

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license.”

Please upload the granted permission to the manuscript as a Supporting Information file. In the figure caption of the copyrighted figure, please include the following text: “Republished from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

Please note that RightsLink permission forms often impose use restrictions that are incompatible with our CC BY 4.0 license, and we are therefore unable to accept these permissions. For this reason, we strongly recommend contacting copyright holders with the PLOS ONE Request for Permission form.

Additional Editor Comments:

Please address all the comments given by the reviewers.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Abstract-

Add the evaluation and results achieved

Introduction-

some references are needed when term periodicity is discussed.

Try to avoid using 'see' a lot to refer to existing works.

Line107: () = (). (This should also be numbered as equation)

Line108: "...of problems of type (1)." <-- Unclear what type(1) means. Is it the equation?

Line117" "..classical outset yielding (1) is then.." again please mention Equation (1) so if it's the equation

Line121: "...The length of the cycle of intervals is then =..." - Show as Equation and number it

Line126: Fig1- the variable k does not appear in the figure. Why?

Also I suggest to have a clearer section to explain the steps taken in this research. It is a bit unclear generally.

Line233: The Section name can be improved. Separate the sections for the two examples or create a subsection, with clearer name of the examples being used.

Clearer explanation on the problem being used as example is needed.

Line287: For second example, can be put in different section. Further explanation of the problem is needed, before going into the technicalities.

Equation 15 not explained well.

Line298: Equation not numbered.

Line306: How are these set of parameters derived? Any reference on the values used here?

Equation 16 and 17 that follow after the parameter, are not well explained.

Line311: The values shown in Equation 18 do not indicate that (18) is an equation.

Outcome from Second Example is not very clear. Where is the outcome/results actually?

Line314: "We discovered the above results"... avoid these kind of phrase. Please point exactly what is referred or at which section

Line325&337&353: Number the equation

Lines 375-377: "As our suggestions for applications to dynamic games and regime shifts demonstrate, our result applies further to problems with a genuinely different structure than the classical Bellman problem" <-- Discuss this further. Why this is seen to be important discovery and how this will make you move forward with future related research?

OVERALL: The writing flow needs to be majorly improved. The structure (with sections) needs to be improved as well. It seems like the focus is put more on showing the formulas but not so much on the discussion and the steps taken on how these experiments are done.

Line401: More References needs to be added ~ at least 4-5 more recent papers

Reviewer #2: The authors present an extension of the classical Bellmann optimization algorithm to periodic control problems. The work is presented in a clear and technically sound manner. As my only minor comment, I suggest that the authors give a few more details how periodicity in optimal control problems is handled by now (they write "... is often treated in some ad-hoc manner" in the introduction). Also one or two references to such "ad-hoc" treatments would be appropriate in the first paragraph of the introduction.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Nov 22;16(11):e0260257. doi: 10.1371/journal.pone.0260257.r002

Author response to Decision Letter 0

20 Jul 2021

Response to Reviewers

Manuscript PONE-D-21-01335

A contraction approach to dynamic optimization problems

First, let us express our gratitude towards the reviewers who have devoted time and effort to review our work. Their comments and suggestions have helped us improve our manuscript considerably. Below, we account for our revisions in context of the reviewer comments; our replies are marked as such. Comments of a technical nature, such as missing equation numbers and so on, are taken care of but omitted from the list below.

Reviewer #1: Abstract-

Add the evaluation and results achieved

Reply: We have revised the abstract. Our main finding, that what we refer to as periodic problems are part of a more general class of solvable contraction problems, were already mentioned in the abstract. We have highlighted this further. In the revised abstract, we refer some of the results from the examples as well.

Introduction-

some references are needed when term periodicity is discussed.

Reply: We have added a reference to Kvamsdal et al. (2020), who also consider periodic problems, although in a narrower sense than what we do in the present manuscript. We would also like to point out that periodicity is considered a basic concept in dynamic systems. The third paragraph, beginning on line 54 in the revised manuscript, discusses the term and what we presently mean by it.

Try to avoid using 'see' a lot to refer to existing works.

Reply: We agree and have revised the manuscript accordingly.

Line108: "...of problems of type (1)." <-- Unclear what type(1) means. Is it the equation?

Reply: Yes, we mean the equation. We have revised the manuscript for clarity on this point.

Line117" "..classical outset yielding (1) is then.." again please mention Equation (1) so if it's the equation

Reply: Yes, again we mean the equation. We have revised the manuscript accordingly.

Line126: Fig1- the variable k does not appear in the figure. Why?

Reply: The variable k is the interval index. We agree that Figure 1 was unclear and have revised the figure to clarify the role of k.

Also I suggest to have a clearer section to explain the steps taken in this research. It is a bit unclear generally.

Reply: We agree that the manuscript would benefit from an overview of how it is organized. We have added, to the end of the introduction section, a paragraph that outlines the various steps in the manuscript. The paragraph starts on line 99 in the revised manuscript.

Line233: The Section name can be improved. Separate the sections for the two examples or create a subsection, with clearer name of the examples being used.

Reply: We have revised the section name and added subsections for each of the examples, in accordance with the suggestion from the reviewer.

Clearer explanation on the problem being used as example is needed.

Reply: We have added some further explanation of both examples. For the first example, we have also moved discussion of related work from the end of section 2 to the example subsection (3.1).

Line287: For second example, can be put in different section. Further explanation of the problem is needed, before going into the technicalities.

Reply: We agree that the second example required further explanation. The example is now more carefully explained before getting to the technicalities. We have also revised the example throughout.

Equation 15 not explained well.

Reply: Earlier equation (15), now equation (17), is now explained better. In particular, the potentially confusing factor, the different role of the index k from the previous example, is explained in a separate paragraph that starts on line 337 in the revised manuscript.

Line306: How are these set of parameters derived? Any reference on the values used here?

Reply: The set of parameters are simply chosen for numerical illustration. The manuscript has been revised to make this clear.

Equation 16 and 17 that follow after the parameter, are not well explained.

Reply: We have added brief explanations of the equations that now are numbered (19) and (20). Further, the supply functions in equation (20) are illustrated in the added Figure 3.

Line311: The values shown in Equation 18 do not indicate that (18) is an equation.

Reply: We agree that this is not an equation and have revised the manuscript accordingly.

Outcome from Second Example is not very clear. Where is the outcome/results actually?

Reply: In response to several of the comments from the reviewer, the second example is now better explained, and we hope the outcomes are clearer. We have illustrated the optimal decision rules in the new Figure 3. Let us hasten to add that the main purpose of the example was to show that the iterative solution converged to the exact solution. The example has been further revised on this point.

Line314: "We discovered the above results"... avoid these kind of phrase. Please point exactly what is referred or at which section

Reply: We had the main theorem in mind and have revised the manuscript accordingly.

Reply: We found this suggestion a bit confusing. The paragraph was meant to briefly summarize the discussion of the various applications that we discuss earlier in the section. What we have done in our revision, however, is to add a brief paragraph prior to this final paragraph. In the added paragraph, we discuss work in bioeconomics that could be directly influenced by our contribution.

Reply: We have revised the manuscript to improve the writing flow. Several of the revisions resulting from other, more specific comments from both reviewers have also contributed to an improved flow. Finally, we have had the manuscript processed by a professional copyeditor.

Line401: More References needs to be added ~ at least 4-5 more recent papers

Reply: Four references from the last decade has been added at various places throughout the manuscript.

Reply: While we never meant it that way, we realize that the term ‘ad-hoc’ could be understood as criticism. The point being that before a generic treatment like ours had been proposed, any approach would necessarily be ad-hoc in the literal sense of being for special purpose. We have thus avoided this term. Further, we have revised the first paragraph of the introduction and provide an example of practice that could be questioned. We also refer to some articles that illustrate, by mean of examples, that proper modeling of periodicity in optimal control problems matter. Notably, we have kept this discussion brief at this point in the manuscript. The examples are discussed somewhat further in the examples section. In addition, a brief paragraph prior to the final paragraph has been added where we discuss work in bioeconomics that could be directly influenced by our contribution.

Attachment

Submitted filename: PONE_RepliesToReviewers.pdf

Click here for additional data file.^{(95.2KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0260257.r003

Decision Letter 1

Mohd Nadhir Ab Wahab

8 Nov 2021

A contraction approach to dynamic optimization problems

PONE-D-21-01335R1

Dear Dr. Kvamsdal,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Mohd Nadhir Ab Wahab, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Please address the comments given by the reviewers.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

6. Review Comments to the Author

Reviewer #1: Revision has addressed the given comments. From the revised version, some improvements can be done:

Abstract - avoid starting first sentence with "We..." - try to rephrase some of the sentences in the abstract into a passive tone. Strongly recommend abstract to be rewritten.

Some of the section heading can be made clearer:

3. Examples (rename this section) - 3. Examples of....?

4.Final remarks -- suggest to be renamed as Conclusion

Reviewer #2: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Volker Ahlers

PLoS One. doi: 10.1371/journal.pone.0260257.r004

Acceptance letter

Mohd Nadhir Ab Wahab

12 Nov 2021

PONE-D-21-01335R1

A contraction approach to dynamic optimization problems

Dear Dr. Kvamsdal:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Mohd Nadhir Ab Wahab

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Pseudo-code for Example 1.

(PDF)

Click here for additional data file.^{(293.3KB, pdf)}

S2 File. Code for Example 2.

(PDF)

Click here for additional data file.^{(346.5KB, pdf)}

S1 Appendix

(DOCX)

Click here for additional data file.^{(15.8KB, docx)}

Attachment

Submitted filename: PONE_RepliesToReviewers.pdf

Click here for additional data file.^{(95.2KB, pdf)}

Data Availability Statement

All relevant data are within the paper and its Supporting Information files.

[pone.0260257.ref001] 1.Kvamsdal SF, Maroto JM, Morán M, Sandal LK. Bioeconomic modeling of seasonal fisheries. European Journal of Operational Research. 2020;281(2): 332–340. [Google Scholar]

[pone.0260257.ref002] 2.Maroto JM, Morán M, Sandal LK, Steinshamn SI. Potential collapse in fisheries with increasing returns and stock-dependent costs. Marine Resource Economics. 2012;27: 43–63. doi: 10.5950/0738-1360-27.1.43 [DOI] [Google Scholar]

[pone.0260257.ref003] 3.Kvamsdal SF, Maroto JM, Morán M, Sandal LK. A bridge between continuous and discrete-time bioeconomic models: Seasonality in fisheries. Ecological Modelling. 2017;364: 124–131. [Google Scholar]

[pone.0260257.ref004] 4.Ni Y, Sandal LK. Seasonality matters: A multi-season, multi-state dynamic optimization in fisheries. European Journal of Operational Research. 2019;275(2): 648–658. [Google Scholar]

[pone.0260257.ref005] 5.Cappers P, Goldman C, Kathan D. Demand response in U.S. electricity markets: Empirical evidence. Energy. 2010;35(4): 1526–1535. [Google Scholar]

[pone.0260257.ref006] 6.McClain JO, Thomas J. Horizon effects in aggregate production planning with seasonal demand. Management Science. 1977;23(7): 728–736. [Google Scholar]

[pone.0260257.ref007] 7.Bradley JR, Arntzen BC. The simultaneous planning of production, capacity, and inventory in seasonal demand environments. Operations Research. 1999;47(6): 795–806. doi: 10.1287/opre.47.6.795 [DOI] [Google Scholar]

[pone.0260257.ref008] 8.Nagaraja CH, Thavaneswaran A, Appadoo SS. Measuring the bullwhip effect for supply chains with seasonal demand components. European Journal of Operational Research. 2015;242(2): 445–454. [Google Scholar]

[pone.0260257.ref009] 9.Besbes O, Lobel I. Intertemporal price discrimination: structure and computation of optimal policies. Management Science. 2015;61(1): 92–110. [Google Scholar]

[pone.0260257.ref010] 10.Liebchen C. The first optimized railway timetable in practice. Transportation Science. 2008;42(4): 420–435. [Google Scholar]

[pone.0260257.ref011] 11.Smith MD. The new fisheries economics: incentives across many margins. Annual Review of Resource Economics. 2012;4: 379–429. [Google Scholar]

[pone.0260257.ref012] 12.Huang L, Smith MD. The dynamic efficiency costs of common-pool resource exploitation. The American Economic Review. 2014;104(12): 4071–4103. [Google Scholar]

[pone.0260257.ref013] 13.Bertsekas DP. Dynamic programming and optimal control. US: Athena Scientific; 2001. [Google Scholar]

[pone.0260257.ref014] 14.Stokey N.L., Lucas R., Prescott E. Recursive methods in economic dynamics. Cambridge, US: Harvard University Press; 1989. [Google Scholar]

[pone.0260257.ref015] 15.Ostrom E, Gardner R, Walker J. Rules, games, & common-pool resources. Ann Arbor, US: The University of Michigan Press; 1994. [Google Scholar]

[pone.0260257.ref016] 16.Holland DS. Optimal intra-annual exploitation of the Maine lobster fishery. Land Economics. 2011;87(4): 699–711. [Google Scholar]

[pone.0260257.ref017] 17.Birkenbach AM, Cojocaru AL, Asche F, Guttormsen AG, Smith MD. Seasonal harvest patterns in multispecies fisheries. Environmental and Resource Economics. 2020;75: 631–655. [Google Scholar]

PERMALINK

A contraction approach to dynamic optimization problems

Leif K Sandal

Sturla F Kvamsdal

José M Maroto

Manuel Morán

Roles

Abstract

1. Introduction

2. A contraction operator for the periodic problem

Fig 1. Interval index reference for periodic problems.

3. Examples of applications

3.1 Periodic price in a resource model

Fig 2. Decision rules for the periodic problem example.

3.2 A dynamic game of supply

Fig 3. Optimal supply for the dynamic game example.

4. Conclusions

Supporting information

Data Availability

Funding Statement

References

Decision Letter 0

Mohd Nadhir Ab Wahab

Roles

Author response to Decision Letter 0

Decision Letter 1

Mohd Nadhir Ab Wahab

Roles

Acceptance letter

Mohd Nadhir Ab Wahab

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A contraction approach to dynamic optimization problems

Leif K Sandal

Sturla F Kvamsdal

José M Maroto

Manuel Morán

Roles

Abstract

1. Introduction

2. A contraction operator for the periodic problem

Fig 1. Interval index reference for periodic problems.

3. Examples of applications

3.1 Periodic price in a resource model

Fig 2. Decision rules for the periodic problem example.

3.2 A dynamic game of supply

Fig 3. Optimal supply for the dynamic game example.

4. Conclusions

Supporting information

Data Availability

Funding Statement

References

Decision Letter 0

Mohd Nadhir Ab Wahab

Roles

Author response to Decision Letter 0

Decision Letter 1

Mohd Nadhir Ab Wahab

Roles

Acceptance letter

Mohd Nadhir Ab Wahab

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases