Abstract
We investigate adaptive strategies to robustly and optimally control the COVID-19 pandemic via social distancing measures based on the example of Germany. Our goal is to minimize the number of fatalities over the course of two years without inducing excessive social costs. We consider a tailored model of the German COVID-19 outbreak with different parameter sets to design and validate our approach. Our analysis reveals that an open-loop optimal control policy can significantly decrease the number of fatalities when compared to simpler policies under the assumption of exact model knowledge. In a more realistic scenario with uncertain data and model mismatch, a feedback strategy that updates the policy weekly using model predictive control (MPC) leads to a reliable performance, even when applied to a validation model with deviant parameters. On top of that, we propose a robust MPC-based feedback policy using interval arithmetic that adapts the social distancing measures cautiously and safely, thus leading to a minimum number of fatalities even if measurements are inaccurate and the infection rates cannot be precisely specified by social distancing. Our theoretical findings support various recent studies by showing that (1) adaptive feedback strategies are required to reliably contain the COVID-19 outbreak, (2) well-designed policies can significantly reduce the number of fatalities compared to simpler ones while keeping the amount of social distancing measures on the same level, and (3) imposing stronger social distancing measures early on is more effective and cheaper in the long run than opening up too soon and restoring stricter measures at a later time.
Keywords: Epidemic control, COVID-19, Optimal control, Model predictive control, Robustness
1. Introduction
Social distancing is an effective way to contain the spread of a contagious disease, particularly when little is known about the virus and no vaccines or other pharmaceutical interventions are available. Social distancing and isolation (together with other non-pharmaceutical measures such as hygiene and face masks) have a direct influence on the infection rates and hence on the spread of the virus (Kissler et al., 2020, Maharaj and Kleczkowski, 2012, Maier and Brockmann, 2020). While this combination has proven effective during the last weeks, e.g. in the German outbreak of COVID-19, strict social distancing is also very costly in terms of economical and psychological damage, which naturally leads to a multi-objective decision problem.
There have been numerous approaches to model the COVID-19 outbreak and to predict future behavior for different distancing policies in simulation studies. The most commonly used modeling approaches are different extensions of the SIR (susceptible–infected–removed) model formulated either as system dynamics or as agent-based simulations (e.g. for Germany (Barbarossa et al., 2020, Dehning et al., 2020, German et al., 2020)). In many such studies, different policies are simulated and compared with respect to the goals that both the health care system is not overwhelmed such that every patient in need receives treatment and the mortality rate is kept low, and also such that the majority of people can resume social interaction as soon as possible. However, in line with Alleman, Torfs, and Nopens (2020) and others, we advocate to go from mere model predictions to (model predictive) control, since control generally offers the theory to develop and apply optimal or robust decision making under uncertainties.
While mathematical modeling and control of epidemics is a topic with rich history (see, e.g., the survey in Nowzari, Preciado, and Pappas (2016) and the references therein), there have also been numerous approaches to apply control theory to the COVID-19 spread. In Casella (2021), the author applies control theoretic principles and insights to a simple model of the outbreak to point out the difficulties of the system at hand: fast unstable dynamics with significant delays. In more recent literature, multiple works have addressed the problem of open-loop optimal control for the COVID-19 pandemic. In Bin et al., 2020, Tsay et al., 2020, for example, the authors argue in favor of ’on and off’-policies of the social distancing measures, yielding a bang–bang like optimal control strategy. Such ’on and off’-policies, where the control input switches between two states, however, could pose great challenges, amongst others, for the society, but also for production lines, supply chains and the economy in general.
In this paper, we propose optimal open-loop and feedback control strategies to handle the German COVID-19 outbreak. We employ the recently developed SIDARTHE model (Giordano et al., 2020) in order to design control policies which minimize the number of fatalities within a time horizon of two years, without using excessive social distancing measures. We also address robustness of our policies w.r.t. model and measurement uncertainties via a (robust) model predictive control (MPC) feedback strategy. Note that the following discussion and results are all based on the information and data available prior to the initial submission of this paper (May 2020).
Similar to the setup in this paper, the authors in Djidjou-Demasse, Michalakis, Choisy, Sofonea, and Alizon (2020) explore the best policy to implement while waiting for the availability of a vaccine. In their paper, they also distinguish between varying severity of symptoms (’mild’ or ‘severe’) and seek a solution to the multi-objective optimization problem of minimizing fatalities and costs due to the implementation of the control strategy itself. Their main outcome of the open-loop input strategy is qualitatively similar to our results in Section 3.2: Start with a loose strategy, soon increase all distancing measures such that the health care system capacities are never extorted and then relax the social distancing measures gradually and slowly. Another example for an open-loop optimal policy applied to the COVID-19 pandemic is presented in Kantner and Koprucki (2020) where the authors consider optimal control of the German outbreak using a slightly simpler model as the one chosen in the present paper (without distinguishing between detected and undetected individuals), which also includes an increased mortality rate if the ICU capacity is exceeded. Therein, the objective is not only to minimize the number of fatalities but also the number of susceptible individuals at the end of the time horizon, thus aiming for herd immunity. Our investigations in Section 3, however, indicate that herd immunity cannot be reached in a reasonable amount of time without overwhelming the hospital capacities. Therefore, our approach minimizes the number of fatalities after two years, with the underlying assumption that a vaccine will be available thereafter.
However, an open-loop optimal policy cannot suffice to control the COVID-19 pandemic given all the uncertainties in the spreading of the virus and the disease progression, as we will see in the numerical results. We argue, similar to Alleman et al. (2020), that an MPC-based feedback strategy is the right tool to develop optimal and robust social distancing policies, especially in the presence of model inaccuracies. By using online measurements of the current outbreak, feedback inherently introduces robustness with respect to uncertainties and disturbances to the policy. We also robustify the feedback mechanism by introducing a robust MPC-based feedback strategy for uncertain state measurements which is crucial in a situation where only a limited amount of data is available and, for example, the number of the currently infected persons can only be estimated roughly by applying different studies.
Our results are also in accordance with a very recently published joint strategy paper for Germany by authors from different German research institutions (Fraunhofer-Gesellschaft, Helmholtz Association, Leibniz Association and Max Planck Society) (Meyer-Hermann, Pigeot, Priesemann, & Schöbel, 2020). Firstly, they also state that reaching herd immunity without the availability of a vaccine would either exceed the health care capacities (with a resulting high mortality rate) or take several years (cf. our results in Section 3). Secondly, they state that the goal of wiping out the virus can only be a robust solution if this eradication would be a worldwide effort with very high social and economic costs (cf. our results in Section 3.1.1), which seems impossible to realize. Finally, they suggest an adaptive strategy for all policies influencing the infection rates with the goal to keep the spread of COVID-19 at bay while requiring the least possible restraints on the society and economy. With exactly this reasoning, we develop suitable control approaches in Section 3.2, Section 4, and Section 5 for such an ‘adaptive’ strategy.
To summarize, our key contributions are the following:
-
•
We extend the model in Giordano et al. (2020) by a mortality rate dependent on the state of the health care capacity and fit the parameters with data from Germany (Section 2).
-
•
We develop an optimization problem for finding the optimal input (in terms of setting infection rates) that minimizes the number of fatalities while keeping the costs occurring due to distancing measures low (Section 3). Moreover, we show that such an optimal input has significant advantages compared to simpler baseline policies.
-
•
We show that simply applying a precomputed (optimized) input is dangerous if the model is uncertain and explain why feedback is of utmost importance when dealing with such an unstable and uncertain system. Further, we demonstrate how such feedback can be incorporated via MPC, and we showcase the advantages of this control policy (Section 4).
-
•
We develop a robust MPC-based feedback strategy, which takes model inaccuracies, uncertain state measurements, and inexact inputs into account and can thus handle the COVID-19 outbreak cautiously and safely (Section 5).
Although based on a simple model fitted with limited data, we hope that these high-level insights inspire further investigations, possibly on more complex epidemiological models, and can ultimately help decision makers to improve and optimize their policies to mitigate the spread of epidemics while keeping the toll on the society and economy low.
2. Modeling of the COVID-19 epidemic
In this section, we describe the model of the COVID-19 epidemic that we use for our subsequent control approach. Our model is adapted from the SIDARTHE model proposed in Giordano et al. (2020) with the key differences that (i) we use more recent data to estimate new parameters to model the German COVID-19 outbreak (in Giordano et al. (2020), the Italian outbreak was considered) and (ii) we model the fact that the mortality rate increases if the number of critically ill patients exceeds the capacity of the German health care system. In Section 2.1, we describe the model of Giordano et al. (2020) and explain its ingredients. Thereafter, in Section 2.2, we provide details on our parameter estimation algorithm which fits the model to the German COVID-19 outbreak. Finally, we propose an extension of the model by increasing the mortality rate when the health care system is overwhelmed in Section 2.3.
2.1. The SIDARTHE model
The considered model based on Giordano et al. (2020) is shown in Fig. 1 and includes eight states: S - Susceptible, I - Infected (asymptomatic, undetected), D - Diagnosed (asymptomatic, detected), A - Ailing (symptomatic, undetected), R - Recognized (symptomatic, detected), T - Threatened (symptomatic with life-threatening symptoms, detected), H - Healed (immune after prior infection, detected or undetected), E - Extinct (dead, detected). In accordance with Fig. 1, the following differential equations describe the SIDARTHE model:
In the Eqs. (1), capital letters describe fractions of the whole population that are currently in the respective state. Since the model represents the whole population, the states sum up to , i.e., they must satisfy at all times. Therefore, one equation in (1) is redundant and hence, e.g., the state can be expressed via the algebraic relation instead of Eq. (1g), as it is common in the field of differential algebraic equations. In most parts of this section, we omit time arguments for simplicity. Further, Greek letters are the model parameters which are briefly summarized in the following:
Fig. 1.
Scheme of the considered model for the COVID-19 epidemic.
Adapted from [16].
-
•
describe the infection rates for susceptible individuals, i.e., the rate at which susceptible individuals are infected by the states , or , and , respectively, and hence join the state .
-
•
describe the testing rate, i.e., at which rate (asymptomatic or symptomatic) infected individuals go from undetected to detected.
-
•
describes the rate of asymptomatic (detected or undetected) infected individuals exhibiting symptoms, i.e., going from states or to or , respectively.
-
•
is the rate at which infected individuals in or develop life-threatening symptoms, i.e., join the state .
-
•
are recovery rates for individuals affected by COVID-19. The recovery rate for threatened individuals depends on , compare Section 2.3.
-
•
is the mortality rate, i.e., the rate at which individuals with life-threatening symptoms decease, and it depends on , compare Section 2.3.
Key features of the considered model for the COVID-19 pandemic compared to simpler ones (e.g., SIR models, compare (Kermack & McKendrick, 1927)) are that it distinguishes between detected and undetected cases, symptomatic and asymptomatic individuals, and it includes a separate state for patients with life-threatening symptoms (compare [16] for a more detailed explanation of the key ingredients). The present model, i.e., Eqs. (1) as well as Fig. 1, is a mild modification of the model suggested in Giordano et al. (2020). First, we reduce the number of parameters by including the following assumptions. We assume that the rate for developing (severe) symptoms is the same for detected and undetected cases, since (to this day) no effective medication of COVID-19 is known. More precisely, the transitions from to and to have the same dynamics with rate , and similarly for the respective recovery rates as well as for the transitions from to and to . Moreover, we assume that the rate at which susceptible individuals are infected is the same from states and , since the state is neglected for the parameter identification step (compare Section 2.2). Finally, as a key difference to Giordano et al. (2020), we consider -dependent rates and for threatened patients, i.e., the mortality and recovery rates depend on the current number of threatened patients. Essentially, increases and decreases if exceeds the capacity of the German health care system (see Section 2.3 for a detailed description of this model refinement).
2.2. Parameters for the German outbreak
In this section, we adjust the model parameters and the initial condition given in Giordano et al. (2020) to the COVID-19 outbreak in Germany. This is necessary, because the outbreaks in Germany and Italy evolved differently due to differences in the testing policy, the testing capacity, the health care system, the reaction of the governmental authorities, and the underlying counting method of confirmed cases.
In order to compute realistic parameters for Germany, we will use a pragmatic approach that enables us to easily include prior knowledge about relations between parameters. The approach is a least squares optimization of the available data, where prior knowledge is incorporated via hard constraints in the optimization problem. The available data is marked by a tilde and is given by:
-
•
the confirmed COVID-19 cases , deaths , and recoveries from Dong et al., 2020a, Dong et al., 2020b for the days from February 28, 2020 () to April 21, 2020 (). We filtered this data set using the Matlab function kaiser(7,3) with window length and shape factor to reduce the effect of noise corruption and having less confirmed cases during weekends. Further, we have to divide the data set by the total German population to ensure all values are normalized and are in the range .
-
•
the COVID-19 patients in ICU and how many of them died or recovered from DIVI-IntensivRegister (2020) for from March 23 () to April 21 ().1
This data set, however, is rather small compared to the complexity of the model (1) consisting of eight states and parameters. Therefore, we need to leverage additional prior knowledge in order to avoid over-fitting and ensure a realistic resulting parameter set and initial conditions. Based on other studies and the interpretation of our model states and parameters, we enforce the following assumptions.
-
•
The detection rate of asymptomatic cases is negligible, as the current German policy is to test only people showing symptoms (Jakob Simmank, 2020), i.e., .
-
•
At February 28, the initial date for our fit, there were confirmed cases, hence, we assume , , and , , appear as decision variables with .
-
•
The test rate is approximately constant. Please note that this does not mean that the absolute number of tests is constant per day, as this value is rather proportional to than to .
-
•
The infection rates and were influenced by the countermeasures that the German authorities installed to fight the spread of the pandemic. According to Dehning et al. (2020), three main events changed the spreading rates: (1) March 9 — canceling large events, (2) March 16 — closing schools and non-essential stores, and (3) March 23 — contact ban (Kontaktsperre) that prohibits groups of more than two people and requires people to maintain a distance of at least in public. Hence, there are four different policies , monotonically increasing from no countermeasures to full lockdown resulting in and . This yields the following six decision variables , , , , , and .
-
•
One of the main reasons why the COVID-19 pandemic is spreading so fast is that infectiousness peaks even before the onset of symptoms (He et al., 2020). As asymptomatic individuals have no indication of their infection, they are on top of that also not as cautious as people with symptoms. Therefore, we require when searching for realistic parameters. Further, we want to ensure that people tested positive are significantly less contagious while in quarantine, such that we require .
-
•
The percentage of confirmed COVID-19 cases is estimated in the study (Bommer & Vollmer, 2020) as 27.32% in Germany. In our model, this value approaches the constant , which is the proportion of people that develop symptoms ( to , to can be ignored as ) and get detected ( to or ); that is the percentage of confirmed accumulated cases in a steady state (). To make sure our model coincides with the findings of Bommer and Vollmer (2020), we expect to be slightly above of the estimated 27.32% as a steady state is not reached yet and the proportion of detected cases increases over time, i.e., we constrain .
-
•
The percentage of asymptomatic disease progressions was estimated at 43% in a population screening study in Iceland (Gudbjartsson et al., 2020), at 43.2% in a comprehensive testing of the whole municipality of Vo’, Italy (Lavezzo et al., 2020) and at 17.9% (Mizumoto, Kagaya, Zarebski, & Chowell, 2020) in a study regarding the cruise ship Diamond Princess. To ensure that our model has a comparable ratio, we add the constraint to the optimization problem.
-
•The (base) reproduction rate in the beginning of March was estimated as approximately (an der Heiden & Hamouda, 2020). Thus, for the parameters , with no active countermeasures we require where is given by (see Giordano et al. (2020) for details)
(2) -
•
The median of the incubation time is – days (Li et al., 2020, Linton et al., 2020, World Health Organization (WHO), 2020), which we identify as the half life period a person is in the state , i.e. . Further, the median time from onset of symptoms until intensive care is – days (Wang et al., 2020, Yang et al., 2020). Hence, we constrain the half life period of a ailing or recognized individuals to .
In the state of (1) the confirmed recovered cases are not distinguished from the undetected ones, thus we define the number of confirmed recovered cases as , with and and further the number of confirmed accumulated cases as in order to match the data and . Considering the COVID-19 patients in intensive care , a natural choice would be to identify them with threatened state , however, all deaths in the model have been in before, but in reality only half to a third of the deaths happens in ICU (DIVI-IntensivRegister, 2020, Dong et al., 2020a). Hence, as the patients in ICU are only a part of , we split into and , where represents the number of people in intensive care and are all otherwise threatened COVID-19 cases. We assume that there are no transitions from to and vice versa, such that can be modeled as
(3a) |
(3b) |
with . This more complex model with and will be approximated with a model of the form described in Section 2.1 as sketched in Fig. 2. Further, we define () to be the numbers of people that recovered (died) from .
Fig. 2.
The threatened state is split up in ICU cases and non-ICU cases which is later approximated using a lumped model.
Finally, we perform the parameter optimization by solving a least squares problem via CasADi (Andersson, Gillis, Horn, Rawlings, & Diehl, 2019) to fit , , , , , to the data , , , , , . The best fitting parameters are given in Table 1 and the resulting fit is shown in Fig. 3. Many of the constraints listed above are active at the optimal set of parameters, e.g., , which is not surprising since we use the constraints to keep the parameters in a realistic range without further regularization.
Table 1.
The parameters of the model for Germany.
Fig. 3.
Simulation of the model (1) with the estimated parameters in Table 1 scaled by compared to the actual data (DIVI-IntensivRegister, 2020, Dong et al., 2020a). The horizontal axis represents the time in days, where is February 28 and is April 21.
This fit further enables us to specify the full model state of today , which will be used in the following sections as the initial condition where corresponds to April 21
(4) |
Please note that the model is quite sensitive to changes in the parameters and one obtains quite different parameter values if, e.g., the estimated range of unknown cases or the percentage of asymptomatic cases deviate from the assumptions.
2.3. Modeling of the mortality rate
It has been recognized as a key difficulty in handling the COVID-19 pandemic that the virus is highly contagious, thus infecting large numbers of individuals. In addition, since many elderly and ill people require hospitalization and/or intensive care (Verity et al., 2020), large waves of infections can quickly exceed the capacities of local health care systems (Grasselli, Pesenti, & Cecconi, 2020). Hence, ensuring that health care resources are sufficient is a key issue in handling the outbreak (IHME COVID-19 health service utilization forecasting team, C. J. L. Murray, 2020), given that an overwhelmed health care system even correlates positively with the mortality rate (Ji, Ma, Peppelenbosch, & Pan, 2020).
In this section, we describe how the mortality and recovery rates and in (1) depend on the number of threatened patients . The basic idea is that they are constant as long as the health care system’s capacity is not at its limit, and the mortality rate (the recovery rate ) increases (decreases) significantly if it is overwhelmed. According to DIVI-IntensivRegister (2020), there are (on April 21) COVID-19 patients in an ICU and ICU spots are available. Hence, the overall ICU capacity currently available for COVID-19 patients is , and we define the relative ICU capacity as , where . We consider a constant value of for simplicity, although it is likely that it will further increase in the future.
We assume that the mortality rate increases if the number of individuals requiring treatment in an ICU exceeds , i.e., if , with as in 2. More precisely, we assume that if a patient requiring intensive care does not receive it, then the patient deceases (i.e., for such patients, the mortality rate increases and the recovery rate is zero). According to data of deceased individuals from Italy, those who were not admitted to an ICU deceased in median within days (COVID-19 Surveillance Group, 2020). Hence, we model those individuals in which are not admitted to an ICU via decaying first order dynamics with a half-life period of days, i.e, the corresponding time constant satisfies , thus leading to .
In a first approximation and hence we only modify the mortality rate in case that . In the model (1), and only occur jointly with , which leads us to the following formula for and :
If , then (5) implies and , i.e., a simple lumped model is recovered as long as the ICU capacity is not exceeded. If however , then the mortality rate increases to for those patients which require intensive care but do not receive it. Similarly, for this fraction, the recovery rate is set to zero implicitly in (5b). The individuals not receiving intensive care are not affected by this mechanism.
Clearly, the modified rates in (5) are just a simple approximation of the effect that the mortality rate increases if hospitals are overwhelmed. This modification in the model is crucial when studying the effect of loosening quarantine measures and corresponding optimal policies, as done in the remainder of this paper. Since (fortunately) the German health care system has not been overwhelmed to this date, there are no quantitative data to validate the above modification and in particular, the exact value of . Nevertheless, the refinement is confirmed qualitatively by experiences in other countries (Grasselli et al., 2020, IHME COVID-19 health service utilization forecasting team, C. J. L. Murray, 2020, Ji et al., 2020). Moreover, even a substantial change of has little effect on the overall dynamics since it only affects the exact number of fatalities. In particular, changing does not lead to a qualitative change in an optimal policy to control the outbreak as long as is sufficiently larger than and it is possible not to exceed the ICU capacity.
3. Open-loop optimal control of the COVID-19 outbreak
In this section, we discuss different policies that can be considered to keep the number of fatalities due to COVID-19 low while at the same time also impose as little constraints as possible on the public. The most significant degree of freedom currently is certainly influencing the infection rates and . Measures for influencing the infection rates include hygienic measures, face masks, and different nuances of distancing policies, up to a mandated lockdown. Therefore, we define as introduced in Section 2.2 as our input, representing distancing policies or other measures that have a direct influence on the infection rates and . We model this influence via
where a value of hence represents the policies in Germany as of mid April (lockdown) and represents no social distancing or other measures (i.e. corresponding to infection rates as in the beginning of March). Furthermore, we assume that the policies affecting the infection rates , (i.e. ) stay constant for at least one week and can only be changed every seven days. In the first subsection, we will introduce different baseline policies which can give insights into the effects of different inputs and which will serve as a comparison to the optimal controller in the following subsection. More specifically, these baseline strategies will be used to define an upper bound on the social distancing measures that the optimal control in Section 3.2 and later on the feedback strategies in Sections 4, 5 can employ to minimize the fatalities.
Test capacity
In addition to varying the infection rates and , another degree of freedom to influence the model (1) lies in adapting the testing policy. Testing individuals on COVID-19 is represented in the current model by parameters and for symptomatic and asymptomatic individuals, respectively. In the following, we assume that only a fixed number of tests can be carried out every day. If we wish to only test symptomatic individuals, this includes both symptomatic individuals infected with COVID-19 (i.e., members of the state ) and individuals suffering from other illnesses with similar symptoms. In Robert Koch Institut (2017), the Robert Koch Institute estimates numbers on influenza-like illnesses (ILI) in Germany. While the numbers show clear seasonal differences, approximately 1.3% of the population become newly infected with ILI on average per week, and approximately 37% of them see a doctor (an indication for more severe symptoms). Moreover, influenza symptoms usually last 4–5 days leaving us with an approximate average of of the population showing significant influenza-like symptoms at an arbitrary time of the year. When testing asymptomatic individuals, this includes infected persons without symptoms but also any other individual not known to be infected or healed and not showing any symptoms (i.e. , where are the confirmed healed cases, compare Section 2.2). The total amount of resources used for testing is then captured by the following cost:
(7) |
Denoting the parameter from Table 1 as , we assume a fixed bound on the amount of resources for testing and that corresponds to the nominal value for , i.e., . In the following, we assume that the current policy with respect to testing stays in place: all available tests are used on a daily basis for as many symptomatic people as tests are available. Then the testing policy used throughout this section reads (as is current practice) and
(8) |
Note that this also implies that throughout this paper the state . The allocation of tests (with the possibility of also saving test resources for later) can also be modeled as control inputs. However, in the present model the effects of temporarily saving tests (under the current resource constraints) are negligible compared to the effects of changes in the infection rates. Increasing the overall test capacity or improving the choice of test subjects (e.g. with tracing of cases), which corresponds to increasing values of /, on the other hand, can potentially improve the evolution of the pandemic significantly, since detected individuals are less contagious than undetected ones. However, increasing test capacities or better allocated testing (especially with regard to , i.e. tracing of also asymptomatic infections) is at the current stage not included in our consideration but could be addressed in future work with the presented model by choosing and making / an increasing and time-varying variable.
Control goal
Given the introduced control input , different control goals can be formulated. One such goal could be to obtain herd immunity. Herd immunity corresponds to the only stable equilibrium given no social distancing measures (i.e. with , ) and requires a large part of the society to be immune. More precisely, herd immunity is reached if , where (Giordano et al., 2020) provide a formula for calculating (see Section 3.1.1 for more details). Given our model, we can now calculate the minimum time that is needed to reach herd immunity. For this, we assume that we can choose a policy that utilizes the full health care capacity at all times. With , , , , decreases each day by . Hence,
gives a lower bound on the time required to reach herd immunity without exceeding the health care capacity given the introduced model. The herein identified model parameters yield a time span of more than six years. A stable steady state in the absence of a vaccine (i.e. herd immunity) can hence only be obtained either after many years or by overstraining the health care system and a corresponding significant rise in the number of fatalities. Therefore, our ongoing assumption throughout this section is that prior to herd immunity, a vaccine will be available and we assume the availability of the vaccine in approximately two years. Our goal is thus to find an optimal policy minimizing the number of fatalities for the next two years while imposing as little constraints as possible on the public and the economy.
In the next subsection, we simulate and discuss the following policies:
-
1.
Keeping the social distancing measures in place (or even increasing the measures) until the virus is eradicated in Germany
-
2.
Slowly (or more aggressively) loosening the distancing measures without overwhelming the health care capacities (while possibly risking a second wave).
In fact, the presented baseline policies are similar to the policies suggested by the German “Helmholtz-Initiative” in Helmholtz-Initiative ‘Systemische Epidemiologische Analyse der Covid-19-Epidemie’ (2020). We will discuss our conclusions in comparison to theirs at the end of the section.
In Section 3.2, we will then improve these baseline policies by applying optimal control techniques and we will discuss the importance and significance of the improvements.
3.1. Introducing different baseline policies
3.1.1. Consistent lockdown
In the following, we argue that a consistent lockdown strategy necessitates strong lockdown measures over a long time horizon to fully eradicate the virus as otherwise, dropping the social distancing measures too early leads to a second outbreak wave.
Based on the SIDARTHE model fitted to the German outbreak, described in Section 2.1, we simulate how long we would need to remain in lockdown and simply wait for the virus to disappear. We define the disappearance of the virus as follows: If – most probably – there is less than one active contagious case, i.e., , the virus is eradicated. It takes days, which is almost one year, until this condition is fulfilled and clearly the economical and psychological damage caused by a lockdown period this long is excessive such that staying and waiting in lockdown is not an option. With even stricter measures, such as , , we could only marginally accelerate this process to days while increasing social distancing is costly, cf. the cost function in Section 3.2. Note that the equilibrium attained under the above lockdown policy is an unstable one that is not robust to uncertainties. In particular, if only one person remains infected when the measures are suspended they could cause a new outbreak. Also, the virus may be reimported from other countries or humans might be reinfected by an interim host.
Next, we simulate the following three scenarios in all of which the German population is kept in lockdown for a predefined period of time, followed by no measures at all. The only difference is the length of the lockdown period. In the first scenario, the measures are abolished immediately (April 21). The second one keeps the current strict measures for an additional days. The third variant simulates an even longer lockdown period, ending after days counting from April 21.
In Fig. 4, we compare the three scenarios. We clearly see that in all three cases the number of currently infected people rises drastically a few days after the measures are removed independent of how long the lockdown persisted before. In any case, we experience a second outbreak wave. Staying longer in lockdown slightly delays the following peak of the share of active cases , yet the peak amplitude is almost the same in all three scenarios.
Fig. 4.
Simulation of the model (1) for a consistent lockdown of different lengths with days on the horizontal axis ( - blue, solid; - red, dash-dotted; - yellow, dashed).
This behavior can be explained as follows. If there is no one who currently has the virus, i.e., , such that , an equilibrium point is attained. The stability of the equilibrium point depends on the value of and the model parameters. In Giordano et al. (2020), the authors show that the IDART subsystem is asymptotically stable if and only if , where is the susceptible state at equilibrium for a given initial condition and the corresponding parameters, especially for and that are actively adjusted according to an underlying policy. The value of follows from the stability analysis of the linearized IDART subsystem and has the following structure with respect to and
(9) |
where are constants, see Giordano et al. (2020) for details and the definition of . Note further that the commonly stated base reproduction rate (2) is directly linked to the value via . The stability of an equilibrium that depends on the parameters changes once we adjust and . For strict measures (, ), the value is high (), such that a stable equilibrium is attained for any . This means that only a small number of people is infected by the virus before the equilibrium is attained. With no measures (, ), a stable equilibrium requires the share of susceptible people to be smaller than , i.e., herd immunity.
We can hence conclude that if , the equilibrium attained during lockdown is unstable with no measures. This means, there inevitably is a second outbreak wave once the lockdown ends. For the fitted model of the German outbreak, is attained after the first wave. Hence, at least another 70.4% of the German population get infected in the second wave before a stable equilibrium is attained. Altogether, this leaves us with the following conclusion of two possible outcomes when choosing a consistent lockdown strategy:
-
•
Strong lockdown measures over long time horizons have to be taken to eradicate the virus in Germany. However, this takes a big toll on the public and any infected person, e.g. from abroad, could spark a second wave at any point.
-
•
Any lockdown strategy that does not fully wipe out the virus inevitably yields a second outbreak wave once all measures are suspended.
3.1.2. Iterative loosening of the distancing policies
As argued in the previous subsection, keeping the lockdown policy strict can never lead to a stable equilibrium when ending the lockdown, no matter how long it did take place before all measures were suspended. Hence, many countries are now discussing or have even already started to loosen the lockdown in very small steps. Indeed, experts consulting the German government (”Nationale Akademie der Wissenschaften Leopoldina”) have recently published their recommendations concerning a possible strategy for loosening the lockdown gradually in small steps (Leopoldina, 2020).
They name the following conditions for loosening the lockdown in small steps:
-
(a)
The number of new infections remains at a low value.
-
(b)
The capacity of the health care system must not be exceeded.
-
(c)
Precautions (such as hygienic measures, face masks, distancing) remain in force.
In the following, we try to translate these recommendations into a policy for our simplified model to first analyze the results and second, to use this as a baseline policy for the optimizer in the following subsection. We implement the conditions presented above via the following policy strategy:
-
(a)
can only be decreased if, over the last days, the number of newly infected persons (i.e. ) is decreasing and has not been increased
-
(b)
can only be decreased if less than of the ICUs are occupied
-
(c)
The decrease in can only be a small decrease at a time and therefore, the interval between (lockdown) and (no measures) is divided into equidistant steps.
Additionally, we add that will be increased again (with the same step size as the decrease) if more than of the ICUs are occupied and no decrease in the amount of necessary ICU is witnessed. This policy results in four ’tuning parameters’ of the policy: , , and . In fact, it turns out that the outcome of the simulation is not very sensitive to the tuning parameters of the policy, but can be tuned to be slightly more careful or more aggressive. In the following subsection, we choose two different sets of parameters as baseline policies for the optimal control approach.
3.2. Optimal control strategy
In this section, we contrast the baseline policy from Section 3.1.2 with an optimal control policy, under idealized assumptions (exact model and state measurement). The purpose of this section is twofold: (a) Understand how an optimal policy differs qualitatively from the baseline policies. (b) Quantify the loss of performance (in terms of increased fatalities and/or unnecessary social policy ) resulting from using a suboptimal baseline policy. The degree of freedom is the input affecting the social policy and we consider the fact that the policy can only be changed every days.
Multi-objective optimal control problem
In the following, we consider the problem only for the next weeks, assuming that thereafter a vaccine might be developed that would ideally prevent (almost all) further fatalities in the future. The overall control problem can be seen as a multi-objective optimal control problem, where we wish to simultaneously minimize the number of fatalities and the societal and economical cost of the social policy measures, which will be measured by the function . We point out that due to the parametrization (6) this cost also inherently considers the infection rate . This cost function is such that the social cost of achieving an arbitrarily small infection rate grows unbounded, while for large values of incremental differences are less relevant. In order to suitably characterize an optimal solution to this multi-objective problem we use the baseline policies in Section 3.1. The resulting optimal control problem is given by (11) below, which will be explained in the following. In particular, our goal is to find an input policy that minimizes the number of accumulated fatalities, while using less resources than the baseline policy in terms of accumulated social impact of (c.f. (11d)). We point out that similar “stabilization” problems subject to resource constraints for the control of epidemic outbreaks can be found in the literature, also using a fractional cost , compare e.g. Köhler et al., 2018, Preciado et al., 2014.
When minimizing the number of accumulated fatalities, it is important to consider not only the extinct individuals at the end of the two year horizon, but to account as well for the part of the already infected individuals that will decease after the two year horizon. The reason for this is, while the availability of a vaccine at the end of the horizon might prevent future infections, it cannot cure already infected people. Hence, if we do not account for the inevitable fatalities among the individuals infected at the end of the prediction horizon, the optimal controller does not take any efforts to keep them low and as a result a lot of people would die shortly after the two year horizon. Therefore, we propose an optimization objective that includes all past and inevitable future fatalities. Based on the model (1), we know that a total of of the infected people will develop symptoms in the future and further that a total of will become threatened. Thus, assuming the capacity is not exceeded afterwards, i.e., setting constant values and , the amount of inevitable fatalities is exactly given by
(10) |
Hence, given a baseline solution from Section 3.1.2, the corresponding optimal control problem reads as follows:
Since we only change the policy every week, the index in (11) corresponds to weeks and corresponds to the objective function in (10), where the states result from simulating the system (1) with the parameters and the initial condition from Section 2 and the input over -weeks. Condition (11d) ensures that the encountered social cost is smaller than the cost of the baseline policy. This optimal control problem is such that the baseline policy is a feasible solution and thus the resulting fatalities will always be lower than that of the baseline policy. We point out that it is possible to consider a more restrictive transient constraint on the policy cost instead of (11d), which is discussed in detail in Appendix B. The optimal control problem (11) can be formulated as a nonlinear program (NLP) and is subsequently solved using CasADi (Andersson et al., 2019).
Numerical results
For comparison and to implement the constraint (11d), we use the baseline policy from Section 3.1.2 with , , , , which overall is rather cautious and does not exceed the ICU capacity. The corresponding results for the baseline policy and the optimal control strategy can be seen in Fig. 5. Although the optimal control input yields initially (first days) a slightly larger number of infected individuals and thus slightly more fatalities in the first 200 days, the number of infected individuals is subsequently significantly lower and the overall number of fatalities is reduced to only 26%. The optimal controller allows for a smooth increase of the infection rate , while keeping the number of threatened individuals () consistently below the corresponding value of the baseline policy after the first 200 days, thus yielding a small number of fatalities. The rising number of infected individuals () at the end results from the finite-horizon and will be considered later in more detail.
Fig. 5.
Optimal control strategy (blue, solid), baseline policy (red, dashed) and ICU capacity (dotted, black).
We also consider a second baseline policy using , , , , which slightly relaxes the social policy, but also exceeds the ICU capacity. The result is shown in Fig. 6. We can see that in comparison to this second baseline policy the optimal policy significantly reduces the number of fatalities to only 39%. The optimal strategy is a lot more cautious in reducing the social policy, while the baseline is more aggressive and goes back and forth between increasing and decreasing , resulting in significant violations of the ICU capacity. Furthermore, the simple baseline policy results in a second wave as the restrictions are loosened too quickly, while the optimal strategy slowly but steadily increases after the first 200 days, and thereby avoids a second wave.
Fig. 6.
More aggressive baseline: Optimal control strategy (blue, solid), baseline policy (red, dashed) and ICU capacity (dotted, black).
In both examples, a further observation should be highlighted. After an initial phase of containing the outbreak, the measures are slowly but steadily relaxed until a larger release at the end of the horizon. Similar behavior can be observed for many optimal control problems with finite horizons and is commonly referred to as “turnpike” behavior, which goes back to Dorfman, Samuelson, and Solow (1987). An explanation for this is that the consequences of decisions taken at later points in time mainly occur after the end of the horizon, such that a more aggressive policy towards the end is optimal, when only considering the finite two year horizon. Of course, one would not implement the “leaving arc” if the development of a vaccine would not be finalized after two years, since implementing such a policy may lead to an uncontrollable increase of infections towards the end of the time horizon in case that the model is inaccurate and should thus be avoided in practice. In Appendix A, we therefore discuss how adding “terminal constraints” to the optimization problem can prevent this turnpike behavior of the optimal solution, at the price of an increasing number of fatalities.
3.3. Discussion
If we compare the results in Section 3.2 with the consistent full lockdown from Section 3.1.1, we can see that it is possible to appropriately increase without exceeding the ICU capacities, while the consistent full lockdown strategy would require a lockdown that takes approximately a year to be effective. Hence, while a consistent lockdown can effectively minimize the number of deaths, this strategy is only viable in case this lockdown can be prolonged over the corresponding time horizon, unless a vaccine is developed earlier. On the other hand, both the optimal controller and the baseline controller allow for a significant relaxation of the lockdown (on average a doubling of ), without significantly increasing the number of fatalities.
In comparison to the baseline policy suggested in Section 3.1.2, the optimal control policy results in a slower but smooth loosening of the distancing policies. Without increasing the social cost over the full time horizon, this optimal policy avoids any violation of the maximum ICU capacity and hence results in a significantly smaller fatality rate. We point out that the resulting optimal policy of slowly increasing is qualitatively similar to the resulting optimal policy in Kantner and Koprucki (2020), albeit for a different control goal. It can be seen that an initially “stronger” lockdown (i.e., a smaller value of ) over a longer time period with subsequent loosening leads to a better handling of the pandemic, compared to repeated tightening and loosening of distancing measures. Moreover, a smooth and monotone loosening of distancing policies is also desirable from an economic aspect since repeated lockdowns after interim-periods of relaxed distancing guidelines may be even more damaging to the economy, compared to an initially longer lockdown.
Comparing our results with the proposed scenarios by the Helmholtz Association (Helmholtz-Initiative ‘Systemische Epidemiologische Analyse der Covid-19-Epidemie’, 2020), we find that we agree that the goal of herd immunity without overwhelming the health care capacity would require years. Our results further agree with Helmholtz-Initiative ‘Systemische Epidemiologische Analyse der Covid-19-Epidemie’ (2020) that the contact restrictions can only be loosened slowly if the health care capacity must not be overwhelmed. However, since the authors in Helmholtz-Initiative ‘Systemische Epidemiologische Analyse der Covid-19-Epidemie’ (2020) do not consider the availability of a vaccine, their conclusion is keeping or even increasing the lockdown until the number of infected persons is small and all infections can be traced efficiently and effectively via strategically allocated (and increased) testing. With the assumption of a vaccine within the next two years, we argue that a slow and smooth loosening of the lockdown does not lead to many more fatalities while decreasing the social and economic cost significantly according to our model. Concurrently, increased and strategically better allocated testing, e.g., via a contact tracing mobile app (Oliver et al., 2020), is of course highly beneficial and greatly advisable to improve the performance (even if it this was not accounted for in our model).
To summarize, it seems possible to reduce the current restrictions and thus allow to increase without exceeding the ICU capacity. Furthermore, optimized policies can significantly improve the outcome (in terms of fewer deaths and/or less social restrictions). However, the result is highly sensitive w.r.t. the change in the infection rate, while an accurate control of the infection rate (e.g. ) through governmental policies seems difficult/unrealistic. In the next section, we will therefore deal with these issues by formulating a robust control strategy that takes uncertainty in our COVID-19 model into account and uses feedback based on uncertain state information.
4. Optimal feedback control of the COVID-19 outbreak
Section 3.2 shows that an optimal control policy can significantly reduce the number of fatalities compared to a baseline policy that allows for iterative loosening of social distancing measures. This optimal control policy is computed by optimizing over all possible policies to find the one minimizing the number of fatalities predicted by the model Eqs. (1) without using stronger shutdown measures than the baseline. Hence, the policy proposed in Section 3.2 strongly relies on the accuracy of the model identified in Section 2 and thus may fail to effectively control the outbreak in case of a model mismatch. However, such a model mismatch is inevitable in practice, especially since the model itself is a simplification of a much more complex reality and the identification outlined in Section 2.2 strongly depends on the (sparse) available data and the additional prior knowledge based e.g. on further studies concerning COVID-19, which also provide only estimates. In addition, the optimal control policy relies on exact knowledge of all states and on the assumption that values for and can be exactly imposed up to arbitrary precision via social distancing measures, both of which are unrealistic assumptions when applying the policy in practice.
In this section, we show how online measurements can be utilized via feedback to effectively and robustly control the German COVID-19 outbreak in the presence of uncertain parameters. More precisely, we illustrate that the optimal open-loop policy of Section 3.2 may lead to poor performance when applied to validation models with a different set of parameters, although these validation models result from adjusting only one prior assumption in the identification and still fit the past data well. On the other hand, we show that a model predictive control (MPC) feedback strategy, based on repeatedly computing an open-loop policy for the nominal model from Section 2, is inherently robust w.r.t. model inaccuracies and successfully handles the outbreak.
Basic idea
At each time step , where corresponds to weeks and as in Section 3.2, we solve the optimization problem (11) over the time horizon using the current measurements as initial condition at week . Then, we apply the computed optimal policy over one week before solving the problem for the new measurements again. In this way, since the initial conditions in the optimal control problem are updated, a feedback mechanism is included as is standard in MPC (Rawlings, Mayne, & Diehl, 2017). As a result, the prediction horizon of the optimization problem is shrinking with each time step , such that it never exceeds the considered total time horizon of weeks, after which we assume the availability of a vaccine. Hence, since the constraint (11d) needs to hold over the whole time horizon , we replace it by
(12) |
with being the cost of the first baseline policy in Section 3.1.2. As a second modification, we adapt the bound on the social distancing cost online, depending on the predicted states, as is detailed in the following.
Online adaptation of social policy constraint
In Section 3.2, we proposed an open-loop optimal control strategy, where the inputs were the infection rates and . Loosely speaking, the control goal was to achieve a minimum number of fatalities without imposing stronger social distancing measures than a simple baseline policy (compare (11d)). Since this constraint heavily depends on the model to which the baseline is applied, a realistic setting with imperfect model knowledge should allow to adapt the constraints on the policy online in case that the nominal model is overly optimistic or pessimistic. Instead of simply requiring that the cost of the MPC-based feedback cannot exceed , we increase the maximum cost in case that the predicted number of patients requiring intensive care lies above 90% of the maximum capacity at least once during the horizon, and we decrease it in case that the number consistently lies below 10% of . This adaption is natural, as in reality one would increase the efforts to contain the outbreak if the current measures are insufficient, and on the other hand, the population cannot be expected to accept strict measures when there are only few (severe) cases across the country. Therefore, the maximum cost in week , denoted by , varies online with and is initialized as . The amount by which we change online is , where with and as in Section 2.2. If, for instance, the model predicts large numbers of future ICU patients, then the cost bound is increased by the difference between the minimum and the maximum social distancing cost, scaled by the remaining time horizon. This increase corresponds to the social cost of an additional week in full lockdown, scaled by the remaining time horizon via the factor .
MPC-based feedback strategy
The proposed MPC-based feedback strategy is summarized in Algorithm 1. In the algorithm, denotes the number of threatened individuals at time , predicted by the optimal solution of (11) at time . Essentially, the algorithm repeatedly applies the open-loop optimal control policy of Section 3.2 with the key difference that, at time , all past measurements are used in the optimization problem, thus including an online feedback. In addition, in Step 3 of the algorithm, the social policy constraints is adapted as described above.
Validation
To assess the improved robustness of Algorithm 1 compared to open-loop optimal control, we produce two validation models. More precisely, we identify two new sets of parameters A and B by proceeding exactly as in Section 2.2 with the only difference that we change the prior assumption that the stationary ratio of confirmed COVID-19 cases is in the interval . Instead, we assume that this value is in for set A and in for set B. When performing parameter identification based on these modified prior assumptions, we also obtain sets of parameters that can accurately explain the existing past data on COVID-19 cases in Germany. However, the resulting models have different dynamics and different reproduction rates for the same lockdown policy. Increasing the above ratio as in parameter set A decreases the number of infected and undetected individuals resulting in a higher reproduction rate to explain the same amount of confirmed cases. Hence, if an open-loop policy based on the nominal model (i.e., with parameters described in Section 2.2) is applied to the validation model with parameters A, then the number of infections and thus the number of fatalities increases significantly.
To illustrate this effect, we apply the open-loop optimal control policy based on the model with parameters as in Section 2.2 to the new models with parameter sets A and B. The control effort of this policy, i.e., the amount of social distancing, is constrained as in (11d) by the cost of the baseline policy when applied to the nominal model identified in Section 2.2. The results for the model with parameters A can be seen in Fig. 7. Since this validation model has a higher reproduction rate for similar inputs as explained above, the number of fatalities after weeks increases significantly compared to the simulations in Section 3. This is due to the fact that the open-loop policy is only computed once, at the beginning of the time horizon, and is then applied over the whole time span of two years without any online adaption based on new measurements. Therefore, it cannot handle the model mismatch and thus has a significantly worse performance. In addition, Fig. 7 shows the evolution under the proposed MPC-based feedback, which leads to a significantly lower number of fatalities compared to the open-loop policy. We point out that the feedback (partially) compensates the fact that the control action is computed based on the nominal model parameters from Section 2.2, which differ significantly from parameter set A. Due to the larger number of infected individuals, the maximum social cost is increased at multiple time steps, which is indicated by the step-like increases of the input. Finally, an open-loop optimal control policy is computed which is allowed to use the same amount of resources as the MPC-based feedback (in hindsight), i.e., the adapted social and economical cost . While this policy performs better than the initial open-loop policy with fewer resources, it leads to a similar number of fatalities at the end of the horizon compared to the feedback controller. However, the number of threatened patients is very large at time , which would lead to a significant increase in fatalities after the considered time period, even if a vaccine is available.
Fig. 7.
Different policies designed based on the nominal model identified in Section 2.2, applied to the validation model with parameter set A (top) and parameter set B (bottom): Open-loop optimal control strategy (‘OL’, blue dotted), Open-loop optimal control strategy with the same amount of resources as the MPC-based feedback (‘OL-resource’, red dashed), MPC-based feedback strategy as in Algorithm 1 (yellow solid), and ICU capacity (black dotted).
Fig. 7 also shows the same comparison for the model with parameter set B. In this case, since the reproduction rate is lower, the open-loop optimal policy leads to fewer fatalities than in Section 3.2. The MPC-based feedback leads to almost identical performance, but it reduces the cost budget at several time instants, i.e., it can handle the outbreak similarly well but with significantly lower social and economic costs. When restricting the budget of the open-loop optimal policy to the one of the feedback strategy, i.e., , then it leads to a dramatic increase in fatalities.
To conclude, the above discussion reveals that a combination of open-loop optimal control with feedback is inherently robust in the sense that it effectively controls the German COVID-19 outbreak even if the employed model is inaccurate. When comparing the result to an open-loop strategy, then the MPC-based feedback strategy can dramatically decrease the number of fatalities or the necessary amount of social distancing, respectively. Such robustness is an important property for applying any control strategy in a real-world scenario, where accurate model knowledge is rarely available. In the next section, we propose a more systematic robust MPC approach which explicitly takes model inaccuracies as well as uncertain state measurements and control inputs into account in order to safely and cautiously control the COVID-19 pandemic.
5. Robust and optimal feedback control of the COVID-19 outbreak
While the MPC-based feedback policy proposed in Section 4 is significantly more successful in handling the outbreak compared to a simple open-loop policy, it relies on the assumption that exact measurements of the state in (1) are available at each time step. In this section, we consider a more realistic scenario of uncertain measurements in terms of biased state estimates, and we analyze the impact on the closed-loop operation. In particular, we develop a robust MPC-based feedback strategy using interval arithmetic that takes the uncertainty into account during the predictions and thus leads to a safer policy minimizing the number of fatalities.
Biased state measurements
In the following, we consider the case where at each day instead of the true state we only obtain an estimated state , which is subject to an additional bias. In Table 2, we summarize the uncertainties in the states. For individuals in states and , the disease COVID-19 was detected by tests. Hence, their values are well known, nevertheless, we assume that they can slightly differ from the true states by , as there might be cases on the borderline between and that are hard to assign to either of the states. The number of people in ICUs is well documented. However, the state contains not only patients in ICUs () but also other infected members of the risk group (), cf. Section 2.1, such that the uncertainty we use is . We assume that the number of deaths is certain by as it includes some people that died of different causes. As the undetected cases can by definition not be measured, they must be estimated using random sampling or strategies like (Bommer & Vollmer, 2020). Therefore, the states and are much less certain, especially without symptoms (, ). Recovering from the disease is a resulting state from both rather certain states, and , and highly uncertain states, and such that overall it is uncertain itself (). The uncertainty of the state of susceptible persons results from the other states: .
Table 2.
Uncertainties in the states.
It is possible to directly use this biased state estimate in Algorithm 1 and compensate the bias through the inherent robustness in the feedback implementation. In the following, we derive an alternative robust formulation that explicitly considers the uncertainty in the prediction.
Interval predictions
First, given a biased state estimate and known bounds on the bias (Table 2), it is possible to compute interval bounds , such that the true state is guaranteed to lie in that interval, i.e., . The following formulation will predict the interval bounds and instead of using some nominal prediction. This methods is similar to interval arithmetic employed in robust MPC (Limon, Bravo, Alamo, & Camacho, 2005) and the robust moment enclosure for an SEIV epidemic model in Watkins, Nowzari, and Pappas (2019). Using the fact that the system is positive ( and all the parameters are positive), it is possible to derive an interval prediction of the form
which ensures that implies for all , given suitable bounds on the uncertain parameters in the system model (1). The detailed derivation of the interval prediction model (14) can be found in Appendix C (more precisely, Eqs. (C.4)). Since deriving reliable bounds on all parameters in the model (1) is rather difficult or unnecessarily complex, we only focus on the uncertainty associated with the infection rate . In particular, we consider an uncertainty of on the infection rate . Thereby, we explicitly consider the problem that the infection rate cannot be precisely specified via social distancing measures. We will see later in the simulations that, although we do not account for all possible mismatches in the prediction model, we nevertheless obtain the desired properties in closed loop.
Given this interval prediction model, the proposed robust formulation now predicts an interval for the different state variables and minimizes the worst-case number of fatalities based on . The overall procedure is summarized in Algorithm 2.
Numerical results
In the following simulations, we consider the extreme case where the number of estimated infected or previously infected individuals (, , , , , , ) is underestimated. The results for the robust MPC and the nominal MPC in comparison to open-loop optimal control strategies for the two validation parameter sets A and B (compare Section 4) can be seen in Fig. 8. Due to the worst-case prediction in the robust MPC, at the robust MPC already increases the resources two times, for both parameter sets, such that at the predictions satisfy .
Fig. 8.
Different policies designed based on the nominal model identified in Section 2.2, applied to the validation model with parameter set A (top) and parameter set B (bottom) with biased state measurements: Open-loop optimal control strategy (‘OL’, blue dotted), Open-loop optimal control strategy with the same amount of resources as the robust MPC-based feedback (‘OL-resource’, red dashed), nominal MPC-based feedback strategy as in Algorithm 1 with biased estimated state (yellow, dash-dot), robust MPC-based feedback strategy using Algorithm 2 (purple, solid), and ICU capacity (black dotted).
In the simulation with the model based on parameter set A, we can directly see that both the nominal MPC and the robust MPC reduce the number of fatalities compared to an open-loop optimal control strategy. Furthermore, if we compare the robust MPC and the nominal MPC, we can see that after days the nominal MPC implementation realizes that the spread is worse than initially assumed. This leads to a strong increase in social measures at . With the robust formulation, decreases almost monotonically. Furthermore, the nominal implementation results in twice the number of fatalities, while the applied resources over the two year horizon differ by less than , which corresponds to one week of lockdown. This indicates that the robust MPC, planning cautiously from the beginning, exploits its resources more efficiently by imposing stricter social distancing measures early on, which results to be beneficial in the long run.
For the second parameter set B, we can see that the worst-case robust formulation uses initially an unnecessarily high control effort. Nevertheless, overall the applied resources differ by less than compared to the nominal MPC, while the number of fatalities is still reduced by 33%. In comparison to the open-loop policies, both MPC policies require less or equally restrictive policy measures , while the number of fatalities are significantly reduced.
6. Conclusions and discussion
In the following, we summarize our findings on a high-level and highlight the main take-away messages:
-
•
Our results in Section 3 confirm the conclusions in Meyer-Hermann et al. (2020) that neither eradication of the virus nor herd immunity without the availability of a vaccine are viable solutions to handle the current COVID-19 outbreak.
-
•
Applying an optimizer to the mathematical model describing the outbreak, one can significantly reduce the number of fatalities without increasing the costs associated to decreasing the infection rate (social distancing policies, closing schools, etc.), compare Section 3.2.
-
•
Since the proposed model can never exactly predict the COVID-19 pandemic, applying a nominal optimal policy introduces unnecessary conservatism, at best, up to posing a great danger (i.e. overwhelming the health care capacities risking high mortality rates). Therefore, our findings in Section 4 support (Meyer-Hermann et al., 2020) by showing that any policy to control the COVID-19 outbreak successfully has to be an adaptive strategy. This means we need to constantly measure, monitor and estimate the current numbers and adapt our policy accordingly, i.e., feedback is necessary for reliably handling the outbreak.
-
•
If we already a priori take into account that our model includes mismatches and that all measured and estimated numbers are not exact and can have a bias, we can further improve the outcome, as shown in Section 5. More specifically, we developed a robust MPC-based feedback strategy using interval arithmetic. The application of feedback without the robust description of the considered model can lead to intermediate increases in the number of new infections necessitating another period of lockdown. On the contrary, a robust feedback strategy can take these model mismatches and other uncertainties into account and is hence able to avoid such behavior, thus significantly reducing the number of fatalities.
-
•
When looking at the qualitative results the robust MPC-based feedback offers, one can see that, accounting for the instability and uncertainty of the spread of the virus, the controller suggests a rather strict policy at the beginning and only then allows for a gradual increase in the infection rate. Keeping this loosening slow at the beginning shows a beneficial effect in the long run. This qualitative result of the robust MPC underpins also the German policy and reaction to the outbreak of COVID-19 in Germany where initially strong measures (what we here refer to as lockdown) were applied. Only very recently the German government started to loosen these measures slowly and gradually.
There are also influences on the course of the outbreak that were not taken into account in the present paper but which are important in an overall strategy towards the spread of COVID-19 (e.g. increasing testing capacities, tracking of infections, as well as investigating which measures lead to the desired infection rate). However, controlling the infection rate is certainly one of the key factors and hence, this paper contributes towards mitigating the spread of COVID-19 under manageable societal and economic costs. We hope that the proposed feedback strategies inspire further investigations in this direction and offer qualitative and high-level insights that underpin the current policies or strategy papers.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
This work was supported by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Grants GRK 2198/1 - 277536708, AL 316/12-2 - 279734922. The authors thank the International Max Planck Research School for Intelligent Systems (IMPRS-IS) for supporting Lukas Schwenkel, Anne Koch, Julian Berberich, and Patricia Pauli.
There was no reliable data available from before March 23; the data from March 23 to March 26 contains only information about , not or ; from April 5 to April 7, there is a gap in the data due to a server migration of the DIVI Intensivregister (DIVI-IntensivRegister, 2020).
Appendix A. Optimal control formulation using terminal constraints
In order to avoid artifacts of considering a finite-horizon problem (e.g. a lot of infected people at the end of the horizon), an alternative to considering the modified cost function F from (10) is the inclusion of additional terminal constraints for the contagious population . In particular, we require that at the end of the control horizon , the number of contagious individuals in each category () should be smaller than the corresponding number from the baseline policy (c.f. (A.1a)). In addition, at the end of the horizon the number of contagious individuals should be non-increasing, which is implemented as (A.1b).
Hence, in the following, we replace the cost by the number of fatalities , and we add the following constraints to the optimal control problem (11) from Section 3.2:
(A.1a) |
(A.1b) |
Again, the index in (11) corresponds to weeks and the states correspond to the result of simulating the system (1) with the parameters and initial condition from Section 2. These terminal conditions (A.1a)–(A.1b) (which should be interpreted element-wise) ensure that the final state after the finite horizon is “better” than the baseline solution (c.f. (A.1a)) and the outbreak can be contained (c.f. (A.1b)). The simulation results with the two baseline policies shown in Fig. A.9, Fig. A.10 demonstrate that the terminal constraints indeed effectively prevent the turnpike behavior. However, the additional constraints also lead to a slight increase in the number of fatalities.
Fig. A.9.
Cautious baseline: Optimal control strategy with terminal constraints (blue, solid), baseline policy (red, dashed) and ICU capacity (dotted, black).
Fig. A.10.
Aggressive baseline: Optimal control strategy with terminal constraints (blue, solid), baseline policy (red, dashed) and ICU capacity (dotted, black).
Appendix B. Alternative average constraint formulation
We wish to briefly mention a stronger restriction on the societal cost of the optimal control strategy. In particular, instead of only restricting the cost over the considered horizon of weeks, a stronger property is to ensure that at any time , the previously accumulated policy cost is smaller than the corresponding cost of the baseline policy. This can be done by replacing condition (11d) with the following transient constraint:
(B.1) |
(B.2) |
The corresponding results for both baselines considered in Section 3.2 can be seen in Fig. B.11. In this case the number of fatalities are reduced by 33% and 37%, respectively. Thus, also for this more restrictive setting, the optimal controller can significantly reduce the number of fatalities. In addition, in the comparison to the more aggressive baseline we also see that early measures are absolutely crucial, since the two policies are essentially equivalent in the period from 110 days–150 days, i.e., including the critical time period where the ICU capacity is exceeded, but differ significantly over 30 weeks prior to the violation of the ICU capacity.
Fig. B.11.
Transient constraint on social policy: Top: cautious baseline; Bottom: aggressive baseline. Optimal control strategy (blue, solid), baseline policy (red, dashed) and ICU capacity (dotted, black).
Appendix C. Interval predictions
In the following, we derive the dynamics of the interval predictions used in (14). Note that the following property holds for any scalars , :
(C.1) |
if . Furthermore, to ensure that given , we only require
(C.2) |
and similarly
(C.3) |
for all . Essentially, we use the property (C.1) together with the fact that and the parameters are positive to ensure that (C.2), (C.3) hold in order to derive the differential equations for the interval. More precisely, assuming that every parameter is uncertain in some bounds (e.g. ) yields the following ODEs:
(C.4a) |
(C.4b) |
(C.4c) |
(C.4d) |
(C.4e) |
(C.4f) |
(C.4g) |
(C.4h) |
(C.4i) |
(C.4j) |
(C.4k) |
(C.4l) |
(C.4m) |
(C.4n) |
(C.4o) |
(C.4p) |
Since these dynamics only correspond to possibly conservative overapproximations, we can use to possibly improve the resulting bounds for using the following projections: and . In principle it would also be possible to directly set using the other states instead of simulating (C.4a)–(C.4b), but this may not necessarily ensure and .
References
- Alleman T., Torfs E., Nopens I. 2020. COVID-19: from model prediction to model predictive control. Available online https://biomath.ugent.be/sites/default/files/2020-04/Alleman_etal_v2.pdf. (Accessed 30 April 2020) [Google Scholar]
- an der Heiden M., Hamouda O. Schätzung der aktuellen Entwicklung der SARS-CoV-2-Epidemie in Deutschland - Nowcasting. Epidemiologisches Bulletin. 2020;2020(17):10–15. [Google Scholar]
- Andersson J.A., Gillis J., Horn G., Rawlings J.B., Diehl M. CasADi: a software framework for nonlinear optimization and optimal control. Mathematical Programming Computation. 2019;11(1):1–36. [Google Scholar]
- Barbarossa M.V., Fuhrmann J., Heidecke J., Varma H.V., Castelletti N., Meinke J.H., et al. 2020. A first study on the impact of current and future control measures on the spread of COVID-19 in Germany. medRxiv preprint. [DOI] [Google Scholar]
- Bin M., Cheung P., Crisostomi E., Ferraro P., Myant C., Parisini T., et al. 2020. On fast multi-shot epidemic interventions for post lock-down mitigation: Implications for simple COVID-19 models. arXiv preprint arXiv:2003.09930. [Google Scholar]
- Bommer C., Vollmer S. 2020. Average detection rate of SARS-CoV-2 infections is estimated around nine percent. Available online https://www.uni-goettingen.de/en/606540.html. (Accessed 15 April 2020) [Google Scholar]
- Casella F. Can the COVID-19 epidemic be controlled on the basis of daily test reports? IEEE Control Systems Letters. 2021;5(3):1079–1084. [Google Scholar]
- COVID-19 Surveillance Group F. 2020. Characteristics of COVID-19 patients dying in Italy, Report based on available data on March 30th, 2020. Available online: https://www.epicentro.iss.it/coronavirus/bollettino/Report-COVID-2019_30_marzo_eng.pdf. (Accessed 16 April 2020) [Google Scholar]
- Dehning J., Zierenberg J., Spitzner F.P., Wibral M., Neto J.P., Wilczek M., et al. 2020. Inferring COVID-19 spreading rates and potential change points for case number forecasts. arXiv preprint arXiv:2004.01105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- DIVI-IntensivRegister J. 2020. Tagesreport 27.03.2020 – 21.04.2020. Available online: https://www.divi.de/divi-intensivregister-tagesreport-archiv. [Google Scholar]
- Djidjou-Demasse R., Michalakis Y., Choisy M., Sofonea M.T., Alizon S. 2020. Optimal COVID-19 epidemic control until vaccine deployment. medRxiv preprint. [DOI] [Google Scholar]
- Dong E., Du H., Gardner L. 2020. 2019 Novel Coronavirus COVID-19 (2019-nCoV) data repository by johns hopkins CSSE. https://github.com/CSSEGISandData/COVID-19. (Accessed 22 April 2020) [Google Scholar]
- Dong E., Du H., Gardner L. An interactive web-based dashboard to track COVID-19 in real time. The Lancet Infectious Diseases. 2020;20(5):533–534. doi: 10.1016/S1473-3099(20)30120-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dorfman R., Samuelson P.A., Solow R.M. Courier Corporation; 1987. Linear programming and economic analysis. [Google Scholar]
- German R., Djanatliev A., Maile L., Bazan P., Hackstein H. 2020. Modeling exit strategies from COVID-19 lockdown with a focus on antibody tests. medRxiv preprint. [DOI] [Google Scholar]
- Giordano G., Blanchini F., Bruno R., Colaneri P., Filippo A.D., Matteo A.D., et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nature Medicine. 2020;26:855–860. doi: 10.1038/s41591-020-0883-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grasselli G., Pesenti A., Cecconi M. Critical care utilization for the COVID-19 outbreak in Lombardy, Italy: early experience and forecast during an emergency response. Journal of the American Medical Association. 2020;323(16):1545–1546. doi: 10.1001/jama.2020.4031. [DOI] [PubMed] [Google Scholar]
- Gudbjartsson D.F., Helgason A., Jonsson H., Magnusson O.T., Melsted P., Norddahl G.L., et al. Spread of SARS-CoV-2 in the icelandic population. New England Journal of Medicine. 2020;382:2302–2315. doi: 10.1056/NEJMoa2006100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- He X., Lau E.H.Y., Wu P., Deng X., Wang J., Hao X., Lau Y.C., et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nature Medicine. 2020;26(5):672–675. doi: 10.1038/s41591-020-0869-5. [DOI] [PubMed] [Google Scholar]
- Helmholtz-Initiative ‘Systemische Epidemiologische Analyse der Covid-19-Epidemie’ X. 2020. Stellungnahme der Helmholtz-Initiative ’Systemische Epidemiologische Analyse der COVID-19-Epidemie’. https://www.helmholtz.de/fileadmin/user_upload/01_forschung/Helmholtz-COVID-19-Papier_02.pdf. (Accessed 15 April 2020) [Google Scholar]
- IHME COVID-19 health service utilization forecasting team, C. J. L. Murray X. 2020. Forecasting COVID-19 impact on hospital bed-days, ICU-days, ventilator-days and deaths by US state in the next 4 months. medRxiv preprint . [DOI] [Google Scholar]
- Jakob Simmank J.W. 2020. So testet Deutschland. ZEIT online, https://www.zeit.de/wissen/gesundheit/2020-03/coronatests-deutschland-coronavirus-covid-19-who-pandemie#wer-soll-in-deutschland-getestet-werden. (Accessed 15 April 2020) [Google Scholar]
- Ji Y., Ma Z., Peppelenbosch M.P., Pan Q. Potential association between COVID-19 mortality and health-care resource availability. The Lancet Global Health. 2020;8(4) doi: 10.1016/S2214-109X(20)30068-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kantner M., Koprucki T. Beyond just ”flattening the curve”: Optimal control of epidemics with purely non-pharmaceutical interventions. Journal of Mathematics in Industry. 2020;10(1):1–23. doi: 10.1186/s13362-020-00091-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kermack W., McKendrick A.G. A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society of London. 1927;(115):700–721. [Google Scholar]
- Kissler S., Tedijanto C., Lipsitch M., Grad Y.H. 2020. Social distancing strategies for curbing the COVID-19 epidemic. medRxiv preprint . [DOI] [Google Scholar]
- Köhler, J., Enyioha, C., & Allgöwer, F. (2018). Dynamic resource allocation to control epidemic outbreaks - A model predictive control approach. In Proc. Annual american control conference (ACC) (pp. 1546–1551).
- Lavezzo E., Franchin E., Ciavarella C., Cuomo-Dannenburg G., Barzon L., Del Vecchio C., et al. Suppression of a SARS-CoV-2 outbreak in the Italian municipality of Vo’. Nature. 2020;584(7821):425–429. doi: 10.1038/s41586-020-2488-1. [DOI] [PubMed] [Google Scholar]
- Leopoldina E. 2020. Dritte Ad-hoc-Stellungnahme: Coronavirus-Pandemie - Die Krise nachhaltig überwinden. Available online https://www.leopoldina.org/uploads/tx_leopublication/2020_04_13_Coronavirus-Pandemie-Die_Krise_nachhaltig_%C3%BCberwinden_final.pdf. (Accessed 15 April 2020) [Google Scholar]
- Li Q., Guan X., Wu P., Wang X., Zhou L., Tong Y., et al. Early transmission dynamics in wuhan, China, of novel coronavirus–infected pneumonia. New England Journal of Medicine. 2020;382(13):1199–1207. doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Limon D., Bravo J., Alamo T., Camacho E. Robust MPC of constrained nonlinear systems based on interval arithmetic. IEE Proceedings D (Control Theory and Applications) 2005;152(3):325–332. [Google Scholar]
- Linton N.M., Kobayashi T., Yang Y., Hayashi K., Akhmetzhanov A.R., mok Jung S., et al. Incubation period and other epidemiological characteristics of 2019 novel coronavirus infections with right truncation: A statistical analysis of publicly available case data. Journal of Clinical Medicine. 2020;9(2):538. doi: 10.3390/jcm9020538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maharaj S., Kleczkowski A. Controlling epidemic spread by social distancing: Do it well or not at all. BMC Public Health. 2012;12(679):1–16. doi: 10.1186/1471-2458-12-679. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maier B.F., Brockmann D. Effective containment explains subexponential growth in recent confirmed COVID-19 cases in China. Science. 2020;368(6492):742–746. doi: 10.1126/science.abb4557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meyer-Hermann M., Pigeot I., Priesemann V., Schöbel A. 2020. Adaptive Strategien zur Eindämmung der COVID-19-Epidemie. https://www.helmholtz.de/fileadmin/user_upload/01_forschung/28-04-2020_Strategien_zur_Eindaemmung.pdf. (Accessed 29 April 2020) [Google Scholar]
- Mizumoto K., Kagaya K., Zarebski A., Chowell G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan. Eurosurveillance. 2020;25(10) doi: 10.2807/1560-7917.ES.2020.25.10.2000180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nowzari C., Preciado V.M., Pappas G.J. Analysis and control of epidemics. IEEE Control Systems Magazine. 2016;36:26–46. [Google Scholar]
- Oliver N., Letouzé E., Sterly H., Delataille S., Nadai M.D., Lepri B., et al. 2020. Mobile phone data and COVID-19: Missing an opportunity? arXiv preprint arXiv:2003.12347. [Google Scholar]
- Preciado V.M., Zargham M., Enyioha C., Jadbabaie A., Pappas G.J. Optimal resource allocation for network protection against spreading processes. IEEE Transactions on Control of Network Systems. 2014;1(1):99–108. [Google Scholar]
- Rawlings J.B., Mayne D.Q., Diehl M. Nob Hill Publishing; 2017. Model predictive control: Theory, computation, and design. [Google Scholar]
- Robert Koch Institut J.B. 2017. Epidemiologisches Bulletin Nr.27; p. 243. Available online https://www.rki.de/DE/Content/Infekt/EpidBull/Archiv/2017/Ausgaben/27_17.pdf?__blob=publicationFile. [Google Scholar]
- Tsay C., Lejarza F., Stadtherr M.A., Baldea M. 2020. Modeling, state estimation, and optimal control for the US COVID-19 outbreak. arXiv preprint arXiv:2004.06291. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Verity R., et al. Estimates of the severity of coronavirus disease 2019: a model-based analysis. The Lancet Infectious Diseases. 2020;20(6):669–677. doi: 10.1016/S1473-3099(20)30243-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang D., Hu B., Hu C., Zhu F., Liu X., Zhang J., et al. Clinical characteristics of 138 hospitalized patients with 2019 Novel Coronavirus–infected Pneumonia in Wuhan, China. Journal of the American Medical Association. 2020;323(11):1061–1069. doi: 10.1001/jama.2020.1585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watkins N.J., Nowzari C., Pappas G.J. Robust economic model predictive control of continuous-time epidemic processes. IEEE Transactions on Automatic Control. 2019;65:1116–1131. [Google Scholar]
- World Health Organization (WHO) N.J. 2020. Report of the WHO-China Joint Mission on Coronavirus Disease 2019 (COVID-19) Available online https://www.who.int/docs/default-source/coronaviruse/who-china-joint-mission-on-covid-19-final-report.pdf. (Accessed 15 April 2020) [Google Scholar]
- Yang X., Yu Y., Xu J., Shu H., Xia J., Liu H., et al. Clinical course and outcomes of critically ill patients with SARS-CoV-2 pneumonia in wuhan, China: a single-centered, retrospective, observational study. The Lancet Respiratory Medicine. 2020;8:475–481. doi: 10.1016/S2213-2600(20)30079-5. [DOI] [PMC free article] [PubMed] [Google Scholar]