Stochastic Modeling and Forecasting of Covid-19 Deaths: Analysis for the Fifty States in the United States

Olusegun Michael Otunuga; Oluwaseun Otunuga

doi:10.1007/s10441-022-09449-z

. 2022 Sep 16;70(4):25. doi: 10.1007/s10441-022-09449-z

Stochastic Modeling and Forecasting of Covid-19 Deaths: Analysis for the Fifty States in the United States

Olusegun Michael Otunuga ^1,^✉, Oluwaseun Otunuga ²

PMCID: PMC9483371 PMID: 36112233

Abstract

In this work, we study and analyze the aggregate death counts of COVID-19 reported by the United States Centers for Disease Control and Prevention (CDC) for the fifty states in the United States. To do this, we derive a stochastic model describing the cumulative number of deaths reported daily by CDC from the first time Covid-19 death is recorded to June 20, 2021 in the United States, and provide a forecast for the death cases. The stochastic model derived in this work performs better than existing deterministic logistic models because it is able to capture irregularities in the sample path of the aggregate death counts. The probability distribution of the aggregate death counts is derived, analyzed, and used to estimate the count’s per capita initial growth rate, carrying capacity, and the expected value for each given day as at the time this research is conducted. Using this distribution, we estimate the expected first passage time when the aggregate death count is slowing down. Our result shows that the expected aggregate death count is slowing down in all states as at the time this analysis is conducted (June 2021). A formula for predicting the end of Covid-19 deaths is derived. The daily expected death count for each states is plotted as a function of time. The probability density function for the current day, together with the forecast and its confidence interval for the next four days, and the root mean square error for our simulation results are estimated.

Keywords: Covid-19, Stochastic differential equation, Probability density function, Forecast, Aggregate death, First passage time

Introduction

Several mathematical models (Wu et al. 2020; Stutt et al. 2020; Linka et al. 2020; Okuonghae and Omame 2020; Ndairou et al. 2020; Ladde et al. 2020; Otunuga 2020; Mummert and Otunuga 2019; Otunuga 2018; Santosh 2020) have been developed to study the transmission of the COVID-19 virus caused by the virus species ”severe acute respiratory syndrome-related corona virus”, named SARS-CoV-2. The airborne transmission occurs by inhaling droplets loaded with SARS-CoV-2 particles that are expelled by infectious people. According to Wu et al. (2020), the ”severe acute respiratory syndrome coronavirus” (SARS-CoV) and the ”Middle East respiratory syndrome coronavirus” (MERS-CoV) are two other novel coronaviruses that emerged as major global health threats since 2002. Several public health interventions have been put in place to eradicate or reduce the spread of the disease. According to CDC, the first U.S. laboratory-confirmed case1 of COVID-19 in the U.S. was recorded on January 20, 2020 from the samples taken 2 days earlier in Washington state. The first COVID-19 death in the United States was first reported in the same state by CDC on February 29, 2020. As of June 24, 2021, the total number of Covid-19 cases in the United States was reported by CDC to be 33, 437, 643, resulting in about 601, 221 deaths2. On December 11 and December 18, 2020, the United States Food and Drug Administration (FDA)3 issued an Emergency Use Authorization (EUA) for the Pfizer-BioNTech and the Moderna COVID-19 vaccine, respectively, in the United States. An EUA for the third vaccine, the Johnson and Johnson (J &J) vaccine, was first issued in the United States on February 27, 20214. A pause on the usage of the J &J vaccine was recommended by CDC and the FDA on April 13, 2021 due to the serious blood clots (a condition called thrombosis) in six women between the ages of 18 and 49 years with thrombocytopenia syndrome (TTS)5 following the usage of the vaccine. As of June 7, 2021, about $51.5 %$ of the total population of the United States have received at least one dose of the vaccination, with $41.9 %$ fully vaccinated. The study done in this work is to check the effects of these interventions. That is, with these vaccines, we check if the number of deaths resulting from the Covid-19 disease slows down as time proceeds and as the number of those who are vaccinated increases.

The trajectory of the aggregate death counts in most states in the United States follows the same dynamics. At first, it follows a somewhat exponential trajectory, with its growth slowing down at some point and speeding up at other points. With public health interventions like vaccination mitigating the growth of the virus, this pattern is expected to continue until a certain steady-state is reached. This dynamic follows roughly the well known Verhulst logistic equation and its generalization (Beddington and May 1977; Li and Wang 2010; Prajneshu 1980; Pelinovsky et al. 2020; Pella and Tomlinson 1969; Wang et al. 2020). The model was first derived by Verhulst (1838) to study population growth. Some other methods (Baud et al. 2020; Bhapkar et al. 2020; Satpathy et al. 2021; Kaciroti et al. 2021) have been developed to estimate mortality following the Covid-19 infection. In this work, we consider the logistic model

\begin{matrix} d N = \frac{\bar{μ}}{K} (K - N) N d t, N (t_{0}) = N_{0}, \end{matrix}

1.1

where $t_{0} \geq 0$ , $N_{0} > 0$ , N(t) denotes the total number of Covid-19 death counts at time t, $K$ is the maximum number of Covid-19 deaths, and $\bar{μ}$ is the per capita initial growth rate, to interpret the aggregate number of COVID-19 death trajectories in the United States. We see from (1.1) that $\frac{dN}{dt} > 0$ on $(0, K)$ and $\frac{d^{2} N}{d t^{2}} = \frac{{\bar{μ}}^{2}}{K^{2}} (K - N) (K - 2 N) N$ . From this, we have $\frac{d^{2} N}{d t^{2}} > 0$ on the interval $(0, K / 2)$ , and $\frac{d^{2} N}{d t^{2}} < 0$ on $(K / 2, K)$ . That is, the trajectory of the aggregate number speeds up in the interval $(0, K / 2)$ and slows down in the interval $(K / 2, K)$ . Also, since N(t) represents the aggregate number of deaths at a given time t, it follows that the daily number of deaths is at the maximum at the time when $N (t) = K / 2$ .

From this analysis, if the current day aggregate death counts in time series is more than $K / 2$ , then we know that its growth is slowing down and the virus spread has been controlled. Otherwise, the virus is spreading, with speeding growth. The problem with using this model to analyze the aggregate number of cases is that it fails to account for the fluctuations or perturbations in the data resulting from fluctuations in the rates of infection or death. These noises/fluctuations can be caused by many factors like the rates at which Covid-19 testing is done, vaccination rates, mask use per capital, social behavior, public health intervention (Linka et al. 2020), and so on. In addition, CDC6 reported on their websites that counting exact aggregate confirmed and probable COVID-19 cases and deaths is not possible due to delays in reporting from different voluntary jurisdictions. The number of death cases reported on CDC’s website might not be complete because it takes several weeks for death records to be processed, coded, submitted, and tabulated on the National Center for Health Statistics (NCHS). As a result, these cause the counts to fluctuate substantially, with a possibility of a negative number of probable cases reported on a given day if more probable cases were disproven than were initially reported on that day.

Several authors (Lv et al. 2019; Li and Wang 2010; Lungu and Øksendal 1997; Yang et al. 2019; Gardiner 1985) have worked on model (1.1) and its extension to a stochastic model. In this work, we derive a stochastic model governing the aggregate number of Covid-19 death counts in the United States by extending model (1.1) to a stochastic case. The proposed model is better in the sense that it captures fluctuations in the aggregate counts better than the widely used logistic model (1.1) with lesser root mean square error. Our aim in this study is to determine whether the virus infection and death counts resulting from the infection are still growing sharply or slowing down. Upon analyzing the COVID-19 data collected from CDC, we see that the path of the aggregate number of Covid-19 death counts over time follows a logistic model with irregular trajectories. We assume this fluctuation is caused by many factors such as listed above, causing the per capita growth rate to fluctuate over time. We account for the fluctuations in this rate by extending the deterministic logistic model (1.1) to a stochastic model. This is done by assuming the parameter $\bar{μ}$ is not constant over time, but fluctuates about a particular mean value. In order to estimate the epidemiological parameters $K$ (the carrying capacity of Covid-19 death counts), $\bar{μ}$ (the per capita initial growth rate), and the death rate noise intensity, we derive the transition probability density function for the aggregate number of death counts and apply a Maximum Likelihood Estimate (MLE) scheme. The distribution is also used to calculate the expected aggregate count at a particular period in time. Using the parameter estimates for the fitted data, we forecast the cumulative number of Covid-19 deaths and provide a $95 %$ confidence interval for the forecast.

The organization of the work done is as follows: In Sect. 2, we derive a stochastic model describing the cumulative number of deaths by assuming $\bar{μ}$ is not constant, but changes with time and fluctuates around a mean value. We show that the model is well defined, with a unique closed-form solution. In Sect. 3, the transition probability density function for the aggregate number of death counts is derived. Using the MLE scheme, we estimate the epidemiological parameters in the model. Since the aggregate count N(t) is random, we calculate the expected number of Covid-19 aggregate death counts for each states in the United States. We show that with probability one the aggregate count remains in the interval $(0, K)$ if it starts from there. In Sect. 4, the expected first hitting time when the aggregate count N(t) reaches $K - ϵ$ is calculated for some small positive constant $ϵ > 0$ . We also estimate the expected first passage time when the aggregate death counts started slowing down. Numerical simulations, forecast, and analysis of the aggregate death counts for the fifty states in the United States are carried out in Sect. 5. The summary of the work done is given in Sect. 6.

Methodology

Data Sources

The Covid-19 aggregate death counts in the United States is collected from the United States Centers for Disease Control and Prevention (CDC) website and provided by the CDC Case Task Force7. The data was collected for the period ranging from January 22, 2020 to June 24, 2021, and it includes the date of counts, state/jurisdiction, total/aggregate number of death cases (including total confirmed and probable deaths), number of new and new probable deaths with the date and time the records were created. The definition of each of these can be found on the CDC’s website $^{7}$ .

Modeling the Covid-19 Aggregate Death Counts

In this section, we describe the dynamics of the aggregate number of deaths in the United States by extending the well-known deterministic logistic model (1.1) to a stochastic differential equation. Analysis of the data (see Figs. 1, 2) shows that its growth rate fluctuates with time. Some works (Lagarto and Braumann 2014; Mazzuco et al. 2018; Zocchetti and Consonni 1994) have been done in analyzing the distribution of mortality rate. The dynamics of male and female crude death rates of the Portuguese population over the period 1940–2009 were modeled in the work of Lagarto and Braumann (2014) using a bi-dimensional stochastic Gompertz model with correlated Wiener processes. Zocchetti and Consonni (1994) showed in their work that when the number of deaths is sufficiently elevated, the Gauss distribution (also referred to as the normal distribution) can be used as a good approximation distribution for the variability in the mortality rate. In their work, Mazzuco et al. (2018) derived a new model for mortality rate based on the mixture of a half-normal distribution with a generalization of the skew-normal distribution. The Wiener process (Khasminskii 2012; Kloeden and Platen 1995; Mao 2007; Øksendal 2003), often called Brownian motion, is a real-valued continuous-time stochastic process ${W (t) : t \geq 0}$ defined on a probability space $(Ω, F, P)$ with stationary independent Gaussian increments such that $W (0) = 0$ with probability one, $W (t + Δ t) - W (t)$ is normally distributed with mean 0 and variance $Δ t$ , and $W (t) - W (s)$ is independent of the past random variable W(u), $0 \leq u \leq s$ . In addition, the random variables ${W (t_{j}) - W (s_{j}), j = 1, 2, \dots, n}$ are jointly independent, for $0 \leq s_{1} < t_{1} \leq s_{2} < t_{2} \dots \leq s_{n} < t_{n} < \infty$ . The independence and stationarity of the increment, together with the continuous (almost everywhere) sample path of the process W(t) lead to its great tractability, making it one of the most important stochastic process in continuous time used in biological systems to model perturbed epidemiological parameters. Following a similar assumption made in the work of Prajneshu (1980), Yang et al. (2019), and Otunuga (2021b), where a logistic stochastic population model with the population’s transition probability density function were derived to study the distribution of a population subjected to a continuous spectrum of disturbances, with fluctuations in the intrinsic growth rate, we assume the dynamics of $\bar{μ}$ shouldn’t be constant over time, but instead be driven by fluctuations that can be modeled to follow a process of the form

\begin{matrix} \bar{μ} d t = μ d t + σ \circ d W (t), \end{matrix}

2.1

where W(t) is a standard Wiener process, $μ$ is the average death counts per capita initial growth rate, $σ$ is the noise intensity, and $\circ$ is the Stratonovich integral symbol (Arnold 1974). We use the Stratonovich calculus instead of the Itô calculus to describe this dynamic simply because it obeys the traditional rule of chain rule and allows white noise to be treated as a regular derivative of a Brownian or Wiener process (West et al. 1979; Wong and Zakai 1965). For more reading on the Itô and Stratonovich calculus, we direct readers to the work of Kloeden and Platen (1995) and Øksendal (2003). Substituting into (1.1), the proposed stochastic differential equation (SDE) for the aggregate number of deaths is given by

\begin{matrix} d N = \frac{μ}{K} (K - N) N d t + \frac{σ}{K} (K - N) N \circ d W (t), N (t_{0}) = N_{0}, \end{matrix}

2.2

where $N_{0} > 0$ , $K$ is the carrying death capacity, $μ$ and $σ$ are as described in (2.1). We note here that the stochasticity added in model (2.2) leads to a non-monotonic cumulative death sample path, a property that makes it able to capture irregularities in the sample path better than the deterministic counterpart, with a smaller root mean square error. The interpretation of the stochastic differential equation as a Stratonovich differential equation follows the work of Otunuga (2019, 2021a). We shall later compare model (2.2) with its deterministic equivalent (1.1) and show that model (2.2) performs better in capturing the trajectory of (and the noise in) the aggregate death counts. We convert (2.2) into a Itô stochastic differential equation as

\begin{matrix} d N = (\frac{μ}{K} (K - N) N + \frac{σ^{2}}{2 K^{2}} (K - N) (K - 2 N) N) d t + \frac{σ}{K} (K - N) N d W (t), N (t_{0}) = N_{0} . \end{matrix}

2.3

In order to show that model (2.3) is biologically feasible, we show in Theorem 2 (using Corollary 3.1 of Khasminskii (2012)) that the solution N(t) of (2.3) exists, and it remains in $(0, K)$ with probability one whenever it starts from there. A statement of the corollary, together with definition of some terminologies in the corollary are given below.

Fig. 1 — Real and simulated aggregate counts of COVID-19 death counts for the states: AR, AZ, CO, FL, GA, IA, IN, KS, KY, MA, MD, ME, MN, MT, NC, NE in the United States

Fig. 2 — Real and simulated aggregate counts of COVID-19 death counts for the states: NM, NV, OK, OR, TN, UT, WI, WY in the United States

Definition 1

L-operator.

Given a one-dimensional stochastic differential equation

\begin{matrix} d x = f (t, x) d t + g (t, x) d W (t), x (t_{0}) = x_{0}, \end{matrix}

2.4

we define the L-operator (Mao 2007) associated with (2.4) as

\begin{matrix} L = \frac{\partial}{\partial t} + f (t, x) \frac{\partial}{\partial x} + \frac{1}{2} g^{2} (t, x) \frac{\partial^{2}}{\partial x^{2}} . \end{matrix}

2.5

If L acts on a nonnegative function V(t, x) which is continuously differentiable with respect to t and twice continuously differentiable with respect to x, then

\begin{matrix} L V (t, x) = \frac{\partial V (t, x)}{\partial t} + f (t, x) \frac{\partial V (t, x)}{\partial x} + \frac{1}{2} g^{2} (t, x) \frac{\partial^{2} V (t, x)}{\partial x^{2}} . \end{matrix}

The usefulness of the expression for LV(t, x) is seen in the Itô Lemma (a Lemma which gives the formula for the stochastic analogue of the chain rule or change of variable rule in calculus), which simply states that the stochastic differential of V(t, x) is given by

\begin{matrix} d V (t, x) = L V (t, x) d t + g (t, x) \frac{\partial V (t, x)}{\partial x} d W (t) . \end{matrix}

2.6

We see here that LV(t, x) is the drift part of the differential dV(t, x).

Theorem 1

(From Corollary 3.1 of Khasminskii (2012))

Let $D_{n}$ be an increasing sequence of open sets whose closure are contained in an open set D such that $⋃ D_{n} = D$ . Suppose that the drift and diffusion coefficients f(t, x) and g(t, x), respectively, in (2.4) satisfy the Lipschitz and linear growth conditions in $(0, \infty) \times D_{n}$ and there exists a function V(t, x), twice continuously differentiable in x and continuously differentiable in t in the domain $(0, \infty) \times D$ , which satisfies

\begin{matrix} L V \leq & c V, \\ inf_{t > 0, x \in D \ D_{n}} V (t, x) \to & \infty a s n \to \infty, \end{matrix}

for some positive constant c. Then for every random variable $x (t_{0})$ independent of $W (t) - W (t_{0})$ , there exists a solution x(t) of (2.4) which is an almost surely continuous stochastic process and is unique up to equivalence8 provided that $P (x (t_{0}) \in D) = 1$ . Moreover the solution satisfies the relation

\begin{matrix} P (x (t) \in D) = 1 for all t \geq t_{0} . \end{matrix}

Since the drift and diffusion coefficients of (2.3) are non-linear, the classical existence and uniqueness theorem of SDE (Kloeden and Platen 1995; Khasminskii 2012; Mao 2007; Øksendal 2003) does not apply. We use Theorem 1 to prove the existence and uniqueness of the solution of (2.3) in the interval $(0, K)$ in Theorem 2.

Theorem 2

Let the stochastic differential equation (2.3) be given for any $t_{0} \geq 0$ and initial value $N_{0} \in (0, K)$ independent of $W (t) - W (t_{0})$ . Then there exists a unique global positive solution $N : [t_{0}, \infty) \to ℜ^{+}$ such that with probability one $N (t) \in (0, K)$ . That is,

\begin{matrix} P \{N (t) \in (0, K), t \geq t_{0}\} = 1 . \end{matrix}

Proof

Define the sequence ${D_{r}}$ by

\begin{matrix} D_{r} = (\frac{1}{r}, K - \frac{1}{r}), r = 1, 2, \dots . \end{matrix}

Clearly, ${D_{r}}$ is an increasing sequence of open sets whose closures are contained in $(0, K)$ . The drift and diffusion coefficients $b (N) = \frac{μ}{K} (K - N) N + \frac{σ^{2}}{2 K^{2}} (K - N) (K - 2 N) N$ and $g (N) = \frac{σ}{K} (K - N) N$ , respectively, of (2.3) satisfy the Lipschitz and linear growth conditions locally in $D_{r}$ . Define a function $V : (0, K) \to ℜ^{+}$ by

\begin{matrix} V (N) = \frac{K}{N (K - N)} . \end{matrix}

Applying the L-operator ((2.5)) associated with (2.3) on V, we have

\begin{matrix} L V = & \frac{\partial V}{\partial t} + b (N) \frac{\partial V}{\partial N} + \frac{g^{2} (N)}{2} \frac{\partial^{2} V}{\partial N^{2}} \\ = & \frac{μ}{K} (K - N) N (- \frac{1}{N^{2}} + \frac{1}{{(K - N)}^{2}}) + \frac{σ^{2}}{K^{2}} {(K - N)}^{2} N^{2} (\frac{1}{N^{3}} + \frac{1}{{(K - N)}^{3}}) \\ + \frac{σ^{2}}{2 K^{2}} (K - N) (K - 2 N) N (- \frac{1}{N^{2}} + \frac{1}{{(K - N)}^{2}}) \\ \leq & \frac{μ}{K} \frac{K}{K - N} + \frac{σ^{2}}{K^{2}} (\frac{K^{3}}{(K - N) N} - 3 K) + \frac{σ^{2}}{2 K^{2}} (\frac{K (K - N) N}{{(K - N)}^{2}} + \frac{2 (K - N) N^{2}}{N^{2}}) \\ \leq & C V, \end{matrix}

where $C = μ + 3 σ^{2} / 2$ . For any $x \in D \ D_{r} = (0, \frac{1}{r}] ⋃ [K - \frac{1}{r}, K)$ , we have

\begin{matrix} inf_{t > 0, x \in D \ D_{r}} V (x) = r + \frac{1}{K} \to \infty a s r \to \infty . \end{matrix}

The result follows from Theorem 1. $□$

We give the exact solution of (2.3) in the following theorem. This solution will be used in the simulation process to plot the sample path of the death counts.

Theorem 3

For any given initial value $N_{0} \in (0, K)$ , the exact solution of the SDE (2.3) is obtained as

\begin{matrix} N (t) = \frac{K}{1 + \frac{K - N_{0}}{N_{0}} Φ^{- 1} (t)}, \end{matrix}

2.7

where

\begin{matrix} Φ (t) = e^{μ (t - t_{0}) + σ (W (t) - W (t_{0}))} . \end{matrix}

Proof

As shown in Theorem (2), there is a unique global positive solution $N (t) \in (0, K)$ for all $t \geq t_{0}$ with probability one. Define

\begin{matrix} u = ln (\frac{N}{K - N}), N \in (0, K) . \end{matrix}

2.8

It follows from (2.3) and (2.6) that

\begin{matrix} d u = μ d t + σ d W (t), u (t_{0}) = u_{0}, \end{matrix}

2.9

with solution

\begin{matrix} u (t) = u_{0} + μ (t - t_{0}) + σ (W (t) - W (t_{0})) . \end{matrix}

2.10

The result follows by substituting (2.10) into (2.8) and solving for N(t). $□$

Remark 1

Theorem 2 shows that the aggregate number of deaths cannot grow past a particular value, $K$ , with probability one if its starting point is in $(0, K)$ . If $0 < N_{0} < K$ , then it follows from Theorems 2 and 3 that the feasible epidemiological region of interest for the solution N(t) is the set

\begin{matrix} T = \{n \in ℜ^{+} | 0 < n < K\} . \end{matrix}

2.11

The following theorem shows that if $μ > σ^{2} / 2$ and the process N(t) starts from $(0, K)$ , then it converges almost surely to a random variable $N_{\infty}$ with finite expectation.

Theorem 4

For any given initial condition $N_{0} \in (0, K)$ , the process N(t) is a submartingale if $μ > σ^{2} / 2$ . That is,

\begin{matrix} E [N (t) | N (s), s \leq t] \geq N (s) . \end{matrix}

Furthermore, there exists a random variable $N_{\infty}$ such that $E [(N_{\infty})] < \infty$ and

\begin{matrix} lim_{t \to \infty} N (t) = N_{\infty} . \end{matrix}

2.12

Proof

It follows from (2.2) that for $s \leq t$ , we have

\begin{matrix} N (t) = N (s) + \int_{s}^{t} (\frac{μ}{K} (K - N (u)) N (u) + \frac{σ^{2}}{2 K^{2}} (K - N (u)) (K - 2 N (u)) N (u)) d u + \int_{s}^{t} \frac{σ}{K} (K - N (u)) N (u) d W (u) . \end{matrix}

If $μ > σ^{2} / 2$ , we obtain

\begin{matrix} E [N (t) | N (s)] = & N (s) + \int_{s}^{t} (\frac{μ}{K} (K - N (u)) N (u) + \frac{σ^{2}}{2 K^{2}} (K - N (u)) (K - 2 N (u)) N (u)) d u \\ \geq & N (s) . \end{matrix}

Clearly, we see from (2.7) that $0 < N (t) < K$ for all $t \geq 0$ . Hence, $sup_{t \geq 0} E [(N, {(t)}^{+})] < \infty$ . By the Martingale Convergence Theorem, we have $E (N_{\infty}) < \infty$ and equation (2.12) is satisfied. $□$

Remark 2

The Martingale Convergence Theorem is a stochastic analogue of the Monotone Convergence Theorem. Here, we also see that a submartingale property is a stochastic analogue of a non-decreasing sequence. Since aggregate death count is expected to be increasing with time and the process N(t) is stochastic, we need a condition that will guarantee a stochastic analogue of an increasing function. Theorem 4 gives such condition. It shows that the expected aggregate death counts is an increasing function provided $N_{0} \in (0, K)$ and $μ > σ^{2} / 2$ . It has been shown in several works (Mendez et al. 2012; Otunuga 2018, 2020) that the presence of environmental perturbations can affect the dynamic nature of biological systems if the noise intensity grows beyond a certain value. The latter condition shows that the noise intensity, $σ$ , of the environmental perturbation must not be allowed to grow beyond a certain function of the average death count’s growth rate $μ$ if the submartingale property is to be maintained. Following Theorem 4, we assume for the rest of this work that the noise intensity $σ < \sqrt{2 μ}$ .

We discuss and analyze the distribution of the random aggregate number of death counts process N(t) by first deriving its transition probability density function. The distribution will be used to estimate the epidemiological parameters $K$ , $μ$ , and $σ$ , and to calculate the expected aggregate number of death counts at a particular time t in the United States. These estimates will later be used in simulating and forecasting the total death counts.

Probability Distribution of the Aggregate Number of Deaths Following (2.3)

Let $p_{N} (n | t, N_{0})$ represents the transition probability density function (PDF) for the aggregate death counts N(t) given t and $N_{0}$ . Following the results in (2.8) and (2.10), the transition probability density function $p_{N} (n | t, N_{0})$ is obtained as

\begin{matrix} p_{N} (n | t, N_{0}) = \frac{K}{n (K - n) σ \sqrt{2 π (t - t_{0})}} exp (- \frac{{(ln (\frac{n}{(K - n)}, \frac{(K - N_{0})}{N_{0}}) - μ (t - t_{0}))}^{2}}{2 (t - t_{0}) σ^{2}}), 0 < n < K . \end{matrix}

3.1

The purpose of deriving this PDF is to be able to estimate the epidemiological parameters in model (2.3) using the MLE scheme.

Parameter Estimates

Let T be a number corresponding to the current date the Covid-19 aggregate data is collected, and $t_{0} < t_{1} < \dots < t_{m} = T$ be a partition P of the interval $[t_{0}, T]$ . Denote $N (t_{j})$ by $N_{j}$ and let $N_{0}, N_{1}, N_{2}, \dots, N_{m}$ be samples satisfying (2.3) at a given time. Let $Δ t_{j} = t_{j} - t_{j - 1}$ , $j = 1, 2, \dots, m$ . The likelihood and log-likelihood functions $L (Θ | N)$ and $L (Θ | N)$ , respectively, of the samples are obtained using (3.1), the transformation (2.8), and the distribution of u in (2.10), as

\begin{matrix} L (Θ | N) = & \prod_{j = 1}^{m} p_{N} (N_{j} | t_{j}, N_{j - 1}) \\ = & \prod_{j = 1}^{m} \frac{K}{N_{j} (K - N_{j}) σ \sqrt{2 π Δ t_{j}}} exp (- \frac{{(ln (\frac{N_{j}}{(K - N_{j})}, \frac{(K - N_{j - 1})}{N_{j - 1}}) - μ Δ t_{j})}^{2}}{2 Δ t_{j} σ^{2}}), \end{matrix}

and

\begin{matrix} \begin{matrix} L (Θ | N) = & m ln K - \sum_{j = 1}^{m} ln (N_{j}, (K - N_{j})) - \frac{m}{2} ln σ^{2} - \frac{1}{2 σ^{2}} \sum_{j = 1}^{m} \frac{1}{Δ t_{j}} {(ln (\frac{N_{j}}{(K - N_{j})}, \frac{(K - N_{j - 1})}{N_{j - 1}}) - μ Δ t_{j})}^{2}, \end{matrix} \end{matrix}

3.2

where $Θ = {K, μ, σ}$ represents the parameter set to be estimated. The maximum likelihood estimates $\hat{K}$ , $\hat{μ}$ , ${\hat{σ}}^{2}$ of $K$ , $μ$ , $σ^{2}$ are estimated from (3.2) as

\begin{matrix} \begin{matrix} \hat{μ} & = & \frac{\sum_{j = 1}^{m} ln (\frac{N_{j}}{(\hat{K} - N_{j})}, \frac{(\hat{K} - N_{j - 1})}{N_{j - 1}})}{\sum_{j = 1}^{m} Δ t_{j}} = \frac{ln (\frac{N (T)}{(\hat{K} - N (T))}, \frac{(\hat{K} - N_{0})}{N_{0}})}{T - t_{0}}, \\ {\hat{σ}}^{2} & = & \frac{1}{m} \sum_{j = 1}^{m} \frac{1}{Δ t_{j}} {(ln (\frac{N_{j}}{(\hat{K} - N_{j})}, \frac{(\hat{K} - N_{j - 1})}{N_{j - 1}}) - \hat{μ} Δ t_{j})}^{2}, \end{matrix} \end{matrix}

3.3

where $\hat{K}$ satisfies

\begin{matrix} \frac{m}{\hat{K}} - \sum_{j = 1}^{m} \frac{1}{\hat{K} - N_{j}} - \frac{1}{2 {\hat{σ}}^{2}} \sum_{j = 1}^{m} \frac{1}{Δ t_{j}} (ln (\frac{N_{j}}{(\hat{K} - N_{j})}, \frac{(\hat{K} - N_{j - 1})}{N_{j - 1}}) - \hat{μ} Δ t_{j}) (\frac{N_{j} - N_{j - 1}}{(\hat{K} - N_{j - 1}) (\hat{K} - N_{j})}) = 0 . \end{matrix}

3.4

Remark 3

The initial point $N_{0}$ can also be estimated for better simulation result. In this case, the estimates $\hat{K}$ , $\hat{μ}$ , ${\hat{σ}}^{2}$ , ${\hat{N}}_{0}$ of $K$ , $μ$ , $σ^{2}$ , and $N_{0}$ reduce to

\begin{matrix} \begin{matrix} \hat{μ} & = & \frac{ln (\frac{N (T)}{(\hat{K} - N (T))}, \frac{(\hat{K} - N_{1})}{N_{1}})}{T - t_{0}}, \\ {\hat{N}}_{0} & = & \frac{\hat{K}}{1 + \frac{\hat{K} - N_{1}}{N_{1}} e^{\hat{μ} Δ t}}, \\ {\hat{σ}}^{2} & = & \frac{1}{m} \sum_{j = 1}^{m} \frac{1}{Δ t_{j}} {(ln (\frac{N_{j}}{(\hat{K} - N_{j})}, \frac{(\hat{K} - N_{j - 1})}{N_{j - 1}}) - \hat{μ} Δ t_{j})}^{2} |_{N_{0} = {\hat{N}}_{0}}), \end{matrix} \end{matrix}

3.5

where $\hat{K}$ satisfies

\begin{matrix} \frac{m}{\hat{K}} - \sum_{j = 1}^{m} \frac{1}{\hat{K} - N_{j}} - \frac{1}{2 {\hat{σ}}^{2}} \sum_{j = 1}^{m} \frac{1}{Δ t_{j}} (ln (\frac{N_{j}}{(\hat{K} - N_{j})}, \frac{(\hat{K} - N_{j - 1})}{N_{j - 1}}) - \hat{μ} Δ t_{j}) (\frac{N_{j} - N_{j - 1}}{(\hat{K} - N_{j - 1}) (\hat{K} - N_{j})}) |_{N_{0} = {\hat{N}}_{0}}) = 0 . \end{matrix}

3.6

Expected and Simulated Number of Deaths

Since the aggregate death counts N(t) is a random process, it is important to calculate the expected number of total deaths at each given time. Given the initial value $N_{0}$ , the expected aggregate number of deaths, denoted $E [N (t) |, N_{0}]$ , at time t is calculated from (3.1) as

\begin{matrix} \begin{matrix} E [N (t) | N_{0} = n_{0}] = & \int_{0}^{K} n p_{N} (n | t, N_{0}) d n = \int_{- \infty}^{\infty} \frac{K}{σ \sqrt{2 π t} (1 + \frac{K - N_{0}}{N_{0}} e^{- u})} exp [- \frac{{(u - μ t)}^{2}}{2 σ^{2} t}] d u . \end{matrix} \end{matrix}

3.7

We show in the next theorem that for each time t, the expected death counts falls in $(0, K)$ if the death counts starts there.

Theorem 5

If $N_{0} \in (0, K)$ , then $0 < E [N (t) | N_{0}] < K$ .

Proof

If $N_{0} \in (0, K)$ , then it follows from (3.7) that

\begin{matrix} E [N (t) | N_{0}] = \int_{- \infty}^{\infty} \frac{K}{σ \sqrt{2 π t} (1 + \frac{K - N_{0}}{N_{0}} e^{- u})} exp [- \frac{{(u - μ t)}^{2}}{2 σ^{2} t}] d u < \int_{- \infty}^{\infty} \frac{K}{σ \sqrt{2 π t}} exp [- \frac{{(u - μ t)}^{2}}{2 σ^{2} t}] d u = K . \end{matrix}

$□$

Theorem 5 shows that the expected aggregate number of deaths will always be in the feasible region $T$ if the initial point $N_{0}$ starts from there. The following theorem gives the total deaths expected on the long run and shows condition under which this number converges to the point $N = K$ using Theorem 5.3 of Khasminskii (2012).

Theorem 6

If $μ > σ^{2} / 2$ and $N_{0} \in (0, K)$ , then the total deaths expected on the long run is $K$ .

Proof

Consider the random process $z (t) = K - N (t)$ . It follows from (3.1) that the probability density function $p_{z} (z | t, z_{0})$ of the random variable z given the initial point $z_{0}$ is obtained as

\begin{matrix} p_{z} (z | t, z_{0}) = \frac{K}{(K - z) z σ \sqrt{2 π (t - t_{0})}} exp (- \frac{{(ln (\frac{(K - z)}{z}, \frac{z_{0}}{(K - z_{0})}) - μ (t - t_{0}))}^{2}}{2 (t - t_{0}) σ^{2}}), 0 < z < K . \end{matrix}

For any $δ > 0$ ,

\begin{matrix} 0 \leq sup_{| z_{0} | > δ, t \geq t_{0}} E [z (t) |, z_{0}] \leq \int_{- \infty}^{\infty} \frac{K}{σ \sqrt{2 π (t - t_{0})} (1 + \frac{K - δ}{δ} e^{u})} exp [- \frac{{(u - μ (t - t_{0}))}^{2}}{2 σ^{2} (t - t_{0})}] d u \to 0 as δ \to 0 \end{matrix}

and

\begin{matrix} 0 < E [z (t) |, z_{0}] < \int_{- \infty}^{\infty} \frac{K}{σ \sqrt{2 π (t - t_{0})} (\frac{K - z_{0}}{z_{0}}, e^{u})} exp [- \frac{{(u - μ (t - t_{0}))}^{2}}{2 σ^{2} (t - t_{0})}] d u = \frac{z_{0}}{K - z_{0}} exp (- (μ - \frac{σ^{2}}{2}) (t - t_{0})) . \end{matrix}

We deduce from the Squeeze Theorem that

\begin{matrix} lim_{t \to \infty} E [z (t) | z_{0}] = 0 . \end{matrix}

3.8

Hence, the result follows from Khasminskii (2012). $□$

Predicting the End of Covid-19 Death

Since N(t) denotes the aggregate death counts at a particular time t, it follows that dN/dt will describe the daily death counts. Following Theorems 2 and 6, we know that the daily death counts converges to zero as N(t) converges asymptotically to $K$ . So, in order to calculate an approximate time that people will stop dying of Covid-19, we need to calculate, in an $ϵ > 0$ neighborhood of $K$ , a time when the aggregate number $K - ϵ$ is reached. For some positive small constant $ϵ$ , define the open interval

\begin{matrix} A_{ϵ} = (K - ϵ, K) . \end{matrix}

4.1

Following Theorem 6, we plan to calculate the first-hitting-time $τ_{ϵ}$ until the process N(t) enters $A_{ϵ}$ .

Definition 2

We define the first passage time $τ_{ϵ}$ as

\begin{matrix} τ_{ϵ} = inf \{t > t_{0} : N (t) = K - ϵ\} . \end{matrix}

4.2

Let $g (t) = P (τ_{ϵ} \leq t)$ and $f_{τ_{ϵ}} (t) = d g (t) / d t$ be the First Passage Time Density (FPTD), the probability that the aggregate number of death counts N(t) has first reached a point $K - ϵ$ at exactly time t. We derive $f_{τ_{ϵ}} (t)$ and the expected first hitting time in the theorem below for small $ϵ$ .

Theorem 7

If $N_{0} \in (0, K)$ , then the probability $f_{τ_{ϵ}} (t)$ that the aggregate number of death counts N(t) has first reached a point $K - ϵ$ at exactly time t is obtained as

\begin{matrix} f_{τ_{ϵ}} (t) = \frac{1}{σ \sqrt{2 π {(t - t_{0})}^{3}}} ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}}) exp (- \frac{{(ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}}) - μ (t - t_{0}))}^{2}}{2 σ^{2} (t - t_{0})}), t > t_{0} . \end{matrix}

Furthermore, for t₀=0, the expected first hitting time $E (τ_{ϵ})$ is obtained as

\begin{matrix} E (τ_{ϵ}) = \frac{1}{μ} ln (\frac{(K - ϵ) (K - N_{0})}{ϵ N_{0}}) . \end{matrix}

4.3

Proof

\begin{matrix} P (τ_{ϵ} \leq t) = & \frac{1}{\sqrt{π}} \int_{\frac{ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}}) - μ (t - t_{0})}{σ \sqrt{2 (t - t_{0})}}}^{\infty} e^{- y^{2}} d y + \frac{e^{\frac{2 μ}{σ^{2}} ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}})}}{\sqrt{π}} \int_{\frac{ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}}) + μ (t - t_{0})}{σ \sqrt{2 (t - t_{0})}}}^{\infty} e^{- y^{2}} d y . \end{matrix}

From this, and the Fundamental Theorem of Calculus, we have

\begin{matrix} f_{τ_{ϵ}} (t) = & \frac{d}{dt} P (τ_{ϵ} \leq t) \\ = & \frac{1}{σ \sqrt{2 π {(t - t_{0})}^{3}}} ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}}) exp (- \frac{{(ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}}) - μ (t - t_{0}))}^{2}}{2 σ^{2} (t - t_{0})}), t > t_{0}, \end{matrix}

and for t₀=0,

\begin{matrix} E (τ_{ϵ}) = & \frac{1}{σ \sqrt{2 π}} \int_{0}^{\infty} (t^{- 1 / 2} ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}})) exp (- \frac{{(ln (\frac{K - ϵ}{ϵ}, \frac{K - N_{0}}{N_{0}}) - μ t)}^{2}}{2 σ^{2} t}) d t \\ = & \frac{1}{μ} ln (\frac{(K - ϵ) (K - N_{0})}{ϵ N_{0}}), 0 < ϵ < K . \end{matrix}

$□$

Equation (4.3) and Theorem 7 can be used to calculate an approximate expected time when people in the United States will stop dying of the Covid-19 by making $ϵ$ to be as small as possible.

Remark 4

For the solution $N_{σ = 0} (t) = \frac{K}{1 + \frac{K - N_{0}}{N_{0}} e^{- μ t}}$ of (1.1), the time $T_{ϵ} = \frac{1}{μ} ln (\frac{(K - ϵ) (K - N_{0})}{ϵ N_{0}})$ at which $N_{σ = 0} (t) = K - ϵ$ satisfies $T_{ϵ} = E (τ_{ϵ})$ .

As shown in Sect. 1 with respect to model (1.1), the aggregate death count’s growth speeds up in the interval $(0, K)$ and slows down in the interval $(K / 2, K)$ . This shows that the maximum number of daily deaths can be calculated as $max (\frac{dN}{dt}) = \frac{μ}{4} K$ , occurring at time $T_{K / 2} = \frac{1}{μ} ln (\frac{K - N_{0}}{N_{0}}) = E (τ_{K / 2})$ . In the next theorem, we calculate the expected first time when the aggregate death counts reaches the size $K / 2$ for the stochastic case.

Corollary 8

If $N_{0} \in (0, K)$ , we have $lim_{ϵ \to 0^{+}} E (τ_{ϵ}) = + \infty$ . That is, the process N(t) never reaches the point $K$ . Also, $E (τ_{K / 2}) = \frac{1}{μ} ln (\frac{(K - N_{0})}{N_{0}}) .$

Proof

The proof follows from (4.3) and Theorem 7. $□$

The expected first time passage $E (τ_{K / 2})$ is analogous to the deterministic time $T_{K / 2}$ when the process N(t) starts slowing down. That is, the time where the maximum number of daily deaths occurs.

As discussed in Sect. 2, we showed, with respect to the deterministic model (1.1), the current day’s aggregate death count’s growth is slowing down starting from the moment when the current day’s aggregate count is more than $K / 2$ . Otherwise, the virus is spreading, with speeding growth. Numerical results in Figs. 7 and 8 also show, for the stochastic case, that the expected aggregate count slows down starting from the time $E (τ_{K / 2})$ .

Fig. 7 — Expected aggregate counts for the states: AR, AZ, CO, FL, GA, IA, IN, KS, KY, MA, MD, ME, MN, MT, NC, NE in the United States

Fig. 8 — Expected aggregate counts for the states: NM, NV, OK, OR, TN, UT, WI, WY in the United States

Numerical Simulation and Forecast for the Aggregate Death Counts in the United States

As discussed earlier, W(t) is a Wiener process that depends continuously on $t \in [0, T]$ with independent increment property such that $W (t_{0} = 0) = 0$ , $W (t) - W (s) \sim \sqrt{t - s} N (0, 1)$ for $0 \leq s < t \leq T$ , where N(0, 1) is the standard normal distribution. Let $Δ W_{j} = W_{j} - W_{j - 1}$ , $j = 1, 2, \dots, m$ , where $W_{j + 1} = W (t_{j + 1})$ , $t_{j} = t_{0} + (j - 1) Δ t_{j}$ . We discretize the Wiener processes $Δ W_{j}$ with time step $Δ t_{j}$ , and $W (t_{j})$ as $Δ W_{j} \sim \sqrt{Δ t_{j}} N (0, 1)$ and $W (t_{j}) = W (t_{j - 1}) + Δ W_{j}$ , $j = 2, 3, \dots, m$ with $W (t_{1}) = Δ W_{1}$ .

Let ${\hat{N}}_{j}$ be the discretized aggregate death counts satisfying the solution (2.7) at time $t_{j}$ , $j = 1, 2, \dots, m$ . We estimate ${\hat{N}}_{j}$ using (2.7) as

\begin{matrix} {\hat{N}}_{j} = \frac{\hat{K}}{1 + \frac{\hat{K} - N_{0}}{N_{0}} exp (- \hat{μ} t_{j} - \hat{σ} W (t_{j}))}, j = 1, 2, \dots, m, \end{matrix}

5.1

where $\hat{K}, \hat{μ}, \hat{σ}$ are calculated in Sect. 3. In order to generate pseudo samples for each point ${\hat{N}}_{j}$ , we define $Δ W_{j}^{l} = W_{j}^{l} - W_{j - 1}^{l}$ , $j = 1, 2, \dots, m$ , $l = 1, 2, \dots, L$ , for sample size m and L number of simulations. Using Milstein scheme (Gaines and Lyons 1994), the l-th discretized solution $N_{j}^{l} \equiv N {(t_{j})}^{l}$ of (2.3) satisfies

\begin{matrix} \begin{matrix} N_{j}^{l} & = & N_{j - 1}^{l} + (\frac{\hat{μ}}{\hat{K}} (K - N_{j - 1}^{l}) N_{j - 1}^{l} + \frac{{\hat{σ}}^{2}}{2 {\hat{K}}^{2}} (\hat{K} - N_{j - 1}^{l}) (\hat{K} - 2 N_{j - 1}^{l}) N_{j - 1}^{l}) Δ t_{j} + \frac{\hat{σ}}{K} (K - N_{j - 1}^{l}) N_{j - 1}^{l} Δ W_{j}^{l} \sqrt{Δ t_{j}} \\ + \frac{{\hat{σ}}^{2}}{2 {\hat{K}}^{2}} N_{j - 1}^{l} (\hat{K} - N_{j - 1}^{l}) (\hat{K} - 2 N_{j - 1}^{l}) ({(Δ, W_{j}^{l})}^{2} - 1) Δ t_{j}, \end{matrix} \end{matrix}

5.2

for $j = 1, 2, \dots, m$ , $l = 1, 2, \dots, L$ . We use the estimate (5.1), together with the estimated parameters $\hat{K}$ , $\hat{μ}$ and $\hat{σ}$ in (3.5)–(3.6) to fit the aggregate COVID-19 death counts in the fifty states in the United States from the period when the first death count is recorded to June 20, 2021, and also to forecast from June 21, 2021 to June 24, 2021. Model (5.2) and (3.1) are used to generate the probability density function for the aggregate death counts for each time $t_{j}$ . Let ${\hat{N}}_{j, σ = 0}$ denote the deterministic equivalent of (5.1), which is the discretization of solution of (1.1) with $σ = 0$ . The parameters in ${\hat{N}}_{j, σ = 0}$ are also estimated using the Non-Linear Least Square estimate scheme (Coleman and Li 1996; May 1963) for model comparison purposes. Denote the root mean square error for the deterministic and stochastic discretization scheme by $RMSE$ and $RAMSE$ , respectively. We define

\begin{matrix} \begin{matrix} RMSE & = & {[\frac{1}{m}, \sum_{j = 1}^{m}, {({\hat{N}}_{j, σ = 0} - N (t_{j}))}^{2}]}^{\frac{1}{2}}, \\ RAMSE & = & {[\frac{1}{m}, \sum_{j = 1}^{m}, {({\hat{N}}_{j} - N (t_{j}))}^{2}]}^{\frac{1}{2}}, \end{matrix} \end{matrix}

5.3

where ${\{N (t_{j})\}}_{j = 1}^{m}$ is the real aggregate death counts data. In order to show the superiority of model (2.3) over (1.1), we compare the root mean square errors $RMSE$ and $RAMSE$ and show that model (2.3) has a smaller root mean square error.

Table 1 shows the parameter estimates for the stochastic model (2.3) together with the root mean square errors $RMSE$ and $RAMSE$ in (5.3) for the deterministic and stochastic cases, respectively, using the Covid-19 aggregate death counts in the United States for the period when the first death case is reported to June 20, 2021. Here, $N_{0}$ denotes the estimate of the starting value when the first death case is reported. The expected first time the aggregate death counts is more than half its carrying capacity is calculated. The root mean square error $RMSE$ was calculated by first estimating the parameters in the deterministic model (1.1) using the Non-Linear Least Square estimate scheme (May 1963; Coleman and Li 1996). The estimated parameters for the deterministic model are not reported in this work. Within the analysis period, we see that the expected aggregate death counts started slowing down around mid December when the first vaccine was administered for most states. A quick comparison of $RMSE$ and $RAMSE$ in Table 1 shows that the stochastic model (2.3) performs better than the deterministic model (1.1) in describing the trajectory of the aggregate count of Covid-19 in the United States. In order to minimize space, we only show the real and simulated death counts for 24 out of 50 states in the United States in Figs. 1 and 2.

Table 1.

Parameter estimates using stochastic model (2.3) for the first day Covid-19 deaths is recorded to June 20, 2021

State	$\hat{K}$	$\hat{μ}$	$\hat{σ}$	$N_{0}$	$RMSE$	$RAMSE$	$E (τ_{K / 2})$
AK	366	0.0235	0.0190	1.0172	11.3997	11.2450	250.6866 ( 23-Nov-2020)
AL	12269	0.0142	0.0111	237.8943	448.4912	434.3204	276.0435 ( 29-Nov-2020)
AR	6020.2	0.0208	0.0168	28.0941	136.1947	117.4587	257.9212 ( 07-Dec-2020)
AZ	19306	0.0137	0.0328	419.7449	996.1320	843.8170	280.1291 ( 25-Dec-2020)
CA	71273	0.0139	0.0212	648.3989	3806.2	3686.1	338.2706 ( 05-Jan-2021)
CO	6941.6	0.0128	0.0442	265.8343	438.0126	323.4614	258.6176 ( 29-Nov-2020)
CT	8824.2	0.0080	0.0434	1638.1	731.9427	539.3944	200.3560 ( 04-Oct-2020)
DE	2018.8	0.0085	0.0274	166.8309	97.1994	77.3889	288.3975 ( 08-Jan-2021)
FL	40794	0.0126	0.0000	1.2765	1191.6	1191.6	273.3731 ( 07-Dec-2020)
GA	25150	0.0113	0.0040	817.7736	645.5268	639.6664	300.7165 ( 07-Jan-2021)
HI	517.11	0.0178	0.0001	6.0115	16.7624	16.7624	249.3262 ( 05-Dec-2020)
IA	6308.2	0.0180	0.0334	58.1454	243.9217	184.7225	261.9713 ( 12-Dec-2020)
ID	2122.0	0.0210	0.0243	10.7009	49.0611	33.2558	251.9702 ( 03-Dec-2020)
IL	26890	0.0119	0.0358	1.3726	1407.4	1013.6	249.2160 ( 21-Nov-2020)
IN	14358	0.0153	0.0422	298.5343	803.2925	603.8770	255.4122 ( 27-Nov-2020)
KS	5259.6	0.0233	0.0270	6.1572	155.4604	142.3990	290.0863 ( 27-Dec-2020)
KY	8567.7	0.0130	0.0180	97.4734	202.7383	177.6680	344.4590 ( 25-Feb-2021)
LA	11488	0.0102	0.0166	1.1347	436.0073	375.6429	218.4941 ( 19-Oct-2020)
MA	19462	0.0084	0.0337	3.0196	1289.9	983.2224	210.8422 ( 15-Oct-2020)
MD	10656	0.0091	0.0335	995.9400	568.9196	397.9094	255.0706 ( 28-Nov-2020)
ME	868.01	0.0165	0.0563	7.0728	53.0840	40.3454	296.6184 ( 18-Jan-2021)
MI	22940	0.0085	0.0323	1862.2	1221.2	931.3025	292.8191 ( 20-Dec-2020)
MN	7759.4	0.0152	0.0372	170.5692	376.5030	277.8504	251.9335 ( 28-Nov-2020)
MO	9325.2	0.0196	0.0268	80.8314	302.6209	236.9409	243.3070 ( 16-Nov-2020)
MS	7649.2	0.0144	0.0230	264.3824	270.1491	214.1997	233.0122 ( 24-Oct-2020)
MT	1651	0.0237	0.0001	2.8173	26.8629	28.8074	269.1468 ( 20-Dec-2020)
NC	14141	0.0155	0.0210	207.0690	487.1274	427.1619	271.8087 ( 11-Dec-2020)
ND	1523	0.0338	0.0156	0.6061	37.0740	37.0678	231.9706 ( 14-Nov-2020)
NE	2294.8	0.0216	0.0402	11.5529	101.2707	77.5800	246.9092 ( 29-Nov-2020)
NH	1497.1	0.0106	0.0424	79.2622	88.0697	60.7974	280.1738 ( 28-Dec-2020)
NJ	26374	0.0091	0.0516	4.8141	2381.3	1602.9	180.5213 ( 07-Sep-2020)
NM	4380.8	0.0209	0.0535	22.4488	232.2403	160.4579	255.3504 ( 03-Dec-2020)
NV	6068.2	0.0146	0.0242	101.9782	225.0800	191.6449	280.5323 ( 22-Dec-2020)
NY	23267	0.0070	0.0359	3360.5	1663.7	1313.5	267.5434 ( 08-Dec-2020)
OH	23400	0.0123	0.0271	496.0223	1173	1068.3	314.31 ( 28-Jan-2021)
OK	7550.7	0.0219	0.0352	21.9981	249.1718	188.7447	267.6021 ( 12-Dec-2020)
OR	2785.0	0.0164	0.0287	25.2546	83.2097	60.4606	288.1383 ( 28-Dec-2020)
PA	2996.6	0.0113	0.0419	1240.5	1790.9	1334.7	283.8442 ( 27-Dec-2020)
RI	2986.6	0.0102	0.0375	252.4832	190.8369	143.3343	240.9393 ( 15-Nov-2020)
SC	10576	0.0141	0.0132	251.1180	391.9012	373.3641	264.1748 ( 05-Dec-2020)
SD	2027	0.0286	0.0380	0.9668	51.6401	43.1584	268.6586 ( 05-Dec-2020)
TN	12757	0.0211	0.0325	51.1127	378.6108	261.3577	263.1342 ( 08-Dec-2020)
TX	53563	0.0154	0.0215	959.9323	1975.8	1734.3	260.4598 ( 30-Nov-2020)
UT	2424.9	0.0164	0.0270	24.7834	61.0702	37.3714	279.9737 ( 27-Dec-2020)
VA	14144	0.0102	0.0229	458.0540	542.5815	471.6913	336.5766 ( 15-Feb-2021)
VT	301.44	0.0100	0.0436	11.7290	19.7914	16.1092	330.1658 ( 12-Feb-2021)
WA	6749.6	0.0095	0.0179	377.1470	224.7571	191.6103	298.7084 ( 24-Dec-2020)
WI	8047	0.0200	0.0294	41.1695	285.9822	232.0415	264.1555 ( 09-Dec-2020)
WV	2890.9	0.0231	0.0314	4.2413	67.4599	47.9361	283.0646 ( 06-Jan-2021)
WY	734	0.0340	0.0180	0.1363	14.1754	14.4533	252.5651 ( 22-Dec-2020)

Open in a new tab

Figures 1 and 2 show the real and simulated aggregate death counts for some of the fifty states in the United States. In order to forecast the aggregate death counts from June 21, 2021 to June 24, 2021, we analyze the data starting from June 4, 2021 to June 20, 2021. The parameter estimates are shown in Table 2.

Table 2.

Parameter estimates using stochastic model (2.3) from June 4, 2021 to June 20, 2021

State	$\hat{K}$	$\hat{μ}$	$\hat{σ}$	$N_{0}$	$N_{06 / 21 / 2021}$	$N_{06 / 22 / 2021}$	$N_{06 / 23 / 2021}$	$N_{06 / 24 / 2021}$
AK	368.4	0.0938	0.0179	360.41	366.75	366.87	367.01	367.11
AL	11441	0.0057	0.0001	11299	11312	11313	11314	11315
AR	6013.6	0.0131	0.0018	5842.1	5876.2	5877.7	5879.5	5881.0
AZ	28387	0.0015	31527	17665	17847	17857	17867	17877
CA	93187	0.00047	0.0001	62447	62620	62630	62639	62649
CO	6925.7	0.0337	0.0001	6593.5	6740.5	6746.4	6752.2	6757.8
CT	8279.6	0.0800	0.0001	8241.6	8270.6	8271.2	8271.9	8272.5
DE	3021.4	0.00057	0.0001	1675.0	1682.7	1683.1	1683.5	1683.9
FL	38744	0.0300	0.0039	36925	37643	37671	37703	37729
GA	28840	0.0042	0.0001	20917	21341	21364	21387	21410
HI	4824.5	0.0011	0.0001	498.64	507.73	508.24	508.75	509.26
IA	6156.3	0.0520	0.0001	6060.2	6118.2	6120.1	6121.9	6123.6
ID	2160.4	0.0295	0.0001	2101.3	2125.3	2126.3	2127.3	2128.2
IL	26192	0.0249	0.0001	25282	25603	25618	25631	25645
IN	18348	0.0014	0.0001	13702	13789	13794	13799	13804
KS	5320.7	0.0176	0.0064	5081.1	5138.8	5141.0	5144.3	5146.2
KY	7213.6	0.0634	0.0043	7166.4	7198.1	7199.0	7199.9	7200.7
LA	17722	0.0012	0.0001	10602	10696	10701	10706	10711
MA	18433	0.0088	0.00022	17898	17974	17978	17982	17985
MD	10000	0.0151	0.0001	9635.8	9720.3	9724.4	9728.4	9732.4
ME	1068.6	0.0075	0.0036	836.96	857.57	858.35	859.73	860.41
MI	21561	0.0263	0.0015	20541	20909	20925	20941	20956
MN	8094.2	0.0129	0.00039	7535.8	7644.0	7649.3	7654.8	7659.9
MO	9424.0	0.0266	0.00069	9141.8	9246.6	9251.0	9255.5	9259.8
MS	7601.6	0.0128	0.00069	7326.0	7380.2	7382.8	7385.5	7388.1
MT	2749.7	0.0024	0.0006	1628.7	1654.9	1656.1	1657.8	1658.9
NC	18353	0.0015	0.0001	13245	13342	13348	13353	13358
ND	1530.0	0.0605	0.0194	.514.2	1524.1	1524.4	1524.7	1524.9
NE	2259	0.2955	0.0978	2243.4	2258.9	2258.9	2258.9	2258.9
NH	1811.7	0.0027	0.0005	1353.2	1368.8	1369.5	1370.5	1371.2
NJ	26778	0.0158	0.0003	26259	26385	26391	26397	26403
NM	4526.7	0.0052	0.0012	4315.9	4332.9	4333.7	4334.7	4335.4
NV	10653	0.0013	0.0001	5592.0	5652.4	5655.7	5659.1	5662.4
NY	20320	0.0128	0.0003	19810	19912	19917	19922	19927
OH	20761	0.0190	0.0001	19948	20176	20187	20197	20208
OK	8005.6	0.0060	0.0026	7315.4	7373.4	7375.6	7379.4	7381.4
OR	2877.7	0.0304	0.0001	2680.4	2760.1	2763.5	2766.8	2770.0
PA	28203	0.0222	0.0008	27305	27592	27605	27618	27630
RI	2796.8	0.0064	0.0001	2717.1	2725.5	2726.0	2726.4	2726.9
SC	10210	0.0057	0.0004	9771.0	9811.3	9813.3	9815.5	9817.5
SD	2028.9	0.0982	0.0001	2020.4	2027.4	2027.6	2027.7	2027.8
TN	13509	0.0027	0.0001	12474	12520	12523	12525	12527
TX	51977	0.0236	0.0001	50572	51050	51071	51092	51112
UT	2395.8	0.0159	0.0001	2309.1	2330.1	2331.1	2332.1	2333.1
VA	11616	0.0282	0.0031	11218	11369	11375	11382	11388
VT	256.4	0.00005	0.0001	256.00	256.00	256.00	256.00	256.0
WA	12305	0.000	0.0009	5824.1	5809.7	5807.3	5807.9	5805.0
WI	8267.0	0.0263	0.0001	7942.5	8061.8	8067.0	8072.1	8077.0
WV	2882.0	0.0995	0.0002	2800.1	2868.0	2869.3	2870.5	2871.6
WY	3320.9	0.0021	0.0011	718.2092	737.1472	737.8835	739.2320	739.87

Open in a new tab

Table 2 contains parameter estimates derived using data set from June 4, 2021 to June 20, 2021. Here, $N_{0}$ denotes the estimate of the starting value for June 4, 2021. These estimates are used in the forecast for the aggregate death counts from June 21, 2021 to June 24, 2021. In Table 2, $N_{06 / 21 / 2021}$ denotes the forecast aggregate death counts estimate for June 21, 2021. The $95 %$ confidence interval for the forecast estimate is also calculated and presented in Figs. 3 and 4.

Fig. 3 — Forecast of aggregate counts of COVID-19 death counts for the states: AR, AZ, CO, FL, GA, IA, IN, KS, KY, MA, MD, ME, MN, MT, NC, NE in the United States

Fig. 4 — Forecast of aggregate counts of COVID-19 death counts for the states: NM, NV, OK, OR, TN, UT, WI, WY in the United States

Figures 3 and 4 show simulated and forecast estimate for the aggregate death counts of Covid-19 in the United States. The parameter estimates used for the simulation are given in Table 2 using the data set for June 4, 2021 to June 20, 2021. These parameters are used to forecast the aggregate death counts for June 21, 2021 to June 24, 2021.

In order to verify the validity of the obtained probability density function (3.1), we show in the following graphs the comparison of the probability density function for the random variable $N_{T}^{l}$ given in (5.2) for $l = 10, 000$ simulations with the probability density function in (3.1) by setting $t = t_{m} = T$ . The time $t = T$ corresponds to the day: June 24, 2021.

By generating the histogram of ${N_{T}^{l}}_{l = 1}^{10, 000}$ in (5.2), we show the comparison of the graphs of the probability density function for the random variable $N_{T}^{l}$ and $p_{N} (N | T, N_{0})$ in Figs. 5 and 6. The graphs show that the probability density function concentrates on a particular value. To know what this value is, we calculate the expected value of the aggregate death counts obtained in (3.7) for each of the fifty states in the United States and noticed the probability density function concentrates on the expected aggregate count $E (N (T))$ on June 24, 2021, with $t = t_{m} = T$ denoting June 24, 2021. We also noticed that this value is close to the equilibrium point $K$ , which is the maximum aggregate death counts as at the time this research is conducted. Define

\begin{matrix} N_{T, max} = {\arg \max}_{N \in (0, K)} p_{N} (N | T, N_{0}) \end{matrix}

5.4

The comparison of the value $N_{T, max}$ where the probability density function $p_{N} (N | T, N_{0})$ concentrates on, with the expected aggregate count on June 24, 2021 is shown in Table 3. The plot of the expected aggregate death counts is plotted in Figs. 7 and 8 as a function of time.

Fig. 5 — Probability density function for COVID-19 death counts for the states: AR, AZ, CO, FL, GA, IA, IN, KS, KY, MA, MD, ME, MN, MT, NC, NE on June 24, 2021

Fig. 6 — Probability density function for COVID-19 death counts for the states: NM, NV, OK, OR, TN, UT, WI, WY on June 24, 2021

Table 3.

Table containing $N_{T, max}$ with the expected aggregate count $E [N (T)]$ on June 24, 2021

State	$N_{T, max}$	$E [N (T) \|, N_{0}]$
AK	365	364
AL	11, 651	11, 603
AR	5932	5915
AZ	18, 316	17, 528
CA	65, 907	64, 308
CO	6752	6344
CT	8391	7680
DE	1700	1600
FL	37, 763	37, 592
GA	21, 786	21, 757
HI	504	503
IA	6188	6072
ID	2116	2106
IL	25, 766	24, 581
IN	14, 094	13, 583
KS	5198	5161
KY	7269	7153
LA	10, 711	10, 584
MA	18, 129	17, 085
MD	9747	9111
ME	854	780
MI	20, 256	18, 718
MN	7595	7367
MO	9234	9179
MS	7474	7401
MT	1640	1638
NC	13, 581	13, 412
ND	1526	1526
NE	2281	2257
NH	1399	1264
NJ	25, 936	24, 031
NM	4369	4279
NV	5754	5630
NY	20, 259	18, 310
OH	20, 735	19, 714
OK	7485	7408
OR	2689	2624
PA	28, 225	25, 783
RI	2822	2647
SC	10, 004	9943
SD	2026	2020
TN	12, 630	12, 507
TX	51, 732	51, 136
UT	2334	2285
VA	11, 499	10, 979
VT	267	230
WA	5828	5687
WI	7977	7899
WV	2854	2822
WY	739	738

Open in a new tab

Using the result obtained in (3.7), we plot the graph of the expected value of the aggregate death counts $E (N (t) |, N_{0})$ with time for each states in the United States in Figs. 7 and 8. The graph shows that the expected value of the aggregate death counts is increasing for $t \geq 0$ . We also see from the graphs that the expected aggregate death counts started slowing down sometimes around the month of December 2020 for most states. This number is still slowing down as at the time this analysis is carried out (June 2021).

Summary and Discussion

In this work, we study and analyze the aggregate death counts N(t) resulting from the Covid-19 virus infection in the United States. Recent studies use the deterministic logistic model to analyze this counts by assuming the death count’s per capita growth rate of the Covid-19 virus is constant over time. Our studies show that this is not the case. We assume, based on some analysis, that this growth rate can be affected by external perturbations causing fluctuations that can be modeled as a white noise described using a Wiener process. This assumption is used to modify and extend the existing logistic model to a stochastic differential equation. We analyze this model by first showing that it has a unique solution and its solution is bounded, with probability one. Analogue to an increasing function, we show that the process N(t) is a submartingale that converges almost surely to a random variable on the long run if $μ$ is greater than $σ^{2} / 2$ . By calculating the first hitting time when the aggregate death counts reach $K - ϵ$ for small $ϵ > 0$ , we calculate an approximate expected time when the death counts slow down for each state in the United States. To do this, we first calculate the probability of the first passage time $τ_{ϵ}$ described in (4.2), and later calculate the expected first passage time. Our result shows that the aggregate death counts is now slowing down as at June 2021 when this analysis is conducted. By comparing the estimate $K / 2$ with the estimate N(T) and $E [N (T) |, N_{0}]$ for the current day, June 24, 2021, our analysis shows that the Covid-19 death crisis slows down in the month of June in most states in the United States.

By deriving the transition probability density function for the process N(t), we show that the expected aggregate death counts is bounded, and approaches $K$ asymptotically. We also studied the distribution of the counts for the time T when this analysis is conducted and our result shows that the distribution concentrates on the expected aggregate count for that day (June 24, 2021).

Using the Maximum Likelihood Estimate scheme, we estimate the epidemiological parameters $K$ , $μ$ , and $σ$ using (3.3)–(3.4). These are used together with the Milstein scheme (5.2) to simulate and forecast the aggregate death counts for each states in the United States. A $95 %$ confidence interval is provided for the forecast result. To show that our model (2.3) performs better than existing model (1.1), we compare the root mean square errors $RAMSE$ and $RMSE$ for the stochastic and deterministic models, respectively, and show that $RAMSE < RMSE$ . This research is still ongoing and an update will be provided when available.

Footnotes

https://www.cdc.gov/museum/timeline/covid19.html, accessed 06.04.2022

https://covid.cdc.gov/covid-data-tracker, accessed 06.24.2021 at 9:09PM.

https://www.fda.gov/emergency-preparedness-and-response/coronavirus-disease-2019-covid-19/covid-19-frequently-asked-questions, accessed 03.09.2021.

⁴

https://www.fda.gov/emergency-preparedness-and-response/coronavirus-disease-2019-covid-19/janssen-covid-19-vaccine, accessed 06.08.2021.

⁵

https://www.healthline.com/health/vaccinations/johnson-and-johnson-vaccine, accessed 06.07.2021.

⁶

https://data.cdc.gov/Case-Surveillance/United-States-COVID-19-Cases-and-Deaths-by-State-o/9mfq-cb36/data.

⁷

https://data.cdc.gov/Case-Surveillance/United-States-COVID-19-Cases-and-Deaths-by-State-o/9mfq-cb36, accessed 06.25.2021.

⁸

Two solutions $x_{1} : [t_{0}, T] \to ℜ^{+}$ and $x_{2} : [t_{0}, T] \to ℜ^{+}$ are said to be equivalent if $P (x_{1} (t) = x_{2} (t) for all t \in [t_{0}, T]) = 1$ .

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Olusegun Michael Otunuga, Email: ootunuga@augusta.edu.

Oluwaseun Otunuga, Email: otunuga1@mail.usf.edu.

References

Arnold L. Stochastic differential equations: theory and applications. New York: Wiley; 1974. [Google Scholar]
Beddington JR, May RM. Harvesting natural populations in a randomly fluctuating environment. Science. 1977;197:463–465. doi: 10.1126/science.197.4302.463. [DOI] [PubMed] [Google Scholar]
Baud D, Qi X, Nielsen-Saines K, Musso D, Pomar Léo, Favre Guillaume. Real estimates of mortality following COVID-19 infection. Lancet Infect Dis. 2020;20(7):773. doi: 10.1016/S1473-3099(20)30195-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bhapkar HR, Mahalle PN, Dey N, Santosh KC. Revisited COVID-19 mortality and recovery rates: are we missing recovery time period? J Med Syst. 2020;44(12):202. doi: 10.1007/s10916-020-01668-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Coleman TF, Li Y. An interior, trust region approach for nonlinear minimization subject to bounds. SIAM J Optim. 1996;6:418–45. doi: 10.1137/0806023. [DOI] [Google Scholar]
Gaines JG, Lyons TJ. Random generation of stochastic area integrals. SIAM J Appl Math. 1994;54(4):1132–1146. doi: 10.1137/S0036139992235706. [DOI] [Google Scholar]
Gardiner CW. Handbook of stochastic methods for physics, chemistry and the natural sciences. New York: Springer-Verlag; 1985. [Google Scholar]
Lv J, Liu H, Zou X. Stationary distribution and persistence of a stochastic predator-prey model with a functional response. J Appl Anal Comput. 2019;9(1):1–11. [Google Scholar]
Kaciroti NA, Lumeng C, Parekh V, Boulton ML. A bayesian mixture model for predicting the COVID-19 related mortality in the United States. Am J Trop Med Hyg. 2021;104(4):1484–1492. doi: 10.4269/ajtmh.20-1147. [DOI] [PMC free article] [PubMed] [Google Scholar]
Khasminskii R. Stochastic stability of differential equations. 2. Berlin Heidelberg: Springer-Verlag; 2012. p. 66. [Google Scholar]
Kloeden PE, Platen E. Numerical solution of stochastic differential equations. New York: Springer-Verlag; 1995. [Google Scholar]
Lagarto S, Braumann CA. Modeling human population death rates: ABi-dimensional stochastic Gompertz model with correlated wiener processes. In: Pacheco A, Santos R, Oliveira M, Paulino C, editors. New advances in statistical modeling and applications. Studies in theoretical and applied statistics. Cham: Springer; 2014. [Google Scholar]
Li W, Wang K. Optimal harvesting policy for general stochastic logistic population model. J Math Anal Appl. 2010;368:420–428. doi: 10.1016/j.jmaa.2010.04.002. [DOI] [Google Scholar]
Linka K, Peirlinck M, Kuhl E. The reproduction number of COVID-19 and its correlation with public health interventions. Comput Mech. 2020;66(4):1035–1050. doi: 10.1007/s00466-020-01880-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lungu EM, Øksendal B. Optimal harvesting from a population model in a Stochastic Crowded Environment. Math Biosci. 1997;145:47–75. doi: 10.1016/S0025-5564(97)00029-1. [DOI] [PubMed] [Google Scholar]
May RM. An algorithm for least-squares estimation of nonlinear parameters. SIAM J Appl Math. 1963;11(2):431–41. doi: 10.1137/0111030. [DOI] [Google Scholar]
Mao X. Stochastic differential equations and applications. 2. Chichester: Horwood; 2007. [Google Scholar]
Mazzuco S, Scarpa B, Zanotto L. A mortality model based on a mixture distribution function. Popul Stud. 2018;72(3):1–10. doi: 10.1080/00324728.2018.1439519. [DOI] [PubMed] [Google Scholar]
Mendez V, Campos D, Horsthemke W. Stochastic fluctuations of the transmission rate in the susceptible-infected-susceptible epidemic model. Phys Rev E. 2012;86:011919. doi: 10.1103/PhysRevE.86.011919. [DOI] [PubMed] [Google Scholar]
Ndairou F, Area I, Nieto JJ, Torres DFM. Mathematical modeling of COVID-19 transmission dynamics with a case study of Wuhan. Chaos Solitons Fractals. 2020;135:109846. doi: 10.1016/j.chaos.2020.109846. [DOI] [PMC free article] [PubMed] [Google Scholar]
Okuonghae D, Omame A. Analysis of a mathematical model for COVID-19 population dynamics in Lagos, Nigeria. Chaos Solitons Fractals. 2020;139:110032. doi: 10.1016/j.chaos.2020.110032. [DOI] [PMC free article] [PubMed] [Google Scholar]
Øksendal B. Stochastic differential equations, An introduction with applications. Berlin Heidelberg, New York: Springer-Verlag; 2003. [Google Scholar]
Ladde GS, Otunuga OM, Ladde NS (2020) Local lagged adapted generalized method of moments Dynamic Process. U.S. Patent Number: 10719578
Otunuga OM. Time-dependent probability distribution for the number of infection in a stochastic SIS model: case study COVID-19. Chaos Solitons Fractals. 2021;147:110983. doi: 10.1016/j.chaos.2021.110983. [DOI] [PMC free article] [PubMed] [Google Scholar]
Otunuga OM. Time-dependent probability density function for general stochastic logistic population model with harvesting effort. Phys A. 2021;573:1–33. doi: 10.1016/j.physa.2021.125931. [DOI] [Google Scholar]
Otunuga OM. Qualitative analysis of a stochastic SEITR epidemic model with multiple stages of infection and treatment. Infect Dis Modell. 2020;5:61–90. doi: 10.1016/j.idm.2019.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
Otunuga OM. Closed-form probability distribution of number of infections at a given time in a stochastic SIS epidemic model. Heliyon. 2019;5:1–12. doi: 10.1016/j.heliyon.2019.e02499. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mummert A, Otunuga OM. Parameter identification for a stochastic SEIRS epidemic model: case study influenza. J Math Biol. 2019;79(2):705–729. doi: 10.1007/s00285-019-01374-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Otunuga OM. Global stability for a 2n + 1 dimensional HIV/AIDS epidemic model with treatments. Math Biosci. 2018;5:138–52. doi: 10.1016/j.mbs.2018.03.013. [DOI] [PubMed] [Google Scholar]
Pelinovsky E, Kurkin A, Kurkina O, Kokoulina M, Epifanova A. Logistic equation and COVID-19. Chaos Solitons Fractals. 2020;140:110241. doi: 10.1016/j.chaos.2020.110241. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pella JS, Tomlinson PK. A generalised stock-production model. Bull Int Am Trop Tuna Commun. 1969;13:421–496. [Google Scholar]
Prajneshu Time dependent solution of the logistic model for population growth in random environment. J Appl Prob. 1980;17:1083–1086. doi: 10.2307/3213218. [DOI] [Google Scholar]
Santosh KC. COVID-19 prediction models and unexploited data. J Med Syst. 2020;44:170. doi: 10.1007/s10916-020-01645-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Satpathy S, Mangla M, Sharma N, Deshmukh H, Mohanty S. Predicting mortality rate and associated risks in COVID-19 patients. Spat Inf Res. 2021;29(4):455–464. doi: 10.1007/s41324-021-00379-5. [DOI] [Google Scholar]
Stutt Rojh, Retkute R, Bradley M, Gilligan CA, Colvin J. A modelling framework to assess the likely effectiveness of facemasks in combination with ‘lock-down’ in managing the COVID-19 pandemic. Proc R Soc A. 2020;476:20200376. doi: 10.1098/rspa.2020.0376. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang P, Zheng X, Li J, Zhu B. Prediction of epidemic trends in COVID-19 with logistic model and machine learning technics. Chaos Solitons Fractals. 2020;139:110058. doi: 10.1016/j.chaos.2020.110058. [DOI] [PMC free article] [PubMed] [Google Scholar]
Verhulst Pierre-Francois (1838) Notice sur la loi que la population poursuit dans son accroissement. Correspondance Mathématique et Physique. 10:113-121. Retrieved 3 Dec 2014
West BJ, Bulsara AR, Lindenberg K, Seshadri V, Shuler KE. Stochastic processes with non-additive fluctuations: I. Itô and Stratonovich calculus and the effects of correlations. Phys A. 1979;97(2):211–233. doi: 10.1016/0378-4371(79)90103-1. [DOI] [Google Scholar]
Wong E, Zakai M. On the convergence of ordinary integrals to stochastic integrals. Ann Math Stat. 1965;36(5):1560–1564. doi: 10.1214/aoms/1177699916. [DOI] [Google Scholar]
Wu JT, Leung K, Leung GM. ”Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet. 2020;35(10225):689–697. doi: 10.1016/S0140-6736(20)30260-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yang B, Cai Y, Wang K, Wang W. Optimal harvesting policy of logistic population model in a randomly fluctuating environment. Phys A. 2019;526:120817. doi: 10.1016/j.physa.2019.04.053. [DOI] [Google Scholar]
Zocchetti C, Consonni D. Mortality rate and its statistical properties. Med Lav. 1994;85(4):327–43. [PubMed] [Google Scholar]

[CR1] Arnold L. Stochastic differential equations: theory and applications. New York: Wiley; 1974. [Google Scholar]

[CR2] Beddington JR, May RM. Harvesting natural populations in a randomly fluctuating environment. Science. 1977;197:463–465. doi: 10.1126/science.197.4302.463. [DOI] [PubMed] [Google Scholar]

[CR3] Baud D, Qi X, Nielsen-Saines K, Musso D, Pomar Léo, Favre Guillaume. Real estimates of mortality following COVID-19 infection. Lancet Infect Dis. 2020;20(7):773. doi: 10.1016/S1473-3099(20)30195-X. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] Bhapkar HR, Mahalle PN, Dey N, Santosh KC. Revisited COVID-19 mortality and recovery rates: are we missing recovery time period? J Med Syst. 2020;44(12):202. doi: 10.1007/s10916-020-01668-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] Coleman TF, Li Y. An interior, trust region approach for nonlinear minimization subject to bounds. SIAM J Optim. 1996;6:418–45. doi: 10.1137/0806023. [DOI] [Google Scholar]

[CR6] Gaines JG, Lyons TJ. Random generation of stochastic area integrals. SIAM J Appl Math. 1994;54(4):1132–1146. doi: 10.1137/S0036139992235706. [DOI] [Google Scholar]

[CR7] Gardiner CW. Handbook of stochastic methods for physics, chemistry and the natural sciences. New York: Springer-Verlag; 1985. [Google Scholar]

[CR8] Lv J, Liu H, Zou X. Stationary distribution and persistence of a stochastic predator-prey model with a functional response. J Appl Anal Comput. 2019;9(1):1–11. [Google Scholar]

[CR9] Kaciroti NA, Lumeng C, Parekh V, Boulton ML. A bayesian mixture model for predicting the COVID-19 related mortality in the United States. Am J Trop Med Hyg. 2021;104(4):1484–1492. doi: 10.4269/ajtmh.20-1147. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] Khasminskii R. Stochastic stability of differential equations. 2. Berlin Heidelberg: Springer-Verlag; 2012. p. 66. [Google Scholar]

[CR11] Kloeden PE, Platen E. Numerical solution of stochastic differential equations. New York: Springer-Verlag; 1995. [Google Scholar]

[CR12] Lagarto S, Braumann CA. Modeling human population death rates: ABi-dimensional stochastic Gompertz model with correlated wiener processes. In: Pacheco A, Santos R, Oliveira M, Paulino C, editors. New advances in statistical modeling and applications. Studies in theoretical and applied statistics. Cham: Springer; 2014. [Google Scholar]

[CR13] Li W, Wang K. Optimal harvesting policy for general stochastic logistic population model. J Math Anal Appl. 2010;368:420–428. doi: 10.1016/j.jmaa.2010.04.002. [DOI] [Google Scholar]

[CR14] Linka K, Peirlinck M, Kuhl E. The reproduction number of COVID-19 and its correlation with public health interventions. Comput Mech. 2020;66(4):1035–1050. doi: 10.1007/s00466-020-01880-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] Lungu EM, Øksendal B. Optimal harvesting from a population model in a Stochastic Crowded Environment. Math Biosci. 1997;145:47–75. doi: 10.1016/S0025-5564(97)00029-1. [DOI] [PubMed] [Google Scholar]

[CR16] May RM. An algorithm for least-squares estimation of nonlinear parameters. SIAM J Appl Math. 1963;11(2):431–41. doi: 10.1137/0111030. [DOI] [Google Scholar]

[CR17] Mao X. Stochastic differential equations and applications. 2. Chichester: Horwood; 2007. [Google Scholar]

[CR18] Mazzuco S, Scarpa B, Zanotto L. A mortality model based on a mixture distribution function. Popul Stud. 2018;72(3):1–10. doi: 10.1080/00324728.2018.1439519. [DOI] [PubMed] [Google Scholar]

[CR19] Mendez V, Campos D, Horsthemke W. Stochastic fluctuations of the transmission rate in the susceptible-infected-susceptible epidemic model. Phys Rev E. 2012;86:011919. doi: 10.1103/PhysRevE.86.011919. [DOI] [PubMed] [Google Scholar]

[CR20] Ndairou F, Area I, Nieto JJ, Torres DFM. Mathematical modeling of COVID-19 transmission dynamics with a case study of Wuhan. Chaos Solitons Fractals. 2020;135:109846. doi: 10.1016/j.chaos.2020.109846. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] Okuonghae D, Omame A. Analysis of a mathematical model for COVID-19 population dynamics in Lagos, Nigeria. Chaos Solitons Fractals. 2020;139:110032. doi: 10.1016/j.chaos.2020.110032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] Øksendal B. Stochastic differential equations, An introduction with applications. Berlin Heidelberg, New York: Springer-Verlag; 2003. [Google Scholar]

[CR23] Ladde GS, Otunuga OM, Ladde NS (2020) Local lagged adapted generalized method of moments Dynamic Process. U.S. Patent Number: 10719578

[CR24] Otunuga OM. Time-dependent probability distribution for the number of infection in a stochastic SIS model: case study COVID-19. Chaos Solitons Fractals. 2021;147:110983. doi: 10.1016/j.chaos.2021.110983. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] Otunuga OM. Time-dependent probability density function for general stochastic logistic population model with harvesting effort. Phys A. 2021;573:1–33. doi: 10.1016/j.physa.2021.125931. [DOI] [Google Scholar]

[CR26] Otunuga OM. Qualitative analysis of a stochastic SEITR epidemic model with multiple stages of infection and treatment. Infect Dis Modell. 2020;5:61–90. doi: 10.1016/j.idm.2019.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] Otunuga OM. Closed-form probability distribution of number of infections at a given time in a stochastic SIS epidemic model. Heliyon. 2019;5:1–12. doi: 10.1016/j.heliyon.2019.e02499. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] Mummert A, Otunuga OM. Parameter identification for a stochastic SEIRS epidemic model: case study influenza. J Math Biol. 2019;79(2):705–729. doi: 10.1007/s00285-019-01374-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] Otunuga OM. Global stability for a 2n + 1 dimensional HIV/AIDS epidemic model with treatments. Math Biosci. 2018;5:138–52. doi: 10.1016/j.mbs.2018.03.013. [DOI] [PubMed] [Google Scholar]

[CR30] Pelinovsky E, Kurkin A, Kurkina O, Kokoulina M, Epifanova A. Logistic equation and COVID-19. Chaos Solitons Fractals. 2020;140:110241. doi: 10.1016/j.chaos.2020.110241. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] Pella JS, Tomlinson PK. A generalised stock-production model. Bull Int Am Trop Tuna Commun. 1969;13:421–496. [Google Scholar]

[CR32] Prajneshu Time dependent solution of the logistic model for population growth in random environment. J Appl Prob. 1980;17:1083–1086. doi: 10.2307/3213218. [DOI] [Google Scholar]

[CR33] Santosh KC. COVID-19 prediction models and unexploited data. J Med Syst. 2020;44:170. doi: 10.1007/s10916-020-01645-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] Satpathy S, Mangla M, Sharma N, Deshmukh H, Mohanty S. Predicting mortality rate and associated risks in COVID-19 patients. Spat Inf Res. 2021;29(4):455–464. doi: 10.1007/s41324-021-00379-5. [DOI] [Google Scholar]

[CR35] Stutt Rojh, Retkute R, Bradley M, Gilligan CA, Colvin J. A modelling framework to assess the likely effectiveness of facemasks in combination with ‘lock-down’ in managing the COVID-19 pandemic. Proc R Soc A. 2020;476:20200376. doi: 10.1098/rspa.2020.0376. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] Wang P, Zheng X, Li J, Zhu B. Prediction of epidemic trends in COVID-19 with logistic model and machine learning technics. Chaos Solitons Fractals. 2020;139:110058. doi: 10.1016/j.chaos.2020.110058. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] Verhulst Pierre-Francois (1838) Notice sur la loi que la population poursuit dans son accroissement. Correspondance Mathématique et Physique. 10:113-121. Retrieved 3 Dec 2014

[CR38] West BJ, Bulsara AR, Lindenberg K, Seshadri V, Shuler KE. Stochastic processes with non-additive fluctuations: I. Itô and Stratonovich calculus and the effects of correlations. Phys A. 1979;97(2):211–233. doi: 10.1016/0378-4371(79)90103-1. [DOI] [Google Scholar]

[CR39] Wong E, Zakai M. On the convergence of ordinary integrals to stochastic integrals. Ann Math Stat. 1965;36(5):1560–1564. doi: 10.1214/aoms/1177699916. [DOI] [Google Scholar]

[CR40] Wu JT, Leung K, Leung GM. ”Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet. 2020;35(10225):689–697. doi: 10.1016/S0140-6736(20)30260-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] Yang B, Cai Y, Wang K, Wang W. Optimal harvesting policy of logistic population model in a randomly fluctuating environment. Phys A. 2019;526:120817. doi: 10.1016/j.physa.2019.04.053. [DOI] [Google Scholar]

[CR42] Zocchetti C, Consonni D. Mortality rate and its statistical properties. Med Lav. 1994;85(4):327–43. [PubMed] [Google Scholar]

PERMALINK

Stochastic Modeling and Forecasting of Covid-19 Deaths: Analysis for the Fifty States in the United States

Olusegun Michael Otunuga

Oluwaseun Otunuga

Abstract

Introduction

Methodology

Data Sources

Modeling the Covid-19 Aggregate Death Counts

Fig. 1.

Fig. 2.

Definition 1

Theorem 1

Theorem 2

Proof

Theorem 3

Proof

Remark 1

Theorem 4

Proof

Remark 2

Probability Distribution of the Aggregate Number of Deaths Following (2.3)

Parameter Estimates

Remark 3

Expected and Simulated Number of Deaths

Theorem 5

Proof

Theorem 6

Proof

Predicting the End of Covid-19 Death

Definition 2

Theorem 7

Proof

Remark 4

Corollary 8

Proof

Fig. 7.

Fig. 8.

Numerical Simulation and Forecast for the Aggregate Death Counts in the United States

Table 1.

Table 2.

Fig. 3.

Fig. 4.

Fig. 5.

Fig. 6.

Table 3.

Summary and Discussion

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases