Spectral Method in Epidemic Time Series: Application to COVID-19 Pandemic

Jacques Demongeot; Pierre Magal

doi:10.3390/biology11121825

. 2022 Dec 14;11(12):1825. doi: 10.3390/biology11121825

Spectral Method in Epidemic Time Series: Application to COVID-19 Pandemic

Jacques Demongeot ¹, Pierre Magal ^2,^3,^*

Editors: Tianwei Yu, Jukka Finne

PMCID: PMC9775943 PMID: 36552333

Abstract

Simple Summary

This article aims to study the times series provided by data of the daily number of reported cases of COVID-19. During the COVID-19 pandemic, most people viewed the oscillations around the exponential growth at the beginning of an epidemic wave as the default in reporting the data. The residual is probably partly due to the reporting data process (random noise). Nevertheless, a significant remaining part of such oscillations could be connected to the infection dynamic at the level of a single average patient. Eventually, the central question we try to address here is: Is there some hidden information in the signal around the exponential tendency for COVID-19 data?

Abstract

Background: The age of infection plays an important role in assessing an individual’s daily level of contagiousness, quantified by the daily reproduction number. Then, we derive an autoregressive moving average model from a daily discrete-time epidemic model based on a difference equation involving the age of infection. Novelty: The article’s main idea is to use a part of the spectrum associated with this difference equation to describe the data and the model. Results: We present some results of the parameters’ identification of the model when all the eigenvalues are known. This method was applied to Japan’s third epidemic wave of COVID-19 fails to preserve the positivity of daily reproduction. This problem forced us to develop an original truncated spectral method applied to Japanese data. We start by considering ten days and extend our analysis to one month. Conclusion: We can identify the shape for a daily reproduction numbers curve throughout the contagion period using only a few eigenvalues to fit the data.

Keywords: epidemic models, time series, spectral method, spectral truncation method, phenomenological models

1. Introduction

Modeling an epidemic peak requires precise knowledge of the daily data corresponding to new cases. One of the aims of the paper is to extract the value of the average daily reproduction numbers. The daily reproduction numbers vary from individual to individual and from day to day during the period of contagiousness of an individual. These numbers depend on the age of infection, i.e., the number of days since the individual contracted the infectious disease.

From a discrete model of the evolution of new daily cases, we propose to evaluate the average number $R_{0} (d)$ of secondary infected individuals produced by a single infected individual on each day d since his infection. For this purpose, on the top of the dominant eigenvalue, we will estimate from the data other significant subdominant eigenvalues (complex), which explain the modulation of the growth and allow better adequacy of the model to the data.

For that purpose, we reconsider the discrete-time epidemic model with the age of infection presented in Demongeot et al. [1]. This model is a discrete-time version of the Volterra integral formulation of the Kermack–McKendrick model with age of infection [2]. The variation of the number of susceptible individuals $S (t)$ is given each day $t = t_{0}, t_{0} + 1, \dots,$ by

S (t) = S_{0} - \sum_{d = t_{0}}^{t - 1} N (d), \forall t \geq t_{0},

(1)

where $S (t)$ is the number of susceptible individuals at time t, and $N (t)$ is the daily number of new infected at time t. Throughout the paper, we use the following convention for the sum

\sum_{d = k}^{m} = 0, whenever m < k .

As a consequence, when $t = t_{0}$ , (1) gives

S (t_{0}) = S_{0} .

We assume for simplicity that the epidemic starts from a single cohort of infected at time $t_{0}$ , then the number of infectious individuals is given by

I (t) = [Γ (t - t_{0}) I_{0} + \sum_{d = 1}^{t - t_{0}} Γ (d) N (t - d)],

(2)

where $I_{0}$ is the number of infected individuals at time $t_{0}$ , and $Γ (d)$ is the probability for an infected to be infectious after d day of infection. In particular, we have $Γ (0) = 0$ .

We assume that the number $N (t)$ of new infected at time t is the product of the transmission rate $τ (t)$ with the number $S (t)$ of susceptible individuals and the number $I (t)$ of infectious at time t. That is,

N (t) = τ (t) S (t) I (t) .

(3)

By replacing $I (t)$ by the right hand side of (2) in (3), we obtain

N (t) = τ (t) S (t) [Γ (t - t_{0}) I_{0} + \sum_{d = 1}^{t - t_{0}} Γ (d) N (t - d)] .

(4)

Now assuming that $τ (t) = τ_{0}$ and $S (t) = S_{0}$ are constant (over a short period of time), then we define the daily reproduction numbers as

R_{0} (d) = τ_{0} S_{0} Γ (d), \forall d \geq 0 .

The quantity $R_{0} (d)$ is the average number of secondary infected produced by a single infected on the day d since infection (see [1] for more details). Therefore, the basic reproduction number is the following quantity

R_{0} = \sum_{d = 1}^{n} R_{0} (d),

(5)

where n is the maximal duration (in days) of the infection.

Moreover, when $τ (t) = τ_{0}$ and $S (t) = S_{0}$ are constant, Equation (4) becomes a linear discrete time Volterra integral equation

N (t) = \underset{(I)}{\underset{︸}{R_{0} (t - t_{0}) I_{0}}} + \underset{(II)}{\underset{︸}{\sum_{d = 1}^{t - t_{0}} R_{0} (d) N (t - d)}}, \forall t \geq t_{0},

(6)

where (I) is the number of infected produced directly by the $I_{0}$ infected individuals already present on day $t_{0}$ , and (II) is the number of new infected individuals at time t produced by the new infected individuals since day $t_{0}$ .

If we consider the first terms of the discrete time Volterra Equation (6), we obtain

\begin{matrix} N (t_{0}) & = R_{0} (0) I_{0}, \\ N (t_{0} + 1) & = R_{0} (1) I_{0} + R_{0} (1) N (t_{0}), \\ N (t_{0} + 2) & = R_{0} (2) I_{0} + R_{0} (2) N (t_{0}) + R_{0} (1) N (t_{0} + 1), \\ N (t_{0} + 3) & = R_{0} (3) I_{0} + R_{0} (3) N (t_{0}) + R_{0} (2) N (t_{0} + 1) + R_{0} (1) N (t_{0} + 2), \\ ⋮ \end{matrix}

In practice, we can assume that $R_{0} (0) = 0$ since infected individuals are not infectious immediately after being infected. Under this additional assumption, we obtain the system

\begin{matrix} N (t_{0}) & = 0, \\ N (t_{0} + 1) & = R_{0} (1) I_{0}, \\ N (t_{0} + 2) & = R_{0} (2) I_{0} + R_{0} (1) N (t_{0} + 1), \\ N (t_{0} + 3) & = R_{0} (3) I_{0} + R_{0} (2) N (t_{0} + 1) + R_{0} (1) N (t_{0} + 2), \\ ⋮ \end{matrix}

Therefore, (6) can be rewritten as a scalar delay difference equation

N (t) = R_{0} (1) N (t - 1) + \dots + R_{0} (t - t_{0} - 1) N (t_{0} + 1) + R_{0} (t - t_{0}) I_{0}, \forall t \geq t_{0} .

(7)

Assume that the infectious period is n days. That is

R_{0} (a) = 0, \forall a \geq n + 1 .

Then by defining $t_{1} = t_{0} + n + 1$ , Equation (6) becomes

N (t) = \sum_{d = 1}^{n} R_{0} (d) N (t - d), \forall t \geq t_{1},

(8)

with the initial values

N (t) = N_{0} (t), \forall t \in [t_{1} - n, t_{1}] .

(9)

The goal of this article is to understand how to identify the daily reproduction numbers $d \in \{1, \dots, n\} \mapsto R_{0} (d)$ in (8) knowing $t \in [t_{1}, t_{2}] \mapsto N (t)$ on some finite time interval. This problem is particularly important to derive the average dynamic of infection at the level of a single patient.

One of the aims of this paper is to investigate the variations of the daily reproduction number $d \in \{1, \dots, n\} \mapsto R_{0} (d)$ during the period of contagiousness of infectious individuals.

This is not the case in influenza, as shown in simulated data [3] and in real infected animals, where we observe a U-shaped evolution of their viral load and symptoms as their body temperature during their contagiousness period. From there, it is possible to suspect a U-shaped variation in their ability to emit (aerosol transmission) the virus and, therefore, to contaminate it [4].

After the first asymptomatic period (without contagiousness), the daily reproduction number increases. After one to three days, this number decreases due to the action of the first defense of the innate immune system. But, the virus passes over this first immune defense, and the daily reproduction number increases again before the action of the second adaptive immune system. Then, after two to four days, the second adaptive immune response becomes fully effective. The combination of these biological mechanisms causes the daily reproduction numbers’ U- or M-shaped curve.

The literature about parameters identification for epidemic models with age of infection can be divided into two groups of articles depending on the assumptions made. The first group assumes that $d \mapsto Γ (d)$ is a given function and estimates the time dependent transmission rate $t \mapsto τ (t)$ . As a consequence, they obtain the instantaneous (daily or effective) reproduction number, which is

R_{0} (t) = τ (t) S (t) \sum_{d = 1}^{n} Γ (d) .

We refer to [5,6,7,8,9,10,11,12] (and references therein) for more results about this subject.

The second group corresponds to the assumptions considered here. That is, we assume that $t \mapsto τ (t) = τ_{0}$ and $t \mapsto S (t) = S_{0}$ are constant functions (over a short period of time) and estimate the daily reproduction number. That is the case for the discrete time model in [13] and more recently for the continuous time model in [1]. The major default in [13] is that the estimated $d \mapsto R_{0} (d)$ does not remain positive. We will have the same problem is Section 3.1 when we will use the full spectrum. In Section 3.2, to solve this problem, we introduce a method using the dominant and secondary eigenvalue only.

This article aims to investigate the shape of the distribution $d \mapsto R_{0} (d)$ from the data of COVID-19. In Figure 1, we illustrate the notion of U- or M-shaped distribution.

In this figure, we illustrate the notion of U shape distribution in (a) and M shape distribution in (b). Recall that $R_{0} (d)$ represents the ability of patients to transmit the pathogen after d days since they were infected. The U shape or M shape distribution means that patients can transmit the pathogen since the beginning of their infection. Then they become less infectious in the middle of the infected period. Finally, they become infectious again at the end of the infected period. The only difference between U and M shape distribution is to include days 0 and 8 and $R_{0} (0) = R_{0} (8) = 0$ in the plot.

The U and M shape distribution are well known in the context of influenza [3,4]. In Figure 2, we present some figures reflecting patients’ viral load for COVID-19.

Viral load in COVID-19 real patients [14]. In (a), the red curve corresponds to the throat swab and the blue curve corresponds to the sputum. In (b), the curves correspond to several patients (A), (B), and (C).

Such U shape has not yet been systematically studied in COVID-19 data, but observations of the evolution of the viral load have been done in some patients and show this U shape. Figure 2 shows such a U-shaped evolution for the viral load in real patients [14].

The present work is directly connected to the original work of Peter Whittle in 1951 [15,16], who introduced the Auto Regressive Moving Average (ARMA) model, after the seminal paper on time series by N. Wiener [17],

N (t) = \underset{Auto regressive part}{\underset{︸}{K (1) N (t - 1) + K (2) N (t - 2) + \dots + K (n) N (t - n)}} + \underset{Moving average part}{\underset{︸}{w (t)}},

(10)

where $N (t)$ is the size at time t of the population whose growth is forecasted, the kernel $d \mapsto K (d)$ has real values, n is the regression order, and here $w (t)$ stands for a noise. Equation (10) has been extensively studied under the denomination of ARMA models by many authors [18,19,20,21,22,23,24].

Here, we propose a new approach based on the spectral properties of the population growth equation to capture information from data. Our goal is to estimate the shape of the daily reproduction numbers $d \mapsto R_{0} (d)$ . Spectral methods are not new (see Priestley [20,25]), but it usually refers to Fourier transform with frequencies associated to various periods, corresponding to a fundamental period and its sub-multiples (harmonics). If we consider the auto regressive part only, the spectrum of the delay difference equation is determined by its characteristic equation

λ^{n} = K (1) λ^{n - 1} + K (2) λ^{n - 2} + \dots + K (n - 1) λ + K (n) .

The main idea in this article is to use these eigenvalues $λ_{1}, λ_{2}, \dots, λ_{n} \in C$ (i.e., the solution of the characteristic equation) to identify the parameters $K (1), K (2), \dots, K (n)$ . The eigenvalues $λ_{1}, λ_{2}, \dots, λ_{n} \in C$ are estimated by some separated method. In Section 2, we will see that when all the eigenvalues are non null and separated two by two, then we can compute the parameters $K (1), K (2), \dots, K (n)$ by using the eigenvalues only.

The idea of using eigenvalues in population dynamics goes back to Malthus [26], who, in 1798, first identified in a mixture of populations the one that would impose itself on the others, determined through the exponential growth of the largest exponent—this leading exponent having been called Malthusian parameter by Fisher [27]. The Malthusian growth seeming unrealistic, the saturation logistic term was introduced further by Lambert [28], and then extending the initial work by Euler [29], Lotka [30], Leslie [31], and Hahn [32] gave the current matrix form of the discrete population growth equations.

However, as far as we know, estimating the subdominant eigenvalues to characterize the system is new. So the key idea of this work is to use the dominant eigenvalue $λ_{1}$ and also the following pair of complex conjugated eigenvalues $λ_{2}, {\bar{λ}}_{2}$ as an estimator to reconstruct the kernel of the auto regressive part.

This work is motivated by the times series provided by data of the daily numbers of reported cases of COVID-19. During the COVID-19 pandemic, most people viewed the oscillations around the exponential growth at the beginning of an epidemic wave as the default in reporting the data. The residual is probably partly due to the reporting data process (random noise). Nevertheless, a significant remaining part of such oscillations could be connected to the infection dynamic at the level of a single average patient. Eventually, the central question we try to address here is: Is there some hidden information in the signal around the exponential tendency for COVID-19 data? So we consider the early stage of an epidemic phase, and we try to exploit the oscillations around the tendency in order to reconstruct the infection dynamic at the level of a single average patient.

We start by investigating the connection between a signal decomposed into a sum of damped or amplified oscillations and a renewal equation. The prototype example we have in mind is the following:

N (t) = A_{1} e^{α_{1} t} + e^{α_{2} t} [A_{2} cos (ω_{2} t) + B_{2} sin (ω_{2} t)] + C, \forall t \geq t_{1} - n,

where $A_{1}, A_{2}, A_{3} \in R$ , $α_{1} > 0$ , $α_{2} \in R$ , and $ω_{2} > 0$ .

In Figure 3, we illustrate a growing function with damped oscillations (i.e., $α_{2} < 0$ ) and amplified oscillations (i.e., $α_{2} > 0$ ). It is clear from Figure 3 that a periodic function can not represent such a signal, and extending such a signal by periodicity would be artificial. Indeed, the Fourier decomposition would only provide purely imaginary eigenvalues that would exclude a continuation of the exponential growth (i.e., eigenvalues with non-zero real parts). To apply wavelets theory (see, for example, in [33]), we need to extend the data for negative times by symmetry with respect to the initial time $t = 0$ , and we need a decreasing function ( $α_{1} < 0$ and $α_{2} < 0$ ).

We plot an exponentially growing function with (a) damped oscillations and (b) amplified oscillations.

Here, we are more interested in the model resulting from the data (i.e., $R_{0} (d) \geq 0$ , $\forall d = 1, \dots, n$ ) than in the fit to the data. The major problem with the Fourier method is that this method provides only eigenvalues with zero real parts (that is due to the periodicity required for this method). Such eigenvalues are well adapted to a periodic signal, but this is not suitable to describe, for example, an ever-growing function (as in Figure 3). Consequently, the Fourier method is not well adapted to derive non-negative daily reproduction numbers (i.e., $R_{0} (d) \geq 0, \forall d = 1, \dots, n$ ).

Previous analogous approaches can be found in the seismic data modeling and statistical literature, like the Wiener–Levinson predictive deconvolution (Robinson [34], Peacock and Treitel [35], Robinson and Treitel [36]), which intends to estimate the minimum phase wavelet in the data, in particular in the case where the relatively weak sampling does not make it possible to affirm the Gaussian character of the errors (Walden and Hosken [37]). If the Gaussian character of the errors can be proven, another similar approach is that of the Geometric Brownian Motion (GBM) processes (Vinod et al. [38]) used, for example, in the analysis of financial data (Ritschel et al. [39]), which are based on the model of the solution of a stochastic differential equation, multiplied by a periodic component with a Gaussian noise.

The structure of this paper is as follows: Section 2 is devoted to the materials and methods. We recall some notions of matrices and spectra. We also present some phenomenological models that will be compared to the data. Section 3 contains the results. We fit the phenomenological models to the cumulative numbers of reported cases in Japan over 10 days and 30 days. We use the eigenvalues derived from the phenomenological model, and we identify the daily reproduction numbers by using: (1) all the spectrum (see Appendix B) and (2) part of the spectrum. The last section of the paper is devoted to the discussion and the conclusion. We present in the Appendices all the mathematical aspects of the paper (see Appendix A, Appendix B, Appendix C and Appendix D).

2. Materials and Methods

2.1. Identification of the Model

The Leslie matrix associated to the difference Equation (8) is

graphic file with name biology-11-01825-i001.jpg

(11)

The characteristic equation of (11) is

λ^{n} = \sum_{d = 1}^{n} R_{0} (d) λ^{n - d},

(12)

for $λ \in C$ , which is equivalent to (whenever $λ \neq 0$ )

1 = \sum_{d = 1}^{n} R_{0} (d) λ^{- d} .

(13)

The complex numbers satisfying the characteristic equation are called the eigenvalues of L.

In Appendix A and Appendix B, we discuss the identification problem of the daily reproduction numbers $R_{0} (1), \dots, R_{0} (n)$ by using the eigenvalues of L. The main identification result of Appendix B corresponds to the formula (A3).

Definition 1.

We will say that L is a Markovian Leslie matrix if all the values $d \in [1, n] \mapsto R_{0} (d)$ are non negative, and

$\sum_{d = 1}^{n} R_{0} (d) = 1 .$

2.2. Phenomenological Model to Fit the Cumulative and the Daily Numbers of Reported Case Data

Due to Lemma A1 below, we propose the following phenomenological model to represent the data

CR (t) = {CR}_{1} e^{λ_{1} t} + {CR}_{2} e^{λ_{2} t} + {CR}_{3} e^{λ_{3} t} + \dots + {CR}_{m} e^{λ_{m} t},

(14)

where ${CR}_{1}, \dots, {CR}_{m} \in C$ are non null, $λ_{1} = α_{1} + i ω_{1}, \dots, λ_{m} = α_{m} + i ω_{m} \in C$ are pairwise distinct, and $m \leq n$ .

Remark 1.

In the above formula, we allow the constant terms whenever $λ_{n} = 0$ .

Assuming that the unit of time is one day, we have the following relationship between the cumulative number of cases $CR (t)$ and the daily number of cases $N (t)$

CR (t) = CR (t_{0}) + \int_{t_{0}}^{t} N (σ) d σ .

We deduce that the daily number of reported cases has the following form

N (t) = N_{1} e^{λ_{1} t} + N_{2} e^{λ_{2} t} + N_{3} e^{λ_{3} t} + \dots + N_{m} e^{λ_{m} t},

where $N_{1}, \dots, N_{m} \in C$ are non null, and $λ_{1}, \dots, λ_{m} \in C$ are the same as in (14), and $m \leq n$ .

Since $N (t)$ is obtained from $CR (t)$ by computing the first derivative, we have the following relationship

N_{k} = {CR}_{k} λ_{k}, \forall k = 1, \dots, m .

Remark 2.

For the daily number of cases data $t \mapsto N (t)$ only a few eigenvalues will be tractable. For example, in Section 3.3, we will consider the following extension

$N (t) = N_{1} e^{λ_{1} t} + N_{2} e^{λ_{2} t} + N_{3} e^{λ_{3} t} + N_{3} e^{λ_{3} t} + N_{4} e^{λ_{4} t} + w (t)$

where $w (t)$ will contain $N_{5} e^{λ_{5} t} + \dots + N_{m} e^{λ_{m} t}$ merged together with some random term.

Remark 3.

The identification of the eigenvalues $λ_{1}, \dots, λ_{m}$ as parameters of the phenomenological model is discussed in Section 3.3. So far, this problem for a finite time interval seems to be open.

We will first approach the data with the following phenomenological model.

Phenomenological model for the cumulative numbers of reported cases with $λ > 0$

We start with a first eigenvalue

λ = e^{α} > 0

, for some

α \in R

. The phenomenological model used to fit the cumulative numbers of reported cases has the following form

CR (t) = A e^{α (t - t_{0})} + C, for t \in [t_{0}, + \infty),

(15)

where

A \in R

α \in R

, and

C \in R

are real numbers.
For discrete times, it is equivalent to say that

CR (n) = A λ^{n} + C, for n = 0, 1, 2, \dots .

(16)

By computing the first derivative of

t \mapsto CR (t)

, we obtain a model for the daily number of cases of the following form

N (t) = A α e^{α (t - t_{0})}, for t \in [t_{0}, + \infty) .

(17)

Open in a new tab

Once the best fit of the above phenomenological model to the data is obtained, we can subtract this model to the data $t \mapsto {CR}_{Data} (t)$ , then we obtain a first residual

Residual (t) = {CR}_{Data} (t) - CR (t) .

Next we will approach the residual with the following phenomenological model.

Phenomenological model for the cumulative numbers of reported cases with $λ \in C$

Assume that the eigenvalues are two conjugated complex numbers

λ = e^{α \pm i ω} \in C

, for some

α \in R

and

ω \geq 0

. The phenomenological model used to fit the cumulative numbers of reported cases has the following form

CR (t) = e^{α (t - t_{0})} [A cos (ω (t - t_{0})) + B sin (ω (t - t_{0}))] + C, for t \in [t_{0}, + \infty),

(18)

where

α \in R

A \in R

B \in R

C \in R

, and

ω \geq 0

are four real numbers.
For discrete times, it is equivalent to say that

CR (n) = \frac{A - i B}{2} λ^{n} + \frac{A + i B}{2} {\bar{λ}}^{n} + C, for n = 0, 1, 2, \dots .

(19)

By computing the first derivative of

t \mapsto CR (t)

, we obtain a model for the daily number of cases of the following form

\begin{matrix} N (t) = e^{α (t - t_{0})} [\hat{A} cos (ω (t - t_{0})) + \hat{B} sin (ω (t - t_{0}))], for t \in [t_{0}, + \infty), \end{matrix}

(20)

where

\{\begin{matrix} \hat{A} = α A + ω B \\ \hat{B} = - ω A + α B \end{matrix} \Leftrightarrow \{\begin{matrix} A = \frac{α \hat{A} - ω \hat{B}}{ω^{2} + α^{2}} \\ B = \frac{ω \hat{A} + α \hat{B}}{ω^{2} + α^{2}} \end{matrix} .

(21)

Open in a new tab

Remark 4.

When $ω = 0$ in (18), we obtain the previous model (15).

2.3. Cumulative and Daily Number of Reported Cases for COVID-19 in Japan

Here we use cumulative numbers of reported cases for COVID-19 in Japan taken from the WHO [40]. The data show a succession of epidemic waves (blue background color regions) followed by endemic periods (yellow background color regions). In Figure 4, black dots represent the data. The blue background color regions correspond to epidemic phases, and the yellow background color region to endemic phases. The region of interest to apply the method is between 19 October and 29 October 2020. This region is marked with light green vertical lines in the figure.

In this figure, we plot the daily number of reported cases for COVID-19 in Japan.

3. Results

3.1. Methods Applied to Ten Days Data

In this section, we will fit the phenomenological model (15) or (18) to the cumulative numbers of reported cases presented in the previous subsection. We consider a period of 10 days since the beginning of the third epidemic wave of COVID-19 in Japan. The period goes from 19 to 29 October 2020.

Step 1:
In Figure 5, we fit an exponential function (15) to the cumulative number of reported cases of COVID-19 in Japan between 19 and 29 October 2020.

In this figure, the black dots correspond to the cumulative numbers of reported cases of COVID-19 in Japan between 19 October and 29 October 2020 (black dots). The red curve corresponds to the best fit of model (15) to the cumulative numbers of reported cases.

In Figure 5, the best fit of model (15) is obtained for

A_{1} = 2 . 8810^{4}, C_{1} = 6 . 4210^{4}, and α_{1} = 0.02 .

Hence,

λ_{1} = exp (α_{1}) = 1.02 .

Step 2:
Next, we consider the residual left after the previous fit,
${Residual}_{1} (t) = CR (t) - [A_{1} e^{α_{1} t} + C_{1}] .$

In Figure 6, we fit the model (18) to the first residual function $t \mapsto {Residual}_{1} (t)$ .

In this figure, the black dots correspond to the function $t \mapsto {Residual}_{1} (t)$ from 19 October and 29 October 2020 (black dots). The red curve corresponds to the best fit of model (18) to ${Residual}_{1} (t)$ .

In Figure 6, the best fit of model (18) (i.e., minimizing the sum-of-squares error) is obtained for

A_{2} = 138.16, B_{2} = - 127.36, C_{2} = 11.88, α_{2} = - 0.07, and ω_{2} = 0.95 .

The period associated to $ω_{2}$ is equal to $P_{2} = \frac{2 π}{ω_{2}} = 6.609$ days. This periodic phenomenon was observed in many countries (see for example [41]). Here,

λ_{2} = exp (α_{2} + i ω_{2}) = 0.54 + 0.76 i,

λ_{3} = exp (α_{2} - i ω_{2}) = 0.54 - 0.76 i .

By using

M = (\begin{matrix} λ_{1}^{- 1} & λ_{1}^{- 2} & λ_{1}^{- 3} \\ λ_{2}^{- 1} & λ_{2}^{- 2} & λ_{2}^{- 3} \\ λ_{3}^{- 1} & λ_{3}^{- 2} & λ_{3}^{- 3} \end{matrix}),

in (A3) below, with $n = 3$ , we obtain

(\begin{matrix} R_{0} (1) \\ R_{0} (2) \\ R_{0} (3) \end{matrix}) = (\begin{matrix} 2.09 \\ - 1.96 \\ 0.87 \end{matrix}) .

(22)

Since

det (M) = 1.78 i,

therefore, the components of $M^{- 1}$ are not too large, and the above result should not be too sensitive to the stochastic errors. The main problem in (22) is the second component $- 1.9625$ , which is not making sense in this context.

3.2. Spectral Truncation Method Applied to Ten Days Data

In the previous subsection, the first two fits make perfect sense. However, adding more fits would be questionable because they become more and more random after a few steps. We could alternatively continue to fit the rest by using our phenomenological model, which would provide new eigenvalues.

The major problem in the previous section is that when we apply formula (A3) with all the eigenvalues, we obtain some $R_{0} (1), \dots, R_{0} (n)$ with negative values. Instead here, we increase the dimension n of L, and we use only the eigenvalues $λ_{1}, λ_{2}, λ_{3}$ .

3.2.1. Re-Normalizing Procedure

Assume that $λ_{1} \neq 1$ then by

\bar{N} (t) = \frac{N (t)}{λ_{1}^{t}} \Leftrightarrow N (t) = λ_{1}^{t} \bar{N} (t)

where $t \mapsto N (t)$ is a solution of (8), we obtain the following normalized equation

λ_{1}^{t} \bar{N} (t) = \sum_{d = 1}^{n} R_{0} (d) λ_{1}^{t - d} \bar{N} (t - d), \forall t \geq t_{1},

and by dividing the above equation by $λ_{1}^{t}$ we obtain

\bar{N} (t) = \sum_{d = 1}^{n} {\bar{R}}_{0} (d) \bar{N} (t - d), \forall t \geq t_{1} .

where

R_{0} (d) = {\bar{R}}_{0} (d) λ_{1}^{d}, \forall d = 1, \dots, n .

(23)

By using the procedure, we can always fix the dominant eigenvalue of L to 1 by imposing that L is Markovian (see Definition 1). Then we use the following re-normalizing procedure for the eigenvalues

λ_{1}^{🟉} = λ_{1} / λ_{1} = 1, λ_{2}^{🟉} = λ_{2} / λ_{1} = 0.53 + 0.74 i, and λ_{3}^{🟉} = 0.53 - 0.74 i .

In Figure 7, we fit these eigenvalues $λ_{2}^{🟉}$ and $λ_{3}^{🟉}$ with the spectrum of Markovian Leslie matrices L on a mesh. We observe that the fit improves when the dimension of L increases.

We plot the spectrum of the Markovian Leslie matrices L (red dots) when $n = 3, 5, 6, 7,$ (respectively in (a–d)) giving the best match to the secondary eigenvalues $λ_{2}^{🟉}$ and $λ_{3}^{🟉}$ (green dots). We observe that the best fit of the two secondary eigenvalues remain far away from $λ_{2}^{🟉}$ and $λ_{3}^{🟉}$ for $n = 3$ , then get closer for $n = 5$ , and are very close for $n = 6$ and $n = 7$ .

In Figure 8, we observe that, for $n \in \{3, 5, 6\}$ , there is a unique set of eigenvalues $λ_{1}, λ_{2}, λ_{3}, \dots, λ_{n}$ of L (classified with decreasing real part) minimizing the distance $| λ_{2}^{🟉} - λ_{2} |$ and $| λ_{3}^{🟉} - λ_{3} |$ . This is no longer true for $n = 7$ .

We plot the spectrum of the Leslie matrix L (red dots) when $n = 3, 5, 6, 7,$ (respectively in (a–d)) giving the best match to the secondary eigenvalues $λ_{2}^{🟉}$ and $λ_{3}^{🟉}$ (green dots). The red dots correspond to the spectrum of L for all the possible matrices L, having their second pair of eigenvalues close to the minimal distance to $λ_{2}^{🟉}$ and $λ_{3}^{🟉}$ .

3.2.2. Daily Basic Reproduction Numbers

In Figure 9, we plot the average distribution $d \mapsto R_{0} (d)$ , standard deviation (blue region), and $95 %$ confidence interval.

In this figure, we use the distributions $d \mapsto R_{0} (d)$ minimizing the distance $| λ_{2}^{🟉} - λ_{2} |$ and $| λ_{3}^{🟉} - λ_{3} |$ whenever $n = 7$ . In (a), we plot the average distribution $d \mapsto R_{0} (d)$ (red curve), standard deviation (blue region), and $95 %$ confidence interval (light blue region). In (b), we plot the 24 distributions $d \mapsto R_{0} (d)$ . In (c), we give a histogram with the multiple values of $R_{0} = \sum_{d = 1}^{n} R_{0} (d)$ . We observe that some of the $d \mapsto R_{0} (d)$ are similar to the case $n = 6$ , with a maximum on day $d = 6$ , but on average the maximum value is on day 7.

In Figure 10, we plot the daily basic reproduction numbers $R_{0} (d)$ .

We plot the daily basic reproduction numbers $R_{0} (d)$ obtained for $n = 3$ in (a), $n = 5$ in (b), $n = 6$ in (c), and $n = 7$ in (d). The distribution for $n = 7$ corresponds to the red curve in Figure 9.

We can notice that following [42], the effective $R_{0}$ is between $1.06$ and $1.14$ on 19 October 2020, in Japan.

3.2.3. Applying the Model to Daily Number of Reported Cases

The model used to run the simulations is the following

N (t) = \sum_{d = 1}^{6} R_{0} (d) N (t - d), \forall t \geq t_{0} + 6,

(24)

and according to the formula (17) and (20), with the initial condition

N (t) = A_{1} ln (λ_{1}) λ_{1}^{t} + e^{α_{2} t} [{\hat{A}}_{2} cos (ω_{2} t) + {\hat{B}}_{2} sin (ω_{2} t)], \forall t = t_{0}, t_{0} + 1, \dots, t_{0} + 5,

(25)

with

{\hat{A}}_{2} = α_{2} A_{2} + ω_{2} B_{2} and {\hat{B}}_{2} = - ω_{2} A_{2} + α_{2} B_{2} .

(26)

In (24)–(26) we use the parameter values estimated in Section 3.1.

In Figure 11, we plot the daily number of reported cases data from October 19 to November 19, 2020 (black dots) and from model (24) and (25) with the values of $R_{0} (d)$ obtained in Figure 10c (red dots).

In this figure, we plot the daily number of reported cases data from October 19 and November 19, 2020 (black dots) and from model (24) and (25) with the values of $R_{0} (d)$ obtained in Figure 10c (red dots).

3.3. Extension of the Spectral Truncation Method over One Month

In Figure 12, we apply respectively the AutoCorrelation Function (ACF) and Partial AutoCorrelation Function (PACF) to the daily number of cases for Japan from 19 October and 19 November 2020. It does not look like any standard cases. In the ACF, we observe the correlation is significant until 7 days, while in the PACF it is until 16 days.

Autocorrelation Function (ACF) (left hand side) and Partial Autocorrelation Function (PACF) (right hand side) applied to the daily number of cases for Japan between 19 October and 19 November 2020.

Step 1: In Figure 13, we fit the model
$ϕ_{1} (t) = A_{1} e^{α_{1} (t - t_{0})} + C_{1},$ (27)
with the cumulative number of reported cases data between 19 October and 19 November 2020.

In this figure, we plot the cumulative number of reported cases data between 19 October and 19 November 2020 (black dots). We plot the best fit of the model (27) to the cumulative data (red curve).

We obtain the following parameter values for the best fit

A_{1} = 7 . 9310^{3}, C_{1} = 8 . 5510^{4}, and α_{1} = 0.05 .

(28)

Step 2: Next we define as before the first residual
${Residual}_{1} (t) = CR (t) - A_{1} e^{α_{1} (t - t_{0})} + C_{1},$ (29)
and we fit the ${Residual}_{1} (t)$ with the model
$\begin{matrix} ϕ_{2} (t) & = e^{α_{2} (t - t_{0})} [A_{2} cos (ω_{2} (t - t_{0})) + B_{2} sin (ω_{2} (t - t_{0}))] \\ + e^{α_{3} (t - t_{0})} [A_{3} cos (ω_{3} (t - t_{0})) + B_{3} sin (ω_{3} (t - t_{0}))] + C_{2} . \end{matrix}$ (30)

In Figure 14, we plot the cumulative number of reported cases data between 19 October and 19 November 2020.

The parameters of the phenomenological model $ϕ_{2} (t)$ obtained for the best fit are the following

A_{2} = 55.21, B_{2} = - 84.48, A_{3} = - 391.57, B_{3} = 88.79, C_{2} = 7.68,

(31)

and

α_{2} = 0.0501, ω_{2} = 0.91, α_{3} = - 0.02, ω_{3} = 0.3 .

(32)

The periods associated to $ω_{2}$ and $ω_{3}$ are, respectively,

P_{2} = \frac{2 π}{ω_{2}} = 6.92 days, and P_{3} = \frac{2 π}{ω_{3}} = 21.24 days .

These periods are close multiples of 7 days.

Remark 5.

It is important to note that the period $P_{3}$ of 21 days is difficult to explain mechanically, but this value is the smallest value giving the best fit to the data. We tried to impose some upper bounds smaller than 21 days. In such a case, $P_{3}$ is always replaced by the upper bound. This is true for all constraints less that 21 days, and for each constraint larger than 22 days, we obtain $P_{3} = 21.24$ days.

Remark 6.

It is important to note that $α_{1} = α_{2}$ . That is because, during the fit, we impose that $α_{2} \leq α_{1}$ and $α_{3} \leq α_{1}$ . That is the condition coming from the Perron Frobenius theorem, in order to obtain

$| λ_{2} | \leq | λ_{1} | a n d | λ_{3} | \leq | λ_{1} | .$

This condition is coming from the fact that $λ_{1}$ must be the spectral radius of L and $λ_{2}, λ_{3}$ belong to the circle centered at 0 and with the radius equal to the spectral radius of L (i.e., with a modulus less or equal to $λ_{1}$ ).

Eigenvalues associated to the model $ϕ_{1} (t)$ and $ϕ_{2} (t)$ : The first eigenvalue is

λ_{1} = e^{α_{1}} = 1.05 .

The second pair of complex conjugated eigenvalues is

λ_{2} = e^{α_{2}} [cos (ω_{2}) + i sin (ω_{2})] = 0.65 + 0.83 i,

and

λ_{3} = e^{α_{2}} [cos (ω_{2}) - i sin (ω_{2})] = 0.65 - 0.83 i,

and the modulus of $λ_{2}$ is

| λ_{2} | = | λ_{3} | = e^{α_{2}} = e^{α_{1}} = λ_{1} = 1.05 .

The fourth eigenvalue is

λ_{4} = e^{α_{3}} [cos (ω_{3}) + i sin (ω_{3})] = 0.94 + 0.29 i,

and the fifth eigenvalue is its conjugate

λ_{5} = e^{α_{3}} [cos (ω_{3}) - i sin (ω_{3})] = 0.94 - 0.29 i,

and the modulus of $λ_{4}$ is

| λ_{4} | = | λ_{5} | = e^{α_{3}} = 0.98 < 1.05 .

Using $λ_{2}$ and $λ_{4}$ as an estimator: Next we consider all the matrices L in which the component $R_{0} (d)$ is replaced by ${\bar{R}}_{0} (d)$ , and we assume that

\sum_{d = 1}^{n} {\bar{R}}_{0} (d) = 1 .

The dominant eigenvalue of L is 1, and we look for matrices such that the second eigenvalue of L is close to

λ_{2}^{🟉} = λ_{2} / λ_{1},

and the fourth eigenvalue of L is close to

λ_{4}^{🟉} = λ_{4} / λ_{1} .

For realizing this approach, we minimize the

χ (L) = max (d (λ_{2}^{🟉}, σ (L)), d (λ_{4}^{🟉}, σ (L)))

where

d (λ_{2}^{🟉}, σ (L)) = min_{λ \in σ (L)} | λ_{2}^{🟉} - λ |, and d (λ_{4}^{🟉}, σ (L)) = min_{λ \in σ (L)} | λ_{4}^{🟉} - λ |,

where $σ (L)$ is the set of all eigenvalues of L.

In Figure 15, we consider the $d \mapsto {\bar{R}}_{0} (d)$ such that the corresponding maximum satisfies

χ (L ({\bar{R}}_{0})) \leq inf_{{\hat{R}}_{0} \geq 0 : \sum {\hat{R}}_{0} (d) = 1} χ (L ({\hat{R}}_{0})) + 10^{- 2} .

In this figure, we consider the case $n = 25$ . We plot the distributions of daily basic reproduction numbers $d \mapsto {\bar{R}}_{0} (d)$ corresponding to the distributions having some secondary eigenvalues and fourth eigenvalues at a distance less than $10^{- 2}$ to the best match. The red curve is the average distribution $d \mapsto {\bar{R}}_{0} (d)$ . The blue region corresponds to the standard deviation around the mean distribution. The light blue region corresponds to the $95 %$ confidence interval.

We define

R_{0} (d) = {\bar{R}}_{0} (d) λ_{1}^{d}, \forall d = 1, \dots, n .

(33)

In Figure 16, we obtain a good description of the dynamic of infection at the individual level that confirms the one obtained over shorter periods. As expected, the average patient first loses its ability to transmit the pathogen, and after decreasing by day 1 to day 4, $R_{0} (d)$ increases between day 4 and day 7. Day 7 is a maximum. After day 7, $R_{0} (d)$ decays until day 9. Then a second peak arises, with a maximum on the day 14. We could explain this second peak by supposing that an important transmission of pathogen still exists from day 12 to day 16. We also obtain a third from day 19 to 23 with a maximum value on day 21.

In this figure, we consider the case $n = 25$ . We plot the distributions of daily basic reproduction numbers $d \mapsto R_{0} (d) = {\bar{R}}_{0} (d) λ_{1}^{d}$ , where ${\bar{R}}_{0} (d)$ is the red curve in Figure 15.

In Figure 17, we plot the spectrum of the Leslie matrix L when $d \mapsto {\bar{R}}_{0} (d)$ corresponds to the average distribution (i.e., the red curve in Figure 15).

In this figure, we consider the case $n = 25$ . We plot the spectrum of the Leslie matrix L (red dots) when $d \mapsto {\bar{R}}_{0} (d)$ corresponds to the average distribution (i.e., the red curve in Figure 15).

Recalling that, by definition, the basic reproduction number is

R_{0} = \sum_{d = 1}^{n} R_{0} (d),

we obtain the sum of the daily reproduction numbers (red curve in Figure 16)

R_{0} = 2.13 .

In Figure 18, we plot a histogram for the values of the basic reproduction number obtained by summing the distributions $d \mapsto R_{0} (d)$ from Figure 16.

In this figure, we consider the case $n = 25$ , and we plot a histogram for the values of the basic reproduction number obtained by summing the distributions $d \mapsto R_{0} (d)$ from Figure 16.

Next, we consider

N (t) = \sum_{d = 1}^{25} R_{0} (d) N (t - d), \forall t \geq t_{0} + 25,

(34)

and accordingly to the formula (17) and (20), with the initial condition for $t = t_{0}, t_{0} + 1, \dots, t_{0} + 25$ , we have

N (t) = A_{1} ln (λ_{1}) λ_{1}^{t} + e^{α_{2} t} [{\hat{A}}_{2} cos (ω_{2} t) + {\hat{B}}_{2} sin (ω_{2} t)] + e^{α_{3} t} [{\hat{A}}_{3} cos (ω_{3} t) + {\hat{B}}_{3} sin (ω_{3} t)],

(35)

with

{\hat{A}}_{2} = α_{2} A_{2} + ω_{2} B_{2}, {\hat{B}}_{2} = - ω_{2} A_{2} + α_{2} B_{2}, {\hat{A}}_{3} = α_{3} A_{3} + ω_{3} B_{3} and {\hat{B}}_{3} = - ω_{3} A_{3} + α_{3} B_{3} .

(36)

In (24)–(26) we use the parameter values estimated in Section 3.1.

In Figure 19, we see the mean distribution $d \mapsto R_{0} (d)$ permits to produce oscillations around the tendency for the daily number of cases. It is important to note that without the third peak in Figure 16 we do not obtain such a good correspondence between the model and the data.

In this figure, we plot the daily number of reported cases data between 19 October and 19 November 2020 (black dots). The red curve corresponds to $ϕ_{1}^{'} + ϕ_{2}^{'}$ , and the green dots correspond (34) and (35) whenever $R_{0} (d)$ comes from the average distribution (i.e., the red curve in Figure 15). We observe a very good match between the green dots and the red curve (the phenomenological model).

4. Discussion

In this article, we start by investigating the connection between a signal decomposed into a sum of damped or amplified oscillations and a renewal equation. Namely, we connect the daily number of reported cases written as

N (t) = N_{1} e^{α_{1} t} [cos (ω_{1} t) + i sin (ω_{1} t)] + \dots + N_{n} e^{α_{n} t} [cos (ω_{n} t) + i sin (ω_{n} t)], \forall t \geq t_{1} - n,

with the renewal equation

N (t) = \sum_{d = 1}^{n} R_{0} (d) N (t - d), \forall t \geq t_{1} .

In the context of epidemic time series, a spectral method usually refers to the Fourier decomposition of a periodic signal. In the present paper, the data are not periodic and are composed of an exponential function (Malthusian growth) perturbed with some damped oscillating functions. So we use complex numbers with non-null real parts. We refer to Cazelles et al. [33] for more results about time series.

4.1. Data over Ten Days

We can notice in Figure 9 and Figure 10 and Table 1 that the daily reproduction number as well as the instantaneous reproduction number are estimated. Concerning the instantaneous (or effective) reproduction number $R_{e} (t)$ [43,44] estimated by [42], which equals 1.1 at the 19th of October 2020, the best fit corresponds to $n = 7$ days (see (c) in Figure 9). This value of the duration of the contagiousness period is close to the values 6 or 7 days and are close to the values estimated from the virulence measured in [14,45,46]. In Figure 10, we always obtain a $U$ -shaped distribution for the curve of daily reproduction numbers. This corresponds to the biphasic form of the virulence already observed in respiratory viruses, such as influenza, as recalled in the Introduction.

Table 1.

The above reproduction numbers are obtained by using the formula $R_{0} = \sum_{d = 1}^{n} R_{0} (d)$ .

n	3	5	6	7
$R_{0}$	1.02	1.04	1.06	1.07

Open in a new tab

This temporal behavior of the contagiousness can correspond to the evolution of contagious symptoms like cough or spitting, which diminish during the innate immune response, followed by a comeback of the symptoms before the adaptive immune response (whenever the innate defense has been overcome by the virus). If the innate cellular immunity has been not sufficient for eliminating the virus, the viral load again increases, causing a reappearance of the symptoms before the adaptive immunity (cellular and humoral) occurs, which results in a transient decrease in contagiousness between the two immunologic phases. The medical recommendations are, in the case of U-shaped contagiousness, never to take a transient improvement for a permanent disappearance of the symptoms and to stay at home to avoid a bacterial secondary infection that is possibly fatal.

The estimation of the daily reproduction numbers in the COVID-19 outbreak constitutes an important issue. At the public health level, to publish only the sum of the daily reproduction numbers, that is, to say the basic reproduction number $R_{0}$ or the effective reproduction number Re, could suffice for controlling and managing the behavior of a whole population with mitigation or vaccination measures. At the individual level, it is important to know the existence of a minimum of the daily reproduction numbers, which generally corresponds to a temporary clinical improvement, after a partial success of the innate immune defense. This makes it possible to advise the patient to continue to respect his own isolation, prevention, and therapy choices (depending on his vaccination state) even if this transient clinical improvement has occurred. The present methodology allows also to estimate both the individual contagiousness duration in a dedicated age class and also its seasonal variations, which is crucial for optimizing the benefit–risk decisions of the public and individual health policies.

4.2. Data over One Month

Over one month, we obtain a daily reproduction number with three peaks. Each peak is centered respectively on 7 days, 14 days, and 21 days. These quantities coincide with the period of 7 days and 21 days obtained in Figure 14 in fitting the first residual when we subtract the exponential growth first fit to the cumulative data. As far as we understand the problem, that is the period of 21 days in the data, which induces the third peak. This third peak is very suspicious. Nevertheless, the data lead us to such a shape for the daily reproductive number. We also tried to run Figure 19 without the third peak, and we obtained a bad fit to the data, while with this third peak, the fit is good. One may also note that the 21-day period is insignificant for the ACF and the PACF in Figure 12.

Several possibilities exist to explain this strange shape for the daily reproduction number using the data over one month. One possible explanation is that the Japanese population should be subdivided into several groups having very different infection dynamics (at the level of a single patient). Here we have in mind the patient with a short infection period but high transmissibility (super spreaders) versus the patient with a long infection period with mild symptoms.

We suspect that such a shape for the daily reproduction number could be attributed to the time since infection to report a case. The daily number of reported cases would be obtained from $N (t)$ , and the daily number of new infected cases by using the following model

D (t) = f \sum_{d = 1}^{q} K (d) N (t - d),

where the integer $q \geq 1$ is the maximum number of days needed to report a case, $f \in [0, 1]$ is the fraction reported, and $K (d) \geq 0$ is the probability to report a case after d days. Therefore, we must have

\sum_{d = 1}^{q} K (d) = 1 .

4.3. Perspectives and Conclusions

In the present paper, we only consider the Japanese data in the exponential phase of the third epidemic wave.

The case of Japan seems emblematic to us, as it corresponds to a wave of well-identified new cases following a clearly characterized endemic phase. The exponential growth phenomenon being transitory, this explains the relatively limited duration of the sampling, which corresponds to a period in days during which the epidemiological parameters (such as the transmission rate) can be considered as constant. It is in such circumstances where the Gaussian nature of the errors is difficult to prove, due to the small sampling, such that similar methods based on wavelets have been proposed (Walden and Hosken [37]).

The method of the present paper should be applied to several countries for each epidemic wave to obtain a more systematic study. For the moment, over one month, we obtained a shape for the daily reproduction number that follows the data very well. However, we are suspicious about the third peak. We suspect that the default of our analysis is coming from the model itself. Such a question has been recently studied by Ioannidis and his collaborators in [47], and we believe that we are facing such modeling difficulties.

Appendix A. Non Identifiability Result

From Formula (13), we deduce that the characteristic (12) has exactly one positive solution. By the Perron–Frobenius theorem applied to the Leslie matrix L defined by (11), we know that (by considering the norm of linear operator) the spectral radius of L

r (L) : = lim_{n \mapsto + \infty} {∥ L^{n} ∥}_{L (R)}^{1 / n} > 0,

is the unique positive solution of (12). Moreover, all the remaining eigenvalues have a modulus smaller or equal to $r (L)$ . We refer to ([48], Chapter 4), for more results about this subject.

Non identifiability result: Let $λ_{🟉} > 0$ and $N_{🟉} \neq 0$ . Then

N (t) = N_{🟉} λ_{🟉}^{t - t_{1}}, \forall t \geq t_{1},

is a known solution of (8) if and only if $λ_{🟉}$ is a solution of the characteristic equation.

Assume that $d \in [1, n] \mapsto R^{🟉} (d) \geq 0$ is given, and satisfies

\sum_{d = 1}^{n} R^{🟉} (d) > 0 .

Then if we define

R_{0} (a) = \frac{R^{🟉} (a)}{\sum_{d = 1}^{n} R^{🟉} (d) λ_{🟉}^{- d}}, \forall a = 1, \dots, n,

we deduce that the equation (12) is satisfied for $λ = λ_{🟉}$ , and $N (t) = N_{🟉} λ_{🟉}^{t - t_{1}}$ is a solution of (8). We conclude that a single function $N (t) = N_{🟉} λ_{🟉}^{t - t_{1}}$ is not enough to identify $R_{0} (1), R_{0} (2), R_{0} (3), \dots, R_{0} (n)$ .

Appendix B. Identifiability Result

Assumption A1.

Assume that $λ_{1}, \dots, λ_{n} \in C$ are nonzero complex numbers, and are separated two by two. That is,

$λ_{i} \neq 0, \forall i = 1, \dots, n .$

and

$λ_{i} \neq λ_{j}, w h e n e v e r i \neq j .$

Remark A1.

Since the coefficients of the characteristic Equation (12) are all real, we could also impose that the conjugate of each eigenvalue belongs to the spectrum. That is

${\bar{λ}}_{i} \in \{λ_{1}, \dots, λ_{n}\}, \forall i = 1, \dots, n .$

However, that is not necessary in this subsection.

Remark A2.

When all the eigenvalues are real, the above assumption will be satisfied if and only if $λ_{1}, \dots, λ_{n} \in R$ are nonzero real numbers which are pairwise distinct. Up to a permutation, that is

$λ_{i} \neq 0, \forall i = 1, \dots, n,$

and

$λ_{1} < λ_{2} < \dots < λ_{n} .$

Lemma A1.

Let Assumption A1 be satisfied. Assume that each $λ_{i} \in C$ satisfies the characteristic Equation (12). Then the Leslie matrix L defined by (11) is diagonalizable (and invertible); moreover, for each $U_{1}, U_{2}, \dots, U_{n} \in C$ ,

$U (t) = U_{1} λ_{1}^{t} + U_{2} λ_{2}^{t} + \dots + U_{n} λ_{n}^{t}, \forall t \geq t_{1} - n,$

is a solution of (8). That is to say,

$U (t) = \sum_{d = 1}^{n} R_{0} (d) U (t - d), \forall t \geq t_{1} .$

Identification of the components $U_{i}$ from the values of $t \mapsto N (t)$ : Assume that the values of $N (t)$ are given for $t = t_{1}, \dots, t_{1} + n - 1$ . We claim that we can compute $U_{1}, U_{2}, U_{3}, \dots, U_{n} \in C$ . Indeed,

\begin{matrix} N (t_{1}) & = U_{1} λ_{1}^{t_{1}} & + U_{2} λ_{2}^{t_{1}} & + \dots + U_{n} λ_{n}^{t_{1}}, \\ N (t_{1} + 1) & = U_{1} λ_{1}^{t_{1} + 1} & + U_{2} λ_{2}^{t_{1} + 1} & + \dots + U_{n} λ_{n}^{t_{1} + 1}, \\ ⋮ \\ N (t_{1} + n - 1) & = U_{1} λ_{1}^{t_{1} + n - 1} & + U_{2} λ_{2}^{t_{1} + n - 1} & + \dots + U_{n} λ_{n}^{t_{1} + n - 1}, \end{matrix}

can be rewritten as the system

(\begin{matrix} N (t_{1}) \\ N (t_{1} + 1) \\ ⋮ \\ N (t_{1} + n - 1) \end{matrix}) = (\begin{matrix} λ_{1}^{t_{1}} & λ_{2}^{t_{1}} \dots \dots \dots \dots & λ_{n}^{t_{1}} \\ λ_{1}^{t_{1} + 1} & λ_{2}^{t_{1} + 1} \dots \dots \dots & λ_{n}^{t_{1} + 1} \\ ⋮ & ⋮ & ⋮ \\ λ_{1}^{t_{1} + n - 1} & λ_{2}^{t_{1} + n - 1} \dots \dots & λ_{n}^{t_{1} + n - 1} \end{matrix}) (\begin{matrix} U_{1} \\ U_{2} \\ ⋮ \\ U_{n} \end{matrix}) .

(A1)

The determinant of the above Vandermonde-like matrix

det (\begin{matrix} λ_{1}^{t_{1}} & λ_{2}^{t_{1}} \dots \dots \dots \dots & λ_{n}^{t_{1}} \\ λ_{1}^{t_{1} + 1} & λ_{2}^{t_{1} + 1} \dots \dots \dots & λ_{n}^{t_{1} + 1} \\ ⋮ & ⋮ & ⋮ \\ λ_{1}^{t_{1} + n - 1} & λ_{2}^{t_{1} + n - 1} \dots \dots & λ_{n}^{t_{1} + n - 1} \end{matrix})

= λ_{1}^{t_{1}} λ_{2}^{t_{2}} \dots λ_{n}^{t_{n}} det (\begin{matrix} 1 & 1 \dots \dots \dots \dots & 1 \\ λ_{1}^{- 1} & λ_{2}^{- 1} \dots \dots \dots & λ_{n}^{- 1} \\ ⋮ & ⋮ & ⋮ \\ λ_{1}^{- (n - 1)} & λ_{2}^{- (n - 1)} \dots \dots & λ_{n}^{- (n - 1)} \end{matrix}),

therefore,

det (\begin{matrix} λ_{1}^{t_{1}} & λ_{2}^{t_{1}} \dots \dots \dots \dots & λ_{n}^{t_{1}} \\ λ_{1}^{t_{1} + 1} & λ_{2}^{t_{1} + 1} \dots \dots \dots & λ_{n}^{t_{1} + 1} \\ ⋮ & ⋮ & ⋮ \\ λ_{1}^{t_{1} + n - 1} & λ_{2}^{t_{1} + n - 1} \dots \dots & λ_{n}^{t_{1} + n - 1} \end{matrix}) = λ_{1}^{- 1} λ_{2}^{- 1} \dots λ_{n}^{- 1} \prod_{1 \leq i < j \leq n} (λ_{i}^{- 1} - λ_{j}^{- 1}) .

Therefore, under Assumption A1, this determinant is non null, and we obtain the following result.

Proposition A1.

Let Assumption A1 be satisfied. Then we can compute the components $U_{1}, \dots, U_{n}$ in function of the given elements of the trajectory $N (t_{1}), \dots, N (t_{1} + n - 1)$ by solving the linear system (A1), and

$(\begin{matrix} U_{1} \\ U_{2} \\ ⋮ \\ U_{n} \end{matrix}) = {(\begin{matrix} λ_{1}^{t_{1}} & λ_{2}^{t_{1}} \dots \dots \dots \dots & λ_{n}^{t_{1}} \\ λ_{1}^{t_{1} + 1} & λ_{2}^{t_{1} + 1} \dots \dots \dots & λ_{n}^{t_{1} + 1} \\ ⋮ & ⋮ & ⋮ \\ λ_{1}^{t_{1} + n - 1} & λ_{2}^{t_{1} + n - 1} \dots \dots & λ_{n}^{t_{1} + n - 1} \end{matrix})}^{- 1} (\begin{matrix} N (t_{1}) \\ N (t_{1} + 1) \\ ⋮ \\ N (t_{1} + n - 1) \end{matrix}) .$

Identification of the component $R_{0} (d)$ from the $λ_{i}$ : By assuming that each $λ_{i}$ is a solution of the characteristic Equation (12), we obtain

\begin{matrix} 1 & = R_{0} (1) λ_{1}^{- 1} + R_{0} (2) λ_{1}^{- 2} + \dots + R_{0} (n) λ_{1}^{- n}, \\ 1 & = R_{0} (1) λ_{2}^{- 1} + R_{0} (2) λ_{2}^{- 2} + \dots + R_{0} (n) λ_{2}^{- n}, \\ ⋮ \\ 1 & = R_{0} (1) λ_{n}^{- 1} + R_{0} (2) λ_{n}^{- 2} + \dots + R_{0} (n) λ_{n}^{- n}, \end{matrix}

(A2)

which rewrites in the matrix form as

(\begin{matrix} 1 \\ 1 \\ ⋮ \\ 1 \end{matrix}) = (\begin{matrix} λ_{1}^{- 1} & λ_{1}^{- 2} \dots \dots & λ_{1}^{- n} \\ λ_{2}^{- 1} & λ_{2}^{- 2} \dots \dots & λ_{2}^{- n} \\ ⋮ & ⋮ & ⋮ \\ λ_{n}^{- 1} & λ_{n}^{- 2} \dots \dots & λ_{n}^{- n} \end{matrix}) (\begin{matrix} R_{0} (1) \\ R_{0} (2) \\ ⋮ \\ R_{0} (n) \end{matrix}) .

Under Assumption A1 the Vandermonde-like matrix

(\begin{matrix} λ_{1}^{- 1} & λ_{1}^{- 2} \dots \dots & λ_{1}^{- n} \\ λ_{2}^{- 1} & λ_{2}^{- 2} \dots \dots & λ_{2}^{- n} \\ ⋮ & ⋮ & ⋮ \\ λ_{n}^{- 1} & λ_{n}^{- 2} \dots \dots & λ_{n}^{- n} \end{matrix})

is invertible, because

det (\begin{matrix} λ_{1}^{- 1} & λ_{1}^{- 2} \dots \dots & λ_{1}^{- n} \\ λ_{2}^{- 1} & λ_{2}^{- 2} \dots \dots & λ_{2}^{- n} \\ ⋮ & ⋮ & ⋮ \\ λ_{n}^{- 1} & λ_{n}^{- 2} \dots \dots & λ_{n}^{- n} \end{matrix}) = λ_{1}^{- 1} λ_{2}^{- 1} \dots λ_{n}^{- 1} det (\begin{matrix} 1 & λ_{1}^{- 1} \dots \dots & λ_{1}^{- (n - 1)} \\ 1 & λ_{2}^{- 1} \dots \dots & λ_{2}^{- (n - 1)} \\ ⋮ & ⋮ & ⋮ \\ 1 & λ_{n}^{- 1} \dots \dots & λ_{n}^{- (n - 1)} \end{matrix})

hence

det (\begin{matrix} λ_{1}^{- 1} & λ_{1}^{- 2} \dots \dots & λ_{1}^{- n} \\ λ_{2}^{- 1} & λ_{2}^{- 2} \dots \dots & λ_{2}^{- n} \\ ⋮ & ⋮ & ⋮ \\ λ_{n}^{- 1} & λ_{n}^{- 2} \dots \dots & λ_{n}^{- n} \end{matrix}) = λ_{1}^{- 1} λ_{2}^{- 1} \dots λ_{n}^{- 1} \prod_{1 \leq i < j \leq n} (λ_{i}^{- 1} - λ_{j}^{- 1}) \neq 0 .

Therefore, we can compute the component of the map $d \in [1, n] \mapsto R_{0} (d)$ by solving a linear system involving the eigenvalues of the characteristic equation.

Theorem A1.

Let Assumption A1 be satisfied. Then the following properties are equivalent

(i)
The set $\{λ_{1}, \dots, λ_{n}\}$ is the spectrum of the Leslie matrix L defined in (11).

(ii)
Each element of $\{λ_{1}, \dots, λ_{n}\}$ satisfies (A2).

(iii)
The elements $\{λ_{1}, \dots, λ_{n}\}$ satisfy
${(\begin{matrix} λ_{1}^{- 1} & λ_{1}^{- 2} \dots \dots & λ_{1}^{- n} \\ λ_{2}^{- 1} & λ_{2}^{- 2} \dots \dots & λ_{2}^{- n} \\ ⋮ & ⋮ & ⋮ \\ λ_{n}^{- 1} & λ_{n}^{- 2} \dots \dots & λ_{n}^{- n} \end{matrix})}^{- 1} (\begin{matrix} 1 \\ 1 \\ ⋮ \\ 1 \end{matrix}) = (\begin{matrix} R_{0} (1) \\ R_{0} (2) \\ ⋮ \\ R_{0} (n) \end{matrix}) .$ (A3)

In Figure A1, we plot all the spectrum’s location for Markovian Leslie matrices on a mesh. We can observe the changes of location of the spectrum depending of the dimension n. It seems that the spectrum is fielding more and more the unit circle in $C$ when the dimension increases. We refer to Kirkland [49] for more results going in that direction.

Continuous dependency of the component $R_{0} (d)$ with respect to the $λ_{i}$ : Define the set $Ω \subset C^{n}$ of all the elements $Λ = \{λ_{1}^{🟉}, \dots, λ_{n}^{🟉}\} \in C^{n}$ satisfying Assumption A1. For each $Λ = \{λ_{1}^{🟉}, \dots, λ_{n}^{🟉}\} \in Ω,$ we define

M (Λ) = (\begin{matrix} λ_{1}^{- 1} & λ_{1}^{- 2} \dots \dots & λ_{1}^{- n} \\ λ_{2}^{- 1} & λ_{2}^{- 2} \dots \dots & λ_{2}^{- n} \\ ⋮ & ⋮ & ⋮ \\ λ_{n}^{- 1} & λ_{n}^{- 2} \dots \dots & λ_{n}^{- n} \end{matrix}), \forall Λ = \{λ_{1}, \dots, λ_{n}\} \in Ω .

Theorem A2.

Consider a sequence ${\{Λ^{m} = \{λ_{1}^{m}, \dots, λ_{n}^{m}\}\}}_{m \geq 0} \subset Ω,$ and a point $Λ^{🟉} = \{λ_{1}^{🟉}, \dots, λ_{n}^{🟉}\} \in Ω$ (i.e., all satisfying Assumption A1). Assume that

$lim_{m \mapsto + \infty} Λ^{m} = Λ^{🟉},$

then

$lim_{m \mapsto + \infty} R_{0}^{m} = R_{0}^{🟉},$

where

$R_{0}^{m} = M {(Λ^{m})}^{- 1} (\begin{matrix} 1 \\ 1 \\ ⋮ \\ 1 \end{matrix}), \forall m \in N, and R_{0}^{🟉} = M {(Λ^{🟉})}^{- 1} (\begin{matrix} 1 \\ 1 \\ ⋮ \\ 1 \end{matrix}) .$

Proof.

We have

$(\begin{matrix} 1 \\ 1 \\ ⋮ \\ 1 \end{matrix}) = M (Λ^{m}) R_{0}^{m}, \forall n \in N, and (\begin{matrix} 1 \\ 1 \\ ⋮ \\ 1 \end{matrix}) = M (Λ^{🟉}) R_{0}^{🟉} .$

Subtracting the two above quantities, we obtain

$0 = M (Λ^{m}) R_{0}^{m} - M (Λ^{🟉}) R_{0}^{🟉},$ (A4)

which is also equivalent to

$0 = M (Λ^{m}) R_{0}^{m} - M (Λ^{🟉}) [R_{0}^{🟉} - R_{0}^{m}] - M (Λ^{🟉}) R_{0}^{m},$

hence,

$R_{0}^{🟉} - R_{0}^{m} = M {(Λ^{🟉})}^{- 1} [M (Λ^{m}) - M (Λ^{🟉})] R_{0}^{m} .$

Setting

$L_{m} = M {(Λ^{🟉})}^{- 1} [M (Λ^{m}) - M (Λ^{🟉})]$

and so

$lim_{m \mapsto + \infty} L_{m} = 0_{M_{n} (C)} .$

Now since

$R_{0}^{🟉} - R_{0}^{m} = L_{m} R_{0}^{🟉} - L_{m} [R_{0}^{🟉} - R_{0}^{m}],$

we deduce that

$∥ R_{0}^{🟉} - R_{0}^{m} ∥ \leq ∥ L_{m} ∥_{L (C^{n})} ∥ R_{0}^{🟉} ∥ + ∥ L_{m} ∥_{L (C^{n})} ∥ R_{0}^{🟉} - R_{0}^{m} ∥,$

hence, for all $m \geq 1$ large enough (i.e., satisfying $∥ L_{m} ∥_{L (C^{n})} < 1$ )

$∥ R_{0}^{🟉} - R_{0}^{m} ∥ \leq \frac{∥ L_{m} ∥_{L (C^{n})}}{1 - ∥ L_{m} ∥_{L (C^{n})}} ∥ R_{0}^{🟉} ∥,$

and the proof is completed. □

Appendix C. Identification of the Phenomenological Model

Here we assume that the daily number of reported cases has the following form

N (t) = N_{1} e^{λ_{1} t} + N_{2} e^{λ_{2} t} + N_{3} e^{λ_{3} t} + \dots + N_{m} e^{λ_{m} t},

(A5)

where $N_{1}, \dots, N_{n} \in C$ are non null, and $λ_{1}, \dots, λ_{n} \in C$ are pairwise distinct.

If we assume to know $t \mapsto N (t)$ for all positive integer values $t = 0, 1, 2, \dots,$ then we can compute the discrete Laplace transform

L (N) (λ) = \sum_{t = 0}^{\infty} e^{- λ t} N (t),

which is well defined for all $λ \in C$ such that

Re (λ) > max_{i = 1, \dots, n} Re (λ_{i}) .

By using (A5), we obtain

L (N) (λ) = \sum_{p = 1}^{m} \frac{N_{p}}{1 - e^{λ_{p} - λ}},

whenever $Re (λ) > max_{i = 1, \dots, n} Re (λ_{i})$ .

Let $k \in \{1, \dots, m\}$ be an integer such that

Re (λ_{k}) = max_{i = 1, \dots, n} Re (λ_{i}),

we obtain

lim_{\begin{matrix} λ \mapsto λ_{k} \\ Re (λ) > Re (λ_{k}) \end{matrix}} | L (N) (λ) | = + \infty .

The Laplace transform could be used to identify the unknown parameters $λ_{1}, \dots, λ_{m}$ in (A5). Then by combining this idea with linear regression of $t \mapsto e^{λ_{k} t}$ , we could identify the parameters $N_{k}$ , then step by step compute all the parameters of $N (t)$ in (A5).

In practice, we only know $t \mapsto N (t)$ on a finite time interval $t = 0, 1, 2, \dots, L$ . In that case, we can define the truncated Laplace transform as

L (N) (λ) = \sum_{t = 0}^{L} e^{- λ t} N (t)

and we have by (27)

L (N) (λ) = \sum_{p = 1}^{m} N_{p} \frac{1 - e^{(λ_{p} - λ) (L + 1)}}{1 - e^{(λ_{p} - λ)}} .

The Laplace transform does not permit to detect the eigenvalues $λ_{k}$ (we tested without success some examples with values of complex numbers coming from the present article). Identification of the eigenvalues $λ_{k}$ , whenever $t \mapsto N (t)$ is known only on a finite time interval, seems to be an open intriguing question.

Appendix D. About Residual 2 (t) in Section 3.3

In Figure A2, we observe that average of ${Residual}_{2} (t) = {Residual}_{1} (t) - ϕ_{2} (t)$ is close to 0, but its histogram does not have the shape of a normal distribution. So, there might be some residual information in ${Residual}_{2} (t)$ .

Author Contributions

Conceptualization, J.D. and P.M.; methodology, P.M.; software, P.M.; writing—original draft preparation, J.D and P.M.; writing—review and editing, J.D and P.M.; All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

There is no subject involved in the present study.

Data Availability Statement

No data were produced for this study.

Conflicts of Interest

The authors declare no conflict of interest.

Funding Statement

This research received no external funding.

Footnotes

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Demongeot J., Griette Q., Maday Y., Magal P. Kermack-McKendrick model with age of infection starting from a single or multiple cohorts of infected patients. arXiv. 20222205.15634 [Google Scholar]
2.Kermack W.O., McKendrick A.G. Contributions to the mathematical theory of epidemics: II. Proc. R. Soc. Lond. Ser. B. 1932;138:55–83. [Google Scholar]
3.Chao D.L., Halloran M.E., Obenchain V.J., Longini I.M., Jr. FluTE, a publicly available stochastic influenza epidemic simulation model. PLoS Comput. Biol. 2010;6:e1000656. doi: 10.1371/journal.pcbi.1000656. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Itoh Y., Shichinohe S., Nakayama M., Igarashi M., Ishii A., Ishigaki H., Ishida H., Kitagawa N., Sasamura T., Shiohara M., et al. Emergence of H7N9 Influenza A Virus Resistant to Neuraminidase Inhibitors in Nonhuman Primates. Antimicrob. Agents Chemother. 2015;59:4962–4973. doi: 10.1128/AAC.00793-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Alvarez L., Colom M., Morel J.D., Morel J.M. Computing the daily reproduction number of COVID-19 by inverting the renewal equation using a variational technique. Proc. Natl. Acad. Sci. USA. 2021;118:e2105112118. doi: 10.1073/pnas.2105112118. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Alvarez L., Morel J.-D., Morel J.-M. Modeling COVID-19 incidence by the renewal equation after removal of administrative bias and noise. Biology. 2022;11:540. doi: 10.3390/biology11040540. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Demongeot J., Griette Q., Magal P. SI epidemic model applied to COVID-19 data in mainland China. R. Soc. Open Sci. 2020;7:201878. doi: 10.1098/rsos.201878. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Griette Q., Demongeot J., Magal P. What can we learn from COVID-19 data by using epidemic models with unidentified infectious cases? Math. Biosci. Eng. 2021;19:537–594. doi: 10.3934/mbe.2022025. [DOI] [PubMed] [Google Scholar]
9.Griette Q., Demongeot J., Magal P. A robust phenomenological approach to investigate COVID-19 data for France. Math. Appl. Sci. Eng. 2021;2:149–218. [Google Scholar]
10.Nishiura H. Time variations in the transmissibility of pandemic influenza in Prussia, Germany, from 1918–19. Theor. Biol. Med. Model. 2007;4:20. doi: 10.1186/1742-4682-4-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Nishiura H., Chowell G. The Effective Reproduction Number as a Prelude to Statistical Estimation of Time-Dependent Epidemic Trends. In: Epidemiology G., Chowell J.M., Hyman L.M., Bettencourt A., Castillo-Chavez C., editors. Mathematical and Statistical Estimation Approaches. Springer; Dordrecht, The Netherlands: 2009. pp. 103–121. [Google Scholar]
12.Bakhta A., Boiveau T., Maday Y., Mula O. Epidemiological forecasting with model reduction of compartmental models. application to the COVID-19 pandemic. Biology. 2020;10:22. doi: 10.3390/biology10010022. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Waku J., Oshinubi K., Demongeot J. Maximal reproduction number estimation and identification of transmission rate from the first inflection point of new infectious cases waves: COVID-19 outbreak example. Math. Comput. Simul. 2022;198:47–64. doi: 10.1016/j.matcom.2022.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Pan Y., Zhang D., Yang P., Poon L.L.M., Wang Q. Viral load of SARS-CoV-2 in clinical samples. Lancet Infect. Dis. 2020;20:411–412. doi: 10.1016/S1473-3099(20)30113-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Whittle P. Hypothesis Testing in Time Series Analysis. Almquist Wicksell. :1951. [Google Scholar]
16.Whittle P. Prediction and Regulation. English Universities Press; London, UK: 1963. [Google Scholar]
17.Wiener N. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Press; Cambridge, MA, USA: 1949. [Google Scholar]
18.Chan K.S., Tong H. A note on certain integral equations associated with non-linear time series analysis. Probab. Th. Rel. Fields. 1986;73:153–158. [Google Scholar]
19.Lim K.S., Tong H. A statistical approach to difference-delay equation modelling in ecology—Two case studies. J. Time Ser. Anal. 1983;4:239–267. doi: 10.1111/j.1467-9892.1983.tb00372.x. [DOI] [Google Scholar]
20.Priestley M.B. Spectral Analysis and Time Series. Academic Press; Cambridge, MA, USA: 1981. [Google Scholar]
21.Ramsay J.O. Monotone Regression Splines in Action. Stat. Sci. 1988;3:425–441. doi: 10.1214/ss/1177012761. [DOI] [Google Scholar]
22.Ramsay J., Hooker G. Dynamic Data Analysis: Modeling Data with Differential Equations. Springer; New York, NY, USA: 2017. [Google Scholar]
23.Tong H. Non-Linear Time Series: A Dynamical System Approach. Oxford University Press; Oxford, UK: 1990. [Google Scholar]
24.Tuan P.D. The estimation of parameters for autoregressive moving average models. J. Time Ser. Anal. 1984;5:53–68. [Google Scholar]
25.Priestley M.B. Evolutionary spectra and non-stationary processes. J. R. Stat. Soc. Ser. 1965;27:204–229. doi: 10.1111/j.2517-6161.1965.tb01488.x. [DOI] [Google Scholar]
26.Malthus T.R. An Essay on the Principle of Population as It Affects the Future Improvement of Society, with Remarks on the Speculations of Mr. Godwin, M. Condorcet, and Other Writers. J. Johnson; London, UK: 1798. [Google Scholar]
27.Fisher R.A. The Wave of Advance of Advantageous Genes. Ann. Eugen. 1937;7:353–369. doi: 10.1111/j.1469-1809.1937.tb02153.x. [DOI] [Google Scholar]
28.Lambert J.H. Beytrage Zum Gebrauche Der Mathematik Und Deren Anwendung. Verlage des Buchladens der Realschule; Berlin, Germany: 1765. p. 72. [Google Scholar]
29.Euler L. Recherches générales sur la mortalité et la multiplication du genre humain. MéMoires L’AcadéMie Des Sci. Berl. 1767;16:144–164. [Google Scholar]
30.Lotka A.J. Relation between birth rates and death rates. Science. 1907;26:121–130. doi: 10.1126/science.26.653.21.b. [DOI] [PubMed] [Google Scholar]
31.Leslie P.H. On the use of matrices in certain population mathematics. Biometrika. 1945;33:183–212. doi: 10.1093/biomet/33.3.183. [DOI] [PubMed] [Google Scholar]
32.Hahn G.M. Mammalian cell populations. Math. Biosci. 1970;6:295–315. doi: 10.1016/0025-5564(70)90069-6. [DOI] [Google Scholar]
33.Cazelles B., Chavez M., Magny G.C.D., Guégan J.F., Hales S. Time-dependent spectral analysis of epidemiological time-series with wavelets. J. R. Soc. Interface. 2007;4:625–636. doi: 10.1098/rsif.2007.0212. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Robinson E.A. Predictive deconvolution of time series with application to seismic exploration. Geophysics. 1967;32:418–484. doi: 10.1190/1.1439873. [DOI] [Google Scholar]
35.Peacock K.L., Treitel S. Predictive deconvolution: Theory and practice. Geophysics. 1969;34:155–169. doi: 10.1190/1.1440003. [DOI] [Google Scholar]
36.Robinson E.A., Treitel S. Geophysical Signal Analysis. Prentice-Hill, Inc.; Englewood Cliffs, NJ, USA: 1980. [Google Scholar]
37.Walden A.T., Hosken J.W.J. The nature of non-Gaussianity of primary reflection coefficients and its significance for deconvolution. Geophys. Prosp. 1986;34:1038–1066. doi: 10.1111/j.1365-2478.1986.tb00512.x. [DOI] [Google Scholar]
38.Vinod D., Cherstvy A.G., Wang W., Metzler R., Sokolov I.M. Nonergodicity of reset geometric Brownian motion. Phys. Rev. E. 2022;105:L012106. doi: 10.1103/PhysRevE.105.L012106. [DOI] [PubMed] [Google Scholar]
39.Ritschel S., Cherstvy A.G., Metzler R. Universality of delay-time averages for financial time series: Analytical results, computer simulations, and analysis of historical stock-market prices. J. Phys. Complex. 2021;2:045003. doi: 10.1088/2632-072X/ac2220. [DOI] [Google Scholar]
40.Data from WHO. [(accessed on 20 July 2022)]. Available online: https://COVID19.who.int/WHO-COVID-19-global-data.csv.
41.Demongeot J., Oshinubi K., Rachdi M., Seligmann H., Thuderoz F., Waku J. Estimation of Daily Reproduction Numbers during the COVID-19 Outbreak. Computation. 2021;9:109. doi: 10.3390/computation9100109. [DOI] [Google Scholar]
42.Powered by the Institute of Global Health, Faculty of Medicine, University of Geneva and the Swiss Data Science Center, ETH Zürich-EPFL. [(accessed on 20 July 2022)]. Available online: https://renkulab.shinyapps.io/COVID-19-Epidemic-Forecasting/_w_850fb011/?tab=jhu_pred&country=Japan.
43.Cori A., Ferguson N.M., Fraser C., Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am. J. Epidemiol. 2013;178:1505–1512. doi: 10.1093/aje/kwt133. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Scire J., Nadeau S., Vaughan T., Brupbacher G., Fuchs S., Sommer J., Koch K.N., Misteli R., Mundorff L., Götz T., et al. Reproductive number of the COVID-19 epidemic in Switzerland with a focus on the Cantons of Basel-Stadt and Basel-Landschaft. Swiss Med. Wkly. 2020;150:w20271. doi: 10.4414/smw.2020.20271. [DOI] [PubMed] [Google Scholar]
45.Kawasuji H., Takegoshi Y., Kaneda M., Ueno A., Miyajima Y., Kawago K., Fukui Y., Yoshida Y., Kimura M., Yamada H., et al. Transmissibility of COVID-19 depends on the viral load around onset in adult and symptomatic patients. PLoS ONE. 2020;15:e0243597. doi: 10.1371/journal.pone.0243597. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Kim S.E., Jeong H.S., Yu Y., Shin S.U., Kim S., Oh T.H., Kim U.J., Kang S.J., Jang H.C., Jung S.I., et al. Viral kinetics of SARS-CoV-2 in asymptomatic carriers and presymptomatic patients. Int. J. Infect. Dis. 2020;95:441–443. doi: 10.1016/j.ijid.2020.04.083. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Ioannidis J.P., Cripps S., Tanner M.A. Forecasting for COVID-19 has failed. Int. J. Forecast. 2022;38:423–438. doi: 10.1016/j.ijforecast.2020.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Ducrot A., Griette Q., Liu Z., Magal P. Differential Equations and Population Dynamics I: Introductory Approaches. Springer Nature; Berlin, Germany: 2022. [Google Scholar]
49.Kirkland S. On the spectrum of a Leslie matrix with a near-periodic fecundity pattern. Linear Algebra Its Appl. 1993;178:261–279. doi: 10.1016/0024-3795(93)90345-O. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

No data were produced for this study.

[B1-biology-11-01825] 1.Demongeot J., Griette Q., Maday Y., Magal P. Kermack-McKendrick model with age of infection starting from a single or multiple cohorts of infected patients. arXiv. 20222205.15634 [Google Scholar]

[B2-biology-11-01825] 2.Kermack W.O., McKendrick A.G. Contributions to the mathematical theory of epidemics: II. Proc. R. Soc. Lond. Ser. B. 1932;138:55–83. [Google Scholar]

[B3-biology-11-01825] 3.Chao D.L., Halloran M.E., Obenchain V.J., Longini I.M., Jr. FluTE, a publicly available stochastic influenza epidemic simulation model. PLoS Comput. Biol. 2010;6:e1000656. doi: 10.1371/journal.pcbi.1000656. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4-biology-11-01825] 4.Itoh Y., Shichinohe S., Nakayama M., Igarashi M., Ishii A., Ishigaki H., Ishida H., Kitagawa N., Sasamura T., Shiohara M., et al. Emergence of H7N9 Influenza A Virus Resistant to Neuraminidase Inhibitors in Nonhuman Primates. Antimicrob. Agents Chemother. 2015;59:4962–4973. doi: 10.1128/AAC.00793-15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5-biology-11-01825] 5.Alvarez L., Colom M., Morel J.D., Morel J.M. Computing the daily reproduction number of COVID-19 by inverting the renewal equation using a variational technique. Proc. Natl. Acad. Sci. USA. 2021;118:e2105112118. doi: 10.1073/pnas.2105112118. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6-biology-11-01825] 6.Alvarez L., Morel J.-D., Morel J.-M. Modeling COVID-19 incidence by the renewal equation after removal of administrative bias and noise. Biology. 2022;11:540. doi: 10.3390/biology11040540. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7-biology-11-01825] 7.Demongeot J., Griette Q., Magal P. SI epidemic model applied to COVID-19 data in mainland China. R. Soc. Open Sci. 2020;7:201878. doi: 10.1098/rsos.201878. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8-biology-11-01825] 8.Griette Q., Demongeot J., Magal P. What can we learn from COVID-19 data by using epidemic models with unidentified infectious cases? Math. Biosci. Eng. 2021;19:537–594. doi: 10.3934/mbe.2022025. [DOI] [PubMed] [Google Scholar]

[B9-biology-11-01825] 9.Griette Q., Demongeot J., Magal P. A robust phenomenological approach to investigate COVID-19 data for France. Math. Appl. Sci. Eng. 2021;2:149–218. [Google Scholar]

[B10-biology-11-01825] 10.Nishiura H. Time variations in the transmissibility of pandemic influenza in Prussia, Germany, from 1918–19. Theor. Biol. Med. Model. 2007;4:20. doi: 10.1186/1742-4682-4-20. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11-biology-11-01825] 11.Nishiura H., Chowell G. The Effective Reproduction Number as a Prelude to Statistical Estimation of Time-Dependent Epidemic Trends. In: Epidemiology G., Chowell J.M., Hyman L.M., Bettencourt A., Castillo-Chavez C., editors. Mathematical and Statistical Estimation Approaches. Springer; Dordrecht, The Netherlands: 2009. pp. 103–121. [Google Scholar]

[B12-biology-11-01825] 12.Bakhta A., Boiveau T., Maday Y., Mula O. Epidemiological forecasting with model reduction of compartmental models. application to the COVID-19 pandemic. Biology. 2020;10:22. doi: 10.3390/biology10010022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B13-biology-11-01825] 13.Waku J., Oshinubi K., Demongeot J. Maximal reproduction number estimation and identification of transmission rate from the first inflection point of new infectious cases waves: COVID-19 outbreak example. Math. Comput. Simul. 2022;198:47–64. doi: 10.1016/j.matcom.2022.02.023. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B14-biology-11-01825] 14.Pan Y., Zhang D., Yang P., Poon L.L.M., Wang Q. Viral load of SARS-CoV-2 in clinical samples. Lancet Infect. Dis. 2020;20:411–412. doi: 10.1016/S1473-3099(20)30113-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15-biology-11-01825] 15.Whittle P. Hypothesis Testing in Time Series Analysis. Almquist Wicksell. :1951. [Google Scholar]

[B16-biology-11-01825] 16.Whittle P. Prediction and Regulation. English Universities Press; London, UK: 1963. [Google Scholar]

[B17-biology-11-01825] 17.Wiener N. Extrapolation, Interpolation, and Smoothing of Stationary Time Series. MIT Press; Cambridge, MA, USA: 1949. [Google Scholar]

[B18-biology-11-01825] 18.Chan K.S., Tong H. A note on certain integral equations associated with non-linear time series analysis. Probab. Th. Rel. Fields. 1986;73:153–158. [Google Scholar]

[B19-biology-11-01825] 19.Lim K.S., Tong H. A statistical approach to difference-delay equation modelling in ecology—Two case studies. J. Time Ser. Anal. 1983;4:239–267. doi: 10.1111/j.1467-9892.1983.tb00372.x. [DOI] [Google Scholar]

[B20-biology-11-01825] 20.Priestley M.B. Spectral Analysis and Time Series. Academic Press; Cambridge, MA, USA: 1981. [Google Scholar]

[B21-biology-11-01825] 21.Ramsay J.O. Monotone Regression Splines in Action. Stat. Sci. 1988;3:425–441. doi: 10.1214/ss/1177012761. [DOI] [Google Scholar]

[B22-biology-11-01825] 22.Ramsay J., Hooker G. Dynamic Data Analysis: Modeling Data with Differential Equations. Springer; New York, NY, USA: 2017. [Google Scholar]

[B23-biology-11-01825] 23.Tong H. Non-Linear Time Series: A Dynamical System Approach. Oxford University Press; Oxford, UK: 1990. [Google Scholar]

[B24-biology-11-01825] 24.Tuan P.D. The estimation of parameters for autoregressive moving average models. J. Time Ser. Anal. 1984;5:53–68. [Google Scholar]

[B25-biology-11-01825] 25.Priestley M.B. Evolutionary spectra and non-stationary processes. J. R. Stat. Soc. Ser. 1965;27:204–229. doi: 10.1111/j.2517-6161.1965.tb01488.x. [DOI] [Google Scholar]

[B26-biology-11-01825] 26.Malthus T.R. An Essay on the Principle of Population as It Affects the Future Improvement of Society, with Remarks on the Speculations of Mr. Godwin, M. Condorcet, and Other Writers. J. Johnson; London, UK: 1798. [Google Scholar]

[B27-biology-11-01825] 27.Fisher R.A. The Wave of Advance of Advantageous Genes. Ann. Eugen. 1937;7:353–369. doi: 10.1111/j.1469-1809.1937.tb02153.x. [DOI] [Google Scholar]

[B28-biology-11-01825] 28.Lambert J.H. Beytrage Zum Gebrauche Der Mathematik Und Deren Anwendung. Verlage des Buchladens der Realschule; Berlin, Germany: 1765. p. 72. [Google Scholar]

[B29-biology-11-01825] 29.Euler L. Recherches générales sur la mortalité et la multiplication du genre humain. MéMoires L’AcadéMie Des Sci. Berl. 1767;16:144–164. [Google Scholar]

[B30-biology-11-01825] 30.Lotka A.J. Relation between birth rates and death rates. Science. 1907;26:121–130. doi: 10.1126/science.26.653.21.b. [DOI] [PubMed] [Google Scholar]

[B31-biology-11-01825] 31.Leslie P.H. On the use of matrices in certain population mathematics. Biometrika. 1945;33:183–212. doi: 10.1093/biomet/33.3.183. [DOI] [PubMed] [Google Scholar]

[B32-biology-11-01825] 32.Hahn G.M. Mammalian cell populations. Math. Biosci. 1970;6:295–315. doi: 10.1016/0025-5564(70)90069-6. [DOI] [Google Scholar]

[B33-biology-11-01825] 33.Cazelles B., Chavez M., Magny G.C.D., Guégan J.F., Hales S. Time-dependent spectral analysis of epidemiological time-series with wavelets. J. R. Soc. Interface. 2007;4:625–636. doi: 10.1098/rsif.2007.0212. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B34-biology-11-01825] 34.Robinson E.A. Predictive deconvolution of time series with application to seismic exploration. Geophysics. 1967;32:418–484. doi: 10.1190/1.1439873. [DOI] [Google Scholar]

[B35-biology-11-01825] 35.Peacock K.L., Treitel S. Predictive deconvolution: Theory and practice. Geophysics. 1969;34:155–169. doi: 10.1190/1.1440003. [DOI] [Google Scholar]

[B36-biology-11-01825] 36.Robinson E.A., Treitel S. Geophysical Signal Analysis. Prentice-Hill, Inc.; Englewood Cliffs, NJ, USA: 1980. [Google Scholar]

[B37-biology-11-01825] 37.Walden A.T., Hosken J.W.J. The nature of non-Gaussianity of primary reflection coefficients and its significance for deconvolution. Geophys. Prosp. 1986;34:1038–1066. doi: 10.1111/j.1365-2478.1986.tb00512.x. [DOI] [Google Scholar]

[B38-biology-11-01825] 38.Vinod D., Cherstvy A.G., Wang W., Metzler R., Sokolov I.M. Nonergodicity of reset geometric Brownian motion. Phys. Rev. E. 2022;105:L012106. doi: 10.1103/PhysRevE.105.L012106. [DOI] [PubMed] [Google Scholar]

[B39-biology-11-01825] 39.Ritschel S., Cherstvy A.G., Metzler R. Universality of delay-time averages for financial time series: Analytical results, computer simulations, and analysis of historical stock-market prices. J. Phys. Complex. 2021;2:045003. doi: 10.1088/2632-072X/ac2220. [DOI] [Google Scholar]

[B40-biology-11-01825] 40.Data from WHO. [(accessed on 20 July 2022)]. Available online: https://COVID19.who.int/WHO-COVID-19-global-data.csv.

[B41-biology-11-01825] 41.Demongeot J., Oshinubi K., Rachdi M., Seligmann H., Thuderoz F., Waku J. Estimation of Daily Reproduction Numbers during the COVID-19 Outbreak. Computation. 2021;9:109. doi: 10.3390/computation9100109. [DOI] [Google Scholar]

[B42-biology-11-01825] 42.Powered by the Institute of Global Health, Faculty of Medicine, University of Geneva and the Swiss Data Science Center, ETH Zürich-EPFL. [(accessed on 20 July 2022)]. Available online: https://renkulab.shinyapps.io/COVID-19-Epidemic-Forecasting/_w_850fb011/?tab=jhu_pred&country=Japan.

[B43-biology-11-01825] 43.Cori A., Ferguson N.M., Fraser C., Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am. J. Epidemiol. 2013;178:1505–1512. doi: 10.1093/aje/kwt133. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B44-biology-11-01825] 44.Scire J., Nadeau S., Vaughan T., Brupbacher G., Fuchs S., Sommer J., Koch K.N., Misteli R., Mundorff L., Götz T., et al. Reproductive number of the COVID-19 epidemic in Switzerland with a focus on the Cantons of Basel-Stadt and Basel-Landschaft. Swiss Med. Wkly. 2020;150:w20271. doi: 10.4414/smw.2020.20271. [DOI] [PubMed] [Google Scholar]

[B45-biology-11-01825] 45.Kawasuji H., Takegoshi Y., Kaneda M., Ueno A., Miyajima Y., Kawago K., Fukui Y., Yoshida Y., Kimura M., Yamada H., et al. Transmissibility of COVID-19 depends on the viral load around onset in adult and symptomatic patients. PLoS ONE. 2020;15:e0243597. doi: 10.1371/journal.pone.0243597. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B46-biology-11-01825] 46.Kim S.E., Jeong H.S., Yu Y., Shin S.U., Kim S., Oh T.H., Kim U.J., Kang S.J., Jang H.C., Jung S.I., et al. Viral kinetics of SARS-CoV-2 in asymptomatic carriers and presymptomatic patients. Int. J. Infect. Dis. 2020;95:441–443. doi: 10.1016/j.ijid.2020.04.083. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B47-biology-11-01825] 47.Ioannidis J.P., Cripps S., Tanner M.A. Forecasting for COVID-19 has failed. Int. J. Forecast. 2022;38:423–438. doi: 10.1016/j.ijforecast.2020.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B48-biology-11-01825] 48.Ducrot A., Griette Q., Liu Z., Magal P. Differential Equations and Population Dynamics I: Introductory Approaches. Springer Nature; Berlin, Germany: 2022. [Google Scholar]

[B49-biology-11-01825] 49.Kirkland S. On the spectrum of a Leslie matrix with a near-periodic fecundity pattern. Linear Algebra Its Appl. 1993;178:261–279. doi: 10.1016/0024-3795(93)90345-O. [DOI] [Google Scholar]

PERMALINK

Spectral Method in Epidemic Time Series: Application to COVID-19 Pandemic

Jacques Demongeot

Pierre Magal

Roles

Abstract

Simple Summary

Abstract

1. Introduction

Figure 1.

Figure 2.

Figure 3.

2. Materials and Methods

2.1. Identification of the Model

Definition 1.

2.2. Phenomenological Model to Fit the Cumulative and the Daily Numbers of Reported Case Data

Remark 1.

Remark 2.

Remark 3.

Remark 4.

2.3. Cumulative and Daily Number of Reported Cases for COVID-19 in Japan

Figure 4.

3. Results

3.1. Methods Applied to Ten Days Data

Figure 5.

Figure 6.

3.2. Spectral Truncation Method Applied to Ten Days Data

3.2.1. Re-Normalizing Procedure

Figure 7.

Figure 8.

3.2.2. Daily Basic Reproduction Numbers

Figure 9.

Figure 10.

3.2.3. Applying the Model to Daily Number of Reported Cases

Figure 11.

3.3. Extension of the Spectral Truncation Method over One Month

Figure 12.

Figure 13.

Figure 14.

Remark 5.

Remark 6.

Figure 15.

Figure 16.

Figure 17.

Figure 18.

Figure 19.

4. Discussion

4.1. Data over Ten Days

Table 1.

4.2. Data over One Month

4.3. Perspectives and Conclusions

Appendix A. Non Identifiability Result

Appendix B. Identifiability Result

Assumption A1.

Remark A1.

Remark A2.

Lemma A1.

Proposition A1.

Theorem A1.

Figure A1.

Theorem A2.

Proof.

Appendix C. Identification of the Phenomenological Model

Appendix D. About Residual 2 (t) in Section 3.3

Figure A2.

Author Contributions

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Funding Statement

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles