Abstract
It is known that the parameters in the deterministic and stochastic SEIR epidemic models are structurally identifiable. For example, from knowledge of the infected population time series I(t) during the entire epidemic, the parameters can be successfully estimated. In this article we observe that estimation will fail in practice if only infected case data during the early part of the epidemic (prepeak) is available. This fact can be explained using a well-known phenomenon called dynamical compensation. We use this concept to derive an unidentifiability manifold in the parameter space of SEIR that consists of parameters indistinguishable from I(t) early in the epidemic. Thus, identifiability depends on the extent of the system trajectory that is available for observation. Although the existence of the unidentifiability manifold obstructs the ability to exactly determine the parameters, we suggest that it may be useful for uncertainty quantification purposes. A variant of SEIR recently proposed for COVID-19 modeling is also analyzed, and an analogous unidentifiability surface is derived.
Keywords: epidemic models, SEIR models, identifiability, 92D30, 92C60
1. Introduction.
In nonlinear systems, identifiability of parameters depends critically on location in phase space. In this article, we point out a particularly vivid illustration of this fact that occurs in SEIR (susceptible, exposed, infected, removed) modeling of epidemics. While the SEIR parameters are identifiable from the infected population I(t) if the entire epidemic is observed, the ability to infer parameters from the prepeak portion of the epidemic is strictly limited, due to the approximately linear dynamics that occur early in the epidemic.
We explain this failure of identifiability in section 3, where we show that for a given instance of the infected time series I(t) early in the epidemic, there are multiple solutions with various parameter values that are approximately consistent with the same I(t). Moreover, we show that these multiple solutions form a two-dimensional unidentifiability manifold that indexes the alternative parameter sets that are consistent with I(t). The alternate parameter sets on this surface share the growth rate of the epidemic (the leading eigenvalue of the linearized system) even though their respective parameter values vary widely. Thus estimating all parameters solely from knowledge of the infected cases during the prepeak portion of the trajectory is not possible in practice with any parameter estimation algorithm.
Since the unidentifiability set is two-dimensional, it follows that two of the three unknown parameters (in the basic SEIR) must be known a priori in order to determine the third. In particular, the reproductive number R0, which is often derived from two of the SEIR parameters, is in practice not identifiable from I(t) alone.
Unidentifiability is an underappreciated issue in infectious disease modeling. The authors of the comprehensive review [16] state that mathematical modeling of epidemics “usually over-parameterizes the model and ignores parameter identifiability, which makes it difficult to directly fit such models to data.” We corroborate this opinion by showing that it is impossible in practice to determine more than one unknown SEIR parameter from observations of I(t) preceding the peak stage of the epidemic, and we exhibit the underlying mathematical reasons for this. While over-parameterization is rampant in the literature, our focus here is deliberately on a reasonably parameterized epidemic model, which suffers from unidentifiability only in a crucial region of phase space.
We will refer to this deficiency as trajectory-dependent unidentifiability. The difficulty stems from a phenomenon called dynamical compensation [24], as identified in linear compartmental models by Bellman and Aström [2] in 1970. In the terminology of [24], it is a structural unidentifiability [21, 25] in the linear model that approximates SEIR in the early stages of an epidemic and that gradually disappears as the nonlinearities become significant as the epidemic progresses (see Figure 4). Determination of the full parameter set is possible if I(t) can be observed through the peak of the infection. In fact, it is well known (see, e.g., [25]) that the parameters of SEIR are formally identifiable from the entire I(t) trajectory.
To illustrate identifiability issues that arise in applications, we employ two independent approaches to parameter estimation. One is a parameter estimation algorithm based on data assimilation from partial observations, and the other is an implementation of MCMC (Markov chain Monte Carlo) techniques [11]. Both are introduced in section 2.2. These are two choices from several alternatives that are in common usage, some of which are based directly on Bayesian inference [1], while others use data assimilation in more sophisticated ways [8, 12, 18]. The principal unidentifiability results of this article are independent of the method of parameter estimation.
Our analysis was preceded by work on dynamical compensation for linear systems (see, e.g., [27]) that shows how to find alternative parameter sets whose solutions do not change the observable I(t). These solutions are designed to match the true underlying solution even during the initial and often unobservable transient at the outset of the epidemic. However, by ignoring rapidly decaying dynamics at early stages, our analysis uncovers a larger set of alternative parameter combinations that match observations. Somewhat counterintuitively, it is exactly this expanded set of parameters, not the more restrictive parameter set [20], that appears to be explored by parameter estimation methods. This indicates that our simplifying assumptions allow us to correctly anticipate the performance of these methods (see Figure 5).
Despite the fact that the unidentifiability surface shows why exact determination of parameters is impossible during the prepeak interval, it has a useful purpose in uncertainty quantification, because it constrains the set of alternative parameters that also generate I(t). Assume a parameter estimation algorithm is used to calculate a parameter set p from an observed I(t) early in an epidemic. Since the system is unidentifiable, another algorithm may provide another parameter set p′. However, we can expect it to lie on the unidentifiability surface of p, which is a constraint. We show in section 4.3 that the systems corresponding to parameter sets chosen from the surface have dynamics much closer to the system generated by p than those chosen off the surface. By studying these nearby systems, we may be able to gain knowledge about the uncertainty of the system with estimated parameter set p′.
As the complexity of parameterized dynamical systems models has steadily increased over the past two decades, the question of identifiability of parameters has become critical. In particular, the nonlinearities inherent in modern dynamical models significantly complicate the problem, leading to the considerable recent attention on the limits and analysis of identifiability [4, 26, 5, 20, 19, 9, 15]. In this work, we address a gap in the literature that is easily overlooked by global analysis, which is whether certain parts of trajectories, such as the outset of an epidemic, can lack identifiability from limited information, even when the entire trajectory considered in full possesses identifiability. Our goal is to point out this vulnerability in a particular common case and to encourage modelers to conduct much broader searches for similar effects.
In section 2 we review the deterministic and stochastic SEIR models and introduce two parameter estimation approaches. In section 3 the notion of dynamical compensation is explored, and its existence in a linearized version of SEIR is observed. Its relevance to the problem of identifiability of parameters in the full nonlinear SEIR is noted in section 4. In section 5, the COVID-19 model of [18] is studied. An obstruction to identifiability caused by dynamical compensation is also observed in this model.
2. Identifying parameters in SEIR.
2.1. The deterministic and stochastic SEIR models.
The deterministic version of the SEIR model [10, 14] that we will consider is
(2.1) |
where the variables S, E, I, and R represent the populations of susceptible, exposed, infected, and removed patients, respectively, and denotes the total population. Time is measured in days. We use the simplest, or SEIR-without-vital-statistics, model, which assumes that N is constant with no births and deaths. There are more complex versions with additional parameters, but the identifiability issues we want to describe occur even for this simplest model. The sole nonlinearity is the βIS/N term, which moves patients from the susceptible compartment to the exposed compartment according to transmission rate coefficient β.
We will interpret the model in the following way. The parameter α is the time constant of movement from exposed to infected; thus we assume that on average, the patient spends 1/α days as exposed before transitioning to infected, where we assume viral shedding begins. We will also make the assumption that symptoms are present in patients in the I compartment, so that the case can be observable for the first time. After 1/γ days in the I compartment, on average, the patient is removed from the population and does not return to the susceptible class.
Our principal interest is in determining what information can be inferred from measured reports of infected cases I(t). We address two obvious limitations of these assumptions. First, perhaps not all infected cases are reported. Thus, the true infected number may be c1I instead of I. Second, a portion of the infected cases may be asymptomatic and are not reported for this reason. Thus, the true infected number may be c2c1I. In either case, the true number of infected may not be knowable. If the true number of infected is proportional to the reported I, the meaning of the contact transmission parameter β will be changed. However, many of the purposes of using the model, such as to forecast future I(t), may still proceed unaffected.
In addition to the deterministic version, we will also consider the SEIR model as a set of stochastic differential equations with Poisson noise. In this version, we will calculate trajectories as follows. For each time step, the right-hand side of the equations will be evaluated by selecting from a Poisson distribution and then integrated using an Euler method step. In other words, the values
are chosen to represent the contribution of the right-hand side at each step, i.e.,
(2.2) |
This version treats the SEIR model as a stochastic system for greater fidelity. However, our main conclusions about identifiability will be relevant for both the deterministic and stochastic versions.
2.2. Parameter estimation.
Parameter estimation is customarily achieved by locating, implicitly or explicitly, the optimum of some auxiliary function that measures the fitness of the parameters. In some methods, the likelihood or marginal probability is maximized, while in others, an error or loss function is minimized.
In one method for estimating parametersfrom daily reports of the single observable I(t), we will choose a particular loss function based on data assimilation and explicitly minimize it. This approach will be useful for illustrating the geometry of the minima of the loss function in two different parts of the SEIR trajectory. Our choice for the loss function will be the data assimilation error in I(t) incurred while using the proposed set of parameters to optimally reconstruct the trajectory from the observed I(t). The use of data assimilation to reconstruct unobserved variables is the basis of modern numerical weather prediction, and it has started to appear in epidemic modeling [6, 8, 12]. For the deterministic SEIR, we employ a standard EnKF (ensemble Kalman filter) [23, 22] to reconstruct the dynamics. For the stochastic SEIR, we use an EnKF tailored to Poisson noise instead of the standard Gaussian assumption. The EnKF used for this purpose is based on the PKF (Poisson Kalman filter) from [7].
Data assimilation gives a way of reconstructing all variables of a differential equations model from partial observations, for example, by measurements of one key variable. For SEIR model (2.1), if the parameters β, α, and γ are known, the observable I(t) or, alternatively, the daily changes , are in general sufficient for reconstructing the other three variables S, E, and R. Figure 1(a) shows a trajectory of a stochastic SEIR model (2.1) with parameters β = 1.1, α = 0.2, and γ = 0.5, and with initial conditions , , , . The inputs to the data assimilation algorithm are the model, the exact parameters, and the daily reports of new infections . The assimilation algorithm uses the initial condition S = 106, E = 0, I = 0, R = 0. That is, it is allowed to know the (constant) total population but no information about the initial caseload. The EnKF is used to estimate the most likely values of S(t), E(t), I(t), and R(t) given the reports . Figure 1(b) shows the resulting reconstructed trajectory, a reasonably accurate version of the original.
If the parameters are not known and incorrect parameters are used in the model, the reconstruction in general will be farther from the original. This leads to a convenient loss function to consider for the purposes of parameter estimation. Let denote the mean squared difference between the observed and the reconstructed from the EnKF, over a time interval [T1, T2]. Then minimization of L as a function of the parameters should lead to the correct, or generating, parameters.
To begin, we carried out this idea on the deterministic SEIR model (2.2) with a standard simplex minimization algorithm [17]. We started the simplex algorithm with 1000 starting guesses for the parameters β, α, and γ that varied from the exact values by about 50%. Figure 2(a) shows the cumulative results of the minimization procedure for a trajectory of length 100 days, using two different intervals of observations, [T1, T2] = [0, 50] or [50, 100], with 1000 realizations of starting parameter guesses. There is a dramatic difference, depending on whether the time interval [0, 50] or [50, 100] is used for the input I(t). The red dotted curve is a histogram of approximate parameters using from the interval [0, 50]. The black histogram uses the interval [50, 100]. While the histogram shows no identifiability on [0, 50], on the interval [50, 100] the method finds the correct parameters with less than 0.1% error on over 95% of the 1000 attempts.
The success of this simple approach to parameter estimation on [50, 100] (or the complete interval [0, 100], not shown) is due to the fact that the SEIR model (2.1) is structurally identifiable from I(t), as long as the peak of the epidemic can be observed. However, one can see that this approach fails on the outbreak part of the epidemic, as shown by the histogram in red. On the time interval [0, 50], the input I(t) is not sufficient for constraining the three parameters.
Figure 2(a) also shows a test of a completely different approach to parameter estimation. We applied MCMC to sample the posterior density of the parameters given the observations, namely for t in the same intervals as above. In the deterministic SEIR (used for the MCMC computation of the posterior) the likelihood is a product of Poisson densities, which allows easy sampling of the true posterior . In Figure 2(a) we show the three marginals of the posterior. We notice similar qualitative behavior for this estimator, namely that the parameters are identifiable from the second half [50, 100] of the epidemic(blue curve) but are almost completely unidentifiable from I(t) during the first half [0, 50] (green curve).
Figure 2(b) returns to minimization of the data assimilation error as above but applied to the stochastic SEIR model and using a Poisson-based EnKF. The histogram shows the variation over 1000 different realizations of Poisson noise. For the interval [50, 100], the variation is increased for stochastic SEIR in comparison to the deterministic SEIR, but the estimates are unbiased around the correct parameter settings. For [0, 50], no meaningful estimation occurs.
In summary, for both deterministic and stochastic versions of the SEIR model, both data assimilation-based and MCMC-based algorithms are able to identify the three parameters given I(t) from the time interval [50, 100], and fail on the interval [0, 50]. The intervals [0, 50] and [50, 100] are chosen to be representative of intervals for which identifiability fails and succeeds, respectively. Similarly chosen intervals show the same results; that is, early in an epidemic before the peak is reached, there is a structural reason why the parameters will not be identifiable. We address that reason in the next two sections.
3. Dynamical compensation in linear models.
Later we will address the fact that during the prepeak part of the epidemic, the SEIR model is approximately linear, and E and I are approximately proportional to each other. The goal of this article is to examine how this fact imposes a constraint on our ability to infer parameters from data, in particular from observations of I(t). The mechanism that causes this is called dynamical compensation. For linear compartmental systems, this phenomenon was reported as early as [2, 3].
3.1. Asymptotic behavior of linear models.
Consider a linear initial value problem consisting of a vector differential equation , satisfying initial conditions , where . Assume A has distinct real eigenvalues.Then solutions are of the form
where are the eigenvalues of A. Because of the exponential form of the solutions, as t moves away from zero, the solutions begin to closely approximate
Assuming , this means that for each for some constant ci.
Example. Consider the linear initial value problem
(3.1) |
which we write as , where
(3.2) |
Let be the diagonalization, where the columns of P are eigenvectors of A. The diagonalization exists because implies that A has distinct real eigenvalues . The solution is
(3.3) |
We can consider separate cases, depending on the constants and B2. Although the Bi have no particular physical significance, they are formally significant because they represent linear combinations of E0 and I0 that grow exponentially with exponent λi, respectively. Thus if one of the Bi is zero, the solutions ) and I(t) will evolve exactly proportionally. If both are nonzero, they will still behave asymptotically proportional to each other, with exponent λ1, the larger eigenvalue.
To be more precise, in what we will call the exactly proportional case, one or both of the Bi are zero. If , the solution is identically zero. If one of the Bi = 0 or, equivalently, the term of the solution, is absent, then for some constant c and for all t.
In what we call the approximately proportional case, both and the solution will be
meaning that asymptotically, where . Note that in all cases, , with the approximation improving exponentially in time.
3.2. Identifiability in linear systems.
A general approach to assessing identifiability in linear systems is suggested in [27]. To search for alternative solutions to (3.1) with the same output I(t) but different E(t) and different parameters , define the coordinate change z = Sx for a nonsingular matrix
Specifically, we seek an S that satisfies
for some F. The new variable z will reproduce I(t) as its second entry, using a “dynamically compensating” F(t) as its first entry, with a different set of parameters, determined below.
This equation is expressible as . From (3.3), this constraint is
Transposing yields
for all t. Now we split the argument into two cases, depending on the initial conditions (see (3.3)).
Case 1 (approximately proportional). In this case, and . Then for two different times t1, t2, the rows of the leftmost matrix in
are linearly independent. Since all matrices on the left side are nonsingular, s21 = 0 and s22 = 1, and therefore,
With this change of coordinates, we can consider the alternative system to (3.1) as , where
(3.4) |
and where we have set to match the desired form (3.1). This gives a family of alternative solutions of (3.1) sharing I(t), but with different parameters and different E(t), that are indexed by the single parameter s11. The revised E(t) is . These solutions exactly match I(t) for all t ≥ 0 and satisfy
(3.5) |
The approximately proportional case provides a one-dimensional family of alternative solutions. As promised in [27], these alternative solutions show that in the approximately proportional case, the parameters of (3.1) are unidentifiable from I(t). That is, on the basis of I(t) alone, one cannot distinguish among the infinite set of solutions of (3.5). If our information about the system (3.1) or its parameters is to be inferred from I(t), the existence of multiple solutions consistent with the observations of I(t) will make recovering the parameters effectively impossible.
Case 2 (exactly proportional). Now assume that either B1 or B2 is zero. Then for all t.
The proportionality constant c can be calculated from the equations and depends only on the parameters α, β, γ. Keeping the approximation and substituting , we get
which implies
(3.6) |
The largest solution c of this quadratic equation is real and positive, assuming that .
Lemma 1.
Let , and let c > 0 be the unique positive solution of the quadratic equation
(3.7) |
Define and . Let lie on the surface in defined by
(3.8) |
and define . Then for all satisfying (3.8), the set satisfies
(3.9) |
Proof. Set , so that .
(i) Note that the right-hand side of the first equation is
We can calculate
where we have used the notation , , and used (3.8) to arrive at the last line. Dividing by A recovers . The time derivative of F(t) is , which verifies the first differential equation of (3.9).
(ii) The right-hand side of the second equation is
by the quadratic equation (3.6). This agrees with , verifying the second differential equation. □
The significance of the lemma is that in Case 2, (3.8) reveals a two-dimensional family of solutions of (3.9) with asymptotically identical I(t), further complicating the identifiability of the parameters. There are substantially more alternative solutions in the exactly proportional Case 2, a two-dimensional set instead of a one-dimensional set found in Case 1. However, since the asymptotic convergence is exponential, and because infected case counts are often noisiest at the outset of an epidemic, the difference is likely to be insignificant in practical applications. Curiously, we observe in the next section that the alternative parameter sets found by standard estimation procedures appear to fill out the two-dimensional set found in Case 2, even though as a solution of a system of linear equations, the initial conditions are less generic than those in Case 1. The fact that the solutions that are mistakenly mirrored by a parameter estimation algorithm often correspond to nongeneric choices of solutions will be opaque to the modeler—there is no way to tell whether the solution being reproduced by data assimilation is generic or nongeneric. One can visualize the comparison in Figure 5.
4. Applications to identifiability.
In this section, we apply our knowledge of dynamical compensation in linear compartmental models from the last section to the nonlinear SEIR model. We find that in using a linear approximation valid in the prepeak portion of the epidemic, it is the exactly proportional case (Case 1 above) that turns out to be the most informative on identifiability.
4.1. Unidentifiability in prepeak SEIR.
The SEIR model (2.1) is a coupled set of nonlinear differential equations, but at the beginning of the epidemic, S ≈ N.As the first cases of exposed individuals begin to transition into the infected class, note that the second and third equations approximate a linear system
This approximation was exploited in [13] to derive a formula for the reproductive number in the case when β is unknown but the latent and infectious periods and and the exponential growth rate λ1 from (3.3) can be independently estimated.
According to the previous section, we will observe the asymptotics of the approximately linear dynamics,
for some c as t moves away from 0. In fact, this behavior is apparent in Figure 3(b), which is a magnification of panel (a). The trace of I(t) appears to be a constant proportion of E(t), and this is confirmed in Figure 3(c), where the ratio is plotted versus time.
Figure 4 shows the results of a parameter estimation computation using the data from Figure 3, which sets β = 1.1, α = 0.2, and γ = 0.5. We run data assimilation on the time interval [0, T] using only the daily case numbers as input, for various choices of T. To simplify the situation, we will fix the parameter γ = 0.5 to be the exact value and attempt to estimate β and α. We accomplish this by minimizing L(β, α, 0.5) as described in section 2.2.
The function L(β, α, 0.5), sampled at 10,000 random values, is displayed in Figure 4, projected onto the β and α axes, respectively, for ease of analysis. For “prepeak” values of T, the parameters β and α are not well estimated. As T increases and approaches the epidemic peak 60 < T < 80, the parameter estimates gradually become quite accurate. This corroborates our finding in Figure 2 that parameter estimation fails to isolate correct parameters early in the epidemic.
The lesson from Figure 4 is that as the proportion of susceptibles decreases from 1, the error bounds on the parameter estimates will grow. The parameters are identifiable for [0, T] for T well above 50 due to the fact that , and the parameter estimation will degrade continuously as T is decreased. This degradation is shown explicitly in Figure 4.
4.2. The unidentifiability manifold.
The dynamical compensation results of the previous section explain the phenomenon seen in Figure 4. The unidentifiability manifold, in this case a surface, is plotted in Figure 5. The red dots identify the parameter points whose evaluated loss function computed on the time interval [0, 50] is in the lowest 1% of points (among 10,000 random points sampled ). The points lie extremely close to the unidentifiability surface (3.8). The wide distribution of the points shows the impossibility of estimating the generating parameter set (β = 1.1, α = 0.2, γ = 0.5) with any accuracy. The color shading on the surface corresponds to reproductive number . We note that R0 is not significantly constrained by the parameters with minimal loss function.
The MCMC approach introduced in section 2.2 shows a similar story. In this case, we use observations of the deterministic model (2.2) and apply MCMC using a single realization of I(t) in the time interval [0, 50] as observable. The true parameters lie inside the envelope of the posterior, as shown in Figure 5(b). The Metropolis–Hastings algorithm within MCMC rejects thousands of proposals that do not lie on the surface and accepts only those that do.
Since the unidentifiability surface is a two-dimensional set, we conclude that even if one of the parameters is known, the other two are not identifiable—the set of possible parameters will only be reduced to a one-dimensional curve. For example, with fixed γ, the data assimilation error on the interval [0, 50] has a poorly defined minimum as a function of (β,α). To illustrate this, we fix in the unidentifiability manifold equation (3.8) to yield the curve . This curve is plotted in blue in Figure 6(a). The plotted red points are the 1% of (β, α) pairs with smallest values of the loss function. Instead of a localized ball near the true value (β, α) = (1.1, 0.2), there is a curve of pairs equally fitting the observed data, which are therefore indistinguishable from the loss function. These pairs form the flat minima of the loss function seen in Figure 4 for times T preceding the epidemic peak.
Similarly, if we fix a different parameter, we see the same phenomena when trying to estimate the other two parameters. For example, fixing , the slice through the unidentifiability manifold (3.8) is , a line. Figure 6(b) shows the line in blue, with the near-minimal pairs of the loss function shown as red dots. Finally, fixing yields the curve from the manifold (3.8), shown in Figure 6(c).
On the other hand, fixing the first two parameters on the unidentifiability surface implies that the third can be determined. That is, if we have knowledge of the true α and γ, setting and in (3.8) implies that , so there is a unique solution with those parameter settings. Thus even on the prepeak interval [0, 50] in the example, if α and γ are known, then β is structurally identifiable from the observations of I(t).
Of course, there are many other figures of merit that could be minimized to determine the parameters from the observed I(t); these could be based on data assimilation errors, maximization of likelihood, or some other probabilistic measure. However, during the prepeak part of the epidemic, they will all be susceptible to the alternative solutions that are equally compatible with I(t), implicit in dynamical compensation.
A perhaps more intuitive, if less geometric, view of the unidentifiability surface is that it is the set of parameters for which the leading eigenvalue λ1 of the resulting system is equal to the λ1 (see (3.3)) of the underlying system that generated I(t). (In fact, this leads to an alternate derivation of (3.8).) Thus, if we trust the parameter estimation algorithm to return a parameter set that is at least on the unidentifiability surface, then it will have the correct λ1. Even if the parameters are wrong, this fact can be exploited for uncertainty quantification purposes, as we discuss in the next section.
4.3. Uncertainty quantification.
The unidentifiability surface (3.8) is useful for theoretical reasons to show the impossibility of isolating the original parameter set p from the infinity of other systems that approximately share I(t) during the beginning portion of an epidemic. Next, we suggest that it may be useful in practice for uncertainty quantification.
It turns out to be a helpful fact that the unidentifiability surface generated by an arbitrary parameter set p indexes the set of parameter sets that share the observed I(t). Assume that we use a parameter estimation algorithm with input I(t), and estimate the parameter set as p′ and that lies on the surface. The roles of p and p′ are symmetric, so we can also consider that p lies on the unidentifiability surface generated by p′. That means we can reverse the roles: switch the primed and unprimed variables in (3.8), noting that c must be replaced by c′ computed from (3.7) with unprimed variables replaced with primed variables.
For instance, assume the correct parameters are but that a parameter estimation algorithm instead returns, for example, an estimate that lies on the unidentifiability surface. The set p′ given here is just for illustration; in this case it was chosen by making an arbitrary choice of α′ and γ′ and then computing the corresponding β′ lying on the surface (3.8). Next, we ignore the origin of p′ and consider what we can infer from it. In Figure 7(a), we produce 30 trajectories of the stochastic SEIR by perturbing p′ 10% to new values . We have overlaid as a yellow curve the original trajectory that produced I(t), generated by the parameters p. There is a large amount of variability in the 30 trajectories.
Figure 7(b) shows trajectories of 30 stochastic SEIR systems, where we have randomly changed α′ and γ′ by 10% to α″ and γ″, but this time we computed the″ that lies on the surface. We reiterate that the surface, being the corresponding β unidentifiability surface of p′, can be computed from p′ and is therefore known to us, even if the original p is unknown. The ensuing trajectories are much more faithful to the original system, given that they share the leading dynamical eigenvalue λ1. Thus, even starting with a mildly incorrect parameter set p′, by querying nearby points p″ on its unidentifiability surface, we see reasonable facsimiles of the underlying dynamics generated by the original parameters p.
Note that there are limitations on how far the incorrect parameters p′ can be from the original parameters p in order for the trajectories produced in this way to be representative of the original systems. In particular, the constant c in the proportionality is in general different for the new system, and so its trajectories will be different. Our informal observation is that if the alternative parameters are within about 20% of the originals, the approximating trajectories may still be useful for uncertainty quantification.
This observation opens up the possibility of using the unidentifiability surface for uncertainty quantification purposes by studying the spread of nearby solutions as a function of uncertainty in the parameters. If an uncertainty in the estimate can be determined from the algorithm generating the estimate, bootstrapping techniques can be used to move along the surface (3.8) and quantify the variance of key aspects of the family of nearby trajectories. We leave a more complete analysis of this phenomenon, and its possible utility to forecasting, to future investigation.
5. Identifiability in other SEIR-like models.
The same identifiability problems are likely to occur in models similar to SEIR. We describe the details for one such example that was proposed recently in [18].
5.1. The SEUIR model.
The epidemic model in [18] was used to represent populations in a specific city and included extra external inputs from other cities.The underlying SEIR-style model is
(5.1) |
with constant total population , where . The new variable U represents unreported infected cases, while I is reserved for reported infected cases. For SEIR, we will consider I(t) as the observable variable.
For simplicity, we rewrite the parameters as , , to arrive at the equivalent but more user-friendly system
(5.2) |
where , with parameters , and , which we call the SEUIR model. Figure 8 displays a sample trajectory of the SEUIR model.
5.2. Unidentifiability in SEUIR.
Again consider the prepeak portion of the epidemic, where . Then there is an approximating linear system
(5.3) |
which will exhibit dynamical compensation. Given our experience with SEIR, we consider solutions of (5.3) where E, U, and I are proportional, say and . One checks that if are such solutions, then , where
(5.4) |
It will be convenient in proving the lemma below to note the identities
(5.5) |
Lemma 2.
Let , and let be solutions of (5.3). Further, let , and consider the functions
where b and c are defined as in (5.4). Assume that , and d′ lie on the surface defined by
(5.6) |
where we denote , , .
Then for all satisfying (5.6) and for any , the set (, , ) satisfies
(5.7) |
Proof. The left-hand side of the first equation is
The right-hand side is
where we used the unidentifiability surface equation (5.6) and used the identity from (5.5) in the penultimate line. This matches the left-hand side.
The second and third equations use only the definitions of F and V. For the second equation,
and the right-hand side is
which agrees with the left side by (5.5). The left side of the third equation is
which matches the right side, by
by (5.5). □
Figure 9 shows a plot of the unidentifiability surface in , along with a plot of the 1% of random parameter sets (β, z, d) that have the lowest loss function values from the nonlinear SEUIR model, using I(t) as input, on the prepeak interval [0, 50]. The generating parameters were β = 0.9, z = 0.3, w = 0.2, and d = 0.5. These parameter sets will be practically indistinguishable when attempting parameter estimation with I(t) only over this interval. Here the w parameter value has been fixed at the generating value w = 0.2.
Figure 10 shows the results of repeating the sampling of the loss function while fixing w = 0.2 and a second parameter. For example, in Figure 10(a) the best 1% of parameter sets (β, z) are plotted as dots, along with the relation (5.6), with Δ d set to 0. The relation, plotted as a curve, is and matches the data accurately. In Figure 10(b), the parameter , along with in (5.6), gives the line . In Figure 10(c), with the curve is .
The identifiability problem with SEUIR is arguably worse than with SEIR, since a glance at the unidentifiability relation (5.6) shows no Δ w term. Thus the multiple solutions of Lemma 2 exist for any value of . These solutions have independent of w′, while also having adjusted F(t) and V(t) that do depend on w′. This results in an added dimension of unidentifiable parameters. In other words, Figures 9 and 10 can be reproduced identically if w′ is fixed at an inaccurate value . This means that the actual unidentifiability set is a two-dimensional set in of points satisfying (5.6) and all w′ such that .
A final comment about the SEUIR model (5.2) is that one can introduce the new variable and arrive at the equivalent SEIR system
where . This may explain the disappearance of the parameter w′ in the unidentifiability surface equation (5.6). However, under the model (5.2), the assumption is that I(t), not Y (t), is observed.
6. Discussion.
In common epidemic models, practical identifiability from the infected population variable I(t) depends strongly on what portion of the population trajectory is observed. In the prepeak interval, when S (t) ≈ N, the linear approximation to the full model admits an infinity of solutions with the same I(t) by adjusting the unobserved population variables to compensate, a property known as dynamical compensation. The combinations of parameters that allow for this compensation are given by (3.8) and (5.6) in Lemmas 1 and 2, in what we call the unidentifiability surface, or unidentifiability manifold. The multiple solutions that coexist in this scenario will defeat any parameter estimation method that relies on observing only I(t) to find the complete set of parameters. Since the unidentifiability manifold is two-dimensional, at least two more independent pieces of information are necessary to isolate any of the parameters. This fact also applies to most combinations of the parameters, such as the reproductive rate R0. These obstructions to identifiability disappear if the entire time history, including the peak of the epidemic, can be observed.
We have shown that these identifiability obstructions exist for the popular SEIR model and another, more recent model. It is likely that any other closely related versions of SEIR, including compartmental models such as SEIRS, SIRD, etc. and models that include vital dynamics, will harbor similar obstructions, due to the same phenomenon.
It is notable that the unidentifiability surfaces found for both models are codimension one in parameter space. We conclude that if all but one of the parameters are known a priori, then the unknown parameter can be determined from an estimation process, such as the minimization technique used here, even during the prepeak portion of the epidemic. We have also proposed that knowledge of the unidentifiability surface may be crucial for the development of practical uncertainty quantification for parameter estimates, although pursuit of that direction is beyond the scope of this article.
Acknowledgments.
We thank the reviewers for insightful comments that led to improvement of the article.
Funding: This work was supported by a National Institutes of Health Director’s Transformative Award (1R01AI145057) and by the National Science Foundation (DMS-1723175, DMS-1854204, and DMS-2006808). The fourth author was supported in part by the Covid Virus Seed Fund (award 8013) from the Institute for Computational and Data Sciences and the Huck Institute at the Pennsylvania State University.
REFERENCES
- [1].Adhikari R, Bolitho A, Caballero F, Cates M, Dolezal J, Ekeh T, Guioth J, et al. , Inference, Prediction and Optimization of Non-Pharmaceutical Interventions Using Compartment Models: the PyRoss Library, preprint, https://arxiv.org/abs/2005.09625, 2020.
- [2].Bellman R. and Astrom K, On structural identifiability, Math. Biosci, 7 (1970), pp. 329–339. [Google Scholar]
- [3].Berman M. and Schoenfeld R, Invariants in experimental data on linear kinetics and the formulation of models, J. Appl. Phys, 27 (1956), pp. 1361–1370. [Google Scholar]
- [4].Berthoumieux S, Brilli M, Kahn D, de Jong H, and Cinquemani E, On the identifiability of metabolic network models, J. Math. Biol, 67 (2013), pp. 1795–1832, 10.1007/s00285-012-0614-x. [DOI] [PubMed] [Google Scholar]
- [5].Chis O-T, Banga JR, and Balsa-Canto E, Structural identifiability of systems biology models: A critical comparison of methods, PloS One, 6 (2011), e27755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Chowell G, Fitting dynamic models to epidemic outbreaks with quantified uncertainty: A primer for parameter uncertainty, identifiability, and forecasts, Infectious Disease Model.,2 (2017), pp. 379–398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Ebeigbe D, Berry T, Schiff S, and Sauer T, A Poisson Kalman Filter to Control the Dynamics of Neonatal Sepsis and Postinfectious Hydrocephalus, preprint, https://arxiv.org/abs/2003.11194v1, 2020.
- [8].Engbert R, Rabe M, Kliegl R, and Reich S, Sequential data assimilation of the stochastic SEIR epidemic model for regional COVID-19 dynamics, Bull. Math. Biol, 83 (2021), 1, 10.1007/s11538-020-00834-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Karlsson J, Anguelova M, and Jirstrand M, An efficient method for structural identifiability analysis of large dynamic systems, IFAC Proc. Vol., 45 (2012), pp. 941–946. [Google Scholar]
- [10].Kermack W. and McKendrick A, A contribution to the mathematical theory of epidemics, Proc. R. Soc. Lond. A, 115 (1927), pp. 700–721. [Google Scholar]
- [11].Li M, Dushoff J, and Bolker B, Fitting mechanistic epidemic models to data: A comparison of simple Markov chain Monte Carlo approaches, Statist. Methods Med. Res, 27 (2018), pp. 1956–1967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Li R, Pei S, Chen B, Song Y, Zhang T, Yang W, and Shaman J, Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2), Science, 368 (2020), pp. 489–493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Lipsitch M, Cohen T, Cooper B, Robins J, Ma S, James L, Gopalakrishna G, et al. Transmission dynamics and control of severe acute respiratory syndrome, Science, 300 (2003), pp. 1966–1970. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Lloyd A, Sensitivity of model-based epidemiological parameter estimation to model assumptions, in Mathematical and Statistical Estimation Approaches in Epidemiology, Springer, 2009, pp. 123–141. [Google Scholar]
- [15].Margaria G, Riccomagno E, and White LJ, Structural identifiability analysis of some highly structured families of statespace models using differential algebra, J. Math. Biol, 49 (2004), pp. 433–454. [DOI] [PubMed] [Google Scholar]
- [16].Miao H, Xia X, Perelson AS, and Wu H, On identifiability of nonlinear ODE models and applications in viral dynamics, SIAM Rev., 53 (2011), pp. 3–39, 10.1137/090757009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Nelder J. and Mead R, A simplex method for function minimization, Comput. J, 7 (1965), pp. 308–313. [Google Scholar]
- [18].Pei S, Kandula S, and Shaman J, Differential Effects of Intervention Timing on COVID- 19 Spread in the United States, preprint, medRxiv, 2020. [DOI] [PMC free article] [PubMed]
- [19].Raue A, Karlsson J, Saccomani MP, Jirstrand M, and Timmer J, Comparison of approaches for parameter identifiability analysis of biological systems, Bioinformatics, 30 (2014), pp. 1440–1448, 10.1093/bioinformatics/btu006. [DOI] [PubMed] [Google Scholar]
- [20].Raue A, Kreutz C, Maiwald T, Bachmann J, Schilling M, Klingm uller U, and Timmer J, Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood, Bioinformatics, 25 (2009), pp. 1923–1929, 10.1093/bioinformatics/btp358. [DOI] [PubMed] [Google Scholar]
- [21].Roosa K. and Chowell G, Assessing parameter identifiability in compartmental dynamic models using a computational approach: Application to infectious disease transmission models, Theoret. Biol. Med. Model, 16 (2019), 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Schiff SJ, Neural Control Engineering: The Emerging Intersection Between Control Theory and Neuroscience, Computational neuroscience, MIT Press, 2011, 10.7551/mitpress/8436.001.0001. [DOI] [Google Scholar]
- [23].Simon D, Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches, John Wiley & Sons, 2006. [Google Scholar]
- [24].Villaverde A. and Banga J, Dynamical compensation and structural identifiability of biological models: Analysis, implications, and reconciliation, PLoS Comput. Biol, 13 (2017), e1005878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Villaverde A, Tsiantis N, and Banga J, Full observability and estimation of unknown inputs, states and parameters of nonlinear biological models, J. R. Soc. Interface, 16 (2019), 20190043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Villaverde AF, Barreiro A, and Papachristodoulou A, Structural identifiability of dynamic systems biology models, PLoS Comput. Biol, 12 (2016), e1005153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Walter E. and Lecourtier Y, Unidentifiable compartmental models: What to do?, Math. Biosci, 56 (1981), pp. 1–25. [Google Scholar]