Abstract
State‐space models (SSMs) are a popular tool for modeling animal abundances. Inference difficulties for simple linear SSMs are well known, particularly in relation to simultaneous estimation of process and observation variances. Several remedies to overcome estimation problems have been studied for relatively simple SSMs, but whether these challenges and proposed remedies apply for nonlinear stage‐structured SSMs, an important class of ecological models, is less well understood. Here we identify improvements for inference about nonlinear stage‐structured SSMs fit with biased sequential life stage data. Theoretical analyses indicate parameter identifiability requires covariates in the state processes. Simulation studies show that plugging in externally estimated observation variances, as opposed to jointly estimating them with other parameters, reduces bias and standard error of estimates. In contrast to previous results for simple linear SSMs, strong confounding between jointly estimated process and observation variance parameters was not found in the models explored here. However, when observation variance was also estimated in the motivating case study, the resulting process variance estimates were implausibly low (near‐zero). As SSMs are used in increasingly complex ways, understanding when inference can be expected to be successful, and what aids it, becomes more important. Our study illustrates (a) the need for relevant process covariates and (b) the benefits of using externally estimated observation variances for inference about nonlinear stage‐structured SSMs.
Keywords: Bayesian hierarchical models, data integration, delta smelt, Hypomesus transpacificus, parameter identifiability, San Francisco Estuary
1. INTRODUCTION
Understanding what determines changes in animal abundance through time is a basic question of population ecology and natural resource management. Estimates of recruitment and survival can facilitate this, quantifying where, when, and how populations respond to endogenous (density‐dependent) and exogeneous (external, density‐independent) factors (Turchin, 2003). However, fitting models can be difficult in practice because field‐based estimates of animal population abundances can be incorrect due to sample randomness as well as systematic biases (Staudenmayer and Buonaccorsi, 2006). Although population models addressing observation error due to sampling variability are increasingly common, the inclusion of observation bias has been much less studied because most analyses use annual time step models of data based on a single survey and the bias is assumed to cancel out. Relative biases can be especially problematic when abundances from different surveys are integrated into a single population model.
State‐space models (SSMs) offer an attractive framework to integrate disparate datasets with potentially different biases. SSMs are proving to be important population modeling tools (Newman et al., 2014) because they allow separate description of process and observation time series. The state process time series consists of unobserved values that may be viewed as an underlying description of the true, alternatively latent or hidden (Newman et al., 2006), state of a dynamic system. The observation time series consists of measurements on the state process. The explicit separation of process variation from observation error allows the flexibility needed to address survey specific errors.
One problem long recognized with some SSMs, especially the special case of normal dynamic linear models (NDLMs; West and Harrison, 2006), is the difficulty of simultaneously estimating both process variance and observation variance (Dennis et al., 2006; Knape, 2008; Knape et al., 2011; Auger‐Méthé et al., 2016). There are similar difficulties in jointly estimating process variability and observation noise variances in nonlinear population dynamics process models (de Valpine and Hilborn, 2005).
Several remedies for SSM inference have been explored. One is to formulate the SSM such that the parameters of the process variance are also parameters for the mean(s) of the state process. Examples of such parameterizations with NDLMs include Newman (1998) who used normal approximations for binomial and multinomial distributions where the expectations and variances matched those distributions, and Besbeas et al. (2002) who approximated Poisson and binomial distributions with the corresponding means and variances. A second remedy is to collect replicate observations that allow separation of observation variance from process variance (Dennis et al., 2010). A third remedy is to avoid trying to estimate the observation variance altogether by inserting external estimates of observation variance when such estimates are available (Knape et al., 2013), for example, through mark‐recapture sampling.
With few exceptions, for example, Knape et al. (2013), most assessments of SSM identifiability problems have been based on NDLMs and often for situations where the observations are annual and based on one survey type. In this paper, we examine the estimation problem of an SSM whose state process is stage‐structured, nonlinear, and non‐Gaussian, and whose observations have stage‐specific relative biases. Stage‐structured population models are an important tool to draw inference about factors affecting recruitment and survival rates (Caswell, 2001). Although some examples of their application embedded within an SSM inferential framework exist (de Valpine, 2003; de Valpine and Rosenheim, 2008), the effects of both noisy and biased data on inference for nonlinear SSMs are relatively poorly understood. Additionally, in practice, ecologists are equally concerned with estimation of coefficients relating predictor variables to vital rates, also a topic of little focus compared to historical emphasis on estimation of variance parameters and latent states.
The rest of the paper is structured as follows. In Section 2, we develop a stage‐structured SSM that contains stochastic vital rates with covariate dependency and an observation model with bias terms for some life stages. We also describe several alternate formulations for the observation model depending on whether external estimates of observation variances are available. Section 3 provides a theoretical analysis of the identifiability of the parameters in this model while Section 4 presents a simulation study investigating parameter inference in practice. Attention is given to how estimation properties differ between models that fix observation error variance using externally derived values and models that internally estimate observation error variance. Section 5 presents a case study on the fish species delta smelt (Hypomesus transpacificus) to illustrate the utility of such models and highlight practical issues that arise when fitting them. Discussion is in Section 6.
2. MODEL DESCRIPTION
Throughout we will parameterize the lognormal distribution using , where μ and σ2 are the log‐mean and log‐variance parameters, respectively. Similarly, logit‐normal distributions will be described using , where μ and σ2 are the location and squared‐scale parameters. The distribution designation of LogitNormal means that the logit transformed survival probabilities are normally distributed.
2.1. State process model
Assume a population can be partitioned into different life stages where denotes the true abundance of life stage s of time (or cohort) t. In particular, we consider a fish population that has four life stages: post‐larvae, juveniles, sub‐adults, and adults. Given an initial abundance of reproducing adults in cohort , the state process update equations are
| (1) |
| (2) |
| (3) |
| (4) |
where is time‐specific recruitment and are the life stage and time‐specific survival probabilities. Assuming (environmental) stochasticity in four processes, the vital rate models for recruitment and the three survival probabilities are
| (5) |
| (6) |
| (7) |
| (8) |
where is a vector of regression coefficients corresponding to a vector of recruitment predictor variables , and is the recruitment process variance on the log scale. The terms in the survival functions, Equations (6)‐(8), are defined analogously but with possibly different dimensions reflecting life stage‐specific numbers of covariates used in survival predictions.
2.2. Observation model
Observation error in the abundance estimates can include both bias, such that , and sampling variance (Staudenmayer and Buonaccorsi, 2006). Different choices of the conditional distribution for can be made, which in turn can affect inference (Knape et al., 2011). We chose a lognormal distribution because it ensures strictly positive abundance indices (especially useful when abundance indices are near zero), it is perhaps the most common assumption (although more often in terms of a normal distribution when working with log abundances), and because earlier work (Polansky et al., 2019) suggests it is to be preferred over a normal distribution for the case study model described in Section 5.
Due to nonidentifiability problems, we do not consider estimating biases that are both life stage and cohort specific because in that case the parameters would confound with the latent abundance estimates. Motivated by the case study in Section 5, we assume juvenile and sub‐adult abundance indices are biased by factors and , and we use the following parameterizations for the observation model equations:
| (9) |
| (10) |
| (11) |
| (12) |
With these parameterizations, the expected index is
| (13) |
and the variance is
| (14) |
Irrespective of whether a bias factor is included, the coefficient of variation (CV) is
| (15) |
from which it follows that
| (16) |
Plugging the right‐hand side of Equation (16) into Equations (9)‐(12), the observation model equations can be written as
| (17) |
where if there is no bias term present in a given life stage or survey specific observation model.
2.3. Observation model formulations
There are three scenarios for how the observation model equations can be formulated. These scenarios are presented in Table 1. Scenarios 1 and 2 depend on the availability of estimates of uncertainty (either coefficients of variation or variances) for that are derived externally to the SSM. In Scenario 3, observation variance is estimated within the SSM. The choice between Scenarios 1 and 2 may depend on whether estimated coefficients of variation or estimated variances are closer to the true values.
TABLE 1.
Different scenarios for how observation model equations can be formulated, including cases where external estimates of observation error variance are available
| Scenario | Equation and description | |
|---|---|---|
| 1 |
|
|
| Use the external coefficient of variation estimates within Equation (17). | ||
| 2 |
|
|
| Use the external abundance estimate variances but use the bias corrected latent abundance value to obtain the term in Equation (17). | ||
| 3 |
|
|
| Do not use external information about the variances of , and instead internally estimate observation variance along with the other parameters and latent states of the SSM. In this case, it is not feasible to estimate both life stage and cohort specific values, but life stage‐specific (and possibly survey method specific if multiple surveys are used to monitor a single life stage) parameters may be estimable. The observation model equations are similar to Equations (9)–(12) but with a life stage s specific observation log‐variance parameter. |
Note. The bias terms are included for generality with the understanding that if there is no bias term present in the life stage or survey specific observation model.
3. THEORETICAL IDENTIFIABILITY
We applied methods developed by Cole and McCrea (2016) to evaluate the theoretical identifiability of the parameters in the SSM described in Section 2 under various scenarios involving fixed or estimated observation error CV and the inclusion or exclusion of covariates in the state process model. An exhaustive summary vector based on (approximate) expected values and variances of the observations was derived assuming each state process had an intercept and either no covariates or one covariate. The derivative matrix, formed by taking partial derivatives of this vector with respect to the parameters of the model, was computed using Maple 17 (Maple, 2013) and code modified from the appendices of Cole and McCrea (2016). If the rank of the derivative matrix, r, equals the number of parameters, p, then all the model parameters are separately identifiable. If , then r parameters or parameter combinations are identifiable, and Maple code from Cole and McCrea (2016) specifies which single parameters and combinations of parameters are identifiable.
One analysis examined the identifiability of parameters in the SSM defined by Equations (1)–(12) assuming a single covariate was used in the modeling of each of the four state processes. This model has 18 parameters: eight state process model parameters determining expected values (ζ0, ζ1, β0, β1, η0, η1, γ0, γ1), two observation bias parameters (, ), four process noise parameters (, , ), and four observation variance parameters assuming that these parameters were time‐invariant (, , , ). Crucially, the initial state component, , was viewed as a known parameter, which seemed reasonable given the assumption of unbiased estimates of the abundance of adults in the year prior to the start of the time series. An exhaustive summary vector of length 20 (which proved to be of sufficient length) was constructed based on first‐order approximations of the expected values and variances of the observations. A subset of the vector including examples of the expectations and variances is
| (18) |
| (19) |
| (20) |
| (21) |
The matrix of first derivatives of the exhaustive summary vector with respect to each of the parameters had a full rank of 18, thus all 18 parameters were identifiable. Further details on the exhaustive summary calculations for this SSM are provided in Web Appendix A.
A second set of analyses examined the effect of state process covariates on identifiability. This was largely motivated by the fact that observations for two of the life stages in the case study (Section 5) were biased. The potential for nonidentifiability can be seen in the case of a single life stage auto‐regressive process model fit with biased estimates. For example, given ≈ and n 1 ≈ , then ≈ and difficulty in separating ψ and ζ0 is apparent. The case for multiple life stages, where unbiased estimates were available for some life stages, is more complicated and the effect of including covariates for different processes in the state process model was of interest. Focusing solely on estimability of mean parameters in the state and observation models, the effects on identifiability of a single covariate being present or absent in each of the four process models (24 = 16 combinations) were examined. In the case where the derivative matrix is not full rank, additional Maple code developed by Cole and McCrea (2016) determines which parameters or combinations of parameters are identifiable.
The results of this second set of analyses are shown in Table 2. In the most limited setting where no covariates are used to model the process dynamics (case 1), there are six parameters, but only four parameter combinations can be estimated, and the only separately identifiable parameter is the intercept, ζ0, for the process model dynamics. The identifiability of ζ0 is readily seen given is assumed known and is unbiased for : ≈ . An example of an identifiable combination, also from case 1, is , which shows that the bias in estimates of juvenile abundance cannot be separated from post‐larval survival. When a covariate is used for a given process, the intercept and slope parameters for that process are always identifiable; for example, in case 3 a covariate for post‐larval survival is included and β0 and β1 are separately identifiable. When at least two survival process models have covariates (cases 9‐16) all the parameters are individually identifiable. Whether a bias parameter is identifiable is a function of the inclusion of covariates and which life stage abundance observation is biased. For example, parameter (but not ) was identifiable when a covariate for the survival of post‐larvae to the juvenile stage was included (case 3), while was identifiable when a covariate for the survival of sub‐adults to the adult stage was included (case 5). If covariates were available for all four processes, and observations for all four life stages were biased, that is, there were also and , say, all the parameters were identifiable (cases 17‐18).
TABLE 2.
Summary of parameter identifiability results for state process mean parameters and observation bias parameters conditional on the inclusion or exclusion of covariates in a given set of state processes
| Case | State processes with covariates | Parameters to be estimated (total) | Number of identifiable parameter combinations | Identifiable singleton parameters | ||
|---|---|---|---|---|---|---|
| 1 | None | {} (6) | 4 | ζ0 | ||
| 2 | ρ | {} (7) | 5 |
|
||
| 3 |
|
{} (7) | 6 |
|
||
| 4 |
|
{} (7) | 6 |
|
||
| 5 |
|
{} (7) | 6 |
|
||
| 6 |
|
{} (8) | 7 |
|
||
| 7 |
|
{} (8) | 7 |
|
||
| 8 |
|
{} (8) | 7 |
|
||
| 9 |
|
{} (8) | 8 | All | ||
| 10 |
|
{} (8) | 8 | All | ||
| 11 |
|
{} (8) | 8 | All | ||
| 12 |
|
{} (9) | 9 | All | ||
| 13 |
|
{} (9) | 9 | All | ||
| 14 |
|
{} (9) | 9 | All | ||
| 15 |
|
{} (9) | 9 | All | ||
| 16 |
|
{} (10) | 10 | All | ||
| 17 |
|
{} (12) | 12 | All | ||
| 18 |
|
{} (11) | 11 | All |
Note. Bias parameter identifiability depends on covariate inclusion and where in the sequence of observations they occur. With juvenile and sub‐adult biases only (cases 1‐16), inclusion of a process covariate allows identifiability of the corresponding intercept and slope parameters (cases 2‐16) and all parameters are identifiable if at least two survival process models have covariates (cases 9‐16). With biases in all life stages and covariates in all processes, all parameters are identifiable (cases 17‐18).
These theoretical calculations are based on a frequentist model formulation, while in Sections 4 and 5 we apply Bayesian methods to fit the SSM. Nonidentifiability in a frequentist context implies nonidentifiability in a Bayesian context with uninformative priors, although it is possible for informative priors to help alleviate this issue (Cole and McCrea, 2016). Scenarios 1 and 2 in Table 1 assume a prior probability of 1 for the externally estimated observation variance values.
4. SIMULATION STUDY
4.1. Design
Although we identified conditions under which parameter identifiability is ensured theoretically, data‐specific features can result in practically nonidentifiable, or nearly redundant, parameters, sometimes identified by flat profile likelihoods and infinitely large confidence intervals in a frequentist context (Raue et al., 2009). We therefore used simulations to explore practical identifiability and to explore the potential benefits of fixing external estimates of as in Scenario 1 (Table 1). Datasets were generated in R (R Core Team, 2019) according to Equations (1)‐(12) with 20 cohorts. Motivated by the case study (Section 5), the post‐larval and adult life stages were assumed to have no observation bias () while the juvenile and sub‐adult life stages had observation biases less than one. Each dataset was used to fit two models, one with fixed at externally derived estimates (Scenario 1) and one with internally estimated as part of the model (Scenario 3). In the first model, the fixed values were assumed to be potentially imperfect according to the distribution , where is the true value and .
We ran 100 simulations each for , in which case is equal to the true value , and for , which represents a more realistic case where is estimated imperfectly. True values of were generated from a Uniform(0.1, 1) distribution. The recruitment and three survival processes were each functions of single covariates. To incorporate the concept of model selection in the study along with parameter estimation, we included two potential covariates per process (the true covariate used to generate data and a second covariate) in the fitted models. The true values used to generate data and the prior distributions used for model fitting are described in Web Appendix B (Table B.1). Model fitting here and in Section 5 used Bayesian methods implemented within R (R Core Team, 2019) using JAGS v4.3.0 (Plummer, 2003; 2016; Su and Yajima, 2015). Model performance was evaluated by calculating marginal posterior summary statistics as well as relative bias, that is, (posteriorMean ‐ trueValue)/trueValue. Model convergence was assessed by examining trace plots for adequate mixing and calculating Gelman‐Rubin statistics.
4.2. Results
Posterior means of the vital rate coefficients, process variance, and observation bias parameters, averaged across simulations, were similar whether observation error was externally or internally estimated and whether the level of noise in the external estimates was low () or high () (Table 3). On average, observation bias parameters were well estimated, while recruitment process variance exhibited the highest level of relative bias (Table 3). Internally estimated observation CV was also generally biased high relative to the mean true observation CV for each life stage (Figure B.1). Plugging in external estimates of observation CV resulted in lower average posterior standard deviations and relative biases, as well as reduced diffusivity in joint posteriors (Figure B.2). Latent abundance posterior means were similarly well estimated regardless of how observation CV was handled, although plugging in external observation CV estimates can lead to smaller latent abundance standard deviations (Figure B.3). The findings presented here for the case with are qualitatively similar to those for the case with .
TABLE 3.
Summary of simulation study parameter estimates
| (a) Simulation with . | (b) Simulation with . | ||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| True | Mean | SD | Rel bias | Mean | SD | Rel bias | |||||||||||||||||||
| Parameter | value |
|
|
|
|
|
|
|
|
|
|
|
|
||||||||||||
| ζ0 | 1.00 | 1.02 | 1.07 | 0.12 | 0.18 | 0.02 | 0.07 | 1.03 | 1.08 | 0.11 | 0.18 | 0.03 | 0.08 | ||||||||||||
| ζ1 | 1.00 | 0.98 | 0.97 | 0.09 | 0.12 | −0.02 | −0.03 | 0.99 | 1.00 | 0.10 | 0.12 | −0.01 | 0.00 | ||||||||||||
| ζ2 | 0.00 | 0.00 | 0.01 | 0.09 | 0.12 | NA | NA | 0.01 | 0.00 | 0.09 | 0.12 | NA | NA | ||||||||||||
| β0 | 1.30 | 1.33 | 1.28 | 0.60 | 0.67 | 0.02 | −0.01 | 1.34 | 1.27 | 0.60 | 0.68 | 0.03 | −0.02 | ||||||||||||
| β1 | 1.00 | 0.82 | 0.70 | 0.39 | 0.45 | −0.18 | −0.30 | 0.78 | 0.69 | 0.40 | 0.45 | −0.22 | −0.31 | ||||||||||||
| β2 | 0.00 | 0.01 | 0.00 | 0.36 | 0.43 | NA | NA | 0.00 | −0.02 | 0.37 | 0.43 | NA | NA | ||||||||||||
| η0 | 1.30 | 1.34 | 1.32 | 0.60 | 0.67 | 0.03 | 0.01 | 1.32 | 1.28 | 0.60 | 0.66 | 0.02 | −0.01 | ||||||||||||
| η1 | 1.00 | 0.80 | 0.68 | 0.40 | 0.46 | −0.20 | −0.32 | 0.82 | 0.71 | 0.40 | 0.45 | −0.18 | −0.29 | ||||||||||||
| η2 | 0.00 | −0.01 | 0.02 | 0.36 | 0.43 | NA | NA | −0.05 | −0.04 | 0.36 | 0.43 | NA | NA | ||||||||||||
| γ0 | 1.30 | 1.40 | 1.37 | 0.59 | 0.66 | 0.08 | 0.05 | 1.46 | 1.43 | 0.60 | 0.67 | 0.12 | 0.10 | ||||||||||||
| γ1 | 1.00 | 0.85 | 0.76 | 0.40 | 0.46 | −0.15 | −0.24 | 0.75 | 0.66 | 0.42 | 0.47 | −0.25 | −0.34 | ||||||||||||
| γ2 | 0.00 | −0.01 | −0.05 | 0.37 | 0.43 | NA | NA | 0.02 | −0.01 | 0.39 | 0.45 | NA | NA | ||||||||||||
|
|
0.05 | 0.11 | 0.15 | 0.09 | 0.12 | 1.27 | 1.93 | 0.15 | 0.15 | 0.10 | 0.11 | 2.04 | 2.01 | ||||||||||||
|
|
0.50 | 0.48 | 0.53 | 0.36 | 0.43 | −0.04 | 0.06 | 0.58 | 0.56 | 0.41 | 0.44 | 0.16 | 0.12 | ||||||||||||
|
|
0.50 | 0.52 | 0.58 | 0.38 | 0.45 | 0.03 | 0.16 | 0.53 | 0.55 | 0.39 | 0.43 | 0.07 | 0.10 | ||||||||||||
|
|
0.50 | 0.51 | 0.55 | 0.38 | 0.44 | 0.01 | 0.10 | 0.59 | 0.61 | 0.42 | 0.46 | 0.18 | 0.21 | ||||||||||||
|
|
0.50 | 0.52 | 0.55 | 0.08 | 0.15 | 0.04 | 0.10 | 0.51 | 0.53 | 0.08 | 0.14 | 0.03 | 0.07 | ||||||||||||
|
|
0.20 | 0.21 | 0.23 | 0.03 | 0.06 | 0.05 | 0.13 | 0.21 | 0.23 | 0.03 | 0.06 | 0.05 | 0.15 | ||||||||||||
Note. Average posterior mean (Mean), posterior standard deviation (SD), and relative bias (Rel Bias) were calculated across simulations for the cases with and . Externally estimated CV equals true CV when , and randomly varies from 50% to 150% of the true CV when . and represent the models with externally and internally estimated observation error, respectively. Relative biases for the coefficients ψ2, β2, η2, and γ2 are undefined because the true values are zero.
5. CASE STUDY
We used the model described in Section 2 to quantify the relative importance of salient factors posited to determine the population dynamics of delta smelt (Hypomesus transpacificus). Delta smelt are a small, nearly annual pelagic fish endemic to the interior “Delta” portion of the San Francisco Estuary (Moyle et al., 1992; 2016). Spawning typically takes place during the winter and spring, with offspring maturing through a number of life stages to eventually become spawning adults by winter of the next calendar year.
Abundance indices and coefficients of variation developed in Polansky et al. (2019) were used as observations on the abundances of each life stage starting with the 1994 birth cohort adult abundance index and including the subsequent 21 cohorts. Based on the abundance index construction work in Polansky et al. (2019), we assumed post‐larval indices, and adult indices for , were unbiased relative to an overall unknown scaling factor, that is, and for . Juvenile and sub‐adult observation biases, and , respectively, were estimated. Because the sampling method used to collect data about adult abundances prior to 2002 was the same as that used for sub‐adult data collection, and indices include length based corrections, we set for . Covariate data used to model recruitment and survival consisted of a collection of abiotic habitat condition metrics, abundance indices of bottom up and top down trophic drivers, and competitors summarized in Web Appendix C Table C.1.
The steps of assembling covariate sets to model each vital rate and the complete set of parameter posterior results are described in detail in Web Appendix C. Model validation was done in several ways, including graphical posterior checks of response residuals (Gelman and Shalizi, 2013), and q‐q plots of one step ahead forecast residuals (Smith, 1985).
Two “global” models with the same sets of covariates for each vital rate were considered: one that used external estimates of observation error CV (Scenario 1), and one that used internally estimated observation error CV (Scenario 3). Posterior summaries are provided in Table C.4. Process noise variance estimates in the model fit using Scenario 1 were higher than those estimated under Scenario 3, while the internally estimated observation coefficients of variation were generally larger than the externally derived ones (Figure C.3). As predicted by the simulation study, Scenario 1 posterior standard deviations of latent abundances and observation bias were smaller compared to those estimated under Scenario 3 (Table C.4 and Figure C.4). Posterior distributions of the bias parameters in both cases showed them to be considerably smaller than one. The model from Scenario 2 was not applied because it is similar to the model from Scenario 1, and the external estimates of abundance variance, , were expected to be as informative as the external estimates of abundance CV, .
We found a number of covariates with more support than others for each vital rate (Table C.4). Here we present a subset of these results to illustrate the most important vital rate predictions. Using the results from the global model fit with external estimates of observation error CV plugged in, and selecting (somewhat arbitrarily) a 0.80 value as the lower limit for which evidence, the posterior distribution probability that the coefficient is above (below) zero when the expected effect of a covariate is positive (negative), is considered substantial enough to report on here, the following relationships were observed: (a) recruitment was most influenced by temperature, the approximate location of the 2‐ppt isohaline during the previous fall, and adult food (note also the export‐inflow ratio had high evidence of support based on the models summarized in Table C.2); (b) post‐larval survival by outflow and turbidity; (c) juvenile survival by turbidity (Secchi depth) and temperature; and (d) sub‐adult survival by turbidity in the south Delta (south Secchi depth), a spatially localized hydrodynamics flow measure in the Old and Middle River corridor (OMR), and adult striped bass (Morone saxatilis). Of the predator/competitor indices considered here, only the effect of summer inland silverside (Menidia beryllina) abundance on post‐larval survival, and juvenile and adult striped bass on sub‐adult survival, had biologically plausible negative estimated effects.
To illustrate vital rate predictions, a model including only the covariates with the highest posterior evidence for each vital rate was constructed. Also included was an interaction between the two covariates that most impact sub‐adult survival, both of which had near one evidence of support. There is little difference between the distribution of vital rate predictions when parameter estimate uncertainty is not included, and the interquartile prediction range is considerably more bounded than the 95% prediction interval (Figure 1). Increases in sub‐adult survival with decreases in turbidity (increases in south Secchi) become more pronounced as OMR decreases (Figure 1).
FIGURE 1.

Predicted vital rates for the case study example model. In all panels the solid curved lines show expected values. In panels (a)‐(c), dark and light gray shadings show the central credible intervals for and , respectively, and include posterior parameter estimate uncertainty. The dashed and dotted lines show the 50% and 95%, respectively, central credible intervals using the mean values of the posterior. Ratios of abundance indices are shown with the last two digits of the cohort year, adjusted by the inverse of the posterior means of observation error bias when relevant. Covariate units are mean daily values over the time interval of each vital rate [Correction added on 8 October 2020, after first online publication: Author corrected labels and units]
6. DISCUSSION
Identifiability issues for SSMs fit to ecological datasets have primarily concentrated on normal dynamic linear models (de Valpine and Hilborn (2005) and de Valpine and Rosenheim (2008) provide some exceptions), with specific focus on accurately distinguishing the magnitudes of process and observation variances (Dennis et al., 2006, 2010; Knape et al., 2011; Auger‐Méthé et al., 2016), and difficulties in accurately assessing density dependence in SSMs for population dynamics (Freckleton et al., 2006; Knape, 2008). The effects of covariates and the effects of external estimates of observation variance on identifiability, particularly for nonlinear and non‐Gaussian SSMs, have not received as much attention.
For the nonlinear non‐Gaussian SSMs examined here, the utility of covariates for enabling identifiability, particularly in the case of biased observations, was clear. Without covariates, but with bias in some of the observations, nonidentifiability occurs. In some cases, one can use method of moments to determine identifiability by setting the observations to their expected values and solving the resulting system of equations. However, the algebra involved in this approach can be extremely difficult and determining what parameter combinations are identifiable can also be challenging. The recently developed methods of Cole and McCrea (2016) are attractive and elegant tools for analytically assessing parameter identifiability for SSMs in a far less algebraically tedious manner, indicating both how many parameter combinations are identifiable as well as what they are. These methods deserve routine application both to guide SSM formulation as well as for after‐the‐fact assessment. A cautionary note, however, is that theoretical identifiability does not rule out practical nonidentifiability for a given dataset (Raue et al., 2009). Simulating datasets from a hypothesized SSM and then examining the ability to estimate the known parameters can be a helpful exercise for identifying practical estimation difficulties and data specific modeling challenges. We note that data cloning (Lele et al., 2010), while not applied here, is an alternative method of determining estimability of model parameters and functions of parameters that relies only on the observed data.
Based on the simulation study, the improvement in inference when observation error CV was externally estimated (Scenario 1) compared to when it was internally estimated (Scenario 3) was relatively minor, given the presence of modeled covariate effects. The primary advantage of using external estimates was an increase in precision, particularly for the observation bias and state estimates. As expected, these gains decrease when the external estimates of observation variance are themselves measured with error. Case study results based on these two scenarios mostly mirror the simulation study findings, particularly with respect to inference about latent states and observation bias parameters. However, an important difference is that when observation error CV was estimated internally, estimation difficulties appeared to be greatly exacerbated in general, with complex joint posterior distributions containing multiple modes and ridges, particularly in the process variance and intercept parameter dimensions. One consequence was that process variance estimates were unrealistically low. The exact aspect of the case study dataset responsible for the low process noise estimates is not clear. A longer time series may help separate process variance from observation variance. However, process variance estimates were not unrealistically low in the simulation study when only 20 cohorts were used, suggesting that unidentified sources of bias or uncertainty (or both) remain in the case study when observation error CV is estimated.
A practical concern when modeling empirical data is that relationships between covariates and response variables may appear weaker than they are in reality because of noise in the covariate data, also known as error‐in‐variables (Carroll et al., 2006), or because the covariate data were summarized in a nonoptimal way (Ferguson et al., 2017). Exploratory simulations indicated that covariate noise affects estimated vital rates in the SSM presented here, making this an aspect of the model that requires further development. An estimation issue that seems particularly challenging given the model framework considered here is that as covariate effect size diminishes, nonidentifiability issues can emerge. Including covariates does not guarantee that straightforward application of computational methods will be sufficient for estimation.
The delta smelt population modeling presented here integrated data from more surveys than has been done previously. Although the sequential life stage model required addressing relative bias in abundance estimates, this model framework allowed new insights about drivers of population abundances. For example, we found flow related impacts on summer survival and lagged fall flow effects on recruitment, whereas prior analyses by Feyrer et al. (2011) were unable to precisely identify where in the life cycle the flow effects on population dynamics occur. Extending the findings by Grimaldo et al. (2009), who found that OMR and turbidity predict an index of south Delta mortality, we found these spatially localized predictor variables interact and predict population wide sub‐adult survival. A number of predator/competitor relationships with delta smelt vital rates were biologically implausible, suggesting that some cohabitant species are more influenced in the contemporary Delta by shared habitat conditions than inter‐specific interactions.
Auger‐Méthé et al. (2016) remarked that SSMs are “becoming the favoured statistical framework [in ecology] for modelling animal movement and population dynamics.” The state‐process equations describing realistic population models are often nonlinear, non‐Gaussian, and link multiple life stages. Because SSMs allow integration of multiple datasets, they are an important tool for advancing population modeling in practice. Here we focused on a particular kind of population in which different age classes are not concurrently observed. More general populations with multiple age classes observed at the same time such as those as described by matrix models can encompass a wide variety of life history strategies (Caswell, 2001), and the number of possible configurations where biases and covariates appear can be quite numerous in general. Mapping out the theoretical and practical identifiability requirements for a given SSM remains an ongoing topic of research.
Supporting information
Web Appendices, Tables, and Figures referenced in Sections 3, 4, and 5, Maple code to reproduce the theoretical identifiability assessments, and JAGS code to reproduce the results of the case study are available with this paper at the Biometrics website on Wiley Online Library.
ACKNOWLEDGMENTS
The California Department of Water Resources, the U.S. Bureau of Reclamation, and the Interagency Ecological Program (IEP) for the San Francisco Estuary provided funding for this work. The viewpoints expressed are those of the authors and do not necessarily reflect the opinions of the U.S. Department of the Interior, the U.S. Fish and Wildlife Service, or the other member agencies of the IEP. We thank William Bennett, Wim Kimmerer, Pete Smith, Perry de Valpine, David Hankin, and Diana Cole for helpful discussions. Several editors and a referee provided helpful comments on an earlier version of this manuscript.
Polansky L, Newman KB, Mitchell L. Improving inference for nonlinear state‐space models of animal population dynamics given biased sequential life stage data. Biometrics. 2021;77:352–361. 10.1111/biom.13267
Contributor Information
Leo Polansky, Email: leo_polansky@fws.gov.
Ken B. Newman, Email: ken.newman@bioss.ac.uk.
Lara Mitchell, Email: lara_mitchell@fws.gov.
DATA AVAILABILITY STATEMENT
The data that support the findings in this paper are available in the Supporting Information of this article.
REFERENCES
- Auger‐Méthé, M. , Field, C. , Albertsen, C.M. , Derocher, A.E. , Lewis, M.A. , Jonsen, I.D. , et al. (2016) State‐space models' dirty little secrets: even simple linear Gaussian models can have estimation problems. Scientific Reports, 6, 26677. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Besbeas, P. , Freeman, S.N. , Morgan, B.J.T. and Catchpole, E.A. (2002) Integrating mark–recapture–recovery and census data to estimate animal abundance and demographic parameters. Biometrics, 58, 540–547. [DOI] [PubMed] [Google Scholar]
- Carroll, R.J. , Ruppert, D. , Stefanski, L.A. and Crainiceanu, C.M. (2006) Measurement Error in Nonlinear Models: A Modern Perspective. Boca Raton, FL: Chapman & Hall/CRC. [Google Scholar]
- Caswell, H. (2001) Matrix Population Models: Construction, Analysis, and Interpretation, 2nd edition. Sunderland, MA: Sinauer Associates, Inc. Publishers. [Google Scholar]
- Cole, D.J. and McCrea, R.S. (2016) Parameter redundancy in discrete state‐space and integrated models. Biometrical Journal, 58, 1071–1090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- de Valpine, P. (2003) Better inferences from population dynamics experiments using Monte Carlo state‐space likelihood methods. Ecology, 84, 3064–3077. [Google Scholar]
- de Valpine, P. and Hilborn, R. (2005) State‐space likelihoods for nonlinear fisheries time‐series. Canadian Journal of Fisheries and Aquatic Sciences, 62, 1937–1952. [Google Scholar]
- de Valpine, P. and Rosenheim, J.A. (2008) Field‐scale roles of density, temperature, nitrogen, and predation on aphid population dynamics. Ecology, 89, 532–541. [DOI] [PubMed] [Google Scholar]
- Dennis, B. , Ponciano, J.M. , Lele, S.R. , Taper, M.L. and Staples, D.F. (2006) Estimating density dependence, process noise, and observation error. Ecological Monographs, 76, 323–341. [Google Scholar]
- Dennis, B. , Ponciano, J.M. and Taper, M.L. (2010) Replicated sampling increases efficiency in monitoring biological populations. Ecology, 91, 610–620. [DOI] [PubMed] [Google Scholar]
- Ferguson, J.M. , Reichert, B.E. , Fletcher Jr., R.J. and Jager, H.I. (2017) Detecting population‐environmental interactions with mismatched time series data. Ecology, 98, 2813–2822. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feyrer, F. , Newman, K. , Nobriga, M. and Sommer, T. (2011) Modeling the effects of future outflow on the abiotic habitat of an imperiled estuarine fish. Estuaries and Coasts, 34, 120–128. [Google Scholar]
- Freckleton, R.P. , Watkinson, A.R. , Green, R.E. and Sutherland, W.J. (2006) Census error and the detection of density dependence. Journal of Animal Ecology, 75, 837–851. [DOI] [PubMed] [Google Scholar]
- Gelman, A. and Shalizi, C.R. (2013) Philosophy and practice of Bayesian statistics. British Journal of Mathematical and Statistical Psychology, 66, 8–38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grimaldo, L.F. , Sommer, T. , Van Ark, N. , Jones, G. , Holland, E. , Moyle, P.B. , et al. (2009) Factors affecting fish entrainment into massive water diversions in a tidal freshwater estuary: can fish losses be managed? North American Journal of Fisheries Management, 29, 253–1270. [Google Scholar]
- Knape, J. (2008) Estimability of density dependence in models of time series data. Ecology, 89, 2994–3000. [DOI] [PubMed] [Google Scholar]
- Knape, J. , Besbeas, P. and de Valpine, P. (2013) Using uncertainty estimates in analyses of population time series. Ecology, 94, 2097–2107. [DOI] [PubMed] [Google Scholar]
- Knape, J. , Jonzén, N. and Sköld, M. (2011) On observation distributions for state space models of population survey data. Journal of Animal Ecology, 90, 1269–1277. [DOI] [PubMed] [Google Scholar]
- Lele, S.R. , Nadeem, K. and Schmuland, B. (2010) Estimability and likelihood inference for generalized linear mixed models using data cloning. Journal of the American Statistical Association, 105, 1617–1625. [Google Scholar]
- Maple (2013) Maple 17.02 . Maplesoft, a division of Waterloo Maple Inc., Waterloo, Ontario, Canada [Accessed 22 August 2019].
- Moyle, P.B. , Brown, L.R. , Durand, J.R. and Hobbs, J.A. (2016) Delta smelt: life history and decline of a once‐abundant species in the San Francisco Estuary. San Francisco Estuary and Watershed Science, 14, 1–30. [Google Scholar]
- Moyle, P.B. , Herbold, B. , Stevens, D.E. and Miller, L.W. (1992) Life history and status of delta smelt in the Sacramento‐San Joaquin Estuary, California. Transactions of the American Fisheries Society, 121, 67–77. [Google Scholar]
- Newman, K.B. (1998) State‐space modeling of animal movement and mortality with application to salmon. Biometrics, 54, 1290–1314. [Google Scholar]
- Newman, K.B. , Buckland, S.T. , Lindley, S.T. , Thomas, L. and Fernandez, C. (2006) Hidden process models for animal population dynamics. Ecological Applications, 16, 74–86. [DOI] [PubMed] [Google Scholar]
- Newman, K.B. , Buckland, S.T. , Morgan, B.J.T. , King, R. , Borchers, D.L. , Cole, D.J. , et al. (2014) Modelling Population Dynamics. Methods in Statistical Ecology. New York, NY: Springer New York. [Google Scholar]
- Plummer, M. (2003) JAGS: a program for analysis of Bayesian graphical models using Gibbs sampling. Available at: https://www.r‐project.org/conferences/DSC‐2003/Proceedings/Plummer.pdf (Accessed 10 September 2019).
- Plummer, M. (2016) rjags: Bayesian graphical models using MCMC . R package version 4‐6. Available at: https://CRAN.R‐project.org/package=rjags [Accessed 10 September 2019].
- Polansky, L. , Mitchell, L. and Newman, K.B. (2019) Using multistage design‐based methods to construct abundance indices and uncertainty measures for delta smelt. Transactions of the American Fisheries Society, 148, 710–724. [Google Scholar]
- R Core Team (2019) R: a language and environment for statistical computing . Available at: https://www.R‐project.org/ [Accessed 10 September 2019].
- Raue, A. , Kreutz, C. , Maiwald, T. , Bachmann, J. , Schilling, M. , Klingmüller, U. and Timmer, J. (2009) Structural and practical identifiability analysis of partially observed dynamical models by exploiting the profile likelihood. Bioinformatics, 25, 1923–1929. [DOI] [PubMed] [Google Scholar]
- Smith, J.Q. (1985) Diagnostic checks of non‐standard time series models. Journal of Forecasting, 4, 283–291. [Google Scholar]
- Staudenmayer, J. and Buonaccorsi, J.P. (2006) Measurement error in a random walk model with applications to population dynamics. Biometrics, 62, 1178–1189. [DOI] [PubMed] [Google Scholar]
- Su, Y.‐S. and Yajima, M. (2015) R2jags: using R to Run ‘JAGS' . R package version 0.5‐7. Available at: https://CRAN.R‐project.org/package=R2jags [Accessed 10 September 2019].
- Turchin, P. (2003) Complex Population Dynamics: A Theoretical/Empirical Synthesis. Princeton, NJ: Princeton University Press. [Google Scholar]
- West, M. and Harrison, J. (2006). Bayesian Forecasting and Dynamic Models. Berlin: Springer Science & Business Media. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Web Appendices, Tables, and Figures referenced in Sections 3, 4, and 5, Maple code to reproduce the theoretical identifiability assessments, and JAGS code to reproduce the results of the case study are available with this paper at the Biometrics website on Wiley Online Library.
Data Availability Statement
The data that support the findings in this paper are available in the Supporting Information of this article.
