Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2014 Oct 14;71(1):237–246. doi: 10.1111/biom.12246

Computational Aspects of N-Mixture Models

Emily B Dennis 1,*, Byron JT Morgan 1, Martin S Ridout 1
PMCID: PMC4406156  PMID: 25314629

Abstract

The N-mixture model is widely used to estimate the abundance of a population in the presence of unknown detection probability from only a set of counts subject to spatial and temporal replication (Royle, 2004, Biometrics 60, 105–115). We explain and exploit the equivalence of N-mixture and multivariate Poisson and negative-binomial models, which provides powerful new approaches for fitting these models. We show that particularly when detection probability and the number of sampling occasions are small, infinite estimates of abundance can arise. We propose a sample covariance as a diagnostic for this event, and demonstrate its good performance in the Poisson case. Infinite estimates may be missed in practice, due to numerical optimization procedures terminating at arbitrarily large values. It is shown that the use of a bound, K, for an infinite summation in the N-mixture likelihood can result in underestimation of abundance, so that default values of K in computer packages should be avoided. Instead we propose a simple automatic way to choose K. The methods are illustrated by analysis of data on Hermann's tortoise Testudo hermanni.

Keywords: Abundance estimation, Method of moments, Multivariate negative binomial, Multivariate Poisson, Optimal design, Sampling, Temporal replication

1. Introduction

Estimating the abundance of a population is an important component of ecological research. N-mixture models can be used to estimate animal abundance from counts with both spatial and temporal replication whilst accounting for imperfect detection (Royle, 2004a). Whereas alternative sampling methods for obtaining estimates of abundance exist, such as capture–recapture, distance, removal and multiple-observer sampling, these may be expensive in effort or cost, or impractical for some species and scenarios. A benefit of the N-mixture model is the reasonably low comparative cost and effort required for data collection which does not require individuals to be identified. This is especially true of many citizen-science based monitoring programs.

Consequently, since development by Royle (2004a), many applications and extensions of the N-mixture model have been made. These include applications to various taxa, including birds (Kéry, Royle, and Schmid, 2005), mammals (Zellweger-Fischer, Kéry, and Pasinelli, 2011), and amphibians (Dodd and Dorazio, 2004; McIntyre et al., 2012). In addition, covariates have often been used to examine spatial patterns in abundance and detection (Kéry, 2008) and hence create maps of spatial abundance (Royle, Nichols, and Kéry, 2005).

Despite the popularity of the N-mixture model, few studies have made comparisons with estimates derived via alternative methods or undertaken simulation studies of performance (Kéry et al., 2005; Hunt, Weckerly, and Ott, 2012; Couturier et al., 2013). A potential issue for fitting the model using classical inference is the need to specify an upper bound, K, to approximate an infinite summation in the likelihood. We found this matter was rarely mentioned in publications. For example, McIntyre et al. (2012) used simulated data to support their amphibian study, highlighting the benefit of more sampling occasions, particularly when detection probability was low, however the value of K used was not provided. When software such as unmarked (Fiske and Chandler, 2011) written in R (R Core Team, 2014) and PRESENCE (Hines, 2011) is used for model fitting, it is possible that only default values of the bound are employed. Couturier et al. (2013) suggest bias could be induced by the choice of K for low detection probabilities.

In this article, we investigate computational aspects of fitting N-mixture models, in particular via a simulation study for scenarios where detection probability is low and/or the number of sampling occasions is small. This may be important for the study of cryptic species, and have implications for sample design: many applications to date have made only three visits, whereas in Royle (2004a) simulations were tested for five visits and an application made to data with 10 visits. When only one sampling visit is made, it is well known that the N-mixture model reduces to a thinned Poisson distribution, with only one estimable parameter, the product of mean abundance and detection probability, a feature which underlies aspects of the work which follows.

The N-mixture model is described in Section The N-Mixture Model. In Section Equivalence of the Poisson N-Mixture Model With a Multivariate Poisson Model we explain the equivalence of the Poisson N-mixture model with a multivariate Poisson distribution. We use this formulation to show that infinite estimates of abundance may arise, and provide a simple diagnostic to identify such cases. The multivariate Poisson formulation has the advantage of not requiring a constant K to be set. Section Explicit Form for the Bivariate Negative-Binomial Case provides the probability function in the bivariate negative-binomial case. In Section The Effect of the Choice of K on Fitting the N-Mixture Model: Poisson Case, we show how the choice of K in the N-mixture model interacts with the occurrence of infinite estimates of abundance, and how incorrect conclusions may arise. An automatic method for choosing K is provided. Section Moment Estimation for a Mixed-Poisson N-Mixture Model provides moment estimates and evaluates the use of two diagnostic tests for the negative-binomial case for when infinite estimates of abundance may arise. Section Application to Hermann's Tortoise Data provides an application to real data and the article ends with discussion and recommendations.

2. The N-Mixture Model

Under the study design in Royle (2004a), a set of counts is made during sampling visits Inline graphic at Inline graphic locations (sites). The population is assumed to be closed during the period of sampling and each individual is assumed to have the same detection probability p. The counts Inline graphic at site i and time t are assumed to be independent binomial random variables,

graphic file with name biom0071-0237-m1.jpg

where Inline graphic is the unknown population size at site i. To fit the model using classical inference, we assume the Inline graphic to be independent random variables with probability function Inline graphic, and then maximize the likelihood

graphic file with name biom0071-0237-m2.jpg (1)

where Inline graphic. As noted by Royle (2004a), numerical maximization of 2013 requires the replacement of the infinite summation over Inline graphic by a sum with upper limit K. The value of K may be selected by fitting the model for a succession of increasing values and selecting K when the parameter estimates appear to stabilize (Royle, 2004a). We shall consider both Poisson and negative-binomial mixing distributions.

It is our experience that the N-mixture model can produce unrealistically large estimates of abundance and we explain this feature in the article.

3. Equivalence of the Poisson N-Mixture Model With a Multivariate Poisson Model

The number of individuals observed at a site at time t can be written as the convolution of independent random variables, corresponding to those seen only once, those seen twice, etc. This natural feature of the N-mixture model can be formalized as we now show.

Let Inline graphic denote the set of non-empty subsets of Inline graphic, and let the random variable Inline graphic denote the number of individuals seen at site i only on occasion s. For example, Inline graphic denotes the individuals seen at site i on occasions 1, 2, and 4 only. Then, if we let Inline graphic denote those elements of Inline graphic that include t, we can decompose Inline graphic as

graphic file with name biom0071-0237-m3.jpg

For example, with Inline graphic, we have

graphic file with name biom0071-0237-m4.jpg

Conditional on Inline graphic, the joint distribution of the set of random variables Inline graphic is multinomial, with index Inline graphic and probabilities Inline graphic, where Inline graphic denotes the number of elements in the set s. When Inline graphic, the Inline graphic are independent Poisson random variables, with

graphic file with name biom0071-0237-m5.jpg

see Johnson, Kotz, and Balakrishnan (1997, p. 146). The thinned Poisson is the case Inline graphic.

It follows that the joint distribution of Inline graphic is multivariate Poisson (Johnson et al., 1997, Chapter 37), with

graphic file with name biom0071-0237-m6.jpg

There are Inline graphic subsets Inline graphic such that Inline graphic). Hence

graphic file with name biom0071-0237-m7.jpg

Similarly, if we let Inline graphic denote the elements of Inline graphic that include both t and u then

graphic file with name biom0071-0237-m8.jpg

There are Inline graphic subsets Inline graphic such that Inline graphic). Hence, for Inline graphic,

graphic file with name biom0071-0237-m9.jpg

and Inline graphic.

This result is a special case of Johnson et al. (1997, equation 37.88), which is stated without proof.

Example: T=2, Poisson Case

Cormack (1989) mentions this case in closed-population capture–recapture modeling of data from one site only.

Suppressing site dependence, we have

graphic file with name biom0071-0237-m10.jpg

where Inline graphic are independent with Inline graphic, where Inline graphic and Inline graphic, where Inline graphic. Note that small p would result typically in small values for Inline graphic, and as p tends to zero Inline graphic and Inline graphic become independent, so that the model reverts to a thinned Poisson.

The counts Inline graphic follow a bivariate Poisson distribution with Inline graphic, and the bivariate Poisson probability is

graphic file with name biom0071-0237-m11.jpg (2)

Including site dependence, the likelihood is

graphic file with name biom0071-0237-m12.jpg (3)

For Inline graphic the expressions of 2013 and 2011 are identical, but the likelihood of 2011 may be maximized without requiring selection of a value K.

3.1. Multivariate Poisson Distribution

For general T, let Inline graphic denote the set of all possible values Inline graphic of the random variables Inline graphic such that

graphic file with name biom0071-0237-m13.jpg

Because the random variables Inline graphic are independent, the joint probability function of Inline graphic is

graphic file with name biom0071-0237-m14.jpg

and

graphic file with name biom0071-0237-m15.jpg

There are Inline graphic elements Inline graphic such that Inline graphic, for Inline graphic. Hence

graphic file with name biom0071-0237-m16.jpg

Therefore, we can write

graphic file with name biom0071-0237-m17.jpg (4)

The case Inline graphic is given in 2011. The associated R program incorporates efficient construction of Inline graphic.

3.2. Performance of the Multivariate Poisson Model

For illustration, we investigate performance of the multivariate Poisson model via simulation from the fitted model. We assess output for the cases Inline graphic based upon 1000 simulations where Inline graphic, Inline graphic and Inline graphic. The chosen parameter values were guided by those used in Royle (2004a). The model was fitted using the optim function in the R software package (R Core Team, 2014) using the default Nelder–Mead algorithm and a tolerance value of Inline graphic. The results were checked with those from using several other optim algorithms, including simulated annealing and quasi-Newton. We observe that estimates for Inline graphic were very large in some cases (the maximum estimate from 1000 simulations was Inline graphic when Inline graphic, Inline graphic, and Inline graphic). Figure1 shows that non-positive values of a covariance diagnostic,

Figure 1.

Figure 1

Log(Inline graphic) from the bivariate Poisson model plotted against the covariance diagnostic, covInline graphic from 2008, based upon 1000 simulated datasets for Inline graphic, Inline graphic and Inline graphic. Values at which the covariance diagnostic is negative are shown by crosses. This figure appears in color in the electronic version of this article.

graphic file with name biom0071-0237-m18.jpg (5)

can identify the high estimates of Inline graphic from fitting the bivariate Poisson. Here Inline graphic denotes the mean of the product Inline graphic over S sites. Note that this (intraclass) estimate is appropriate as Inline graphic. A proof that a local maximum of the likelihood occurs at Inline graphic when Inline graphic is given in the Appendix; we are working on a general proof for Inline graphic, as well as a proof that there are no other maxima when the diagnostic is satisfied. Hence, in these instances when Inline graphic, in order to have finite Inline graphic, Inline graphic is actually infinite and the large range of high estimates of abundance obtained in practice, as in Figure1, is partly an artefact of the optimization routine stopping prematurely when the likelihood is flat.

For more than two visits (Inline graphic), the appropriate covariance diagnostic can be estimated as

graphic file with name biom0071-0237-m19.jpg (6)

where the first term consists of the average of the means of all Inline graphic pairwise products. Our conjecture that the diagnostic extends for Inline graphic is supported by Web Figure 4 which compares the covariance diagnostic 2009 with Inline graphic from the multivariate Poisson model for Inline graphic, when Inline graphic.

Performance of the covariance diagnostic is demonstrated further in Table 1, which shows close correspondence between the proportion of simulations where the diagnostic is negative and the proportion where Inline graphic is large (Inline graphic). Table 1 also shows the prevalence of infinite estimates of Inline graphic, particularly as Inline graphic, T, and p decrease. In fact for the case where Inline graphic, Inline graphic, and Inline graphic, a finite value of Inline graphic was not achievable in over half of 1000 simulations.

Table 1.

Performance of the covariance diagnostic for the multivariate Poisson model, based upon 1000 simulations for various scenarios of Inline graphic, p, and T for Inline graphic sites. EPN is the proportion of simulations when the sample covariance diagnostic was negative. EPD is the proportion of simulations where the estimate of Inline graphic

Inline graphic Inline graphic Inline graphic
Inline graphic p EPN EPD EPN EPD EPN EPD
2 0.10 0.505 0.505 0.351 0.351 0.276 0.276
2 0.25 0.225 0.224 0.090 0.089 0.033 0.033
5 0.10 0.427 0.427 0.362 0.361 0.219 0.222
5 0.25 0.167 0.167 0.084 0.084 0.017 0.020
10 0.10 0.398 0.398 0.317 0.318 0.251 0.256
10 0.25 0.180 0.181 0.066 0.066 0.038 0.038

4. Explicit Form for the Bivariate Negative-Binomial Case

The Poisson distribution may be replaced by a mixed-Poisson distribution, for which Inline graphic, when the probability of 2011 becomes

graphic file with name biom0071-0237-m20.jpg

For the negative-binomial distribution, the mixing distribution is gamma with parameters Inline graphic and

graphic file with name biom0071-0237-m21.jpg (7)

which results in the NB-2 form (Hilbe, 2011, p. 187). In this case

graphic file with name biom0071-0237-m22.jpg

Therefore the joint probability for the bivariate negative-binomial model is given by

graphic file with name biom0071-0237-m23.jpg (8)

In the parameterization of 2011, the mean and variance of the gamma distribution are Inline graphic and Inline graphic, respectively. If we now write Inline graphic for the expected value of the Poisson mean, then the variance is Inline graphic and the coefficient of variation of the Poisson mean is Inline graphic. The Poisson model arises as the limit Inline graphic, maintaining Inline graphic.

In terms of the parameters Inline graphic and Inline graphic, Inline graphic and we can write 2014 as

graphic file with name biom0071-0237-m24.jpg (9)

The case for Inline graphic follows in the same way, by integrating the expression of 2003, to give the multivariate negative-binomial probability as

graphic file with name biom0071-0237-m25.jpg

The expression Inline graphic also applies to the negative binomial case, but the Inline graphic are no longer independent.

5. The Effect of the Choice of K on Fitting the N-Mixture Model: Poisson Case

5.1. Incorrect Estimates due to the Choice of K

We now consider how the choice of K for computing the Poisson N-mixture likelihood of 2013 interacts with the occurrence of infinite estimates of Inline graphic. Output is obtained for 1000 simulations based on the parameter values used in Royle (2004a), where Inline graphic, Inline graphic and Inline graphic, but for number of sampling occasions Inline graphic. The models were again fitted using optim in the R software package. The parameters p and Inline graphic were constrained to be in range via logit and log link functions, respectively. Each simulated dataset was fitted with Inline graphic.

We see that large finite estimates of abundance can arise, in particular where the number of sampling occasions T is small (Figure2). Specifically, a proportion of simulations result in a second peak in the sampling distribution for Inline graphic and the value at which this is found increases with the value of K. Fitting the multivariate Poisson model to simulated data created under comparable scenarios for Inline graphic also produced a second peak in the sampling distribution for Inline graphic, but as described in Section Performance of the Multivariate Poisson Model, the estimates were substantially greater in the absence of the limiting value K in the N-mixture model. An increase in the number of sampling occasions reduces the incidence of high estimates of Inline graphic, which become rare for Inline graphic, as more information is available as T increases. For Inline graphic very few high estimates of Inline graphic occurred in the 1000 simulations. An increase in the number of sites also reduces the proportion of high values (Web Figure 5).

Figure 2.

Figure 2

Kernel density estimates of Inline graphic from the Poisson N-mixture model for Inline graphic sites, Inline graphic and Inline graphic based upon 1000 simulated datasets for Inline graphic, and Inline graphic. This figure appears in color in the electronic version of this article.

Thus when the N-mixture model is fitted by maximizing the likelihood of 2013, when Inline graphic should be infinite, Inline graphic is estimated as large as possible for a given value of K, and Inline graphic is restricted to be as close to zero as possible. We discuss this matter further in Web Appendix 1. The occurrence of large finite estimates of Inline graphic is similar to analogous findings of Wang and Lindsay (2005) in the context of species richness estimation.

5.2. Automatic Choice of K

For the Poisson case the covariance diagnostic identifies when infinite values of Inline graphic arise. When the diagnostic is not satisfied, K may be selected automatically, for example by ensuring that the Poisson upper tail probability is Inline graphic, so that the value of K will adapt for successive iterations according to the estimate of Inline graphic. This approach was also suggested by Guillera-Arroita et al. (2012). We have found this to be a simple and preferable alternative to fitting the model for successively larger values of K until estimates appear to stabilize.

6. Moment Estimation for a Mixed-Poisson N-Mixture Model

Suppose we have an N-mixture model in which Inline graphic follows a mixed-Poisson distribution, as in Section Explicit Form for the Bivariate Negative-Binomial Case, with

graphic file with name biom0071-0237-m26.jpg

Conditional on Inline graphic, the random variables Inline graphic are independent binomial variables, with

graphic file with name biom0071-0237-m27.jpg

Therefore, conditional on Inline graphic

graphic file with name biom0071-0237-m28.jpg

and the corresponding unconditional expectations are

graphic file with name biom0071-0237-m29.jpg (10)
graphic file with name biom0071-0237-m30.jpg (11)
graphic file with name biom0071-0237-m31.jpg (12)

It follows that

graphic file with name biom0071-0237-m32.jpg

6.1. Moment Estimation

We have the following moment estimates for Inline graphic, Inline graphic, and Inline graphic, respectively:

graphic file with name biom0071-0237-m33.jpg

Equating these to the expectations given by (10)(12) yields the following moment estimators of the parameters Inline graphic, and Inline graphic

graphic file with name biom0071-0237-m34.jpg

Because Inline graphic, we require

graphic file with name biom0071-0237-m35.jpg (13)

for a valid set of moment estimates. This is the same diagnostic as used previously in 2009.

We also require Inline graphic. The lower bound yields the new diagnostic

graphic file with name biom0071-0237-m36.jpg (14)

for a finite (moment) estimate of Inline graphic. The upper bound yields

graphic file with name biom0071-0237-m37.jpg

or

graphic file with name biom0071-0237-m38.jpg

which is a consequence of the Cauchy–Schwarz inequality and not a useful diagnostic. The bound Inline graphic given above to ensure Inline graphic and hence Inline graphic finite, gives a new diagnostic.

If we adopt a method-of-moments (MOM) approach for the bivariate Poisson distribution, p is estimated by the sample correlation of the counts, as observed also by Royle (2004b), and Inline graphic is estimated by dividing Inline graphic by this estimate of p. For more than two visits (Inline graphic), p can be estimated by the mean of all sample correlations between counts for different sampling occasions. Then Inline graphic is the sample mean of all counts divided by this estimate of p. This generalizes Holgate's (1964) work, which considered Inline graphic only. In Web Appendix 2 we assess the performance of MOM estimation as a simple method for parameter estimation compared to maximum likelihood for the N-mixture model.

6.2. Performance of the Multivariate Negative-Binomial Model

Given the proposed diagnostics for the mixed-Poisson case in Section Moment Estimation, here we assess the performance of the multivariate negative-binomial model. Simulated data were fitted as in Section Performance of the Multivariate Poisson Model but for the negative binomial case, with Inline graphic and Inline graphic. We again assume that Inline graphic equates to infinite Inline graphic. If both (13)and (14) are negative, Inline graphic is very likely to be infinite and the mean proportion with Inline graphic from 21 scenarios is 0.921 (Table 2). However performance of the diagnostics when one or more of the two diagnostics is negative is less clear. Additionally, Inline graphic may occasionally be infinite despite both diagnostics being positive and on average Inline graphic for approximately 8.5% of simulations when both diagnostics are positive. Performance for the bivariate cases where Inline graphic and Inline graphic is illustrated in Figure 3 and for the cases where Inline graphic and Inline graphic in Web Figures 6–8. We see that neither singly nor in combination do the diagnostics perform as well as the single diagnostic for the Poisson case. We see fewer instances of infinite Inline graphic for large T and p.

Table 2.

Performance of the covariance diagnostic for the multivariate negative-binomial model, based upon 1000 simulations for various scenarios of Inline graphic, p, Inline graphic, and T for Inline graphic sites. EPInline graphic, EPInline graphic, and EPInline graphic are the proportion of simulations where both diagnostics are negative, one or more diagnostic is negative, or both diagnostics are positive, respectively. EPInline graphic, EPInline graphic, and EPInline graphic are the corresponding proportions of those where Inline graphic

Inline graphic p Inline graphic T EPInline graphic EPInline graphic EPInline graphic EPInline graphic EPInline graphic EPInline graphic
2 0.10 1.25 2 0.192 0.938 0.3 0.853 0.388 0.072
2 0.10 1.25 3 0.093 0.925 0.271 0.841 0.426 0.131
2 0.10 5.00 2 0.199 0.92 0.296 0.804 0.274 0.113
2 0.10 5.00 3 0.104 0.904 0.264 0.822 0.293 0.126
2 0.25 1.25 2 0.046 0.913 0.229 0.777 0.571 0.07
2 0.25 1.25 3 0.002 1 0.138 0.681 0.71 0.048
2 0.25 5.00 2 0.064 0.953 0.184 0.826 0.411 0.097
2 0.25 5.00 3 0.011 1 0.103 0.748 0.473 0.047
5 0.10 1.25 2 0.088 0.966 0.347 0.813 0.472 0.121
5 0.10 1.25 3 0.023 1 0.333 0.757 0.52 0.113
5 0.10 5.00 2 0.139 0.935 0.305 0.803 0.282 0.128
5 0.10 5.00 3 0.064 0.906 0.252 0.829 0.343 0.143
5 0.25 1.25 2 0.006 1 0.217 0.71 0.746 0.068
5 0.25 1.25 3 0 0.137 0.533 0.843 0.047
5 0.25 5.00 2 0.038 0.763 0.193 0.741 0.555 0.05
5 0.25 5.00 3 0.002 0.5 0.108 0.694 0.678 0.028
10 0.10 1.25 2 0.032 0.969 0.342 0.813 0.596 0.139
10 0.10 1.25 3 0.005 1 0.325 0.775 0.65 0.097
10 0.10 5.00 2 0.116 0.931 0.322 0.835 0.378 0.108
10 0.10 5.00 3 0.027 0.926 0.302 0.844 0.437 0.105
10 0.25 1.25 2 0 0.193 0.674 0.806 0.069
10 0.25 1.25 3 0 0.125 0.472 0.87 0.029
10 0.25 5.00 2 0.01 0.9 0.156 0.756 0.726 0.054
10 0.25 5.00 3 0.001 1 0.09 0.656 0.817 0.026

Figure 3.

Figure 3

Diagnostic 1 (13) versus diagnostic 2 (14) from the bivariate negative binomial model, based upon 1000 simulated datasets for Inline graphic, Inline graphic, Inline graphic, and Inline graphic. Values at which Inline graphic and Inline graphic are shown by circles and crosses, respectively. This figure appears in color in the electronic version of this article.

7. Application to Hermann's Tortoise Data

Here we analyze data from a study of the threatened Hermann's tortoise Testudo hermanni in southeastern France. One hundred and eighteen sites were each surveyed three times during a period when the species is most active. Full details are provided in Couturier et al. (2013), and we briefly reassess the conclusions drawn in their article and demonstrate the effect of study design on results.

For the tortoise data, optimization of the negative-binomial model confirms that Inline graphic is infinite in the negative binomial model for these data; after 500 iterations, the estimates had reached

graphic file with name biom0071-0237-m39.jpg

As noted in Couturier et al. (2013), the fit is much improved compared to the Poisson case, with -maximum log-likelihood 540.34 versus 576.27, but at the expense of Inline graphic becoming infinite. Hence for this dataset a finite estimate of mean abundance can be obtained for the Poisson but not for the negative-binomial. Whilst the first diagnostic (13) is positive, Inline graphic, so that the Poisson estimate is finite, the additional diagnostic (14) is negative, Inline graphic.

The zero-inflated Poisson is an intermediate model between the Poisson and negative-binomial, with -maximum log-likelihood 562.13 for these data. The zero-inflated Poisson therefore provides an improvement upon the Poisson case, but still yields the finite parameter estimate Inline graphic.

To show the potential effect of study design on model performance, we inspect the sample covariance diagnostic (13) for this dataset for the Poisson case for a reduced number of sites and/or visits. Taking two of the three visits made at all sites, the diagnostic was always positive (0.97–1.17). The diagnostic based upon all three visits but a random sample of fewer sites, was negative for 1.7% and 0% of 1000 samples, respectively for Inline graphic and Inline graphic. However for only two visits, the diagnostic was negative for 9.0% and 0.8% of 1000 samples, respectively for Inline graphic and Inline graphic.

8. Discussion and Recommendations

We have shown that the N-mixture model can produce infinite estimates of abundance, particularly when working with a limited number of sampling occasions and low detection probability. The equivalence of the N-mixture model with the multivariate Poisson has been demonstrated, allowing us to understand and diagnose poor behavior of the N-mixture model.

We believe the equivalence of the Poisson N-mixture model to the multivariate Poisson distribution to be previously largely unknown, especially in statistical ecology. The multivariate Poisson model conveniently avoids the requirement to select an upper bound K. We provide code for fitting the multivariate Poisson and negative-binomial models. Possible alternative techniques for fitting the multivariate distributions include using the EM algorithm (Karlis, 2003), a composite likelihood (Jost, Brcich, and Zoubir, 2006) or a symbolic computation approach (Sontag and Zeilberger, 2010). Consequently this equivalence could also have the alternative purpose of using the N-mixture model to provide simple fitting of the multivariate Poisson and negative-binomial models for particular covariance structures.

A recent extension of the N-mixture model to open populations by including population dynamics parameters offers great potential but also requires an upper bound to be set (Dail and Madsen, 2011). Further exploration of this model via simulation to assess performance is in progress. Kéry et al. (2009) extended the N-mixture model to allow for analysis of data resulting from closed sampling periods connected by open periods and the multivariate formulations also apply in that case. Dorazio, Martin, and Edwards (2013) provide an extension in which p is given a distribution at each visit. The binomial distribution in 2013 is then replaced by a beta-binomial. This has also been considered in a Bayesian context by Martin et al. (2011). For the multivariate Poisson case this extension is dealt with by appropriate numerical integration of the probability of 2003. An increasing number of studies use a Bayesian approach for parameter estimation (Kéry et al., 2009; Graves et al., 2011). Further simulation study comparing a Bayesian approach with maximum-likelihood estimation could show whether this approach can also produce poor estimates in some scenarios. Some comparisons have been made by Toribio, Gray, and Liang (2012), based upon parameter values from Royle (2004a).

In practice, covariates are frequently used to describe variation in abundance and detection. Further analysis could determine how the inclusion of covariates might affect instances where a finite abundance estimate cannot be obtained for a model with constant abundance and detection, and in particular determine whether parameters may become identifiable.

Good experimental design is vital for occupancy studies; see for example Guillera-Arroita, Ridout, and Morgan, (2010, 2014). The same issues apply for N-mixture work, though with the different perspective of avoiding poor model-fitting behavior. If possible, study design effort should be distributed to ensure more than two visits are made to each site (in addition to including a reasonable number of sites). Alternatively a study design where more visits are made to a subset of the sites is worth exploring.

For maximum-likelihood estimation, we recommend using MOM estimates to start the iterative search for MLEs. In the Poisson case the covariance diagnostic may be used to determine when infinite estimates of abundance may arise. Infinite estimates of abundance may occur for some model choices but not others, as for the Hermann's Tortoise case study. Hence we advise fitting the model for multiple distribution choices, to identify which may provide finite estimates of abundance. An R program is available in the Supplementary Materials which allows for covariates in the detection and abundance parameters.

9. Supplementary Materials

The Web Appendices referenced in Sections Performance of the Multivariate Poisson Model, Incorrect Estimates due to the Choice of K, and Performance of the Multivariate Negative-Binomial Model, together with R code, are available with this paper at the Biometrics website on Wiley Online Library.

Acknowledgments

This work was part-funded by EPSRC grants EP/I000917/1 and EP/P505577/1. We thank Bill Link, Peter Jupp and Marc Kéry for their useful discussion, and Thibaut Couturier for supplying the Hermann's tortoise data. Useful input was provided by two referees and an Associate Editor.

Appendix

Appendix

Proof that when Inline graphic a local maximum of the likelihood occurs at Inline graphic when covInline graphic.

It is convenient here to set Inline graphic. It is simple to prove that Inline graphic, as noted in Holgate (1964). For observation Inline graphic, we write

graphic file with name biom0071-0237-m40.jpg

The profile log-likelihood function for Inline graphic is then given by

graphic file with name biom0071-0237-m41.jpg
graphic file with name biom0071-0237-m42.jpg

We deduce that

graphic file with name biom0071-0237-m43.jpg

Thus

graphic file with name biom0071-0237-m44.jpg

and

graphic file with name biom0071-0237-m45.jpg

biom0071-0237-sd1.pdf (1.4MB, pdf)
biom0071-0237-sd2.zip (4.6KB, zip)

References

  1. Cormack RM. Log-linear models for capture–recapture. Biometrics. 1989;45:395–413. [Google Scholar]
  2. Couturier T, Cheylan M, Bertolero A, Astruc G. Besnard A. Estimating abundance and population trends when detection is low andhighly variable: A comparison of three methods for the Hermann's tortoise. The Journal of Wildlife Management. 2013;77:454–462. [Google Scholar]
  3. Dail D, Madsen L. Models for estimating abundance from repeated counts of an openmetapopulation. Biometrics. 2011;67:577–587. doi: 10.1111/j.1541-0420.2010.01465.x. [DOI] [PubMed] [Google Scholar]
  4. Dodd CK, Dorazio RM. Using counts to simultaneously estimate abundance and detectionprobabilities in a salamander community. Herpetologica. 2004;60:468–478. [Google Scholar]
  5. Dorazio RM, Martin J. Edwards HH. Estimating abundance while accounting for rarity, correlatedbehavior, and other sources of variation in counts. Ecology. 2013;94:1472–1478. doi: 10.1890/12-1365.1. [DOI] [PubMed] [Google Scholar]
  6. Fiske I, Chandler RB. Journal of Statistical Software. 2011;43:1–23. doi: 10.18637/jss.v043.i01. ).Unmarked: An R package for fitting hierarchical modelsof wildlife occurrence and abundance. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Graves TA, Kendall KC, Royle JA, Stetz JB. Macleod AC. Linking landscape characteristics to local grizzly bear abundanceusing multiple detection methods in a hierarchical model. Animal Conservation. 2011;14:652–664. [Google Scholar]
  8. Guillera-Arroita G, Ridout MS. Morgan BJT. Design of occupancy studies with imperfect detection. Methods in Ecology and Evolution. 2010;1:131–139. [Google Scholar]
  9. Guillera-Arroita G, Ridout MS. Morgan BJT. Two-stage sequential Bayesian study design forspecies occupancy estimation. Journal of Agricultural, Biological,and Environmental Statistics. 2014;19:278–291. [Google Scholar]
  10. Guillera-Arroita G, Ridout MS, Morgan BJT. Linkie M. Models for species-detection data collected along transects in thepresence of abundance-induced heterogeneity and clustering in the detectionprocess. Methods in Ecology and Evolution. 2012;3:358–367. [Google Scholar]
  11. Hilbe JM. Negative Binomial Regression. 2011 .New York: Cambridge University Press. [Google Scholar]
  12. Hines JE. Program PRESENCE 4.1—Software to estimate patch occupancy andrelated parameters. 2011 .U.S. Geological Survey Patuxent Wildlife Research Center, Maryland. [Google Scholar]
  13. Holgate P. Estimation for the bivariate Poisson distribution. Biometrika. 1964;51:241–245. [Google Scholar]
  14. Hunt JW, Weckerly FW. Ott JR. Reliability of occupancy and binomial mixture models for estimatingabundance of golden-cheeked warblers (Setophaga chrysoparia) The Auk. 2012;129:105–114. [Google Scholar]
  15. Johnson NL, Kotz S. Balakrishnan N. Discrete Multivariate Distributions. 1997 and. New York: Wiley. [Google Scholar]
  16. Jost TA, Brcich RF. Zoubir AM. Estimating the parameters of the multivariate Poisson distributionusing the composite likelihood concept. The Proceedings of the 31st IEEE International Conference onAcoustics, Speech and Signal Processing. 2006 and. In Toulouse, France: IEEE. [Google Scholar]
  17. Karlis D. An EM algorithm for multivariate Poisson distribution and relatedmodels. Journal of Applied Statistics. 2003;30:63–77. [Google Scholar]
  18. Kéry M. Estimating abundance from bird counts: Binomial mixture modelsuncover complex covariate relationships. The Auk. 2008;125:336–345. [Google Scholar]
  19. Kéry M, Dorazio RM, Soldaat L, Van Strien A, Zuiderwijk A. Royle JA. Trend estimation in populations with imperfect detection. Journal of Applied Ecology. 2009;46:1163–1172. [Google Scholar]
  20. Kéry M, Royle JA. Schmid H. Modeling avian abundance from replicated counts using binomialmixture models. Ecological Applications. 2005;15:1450–1461. [Google Scholar]
  21. Martin J, Royle JA, Mackenzie DI, Edwards HH, Kéry M. Gardner B. Accounting for non-independent detection when estimating abundance oforganisms with a Bayesian approach. Methods in Ecology and Evolution. 2011;2:595–601. [Google Scholar]
  22. McIntyre AP, Jones JE, Lund EM, Waterstrat FT, Giovanini JN, Duke SD, Hayes MP, Quinn T. Kroll AJ. Empirical and simulation evaluations of an abundance estimator usingunmarked individuals of cryptic forest-dwelling taxa. Forest Ecology and Management. 2012;286:129–136. [Google Scholar]
  23. R Core Team. R: A Language and Environment for Statistical Computing. 2014 .R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
  24. Royle JA. N-mixture models for estimating population size fromspatially replicated counts. Biometrics. 2004a;60:108–115. doi: 10.1111/j.0006-341X.2004.00142.x. [DOI] [PubMed] [Google Scholar]
  25. Royle JA. Generalized estimators of avian abundance from countsurvey data. Animal Biodiversity and Conservation. 2004b;27:375–386. [Google Scholar]
  26. Royle JA, Nichols JD. Kéry M. Modelling occurrence and abundance of species when detection isimperfect. Oikos. 2005;110:353–359. [Google Scholar]
  27. Sontag ED, Zeilberger D. A symbolic computation approach to a problem involving multivariatePoisson distributions. Advances in Applied Mathematics. 2010;44:359–377. [Google Scholar]
  28. Toribio S, Gray B. Liang S. An evaluation of the Bayesian approach to fitting the N-mixturemodel for use with pseudo-replicated count data. Journal of Statistical Computation and Simulation. 2012;82:1135–1143. [Google Scholar]
  29. Wang J-PZ, Lindsay BG. A penalized nonparametric maximum likelihood approach to speciesrichness estimation. Journal of the American Statistical Association. 2005;100:942–959. [Google Scholar]
  30. Zellweger-Fischer J, Kéry M. Pasinelli G. Population trends of brown hares in Switzerland: The role ofland-use and ecological compensation areas. Biological Conservation. 2011;144:1364–1373. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

biom0071-0237-sd1.pdf (1.4MB, pdf)
biom0071-0237-sd2.zip (4.6KB, zip)

Articles from Biometrics are provided here courtesy of Wiley

RESOURCES