Computational Aspects of N-Mixture Models

Emily B Dennis; Byron JT Morgan; Martin S Ridout

doi:10.1111/biom.12246

. 2014 Oct 14;71(1):237–246. doi: 10.1111/biom.12246

Computational Aspects of N-Mixture Models

Emily B Dennis ^1,^*, Byron JT Morgan ¹, Martin S Ridout ¹

PMCID: PMC4406156 PMID: 25314629

Abstract

The N-mixture model is widely used to estimate the abundance of a population in the presence of unknown detection probability from only a set of counts subject to spatial and temporal replication (Royle, 2004, Biometrics 60, 105–115). We explain and exploit the equivalence of N-mixture and multivariate Poisson and negative-binomial models, which provides powerful new approaches for fitting these models. We show that particularly when detection probability and the number of sampling occasions are small, infinite estimates of abundance can arise. We propose a sample covariance as a diagnostic for this event, and demonstrate its good performance in the Poisson case. Infinite estimates may be missed in practice, due to numerical optimization procedures terminating at arbitrarily large values. It is shown that the use of a bound, K, for an infinite summation in the N-mixture likelihood can result in underestimation of abundance, so that default values of K in computer packages should be avoided. Instead we propose a simple automatic way to choose K. The methods are illustrated by analysis of data on Hermann's tortoise Testudo hermanni.

Keywords: Abundance estimation, Method of moments, Multivariate negative binomial, Multivariate Poisson, Optimal design, Sampling, Temporal replication

1. Introduction

Estimating the abundance of a population is an important component of ecological research. N-mixture models can be used to estimate animal abundance from counts with both spatial and temporal replication whilst accounting for imperfect detection (Royle, ^2004a). Whereas alternative sampling methods for obtaining estimates of abundance exist, such as capture–recapture, distance, removal and multiple-observer sampling, these may be expensive in effort or cost, or impractical for some species and scenarios. A benefit of the N-mixture model is the reasonably low comparative cost and effort required for data collection which does not require individuals to be identified. This is especially true of many citizen-science based monitoring programs.

Consequently, since development by Royle (2004a), many applications and extensions of the N-mixture model have been made. These include applications to various taxa, including birds (Kéry, Royle, and Schmid, 2005), mammals (Zellweger-Fischer, Kéry, and Pasinelli, 2011), and amphibians (Dodd and Dorazio, 2004; McIntyre et al., ²⁰¹²). In addition, covariates have often been used to examine spatial patterns in abundance and detection (Kéry, 2008) and hence create maps of spatial abundance (Royle, Nichols, and Kéry, 2005).

Despite the popularity of the N-mixture model, few studies have made comparisons with estimates derived via alternative methods or undertaken simulation studies of performance (Kéry et al., 2005; Hunt, Weckerly, and Ott, ²⁰¹²; Couturier et al., ²⁰¹³). A potential issue for fitting the model using classical inference is the need to specify an upper bound, K, to approximate an infinite summation in the likelihood. We found this matter was rarely mentioned in publications. For example, McIntyre et al. (2012) used simulated data to support their amphibian study, highlighting the benefit of more sampling occasions, particularly when detection probability was low, however the value of K used was not provided. When software such as unmarked (Fiske and Chandler, 2011) written in R (R Core Team, ²⁰¹⁴) and PRESENCE (Hines, 2011) is used for model fitting, it is possible that only default values of the bound are employed. Couturier et al. (2013) suggest bias could be induced by the choice of K for low detection probabilities.

In this article, we investigate computational aspects of fitting N-mixture models, in particular via a simulation study for scenarios where detection probability is low and/or the number of sampling occasions is small. This may be important for the study of cryptic species, and have implications for sample design: many applications to date have made only three visits, whereas in Royle (2004a) simulations were tested for five visits and an application made to data with 10 visits. When only one sampling visit is made, it is well known that the N-mixture model reduces to a thinned Poisson distribution, with only one estimable parameter, the product of mean abundance and detection probability, a feature which underlies aspects of the work which follows.

The N-mixture model is described in Section The N-Mixture Model. In Section Equivalence of the Poisson N-Mixture Model With a Multivariate Poisson Model we explain the equivalence of the Poisson N-mixture model with a multivariate Poisson distribution. We use this formulation to show that infinite estimates of abundance may arise, and provide a simple diagnostic to identify such cases. The multivariate Poisson formulation has the advantage of not requiring a constant K to be set. Section Explicit Form for the Bivariate Negative-Binomial Case provides the probability function in the bivariate negative-binomial case. In Section The Effect of the Choice of K on Fitting the N-Mixture Model: Poisson Case, we show how the choice of K in the N-mixture model interacts with the occurrence of infinite estimates of abundance, and how incorrect conclusions may arise. An automatic method for choosing K is provided. Section Moment Estimation for a Mixed-Poisson N-Mixture Model provides moment estimates and evaluates the use of two diagnostic tests for the negative-binomial case for when infinite estimates of abundance may arise. Section Application to Hermann's Tortoise Data provides an application to real data and the article ends with discussion and recommendations.

2. The N-Mixture Model

Under the study design in Royle (2004a), a set of counts is made during sampling visits Inline graphic at locations (sites). The population is assumed to be closed during the period of sampling and each individual is assumed to have the same detection probability p. The counts at site i and time t are assumed to be independent binomial random variables,

where Inline graphic is the unknown population size at site i. To fit the model using classical inference, we assume the to be independent random variables with probability function , and then maximize the likelihood

(1)

where Inline graphic . As noted by Royle (2004a), numerical maximization of ²⁰¹³ requires the replacement of the infinite summation over by a sum with upper limit K. The value of K may be selected by fitting the model for a succession of increasing values and selecting K when the parameter estimates appear to stabilize (Royle, ^2004a). We shall consider both Poisson and negative-binomial mixing distributions.

It is our experience that the N-mixture model can produce unrealistically large estimates of abundance and we explain this feature in the article.

3. Equivalence of the Poisson N-Mixture Model With a Multivariate Poisson Model

The number of individuals observed at a site at time t can be written as the convolution of independent random variables, corresponding to those seen only once, those seen twice, etc. This natural feature of the N-mixture model can be formalized as we now show.

Let Inline graphic denote the set of non-empty subsets of , and let the random variable denote the number of individuals seen at site i only on occasion s. For example, denotes the individuals seen at site i on occasions 1, 2, and 4 only. Then, if we let denote those elements of that include t, we can decompose Inline graphic as

For example, with Inline graphic , we have

Conditional on Inline graphic , the joint distribution of the set of random variables is multinomial, with index and probabilities , where denotes the number of elements in the set s. When , the are independent Poisson random variables, with

see Johnson, Kotz, and Balakrishnan (1997, p. 146). The thinned Poisson is the case Inline graphic .

It follows that the joint distribution of Inline graphic is multivariate Poisson (Johnson et al., 1997, Chapter 37), with

There are Inline graphic subsets such that ). Hence

Similarly, if we let Inline graphic denote the elements of that include both t and u then

There are Inline graphic subsets such that ). Hence, for ,

and Inline graphic .

This result is a special case of Johnson et al. (1997, equation 37.88), which is stated without proof.

Example: T=2, Poisson Case

Cormack (1989) mentions this case in closed-population capture–recapture modeling of data from one site only.

Suppressing site dependence, we have

where Inline graphic are independent with , where and , where . Note that small p would result typically in small values for , and as p tends to zero and become independent, so that the model reverts to a thinned Poisson.

The counts Inline graphic follow a bivariate Poisson distribution with , and the bivariate Poisson probability is

graphic file with name biom0071-0237-m11.jpg

(2)

Including site dependence, the likelihood is

graphic file with name biom0071-0237-m12.jpg

(3)

For Inline graphic the expressions of ²⁰¹³ and ²⁰¹¹ are identical, but the likelihood of ²⁰¹¹ may be maximized without requiring selection of a value K.

3.1. Multivariate Poisson Distribution

For general T, let Inline graphic denote the set of all possible values of the random variables such that

Because the random variables Inline graphic are independent, the joint probability function of is

and

graphic file with name biom0071-0237-m15.jpg

There are Inline graphic elements such that , for . Hence

graphic file with name biom0071-0237-m16.jpg

Therefore, we can write

graphic file with name biom0071-0237-m17.jpg

(4)

The case Inline graphic is given in ²⁰¹¹. The associated R program incorporates efficient construction of .

3.2. Performance of the Multivariate Poisson Model

For illustration, we investigate performance of the multivariate Poisson model via simulation from the fitted model. We assess output for the cases Inline graphic based upon 1000 simulations where , and . The chosen parameter values were guided by those used in Royle (2004a). The model was fitted using the optim function in the R software package (R Core Team, ²⁰¹⁴) using the default Nelder–Mead algorithm and a tolerance value of . The results were checked with those from using several other optim algorithms, including simulated annealing and quasi-Newton. We observe that estimates for Inline graphic were very large in some cases (the maximum estimate from 1000 simulations was when , , and ). Figure1 shows that non-positive values of a covariance diagnostic,

Log() from the bivariate Poisson model plotted against the covariance diagnostic, cov from ²⁰⁰⁸, based upon 1000 simulated datasets for , and . Values at which the covariance diagnostic is negative are shown by crosses. This figure appears in color in the electronic version of this article.

Inline graphic — Log() from the bivariate Poisson model plotted against the covariance diagnostic, cov from ²⁰⁰⁸, based upon 1000 simulated datasets for , and . Values at which the covariance diagnostic is negative are shown by crosses. This figure appears in color in the electronic version of this article.

(5)

can identify the high estimates of Inline graphic from fitting the bivariate Poisson. Here denotes the mean of the product over S sites. Note that this (intraclass) estimate is appropriate as . A proof that a local maximum of the likelihood occurs at when is given in the Appendix; we are working on a general proof for , as well as a proof that there are no other maxima when the diagnostic is satisfied. Hence, in these instances when Inline graphic , in order to have finite , is actually infinite and the large range of high estimates of abundance obtained in practice, as in Figure1, is partly an artefact of the optimization routine stopping prematurely when the likelihood is flat.

For more than two visits ( Inline graphic ), the appropriate covariance diagnostic can be estimated as

(6)

where the first term consists of the average of the means of all Inline graphic pairwise products. Our conjecture that the diagnostic extends for is supported by Web Figure 4 which compares the covariance diagnostic ²⁰⁰⁹ with from the multivariate Poisson model for , when .

Performance of the covariance diagnostic is demonstrated further in Table 1, which shows close correspondence between the proportion of simulations where the diagnostic is negative and the proportion where Inline graphic is large (). Table 1 also shows the prevalence of infinite estimates of , particularly as , T, and p decrease. In fact for the case where , , and , a finite value of was not achievable in over half of 1000 simulations.

Table 1.

Performance of the covariance diagnostic for the multivariate Poisson model, based upon 1000 simulations for various scenarios of Inline graphic , p, and T for sites. EPN is the proportion of simulations when the sample covariance diagnostic was negative. EPD is the proportion of simulations where the estimate of


	p	EPN	EPD	EPN	EPD	EPN	EPD
2	0.10	0.505	0.505	0.351	0.351	0.276	0.276
2	0.25	0.225	0.224	0.090	0.089	0.033	0.033
5	0.10	0.427	0.427	0.362	0.361	0.219	0.222
5	0.25	0.167	0.167	0.084	0.084	0.017	0.020
10	0.10	0.398	0.398	0.317	0.318	0.251	0.256
10	0.25	0.180	0.181	0.066	0.066	0.038	0.038

Open in a new tab

4. Explicit Form for the Bivariate Negative-Binomial Case

The Poisson distribution may be replaced by a mixed-Poisson distribution, for which Inline graphic , when the probability of ²⁰¹¹ becomes

graphic file with name biom0071-0237-m20.jpg

For the negative-binomial distribution, the mixing distribution is gamma with parameters Inline graphic and

(7)

which results in the NB-2 form (Hilbe, 2011, p. 187). In this case

graphic file with name biom0071-0237-m22.jpg

Therefore the joint probability for the bivariate negative-binomial model is given by

graphic file with name biom0071-0237-m23.jpg

(8)

In the parameterization of ²⁰¹¹, the mean and variance of the gamma distribution are Inline graphic and , respectively. If we now write for the expected value of the Poisson mean, then the variance is and the coefficient of variation of the Poisson mean is . The Poisson model arises as the limit , maintaining .

In terms of the parameters Inline graphic and , and we can write ²⁰¹⁴ as

graphic file with name biom0071-0237-m24.jpg

(9)

The case for Inline graphic follows in the same way, by integrating the expression of ²⁰⁰³, to give the multivariate negative-binomial probability as

graphic file with name biom0071-0237-m25.jpg

The expression Inline graphic also applies to the negative binomial case, but the are no longer independent.

5. The Effect of the Choice of K on Fitting the N-Mixture Model: Poisson Case

5.1. Incorrect Estimates due to the Choice of K

We now consider how the choice of K for computing the Poisson N-mixture likelihood of ²⁰¹³ interacts with the occurrence of infinite estimates of Inline graphic . Output is obtained for 1000 simulations based on the parameter values used in Royle (2004a), where , and , but for number of sampling occasions . The models were again fitted using optim in the R software package. The parameters p and were constrained to be in range via logit and log link functions, respectively. Each simulated dataset was fitted with Inline graphic .

We see that large finite estimates of abundance can arise, in particular where the number of sampling occasions T is small (Figure2). Specifically, a proportion of simulations result in a second peak in the sampling distribution for Inline graphic and the value at which this is found increases with the value of K. Fitting the multivariate Poisson model to simulated data created under comparable scenarios for also produced a second peak in the sampling distribution for , but as described in Section Performance of the Multivariate Poisson Model, the estimates were substantially greater in the absence of the limiting value K in the N-mixture model. An increase in the number of sampling occasions reduces the incidence of high estimates of Inline graphic , which become rare for , as more information is available as T increases. For very few high estimates of occurred in the 1000 simulations. An increase in the number of sites also reduces the proportion of high values (Web Figure 5).

Kernel density estimates of from the Poisson N-mixture model for sites, and based upon 1000 simulated datasets for , and . This figure appears in color in the electronic version of this article.

Thus when the N-mixture model is fitted by maximizing the likelihood of ²⁰¹³, when Inline graphic should be infinite, is estimated as large as possible for a given value of K, and is restricted to be as close to zero as possible. We discuss this matter further in Web Appendix 1. The occurrence of large finite estimates of is similar to analogous findings of Wang and Lindsay (2005) in the context of species richness estimation.

5.2. Automatic Choice of K

For the Poisson case the covariance diagnostic identifies when infinite values of Inline graphic arise. When the diagnostic is not satisfied, K may be selected automatically, for example by ensuring that the Poisson upper tail probability is , so that the value of K will adapt for successive iterations according to the estimate of . This approach was also suggested by Guillera-Arroita et al. (2012). We have found this to be a simple and preferable alternative to fitting the model for successively larger values of K until estimates appear to stabilize.

6. Moment Estimation for a Mixed-Poisson N-Mixture Model

Suppose we have an N-mixture model in which Inline graphic follows a mixed-Poisson distribution, as in Section Explicit Form for the Bivariate Negative-Binomial Case, with

Conditional on Inline graphic , the random variables are independent binomial variables, with

Therefore, conditional on Inline graphic

and the corresponding unconditional expectations are

(10)

(11)

(12)

It follows that

6.1. Moment Estimation

We have the following moment estimates for Inline graphic , , and , respectively:

graphic file with name biom0071-0237-m33.jpg

Equating these to the expectations given by (10)–(12) yields the following moment estimators of the parameters Inline graphic , and

Because Inline graphic , we require

(13)

for a valid set of moment estimates. This is the same diagnostic as used previously in ²⁰⁰⁹.

We also require Inline graphic . The lower bound yields the new diagnostic

(14)

for a finite (moment) estimate of Inline graphic . The upper bound yields

which is a consequence of the Cauchy–Schwarz inequality and not a useful diagnostic. The bound Inline graphic given above to ensure and hence finite, gives a new diagnostic.

If we adopt a method-of-moments (MOM) approach for the bivariate Poisson distribution, p is estimated by the sample correlation of the counts, as observed also by Royle (2004b), and Inline graphic is estimated by dividing by this estimate of p. For more than two visits (), p can be estimated by the mean of all sample correlations between counts for different sampling occasions. Then is the sample mean of all counts divided by this estimate of p. This generalizes Holgate's (1964) work, which considered Inline graphic only. In Web Appendix 2 we assess the performance of MOM estimation as a simple method for parameter estimation compared to maximum likelihood for the N-mixture model.

6.2. Performance of the Multivariate Negative-Binomial Model

Given the proposed diagnostics for the mixed-Poisson case in Section Moment Estimation, here we assess the performance of the multivariate negative-binomial model. Simulated data were fitted as in Section Performance of the Multivariate Poisson Model but for the negative binomial case, with Inline graphic and . We again assume that equates to infinite . If both (13)and (14) are negative, is very likely to be infinite and the mean proportion with from 21 scenarios is 0.921 (Table 2). However performance of the diagnostics when one or more of the two diagnostics is negative is less clear. Additionally, Inline graphic may occasionally be infinite despite both diagnostics being positive and on average for approximately 8.5% of simulations when both diagnostics are positive. Performance for the bivariate cases where and is illustrated in Figure 3 and for the cases where and in Web Figures 6–8. We see that neither singly nor in combination do the diagnostics perform as well as the single diagnostic for the Poisson case. We see fewer instances of infinite Inline graphic for large T and p.

Table 2.

Performance of the covariance diagnostic for the multivariate negative-binomial model, based upon 1000 simulations for various scenarios of Inline graphic , p, , and T for sites. EP, EP, and EP are the proportion of simulations where both diagnostics are negative, one or more diagnostic is negative, or both diagnostics are positive, respectively. EP, EP, and EP are the corresponding proportions of those where

	p		T	EP	EP	EP	EP	EP	EP
2	0.10	1.25	2	0.192	0.938	0.3	0.853	0.388	0.072
2	0.10	1.25	3	0.093	0.925	0.271	0.841	0.426	0.131
2	0.10	5.00	2	0.199	0.92	0.296	0.804	0.274	0.113
2	0.10	5.00	3	0.104	0.904	0.264	0.822	0.293	0.126
2	0.25	1.25	2	0.046	0.913	0.229	0.777	0.571	0.07
2	0.25	1.25	3	0.002	1	0.138	0.681	0.71	0.048
2	0.25	5.00	2	0.064	0.953	0.184	0.826	0.411	0.097
2	0.25	5.00	3	0.011	1	0.103	0.748	0.473	0.047
5	0.10	1.25	2	0.088	0.966	0.347	0.813	0.472	0.121
5	0.10	1.25	3	0.023	1	0.333	0.757	0.52	0.113
5	0.10	5.00	2	0.139	0.935	0.305	0.803	0.282	0.128
5	0.10	5.00	3	0.064	0.906	0.252	0.829	0.343	0.143
5	0.25	1.25	2	0.006	1	0.217	0.71	0.746	0.068
5	0.25	1.25	3	0	–	0.137	0.533	0.843	0.047
5	0.25	5.00	2	0.038	0.763	0.193	0.741	0.555	0.05
5	0.25	5.00	3	0.002	0.5	0.108	0.694	0.678	0.028
10	0.10	1.25	2	0.032	0.969	0.342	0.813	0.596	0.139
10	0.10	1.25	3	0.005	1	0.325	0.775	0.65	0.097
10	0.10	5.00	2	0.116	0.931	0.322	0.835	0.378	0.108
10	0.10	5.00	3	0.027	0.926	0.302	0.844	0.437	0.105
10	0.25	1.25	2	0	–	0.193	0.674	0.806	0.069
10	0.25	1.25	3	0	–	0.125	0.472	0.87	0.029
10	0.25	5.00	2	0.01	0.9	0.156	0.756	0.726	0.054
10	0.25	5.00	3	0.001	1	0.09	0.656	0.817	0.026

Open in a new tab

Diagnostic 1 (13) versus diagnostic 2 (14) from the bivariate negative binomial model, based upon 1000 simulated datasets for , , , and . Values at which and are shown by circles and crosses, respectively. This figure appears in color in the electronic version of this article.

7. Application to Hermann's Tortoise Data

Here we analyze data from a study of the threatened Hermann's tortoise Testudo hermanni in southeastern France. One hundred and eighteen sites were each surveyed three times during a period when the species is most active. Full details are provided in Couturier et al. (2013), and we briefly reassess the conclusions drawn in their article and demonstrate the effect of study design on results.

For the tortoise data, optimization of the negative-binomial model confirms that Inline graphic is infinite in the negative binomial model for these data; after 500 iterations, the estimates had reached

As noted in Couturier et al. (2013), the fit is much improved compared to the Poisson case, with -maximum log-likelihood 540.34 versus 576.27, but at the expense of Inline graphic becoming infinite. Hence for this dataset a finite estimate of mean abundance can be obtained for the Poisson but not for the negative-binomial. Whilst the first diagnostic (13) is positive, , so that the Poisson estimate is finite, the additional diagnostic (14) is negative, .

The zero-inflated Poisson is an intermediate model between the Poisson and negative-binomial, with -maximum log-likelihood 562.13 for these data. The zero-inflated Poisson therefore provides an improvement upon the Poisson case, but still yields the finite parameter estimate Inline graphic .

To show the potential effect of study design on model performance, we inspect the sample covariance diagnostic (13) for this dataset for the Poisson case for a reduced number of sites and/or visits. Taking two of the three visits made at all sites, the diagnostic was always positive (0.97–1.17). The diagnostic based upon all three visits but a random sample of fewer sites, was negative for 1.7% and 0% of 1000 samples, respectively for Inline graphic and . However for only two visits, the diagnostic was negative for 9.0% and 0.8% of 1000 samples, respectively for and .

8. Discussion and Recommendations

We have shown that the N-mixture model can produce infinite estimates of abundance, particularly when working with a limited number of sampling occasions and low detection probability. The equivalence of the N-mixture model with the multivariate Poisson has been demonstrated, allowing us to understand and diagnose poor behavior of the N-mixture model.

We believe the equivalence of the Poisson N-mixture model to the multivariate Poisson distribution to be previously largely unknown, especially in statistical ecology. The multivariate Poisson model conveniently avoids the requirement to select an upper bound K. We provide code for fitting the multivariate Poisson and negative-binomial models. Possible alternative techniques for fitting the multivariate distributions include using the EM algorithm (Karlis, 2003), a composite likelihood (Jost, Brcich, and Zoubir, 2006) or a symbolic computation approach (Sontag and Zeilberger, 2010). Consequently this equivalence could also have the alternative purpose of using the N-mixture model to provide simple fitting of the multivariate Poisson and negative-binomial models for particular covariance structures.

A recent extension of the N-mixture model to open populations by including population dynamics parameters offers great potential but also requires an upper bound to be set (Dail and Madsen, 2011). Further exploration of this model via simulation to assess performance is in progress. Kéry et al. (2009) extended the N-mixture model to allow for analysis of data resulting from closed sampling periods connected by open periods and the multivariate formulations also apply in that case. Dorazio, Martin, and Edwards (2013) provide an extension in which p is given a distribution at each visit. The binomial distribution in ²⁰¹³ is then replaced by a beta-binomial. This has also been considered in a Bayesian context by Martin et al. (2011). For the multivariate Poisson case this extension is dealt with by appropriate numerical integration of the probability of ²⁰⁰³. An increasing number of studies use a Bayesian approach for parameter estimation (Kéry et al., 2009; Graves et al., ²⁰¹¹). Further simulation study comparing a Bayesian approach with maximum-likelihood estimation could show whether this approach can also produce poor estimates in some scenarios. Some comparisons have been made by Toribio, Gray, and Liang (2012), based upon parameter values from Royle (2004a).

In practice, covariates are frequently used to describe variation in abundance and detection. Further analysis could determine how the inclusion of covariates might affect instances where a finite abundance estimate cannot be obtained for a model with constant abundance and detection, and in particular determine whether parameters may become identifiable.

Good experimental design is vital for occupancy studies; see for example Guillera-Arroita, Ridout, and Morgan, (2010, 2014). The same issues apply for N-mixture work, though with the different perspective of avoiding poor model-fitting behavior. If possible, study design effort should be distributed to ensure more than two visits are made to each site (in addition to including a reasonable number of sites). Alternatively a study design where more visits are made to a subset of the sites is worth exploring.

For maximum-likelihood estimation, we recommend using MOM estimates to start the iterative search for MLEs. In the Poisson case the covariance diagnostic may be used to determine when infinite estimates of abundance may arise. Infinite estimates of abundance may occur for some model choices but not others, as for the Hermann's Tortoise case study. Hence we advise fitting the model for multiple distribution choices, to identify which may provide finite estimates of abundance. An R program is available in the Supplementary Materials which allows for covariates in the detection and abundance parameters.

9. Supplementary Materials

The Web Appendices referenced in Sections Performance of the Multivariate Poisson Model, Incorrect Estimates due to the Choice of K, and Performance of the Multivariate Negative-Binomial Model, together with R code, are available with this paper at the Biometrics website on Wiley Online Library.

Acknowledgments

This work was part-funded by EPSRC grants EP/I000917/1 and EP/P505577/1. We thank Bill Link, Peter Jupp and Marc Kéry for their useful discussion, and Thibaut Couturier for supplying the Hermann's tortoise data. Useful input was provided by two referees and an Associate Editor.

Appendix

Proof that when Inline graphic a local maximum of the likelihood occurs at when cov.

It is convenient here to set Inline graphic . It is simple to prove that , as noted in Holgate (1964). For observation , we write

The profile log-likelihood function for Inline graphic is then given by

We deduce that

Thus

and

graphic file with name biom0071-0237-m45.jpg

biom0071-0237-sd1.pdf^{(1.4MB, pdf)}

biom0071-0237-sd2.zip^{(4.6KB, zip)}

References

Cormack RM. Log-linear models for capture–recapture. Biometrics. 1989;45:395–413. [Google Scholar]
Couturier T, Cheylan M, Bertolero A, Astruc G. Besnard A. Estimating abundance and population trends when detection is low andhighly variable: A comparison of three methods for the Hermann's tortoise. The Journal of Wildlife Management. 2013;77:454–462. [Google Scholar]
Dail D, Madsen L. Models for estimating abundance from repeated counts of an openmetapopulation. Biometrics. 2011;67:577–587. doi: 10.1111/j.1541-0420.2010.01465.x. [DOI] [PubMed] [Google Scholar]
Dodd CK, Dorazio RM. Using counts to simultaneously estimate abundance and detectionprobabilities in a salamander community. Herpetologica. 2004;60:468–478. [Google Scholar]
Dorazio RM, Martin J. Edwards HH. Estimating abundance while accounting for rarity, correlatedbehavior, and other sources of variation in counts. Ecology. 2013;94:1472–1478. doi: 10.1890/12-1365.1. [DOI] [PubMed] [Google Scholar]
Fiske I, Chandler RB. Journal of Statistical Software. 2011;43:1–23. doi: 10.18637/jss.v043.i01. ).Unmarked: An R package for fitting hierarchical modelsof wildlife occurrence and abundance. [DOI] [PMC free article] [PubMed] [Google Scholar]
Graves TA, Kendall KC, Royle JA, Stetz JB. Macleod AC. Linking landscape characteristics to local grizzly bear abundanceusing multiple detection methods in a hierarchical model. Animal Conservation. 2011;14:652–664. [Google Scholar]
Guillera-Arroita G, Ridout MS. Morgan BJT. Design of occupancy studies with imperfect detection. Methods in Ecology and Evolution. 2010;1:131–139. [Google Scholar]
Guillera-Arroita G, Ridout MS. Morgan BJT. Two-stage sequential Bayesian study design forspecies occupancy estimation. Journal of Agricultural, Biological,and Environmental Statistics. 2014;19:278–291. [Google Scholar]
Guillera-Arroita G, Ridout MS, Morgan BJT. Linkie M. Models for species-detection data collected along transects in thepresence of abundance-induced heterogeneity and clustering in the detectionprocess. Methods in Ecology and Evolution. 2012;3:358–367. [Google Scholar]
Hilbe JM. Negative Binomial Regression. 2011 .New York: Cambridge University Press. [Google Scholar]
Hines JE. Program PRESENCE 4.1—Software to estimate patch occupancy andrelated parameters. 2011 .U.S. Geological Survey Patuxent Wildlife Research Center, Maryland. [Google Scholar]
Holgate P. Estimation for the bivariate Poisson distribution. Biometrika. 1964;51:241–245. [Google Scholar]
Hunt JW, Weckerly FW. Ott JR. Reliability of occupancy and binomial mixture models for estimatingabundance of golden-cheeked warblers (Setophaga chrysoparia) The Auk. 2012;129:105–114. [Google Scholar]
Johnson NL, Kotz S. Balakrishnan N. Discrete Multivariate Distributions. 1997 and. New York: Wiley. [Google Scholar]
Jost TA, Brcich RF. Zoubir AM. Estimating the parameters of the multivariate Poisson distributionusing the composite likelihood concept. The Proceedings of the 31st IEEE International Conference onAcoustics, Speech and Signal Processing. 2006 and. In Toulouse, France: IEEE. [Google Scholar]
Karlis D. An EM algorithm for multivariate Poisson distribution and relatedmodels. Journal of Applied Statistics. 2003;30:63–77. [Google Scholar]
Kéry M. Estimating abundance from bird counts: Binomial mixture modelsuncover complex covariate relationships. The Auk. 2008;125:336–345. [Google Scholar]
Kéry M, Dorazio RM, Soldaat L, Van Strien A, Zuiderwijk A. Royle JA. Trend estimation in populations with imperfect detection. Journal of Applied Ecology. 2009;46:1163–1172. [Google Scholar]
Kéry M, Royle JA. Schmid H. Modeling avian abundance from replicated counts using binomialmixture models. Ecological Applications. 2005;15:1450–1461. [Google Scholar]
Martin J, Royle JA, Mackenzie DI, Edwards HH, Kéry M. Gardner B. Accounting for non-independent detection when estimating abundance oforganisms with a Bayesian approach. Methods in Ecology and Evolution. 2011;2:595–601. [Google Scholar]
McIntyre AP, Jones JE, Lund EM, Waterstrat FT, Giovanini JN, Duke SD, Hayes MP, Quinn T. Kroll AJ. Empirical and simulation evaluations of an abundance estimator usingunmarked individuals of cryptic forest-dwelling taxa. Forest Ecology and Management. 2012;286:129–136. [Google Scholar]
R Core Team. R: A Language and Environment for Statistical Computing. 2014 .R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
Royle JA. N-mixture models for estimating population size fromspatially replicated counts. Biometrics. 2004a;60:108–115. doi: 10.1111/j.0006-341X.2004.00142.x. [DOI] [PubMed] [Google Scholar]
Royle JA. Generalized estimators of avian abundance from countsurvey data. Animal Biodiversity and Conservation. 2004b;27:375–386. [Google Scholar]
Royle JA, Nichols JD. Kéry M. Modelling occurrence and abundance of species when detection isimperfect. Oikos. 2005;110:353–359. [Google Scholar]
Sontag ED, Zeilberger D. A symbolic computation approach to a problem involving multivariatePoisson distributions. Advances in Applied Mathematics. 2010;44:359–377. [Google Scholar]
Toribio S, Gray B. Liang S. An evaluation of the Bayesian approach to fitting the N-mixturemodel for use with pseudo-replicated count data. Journal of Statistical Computation and Simulation. 2012;82:1135–1143. [Google Scholar]
Wang J-PZ, Lindsay BG. A penalized nonparametric maximum likelihood approach to speciesrichness estimation. Journal of the American Statistical Association. 2005;100:942–959. [Google Scholar]
Zellweger-Fischer J, Kéry M. Pasinelli G. Population trends of brown hares in Switzerland: The role ofland-use and ecological compensation areas. Biological Conservation. 2011;144:1364–1373. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

biom0071-0237-sd1.pdf^{(1.4MB, pdf)}

biom0071-0237-sd2.zip^{(4.6KB, zip)}

[b1] Cormack RM. Log-linear models for capture–recapture. Biometrics. 1989;45:395–413. [Google Scholar]

[b2] Couturier T, Cheylan M, Bertolero A, Astruc G. Besnard A. Estimating abundance and population trends when detection is low andhighly variable: A comparison of three methods for the Hermann's tortoise. The Journal of Wildlife Management. 2013;77:454–462. [Google Scholar]

[b3] Dail D, Madsen L. Models for estimating abundance from repeated counts of an openmetapopulation. Biometrics. 2011;67:577–587. doi: 10.1111/j.1541-0420.2010.01465.x. [DOI] [PubMed] [Google Scholar]

[b4] Dodd CK, Dorazio RM. Using counts to simultaneously estimate abundance and detectionprobabilities in a salamander community. Herpetologica. 2004;60:468–478. [Google Scholar]

[b5] Dorazio RM, Martin J. Edwards HH. Estimating abundance while accounting for rarity, correlatedbehavior, and other sources of variation in counts. Ecology. 2013;94:1472–1478. doi: 10.1890/12-1365.1. [DOI] [PubMed] [Google Scholar]

[b6] Fiske I, Chandler RB. Journal of Statistical Software. 2011;43:1–23. doi: 10.18637/jss.v043.i01. ).Unmarked: An R package for fitting hierarchical modelsof wildlife occurrence and abundance. [DOI] [PMC free article] [PubMed] [Google Scholar]

[b7] Graves TA, Kendall KC, Royle JA, Stetz JB. Macleod AC. Linking landscape characteristics to local grizzly bear abundanceusing multiple detection methods in a hierarchical model. Animal Conservation. 2011;14:652–664. [Google Scholar]

[b8] Guillera-Arroita G, Ridout MS. Morgan BJT. Design of occupancy studies with imperfect detection. Methods in Ecology and Evolution. 2010;1:131–139. [Google Scholar]

[b9] Guillera-Arroita G, Ridout MS. Morgan BJT. Two-stage sequential Bayesian study design forspecies occupancy estimation. Journal of Agricultural, Biological,and Environmental Statistics. 2014;19:278–291. [Google Scholar]

[b10] Guillera-Arroita G, Ridout MS, Morgan BJT. Linkie M. Models for species-detection data collected along transects in thepresence of abundance-induced heterogeneity and clustering in the detectionprocess. Methods in Ecology and Evolution. 2012;3:358–367. [Google Scholar]

[b11] Hilbe JM. Negative Binomial Regression. 2011 .New York: Cambridge University Press. [Google Scholar]

[b12] Hines JE. Program PRESENCE 4.1—Software to estimate patch occupancy andrelated parameters. 2011 .U.S. Geological Survey Patuxent Wildlife Research Center, Maryland. [Google Scholar]

[b13] Holgate P. Estimation for the bivariate Poisson distribution. Biometrika. 1964;51:241–245. [Google Scholar]

[b14] Hunt JW, Weckerly FW. Ott JR. Reliability of occupancy and binomial mixture models for estimatingabundance of golden-cheeked warblers (Setophaga chrysoparia) The Auk. 2012;129:105–114. [Google Scholar]

[b15] Johnson NL, Kotz S. Balakrishnan N. Discrete Multivariate Distributions. 1997 and. New York: Wiley. [Google Scholar]

[b16] Jost TA, Brcich RF. Zoubir AM. Estimating the parameters of the multivariate Poisson distributionusing the composite likelihood concept. The Proceedings of the 31st IEEE International Conference onAcoustics, Speech and Signal Processing. 2006 and. In Toulouse, France: IEEE. [Google Scholar]

[b17] Karlis D. An EM algorithm for multivariate Poisson distribution and relatedmodels. Journal of Applied Statistics. 2003;30:63–77. [Google Scholar]

[b18] Kéry M. Estimating abundance from bird counts: Binomial mixture modelsuncover complex covariate relationships. The Auk. 2008;125:336–345. [Google Scholar]

[b19] Kéry M, Dorazio RM, Soldaat L, Van Strien A, Zuiderwijk A. Royle JA. Trend estimation in populations with imperfect detection. Journal of Applied Ecology. 2009;46:1163–1172. [Google Scholar]

[b20] Kéry M, Royle JA. Schmid H. Modeling avian abundance from replicated counts using binomialmixture models. Ecological Applications. 2005;15:1450–1461. [Google Scholar]

[b21] Martin J, Royle JA, Mackenzie DI, Edwards HH, Kéry M. Gardner B. Accounting for non-independent detection when estimating abundance oforganisms with a Bayesian approach. Methods in Ecology and Evolution. 2011;2:595–601. [Google Scholar]

[b22] McIntyre AP, Jones JE, Lund EM, Waterstrat FT, Giovanini JN, Duke SD, Hayes MP, Quinn T. Kroll AJ. Empirical and simulation evaluations of an abundance estimator usingunmarked individuals of cryptic forest-dwelling taxa. Forest Ecology and Management. 2012;286:129–136. [Google Scholar]

[b23] R Core Team. R: A Language and Environment for Statistical Computing. 2014 .R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]

[b24] Royle JA. N-mixture models for estimating population size fromspatially replicated counts. Biometrics. 2004a;60:108–115. doi: 10.1111/j.0006-341X.2004.00142.x. [DOI] [PubMed] [Google Scholar]

[b25] Royle JA. Generalized estimators of avian abundance from countsurvey data. Animal Biodiversity and Conservation. 2004b;27:375–386. [Google Scholar]

[b26] Royle JA, Nichols JD. Kéry M. Modelling occurrence and abundance of species when detection isimperfect. Oikos. 2005;110:353–359. [Google Scholar]

[b27] Sontag ED, Zeilberger D. A symbolic computation approach to a problem involving multivariatePoisson distributions. Advances in Applied Mathematics. 2010;44:359–377. [Google Scholar]

[b28] Toribio S, Gray B. Liang S. An evaluation of the Bayesian approach to fitting the N-mixturemodel for use with pseudo-replicated count data. Journal of Statistical Computation and Simulation. 2012;82:1135–1143. [Google Scholar]

[b29] Wang J-PZ, Lindsay BG. A penalized nonparametric maximum likelihood approach to speciesrichness estimation. Journal of the American Statistical Association. 2005;100:942–959. [Google Scholar]

[b30] Zellweger-Fischer J, Kéry M. Pasinelli G. Population trends of brown hares in Switzerland: The role ofland-use and ecological compensation areas. Biological Conservation. 2011;144:1364–1373. [Google Scholar]

PERMALINK

Computational Aspects of N-Mixture Models

Emily B Dennis

Byron JT Morgan

Martin S Ridout

Abstract

1. Introduction

2. The N-Mixture Model

3. Equivalence of the Poisson N-Mixture Model With a Multivariate Poisson Model

Example: T=2, Poisson Case

3.1. Multivariate Poisson Distribution

3.2. Performance of the Multivariate Poisson Model

Figure 1.

Table 1.

4. Explicit Form for the Bivariate Negative-Binomial Case

5. The Effect of the Choice of K on Fitting the N-Mixture Model: Poisson Case

5.1. Incorrect Estimates due to the Choice of K

Figure 2.

5.2. Automatic Choice of K

6. Moment Estimation for a Mixed-Poisson N-Mixture Model

6.1. Moment Estimation

6.2. Performance of the Multivariate Negative-Binomial Model

Table 2.

Figure 3.

7. Application to Hermann's Tortoise Data

8. Discussion and Recommendations

9. Supplementary Materials

Acknowledgments

Appendix

Appendix

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Computational Aspects of N-Mixture Models

Emily B Dennis

Byron JT Morgan

Martin S Ridout

Abstract

1. Introduction

2. The N-Mixture Model

3. Equivalence of the Poisson N-Mixture Model With a Multivariate Poisson Model

Example: T=2, Poisson Case

3.1. Multivariate Poisson Distribution

3.2. Performance of the Multivariate Poisson Model

Figure 1.

Table 1.

4. Explicit Form for the Bivariate Negative-Binomial Case

5. The Effect of the Choice of K on Fitting the N-Mixture Model: Poisson Case

5.1. Incorrect Estimates due to the Choice of K

Figure 2.

5.2. Automatic Choice of K

6. Moment Estimation for a Mixed-Poisson N-Mixture Model

6.1. Moment Estimation

6.2. Performance of the Multivariate Negative-Binomial Model

Table 2.

Figure 3.

7. Application to Hermann's Tortoise Data

8. Discussion and Recommendations

9. Supplementary Materials

Acknowledgments

Appendix

Appendix

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases