Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2019 Apr 25;75(3):799–809. doi: 10.1111/biom.13058

Improved methods for estimating abundance and related demographic parameters from mark‐resight data

Brett T McClintock 1,, Gary C White 2, Moira A Pryde 3
PMCID: PMC6850357  PMID: 31004437

Abstract

Over the past decade, there has been much methodological development for the estimation of abundance and related demographic parameters using mark‐resight data. Often viewed as a less‐invasive and less‐expensive alternative to conventional mark recapture, mark‐resight methods jointly model marked individual encounters and counts of unmarked individuals, and recent extensions accommodate common challenges associated with imperfect detection. When these challenges include both individual detection heterogeneity and an unknown marked sample size, we demonstrate several deficiencies associated with the most widely used mark‐resight models currently implemented in the popular capture‐recapture freeware Program MARK. We propose a composite likelihood solution based on a zero‐inflated Poisson log‐normal model and find the performance of this new estimator to be superior in terms of bias and confidence interval coverage. Under Pollock's robust design, we also extend the models to accommodate individual‐level random effects across sampling occasions as a potentially more realistic alternative to models that assume independence. As a motivating example, we revisit a previous analysis of mark‐resight data for the New Zealand Robin (Petroica australis) and compare inferences from the proposed estimators. For the all‐too‐common situation where encounter rates are low, individual detection heterogeneity is non‐negligible, and the number of marked individuals is unknown, we recommend practitioners use the zero‐inflated Poisson log‐normal mark‐resight estimator as now implemented in Program MARK.

Keywords: capture‐recapture, capture resight, population size, Program NOREMARK, survival

1. INTRODUCTION

Mark‐resight methods are commonly used to estimate abundance and related demographic parameters for animal populations (eg, White and Shenk, 2001; McClintock and White, 2012). They are widely applicable across diverse taxa, including terrestrial mammals (eg, Keech et al., 2011), marine mammals (eg, Ryan et al., 2011), and birds (eg, Lyons et al., 2016). Mark‐resight methods are most often utilized when some segment of the population is marked, and non‐invasive sighting data can be simultaneously collected for both marked and unmarked individuals. Non‐invasive sighting data can include those arising from ground (eg, McClintock and White, 2007), aerial (eg, Keech et al., 2011), and camera (eg, Alonso et al., 2015) surveys. Because mark‐resight does not require repeated physical capture of individuals, the main advantage of this approach is that costs associated with marking and recapturing can be minimized. In many circumstances, it can therefore be a less‐invasive and less‐expensive alternative to traditional mark recapture for population monitoring. Relative to capture recapture, mark‐resight abundance estimators also tend to be more robust to individual heterogeneity in detection probability because marked individuals are selected independent of the sighting process (ie, “all‐zero” detection histories are observed).

Mark‐resight estimators have seen much development over the past decade (McClintock and White, 2012; Matechou et al., 2013; Alonso et al., 2015; Lyons et al., 2016), and the models that are currently implemented in the freeware Program MARK (White and Burnham, 1999) are widely used by practitioners. Because it does not require knowledge of the exact number of marked individuals in the population, one of the most popular mark‐resight models for estimating abundance and survival parameters is the zero‐truncated Poisson log‐normal estimator (ztPNE; McClintock et al., 2009; McClintock & White, 2009). Here, we identify several potential deficiencies of the ztPNE currently implemented in Program MARK. Among them are its failure to utilize valuable information about the number of marks ever released into the population and, under Pollock's robust design, its conditioning on first sighting instead of first encounter (eg., capture). The ztPNE in Program MARK also relies on an ad‐hoc adjustment term to account for marks that are sighted but not identified to individual identity (see McClintock et al., 2014). Lastly, while the ztPNE implemented in Program MARK can include individual‐level random effects in detection probability, these individual heterogeneity effects are (perhaps unrealistically) assumed to be independent across primary sampling periods.

As a solution to these potential deficiencies of ztPNE, we propose the zero‐inflated Poisson log‐normal estimator (ziPNE). By allowing for zero inflation in the marked individual sighting data, ziPNE can incorporate all marked individuals ever released into the population and thus does not need to condition on first sighting. We also extend the likelihoods for these models to accommodate unidentified marks and individual‐level random effects across primary sampling occasions. After describing ziPNE in the next section, we compare the performance of ziPNE relative to ztPNE under a broad range of simulated sampling scenarios. We then revisit the New Zealand Robin (Petroica australis) analysis of McClintock and White (2009) and compare inferences from the various estimators.

2. METHODS

There are several different study designs for the mark‐resight models in Program MARK (McClintock and White, 2012). The mark‐resight scenario under consideration consists of T sampling occasions where sightings of both marked and unmarked individuals are recorded. The observable population size (Nt) is defined as the number of individuals that are available to be observed (eg, alive and within the study area) during sighting survey t=1,,T. The population is assumed to be geographically and demographically closed within each survey, but not between surveys. To account for temporary emigration off the study area (or any other process that causes a segment of the population to be temporarily unobservable), we define the unobservable population size as the number of individuals that were alive during survey t but had temporarily emigrated off the study area between times t1 and t.

During each survey, there are nt{0,,Mt} marked individuals and Ut=Ntnt unmarked individuals in the observable population, where Mt=k=1tRk is the total number of marked individuals ever released into the population prior to survey t, and Rt is the number of (unmarked) individuals that were newly marked between times t1 and t. The mark‐resight data therefore consist of the number of resightings for each marked individual (xs,t{0,1,2,} for sMt) and the total number of unmarked individual sightings (ut=sMtxs,t), where Mt is the set of Mt marked individuals ever released into the population prior to survey t. In the common circumstance where the identity of an individual detected as marked cannot always be determined (McClintock et al., 2014), then the observed data for the marked segment of the population consist of ys,t=xs,tϵs,t for sMt and the total number of marked sightings that were not individually identified (et=sMtϵs,t). The number of marks alive and within the study area can sometimes be reasonably assumed to be known exactly (eg, if all nt marks are introduced immediately prior to sighting survey t), but more commonly nt is unknown due to attrition of marks over time from mortality and permanent emigration.

When nt is unknown for at least one sampling occasion, the zero‐truncated Poisson log‐normal estimator (ztPNE, McClintock et al., 2009; McClintock and White, 2009) can be used to estimate abundance (Nt), apparent survival from time t to time t+1(ϕt), the transition probability from an observable state (eg. in the study area) at time t to an unobservable state (eg, temporarily off the study area) at time t+1(γt), and the transition probability from an unobservable state at time t to an unobservable state at time t+1(γt). However, the ztPNE does not capitalize on all of the available information afforded by this mark‐resight study design. Notably, the model does not incorporate Mt into estimation of detection probability parameters, and, under Pollock's robust design (Kendall et al., 1997; McClintock and White, 2009), it conditions on the first sighting of each marked individual (instead of first encounter; eg, capture) for estimating the open population parameters (ϕ,γ, and γ).

To make better use of the available information, we propose the zero‐inflated Poisson log‐normal estimator (ziPNE). The assumptions of ziPNE are identical to the standard mark‐resight assumptions of ztPNE (McClintock et al., 2009; McClintock and White, 2009). Unmodeled individual sighting heterogeneity is well known to induce bias in abundance estimators, and we develop two ziPNE formulations based on whether or not individual‐level effects for sighting rates are assumed independent (“within” heterogeneity sensu McClintock et al., 2009) or not (“across” heterogeneity sensu Gimenez and Choquet, 2010) between sampling occasions.

Assuming the latent xs,t sightings follow a Poisson log‐normal distribution, we have xs,tλs,t~Poisson(λs,t),ys,tλs,t,rt~Poisson(λs,trt), and ϵs,tλs,t,rt~Poisson(λs,t(1rt)), where λs,t=exp(αt+σtzs,t) and rt is the conditional probability that a mark is individually identified (given detected as marked). The individual sighting rate (λs,t) includes a (log‐scale) mean (αt), standard deviation (σt), and latent individual‐level effect zs,t~N(0,1). When the latent individual‐level effects include time dependence, we refer to this as the “within” heterogeneity model. For an “across” heterogeneity model, we simply remove time dependence from the individual effects: λs,t=exp(αt+σtzs), where zs~N(0,1).

Under this scenario, likelihood‐based estimation of Nt and related demographic parameters is challenging for several reasons. The individual‐level effects clearly induce dependencies between Y=(ys,t) and e=(e1,,eT), and, for the “across” heterogeneity models, within Y,e, and u=(u1,,uT). A further complication for the “across” heterogeneity models is that when Rt>0 for t{2,,T}, then ys,t:T,et:T, and u1:t1 may not be independent for sMt1. One possible solution is to formulate a complete data likelihood and use data augmentation (eg, McClintock et al., 2014; Rankin et al., 2016), but such approaches can be time‐consuming and difficult to fit (see Web Appendix A). We instead adopt a practical perspective and focus our efforts on a composite likelihood (eg, Varin et al., 2011) that is amenable to maximum likelihood estimation and implementation in Program MARK. Web Table B1 includes definitions for all parameters, latent variables, and random variables in our composite likelihood.

To make the problem tractable, we propose to ignore any covariance between Y,e, and u and simply integrate over the latent Z for each of three components of a composite observed data likelihood as if they were independent:

Y,u,eU,α,σ,r,w,g,ϕ,γ,γ=s=1MTzsyszs,α,σ,r,w,g,ϕ,γ,γ[zs]×ZeZ,α,σ,r[Z]×ZuZ,U,α,σ[Z], (1)

where bracket notation [dataparameter] denotes a probability density (or mass) function. We can account for zero inflation in ys by assuming marked individuals are in one of three states at time t (1 = dead or permanent emigrant; 2 = alive, not a permanent emigrant, off the study area; 3 = alive, not a permanent emigrant, on the study area). Let wt=[1wt,wt(1gt),wtgt] be a row vector of initial state probabilities, where wt is the probability that a randomly selected individual from the Rt new marks introduced between sighting occasion t1 and t was alive and not a permanent emigrant during sighting occasion t, and gt is the conditional probability that this individual was within the study area (given alive and not a permanent emigrant). Let Ps,t be a 3×3 matrix where the diagonal elements denote the probability mass function of ys,t conditional on the state of individual s at time t, and let Gt be a 3×3 state transition probability matrix with rows corresponding to the state at time t and the columns corresponding to the state at time t+1. We then have

yszs,α,σ,r,w,g,ϕ,γ,γ=wbsPs,bsi=bsT1GiPs,i+113
Ps,t=1I(ys,t>0)0001I(ys,t>0)000ps,t,
Gt=1001ϕtϕtγtϕt(1γt)1ϕtϕtγtϕt(1γt),

where bs{1,,T} is the first sighting occasion for which individual s was marked (and thus potentially available for resighting), I() is the indicator function, 13=[1,1,1]T, and ps,t=[ys,tλs,t,rt]. By allowing ys,t for the Rt newly marked individuals to be zero inflated (with probability 1wtgt), we can include MT and condition on first encounter (instead of first sighting) in our composite likelihood.

For both the “within” and “across” models, we wish to integrate over the latent variables zs,t and zs, respectively, in Equation (1). While this is straightforward for [yszs,α,σ,r,w,g,ϕ,γ,γ],[uU,α,σ]=Z[uZ,U,α,σ][Z] and [eα,σ,r]=Z[eZ,α,σ,r][Z] pose difficulties because sums of Poisson log‐normal random variables are not of standard form (eg, Poisson) and they involve multidimensional integrals that are not amenable to numerical approximation. Similar to ztPNE, we rely on a left‐truncated normal distribution for [uU,α,σ] based on the expected value λt=Ez(xs,tλs,t)=E(xs,tλs,t)[z]dz=exp(αt+σtz)[z]dz=exp(αt+σt22) and variance ηt=varz(xs,tλs,t)=Ez(var(xs,tλs,t))+varz(E(xs,tλs,t))=λt+exp(2αt)(exp(2σt2)exp(σt2)) of a Poisson log‐normal random variable after integrating over the individual‐level effects:

[uU,α,σ]=t=1Tut*Ut,αt,σt

where ut*Ut,αt,σt~TN(0,)(Utλt,Utηt),ut*=ut is a non‐negative continuous random variable, and TN(a,b) represents a normal distribution truncated at a and b. Note that for the “across” heterogeneity model, this composite likelihood also ignores the covariance induced in u by the latent individual‐level effects.

We employ a similar left‐truncated normal distribution for [eα,σ,r]:

[eα,σ,r]=t=1Tet*αt,σt,rt (2)

where et*αt,σt,rt~TN(0,)(ntλt(1rt),ntκt),κt,λt(1rt)+exp(2αt)(1rt)2(exp(2σt2)exp(σt2)), and et*=et is a non‐negative continuous random variable. While n1 can sometimes be reasonably assumed to be known exactly (ie, n1=M1 and w1=g1=1), for all occasions where the number of marks alive and within the study area is not known exactly, we replace nt in Equation (2) with a Horvitz‐Thompson‐type estimator: n^t=sMtI(ys,t>0)pt1, where pt=1exp(exp(αt+σtz)rt)[z]dz is the probability of a marked individual being resighted at least once.

We therefore have the following composite likelihood for the “within” heterogeneity ziPNE:

Y,u,eU,α,σ,r,w,g,ϕ,γ,γ=s=1MTysα,σ,r,w,g,ϕ,γ,γ×t=1Tet*αt,σt,rtut*Ut,αt,σt, (3)

where ps,t=[ys,tλs,t,rt][zs,t]dzs,t and λs,t=exp(αt+σtzs,t). For the “across” heterogeneity ziPNE:

Y,u,eU,α,σ,r,w,g,ϕ,γ,γ=s=1MTyszs,α,σ,r,w,g,ϕ,γ,γ[zs]dzs×t=1Tet*αt,σt,rtut*Ut,αt,σt, (4)

where ps,t=[ys,tλs,t,rt] and λs,t=exp(αt+σtzs). For maximum likelihood analysis, the integrals in Equations (3) and (4) can be easily approximated using Gauss‐Hermite quadrature:

ps,t=[ys,tλs,t,rtzs,tdzs,t1πl=1Lfl[ys,tλl,t*,rt],

and

yszs,α,σ,r,w,g,ϕ,γ,γzsdzs1πl=1LflwbsPl,bs*i=bsT1GiPl,i+1*13,

where

Pl,t*=1I(ys,t>0)0001I(ys,t>0)000ys,tλl,t*,rt,

ys,tλl,t*,rt~Poisson(λl,t*rt),λl,t*=exp(αt+2σtvl), and (fl,vl) are the corresponding weights and nodes, respectively, for quadrature points l=1,,L (Abramowitz and Stegun, 1964). When nt is unknown, we have pt11πl=1Lflexp(λl,t*rt).

The size of the observable population (eg, alive and within the study area) during sighting survey t(Nt=Ut+nt) can be derived from the estimated parameters after fitting Equations (3) or (4) using maximum likelihood methods. As with all derived parameters in Program MARK, standard errors for Nt are estimated using the delta method and finite‐difference approximations of the first derivative. Program MARK also facilitates modeling any of the parameters as a function of covariates using link functions (see Section 4). We note that in a fully time‐dependent model for wt and gt(t=1,,T), only the product wTgT is estimable. When T=1, only the product w1g1 is estimable. We also note that because the “within” and “across” heterogeneity models use the same data, they can be compared using model selection criteria such as Akaike's Information Criterion (AIC; Burnham and Anderson, 2002).

3. SIMULATION STUDY

Using data generated from the exact model, we performed a series of simulation experiments to evaluate the performance of our composite ziPNE relative to the ztPNE as implemented in Program MARK (McClintock and White, 2012. Each simulated scenario included T=7,ϕt=0.95,γt=γt=0.1,wt=0.95,gt=0.9,rt{0.5,0.9},E(R1){60,120},E(U1){300,600},λt{1.5,5,100}, and τ{0,0.5,1,5}, where varz(xs,tλs,t)=λt(1+τ), such that αt=2log(λt)log(λt2+λtτ)2 and σt=log(λt2+λtτ)2log(λt) for t=1,,T. For Rt(t=2,,T), some of the Ut1 unmarked individuals became newly marked (with probability 0.02) in order to maintain a somewhat consistent marked population size over time. However, no additional unmarked individuals were introduced to the population after the first occasion, so Nt is expected to steadily decline through time due to mortality and permanent emigration. Scenarios with λt=100 represent the “best case” scenario where sighting rates are very high and individuals that are alive and within the study area are virtually guaranteed to be sighted (i.e. xs,t>0). Under this scenario, we would expect ziPNE and ztPNE to perform similarly. Scenarios with both λt=1.5 and τ=5 represent the pathological case where mean sighting rates are low and individual heterogeneity is high, whereby the vast majority of individuals are never expected to be sighted (xs,t=0) but some individuals will be sighted many times (xs,t0). In terms of α,σ, and N estimation, we would generally expect all estimators to perform less than nominally under this extreme scenario (particularly when rt=0.5). While we would not expect ϕ estimators to be biased by extreme heterogeneity coupled with low mean sighting rates, we suspect it could also be deleterious to reliable estimation of γ and γ.

For each combination, we simulated data from both the “within” and “across” heterogeneity models, thus yielding 2×2×2×2×3×4=192 simulated scenarios. For each scenario, we fit the two models corresponding to the data‐generating heterogeneity model: 1) ziPNE within heterogeneity Equation (3) and ztPNE within heterogeneity; or 2) ziPNE across heterogeneity Equation (4) and ztPNE across heterogeneity. We investigated heterogeneity model mis‐mispecification in an additional series of simulation experiments (see Web Appendix E). We assumed that n1 was known, nt(t=2,,T) was unknown, and M1=n1. For scenarios with τ=0, we fixed σt=0 for each of the fitted models. For simplicity, we also fixed w1=g1=1 (because n1 is known and n1=M1) and otherwise assumed no time dependence in parameters (eg, ϕ1=ϕ2==ϕT1). All simulations used L=101 quadrature points for numerical integration. We evaluated estimator performance based on bias, 95% confidence interval coverage of the true parameter values, and root mean squared error (RMSE). Simulation results are presented using the nested loop plot of Rücker and Schwarzer (2014). Similar to a time series plot, nested loop plots serve to present a large number of simulation results by putting all scenarios into a lexicographical order and arranging them consecutively along the horizontal axis. The quantity of interest (eg, bias, coverage, and RMSE) is then plotted on the vertical axis.

We found the “within” heterogeneity ziPNE to be far superior to ztPNE for the estimation of the closed population parameters (N,α, and σ), particularly as λt decreased, τ increased, and r decreased. In nearly all of the scenarios, ziPNE tended to be less biased with higher 95% confidence interval coverage and lower RMSE (Figure 1, Web Figures C1–C4). Although it performed much better than ztPNE, ziPNE still exhibited an increase in bias and decrease in coverage under the pathological scenarios with low mean resighting rates and extreme heterogeneity. While little difference was found between ziPNE and ztPNE in terms of apparent survival (ϕ) estimation (Figure 2), ziPNE performed much better than ztPNE in the estimation of both γ and γ (Web Figures C5‐C8).

Figure 1.

Figure 1

Nested loop plots of percent relative bias (top panel) and 95% confidence interval coverage (bottom panel) of abundance (Nt,t=1,,7) estimators based on 192 simulated scenarios. Scenarios are ordered from outer to inner loops by λt{100,5,1.5} (“lambda”; three levels, decreasing), τ{0,0.5,1,5} (“tau”; four levels, increasing), r{0.9,0.5} (two levels, decreasing), E(R1){120,60} (“Number marked”; two levels, decreasing), and E(U1){600,300} (“Number unmarked”; two levels, decreasing) such that the left‐ and right‐most scenarios are the best‐ and worst‐case sampling scenarios, respectively. Estimators include the zero‐inflated Poisson log‐normal estmator (dark orange = ziPNE within, light orange = ziPNE across) and the zero‐truncated Poisson log‐normal estimator (dark purple = ztPNE within, light purple = ztPNE across). Results are based on 400 simulated data sets for each scenario. This figure appears in color in the electronic version of this article, and color refers to that version

Figure 2.

Figure 2

Nested loop plots of absolute bias (top panel) and 95% confidence interval coverage (bottom panel) of apparent survival (ϕ) estimators based on 192 simulated scenarios. Scenarios are ordered from outer to inner loops by λt{100,5,1.5} (“lambda”; three levels, decreasing), τ{0,0.5,1,5} (“tau”; four levels, increasing), r{0.9,0.5} (two levels, decreasing), E(R1){120,60} (“Number marked”; two levels, decreasing), and E(U1){600,300} (“Number unmarked”; two levels, decreasing) such that the left‐ and right‐most scenarios are the best‐ and worst‐case sampling scenarios, respectively. Estimators include the zero‐inflated Poisson log‐normal estmator (dark orange = ziPNE within, light orange = ziPNE across) and the zero‐truncated Poisson log‐normal estimator (dark purple = ztPNE within, light purple = ztPNE across). Results are based on 400 simulated data sets for each scenario. This figure appears in color in the electronic version of this article, and color refers to that version

We did not find such drastic differences in the performance of the “across” heterogeneity ziPNE relative to ztPNE, but ziPNE did tend to perform better in the estimation of the closed population parameters (Figure 1, Web Figures C2‐C4). While the “across” ztPNE tended to exhibit slightly less bias and lower RMSE for N in the most extreme scenarios, the “across” ziPNE tended to yield higher 95% confidence interval coverage (particularly when rt=0.5; Figure 1, Web Figure C1). Nevertheless, coverage of N still tended to be less than nominal when τ>0 and rt=0.5 for all λt. For the open parameters (ϕ,γ,γ), we found that both ziPNE and ztPNE performed equally well in terms of bias and confidence interval coverage (Figure 2), but RMSE for ϕ and γ in particular tended to be larger for ztPNE (Web Figures C5‐C8).

4. EXAMPLE: NEW ZEALAND ROBIN

As an example application, we return to the New Zealand Robin (P. australis) data originally analyzed using ztPNE by McClintock and White (2009). This mark‐resight study was conducted by the New Zealand Department of Conservation on three 100 ha study areas (“Knobs Flat”, “Smithy”, and “Walker Creek”) in Fiordland National Park, New Zealand. Between September 2003 and August 2007, capture and banding of both juvenile and adult birds occurred continuously throughout the breeding season (August‐March) and intermittently prior to sighting surveys conducted in March (postbreeding) and August (prebreeding) of each year from 2005 to 2007. Because marked birds could have died or permanently emigrated during the extended capture period, the exact number of marked individuals in each population was unknown for every sighting occasion. For each occasion, the observable population for abundance estimation consisted of all juvenile and adult Robins within a study area. Sighting surveys did not begin in Smithy until August 2005, so there were T=6 sighting occasions for both Knobs Flat and Walker Creek, but only T=5 occasions for Smithy. Notably, ship rat (Rattus rattus) control was conducted on the Walker Creek study area from June 2006 to April 2007. Complete details of the study design and data can be found in McClintock and White (2009).

We performed a ziPNE analysis for comparison to the “within” ztPNE analysis of McClintock and White (2009) using Program MARK. While the original ztPNE analysis only included detection histories for sMTI(t=1Tys,t>0)=149 marked birds that were sighted at least once between March 2005 and August 2007, the ziPNE analysis includes detection histories for all MT=439 marked birds that were released into the populations prior to the August 2007 sighting occasion. Covariates examined for sighting rate and open population parameters included age class (age), sex, age class at time of capture (agecap), study area (area), time, breeding season (season), and rat control on Walker Creek. As in McClintock and White (2009), we also included an interaction between age, sex, and breeding season (age*sex*season) to investigate potential effects of reproduction on adult survival. For model selection, we used AIC adjusted for small sample sizes (AICc; Burnham and Anderson, 2002). Three covariate models for rat control on Walker Creek were examined: (1) an immediate and constant effect during the entire program (rat); (2) an immediate effect with no effect thereafter (rat1); and (3) no immediate effect with a delayed effect thereafter (rat2). We also investigated models under completely random emigration, γ=γ (Kendall et al., 1997), and models ignoring individual heterogeneity (σ=0) or temporary emigration (γ=γ=0).

Similar to McClintock and White (2009), we first investigated various models for the sighting rate parameters (α,σ, and r) under the most general (but identifiable) time‐ and group‐dependent structure for U and the open parameters (ϕ,γ, and γ). We proceeded by selecting the model structure for α,σ, and r best supported by AICc and then using only this structure for investigating more parsimonious models for ϕ,γ, and γ (for an evaluation of a similar model selection procedure, see Doherty et al., 2012). We then fit all combinations of the three best supported structures for each parameter based on each stage of this selection process. We assumed the same structure for w and ϕ and the same structure for g,γ, and γ. However, because they pertain to the entire capture period beginning in September 2003, we also investigated w1 and g1 effects (cap1) for the Knobs Flat and Walker Creek study areas, and w2 and g2 effects (cap2) for the Smithy study area. Unlike the “within” ztPNE analysis of McClintock and White (2009), we investigated both “within” and “across” heterogeneity models, and we again note that, unlike the ziPNE proposed here, the ztPNE of McClintock and White (2009) does not include the r parameter and instead relies on an ad‐hoc adjustment term for unidentified marks. When fitting the models, we found the Markovian temporary emigration models including additive age, agecap, or area effects to have suspect convergence and thus excluded these from our model set. We based our comparisons on model‐averaged parameter estimates and unconditional variances calculated from AICc weights.

AICc overwhelmingly supported “across” heterogeneity models for ziPNE, with the minimum AICc model including: agecap, sex, season, and area effects on ϕ and w; cap1 and cap2 effects on w; agecap and season effects on random emigration (ie, γ=γ) and g; age, sex, and rat effects on α; sex effects on σ; and area by time interaction effects on r. Model selection was somewhat similar for the “within” ztPNE model of McClintock and White (2009), but the ziPNE analysis exhibited considerably more model selection uncertainty (Web Table D1) and weak evidence of rat control effects on ϕ.

For ziPNE, model‐averaged mean sighting rates (λt) were the lowest in August 2007 (2.2, 95% CI: 1.92.5) and the highest on Walker Creek during rat control in March 2007 (3.2, 95% CI: 2.64). For the “across” ziPNE models, model‐averaged σ=0.52 (SE =0.18) for females, and σ=0.28 (SE =0.07) for males. Individual mark identification probabilities (r) varied by study area and time but were generally high, with model‐averaged estimates ranging from 0.81 (SE =0.07) to 1.00 (SE =0.00). Point and variance estimates for Nt tended to be larger for ziPNE, and, except for Walker Creek in August 2006, all estimated 95% confidence intervals for Nt were greater than the minimum number known alive based on territory mapping that was independent of the mark‐resight methodology (Figure 3). As with the original ztPNE analysis, the ziPNE analysis found evidence of higher apparent survival probabilities for individuals that were first captured as adults (Figure 4) when compared to individuals that were first captured as juveniles (Web Figure D1), but ziPNE exhibited less seasonal variation in adult female apparent survival relative to ztPNE. Temporary emigration parameter (γ and γ) point estimates were similar between ziPNE and ztPNE, but ziPNE was generally more precise (Web Figures D2‐D4). Estimates for w after longer capture intervals tended to be the lowest, particularly for individuals that were first captured as juveniles. Estimates for g were generally high and tended to be somewhat higher for juveniles (Web Figures D5–D6).

Figure 3.

Figure 3

Mark‐resight abundance (N) estimates and 95% confidence intervals between March 2005 and August 2007 for three New Zealand Robin (Petroica australis) populations (dark orange = Knobs Flat, light orange = Smithy, and dark purple = Walker Creek) in Fiordland National Park, New Zealand. Estimates are AICc model averaged based on the zero‐inflated model (ziPNE; solid lines) and the zero‐truncated model (ztPNE; dashed lines). Crosses (×) indicate the minimum number known alive based on territory mapping that was independent of the mark‐resight methodology. Vertical dotted lines indicate a period of rat control on the Walker Creek study area. This figure appears in color in the electronic version of this article, and color refers to that version

Figure 4.

Figure 4

Model‐averaged four‐month apparent survival (ϕ) estimates (±SE) between March 2005 and August 2007 for three New Zealand Robin populations (dark orange = Knobs Flat, light orange = Smithy, and dark purple = Walker Creek) in Fiordland National Park, New Zealand. Estimates are for males (top panel) and females (bottom panel) that were first captured as adults based on the zero‐inflated model (ziPNE; solid lines) and the zero‐truncated model (ztPNE; dashed lines). The longer intervals between August and March encompass the breeding season. Vertical dotted lines indicate a period of rat control on the Walker Creek study area. This figure appears in color in the electronic version of this article, and color refers to that version

5. DISCUSSION

We have proposed a new zero‐inflated version of the Poisson log‐normal mark‐resight estimator in Program MARK. Using simulation, we demonstrated ziPNE has superior performance relative to the zero‐truncated model of McClintock and White (2009). By accounting for zero inflation in the marked individual sighting data, ziPNE is able to utilize valuable information about detection rates provided by marked individuals that were released but never subsequently sighted. This not only makes ziPNE more robust to individual heterogeneity for reliable estimation of population size but also allows the likelihood for the open population parameters (ϕ,γ,γ) to condition on first encounter (instead of first sighting). ziPNE thus makes better use of available data and, unlike ztPNE, allows γ1 to be estimated under Pollock's robust design.

In terms of abundance estimation, previous simulation studies also found the “within” heterogenity zero‐truncated model of McClintock and White (2009) to perform increasingly poorly with low sighting rates, high levels of individual heterogenity, and low‐marked individual identification probabilities (McClintock et al., 2009; McClintock and White, 2009; McClintock et al., 2014). Our extensive simulations included a broader range of more extreme scenarios, and the new ziPNE performed near nominally and better than ztPNE in almost every scenario examined. However, the average coverage of N could still be as low as 81% for the “across” ziPNE when high levels of individual heterogeneity were combined with low mark identification probabilities. While we have not encountered actual mark‐resight data as information poor as some of the scenarios included in our simulations, these pathological sampling scenarios provide valuable insights about the limitations of our composite likelihoods. Practitioners should carefully consider study designs that increase sighting rates, decrease individual heterogeneity, and increase marked individual identification whenever possible.

Unless changes in individual behavior (or sighting survey design) effectively eliminates correlations in the marked individual sighting data, the “across” model may often be a more natural choice than the “within” model. Indeed, our New Zealand Robin example demonstrated overwhelming support for the “across” models and thus strong correlations in the individual sighting data across sighting surveys. It deserves note that both the “across” and “within” models tended to perform poorly under heterogeneity model mis‐specification (see Web Appendix E). Regardless of the model (ztPNE or ziPNE), it is therefore important to examine both “within” and “across” heterogeneity models. Fortunately, these models can be compared using standard model selection criteria such as AIC and Bayesian information criterion (BIC). In some cases, it may be desirable to simultaneously model both “within” and “across” heterogeneity; this extension remains an avenue for future research.

Our ziPNE is a composite likelihood that ignores some of the dependencies induced by individual heterogeneity among Y,e, and u in order to facilitate maximum likelihood estimation in Program MARK. While this composite likelihood proved robust under a broad range of simulation scenarios, an alternative (exact) solution can be implemented using a data‐augmented complete data likelihood in a Bayesian framework (see Web Appendix A). While relatively easy to code using standard software, poor mixing can cause such implementations to be time consuming and challenging to fit. Thus, while conceptually appealing, they can often be impractical for simulation studies or real‐world application (particularly for large T or N). To aid practitioners in determining whether or not the performance of ziPNE can be expected to meet their specific objectives under sampling scenarios not covered in our simulations, we provide R code for simulating data from the exact model and fitting ziPNE in the Supporting Information.

We recommend practitioners use ziPNE for mark‐resight analyses in Program MARK whenever the number of marks in the population is unknown due to attrition arising from mortality and permanent emigration. As demonstrated in our New Zealand Robin example, we also recommend previous ztPNE analyses be revisited with ziPNE where appropriate. Program MARK includes both individual heterogeneity model formulations (“within” and “across”), and has many additional analysis features, such as covariate modeling via link functions, model selection using AIC or BIC, multimodel inference, and plotting. MARK also provides the ability to simulate data under the mark‐resight models, a useful feature for designing studies. For advanced users, these mark‐resight models can also be implemented through the R package RMark (Laake, 2013) interface for Program MARK.

Supporting information

Supplementary Information

Supplementary Information

Supplementary Information

ACKNOWLEDGMENTS

The findings and conclusions in the manuscript are those of the author(s) and do not necessarily represent the views of the National Marine Fisheries Service, NOAA. Any use of trade, product, or firm names does not imply an endorsement by the US Government.

McClintock BT, White GC, Pryde MA. Improved methods for estimating abundance and related demographic parameters from mark‐resight data. Biometrics. 2019;75:799‐809. 10.1002/jcd.21657

References

REFERENCES

  1. Abramowitz M. and Stegun I. (1964). Handbook of Mathematical Functions, With Formulas, Graphs and Math Tables (National Bureau of Standards Applied Math Series 55). Washington, DC: U. S. Government Printing Office. [Google Scholar]
  2. Alonso, R.S. , McClintock, B.T. , Lyren, L.M. , Boydston, E.E. and Crooks, K.R. (2015). Mark‐recapture and mark‐resight methods for estimating abundance with remote cameras: a carnivore case study. PLoS ONE, 10, e0123032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Burnham, K.P. and Anderson, D.R. (2002). Model Selection and Multimodel Inference: A Practical Information‐Theoretic Approach. New York, NY: Springer‐Verlag. [Google Scholar]
  4. Doherty, P.F. , White, G.C. and Burnham, K.P. (2012). Comparison of model building and selection strategies. Journal of Ornithology, 152, 317–323. [Google Scholar]
  5. Gimenez, O. and Choquet, R. (2010). Individual heterogeneity in studies on marked animals using numerical integration: capture‐recapture mixed models. Ecology, 91, 951–957. [DOI] [PubMed] [Google Scholar]
  6. Keech, M.A. , Lindberg, M.S. , Boertje, R.D. , Valkenburg, P. , Taras, B.D. and Boudreau, T.A. (2011). Effects of predator treatments, individual traits, and environment on moose survival in Alaska. Journal of Wildlife Management, 75, 1361–1380. [Google Scholar]
  7. Kendall, W.L. , Nichols, J.D. and Hines, J.E. (1997). Estimating temporary emigration using capture‐recapture data with Pollockas robust design. Ecology, 78, 563–578. [Google Scholar]
  8. Laake, J.L. (2013). RMark: An R Interface for Analysis of Capture‐Recapture Data with MARK. AFSC Processed Rep 2013‐01, 25p. Alaska Fish. Sci. Cent., NOAA, Natl. Mar. Fish. Serv., 7600 Sand Point Way NE, Seattle WA 98115.
  9. Lyons, J.E. , Kendall, W.L. , Royle, J.A. , Converse, S.J. , Andres, B.A. and Buchanan, J.B. (2016). Population size and stopover duration estimation using mark‐resight data and Bayesian analysis of a superpopulation model. Biometrics, 72, 262–271. [DOI] [PubMed] [Google Scholar]
  10. Matechou, E. , Morgan, B. J. T. , Pledger, S. , Collazo, J.A. and Lyons, J.E. (2013). Integrated analysis of capture‐recapture‐resighting data and counts of unmarked birds at stop‐over sites. Journal of Agricultural, Biological, and Environmental Statistics, 18, 120–135. [Google Scholar]
  11. McClintock, B.T. , Hill, J.M. , Fritz, L. , Chumbley, K. , Luxa, K. and Diefenbach, D.R. (2014). Mark‐resight abundance estimation under incomplete identification of marked individuals. Methods in Ecology and Evolution, 5, 1294–1304. [Google Scholar]
  12. McClintock, B.T. and White, G.C. (2007). Bighorn sheep abundance following a suspected pneumonia epidemic in Rocky Mountain National Park. Journal of Wildlife Management, 71, 183–189. [Google Scholar]
  13. McClintock, B.T. and White, G.C. (2009). A less field‐intensive robust design for estimating demographic parameters with mark‐resight data. Ecology, 90, 313–320. [DOI] [PubMed] [Google Scholar]
  14. McClintock, B.T. and White, G.C. (2012). From NOREMARK to MARK: Software for estimating demographic parameters using mark‐resight methodology. Journal of Ornithology, 152, 641–650. [Google Scholar]
  15. McClintock, B.T. , White, G.C. , Antolin, M.F. and Tripp, D.W. (2009) Estimating abundance using mark‐resight when sampling is with replacement or the number of marked individuals is unknown. Biometrics, 65, 237–246. [DOI] [PubMed] [Google Scholar]
  16. Rankin, R.W. , Nicholson, K.E. , Allen, S.J. , Krützen, M. , Bejder, L. and Pollock, K.H. (2016). A full‐capture hierarchical Bayesian model of Pollockas closed robust design and application to dolphins. Frontiers in Marine Science, 3, 25. [Google Scholar]
  17. Rücker, G. and Schwarzer, G. (2014). Presenting simulation results in a nested loop plot. BMC Medical Research Methodology, 14, 129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Ryan, G.E. , Dove, V. , Trujillo, F. and Doherty, P.F. (2011). Irrawaddy dolphin demography in the Mekong river: An application of mark‐resight models. Ecosphere, 2, 1–15. [Google Scholar]
  19. Varin, C. , Reid, N. and Firth, D. (2011). An overview of composite likelihood methods. Statistica Sinica, 21, 5–42. [Google Scholar]
  20. White, G.C. and Burnham, K.P. (1999). Program MARK: survival estimation from populations of marked animals. Bird Study, 46, S120–S138. [Google Scholar]
  21. White, G.C. and Shenk, T.M. (2001). Population estimation with radio‐marked animals In Millspaugh, Radio Tracking and Animal Populations. San Diego, CA: Academic Press; pp. 329–350. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information

Supplementary Information

Supplementary Information


Articles from Biometrics are provided here courtesy of Wiley

RESOURCES