SUMMARY
Population size estimation is an important research field in biological sciences. In practice, covariates are often measured upon capture on individuals sampled from the population. However, some biological measurements, such as body weight may vary over time within a subject’s capture history. This can be treated as a population size estimation problem in the presence of covariate measurement error. We show that if the unobserved true covariate and measurement error are both normally distributed, then a naïve estimator without taking into account measurement error will under-estimate the population size. We then develop new methods to correct for the effect of measurement errors. In particular, we present a conditional score and a nonparametric corrected score approach that are both consistent for population size estimation. Importantly, the proposed approaches do not require the distribution assumption on the true covariates, furthermore the latter does not require normality assumptions on the measurement errors. This is highly relevant in biological applications, as the distribution of covariates is often non-normal or unknown. We investigate finite sample performance of the new estimators via extensive simulated studies. The methods are applied to real data from a capture–recapture study.
Keywords: Capture–recapture data, Corrected score, Errors-in-variables, Weighted partial likelihood
1. Introduction
Population size estimation is an important problem in applied research such as ecology, epidemiology and bioinformatics. To estimate the unknown size of a population, capture–recapture experiments conducted on individuals in the population are often carried out. On capture, previously uncaptured individuals are marked and the marks of previously captured individuals are noted. The resulting data, often known as capture history data, are then used for fitting capture–recapture models. Individual covariates can also be collected upon capture and used in the analysis. We are motivated by a capture–recapture study conducted at the Mai Po Nature Reserve in Hong Kong on Yellow-bellied prinia (Prinia flaviventris), which is a resident bird species in Hong Kong, and also found in Pakistan, the southern Himalayan foothills, the northeastern Indian subcontinent, and Southeast Asia. In this study, 165 distinct birds were captured out of 206 captures during the non-breeding season in 1993. In addition to the capture histories, the data set includes two covariates, wing length, and fat index, that may be related to the capture probabilities. Fat index was recorded to classify the condition of total body fat which has four levels coded from 1 to 4. However, measurements of wing length may be error-prone. Estimation of the population size without taking into account covariate measurement error may be biased.
A number of different capture–recapture methods that incorporate information in the covariates have been developed in the literature; see Huggins (1989), Chao (2001); McCrea and Morgan (2015) and Liu, Li and Qin (2017). It is well-known in the capture–recapture literature that incorporating covariates in a capture–recapture analysis can enhance inference in population size estimation (Pollock, 2002). Because the data are usually collected over a fixed number of capture occasions, traditional capture–capture methods are primarily based on zero-truncated binomial regression models. For each occasion, some individuals are captured, marked (for those first captured), recorded and released back into the population. The resulting capture history of an observed individual is a vector of binary outcomes, but cannot be a zero vector. Therefore, a zero-truncated binomial type model is naturally one of the best models to fit these data.
In this study, we consider zero-truncated Poisson regression models (Van der Heijden, Cruyff and Van Houwelinge, 2003) to deal with a different type of capture–recapture data set as follows. For some animal species, such as sperm whales and grizzly bears (Wilson and Anderson, 1995), individual capture histories consist of capture (sighting) times during an observation period. The collection process for these types of data is a realization of a counting process, and is often called a continuous-time capture–recapture model. In fact, the Poisson process is the most popular counting process used for continuous-time capture–recapture models. Under a Poisson process framework, the number of captures for each individual contains all information about the inference on the population size (Schofield et al., 2018; Liu et al., 2018; Zhang and Bonner, 2020), while event times are only useful for inference on capture intensities. It follows that the number of captures is Poisson distributed and hence, a zero-truncated Poisson regression model is more suitable in continuous-time capture–recapture models.
It is well-known that measurement error in the covariates can lead to inconsistent estimates in regression analysis (Carroll et al., 2006). For example, in simple linear regression, inconsistent estimates arise due to attenuation effects caused by measurement errors. In capture–recapture studies, population size estimates will generally yield a negative bias when measurement errors are ignored. To the best of our knowledge, while effects of measurement error in population size estimation have been observed in the capture–recapture literature, there does not exist a sensible explanation of this finding. Nevertheless, various capture–recapture methods have been developed to correct for measurement error in the covariates. These include (approximate) regression calibration (Hwang and Huang, 2003), conditional score estimation (Hwang et al., 2007), parametric likelihood (Yip et al., 2005; Xi et al., 2009; Stoklosa et al., 2011), estimating equations (Huggins and Hwang, 2010), and a semi-efficient method with multiple measurements (Xu and Ma, 2014). These methods all assume the underlying model is binomial for each subject’s capture frequency. Note that Yip et al. (2005) and Xi et al. (2009) require a parametric distributional assumption on the true underlying covariate, and Xi et al. (2009) specify the parametric distributions of the unobserved covariates and measurement errors where an expectation maximization algorithm is used to find the maximum likelihood estimates. However, because the unobserved covariates are latent variables, the expectation step involves the evaluation of many numerical integrals which incurs significant computational cost, and it may be sensitive to distributional assumptions. Interestingly, very limited literature has been developed for Poisson regression, with the exception of Hwang and Huang (2007) who considered a regression calibration approach and conditional score estimation for Poisson capture–recapture modeling.
Over the last few decades, the majority of newly developed measurement error methods require the assumption of a “surrogate condition” for correcting the effects of measurement errors. Specifically, this assumption holds if measurement errors are independent of the response and other variables. However, in regression models for zero-truncated count data, such as capture–recapture data, the number of covariate measurements for a given subject may be equal to the actual count response. This implies that the measurement error variance is related to the response variable and hence the surrogate condition is not satisfied. This is seen in the Prinia flaviventris capture–recapture data where the total number of observed wing length measurements per individual is equal to the number of times the individual is captured, thus the surrogate condition is invalid when fitting measurement error models.
To address the problem of multiple measurements within a subject, Huang et al. (2011) proposed a modified conditional score approach to estimate model parameters in zero-truncated binomial and Poisson regression models. Their estimation methods were shown to be consistent, but they were computationally demanding and they did not consider population size estimation. Furthermore, these studies require a normality assumption on the measurement error, which may be inappropriate in some circumstances. In biological applications, the distribution of the observed covariates is often non-normal or unknown, so relaxing this assumption is beneficial.
In this paper, we begin by investigating the effects of measurement errors when estimating the population size. For multiple measurements within a subject, we first develop an average conditional score estimating function (Stefanski and Carroll, 1987) to estimate regression parameters and propose a corresponding population size estimator. We then develop a corrected score approach (Nakamura, 1990). The corrected score can be viewed as an unbiased estimate of the original score function where the true covariate is known. Unfortunately, in standard zero-truncated models, such as zero-truncated Poisson and binomial models, an unbiased estimate for the original score function does not exist (Stefanski, 1989). For other truncated models, the applicability of the corrected score approach remains an open research question. Here, we propose an estimating function that is correctable in the presence of measurement error. We also derive a new corrected score type estimating function with or without a parametric distributional assumption on the measurement errors. This allows us to develop consistent population size estimators. To our knowledge, this is the first time that the corrected score method has been developed for zero-truncated regression and population size estimation.
In Section 2 we introduce some notation and explain why an unadjusted population size estimate would typically yield a downward bias in the presence of measurement error. In Section 3, we develop a conditional score estimator that does not need a distributional assumption on the covariates. However, the conditional score estimator needs a normality assumption on the measurement error. To further relax this assumption, we propose a corrected score method in Section 4. A simulation study is carried out in Section 5 in which all methods are compared under various scenarios. In Section 6 we apply the methods to the Prinia flaviventris data mentioned above. Discussion and concluding remarks are given in Section 7.
2. Models and effects of measurement error
Let i = 1, …, N denote the index of an animal within a closed population where N is the unknown population size. For i = 1, …, N, let Yi be the number of captures for the ith animal from a capture–recapture experiment. Without loss of generality, Yi is positive for i ≤ n, in which n is the number of distinct observed/captured animals in the experiment. Further, let Xi and Zi represent covariates for the ith subject where Xi is a univariate variable and Zi is a vector. Suppose that Zi is a vector of 1 and any covariates that are recorded precisely (such as sex or age group). We assume that all the covariates do not vary over time; this assumption is reasonable under a closed population setting. Instead of observing Xi, we observe the error-prone measurements Wij for some j = 1, …, mi where mi is the number of measurements for the ith subject. We consider an additive measurement error model:
| (1) |
where Uij is the measurement error which is independent of all other variables. We also assume that Uij are symmetric identically distributed with zero mean for all i and j. Let be the surrogate of Xi, and write where . In practice, the number of measurements mi may be related to Yi, such that the usual surrogate condition where the measurement error is independent of the response variable (Carroll et al., 2006) does not hold (i.e., is not independent of Yi). Generally speaking, mi is equal to the number of captures Yi.
Denote , which is a vector of parameters. We consider Yi as a Pois(λi) distributed random variable with mean for all i = 1, …, N. Since we can only observe individual covariates on captured animals, or equivalently, given the condition Yi > 0, these Yi are zero-truncated variables and have the following probability distribution:
where which is the probability of the ith individual being captured at least once. If the Xi’s are available, the log-likelihood function of θ is . Hence, the score function of θ is
| (2) |
and the maximum likelihood estimator solves . Let and be the corresponding estimators of λi(Θ) and πi(θ) with substituted for θ. To estimate the population size N, the following two weighted estimators can be used:
In the context of capture-recapture modeling, is generally referred to as the Horvitz-Thompson estimator because the denominator is the probability that the unit is ever selected in the sampling process (Alho et al., 1993). Whereas is a Hansen-Hurwitz estimator since the denominator is the expected selection on one occasion, and it is summed over all selections (Hansen and Hurwitz, 1943). Both population size estimators are consistent and asymptotically normally distributed, and they have similar finite sample performance.
We define a naïve score function Gw(θ) by replacing Xi with in (2). A naïve estimator of θ, , is then established via solving Gw(θ) = 0, and we can also similarly obtain the corresponding population size estimators. These types of naïve population size estimators generally exhibit a downward bias, as seen in the simulation studies of Hwang and Huang (2007). This kind of downward bias is also usually seen in the other measurement error capture–recapture models (Hwang and Huang, 2003; Xi et al., 2009), an explanation for this observation is that the contamination, due to measurement error, reduces the degree of heterogeneity in capture-recapture models. For example, suppose that and is very large such that Wi cannot provide information for modeling heterogeneity in capture probabilities. As a consequence, the effect of the covariate would be excluded from the model, thus the data would be analyzed via a simple homogeneous capture-recapture model where λi is a constant. It can be shown that this kind of model misspecification yields a downward bias in estimating the population size (Hwang and Huggins, 2005).
In Web Appendix A, we provide a theoretical justification for this phenomenon when both Xi and Ui are normally distributed under the Poisson regression framework. Interestingly, if we consider ordinary Poisson regression with normality assumptions on Xi and Ui, then the naïve estimator for N is consistent even though the naïve estimator for θ is not. Specifically, it estimates the covariate effect of Wi through , which is not the true β in ; see Web Appendix A. In particular, the resulting estimator of is consistent and can be used as a surrogate for λi when being used in the population size estimators stated earlier. However, this argument is incorrect in our problem because under this situation, Yi is drawn from a zero-truncated Poisson model. Moreover, we can show that the naïve method generally overestimates the average of λi which implies that the naïve population size estimators would generally result in a downward bias. See Proposition 1 in Web Appendix A.
3. Conditional score estimation
In this section, we develop a conditional score (CDS) estimator that does not need a distributional assumption on the latent variables Xi. Define for all i = 1, …, N and j = 1, …, mi. Following Stefanski and Carroll (1987) and Hwang et al. (2007), it can be showed that Δij is a sufficient and complete statistic for Xi if β is known. By sufficiency, the conditional distribution of Yi given Δij, Zi is independent of Xi and can be shown to be
| (3) |
for k = 0, 1, 2, …, where .
For capture–recapture data, we observe the data up to the individuals i ≤ n. Hence, for all i ≤ n and k = 1, 2, 3, …, we have
By plugging into Δij, the conditional score estimator for θ, denoted as , solves
| (4) |
where ϕc(θ; Yi, Zi, Δi) is the average of for j = 1, …, mi. In order to implement the above CDS approach, and can be calculated numerically. Since both terms involve an infinite summation and they do not have closed forms, we need to approximate these terms using finite sums with upper-bound restrictions. Due to the variance of a zero-truncated Poisson random variable being smaller than its expectation, we adopt a simple strategy that uses the upper bound of (Hwang et al., 2007).
Let ϕc,i denote ϕc(θ; Yi, Zi, Δi.) and notice that ϕc,i are independent over all i. This implies that (4) is a classical unbiased estimating equation, thus a sandwich variance estimate of can be applied. Specifically, the variance estimator is
| (5) |
where θ is evaluated at . Note that the above variance estimator does not account for the variability from estimating . Nevertheless, from our simulation study, we found that variability due to estimating is generally negligible, and it has almost no effect on the variance estimation.
For population size estimation, let where . We then mimic the from of and suggest to estimate N by
To estimate the variance of , by the delta method, we have
| (6) |
and this yields a variance estimator for when all parameters are evaluated at .
Remark 1: An alternative approach is to construct an estimating function based on . In fact, the use of could be more efficient, but it is valid only if mi are independent with other variables. Huang et al. (2011) called this method a naïve conditional score when mi are correlated with Yi (e.g., mi = Yi in the motivated example). They also proposed a Monte Carlo error augmentation conditional score method to solve this problem, but it is computational time consuming. Although not reported here, we found in simulation studies that the current proposed method performed equally well, and in some cases was more robust than the error augmentation conditional score.
4. Corrected score estimation
In generalized linear models with covariate measurement error, the corrected score approach of Nakamura (1990) also does not require specifying the underlying distribution of Xi. As shown in Stefanski (1989), a corrected score function with respect to the score function of a zero-truncated Poisson measurement error model does not exist. In the following, we consider an estimating function which is correctable when Xi are contaminated with measurement errors.
4.1. Parametric corrected score
Let and , then we have the identity
| (7) |
To show this, we note that the first identity is trivial since . For the second identity, we have
Hence,
which proves the identity above. According to (7), when the Xis are available, we may estimate θ through the estimating equation
| (8) |
Remark 2: The estimating function in (8) is also considered a “modified score function” since it is the score function of a weighted partial likelihood proposed by Hwang et al. (2019). Here, we provide a simple justification for this result. Hwang et al. (2019) showed that the resulting estimator of (8) performed well, as it generally retained high efficiency in comparison with the maximum likelihood estimator. As shown in Hwang et al. (2019), this loss in efficiency is quite small; the resulting estimator of (8) will attain efficiency of at least 98% in contrast to the maximum likelihood estimator.
The estimating function is biased if we solve where . Now, let Ki(s) be the cumulant generating function of , that is, whenever the expectation exists. Let be the first derivative of Ki(s). We follow Nakamura (1990) and propose a corrected estimating function of S(θ), that is,
| (9) |
where
It can be seen that . Therefore Φp(θ) = 0 is an unbiased estimating equation. A parametric corrected score (PCS) estimator can be obtained by assuming a known distribution for the measurement error (but with an unknown variance). A normality assumption is perhaps the most commonly used in practice. If we assume that , then , and , where can be estimated from the sample pooled variance of Wij, denoted here as . Alternatively, if for example we assume that Uij ~ Unif(−δ, δ), then where δ can be estimated by the moment method, that is, .
Denote the resulting PCS estimator by that solves Φp(θ) = 0 where Ki(β) and are substituted by their corresponding estimates. Like the CDS approach, Φp(θ) is a summation of unbiased estimating functions, so a sandwich variance estimator of can be obtained with the same form as (5) but we replace the estimating functions ϕc,i with ϕp,i where .
To estimate the population size, write and . We propose the following corrected population size estimator
| (10) |
Clearly, is reduced to when there is no measurement error and if .
For the population size estimator , by the delta method, we have
| (11) |
and this yields a variance estimator for when θ and λi are evaluated at as well as respectively.
4.2. Nonparametric corrected score
The PCS estimator is restricted by an assumption on the measurement error distribution. We now propose a method to estimate Ki(β) and in (9) and (10) nonparametrically, and it is referred to as the NPC method (Huang and Wang 2000, 2001). Here, the measurement errors Uij are assumed to be independent and symmetric identically distributed random variables with the existence of moment generating functions. For i ≤ n, let j, ℓ ≤ mi and j ≠ l, then . Let
We can then estimate by consistently. Similarly, the can be estimated by where
The NPC estimators can be obtained by substituting the resulting estimates of Ki(β) and into (9) and (10), respectively. The corresponding variance estimators are similar to those of PCS, as given at the end of Section 4.1.
5. Simulations
We conducted a simulation study to compare the naïve, conditional score (CDS), and the nonparametric corrected score (NPC) estimators. The simulation study initially included the parametric corrected score (PCS) approach where the measurement error was assumed to follow a normal distribution. However, we do not present the results of PCS here because it performed worse than NPC and CDS in essentially all cases. In particular, the PCS was not as good as CDS or NPC even when the measurement error Uij was normally distributed in which the assumption of PCS holds. A reason for this was that the numerical solution of PCS was less stable compared to NPC and CDS, such that the PCS estimator had a greater chance of yielding extremely large estimates. Moreover, the observed Fisher information matrix of PCS is unstable, which leads to large standard error and interval estimates. The numerical challenges of PCS were similarly observed in Huang (2014).
In the simulations below, we examined both normally and non-normally distributed measurement error structures with different magnitudes of measurement error variances. We also considered different distributions for the true covariates, different effect sizes, capture rates and population sizes.
We examined two cases for the true population size with N = 300 (moderate size), and N = 1000 (larger size), and considered two distributions for the true covariate Xi: the standard normal, and a location shifted, scaled chi-square distribution, where the latter is defined as , and χ2(3) is a chi-square distributed random variable with 3 degrees of freedom. The expected number of times an individual was captured was set to λi = exp(α + βXi), where α = −0.5 or 0.5 which corresponds to an average capture percentages 100 × (n/N) of approximately 45% and 80%, respectively, and β = 0.5 ln(2) ≈ 0.35 or ln(2) ≈ 0.69 (corresponding to moderate and large effect sizes). Thus, Yi ~ Poiss(λi) for all i = 1, …, N. Let mi = Yi be the number of measurements for the ith individual, and let Wij = Xi + Ui for all j = 1, …, mi be the error-contaminated covariate where the measurement error is Ui. We generated Ui from two distributions: N(0, ), and a symmetric uniform distribution with variance equal to .
For each combination of simulation settings, we generated 1,000 simulated data sets. Based on the 1,000 estimates, we calculated the “bias-%” as the median of relative bias (%), that is, we use ; the median absolute deviations, denoted by “MAD”; the median of the standard error estimates, denoted by “Med.SE”, and the root mean square error denoted by “RMSE”. We also included 95% Wald-type confidence interval coverage probabilities (CP) for each estimator. Note that, MAD and Med.SE are robust measures of the standard deviation and the average of standard error estimates, respectively. We used these robust measures because we encountered several extreme estimates for each estimator. In the simulation, an estimation procedure was considered as divergence if its numerical solution did not converge, or the method resulted in a value of . Whenever divergence of this type had occurred, we used the naïve estimate instead. Overall, the naïve method did not encounter divergence and the divergence rate was less than 3% for CDS and NPC in all scenarios. Divergence could occur when α = −0.5, or N = 300 but this had rarely occurred in all other cases.
In Tables 1–2, we report numerical summaries for estimates of β and N when . Specifically, in Table 1 we present results where X ~ N(0, 1) and U ~ N(0, 1), and in Table 2 we present results where X is non-normally distributed and U is uniformly distributed. As in many regression measurement error models, the naïve method showed an attenuation effect in estimating β in all cases, the relative bias was as large as −46% (Table 1) and −73% (Table 2). The naïve population size estimator also yielded a negative bias, as seen in Table 1. This agrees with our findings for the effects of measurement errors when X and U are normally distributed (see Section 2). Moreover, the downward bias of the naïve population size estimator was still apparent in Table 2 even though neither X nor U were normally distributed. Overall, the bias of the naïve population size estimator was not as severe as β where the worst case occurred around −13.1%.
Table 1:
Simulation study results for the naïve, CDS, and NPC methods with the population size N = 1000, the unobserved covariate X ~ N(0, 1) and the measurement error U ~ N(0, 1) for each considered parameter combination. We denote “bias-%” as the median of relative bias (%) – that is, 100 × {( − θ)/θ}, “MAD” as the median absolute deviations, “Med.SE” as the median of the standard error estimates, “RMSE” as the root mean square error, and “CP” as the 95% confidence interval coverage percentages.
|
β
|
N
|
|||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| bias-% | MAD | Med.SE | RMSE | CP | bias-% | MAD | Med.SE | RMSE | CP | |
|
|
|
|||||||||
| α = −0.5, β = 0.5ln(2) |
||||||||||
| Naïve | −45.1 | 0.050 | 0.049 | 0.167 | 10.8 | −3.3 | 64.8 | 63.8 | 72.4 | 87.8 |
| CDS | 0.8 | 0.103 | 0.092 | 0.107 | 91.5 | 0.2 | 79.0 | 75.5 | 85.2 | 94.5 |
| NPC | 0.2 | 0.104 | 0.091 | 0.110 | 92.0 | 1.3 | 81.0 | 79.7 | 91.5 | 95.8 |
| α = −0.5, β = 0.5ln(2) |
||||||||||
| Naïve | −38.0 | 0.027 | 0.025 | 0.136 | 0.1 | −2.0 | 21.2 | 20.3 | 28.8 | 79.9 |
| CDS | −0.4 | 0.041 | 0.040 | 0.041 | 95.0 | −0.4 | 23.6 | 23.0 | 23.6 | 94.2 |
| NPC | −0.0 | 0.042 | 0.040 | 0.042 | 93.7 | 0.1 | 24.4 | 23.8 | 24.2 | 94.7 |
| α = −0.5, β = ln(2) |
||||||||||
| Naïve | −45.2 | 0.049 | 0.048 | 0.315 | 0.0 | −12.8 | 57.0 | 54.6 | 136.1 | 39.7 |
| CDS | −0.2 | 0.092 | 0.089 | 0.096 | 94.0 | −1.3 | 92.4 | 90.4 | 104.2 | 93.6 |
| NPC | 0.2 | 0.103 | 0.091 | 0.117 | 92.3 | 0.5 | 106.3 | 98.4 | 129.0 | 95.6 |
| α = −0.5, β = ln(2) |
||||||||||
| Naïve | −34.5 | 0.030 | 0.031 | 0.240 | 0.0 | −5.3 | 25.0 | 23.9 | 57.7 | 42.1 |
| CDS | −0.1 | 0.041 | 0.041 | 0.041 | 95.4 | −0.4 | 31.0 | 31.2 | 32.6 | 93.5 |
| NPC | 0.1 | 0.044 | 0.041 | 0.046 | 91.7 | −0.1 | 34.4 | 32.3 | 36.7 | 93.1 |
Table 2:
Simulation study results for the naïve, CDS, and NPC methods with the population size N = 1000, the unobserved covariate X is a location-shifted, scaled chi-square distribution and the measurement error U is uniformly distributed with variance for each considered parameter combination. We denote “bias-%” as the median of relative bias (%) – that is, 100 × {( − θ)/θ}, “MAD” as the median absolute deviations, “Med.SE” as the median of the standard error estimates, “RMSE” as the root mean square error, and “CP” as the 95% confidence interval coverage percentages.
|
β
|
N
|
|||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| bias-% | MAD | Med.SE | RMSE | CP | bias-% | MAD | Med.SE | RMSE | CP | |
|
|
|
|||||||||
| α = −0.5, β = 0.5ln(2) |
||||||||||
| Naïve | −63.9 | 0.048 | 0.051 | 0.227 | 1.6 | −3.8 | 63.0 | 60.4 | 73.4 | 86.1 |
| CDS | 5.7 | 0.208 | 0.185 | 0.249 | 92.7 | 1.4 | 99.2 | 88.3 | 199.9 | 95.1 |
| NPC | −1.2 | 0.174 | 0.162 | 0.201 | 92.6 | 1.2 | 91.3 | 87.2 | 166.8 | 95.8 |
| α = 0.5, β = 0.5ln(2) |
||||||||||
| Naïve | −53.9 | 0.026 | 0.026 | 0.190 | 0.0 | −2.7 | 19.6 | 18.9 | 32.7 | 69.8 |
| CDS | 1.9 | 0.070 | 0.067 | 0.073 | 94.8 | −0.1 | 26.7 | 25.1 | 27.6 | 93.9 |
| NPC | −0.0 | 0.069 | 0.064 | 0.069 | 94.2 | 0.2 | 27.1 | 26.2 | 29.0 | 93.6 |
| α = −0.5, β = ln(2) |
||||||||||
| Naïve | −72.0 | 0.047 | 0.047 | 0.500 | 0.0 | −13.1 | 51.0 | 46.5 | 136.0 | 27.6 |
| CDS | 7.5 | 0.271 | 0.234 | 0.511 | 93.0 | 1.3 | 146.0 | 136.9 | 298.2 | 89.9 |
| NPC | −0.1 | 0.255 | 0.222 | 0.294 | 91.2 | 0.2 | 142.6 | 131.6 | 353.1 | 91.8 |
| α = 0.5, β = ln(2) |
||||||||||
| Naïve | −60.9 | 0.029 | 0.027 | 0.422 | 0.0 | −8.7 | 18.8 | 15.9 | 87.9 | 0.7 |
| CDS | 1.0 | 0.090 | 0.090 | 0.096 | 94.0 | −0.4 | 41.3 | 39.1 | 42.7 | 91.2 |
| NPC | 1.8 | 0.106 | 0.094 | 0.117 | 93.3 | 0.5 | 51.3 | 45.9 | 68.0 | 94.4 |
CDS performed very well for cases when U was normally distributed (see Table 1). However, when U was not normally distributed (Table 2), CDS may be biased for β whenever the capture probabilities were small (α = −0.5). Nevertheless, in most cases the CDS estimators for N were nearly unbiased. Moreover, CDS showed a certain degree of robustness in Table 2, this was most noticeable for α = 0.5 when the data was rich (larger n and ). Under many cases, the CDS estimator was better than the corrected score estimators in terms of RMSE. However, when the measurement error was uniformly distributed, it was slightly worse than NPC in terms of CP.
The nonparametric corrected score NPC is robust to distributional assumptions on U. Therefore, only NPC showed unbiasedness in all cases of Tables 1–2. Though not reported, NPC performed better than PCS, in particular, whenever Uij was normally distributed. In Table 2 with α = −0.5, the MAD of NPC was smaller than CDS but this phenomena was generally reversed otherwise. We note that since NPC was constructed under a modified estimating function, standard errors (and MADs) were, in general, slightly larger compared to the maximum likelihood estimator when (i.e., the reduced form of CDS), see Remark 2 in Section 4.
The proposed variance estimators of CDS and NPC worked reasonably well, as the resulting Med.SE were very close to MAD in most cases. We found that variability due to estimating (or K(β)) is generally negligible, and it has almost no effect on the variance estimation.
In Figure 1, we plotted the bias-% against the measurement error variance for the population size N where N = 1000. We have also included the results for the PCS method in these plots. For all cases, the naïve estimator for N exhibited a downward bias as , and the bias-% had significantly increased with increasing . The bias-% of CDS and NPC were generally quite low, with the exception of the case when α = 0.5, β = 0.69, and when X was non-normally distributed.
Figure 1:

Relative bias for N against increasing measurement error variances () when X ~ N(0, 1) (top) and X follows is a location-shifted, scaled chi-square distribution (bottom) for N = 1000. Sub-figures (a)–(d) correspond to the four effect size scenarios when U ~ N(0, ), and sub-figures (e)–(h) correspond to the four effect size scenarios when U follows a symmetric uniform distribution. We denote “Relative Bias” as the median of relative bias (%) – that is, 100 × {( − θ)/θ}.
In Figure 2 we plotted the CPs of N against for N = 1000. The CPs for the naïve estimator were very poor for increasing , in some cases this was reported to be as low as 0. The CPs for the other methods were reasonable even though some CPs slightly decreased as increased. Additional simulations with various sets of parameters are given in Web Tables 1–8 and Web Figures 1–8 of the Supplementary Materials.
Figure 2:

Coverage probabilities for N against increasing measurement error variances () when X ~ N(0, 1) (top) and X follows is a location-shifted, scaled chi-square distribution (bottom) for N = 1000. Sub-figures (a)–(d) correspond to the four effect size scenarios when U ~ N(0, ), and sub-figures (e)–(h) correspond to the four effect size scenarios when U follows a symmetric uniform distribution.
Overall, the proposed correction methods had reduced bias and yielded accurate interval estimates. However, we also observed that both corrected methods yielded large population size estimates, especially when capture probabilities were low, combined with high heterogeneity. Briefly, in contrast to the naïve method, the proposed estimators generally have smaller bias but larger variances. This is a bias-variance trade-off. In this simulation study of N = 300, the proposed population size estimators performed poorly (in terms of RMSE) compared to the naïve method, but when capture probabilities and/or the population size N had increased, the proposed methods showed superiority in RMSE, so long as the benefit of bias is greater than the loss of variance. For example, when the population size is set to 5000, the RMSEs of the correction methods are significantly smaller than that of the naïve method for all considered scenarios (these results are not reported here).
6. Analysis of yellow-bellied prinia data
We applied the methods to the real capture–recapture data collected on the yellow-bellied prinia (Prinia flaviventris) study described in Section 1. The data were collected at the Mai Po Bird Sanctuary in Hong Kong from January to April in 1993. The yellow-bellied prinia is a resident bird in that area, and the period of January to April is a non-breeding season so that the closure population assumption is reasonable. We excluded two birds from the data due a missing value in wing length and one in fat index. As a consequence, there were n = 163 uniquely observed individuals captured, and the total number of captures made was with an average capture frequency of . This recapture rate was very low. During the experiment, bird wing length measurements were recorded upon capture, which were known to be error-prone. The average wing length was with a sample variance of 1.576. The estimated measurement error variance for the wing length was . Hence, the proportion of measurement error variance was about 24% (i.e., ), which suggests that the measurement error in bird wing lengths should not be ignored in the data analysis. Note that bird wing lengths will increase over time in nature, however the sampling period for this study is very short, and all measurements were collected on adult birds. We therefore assume bird wing lengths are constant. For fat index, we combined levels 2–4 as a single “fat” class (recorded here as 1) and used level 1 as a “not fat” class (recorded here as 0). Of the 163 observed birds, 70 had “no fat” recorded and 93 with fat. Fat index is considered as a precisely recorded covariate in this analysis.
We assume that is the expected count of the i-th individual where Xi = true wing length and Zi = fat index. We fitted the naïve, CDS, PCS, and NPC as mentioned in Section 5. Model parameters and population size estimates (standard errors in parenthesis) are given in Table 3.
Table 3:
Regression coefficients and population size estimates (standard errors in parenthesis) for the Naïve, CDS, and NPC methods fitted to the Yellow-bellied Prinia capture–recapture data. The number of uniquely observed individuals was n = 163. Bird wing length measurements were used as a covariate where the estimated measurement error was which yields a measurement reliability ratio of 76.2%. Notice the distinct difference in parameter and population estimates between the naïve and measurement error models.
| Naïve | −18.713 (3.640) | 0.377 (0.080) | 1.071 (0.400) | 668.377 (157.907) |
| CDS | −24.690 (5.908) | 0.508 (0.130) | 1.063 (0.401) | 721.761 (182.982) |
| PCS | −23.311 (4.831) | 0.478 (0.106) | 1.069 (0.401) | 691.563 (173.311) |
| NPC | −22.109 (4.479) | 0.452 (0.098) | 1.074 (0.401) | 687.119 (171.708) |
From the result in Table 3, the naïve estimates for the intercept and slope were small compared with other methods which accounted for measurement errors. Specifically, the slope estimate of β was reduced by at least 16% compared with other methods. Hence the attenuation effect on the naïve method was quite remarkable. For population size estimation, the naïve method also yielded the smallest value, while the reduction was approximately 3% to 7%. This phenomena was similar to what we observed in the simulation study in Section 5. We found that PCS, and NPC produced comparable slope estimates and standard errors while CDS made a much larger correction for the measurement error. The estimate for the fat index effect was similar across all models (estimated at approximately 1.07), and was deemed significant for all models.
7. Discussion
In this study, we proposed three new zero-truncated Poisson regression methods that correct for uncertainty in the covariates that are collected in capture–recapture experiments. In particular, we introduced a corrected score method which has yet to be developed for zero-truncated regression and population size estimation. Through simulation studies and a real-life example, we observed that bias in the slope and population size estimates had significantly reduced when compared with naïve capture–recapture models. We also showed analytically, why the unadjusted population size estimate results in downward bias in the presence of measurement error.
Each of the proposed methods has its own advantages and disadvantages. For example, whilst both the conditional score and parametric corrected score methods are functional (that is, the distribution of the true covariate need not be specified), both methods still rely on normality or parametric assumptions on the measurement error. Furthermore, we found that the parametric corrected score method can be numerically unstable under some parameter settings.
Overall, we do not recommend the PCS method based on a normality measurement error assumption because it does not have a clear advantage in any aspect. However, the performance of PCS based on other parametric measurement error assumptions has not been fully understood. On a related research, the estimating function of PCS may be further modified (Huang, 2014) to improve the estimation efficiency. The naïve, CDS and NPC methods have their own advantages, we therefore provide some recommendations and guideline for their use when the aim is to estimate the population size:
Naïve method:
This estimator is typically biased, but can be used when the measurement error is small, the population size is small, or the capture-probabilities are low.
Conditional score:
This estimator usually has small bias when the measurement error is moderate or large. For moderate sample sizes, it works well even if the measurement error is not normally distributed. However, it may be biased with large sample sizes if there is violation of the normal error assumption.
Non-parametric corrected score:
This is the most robust estimator in terms of distributional assumptions. Compared with CDS, the NPC estimator is better when the measurement error distribution is not normally distributed. It usually works well with moderate sample sizes, and it performs the best when the population size is large.
We also assessed the performance of population size estimation on models that used “binned” error-contaminated covariates. Specifically, we evenly categorized the observed (error prone) continuous covariate into a fixed number of bins which were then used as dummy variables for the naïve model. We conducted several simulation studies (see Web Tables 9 and 10) where we considered 4 and 8 bins. We also fitted the binned covariate models to the yellow-bellied prinia bird capture–recapture data where we categorized the wing length covariate into 2, 3, 4 and 5 bins of equal frequency (see Web Table 11). In almost all the considered cases, we observed that the bias for the binned models was larger than our proposed estimators, and the coverage probabilities were below the nominal level. In summary, our simulation results showed that the method by binned error-contaminated covariates was not as good as our proposed estimators in general.
Several methodological problems remain to be addressed for future research. For the yellow-bellied prinia bird example, based on our analysis, the corrected score methods performed similarly in terms of parameter estimation but were substantially different compared to the fitted conditional score model. Thus, an unsolved and interesting problem remains in choosing the best capture–recapture model (or method) amongst a set of candidate models (that is, model selection in the presence of measurement error). Also, in many studies involving capture–recapture experiments, covariate values collected on captured individuals are missing at random; see Xi et al. (2009) and Stoklosa, Lee and Hwang (2019). A potential extension to our methods is to simultaneously account for missingness, in addition to measurement error in the covariates. We leave these research topics as future work.
Supplementary Material
Acknowledgements
The authors are grateful to an associate editor and two referees for reviewing the manuscript and providing valuable comments. This work was supported by the Ministry of Science and Technology of Taiwan (Hwang), US National Institutes of Health grants CA235122, CA239168, CA86368, and ORIP S10OD028685 (Wang).
Footnotes
Supplementary Materials
In the Web Appendix we give a proposition and proof for the effects of measurement errors, Web Tables 1–11 and Web Figures 1–8 consisting of simulation results.
References
- Alho JM, Mulry MH, Wurdeman K, and Kim J (1993). Estimating heterogeneity in the probabilities of enumeration for dual-system estimation. Journal of the American Statistical Association 88, 1130–1136. [PubMed] [Google Scholar]
- Carroll RJ, Ruppert D, Stefanski LA, and Crainiceanu CM (2006). Measurement Error in Nonlinear Models: A Modern Perspective. 2nd ed. London: Chapman and Hall. [Google Scholar]
- Chao A (2001). An overview of closed capture–recapture models. Journal of Agricultural, Biological, and Environmental Statistics 6, 158–175. [Google Scholar]
- Hansen MH, and Hurwitz WN (1943). On the theory of sampling from finite populations. The Annals of Mathematical Statistics 14, 333–362. [Google Scholar]
- Huang Y (2014). Corrected score with sizable covariate measurement error: Pathology and remedy. Statistica Sinica 24, 357–374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang Y and Wang CY (2000). Cox regression with accurate covariates unascertainable: A nonparametric-correction approach. Journal of the American Statistical Association 95, 1209–1219. [Google Scholar]
- Huang Y and Wang CY (2001). Consistent functional methods for logistic regression with errors in covariates. Journal of the American Statistical Association 96, 1469–1482. [Google Scholar]
- Huang YH, Hwang WH and Chen FY (2011). Differential measurement errors in zero-truncated regression models for count data. Biometrics 67, 1471–1480. [DOI] [PubMed] [Google Scholar]
- Huggins RM (1989). On the statistical analysis of capture experiments. Biometrika 76, 133–140. [Google Scholar]
- Huggins R and Hwang WH (2010). A measurement error model for heterogeneous capture probabilities in mark-recapture experiments: An estimating equation approach. Journal of Agricultural, Biological, and Environmental Statistics 15, 198–208. [Google Scholar]
- Hwang WH and Huang SYH (2003). Estimation in capture–recapture models when covariates are subject to measurement errors. Biometrics 59, 1113–1122. [DOI] [PubMed] [Google Scholar]
- Hwang WH and Huggins R (2005). An examination of the effect of heterogeneity on the estimation of population size using capture–recapture data, Biometrika 92, 229–233. [Google Scholar]
- Hwang WH and Huang SYH (2007). Measurement errors in continuous-time capture–recapture models. Journal of statistical planning and inference 137, 1888–1899. [Google Scholar]
- Hwang WH, Huang SYH and Wang CY (2007). Effects of measurement error and conditional score estimation in capture–recapture models. Statistica Sinica 17, 301–316. [Google Scholar]
- Hwang WH, Heinze D, and Stoklosa J (2019). A weighted partial likelihood approach for zero-truncated models. Biometrical Journal 61, 1073–1087. [DOI] [PubMed] [Google Scholar]
- Liu Y, Li P and Qin J (2017). Maximum empirical likelihood estimation for abundance in a closed population from capture–recapture data. Biometrika 104, 527–543. [Google Scholar]
- Liu Y, Liu Y, Li P and Qin J (2018). Full likelihood inference for abundance from continuous time capture–recapture data. Journal of the Royal Statistical Society, Series B 80, 995–1014. [Google Scholar]
- McCrea RS and Morgan BJ (2015). Analysis of Capture–Recapture Data. Chapman and Hall/CRC Press. [Google Scholar]
- Nakamura T (1990). Corrected score function for errors-in-variables models: Methodology and application to generalized linear models. Biometrika 77, 127–137. [Google Scholar]
- Pollock KH (2002). The use of auxiliary variables in capture–recapture modelling: an overview. Journal of Applied Statistics 29, 85–102. [Google Scholar]
- Schofield MR, Barker RJ, and Gelling N (2018). Continuous-time capture–recapture in closed populations. Biometrics 74, 626–635. [DOI] [PubMed] [Google Scholar]
- Stefanski LA (1989). Unbiased estimation of a nonlinear function a normal mean with application to measurement error models. Communications in Statistics, Series A - Theory and Methods 18, 4335–4358. [Google Scholar]
- Stefanski LA and Carroll RJ (1987). Conditional scores and optimal scores for generalized linear measurement-error models. Biometrika 74, 703–716. [Google Scholar]
- Stoklosa J, Hwang WH, Wu SH, and Huggins RM (2011). Heterogeneous capture–recapture models with covariates: A partial likelihood approach for closed populations. Biometrics 67, 1659–1665. [DOI] [PubMed] [Google Scholar]
- Stoklosa J, Lee SM, and Hwang WH (2019). Closed-population capture–recapture models with measurement error and missing observations in covariates. Statistica Sinica 29, 589–610. [Google Scholar]
- Van der Heijden PGM, Cruyff MJLF, and Van Houwelingen HC (2003). Estimating the size of a criminal population from police records using the truncated Poisson regression model. Statistica Neerlandica 57, 1–16. [Google Scholar]
- Wilson KR, and Anderson DR (1995). Continuous–time capture–recapture population estimation when capture probabilities vary over time. Environmental and Ecological Statistics 2, 55–69. [Google Scholar]
- Xi L, Watson R, Wang JP, and Yip PSF (2009). Estimation in capture–recapture models when covariates are subject to measurement errors and missing data. The Canadian Journal of Statistics 37, 645–658. [Google Scholar]
- Xu K and Ma Y (2014). Effective use of multiple error-prone covariate measurements in capture–recapture models. Statistica Sinica 24, 1529–1546. [Google Scholar]
- Yip PSF, Lin HZ, Xi L (2005). A semiparametric method for estimating population size for capture–recapture experiment with random covariates in continuous time. Biometrics 61, 1085–1092. [DOI] [PubMed] [Google Scholar]
- Zhang W and Bonner SJ (2020). On continuous-time capture–recapture in closed populations. Biometrics 76, 1028–1033. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
