Simultaneous treatment of unspecified heteroskedastic model error distribution and mismeasured covariates for restricted moment models

Tanya P Garcia; Yanyuan Ma

doi:10.1016/j.jeconom.2017.06.005

. Author manuscript; available in PMC: 2018 Oct 1.

Published in final edited form as: J Econom. 2017 Jul 8;200(2):194–206. doi: 10.1016/j.jeconom.2017.06.005

Simultaneous treatment of unspecified heteroskedastic model error distribution and mismeasured covariates for restricted moment models

Tanya P Garcia ^1,¹, Yanyuan Ma ²

PMCID: PMC5708600 NIHMSID: NIHMS891467 PMID: 29200600

Abstract

We develop consistent and efficient estimation of parameters in general regression models with mismeasured covariates. We assume the model error and covariate distributions are unspecified, and the measurement error distribution is a general parametric distribution with unknown variance-covariance. We construct root-n consistent, asymptotically normal and locally efficient estimators using the semiparametric efficient score. We do not estimate any unknown distribution or model error heteroskedasticity. Instead, we form the estimator under possibly incorrect working distribution models for the model error, error-prone covariate, or both. Empirical results demonstrate robustness to different incorrect working models in homoscedastic and heteroskedastic models with error-prone covariates.

Some Key Words: Influence function, Linear operator, Measurement error, Nuisance tangent space, Restricted moment model

1 Introduction

1.1 Motivating problem

Regression is arguably the most familiar topic in econometrics and statistics and has motivated a vast amount of literature. Many scientific phenomena can be modeled using a general regression model where a univariate response Y is related to covariates X ∈ ℝ^k and Z ∈ ℝ^s through

Y = m (X, Z; β) + ε .

(1)

Here, m is known up to the parameter β ∈ ℝ^p, and the model error ε is only required to satisfy E(ε|X,Z) = 0. With the conditional distribution of ε unspecified, this model is also known as a restricted moment model (RMM). A typical challenge with RMMs is that some covariates, say Z, are precisely measured, whereas others, say X, are mismeasured. In place of X_i, i = 1, … , n, one instead observes ℓ surrogate replicates

W_{i j} = X_{i} + U_{i j}, j = 1, \dots, ℓ,

(2)

where U_ij’s are independent, mean zero random variables with unknown variance-covariance Ω_U ∈ ℝ^k×k. The surrogacy assumption implies that Y_i and W_ij’s are conditionally independent given (X_i, Z_i). Lastly, we suppose the measurement error is classical so that X_i and U_ij are independent.

An example of this model is in the nutrition study of Flagg et al. (2000). There, a key interest is properly modeling the relationship between percent calories from fat (Y ), race (Z), and saturated fat intake (X). Saturated fat intake is not known exactly and only an approximate version via two repeated measurements, W_·1,W_·2, is available from food frequency questionnaires. To handle the measurement error in this example and in any model characterized by (1) and (2), the goal of this paper is to estimate the model parameters β and Ω_U under the following general assumptions:

Assumption (i): the mean model m(X,Z; β) is any linear or nonlinear function;
Assumption (ii): the model error ε may depend on (X,Z) (i.e., heteroskedasticity), and its conditional distribution p_ε|X,Z(ε|x, z) is unspecified;
Assumption (iii): the conditional distribution of X given Z, p_X|Z(x|z), and the distribution of Z, p_Z(z), exist but are completely unspecified. Thus we have a modern functional measurement error model (Carroll et al., 2006, chap. 7.2);
Assumption (iv): the measurement error is classical and U_ij, i = 1, … , n; j = 1, … , ℓ, has a general parametric distribution p_{U_ij} (u; Ω_U) with Ω_U unknown. This contrasts from the usual normality assumption for measurement error (Carroll et al., 1999, 2004).

1.2 Estimation challenges

Allowing p_ℓ_|X,Z, p_X|Z, and p_Z to be unspecified provides more modeling flexibility and reduces the chance of model misspecification. However, it also raises serious challenges. The unknown distributions cannot be ignored, and arbitrarily adopting models for p_ℓ_|X,Z or p_X|Z may cause bias. Estimating these distributions is also potentially difficult. For example, p_X|Z is a model of unobserved variables. Its estimation would involve an inverse operation such as deconvolution (Stefanski and Carroll, 1990), which results in a very slow rate (Carroll and Hall, 1988; Fan, 1991). The estimation of p_ℓ_|X,Z(ℓ|x, z) is equally challenging because residuals are unobtainable in measurement error models even if model parameters were known. The unavailability of the residuals makes correctly estimating the model error’s variance-covariance difficult. This is especially problematic when the model error is heteroskedastic, and a proper variance-covariance is needed to yield consistent model parameter estimates. Although methods exist to estimate the unknown variance-covariance, they are either approximate (Carroll and Wang, 2008) or complex (Delaigle and Hall, 2011).

1.3 Competing methods and features of our approach

A nonlinear, classical measurement error model with replicates has been treated by several authors. Extensive research has focused on measurement error problems with specific forms of m(X,Z; β), ranging from polynomial regression (Chan and Mak, 1985; Cheng and Schneeweiss, 1998; Cheng et al., 2000; Huang and Huwang, 2001) to generalized linear mixed models (Liang, 2009; Li and Liqun, 2012). For our purposes, we consider m(X,Z; β) to be any linear or nonlinear form, which is a general assumption of many existing works. For example, Li (2002) used Kotlarski’s identification (Rao, 1992, p. 21) to identify and consistently estimate model parameters for a general m(X,Z; β) with two replicates W_i₁,W_i₂. Tsiatis and Ma (2004) developed a consistent, asymptotically normal estimator when p_ℓ_|X,Z is known and parametric. Schennach (2004a) used properties of Fourier transforms, where the crux of her work lies in constructing moments of the unobserved X, and then forming estimators that can be written in terms of these moments. Schennach (2004b) developed an unbiased, Nadaraya-Watson based estimator to nonparametrically estimate m(X,Z) in (1). Lastly, Hu and Schennach (2008) and Schennach and Hu (2013) used a sieve maximum likelihood estimator (MLE) which yields consistency and the former successfully handles heteroskedastic measurement error (i.e., U in (2) depends on X). For an overview on measurement error models, see Fuller (1987) for earlier results in linear models and Carroll et al. (2006) for modern approaches in linear and nonlinear models. The developed methodologies have all positively impacted the literature of regression with classical measurement error. Still, some limitations linger and it is these limitations that motivated this work.

In this paper, we propose to overcome two key limitations of existing methods: the direct estimation or knowledge of p_ε|X,Z, and the inability to handle model error heteroskedasticity. In this regard, we develop a semiparametric estimator which avoids estimating p_X|Z and p_ε|X,Z. This is possible through deriving the semiparametric efficient score (Bickel et al., 1993; Tsiatis, 2006) which we reveal is robust to misspecification of the unknown distributions. Our approach involves adopting working parametric models for the unknown distributions. We show that if the working models are correct, then the estimator is semiparametric efficient; otherwise, the estimator is still root-n consistent and asymptotically normal. Lastly, our method does not require correctly estimating the model error’s variance-covariance.

Not having to directly estimate p_ε|X,Z differs from the semiparametric Tsiatis and Ma (2004) method and the sieve MLE (Shen, 1997; Schennach and Hu, 2013). Tsiatis and Ma (2004) assume p_ε|X,Z is a known, parametric form. Unfortunately, in our own numerical studies (Section 4), we found that such an assumption is sensitive to misspecification of the model error variance. With the sieve MLE, p_ε|X,Z and p_X|Z are represented by increasingly rich parametric representations such as a truncated series of basis functions. The parameters in the truncated series and regression are then jointly estimated via MLE subject to constraints that ensure the estimated p_ε|X,Z, p_X|Z are valid densities and that E(ε|X,Z) = 0. Sieve methods yield consistent estimators and are fairly straight-forward to implement, making the approach widely appreciated in the literature. However, compared to the sieve MLE, our approach bypasses the consistent estimation of p_ε|X,Z, p_X|Z. In doing so, our method eliminates a step in the aim of constructing a consistent estimator and, as described next, flexibly handles potential heteroskedasticity in the model error.

Model error heteroskedasticity is a challenging problem, especially in a measurement error setting where residuals are unavailable to aid the appropriate modeling of variance-covariance structures. In bypassing the correct estimation of p_ε|X,Z, our method implicitly handles misspecifications of the model error’s variance structure. That is, knowledge of the model error being heteroskedastic or homoskedastic is not needed. In our own explorations of existing estimators to handle model error heteroskedasticity, we found some shortcomings. The estimators of Li (2002) and Schennach and Hu (2013) both assume ε and (X,Z) are independent (i.e., homoskedastic model error). Consequently, ignoring the homoskedastic assumption naturally results in bias when the model error is truly heteroskedastic; see numerical studies in Section 4 for bias of the sieve estimator from Schennach and Hu (2013). The bias persists even when the number of terms in the sieve representations increases. As improvement, Hu and Schennach (2008) developed a different sieve estimator that successfully handles heteroskedastic measurement error (i.e., U in (2) depends on X). Unfortunately, when we extended their methodology to handle heteroskedastic model error, we encountered two difficulties. First, for the heteroskedastic sieve of p_ε|X,Z to be a valid density and have conditional mean zero, we require imposing twelve constraints (see Section S.8, Supplementary Material). Second, from our numerical studies (Section 4), we found that the heteroskedastic sieve estimator yielded biased estimates for the RMM models considered here. Given that the sieve approach has been widely successful in various regressions with errors-in-covariates, we were initially surprised by these results. However, we now believe the biasedness is a consequence of the complex computation that attempts a constrained optimization subject to too many constraints.

Lastly, for a nonparametric regression with classical measurement error, Schennach (2004b) developed an unbiased, Nadaraya-Watson based estimator that can handle heteroskedastic model error. However, our situation is completely different in that we consider a semiparametric regression model (i.e., m(X,Z; β)), not a nonparametric one (i.e., m(X,Z)).

Thus, as far as we are aware, we believe our semiparametric approach provides advantages over existing methods in that it bypasses estimating p_ε|X,Z and p_X|Z, and simultaneously handles unspecified heteroskedastic model error and mismeasured covariates. It is important to note that our method is developed under specific assumptions in Section 1.1, among which require multiple proxy variables and classical measurement error (Assumption (iv)). Under Assumption (iv), we may easily estimate Ω_U in the measurement error distribution (Section 2.1) and thus, more easily identify estimating equations for β (Theorem 1). When this assumption no longer holds, the estimation procedure is more difficult: a more general method is needed to simultaneously estimate Ω_U and β. Work in this area has been explored; see Hu and Schennach (2008) and Chen et al. (2009) for developments in non-classical measurement error and estimation without available replicates (Chen et al., 2009).

The rest of the paper is as follows. Section 2 establishes identifiability results for the model parameters. Section 3 describes the main results for the semiparametric estimator, including theoretical properties, robustness to misspecifications of working distributions and its numerical implementation. We show the satisfying performance of the estimator through a simulation study in Section 4 and a data example in Section 5. Section 6 concludes the paper with a brief discussion. Technical proofs and additional simulation results are provided in the Supplementary Material. All computer codes are available upon request.

2 Identification

2.1 Identification of Ω_U

The identification of Ω_U is facilitated by the observed replicates. If replicates are unavailable, then validation data (Lee and Sepanski, 1995) or instrumental variables (Carroll et al., 2004) can be used.

To identify Ω_U, we use the usual components of variance analysis (Carroll et al., 2006, chap. 4). Define $W_{i} = \sum_{j = 1}^{ℓ} W_{i j} / ℓ$ and $V_{i} = \sum_{j = 1}^{ℓ} (W_{i j} - W_{i}) {(W_{i j} - W_{i})}^{T}$ . Then Ω_U = E(V_i)/(l − 1), hence it is identifiable. In practice, we solve

\sum_{i = 1}^{n} (\frac{V_{i}}{ℓ - 1} - Ω_{U}) = 0

(3)

to obtain Ω̂_U.

2.2 Identification of β

We demonstrate identifiability of β by casting the RMM with measurement error into a semiparametric framework. Let η₁(x, z) ≡ p_X|Z(x|z), η₂(ε, x, z) ≡ p_ε|X,Z(ε|x, z), and η₃(z) ≡ p_Z(z) denote infinite-dimensional nuisance parameters corresponding to the unknown distributions. Let W denote the average of the observed replicates and p_W|X,Z(w|x, z; α) denote its conditional distribution given (X,Z), with α = vech(Ω_U) (i.e., the vectorized version of the upper block of Ω_U including its diagonal). Then, the probability density function of (Y,W,Z) is

p_{Y, W, Z} (y, w, z; β, α, η_{1}, η_{2}, η_{3}) = \int η_{2} {y - m (x, z; β), x, z} p_{W ∣ X, Z} (w ∣ x, z; α) η_{1} (x, z) η_{3} (z) d μ (x),

(4)

where dμ(·) denotes the dominating measure, which is the Lebesgue measure for continuous variables and the counting measure for discrete variables. The density of (Y,W,Z) contains both finite and infinite-dimensional parameters, hence the RMM with measurement error is a semiparametric model.

The identifiability of β in the RMM with measurement error is closely linked to the identifiability of β in the RMM without measurement error. To see this, assume to the contrary that the RMM without measurement error is identifiable, but that β in the RMM with measurement error is not. Then, there exist β₀, η₁, η₂, η₃ and β^†, $η_{1}^{†}, η_{2}^{†}, η_{3}^{†}$ where β₀ ≠ β^†, but β₀, η₁, η₂, η₃ and β^†, $η_{1}^{†}, η_{2}^{†}, η_{3}^{†}$ yield the same data generation procedure:

\begin{array}{l} p_{Y, W, Z} (y, w, z; β, α, η_{1}, η_{2}, η_{3}) & = & \int η_{2} {y - m (x, z; β_{0}), x, z} η_{1} (x, z) η_{3} (z) p_{U} (w - x; α) d x \\ = & \int η_{2}^{†} {y - m (x, z; β^{†}), x, z} η_{1}^{†} (x, z) η_{3}^{†} (z) p_{U} (w - x; α) d x . \end{array}

Here, p_U(u; α) denotes the measurement error distribution. Deconvolution then implies that for all (Y,X,Z), $η_{2} {y - m (x, z; β_{0}), x, z} η_{1} (x, z) η_{3} (z) = η_{2}^{†} {y - m (x, z; β^{†}), x, z} η_{1}^{†} (x, z) η_{3}^{†} (z)$ . A similar argument to

p_{W, Z} (w, z; α, η_{1}, η_{3}) = \int p_{U} (w - x; α) η_{1} (x, z) η_{3} (z) d x = \int p_{U} (w - x; α) η_{1}^{†} (x, z) η_{3}^{†} (z) d x,

yields $η_{1} (x, z) η_{3} (z) = η_{1}^{†} (x, z) η_{3}^{†} (z)$ for all (x, z). Together, these results imply that on the support of the probability density of (x, z),

η_{2} {y - m (x, z; β_{0}), x, z} = η_{2}^{†} {y - m (x, z; β^{†}), x, z}

(5)

for all (Y,X,Z). Hence, (5) implies that the conditional model error distributions under β₀ and under β^† are identical which makes the RMMwithout measurement error not identifiable. This contradicts our original assumption. Therefore, we have identifiability as long as we begin with an identifiable RMM without measurement error. Identifiability of the RMM without measurement error depends on the specific form of the mean model and is generally straight-forward to establish.

3 Methodology

3.1 Estimation of Ω_U and β

Estimation of Ω_U, equivalently α = vech(Ω_U ), follows directly from the solution to (3). Estimation of β builds upon the semiparametric results for an RMM without measurement error. For this latter case, Tsiatis (2006) demonstrated that consistent estimators are the solutions to the linear estimating equation

\sum_{i = 1}^{n} A (X_{i}, Z_{i}) {Y_{i} - m (X_{i}, Z_{i}; β)} = 0.

Here, A(X,Z) ∈ ℝ^p is an arbitrary function that does not cause the above estimating equation to degenerate. If A(X,Z) = ∂m(X,Z; β)/∂βE(ε²|X,Z)⁻¹, then the equation is named the optimal generalized estimating equation (optimal GEE; Liang and Zeger, 1986), and it yields the efficient estimator. See Section S.1 (Supplementary Material) for a brief overview of the semiparametric procedure and its application to the RMM without measurement error.

Applying the semiparametric procedure to the RMM with measurement error, we establish in Theorem 1 the condition that any consistent estimator for β must satisfy. A detailed derivation is given in Section S.2 (Supplementary Material).

Theorem 1

For the RMM with measurement error, a consistent estimator for β is the solution to $\sum_{i = 1}^{n} f (Y_{i}, W_{i}, Z_{i}; β) = 0$ where f is a p-dimensional function in

Λ^{⊥} = [f (Y, W, Z) : E {f (Y, W, Z) ∣ Y, X, Z} = g (X, Z) ε] .

Here, g is an arbitrary function of (X,Z) with finite variance, and

E {f (Y, W, Z) ∣ Y, X, Z} = \int f (y, w, z) p_{W ∣ X, Z} (w ∣ x, z; α) d μ (w) .

(6)

Theorem 1 states that to determine if a function f(Y,W,Z; β) yields a consistent estimator for β, one must verify that f belongs to Λ^⊥. The verification involves computing the integral in (6) and checking that the result is of the form g(X,Z)ε for some function g(X,Z). Note that the integration in (6) does not involve the unknown distributions η₁ or η₂. Instead, it only involves the distribution p_W|X,Z(w|x, z; α) which is completely known once α is estimated from (3). This observation means that even without knowing η₁ and η₂, one can verify if a function f belongs to Λ^⊥, and thus use it to form a consistent estimator for β.

Unfortunately, finding f that belongs to Λ^⊥ is not a trivial task. It is equivalent to the challenge of finding a corrected score which is only resolved for generalized linear models (Nakamura, 1990). An approximate corrected score is possible using complex-variable computations and Monte Carlo averaging (Novick and Stefanski, 2002). In this work, we use a careful analytic derivation to construct f in Λ^⊥.

Let $η_{1}^{*} (x, z)$ and $η_{2}^{*} (ε, x, z)$ be working models of η₁ and η₂, respectively. The working models may be completely different from the true models, denoted as η₁₀, η₂₀, but we assume the support is the same. Throughout, let E_* (·) denote the expectation computed under $η_{1}^{*}, η_{2}^{*}$ , and E(·) denote the expectation computed under η₁₀, η₂₀. Define conjugate linear operators

K_{1} {h (Y, X, Z)} = E_{*} {h (Y, X, Z) ∣ Y, W, Z}, K_{2} {f (Y, W, Z)} = E {f (Y, W, Z) ∣ Y, X, Z} .

It is important to note that 𝒦₂ is independent of $η_{1}^{*}, η_{2}^{*}$ as evident from (6); hence its definition is asterisk-free.

Using the projection theorem (Rudin, 1987), we demonstrate in Section S.3 (Supplementary Material) that a function in Λ^⊥ is 𝒦₁{d^*(Y,X,Z)} where d^*(Y,X,Z) is a p-dimensional function that satisfies

ε E_{*} (d^{*} ε ∣ X, Z) + K_{2} \circ K_{1} (d^{*}) E_{*} (ε^{2} ∣ X, Z) - ε E_{*} {K_{2} \circ K_{1} (d^{*}) ε ∣ X, Z} = m_{β}^{'} (X, Z; β) ε .

(7)

Here, ∘ denotes the composite operation and $m_{β}^{'} (X, Z; β)$ is ∂m(X,Z; β)/∂β. To see that 𝒦₁{d^*(Y,X,Z)} indeed belongs to Λ^⊥, we can easily re-arrange (7) to show that E[𝒦₁{d^*(Y,X,Z)}|Y,X,Z] = g(X,Z)ε with $g (X, Z) = (m_{β}^{'} (X, Z; β) - E_{*} [{d^{*} - K_{2} \circ K_{1} (d^{*})} ε ∣ X, Z]) E_{*} {(ε^{2} ∣ X, Z)}^{- 1}$ . It is worth noting that in the terminology of semip arametric theory, 𝒦₁{d^*(Y,X,Z)} is known as the locally efficient score vector

S_{eff}^{*} (Y, W, Z; β, α, η_{1}^{*}, η_{2}^{*}) \equiv K_{1} {d^{*} (Y, X, Z)} .

A few remarks are in order. First, equation (7) may admit more than one solution d^*. However, by the projection theorem (Rudin, 1987), even if d^* is not unique, 𝒦₁{d^*(Y,X,Z)} is unique; see Section S.3 (Supplementary Material). Hence differences in numerical procedures for obtaining d^* will not affect the final estimating equation which is formed using $S_{eff}^{*} (Y, W, Z; β, α, η_{1}^{*}, η_{2}^{*}) \equiv K_{1} {d^{*} (Y, X, Z)}$ . Second, to ensure that the parameter values are identified from the ensuing estimating equation, we require that $E {S_{eff}^{*} (Y_{i}, W_{i}, Z_{i}; β, α, η_{1}^{*}, η_{2}^{*})} = 0$ has unique root. Third, even if the unique root property holds at the population level, the estimating equation may still have multiple roots at the sample level. As far as we are aware, selecting among the multiple roots in estimating equations is a thorny issue; empirical knowledge for root selection is usually needed in practice. Lastly, because 𝒦₁{d^*(Y,X,Z)} is constructed to be an element of Λ^⊥ and all elements of Λ^⊥ yield consistent estimators for β (Theorem 1), the choice of $η_{1}^{*}, η_{2}^{*}$ in forming 𝒦₁{d^*(Y,X,Z)} does not affect consistency. See Section 3.3 for a discussion of choosing $η_{1}^{*}, η_{2}^{*}$ in practice. To the best of our knowledge, this is the only existing root-n consistent estimator for the RMM with measurement error that does not require estimating the unknown η₁, η₂.

3.2 Algorithm for estimating Ω_U and β

The algorithm for estimating Ω_U and β in model (1) and (2) is as follows.

Recall that $W_{i} = \sum_{j = 1}^{ℓ} W_{i j} / ℓ$ and $V_{i} = \sum_{j = 1}^{ℓ} (W_{i j} - W_{i}) {(W_{i j} - W_{i})}^{T}$ . Solve for Ω̂_U as the root of (3) and form α̂_n = vech(Ω̂_U).
Propose a working density model $η_{1}^{*}$ for η₁.
Propose a working density model $η_{2}^{*}$ for η₂ that satisfies E_*(ε|X,Z) = 0.
Perform 𝒦₁, 𝒦₂, E_*(·|X,Z) under p_W|X,Z(w|x, z; α̂_n), $η_{1}^{*}$ , and $η_{2}^{*}$ . Solve for d^*(Y,X,Z) from (7). When (7) admits more than one solution, pick one arbitrarily.
Form the score vector $S_{eff}^{*} (Y, W, Z; β, {\hat{α}}_{n}, η_{1}^{*}, η_{2}^{*}) = K_{1} (d^{*})$ by calculating 𝒦₁ under $η_{1}^{*}$ and p_W|X,Z(w|x, z; α̂_n). Even if (7) has multiple solutions, they will yield the same 𝒦₁(d^*) (Rudin, 1987).
Solve the estimating equation $\sum_{i = 1}^{n} S_{eff}^{*} (Y_{i}, W_{i}, Z_{i}; β, {\hat{α}}_{n}, η_{1}^{*}, η_{2}^{*}) = 0$ for the estimator β̂_n.

In estimating β, we have treated α via a plug-in estimator obtained from Step 1. Alternatively, we can also augment α to β and simultaneously estimate both using the procedure from Step 2 on. That is, we may solve for ${\hat{θ}}_{n} = {({\hat{α}}_{n}^{T}, {\hat{β}}_{n}^{T})}^{T}$ as the root of

\sum_{i = 1}^{n} S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, η_{1}, η_{2}) = 0.

(8)

Here, 𝒮 = (ϕ^T, f^T )^T with ϕ denoting the estimating equations in (3) corresponding to the α elements, and f ∈ Λ^⊥. In our algorithm, we set $f = S_{eff}^{*}$ , and $η_{1} = η_{1}^{*}, η_{2} = η_{2}^{*}$ . Solving for α̂_n and β̂_n simultaneously does not change the analysis.

The numerical implementation of the algorithm is given in Section 3.5. We now give some remarks regarding the algorithm.

3.3 Selection and impact of working models $η_{1}^{}, η_{2}^{}$

One flexibility of our algorithm is the ability to choose possibly incorrect, working models $η_{1}^{*}, η_{2}^{*}$ for η₁, η₂ (Steps 2 and 3 of the algorithm). We now discuss a practical approach for selecting these working models and its impact on the consistency and efficiency of β̂_n.

Remark 1

When either or both $η_{1}^{*}, η_{2}^{*}$ are misspecified and the measurement error distribution is estimated as p_W|X,Z(w|x, z; α̂_n), the algorithm still provides a consistent estimator.

To prove the consistency claim in Remark 1, we make the following regularity conditions, stated using the general 𝒮 = (ϕ^T, f^T )^T and θ = (α^T, β^T )^T notation. We assume θ belongs to a domain of interest Θ which is a compact set.

(R1)
The estimating equation in (8) and its expectation E{𝒮(Y,W, V,Z; θ, η₁, η₂)} are sufficiently smooth in (θ, η₁, η₂) in a neighborhood of (θ₀, η₁₀, η₂₀). This condition is needed so that the weak law of large numbers is valid.
(R2)
The matrix E{∂𝒮(Y,W, V,Z; θ, η₁, η₂)/∂θ^T } is invertible, bounded and smooth in (θ, η₁, η₂) in a neighborhood of (θ₀, η₁₀, η₂₀). This assumption permits the re-arrangement of a Taylor expansion and hence, the applicability of the central limit theorem.
(R3)
For $η_{1} = η_{1}^{*}, η_{2} = η_{2}^{*}$ , the equation E{𝒮(Y_i,W_i, V_i, Z_i; θ, η₁, η₂)} = 0 has a unique solution and E{sup_θ_∈_Θ|𝒮(Y_i,W_i, V_i, Z_i; θ, η₁, η₂)|} < ∞ component wise. The unique solution requirement is commonly needed in semiparametric estimation and in parametric estimation, except when the objective function guarantees the unique root property such as when it is convex. While a globally unique root property is somewhat restrictive, one can instead require a unique root in a region of interest, so long as it is justifiable to consider parameters only in that region.

We show in Section S.4 (Supplementary Material) that under these regularity conditions, θ̂_n is a consistent estimator even when $η_{1} = η_{1}^{*}, η_{2} = η_{2}^{*}$ are misspecified.

From Remark 1, in terms of obtaining a consistent estimator, we are free to choose any working model $η_{1}^{*}$ and $η_{2}^{*}$ . Thus, for computational ease, we suggest using Gaussian models due to their simplicity. We also recommend choosing the support of $η_{1}^{*}, η_{2}^{*}$ to be as large as that of the true distributions so as to maintain numerical stability. Of course, the true distributions are unknown, so the latter requirement may be achieved by choosing the support based on the observed data. For example, after centering the observed data (Y,W,Z), one may choose $η_{1}^{*}$ to be a normal distribution with mean zero and variance equal to the sample variance of W. Likewise, one may choose $η_{2}^{*}$ to be a normal distribution with mean zero and variance as estimated from the residual sum of squares after regressing Y on m(W, Z; β).

Although any working models $η_{1}^{*}, η_{2}^{*}$ maintain consistency, they affect efficiency in theory as we now describe.

Remark 2

The choice of $η_{1}^{*}, η_{2}^{*}$ only affects β, so we characterize the efficiency for β only with α fixed at the truth. When the working models are correct, i.e., $η_{1}^{*} = η_{10}$ and $η_{2}^{*} = η_{20}$ , the algorithm gives the optimal estimator in that its estimation variance achieves the semiparametric efficiency bound (Tsiatis, 2006, chap. 4). Such results follow because, in this case, the resulting estimator solves the true efficient score estimating equation $\sum_{i = 1}^{n} S_{eff} (Y_{i}, W_{i}, Z_{i}; {\hat{β}}_{n}, α_{0}, η_{10}, η_{20}) = 0$ .

Justification of Remark 2 follows from the principles of semiparametric theory (Tsiatis, 2006, chap. 4). From Remark 2, if the working models $η_{1}^{*}, η_{2}^{*}$ are exactly the true models η₁₀, η₂₀, then the resulting estimator β̂_n is most efficient. Of course knowing the true models η₁₀, η₂₀ is rarely an option. Hence, some efficiency loss is expected since the working models will most likely differ from the truth. The incurring loss depends on the proposed working models, and can be theoretically characterized as follows.

Let $S_{eff}^{*}$ be as in Step 5 of the algorithm which is constructed under the possibly misspecified working models $η_{1}^{*}, η_{2}^{*}$ . Let $A_{*} = E {\partial S_{eff}^{*} (Y, W, Z; β_{0}, α_{0}, η_{1}^{*}, η_{2}^{*}) / \partial β}$ and $B_{*} = var {S_{eff}^{*} (Y, W, Z; β_{0}, α_{0}, η_{1}^{*}, η_{2}^{*})}$ , where β₀, α₀ denote the true parameter values. The asterisks in A_* and B_* are used to emphasize that $S_{eff}^{*}$ depends on the working models. Finally, let A,B and S_eff be defined analogously to A_*,B_*, and $S_{eff}^{*}$ , respectively, except with $η_{1}^{*} = η_{10}$ and $η_{2}^{*} = η_{20}$ .

In Theorem 2 (see Section 3.4), we demonstrate that under working models $η_{1}^{*}, η_{2}^{*}$ , the estimator β̂_n is asymptotically normal with mean zero and variance-covariance $A_{*}^{- 1} B_{*} {(A_{*}^{- 1})}^{T}$ . In comparison, under the true η₁₀, η₂₀, the asymptotic variance-covariance of β̂_n is A⁻¹B(A⁻¹)^T. Therefore, the theoretical efficiency loss of the estimator computed under misspecified working models and the truth is the difference $A_{*}^{- 1} B_{*} {(A_{*}^{- 1})}^{T} - A^{- 1} B {(A^{- 1})}^{T}$ . This difference is identical to $E {(A_{*}^{- 1} S_{eff}^{*} - A^{- 1} S_{eff}) {(A_{*}^{- 1} S_{eff}^{*} - A^{- 1} S_{eff})}^{T}}$ (see Section S.5 in Supplementary Material), which means that the efficiency loss is positive definite. The precise efficiency loss can thus be evaluated in each case. In our limited empirical studies (see Section 4.3.2), it has been observed that the loss is generally small, and the estimation variance is quite insensitive to the choice of the working models.

In summary, our procedure allows flexible working models $η_{1}^{*}, η_{2}^{*}$ to construct consistent estimators and achieves local efficiency. This contrasts from existing methods in the literature, including that from Tsiatis and Ma (2004), which are highly sensitive to the variance misspecification of the model error. Moreover, in bypassing the estimation of η₁, η₂, our algorithm minimizes the unnecessary work in the process of estimating α and β.

3.4 Theoretical properties

We describe the theoretical properties of ${\hat{θ}}_{n} = {({\hat{α}}_{n}^{T}, {\hat{β}}_{n}^{T})}^{T}$ under working models $η_{1}^{*} (x, z; γ_{1})$ and $η_{2}^{*} (ε, x, z; γ_{2})$ where γ₁, γ₂ are finite-dimensional parameters. The parameters γ₁, γ₂ reflect the common practice of using parametric forms for the working models $η_{1}^{*}, η_{2}^{*}$ . The true forms η₁₀, η₂₀ may or may not belong to these working model families.

Let $γ = {(γ_{1}^{T}, γ_{2}^{T})}^{T}$ belong to a compact set 𝒢 and γ̂_n be an estimator of γ. We assume γ̂_n is root-n consistent in the proposed working models, so n^1/2(γ̂_n−γ^*) is bounded in probability for some constant γ^*. We now demonstrate that under $η_{1}^{*} (x, z; γ_{1})$ and $η_{2}^{*} (ε, x, z; γ_{2})$ , the estimator θ̂_n is asymptotically normal, and its efficiency does not depend on how efficiently we estimate γ.

To establish these results, we further make the following assumptions:

(R4)
The equation E{𝒮(Y_i,W_i, Z_i, V_i; θ, γ^*)} = 0 has a unique solution. In addition, E{sup_{θ∈Θ,γ^*∈ 𝒢}|𝒮(Y_i,W_i, Z_i, V_i; θ, γ^*)|} < ∞ component wise, and the expectation of the squared l₂ norm of 𝒮, i.e. E{||𝒮(Y_i,W_i, Z_i, V_i; θ₀, γ^*)||²}, is bounded. This condition is similar to condition (R3).
(R5)
$n^{- 1} \sum_{i = 1}^{n} \partial S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, γ) / \partial θ$ converges in probability to E{∂𝒮(Y_i,W_i, V_i, Z_i; θ, γ)/∂θ} uniformly in (θ, γ) in a neighborhood of (θ₀, γ*).
(R6)
$n^{- 1} \sum_{i = 1}^{n} \partial S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, γ) / \partial γ$ converges in probability to E{∂𝒮(Y_i,W_i, V_i, Z_i; θ, γ)/∂γ} uniformly in (θ, γ) in a neighborhood of (θ₀, γ_*).

The last two conditions are very mild and are generally satisfied following the law of large lumbers and equicontinuity conditions.

Our first theoretical result shows that θ̂_n is asymptotically normal whether $η_{1}^{*} (x, z; γ_{1})$ and $η_{2}^{*} (ε, x, z; γ_{2})$ contain the true η₁₀, η₂₀ or not.

Theorem 2

Let f be an arbitrary p-dimensional function belonging to Λ^⊥ in Theorem 1. Let $η_{1}^{*} (x, z; γ_{1})$ and $η_{2}^{*} (ε, x, z; γ_{2})$ be working parametric models for η₁, η₂. Let $γ = {(γ_{1}^{T}, γ_{2}^{T})}^{T}$ and γ̂_n be its estimate such that for some constant γ^*, n¹^/²(γ̂_n−γ^*) is bounded in probability. Finally, let ${\hat{θ}}_{n} = {({\hat{α}}_{n}^{T}, {\hat{β}}_{n}^{T})}^{T}$ and $θ_{0} = {(α_{0}^{T}, β_{0}^{T})}^{T}$ denote the truth. Under regularity conditions ( R1)–( R6), the root θ̂_n of $\sum_{i = 1}^{n} S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, {\hat{γ}}_{n}) = 0$ is consistent and

\sqrt{n} ({\hat{θ}}_{n} - θ_{0}) \to Normal (0, V_{*})

in distribution as n→∞. Here, $V_{*} = A_{*}^{- 1} B_{*} {(A_{*}^{- 1})}^{T}$ with 𝒜_* = E{∂𝒮(Y,W, V,Z; θ₀, γ^*)/∂θ^T } and ℬ_* = diag[var{ϕ(V ; α₀)}, var{f(Y,W,Z; θ₀, γ^*)}], a block diagonal matrix.

In Theorem 2, the arguments in $\sum_{i = 1}^{n} S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, {\hat{γ}}_{n}) = 0$ differ from those in (8) in that we have replaced $η_{1}^{*}, η_{2}^{*}$ with ${\hat{γ}}_{n} = {({\hat{γ}}_{n 1}^{T}, {\hat{γ}}_{n 2}^{T})}^{T}$ to emphasize our use of parametric working models for $η_{1}^{*}, η_{2}^{*}$ . The results in Theorem 2 hold because we can express $\sqrt{n} ({\hat{θ}}_{n} - θ_{0})$ as a summand of normalized, zero-mean random vectors based on Taylor expansion and the properties of Λ^⊥. Consequently, by our regularity assumptions and the central limit theorem, this normalized sum will converge in distribution to a multivariate normal with zero mean and variance-covariance 𝒱_*; see Section S.6 (Supplementary Material) for complete details. In addition, the result in Theorem 2 is useful for performing inference on θ where 𝒱_* in practice is estimated by the sandwich estimator ${\hat{A}}_{*}^{- 1} {\hat{B}}_{*} {({\hat{A}}_{*}^{- 1})}^{T}$ . Here,

\begin{array}{l} {\hat{A}}_{*} = n^{- 1} \sum_{i = 1}^{n} \partial S (Y_{i}, W_{i}, V_{i}, Z_{i}; {\hat{θ}}_{n}, {\hat{γ}}_{n}) / \partial θ^{T}, \\ {\hat{B}}_{*} = n^{- 1} \sum_{i = 1}^{n} S (Y_{i}, W_{i}, V_{i}, Z_{i}; {\hat{θ}}_{n}, {\hat{γ}}_{n}) S^{T} (Y_{i}, W_{i}, V_{i}, Z_{i}; {\hat{θ}}_{n}, {\hat{γ}}_{n}) . \end{array}

Remark 3

The result in Theorem 2 applies to any function f ∈ Λ^⊥. In Section 3.1, we argued that a particular function in Λ^⊥ is $S_{eff}^{*} = K_{1} (d^{*})$ . Thus, by Remark 1 and Theorem 2, when $S = {(ϕ^{T}, S_{eff}^{*^{T}})}^{T}$ , the resulting estimator θ̂_n from our proposed algorithm is consistent and asymptotically normal.

Our second theoretical result demonstrates that the asymptotic efficiency of θ̂_n does not depend on how efficiently we estimate γ in the working parametric models. Specifically, consider the case when θ̂_n solves the estimating equation $\sum_{i = 1}^{n} S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, {\hat{γ}}_{n}) = 0$ , and θ̌_n solves the estimating equation $\sum_{i = 1}^{n} S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, γ^{*}) = 0$ , where 𝒮 = (ϕ, f^T )^T and f belongs to Λ^⊥ in Theorem 1. Our previous results from Theorem 2 warrant that θ̂_n and θ̌_n are root-n consistent estimators and asymptotically normal. A stronger result, shown below, is that θ̂_n and θ̌_n also have the same asymptotic efficiency even though the former involves the estimated γ̂_n, and the latter only involves the constant γ^*. Thus, as long as we consistently estimate γ̂_n, then using either γ̂_n or γ^* in the working parametric models yields the same efficiency for θ̂_n.

Theorem 3

Let the p-dimensional function f belong to Λ^⊥ in Theorem 1. Assume γ̂_n is such that n^1/2(γ̂_n − γ^*) is bounded in probability. Then, under regularity conditions ( R1)–( R6), the efficiency of the estimator θ̂_n obtained as the root of $\sum_{i = 1}^{n} S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, {\hat{γ}}_{n}) = 0$ is asymptotically equivalent to the efficiency of the estimator θ̌_n obtained as the root of $\sum_{i = 1}^{n} S (Y_{i}, W_{i}, V_{i}, Z_{i}; θ, γ^{*}) = 0$ . Namely, both n^1/2(θ̂_n − θ₀) and n^1/2(θ̌_n − θ₀) are asymptotically normal with mean zero and variance-covariance 𝒱_* as in Theorem 2.

The proof of Theorem 3 follows analogously to that of Theorem 2 in that $\sqrt{n} ({\hat{θ}}_{n} - θ_{0})$ and $\sqrt{n} ({\overset{ˇ}{θ}}_{n} - θ_{0})$ can be expressed as the same summand of normalized, zero-mean random vectors via Taylor expansion; see Section S.7 (Supplementary Material). Thus, because the first order expansions of θ̂_n and θ̌_n are the same, it immediately follows from the regularity conditions and central limit theorem that both estimators are asymptotically normal with identical variance-covariance 𝒱_*.

The results in Theorems 2 and 3 hold whether or not $η_{1}^{*} (x, z; γ_{1})$ and $η_{2}^{*} (ε, x, z; γ_{2})$ contain the true distributions η₁₀, η₂₀. However, when the working parametric models do contain the true distributions, the resulting estimator θ̂_n is actually semiparametric efficient as noted below.

Remark 4

A particularly interesting case is when f is the efficient score $S_{eff}^{*}$ as in our algorithm. Since $S_{eff}^{*} \in Λ^{⊥}$ , Theorem 3 tells us that if correct parametric models with parameters γ are used for η₁(x, z), η₂(ε, x, z), and root-n estimators can be found for the parameters γ, then it is as if η₁(x, z), η₂(ε, x, z) were known precisely. In this case, we achieve optimal semiparametric efficiency. This is a stronger statement than Remark 2.

In practice, a correct parametric model is certainly not easy to obtain. It requires good knowledge of η₁(x, z) and η₂(ε, x, z), both of which are “invisible” due to the unobservable X’s. Thus, if reducing estimation variability is important, one can propose a relatively large model for η₁ and η₂, and proceed with the locally efficient estimator. With richer models of η₁, η₂, the chance of achieving efficiency is increased.

3.5 Implementation of the algorithm

Steps 1–3 in our algorithm are easily handled by following the guidance in Section 3.3 for selecting $η_{1}^{*}, η_{2}^{*}$ . We thus focus on the details for executing Steps 4–6.

Step 4 requires solving for d^*(Y,XZ) from the ill-posed problem in (7). Although this ill-posed problem may at first appear challenging, we benefit from two aspects. First, solving for d^* is a “good” ill-posed problem in the sense that the ill-posedness is only because more than one solution may satisfy (7). This is beneficial since our objective is to find any one of these solutions. Second, what we really need for estimation and inference is not d^* itself, but a smoothed version of d^*, namely 𝒦₁(d^*) = E(d^*|Y,W,Z) which is unique and hence no longer an ill-posed problem. We now demonstrate how (7) can be solved analytically in some cases and numerically otherwise.

3.5.1 Analytic d^*

For some mean models, d^* may be computed analytically such as for the simple, linear RMM with two replicates:

Y_{i} = β_{1} + β_{2} X_{i} + ε_{i}, W_{i j} = X_{i} + U_{i j}, E (ε_{i} ∣ X_{i}) = 0,

for i = 1, … , n, j = 1, 2. Here, U_ij is normally distributed with mean zero and unknown variance $2 σ_{U}^{2}$ .

Following our algorithm, solve for ${\hat{σ}}_{U}^{2}$ from (3) and let W_i = (W_i₁ + W_i₂)/2. With η₁ ≡ p_X(x) and η₂ ≡ p_ε|X(ε|x), we suppose that (Y_i,W_i) are standardized so that it is reasonable to posit $η_{1}^{*}, η_{2}^{*}$ as standard normals. Then, under $η_{1}^{*}, η_{2}^{*}$ , an analytic solution to (7) is $d^{*} = {(d_{1}^{*}, d_{2}^{*})}^{T}$ with

\begin{array}{l} d_{1}^{*} (Y, X) = Y - β_{1} - β_{2} X (1 + c_{1}^{- 1} {\hat{σ}}_{U}^{2}), \\ d_{2}^{*} (Y, X) = c_{2}^{- 1} β_{2} {\hat{σ}}_{U}^{2} {c_{1} (1 - β_{1}^{2}) + {\hat{σ}}_{U}^{2} + 1 - c_{1} (Y - 2 β_{1}) Y} + c_{2}^{- 1} (c_{1} + {\hat{σ}}_{U}^{2}) X {(2 c_{1} - 1) (Y - β_{1}) - β_{2} (c_{1} + {\hat{σ}}_{U}^{2}) X}, \end{array}

and $c_{1} = 1 + β_{2}^{2} {\hat{σ}}_{U}^{2}, c_{2} = c_{1} (1 + 2 {\hat{σ}}_{U}^{2}) - {\hat{σ}}_{U}^{2}$ .

Using the analytic d^* to form the score vector $S_{eff}^{*} (Y, W; β, {\hat{σ}}_{U}^{2}, η_{1}^{*}, η_{2}^{*}) = K_{1} (d^{*})$ then yields that β̂_n solves

0 = C n^{- 1} \sum_{i = 1}^{n} {(\begin{matrix} 1 & W_{i} \\ W_{i} & W_{i}^{2} - {\hat{σ}}_{U}^{2} \end{matrix}) (\begin{matrix} β_{1} \\ β_{2} \end{matrix}) - (\begin{matrix} Y_{i} \\ Y_{i} W_{i} \end{matrix})},

where $C = diag (c_{1}^{- 1}, - c_{2}^{- 1})$ . Because C is non-singular, the above estimating equation is exactly the same explicit form previously given in Hall and Ma (2007). In other words, the estimator in Hall and Ma (2007) is a special case of our solution family corresponding to a natural choice of standard normals for the working models $η_{1}^{*}, η_{2}^{*}$ .

3.5.2 Numerical d^*

For general mean models, d^* is computed numerically. The implementation below is provided in software available on the first-author’s website. We propose solving for d^* by approximating it with a linear combination of basis functions. For ease of presentation, we demonstrate the procedure when X and Z are univariate; however, the method extends to the multivariate case.

In our approach, we approximate d^* by

d^{*} (Y, X, Z) = \sum_{j, k = 1}^{q} c_{j k} (Z) g_{k} (Y) h_{j} (X),

where c_jk(Z), j, k = 1, … , q, is a p-dimensional vector of unknown coefficients, and g_k(·), h_j(·) are sets of real-valued basis functions (e.g., Hermite polynomials, Chebychev polynomials, Fourier series, B-splines, Legendre polynomials). The number of bases q is chosen to give accurate approximation and permit fast computation. The number of basis functions q is dependent on the true d^*(Y,X,Z) function and on the type of basis functions. Empirically, we suggest to start from q = 4 and increase it until the result stabilizes.

With d^* as above, the goal then is to form (7) and solve for the coefficients c_jk(Z), j, k = 1, . . . , q. To this end, (7) becomes

\begin{array}{l} \sum_{j, k = 1}^{q} c_{j k} (Z) ε h_{j} (X) E_{*} {g_{k} (Y) ε ∣ X, Z} \\ + \sum_{j, k = 1}^{q} c_{j k} (Z) g_{k} (Y) K_{2} \circ K_{1} {h_{j} (X)} E_{*} (ε^{2} ∣ X, Z) \\ - \sum_{j, k = 1}^{q} c_{j k} (Z) ε E_{*} [g_{k} (Y) K_{2} \circ K_{1} {h_{j} (X)} ε ∣ X, Z)] = m_{β}^{'} (X, Z; β) ε . \end{array}

(9)

Under the working models $η_{1}^{*}$ and $η_{2}^{*}$ , we evaluate the expectations in (9) using discretization and quadrature integration (e.g., Hermite quadrature). Specifically, we discretize $η_{1}^{*} (x, z)$ at r points x₁, . . . , x_r across the support of X with weights given by $η_{1}^{*} (x, z) = \sum_{s = 1}^{r} p_{s} (z) I (x = x_{s})$ such that $\sum_{s = 1}^{r} p_{s} (z) = 1$ for all z in the support of Z. Under this discretization, the terms in (9) are computed using the formulas

\begin{array}{l} K_{1} {f_{1} (Y, X, Z)} = \frac{\sum_{s = 1}^{r} f_{1} (Y, x_{s}, Z) p_{W ∣ X, Z} (W ∣ x_{s}, Z; {\hat{α}}_{n}) η_{2}^{*} {Y - m (x_{s}, Z; β), x_{s}, Z} p_{s} (Z)}{\sum_{s = 1}^{r} p_{W ∣ X, Z} (W ∣ x_{s}, Z; {\hat{α}}_{n}) η_{2}^{*} {Y - m (x_{s}, Z; β), x_{s}, Z} p_{s} (Z)}, \\ K_{2} {f_{2} (Y, W, Z)} = \int f_{2} (Y, w, Z) p (w ∣ X, Z; {\hat{α}}_{n}) d μ (w), \\ E_{*} {f_{1} (Y, X, Z) ∣ X, Z} = \int f_{1} (y, X, Z) η_{2}^{*} {y - m (X, Z; β), X, Z} d μ (y), \end{array}

for appropriate functions f₁(Y,X,Z), f₂(Y,W,Z). Finally, the integrals in 𝒦₂ and E_*(·|X,Z) are evaluated using quadrature integration (Kress, 1999, chap. 12). It is important to note that our way of discretizing $η_{1}^{*} (x, z)$ simplifies the computation of 𝒦₁ into a simple summation of functions evaluated at x₁, . . . , x_r. Doing so avoids the complex task of estimating the unknown distribution p_X|Y,W,Z(x|y,w, z). The number of discretization points r controls the integral approximation accuracy. Empirically, we suggest to use r = 20 and increase it until the results stabilize.

In the last step of solving for the coefficients of d^*, each term in (9) is evaluated at q² grid-points (y_m, x_ℓ, Z) for m, ℓ = 1, . . . , q, typically chosen as quadrature points. Doing so leads to p linear systems of size q²×q², from which we may evaluate c_jk(Z) at each observed Z. After obtaining the coefficients, we then verify that (7) is really solved by plugging in the coefficients to d^*. The verification needs to be done only at the grid points (y_m, x_ℓ, Z) because d^* was only solved for at these grid points. By having to verify d^* only at these grid points rather than at all (Y,X,Z), we essentially bypass the functional nature of solving for d^*, which means solving for d^* is actually simpler than it appears.

After the coefficients of d^* are verified, we then do Step 5 and form

S_{eff}^{*} (Y, W, Z; β, {\hat{α}}_{n}, η_{1}^{*}, η_{2}^{*}) = K_{1} (d^{*}) = \sum_{j, k = 1}^{q} c_{j k} (Z) g_{k} (Y) K_{1} {h_{j} (X)}

to construct $\sum_{i = 1}^{n} S_{eff}^{*} (Y_{i}, W_{i}, Z_{i}; β, {\hat{α}}_{n}, η_{1}^{*}, η_{2}^{*}) = 0$ . In Step 6, the estimator for β is then the root of the constructed estimating equation.

One possible concern about our proposed implementation is that the different numerical approximations may ultimately affect the efficiency of the proposed estimator. However, this is not the case. If d^* is constructed so that (7) is indeed satisfied, then $S_{eff}^{*} = K_{1} (d^{*})$ belongs to Λ^⊥ as stated in Section 3.1. Elements in Λ^⊥ lead to consistent estimators for β (see Theorem 1 and Remark 1) with efficiency affected only by the choice of $η_{1}^{*}, η_{2}^{*}$ , not the approximation of d^* (see Remark 2 and the ensuing discussion). Therefore, a critical step is ensuring that the obtained d^* does indeed satisfy (7), which is exactly what we do. Therefore, solving for d^* is genuinely and completely a computational issue since no data is involved in the solution process. To ensure that (7) is properly solved, one may need to choose a rich class of basis functions, for example, combinations of polynomial bases, B-splines, or Fourier series. A full discussion of various methods to solve (7) is a well studied topic in numerical analysis and can be found in Kress (1999) and references therein.

4 Empirical Studies

We now demonstrate the performance of our method and compare its results to five competing methods.

4.1 Simulation design

We consider the RMM with measurement error

Y_{i} = β_{2} exp (- β_{1} X_{i}^{2}) + β_{3} Z_{i} + ε_{i}, W_{i j} = X_{i} + U_{i j}, U_{i j} ~ Normal (0, 2 σ_{U}^{2}),

for i = 1, . . . , n and j = 1, 2. The true model parameters are ${(σ_{U}^{T}, β_{1}, β_{2}, β_{3})}^{T} = {(0.05, 0.25, 0.7, 0.5)}^{T}$ . Results for other mean models are reported in the Supplementary Material (Section S.9).

The true distribution η₁₀ of X is uniform on [ $1.1 - \sqrt{0.9}, 1.1 + \sqrt{0.9}$ ], and the true distribution η₃₀ of Z is Bernoulli with parameter 0.5. To evaluate the robustness of our method, we set the model error distribution η₂₀ to be either a uniform or t-distribution with 5 degrees of freedom (i.e., t₅ distribution), and its variance either homoskedastic or heteroskedastic. Specifically, we consider

Setting 1 : Uniform distribution.
- Homoskedastic: η₂₀ is uniform on [−1, 1];
- Heteroskedastic: η₂₀ is (|X| + 1)𝒰 where 𝒰 is a uniform distribution on [−1, 1].
Setting 2 : t₅-distribution.
- Homoskedastic: η₂₀ is 0.4t₅;
- Heteroskedastic: η₂₀ is (0.4|X| + 0.5)t₅.

4.2 Methods evaluated

For all settings, we generated 1000 data sets with sample size n = 500. Parameters $σ_{U}^{2}$ , β₁, β₂, β₃ were estimated using six different methods.

4.2.1 Proposed method

We used our proposed method where we set working models $η_{1}^{*}, η_{2}^{*}$ different from the true η₁₀, η₂₀ both in terms of distributional form and variance structure. The differences are intended to demonstrate the robustness of our method when $η_{1}^{*}, η_{2}^{*}$ differ from the truth.

In Settings 1 and 2, we let the working model $η_{1}^{*}$ be Normal(1.1, 0.9/3.5²). In Setting 1, the working model $η_{2}^{*}$ was Normal(0, 0.9²), and in Setting 2, $η_{2}^{*}$ was Normal(0, 1.7²). While the working models have supports as large as the true distributions, the proposed $η_{2}^{*}$ in no way accounts for the possible heteroskedasticity in η₂₀. Our approach was implemented following the procedure in Section 3.5 where d^* was computed numerically with q = 7 Hermite bases and r = 20 discretization points; all integrals were computed using Hermite quadrature.

4.2.2 Homoskedastic and heteroskedastic sieve estimator

The second and third method is a sieve MLE which either assumes homoskedastic or heteroskedastic model error. Specifically, the sieve MLE is the solution to

arg {max}_{β} {sup}_{(f_{1}, f_{2})} \frac{1}{n} \sum_{i = 1}^{n} ln \int f_{1} {y_{i} - m (x, z_{i}; β) ∣ x, z_{i}} p_{U} (w_{i} - x; {\hat{α}}_{n}) f_{2} (x) d μ (x),

(10)

where f₁, f₂ are truncated series used to estimate the unknown distributions of p_ε|X,Z(ε|x, z) and p_X|Z(x|z), respectively. For our simulations, we have that p_X|Z(x|z) = p_X(x) since X and Z are generated independently of each other; thus, f₂ is set to represent p_X(x). Lastly, p_U corresponds to the normal distribution for W_i = (W_i₁ +W_i₂)/2 and α̂_n is the vectorized solution to (3).

We consider two different sieves for f₁. The first is a homoskedastic sieve where f₁ will estimate p_ε( ε) and thus ignore any dependence between ε and (X,Z). The second is a heteroskedastic sieve where f₁ will estimate p_ε|X,Z(ε|x, z) and thus account for any dependence between ε and (X,Z).

For the homoskedastic sieve, we use the work of Schennach and Hu (2013), and use

\sqrt{f_{1} (ε)} = \sum_{j = 0}^{κ_{ε}} ξ_{j}^{ε} t_{j} (ε),

where κ_ε is a smoothing parameter, $t_{j} (x) = {(\sqrt{\sqrt{π} j! 2^{j}})}^{- 1} H_{j} (x) exp (- x^{2} / 2)$ and H_j are Hermite polynomials. To ensure that f₁(ε) is a valid density and that E(ε) = 0, we require that $\sum_{j = 0}^{κ_{ε}} {(ξ_{j}^{ε})}^{2} = 1$ and $\sum_{j = 1}^{κ_{ε} - 1} \sqrt{2 (j + 1)} ξ_{j}^{ε} ξ_{j + 1}^{ε} = 0$ . We expect that this homoskedastic f₁ will perform well when ε is in fact homoskedastic, but we do expect bias when ε is in fact heteroskedastic.

For the heteroskedastic sieve, we extended the work of Hu and Schennach (2008), and use

\begin{array}{l} \sqrt{f_{1} (ε ∣ x, z)} = [a_{00} + a_{01} cos {\frac{π}{ℓ_{x}} m (x, z; β)} + a_{02} cos {\frac{2 π}{ℓ_{x}} m (x, z; β)}] \\ + \sum_{k = 1}^{3} [a_{k 0} + a_{k 1} cos {\frac{π}{ℓ_{x}} m (x, z; β)} + a_{k 2} cos {\frac{2 π}{ℓ_{x}} m (x, z; β)}] cos (\frac{k π}{ℓ_{e}} ε) \\ + \sum_{k = 1}^{3} [b_{k 0} + b_{k 1} cos {\frac{π}{ℓ_{x}} m (x, z; β)} + b_{k 2} cos {\frac{2 π}{ℓ_{x}} m (x, z; β)}] sin (\frac{k π}{ℓ_{e}} ε) . \end{array}

By construction m(x, z; β) ∈ [0, ℓ_x] and we simulated data such that ε ∈ [−ℓ_e, ℓ_e] for an appropriate choice of ℓ_e, so as to align with the assumptions of Hu and Schennach (2008). Finally, to ensure that f₁(ε|x, z) is a valid density and that E(ε|X,Z) = 0, we impose twelve constraints given in Section S.8 (Supplementary Material). It is important to note that our heteroskedastic sieve above differs from that in Hu and Schennach (2008) in two ways. First, we use a sieve to estimate p_ε|X,Z rather than p_U|X,Z as in Hu and Schennach (2008) who considered heteroskedastic measurement error, not heteroskedastic model error. Second, we further require that f₁ is always non-negative while Hu and Schennach (2008) did not impose that in their numerical studies. In terms of performance, we expect that the heteroskedastic f₁ will perform well whether ε is homoskedastic or heteroskedastic, since homoskedasticity is a special case of heteroskedascity (i.e., σ_ε(x) ≡ σ_ε).

Lastly, regardless of the form for f₁, we let

\sqrt{f_{2} (x)} = \sum_{j = 0}^{κ_{x}} ξ_{j}^{x} t_{j} (x),

where κ_x is a smoothing parameter, and t_j(x) is the Hermite representation as defined for the homoskedastic f₁. To ensure that f₂ is a valid density we require that $\sum_{j = 0}^{κ_{x}} {(ξ_{j}^{x})}^{2} = 1$ .

The homoskedastic and heteroskedastic sieve MLE is then the solution to the optimization problem in (10) subject to all constraints stated above: three for the homoskedastic sieve MLE and thirteen for the heteroskedastic sieve MLE. The integral in (10) is evaluated using Hermite quadrature. We set the smoothing parameters κ_ε = 6 and κ_x = 6 as in Schennach and Hu (2013), but other values were considered and yielded similar results (not reported).

4.2.3 Homoskedastic and heteroskedastic Tsiatis-Ma estimator

The fourth and fifth methods are based on the work of Tsiatis and Ma (2004). The Tsiatis-Ma (TM) estimator also uses a working model $η_{1}^{*}$ , but requires $η_{2}^{*}$ to yield a correctly specified variance structure. To demonstrate this sensitivity, we applied the TM estimator assuming homoskedastic model errors (TM-Homoskedastic) and assuming heteroskedastic model errors (TM-Heteroskedastic).

For both TM-Homoskedastic and TM-Heteroskedastic estimators, we set the working model $η_{1}^{*}$ as Normal(1.1, 0.9/3.5²). For the TM-Homoskedastic estimator, we let $η_{2}^{*}$ be Normal(0, 1/3) in Setting 1 and Normal(0, 4/15) in Setting 2. The variances for $η_{2}^{*}$ correspond to the true variances of η₂₀ when η₂₀ is homoskedastic. For the TM-Heteroskedastic estimator, we let $η_{2}^{*}$ be Normal {0, (|x|+1)²/3} in Setting 1 and Normal {0, 5(0.4|x|+0.5)²/3} in Setting 2. The variances for $η_{2}^{*}$ correspond to the true variances of η₂₀ when η₂₀ is heteroskedastic.

4.2.4 Naive estimator

The last method is the naive least squares estimator which is the solution to

arg {max}_{β} \sum_{i = 1}^{n} {y_{i} - m (w_{i}, z_{i}; β)}^{2} .

The naive estimator ignores measurement error and falsely assumes X_i and W_i = (W_i₁ + W_i₂)/2 are the same.

4.3 Simulation results

4.3.1 Performance of methods compared

Results in Tables 1 and 2 show the bias, estimated variance, and estimated 95% coverage probabilities for the model parameter estimates based on all six methods. Overall, all estimators consistently estimated the measurement error variance $σ_{U}^{2}$ and β₃ associated with the non-mismeasured covariate Z. Performances differed, however, for parameters β₁, β₂ which were affected by the mismeasured covariate X.

Table 1.

Bias, empirical sample variances (var), averaged estimated variances ( $\hat{var}$ ), and estimated 95% coverage probabilities (CI) for ${({\hat{σ}}_{U}^{2}, {\hat{β}}^{T})}^{T}$ based on our proposed method (Semipar), homoskedastic sieve MLE (Sieve-Hom), heteroskedastic sieve MLE (Sieve-Het), Tsiatis-Ma homoskedastic estimator (TM-Hom), Tsiatis-Ma heteroskedastic estimator (TM-Het), and the naive estimator. Results based on 1000 simulations when m(X,Z; β) = β₂ exp(−β₁X²) + β₃Z, and true parameter values ${(σ_{U, 0}^{2}, β_{0}^{T})}^{T} = {(0.05, 0.25, 0.7, 0.5)}^{T}$ .

Setting 1:

Setting 2:

η₂₀ ~ Uniform

η₂₀ ~ t₅

β̂₁

β̂₂

β̂₃

{\hat{σ}}_{U}^{2}

β̂₁

β̂₂

β̂₃

{\hat{σ}}_{U}^{2}

η₂₀: Homoskedastic

Semipar

bias

−0.0065

−0.0080

0.0011

5.1664×10⁻⁵

−0.0056

−0.0059

0.0008

−9.2372×10⁻⁵

var

0.0030

0.0031

0.0026

1.0255×10⁻⁵

0.0024

0.0025

0.0022

1.0823×10⁻⁵

\hat{var}

0.0030

0.0027

1.0255×10⁻⁵

0.0024

0.0021

9.9839×10⁻⁶

0.9500

0.9390

0.9520

0.9490

0.9440

0.9370

0.9520

0.9320

Sieve-Hom^*

bias

0.0066

0.0008

0.0021

5.1664×10⁻⁵

0.0371

0.0235

0.0056

−9.2372×10⁻⁵

var

0.0033

0.0030

0.0022

1.0255×10⁻⁵

0.0046

0.0035

0.0023

1.0823×10⁻⁵

\hat{var}

Sieve-Het^*

bias

0.5022

0.8177

0.6900

9.6823×10⁻⁶

0.7261

−0.1768

0.3083

−5.0634×10⁻⁵

var

0.0450

0.0458

0.0795

1.0916×10⁻⁵

0.0521

0.0581

0.0497

1.0109×10⁻⁵

\hat{var}

TM-Hom

bias

0.0019

−0.0000

−0.0002

0.0012

0.0004

−0.0013

−7.9103×10₋₅

var

0.0035

0.0033

0.0027

1.0396×10⁻⁵

0.0028

0.0026

0.0021

9.6008×10⁻⁶

\hat{var}

0.0035

0.0032

0.0027

9.8539×10⁻⁶

0.0028

0.0026

0.0022

9.941×10⁻⁶

0.9460

0.9440

0.9470

0.9490

0.9450

0.9540

0.9610

0.9470

TM-Het

bias

−0.0144

−0.0185

0.0001

−0.0002

−0.0203

−0.0234

−0.0013

−7.9103×10⁻⁵

var

0.0038

0.0034

0.0032

1.0396×10⁻⁵

0.0026

0.0024

0.0023

9.6008×10⁻⁶

\hat{var}

0.0037

0.0032

0.0031

9.8539×10⁻⁶

0.0026

0.0023

9.941×10⁻⁶

0.9210

0.9300

0.9480

0.9490

0.9080

0.9160

0.9530

0.9470

Naive

bias

−0.0269

−0.0230

0.0027

5.1664×10⁻⁵

−0.0255

−0.0206

0.0023

−9.2372×10⁻⁵

var

0.0029

0.0030

0.0026

1.0255×10⁻⁵

0.0023

0.0024

0.0022

1.0823×10⁻⁵

\hat{var}

0.0030

0.0028

0.0027

1.0255×10⁻⁵

0.0024

0.0023

0.0022

9.9839×10⁻⁶

0.8930

0.9130

0.9570

0.9490

0.8830

0.9190

0.9530

0.9320

Open in a new tab

Estimated variances not available. The homoskedastic sieve MLE uses smoothing parameters κ_ε = κ_x = 6, except for the uniform heteroskedastic setting which uses κ_ε = 5, κ_x = 6. For the uniform heteroskedastic setting, the constrained optimization could not be solved for larger κ_ε values.

Table 2.

Setting 1:

Setting 2:

η₂₀ ~ Uniform

η₂₀ ~ t₅

β̂₁

β̂₂

β̂₃

{\hat{σ}}_{U}^{2}

β̂₁

β̂₂

β̂₃

{\hat{σ}}_{U}^{2}

η₂₀: Heteroskedastic

Semipar

bias

0.0098

−0.0055

0.0013

2.9761×10⁻⁵

0.0122

−0.0009

0.0008

−0.0001

var

0.0188

0.0101

0.0119

1.0161×10⁻⁵

0.0160

0.0103

0.0124

1.0843×10⁻⁵

\hat{var}

0.0206

0.0100

0.0125

1.0022×10⁻⁵

0.0195

0.0102

0.0124

9.9708×10⁻⁵

0.9610

0.9480

0.9550

0.9510

0.9570

0.9520

0.9320

Sieve-Hom^*

bias

0.1683

0.1008

−0.0757

−2.4913×10⁻⁵

0.2437

0.1595

−0.0139

−0.0001

var

0.1686

0.2500

0.0518

9.9631×10⁻⁶

0.0601

0.0410

0.0247

1.0492×10⁻⁵

\hat{var}

Sieve-Het^*

bias

0.7334

0.6731

0.6527

9.6823×10⁻⁶

0.7868

−0.2997

0.5669

−5.0634×10⁻⁵

var

0.0423

0.0385

0.0589

1.0916×10⁻⁵

0.0443

0.0844

0.0256

1.0109×10⁻⁵

\hat{var}

TM-Hom

bias

0.2931

0.2677

−0.0227

−0.0002

0.5032

0.5174

−0.0349

−0.0005

var

0.2640

0.1392

0.0123

1.037×10⁻⁵

1.3260

1.6331

0.0119

8.9453×10⁻⁶

\hat{var}

0.1065

0.0692

0.0131

9.8513×10⁻⁶

0.3768

0.2236

0.0135

9.7925×10⁻⁶

0.8110

0.6800

0.9580

0.9480

0.8950

0.7890

0.9600

0.9550

TM-Het

bias

0.0225

0.0100

0.0005

−0.0002

0.0134

0.0011

−0.0027

−9.3324×10⁻⁵

var

0.0218

0.0096

0.0103

1.0409×10⁻⁵

0.0195

0.0088

0.0096

9.5564×10⁻⁶

\hat{var}

0.0258

0.0098

0.0104

9.8583×10⁻⁶

0.0215

0.0089

0.0099

9.9345×10⁻⁶

0.9500

0.9540

0.9490

0.9520

0.9500

0.9590

0.9470

Naive

bias

0.0692

−0.0185

0.0033

2.9761×10⁻⁵

0.0149

−0.0123

0.0028

−0.0001

var

3.0261

0.0102

0.0125

1.0161×10⁻⁵

0.3106

0.0100

0.0127

1.0843×10⁻⁵

\hat{var}

1.5445

0.0102

0.0177

1.0022×10⁻⁵

0.9245

0.0387

0.0792

9.9708×10⁻⁵

0.9390

0.9480

0.9550

0.9510

0.9310

0.9550

0.9320

Open in a new tab

In general, compared to the other estimators, our estimator had smaller bias, estimated variances better matching the sample variances, and estimated coverage probabilities closer to the nominal 95% level. This performance was similar regardless of the true model error distribution and its variance structure, thus reflecting the proposed estimator’s flexibility. The proposed estimator can yield valid estimates for an RMM with measurement error regardless of whether the true model error is homoskedastic or heteroskedastic. This is especially beneficial in practice since knowing the correct model error variance structure is almost impossible as residuals are not obtainable in measurement error models.

In comparison, the homoskedastic and heteroskedastic sieve MLE were, in some cases, sensitive to the model error’s variance structure. When the model error was homoskedastic, the homoskedastic sieve MLE performed well and yielded unbiased estimates. Unfortunately, when applied to the heteroskedastic model error, this same estimator yielded biased estimates with bias up to 19 times larger than our proposed estimator. Increasing the number of smoothing parameters did not change the numerical results (a similar phenomenon was observed in Schennach and Hu (2013)), and it breaks the constrained optimization solver when the number becomes too large. The observed bias was expected, however, because the homoskedastic sieve MLE is not designed to handle heteroskedasticity. Instead, a more flexible sieve such as the heteroskedastic sieve estimator should actually be employed. Unfortunately, in our numerical studies, the heteroskedastic sieve MLE yielded biased estimates both when the model error was homoskedastic and heteroskedastic. We suspect the observed bias could be a result of the difficulty in solving a constrained optimization subject to too many constraints. When the model error is truly heteroskedastic, we further suspect that more specialized bases may be needed to properly account for the heteroskedasticity. Doing so, however, may be difficult as it would require estimating the model error’s heteroskedasticity and defining a truncated series that can capture its form. For an RMM with measurement error, correctly determining the model error’s variance-covariance is challenging, and is a step surpassed by our proposed estimator.

The TM-Homoskedastic and TM-Heteroskedastic estimators also heavily relied on the correctness of the model error variance. When the model error variance structure was correctly specified, the TM estimators had little bias and nearly perfect nominal 95% coverage probabilities. In this case, the TM estimator has one less nonparametric term than our proposed method, and thus performed well. In contrast, when the variance structure was incorrect, the TM estimators performed poorly compared to our proposed estimator. The poor performance was most notable when the data was generated with heteroskedastic model errors, and we applied the TM-Homoskedastic estimator. In this case, the TM-Homoskedastic estimator yielded estimates with bias up to 40 times larger than our proposed estimator.

Finally, the naive estimator had large bias and coverage probabilities less than the nominal 95%, indicating that the measurement error was significant enough and could not be ignored.

These results demonstrate that measurement error cannot be ignored and that methods that rely on knowing the model error variance structure will, unfortunately, yield biased estimates. Because our proposed estimator makes no assumptions about the model error’s variance structure, our method does indicate more flexibility than existing methods, including the sieve MLE and Tsiatis-Ma method. Specifically, our proposed estimator provides consistent estimates even when the model error and covariate distributions are both misspecified. Similar results were observed for other mean models; see Supplementary Material (Section S.9).

4.3.2 Empirical impact of working models in proposed method

In Section 3.3, we discussed the theoretical impact of working models in our proposed method. We now evaluate the numerical impact. Specifically, we generated data as in Section 4.1, except with η₁₀ as Normal(0, 0.5²) and η₂₀ as Normal(0, 0.4²). We then evaluated our proposed method for four different cases of working models $η_{1}^{*}, η_{2}^{*}$ :

Case 1: $η_{1}^{*} = η_{10}, η_{2}^{*} = η_{20}$ .
Case 2: $η_{1}^{*} \neq η_{10}, η_{2}^{*} = η_{20}$ with $η_{1}^{*}$ a t-distribution with 4 degrees of freedom.
Case 3: $η_{1}^{*} = η_{10}, η_{2}^{*} \neq η_{20}$ with $η_{2}^{*}$ as Normal {0, (1 + |X|)²/3²}.
Case 4: $η_{1}^{*} \neq η_{10}, η_{2}^{*} \neq η_{20}$ with $η_{1}^{*}$ a t-distribution with 4 degrees of freedom, and $η_{2}^{*}$ as Normal {0, (1 + |X|)²/3²}.

Results in Table 3 show that in all cases, the proposed estimator yields consistent estimates. As we progress from Case 2 to Case 4, the efficiency loss only slightly increases; for example, the estimated variance for β̂₁ is 0.0065 in Case 4 compared to an estimated variance of 0.0044 in Case 1. Similar results were observed for other regression models; see Supplementary Material (Section S.9). This small loss in efficiency and insensitivity to the choice of the working models was similarly observed in simpler models (see Tsiatis and Ma, 2004, Ma and Carroll, 2006 and Wang et al., 2009). Hence, for flexible choices of working models, our method yields consistent estimates and small efficiency loss when using incorrect working models.

Table 3.

Evaluation of efficiency loss from proposed method when working models $η_{1}^{*}, η_{2}^{*}$ may differ from the true η₁₀, η₂₀. Bias, empirical sample variances (var), averaged estimated variances ( $\hat{var}$ ), and estimated 95% coverage probabilities (CI) for ${({\hat{σ}}_{U}^{2}, {\hat{β}}^{T})}^{T}$ with true parameter values ${(σ_{U, 0}^{2}, β_{0}^{T})}^{T} = {(0.05, 0.25, 0.7, 0.5)}^{T}$ and m(X,Z; β) = β₂ exp(−β₁X²) + β₃Z. Results based on 1000 simulations.

Setting

β̂₁

β̂₂

β̂₃

{\hat{σ}}_{U}^{2}

η_{1}^{*} = η_{10}, η_{2}^{*} = η_{20}

bias

−0.0013

0.0009

−0.0017

−3.9892×10⁻⁵

var

0.0043

0.0010

0.0013

1.0103×10⁻⁵

\hat{var}

0.0044

0.0010

0.0013

9.9257×10⁻⁶

0.9500

0.9430

η_{1}^{*} \neq η_{10}, η_{2}^{*} = η_{20}

bias

−0.0001

0.0012

−0.0017

−3.9892×10⁻⁵

var

0.0046

0.0010

0.0013

1.0103×10⁻⁵

\hat{var}

0.0047

0.0010

0.0013

9.9257×10⁻⁶

0.9480

0.9500

0.9430

η_{1}^{*} = η_{10}, η_{2}^{*} \neq η_{20}

bias

−0.0104

0.0007

−0.0063

−3.9892×10⁻⁵

var

0.0051

0.0012

0.0020

1.0103×10⁻⁵

\hat{var}

0.0052

0.0012

0.0018

9.9257×10⁻⁶

0.9380

0.9490

0.9410

0.9430

η_{1}^{*} \neq η_{10}, η_{2}^{*} \neq η_{20}

bias

−0.0081

0.0011

−0.0064

−3.9892×10⁻⁵

var

0.0064

0.0013

0.0021

1.0103×10⁻⁵

\hat{var}

0.0065

0.0013

0.0019

9.9257×10⁻⁶

0.9410

0.9470

0.9430

Open in a new tab

5 A case study

Flagg et al. (2000) performed a study to evaluate the validity of a Nutrition Survey conducted by the American Cancer Society in 1992–1993. In the study, n = 317 male participants completed four 24 hour dietary recall interviews given over a one-year period. Interest lies in understanding the impact of saturated fat intake on percent calories from fat for different races (white vs. non-white). Saturated fat intake, however, is not known exactly and only a mismeasured version via two repeated measurements is available.

Let Y denote the percent calories from fat, X denote the log transformation of the true (unobserved) saturated fat intake, and Z denote race (Z = 1 refers to white). We let W₁ and W₂ be the centered, log-transformed saturated fat measurements. Through a QQ-plot in Figure 1, we find that V = (W₁ − W₂)/2 is acceptably normally distributed with some unknown variance $σ_{U}^{2}$ . Normality was formally evaluated through a Pearson Chi-squared test where we used 10 to 20 bins for testing and obtained a p-value at least 0.63, thus assuring the normality assumption.

Nutrition study: quantile-quantile plots of the measurement error for the original first and third readings of the 24 hour recall surveys (top) and after the logarithm transform (bottom).

Because nutrition models usually assume percent calories from fat is related to saturated fat intake through a linear regression, we use the model

Y_{i} = β_{1} exp (X_{i}) + β_{2} + β_{3} Z_{i} + ε, W_{i j} = X_{i} + U_{i j}, U_{i j} ~ Normal (0, 2 σ_{U}^{2})

for i = 1, . . . , n; j = 1,2 and E(ε|X,Z) = 0.

To estimate the model parameters, we used five methods: (i) The proposed method with working models $η_{1}^{*}$ as Normal(0, 0.56²) and $η_{2}^{*}$ as Normal(0, 0.91²). The variance for $η_{1}^{*}$ is the sample variance of W, and the variance of $η_{2}^{*}$ is the residual sum of squares after regressing Y on exp(W) and Z. (ii) The homoskedastic sieve MLE with smoothing parameters κ_ε = κ_x = 6. (iii) The heteroskedastic sieve MLE with ℓ_x = ℓ_e = max_i |Y_i|. (iv) The Tsiatis-Ma Homoskedastic estimator with $η_{1}^{*}, η_{2}^{*}$ as in our proposed method. Unlike our method, the TM-Homoskedastic estimator assumes the specified $η_{2}^{*}$ is correct. We did not use the TM-Heteroskedastic estimator because it is difficult to specify a heteroskedastic variance structure for an RMM with measurement error. (v) The naive estimator.

Parameter estimates for all methods are in Table 4. All methods yielded similar inference conclusions: among the male population, saturated fat intake is statistically significant in relation to percent calories from fat (e.g., proposed method yielded β̂₁ = 1.59, 95% CI: (1.23, 1.95)), whereas race is not (e.g., proposed method yielded β̂₃ = −0.14, 95% CI: (−0.35, 0.08)). Though inference conclusions were similar, the methods yielded different magnitudes of the parameter effects. For example, the proposed method indicated that a one unit increase in saturated fat is associated with an estimated increase of 1.59 units in the mean of percent calories. This is nearly twice as large as the naive estimates would conclude and at least 1.4 times as large as the homoskedastic sieve, heteroskedastic sieve or TM-Homoskedastic estimator would conclude. The contrast in these results indicate that measurement error cannot be ignored. Moreover, given that the Tsiatis-Ma and sieve MLE estimator exhibited sensitivity to misspecification of the model error variance, we would prefer to rely on the results from the proposed method which is insensitive to such misspecification. Therefore, our method indicates that saturated fat intake affects a male’s percent calories from fat more than existing methods would indicate.

Table 4.

Results from nutrition study when estimation is based on proposed method (Semipar), homoskedastic sieve MLE (Sieve-Hom), heteroskedastic sieve MLE (Sieve-Het), Tsiatis-Ma homoskedastic estimator (TM-Hom), and naive estimator. Parameter estimate (est), its estimated variance ( $\hat{var}$ ), and 95% confidence interval (CI).

β̂₁

β̂₂

β̂₃

{\hat{σ}}_{U}^{2}

Semipar

est

1.5926

−1.6611

−0.1364

0.1097

\hat{var}

0.0338

0.0544

0.0117

0.0001

(1.2322,1.9530)

(−2.1182, −1.2040)

(−0.3480, 0.0752)

(0.0923, 0.1271)

Sieve-Hom^*

est

1.1397

−1.2449

−0.0401

0.1097

\hat{var}

Sieve-Het^*

est

1.1407

1.1628

0.2398

0.1097

\hat{var}

TM-Hom^†

est

1.2745

−1.3604

−0.0734

0.1097

\hat{var}

0.0190

0.0242

0.0116

0.0001

(1.0046, 1.5445)

(−1.6655, −1.0553)

(−0.2841, 0.1373)

(0.0923, 0.1271)

Naive

est

0.7110

−0.8044

−0.0351

0.1097

\hat{var}

0.0065

0.0129

0.0104

0.0001

(0.3506, 1.0714)

(−1.2615, −0.3473)

(−0.2467, 0.1766)

(0.0923, 0.1271)

Open in a new tab

Estimated variances not available. The homoskedastic sieve MLE uses smoothing parameters κ_ε = κ_x = 6.

^†

We did not use the TM-Heteroskedastic estimator because it is difficult to specify a heteroskedastic variance structure for an RMM with measurement error.

6 Discussion

We have developed root-n consistent estimators and provided inference tools for an RMM with errors in covariates where both the mean model and the measurement error model are in their general form. We showed that our method’s consistency does not require independence between the covariates and the model error, nor require estimating the unobservable covariate distribution and model error distribution. This is advantageous over existing methods including the Tsiatis and Ma (2004) estimator and the sieve MLE which have shown numerical sensitivity to model error heteroskedasticity. The proposed estimator is derived via a semiparametric procedure different from that in Tsiatis and Ma (2004), and, to the best of our knowledge, the resulting root-n consistent estimator is the first known in its generality that is robust to various distribution misspecifications.

To identify and estimate Ω_U in Section 2.1, we used the average of repeated measures. An alternative is to directly use the repeated measures to perform estimation and inference. Based on our experience (Ma and Yin, 2008), there is generally not a definitive efficiency gain or loss with this approach relative to the average approach. However, more careful analysis will be needed to determine when one or the other is more efficient.

We assumed throughout that the measurement error distribution p_{U_ij} (u; Ω_U) is parametric with Ω_U unknown. We can relax this assumption to have a nonparametric measurement error distribution. In this case, still assuming X_i and U_ij are independent, Kotlarski’s Theorem (Kotlarski, 1967) implies that the measurement error density is identifiable. From the repeated measures, a nonparametric kernel estimation of the measurement error density function p̂_{U_ij} can be obtained, and operationally our estimation procedure can proceed with p_{U_ij} replaced by p̂_{U_ij}. For such a plug-in procedure, we provide the following summary. (i) The identifiability of β (Section 2.2) still holds; (ii) Theorem 1 and the estimation procedure for β (Section 3.2) remain valid since they only required a consistent estimator for the measurement error density. (iii) The root-n consistency and asymptotic normality in Theorems 2 and 3 still hold, although the asymptotic variance will change and the proofs will need to be redone to take into account the additional nonparametric estimation. See Hall and Ma (2007) for details on how to incorporate a nonparametrically estimated error distribution in a different model. (iv) The optimal efficiency bound in estimating β will decrease due to the the nonparametric estimation of p_{U_ij}.

Lastly, another extension of our method is to a conditional moment model where

E {m (Y, X, Z; β) ∣ X, Z} = 0.

(11)

In this case, the proof of identification of β (Section 2.2) still holds since it does not require a particular form of the conditional density of Y conditional on X,Z. Our remaining estimation procedure, asymptotic properties, and implementation (Section 3) also remain intact except with ε replaced everywhere by m(Y,X,Z; β) and $m_{β}^{'} (X, Z; β)$ in the right hand side of equation (7) changed to −E{∂m(Y,X,Z; β)/∂β|X,Z}. To this end, even for general nonlinear and nonseparable regression models of the form Y = f(X,Z, ε, β₀), where the distribution of ε is unknown and may be subject to various restrictions, as long as we can construct moment conditions, i.e. finding m(Y,X,Z; β₀) such that (11) holds, our general procedure is applicable. This extension is particularly useful in empirical economics where models can take a conditional or nonseparable forms.

Acknowledgments

This work was supported by the the National Institute Of Neurological Disorders And Stroke of the National Institutes of Health under Award Number K01NS099343, the Huntington’s Disease Society of America Human Biology Project Fellowship, Texas A&M School of Public Health Research Enhancement and Development Initiative (REDI-23-202059-36000), and the National Science Foundation (DMS-1608540). We thank Yingyao Hu for providing code for the homoskedastic sieve estimator and for advising on the heteroskedastic sieve estimator. We thank Raymond J. Carroll for providing the nutrition data. We also thank the editor and two referees whose comments substantially improved the quality and presentation of the work.

Footnotes

JEL Classification: C1

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Tanya P. Garcia, Department of Epidemiology and Biostatistics, Texas A&M University.

Yanyuan Ma, Department of Statistics, Pennsylvania State University.

References

Bickel PJ, Klaassen CAJ, Ritov Y, Wellner JA. Efficient and Adaptive Estimation for Semiparametric Models. Baltimore: The Johns Hopkins University Press; 1993. [Google Scholar]
Carroll RJ, Hall P. Optimal rates of convergence for deconvoluting a density. Journal of the American Statistical Association. 1988;83:1184–1186. [Google Scholar]
Carroll RJ, Maca JD, Ruppert D. Nonparametric regression in the presence of measurement error. Biometrika. 1999;86:541–554. [Google Scholar]
Carroll RJ, Ruppert D, Crainiceanu CM, Tosteson TD, Karagas MR. Nonlinear and nonparametric regression and instrumental variables. Journal of the American Statistical Association. 2004;99:736–750. [Google Scholar]
Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu C. Measurement Error in Nonlinear Models: A Modern Perspective. 2. London: CRC Press; 2006. [Google Scholar]
Carroll RJ, Wang Y. Nonparametric variance estimation in the analysis of microarray data: a measurement error approach. Biometrika. 2008;95:437–449. doi: 10.1093/biomet/asn017. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chan LK, Mak TK. On the polynomial functional relationship. Journal of the Royal Statistical Society, Series B. 1985;47:510–518. [Google Scholar]
Chen X, Hu Y, Lewbel A. Nonparametric identification and estimation of nonclassical errors-in-variables models without additional information. Statistica Sinica. 2009;19:949–968. [Google Scholar]
Cheng CL, Schneeweiss H. Polynomial regression with errors in the variables. Journal of the Royal Statistical Society, Series B. 1998;60:189–199. [Google Scholar]
Cheng CL, Schneeweiss H, Thamerus M. A small sample estimator for a polynomial regression with errors in the variables. Journal of the Royal Statistical Society, Series B. 2000;62:699–709. [Google Scholar]
Delaigle A, Hall P. Estimation of observation-error variance in errors-invariables regression. Statistica Sinica. 2011;21:1023–1063. [Google Scholar]
Fan J. On the optimal rates of convergence for nonparametric deconvolution problems. Annals of Statistics. 1991;19:1257–1272. [Google Scholar]
Flagg E, Coates R, Calle E, Potischman N, Thun M. Validation of the American Cancer Society Cancer Prevention Study II Nutrition Survey Cohort Food Frequency Questionnaire. Epidemiology. 2000;11:462–468. doi: 10.1097/00001648-200007000-00017. [DOI] [PubMed] [Google Scholar]
Fuller WA. Measurement Error Models. New York: Wiley; 1987. [Google Scholar]
Hall P, Ma Y. Measurement Error Models with Unknown Error Structure. Journal of the Royal Statistical Society, Series B. 2007;69:429–446. [Google Scholar]
Huang S, Huwang L. On the polynomial structural relationship. The Canadian Journal of Statistics. 2001;29:495–512. [Google Scholar]
Hu Y, Schennach S. Instrumental variable treatment of nonclassical measurement error models. Econometrica. 2008;76:195–216. [Google Scholar]
Jennrich RI. Asymptotic Properties of Non-Linear Least Squares Estimators. Annals of Mathematical Statistics. 1969;40:633–643. [Google Scholar]
Kotlarski II. On characterizing the gamma and normal distribution. Pacific Journal of Mathematics. 1967;20:69–76. [Google Scholar]
Kress R. Linear integral equations. 2. Berlin: Springer; 1999. [Google Scholar]
Lee L, Sepanski J. Estimation of linear and nonlinear errors-in-variables models using validation data. Journal of the American Statistical Association. 1995;90:130–140. [Google Scholar]
Li H, Liqun W. Consistent estimation in generalized linear mixed models with measurement error. Journal of Biometrics and Biostatistics. 2012:S7:007. doi: 10.4172/2155-6180.S7-007. [DOI] [Google Scholar]
Li T. Robust and consistent estimation of nonlinear errors-in-variables models. Journal of Econometrics. 2002;110:1–26. [Google Scholar]
Liang H. Generalized partially linear mixed effects models incorporating mismeasured covariates. Annals of the Institute of Statistical Mathematics. 2009;61:27–46. doi: 10.1007/s10463-007-0146-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liang K, Zeger S. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73:13–22. [Google Scholar]
Ma Y, Carroll RJ. Locally efficient estimators for semiparametric models with measurement error. Journal of the American Statistical Association. 2006;101:1465–1474. [Google Scholar]
Ma Y, Yin G. Cure Rate Model with Mismeasured Covariates under Transformation. Journal of the American Statistical Association. 2008;103:743–756. [Google Scholar]
Nakamura T. Corrected score function for errors-in-variables models: methodology and application to generalized linear models. Biometrika. 1990;77:127–137. [Google Scholar]
Newey W. Semiparametric efficiency bounds. Journal of Applied Econometrics. 1990;5:99–135. [Google Scholar]
Novick SJ, Stefanski LA. Corrected score estimation via complex variable simulation extrapolation. Journal of the American Statistical Association. 2002;97:472–481. [Google Scholar]
Rao CR. Linear Statistical Inference and Its Applications. New York: Wiley; 1973. [Google Scholar]
Rao P. Identifiability in Stochastic Models. New York: Academic Press; 1992. [Google Scholar]
Rudin W. Real and complex analysis. McGraw-Hill; 1987. Mathematics series. [Google Scholar]
Schennach S. Estimation of nonlinear models with measurement error. Econometrica. 2004;72:33–75. [Google Scholar]
Schennach S. Nonparametric regression in the presence of measurement error. Econometric Theory. 2004b;20:1046–1093. [Google Scholar]
Schennach S, Hu Y. Nonparametric identification and semiparametric estimation of classical measurement error models without side information. Journal of the American Statistical Association. 2013;108:177–186. [Google Scholar]
Shen X. On methods of sieves and penalization. Annals of Statistics. 1997;25:2555–2591. [Google Scholar]
Stefanski L, Carroll RJ. Deconvoluting kernel density estimators. Statistics. 1990;21:169–184. [Google Scholar]
Tsiatis A. Semiparametric Theory and Missing Data. New York: Springer; 2006. [Google Scholar]
Tsiatis A, Ma Y. Locally efficient semiparametric estimators for functional measurement error models. Biometrika. 2004;91:835–848. [Google Scholar]
Wang Y, Ma Y, Carroll RJ. Variance estimation in the analysis of microarray data. Journal of the Royal Statistical Society, Series B. 2009;71:725–745. doi: 10.1111/j.1467-9868.2008.00690.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] Bickel PJ, Klaassen CAJ, Ritov Y, Wellner JA. Efficient and Adaptive Estimation for Semiparametric Models. Baltimore: The Johns Hopkins University Press; 1993. [Google Scholar]

[R2] Carroll RJ, Hall P. Optimal rates of convergence for deconvoluting a density. Journal of the American Statistical Association. 1988;83:1184–1186. [Google Scholar]

[R3] Carroll RJ, Maca JD, Ruppert D. Nonparametric regression in the presence of measurement error. Biometrika. 1999;86:541–554. [Google Scholar]

[R4] Carroll RJ, Ruppert D, Crainiceanu CM, Tosteson TD, Karagas MR. Nonlinear and nonparametric regression and instrumental variables. Journal of the American Statistical Association. 2004;99:736–750. [Google Scholar]

[R5] Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu C. Measurement Error in Nonlinear Models: A Modern Perspective. 2. London: CRC Press; 2006. [Google Scholar]

[R6] Carroll RJ, Wang Y. Nonparametric variance estimation in the analysis of microarray data: a measurement error approach. Biometrika. 2008;95:437–449. doi: 10.1093/biomet/asn017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Chan LK, Mak TK. On the polynomial functional relationship. Journal of the Royal Statistical Society, Series B. 1985;47:510–518. [Google Scholar]

[R8] Chen X, Hu Y, Lewbel A. Nonparametric identification and estimation of nonclassical errors-in-variables models without additional information. Statistica Sinica. 2009;19:949–968. [Google Scholar]

[R9] Cheng CL, Schneeweiss H. Polynomial regression with errors in the variables. Journal of the Royal Statistical Society, Series B. 1998;60:189–199. [Google Scholar]

[R10] Cheng CL, Schneeweiss H, Thamerus M. A small sample estimator for a polynomial regression with errors in the variables. Journal of the Royal Statistical Society, Series B. 2000;62:699–709. [Google Scholar]

[R11] Delaigle A, Hall P. Estimation of observation-error variance in errors-invariables regression. Statistica Sinica. 2011;21:1023–1063. [Google Scholar]

[R12] Fan J. On the optimal rates of convergence for nonparametric deconvolution problems. Annals of Statistics. 1991;19:1257–1272. [Google Scholar]

[R13] Flagg E, Coates R, Calle E, Potischman N, Thun M. Validation of the American Cancer Society Cancer Prevention Study II Nutrition Survey Cohort Food Frequency Questionnaire. Epidemiology. 2000;11:462–468. doi: 10.1097/00001648-200007000-00017. [DOI] [PubMed] [Google Scholar]

[R14] Fuller WA. Measurement Error Models. New York: Wiley; 1987. [Google Scholar]

[R15] Hall P, Ma Y. Measurement Error Models with Unknown Error Structure. Journal of the Royal Statistical Society, Series B. 2007;69:429–446. [Google Scholar]

[R16] Huang S, Huwang L. On the polynomial structural relationship. The Canadian Journal of Statistics. 2001;29:495–512. [Google Scholar]

[R17] Hu Y, Schennach S. Instrumental variable treatment of nonclassical measurement error models. Econometrica. 2008;76:195–216. [Google Scholar]

[R18] Jennrich RI. Asymptotic Properties of Non-Linear Least Squares Estimators. Annals of Mathematical Statistics. 1969;40:633–643. [Google Scholar]

[R19] Kotlarski II. On characterizing the gamma and normal distribution. Pacific Journal of Mathematics. 1967;20:69–76. [Google Scholar]

[R20] Kress R. Linear integral equations. 2. Berlin: Springer; 1999. [Google Scholar]

[R21] Lee L, Sepanski J. Estimation of linear and nonlinear errors-in-variables models using validation data. Journal of the American Statistical Association. 1995;90:130–140. [Google Scholar]

[R22] Li H, Liqun W. Consistent estimation in generalized linear mixed models with measurement error. Journal of Biometrics and Biostatistics. 2012:S7:007. doi: 10.4172/2155-6180.S7-007. [DOI] [Google Scholar]

[R23] Li T. Robust and consistent estimation of nonlinear errors-in-variables models. Journal of Econometrics. 2002;110:1–26. [Google Scholar]

[R24] Liang H. Generalized partially linear mixed effects models incorporating mismeasured covariates. Annals of the Institute of Statistical Mathematics. 2009;61:27–46. doi: 10.1007/s10463-007-0146-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] Liang K, Zeger S. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73:13–22. [Google Scholar]

[R26] Ma Y, Carroll RJ. Locally efficient estimators for semiparametric models with measurement error. Journal of the American Statistical Association. 2006;101:1465–1474. [Google Scholar]

[R27] Ma Y, Yin G. Cure Rate Model with Mismeasured Covariates under Transformation. Journal of the American Statistical Association. 2008;103:743–756. [Google Scholar]

[R28] Nakamura T. Corrected score function for errors-in-variables models: methodology and application to generalized linear models. Biometrika. 1990;77:127–137. [Google Scholar]

[R29] Newey W. Semiparametric efficiency bounds. Journal of Applied Econometrics. 1990;5:99–135. [Google Scholar]

[R30] Novick SJ, Stefanski LA. Corrected score estimation via complex variable simulation extrapolation. Journal of the American Statistical Association. 2002;97:472–481. [Google Scholar]

[R31] Rao CR. Linear Statistical Inference and Its Applications. New York: Wiley; 1973. [Google Scholar]

[R32] Rao P. Identifiability in Stochastic Models. New York: Academic Press; 1992. [Google Scholar]

[R33] Rudin W. Real and complex analysis. McGraw-Hill; 1987. Mathematics series. [Google Scholar]

[R34] Schennach S. Estimation of nonlinear models with measurement error. Econometrica. 2004;72:33–75. [Google Scholar]

[R35] Schennach S. Nonparametric regression in the presence of measurement error. Econometric Theory. 2004b;20:1046–1093. [Google Scholar]

[R36] Schennach S, Hu Y. Nonparametric identification and semiparametric estimation of classical measurement error models without side information. Journal of the American Statistical Association. 2013;108:177–186. [Google Scholar]

[R37] Shen X. On methods of sieves and penalization. Annals of Statistics. 1997;25:2555–2591. [Google Scholar]

[R38] Stefanski L, Carroll RJ. Deconvoluting kernel density estimators. Statistics. 1990;21:169–184. [Google Scholar]

[R39] Tsiatis A. Semiparametric Theory and Missing Data. New York: Springer; 2006. [Google Scholar]

[R40] Tsiatis A, Ma Y. Locally efficient semiparametric estimators for functional measurement error models. Biometrika. 2004;91:835–848. [Google Scholar]

[R41] Wang Y, Ma Y, Carroll RJ. Variance estimation in the analysis of microarray data. Journal of the Royal Statistical Society, Series B. 2009;71:725–745. doi: 10.1111/j.1467-9868.2008.00690.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Simultaneous treatment of unspecified heteroskedastic model error distribution and mismeasured covariates for restricted moment models

Tanya P Garcia

Yanyuan Ma

Abstract

1 Introduction

1.1 Motivating problem

1.2 Estimation challenges

1.3 Competing methods and features of our approach

2 Identification

2.1 Identification of ΩU

2.2 Identification of β

3 Methodology

3.1 Estimation of ΩU and β

Theorem 1

3.2 Algorithm for estimating ΩU and β

3.3 Selection and impact of working models η1∗,η2∗

Remark 1

Remark 2

3.4 Theoretical properties

Theorem 2

Remark 3

Theorem 3

Remark 4

3.5 Implementation of the algorithm

3.5.1 Analytic d*

3.5.2 Numerical d*

4 Empirical Studies

4.1 Simulation design

4.2 Methods evaluated

4.2.1 Proposed method

4.2.2 Homoskedastic and heteroskedastic sieve estimator

4.2.3 Homoskedastic and heteroskedastic Tsiatis-Ma estimator

4.2.4 Naive estimator

4.3 Simulation results

4.3.1 Performance of methods compared

Table 1.

Table 2.

4.3.2 Empirical impact of working models in proposed method

Table 3.

5 A case study

Figure 1.

Table 4.

6 Discussion

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

2.1 Identification of Ω_U

3.1 Estimation of Ω_U and β

3.2 Algorithm for estimating Ω_U and β

3.3 Selection and impact of working models $η_{1}^{}, η_{2}^{}$

3.5.1 Analytic d^*

3.5.2 Numerical d^*