On Spatial Processes and Asymptotic Inference under Near-Epoch Dependence

Nazgul Jenish; Ingmar R Prucha

doi:10.1016/j.jeconom.2012.05.022

. Author manuscript; available in PMC: 2013 Sep 1.

Published in final edited form as: J Econom. 2012 Sep;170(1):178–190. doi: 10.1016/j.jeconom.2012.05.022

On Spatial Processes and Asymptotic Inference under Near-Epoch Dependence

Nazgul Jenish ¹, Ingmar R Prucha ²

PMCID: PMC3441186 NIHMSID: NIHMS390113 PMID: 22984323

Abstract

The development of a general inferential theory for nonlinear models with cross-sectionally or spatially dependent data has been hampered by a lack of appropriate limit theorems. To facilitate a general asymptotic inference theory relevant to economic applications, this paper first extends the notion of near-epoch dependent (NED) processes used in the time series literature to random fields. The class of processes that is NED on, say, an α-mixing process, is shown to be closed under infinite transformations, and thus accommodates models with spatial dynamics. This would generally not be the case for the smaller class of α-mixing processes. The paper then derives a central limit theorem and law of large numbers for NED random fields. These limit theorems allow for fairly general forms of heterogeneity including asymptotically unbounded moments, and accommodate arrays of random fields on unevenly spaced lattices. The limit theorems are employed to establish consistency and asymptotic normality of GMM estimators. These results provide a basis for inference in a wide range of models with spatial dependence.

Keywords: Random fields, near-epoch dependent processes, central limit theorem, law of large numbers, GMM estimator

1 Introduction

Models with spatially dependent data have recently attracted considerable attention in various fields of economics including labor and public economics, IO, political economy, international and urban economics. In these models, strategic interaction, neighborhood effects, shared resources and common shocks lead to interdependences in the dependent and/or explanatory variables, with the variables indexed by their location in some socioeconomic space.¹ Insofar as these locations are deterministic, observations can be modeled as a realization of a dependent heterogenous process indexed by a point in ℝ^d, d > 1, i.e., as a random field.

The aim of this paper is to define a class of random fields that is sufficiently general to accommodate many applications of interest, and to establish corresponding limit theorems that can be used for asymptotic inference. In particular, we apply these limit theorems to prove consistency and asymptotic normality of generalized method of moments (GMM) estimators for a general class of nonlinear spatial models.

To date, linear spatial autoregressive models, also known as Cliff Ord (1981) type models², have arguably been one of the most popular approaches to modeling spatial dependence in the econometrics literature. The asymptotic theory in these models is facilitated, loosely speaking, by imposing specific structural conditions on the data generating process, and by exploiting some underlying independence assumptions. Another popular approach to model dependence is through mixing conditions. Various mixing concepts developed for time series processes have been extended to random fields. However, the respective limit theorems for random fields have not been sufficiently general to accommodate many of the processes encountered in economics. This hampered the development of a general asymptotic inference theory for nonlinear models with cross-sectional dependence. Towards filling this gap, Jenish and Prucha (2009) have recently introduced a set of limit theorems (CLT, ULLN, LLN) for α-mixing random fields on unevenly spaced lattices that allow for nonstationary processes with trending moments.

However, some important classes of dependent processes are not necessarily mixing, including linear autoregressive (AR) and infinite moving average (MA(∞)) processes. Sufficient conditions for the α-mixing property of linear processes³ are fairly stringent, and involve three types of restrictions (i) smoothness of the density functions of the innovations, (ii) sufficiently fast rates of decay of the coefficients, and (iii) invertibility of the linear process. There are examples demonstrating that the mixing property can fail for any of these reasons. In particular, Andrews (1984) showed that a simple AR(1) process of independent Bernoulli innovations is not α-mixing. Similar examples have been constructed for random fields, see, e.g. Doukhan and Lang (2002). Thus, mixing may break down in the case of discrete innovations. Further, Gorodetskii (1977) showed that the strong mixing property may fail even in the case of continuously distributed (normal) innovations when the coefficients of the linear process do not decline sufficiently fast. As these examples suggest, the mixing property is generally not preserved under infinite transformations of mixing processes. Yet stochastic processes generated as functionals of some underlying process arise in a wide range of models, with autoregressive models being the leading example. Thus, it is important to develop an asymptotic theory for a generalized class of random fields that is “closed with respect to infinite transformations”.

To tackle this problem, the paper first extends the concept of near-epoch dependent (NED) processes used in the time series literature to spatial processes. The notion dates back to Ibragimov (1962), and Billingsley (1968). The NED concept, or variants thereof, have been used extensively in the time series literature by McLeish (1975), Bierens (1981), Wooldridge (1986), Gallant and White (1988), Andrews (1988), Pötscher and Prucha (1997), Davidson (1992, 1993, 1994) and de Jong (1997), among others. Doukhan and Louhichi (1999) introduced an alternative class of dependent processes called “θ-weakly dependent”.

In deriving our limit theorems we then only assume that the process is NED on a mixing input process, i.e., that the process can be approximated by a mixing input process in the NED sense, rather than to assume that the process itself is mixing. Of course, every mixing process is trivially also NED on itself, and thus the class of processes that are NED on a mixing input process includes the class of mixing processes. There are several advantages to working with the enlarged class of process that are NED on a mixing process. First, linear processes with discrete innovations, which results in the process to not satisfy the strong mixing property, will still be NED on the mixing input process of innovations, provided the latter are mixing. We note that, in particular, the NED property holds in both examples of Andrews (1984) and Gorodetskii (1977), by Proposition 1 of this paper. Second, as shown in this paper, nonlinear MA(∞) random fields are also NED under some mild conditions, while such conditions are not readily available for mixing. Third, the NED property is often easy to verify. For instance, the sufficient conditions for MA(∞) random fields involve only smoothness conditions on the functional form and absolute summability of the coefficients, which are not difficult to check, while verification of mixing is usually more difficult.

The paper derives a CLT and an LLN for spatial processes that are near epoch dependent on an α-mixing input process. These limit theorems allow for fairly general forms of heterogeneity including asymptotically unbounded moments, and accommodate arrays of random fields on unevenly spaced lattices. The LLN can be combined with the generic ULLN in Jenish and Prucha (2009) to obtain an ULLN for NED spatial processes. In the time series literature, CLTs for NED processes were derived by Wooldridge (1986), Davidson (1992, 1993), and de Jong (1997). Interestingly, our CLT contains as a special case the CLT of Wooldridge (1986), Theorem 3.13 and Corollary 4.4.

In addition, we give conditions under which the NED property is preserved under transformations. These results play a key role in verifying the NED property in applications. Thus, the NED property is compatible with considerable heterogeneity and dependence, invariant under transformations, and leads to a CLT and LLN under fairly general conditions. All these features make it a convenient tool for modeling spatial dependence.

As an application, we establish consistency and asymptotic normality of spatial GMM estimators. These results provide a fundamental basis for constructing confidence intervals and testing hypothesis for GMM estimators in nonlinear spatial models. Our results also expand on Conley (1999), who established the asymptotic properties of GMM estimators assuming that the data generating process and the moment functions are stationary and α-mixing.⁴

The rest of the paper is organized as follows. Section 2 introduces the concept of NED spatial processes and gives of examples random fields satisfying this condition. Section 3 contains the LLN and CLT for NED spatial processes. Section 4 establishes the asymptotic properties of spatial GMM estimators. All proofs are relegated to the appendices.

2 NED Spatial Processes

Let D ⊂ ℝ^d, d ≥ 1, be a lattice of (possibly) unevenly placed locations in ℝ^d, and let Z = {Z_i_,_n, i ∈ D_n, n ≥ 1} and ε = {ε_i_,_n, i ∈ T_n, n ≥ 1} be triangular arrays of random fields defined on a probability space (Ω, Inline graphic , P) with D_n ⊆ T_n ⊆ D. The space ℝ^d is equipped with the metric ρ(i, j) = max_1≤_l_≤_d |j_l − i_l|, where i_l is the l-th component of i. The distance between any subsets U, V ⊆ D is defined as ρ(U, V ) = inf {ρ(i, j): i ∈ U and j ∈ V }. Furthermore, let |U| denote the cardinality of a finite subset U ⊂ D.

The random variables Z_i_,_n and ε_i_,_n are possibly vector-valued taking their values in ℝ^p_z and ℝ^p_ε, respectively. We assume that ℝ^p_z and ℝ^p_ε are normed metric spaces equipped with the Euclidean norm, which we denote (in an obvious misuse of notation) as |.|. For any random vector Y, let ||Y||_p = [E |Y|_p]^1/^p, p ≥ 1, denote its L_p-norm. Finally, let Inline graphic (s) = σ(ε_j_,_n; j ∈ T_n: ρ(i, j) ≤ s) be the σ-field generated by the random vectors ε_j_,_n located in the s-neighborhood of location i.

Throughout the paper, we maintain these notational conventions and the following assumption concerning D.

Assumption 1

The lattice D ⊂ ℝ^d, d ≥ 1, is infinitely countable. All elements in D are located at distances of at least ρ₀ > 0 from each other, i.e., for all i, j ∈ D: ρ(i, j) ≥ ρ₀; w.l.o.g. we assume that ρ₀ > 1.

The assumption of a minimum distance has also been used by Conley (1999) and Jenish and Prucha (2009). It ensures the growth of the sample size as the sample regions D_n and T_n expand. The setup is thus geared towards what is referred to in the spatial literature as increasing domain asymptotics.

We now introduce the notion of near-epoch dependent (NED) random fields.

Definition 1

Let Z = {Z_i_,_n, i ∈ D_n, n ≥ 1} be a random field with ||Z_i_,_n||_p < ∞, p ≥ 1, let ε = {ε_i_,_n, i ∈ T_n, n ≥ 1} be a random field, where |T_n| → ∞ as n → ∞, and let d = {d_i_,_n, i ∈ D_n, n ≥ 1} be an array of finite positive constants. Then the random field Z is said to be L_p(d)-near-epoch dependent on the random field ε if

{| | Z_{i, n} - E (Z_{i, n} ∣ F_{i, n} (s)) | |}_{p} \leq d_{i, n} ψ (s)

(1)

for some sequence ψ(s) ≥ 0 with lim_s_→∞ ψ(s) = 0. The ψ(s), which are w.l.o.g. assumed to be non-increasing, are called the NED coefficients, and the d_i_,_n are called NED scaling factors. Z is said to be L_p-NED on ε of size −λ if ψ(s) = O(s⁻^μ) for some μ > λ > 0. Furthermore, if sup_n sup_{i∈D_n}d_i,n < ∞, then Z is said to be uniformly L_p-NED on ε.

Recall that D_n ⊆ T_n. Typically, T_n will be an infinite subset of D, and often T_n = D. However, as discussed in more detail in Jenish and Prucha (2011), to cover Cliff Ord type processes T_n is allowed to depend on n and to be finite provided that it increases in size with n.

The role of the scaling factors {d_i_,_n} is to allow for the the possibility of “unbounded moments”, i.e., sup_n sup_{i∈D_n}d_i,n = ∞. Unbounded moments may reflect trends in the moments in certain directions, in which case we may also use, as in the time series literature, the terminology of “trending moments”. The NED property is thus compatible with a considerable amount of heterogeneity. In establishing limit theorems for NED processes, we will have to impose restrictions on the scaling factors d_i_,_n. In this respect, observe that

{| | Z_{i, n} - E (Z_{i, n} ∣ F_{i, n} (s)) | |}_{p} \leq {| | Z_{i, n} | |}_{p} + {| | E (Z_{i, n} ∣ F_{i, n} (s)) | |}_{p} \leq 2 {| | Z_{i, n} | |}_{p}

by the Minkowski and the conditional Jensen inequalities. Given this, we may choose d_i_,_n ≤ 2 ||Z_i_,_n||_p, and consequently w.l.o.g. 0 ≤ ψ(s) ≤ 1; see, e.g., Davidson (1994), p. 262, for a corresponding discussion within the context of time series processes. Note that by the Lyapunov inequality, if Z_i_,_n is L_p-NED, then it is also L_q-NED with the same coefficients {d_i_,_n} and {ψ(s)} for any q ≤ p.

Our definition of NED for spatial processes is adapted from the definition of NED for time series processes. In the time series literature, the NED concept first appeared in the works of Ibragimov (1962) and Billingsley (1968), although they did not use the present term. The concept of time series NED processes was later formalized by McLeish (1975), Wooldridge (1986), Gallant and White (1988). These authors considered only L₂-NED processes. Andrews (1988) generalized it to L_p-NED processes for p ≥ 1. Davidson (1992, 1993, 1994) and de Jong (1997) further extended it to allow for trending time series processes.

We note that aside from the NED condition, a number of different notions of dependence have been used in the time series literature. For instance, Pötscher and Prucha (1997) considered a more general dependence condition (called L_p-approximability). They use more general approximating functions than the conditional mean in Definition 1 to describe the dependence structure of a process. Similar conditions are also used by Lu (2001), Lu and Linton (2007), among others. These conditions allow for more general choices of approximating functions than the conditional expectation. One of the main results in this paper is a central limit theorem, which requires the existence of second moments. Since for p = 2 the conditional mean is the best approximator in the sense of minimizing the mean squared error, our use the conditional mean as an approximating function is not restrictive. Still, in particular applications it may be convenient to work with some other Inline graphic (s)-measurable approximating function, say h_i_,_s_,_n. Of course, if one can show that ||Z_i_,_n − h_i_,_s_,_n||₂ ≤ d_i_,_nψ(s), then this also established (1) for p = 2.

In the spatial literature, NED processes were considered in the special context of density estimation by Hallin, Lu and Tran (2001), and Hallin, Lu and Tran (2004), albeit they did not use this term. The first paper proves asymptotic normality of the kernel density estimator for linear random fields, the second paper shows L₁-consistency of the kernel density estimator for nonlinear functionals of i.i.d. random fields. We note that neither of these papers establishes a central limit theorem for nonlinear NED random fields.

As discussed earlier, an important motivation for considering NED processes is that mixing is generally not preserved under transformations involving infinitely many arguments. However, as illustrated below, the output process is generated as a function of infinitely many input variables in a wide range of models. In those situations, mixing of the input process does not necessarily carry over to the output process, and thus limit theorems for averages of the output process cannot simply be established from limit theorems for mixing processes. Nevertheless, as with time series processes, we show below that limit theorems can be extended to spatial processes that are NED on a mixing input process, provided the approximation error declines “sufficiently fast” as the conditioning set of input variables expands.

We now give examples of NED spatial processes. First, spatial Cliff and Ord (1981) type autoregressive processes are NED under some weak conditions on the spatial weight coefficients. These models have been used widely in applications. For recent contributions on estimation strategies for these models see, e.g., Robinson (2010, 2009), Kelejian and Prucha (2010, 2007, 2004), and Lee (2007, 2004). The second example is linear infinite moving average (MA(∞)) random fields. In preparation of the example, we first give a more general result, which shows that the NED property is satisfied by random fields generated from nonlinear Lipschitz type functionals of some ℝ^p_ε-valued random field ε = {ε_in, i ∈ D}:

Z_{i n} = H_{i n} ({(ε_{j n})}_{j \in D})

(2)

where H_in: Inline graphic → ℝ^p_z, ⊆ ℝ^p_ε, are measurable functions satisfying for all e, e′ ∈

∣ H_{i n} (e) - H_{i n} (e^{'}) ∣ \leq \sum_{j \in D} w_{ijn} ∣ e_{j} - e_{j}^{'} ∣ with w_{ijn} \geq 0

(3)

with

lim_{s \to \infty} sup_{n, i \in D} \sum_{j \in D : ρ (i, j) > s} w_{ijn} = 0, and {| | ε | |}_{2} = sup_{n, i \in D} {| | ε_{i n} | |}_{2} < \infty .

(4)

Proposition 1

Under conditions (3)–(4),Z = {Z_in, i ∈ D_n, n ≥ 1} given by (2) is well-defined, and is L₂-NED on ε with ψ(s) = ||ε||₂ sup_n_,_i_∈_DΣ_j_∈_D:ρ₍_i_,_j₎_>s w_ijn.

We now use the above proposition to establish the NED property for linear MA(∞) random fields. Linear MA(∞) random fields may arise as solutions of autoregressive models. For any k ∈ ℕ and fixed vectors v_l ∈ ℤ^d, l = 1, …, k, consider the following autoregressive random field:

Z_{i} = \sum_{l = 1}^{k} a_{l} Z_{i - v_{l}} + ε_{i}

(5)

where $a = \sum_{l = 1}^{k} ∣ a_{l} ∣ < 1$ , {ε_i, i ∈ ℤ^d} are i.i.d. with ||ε_i||_q < ∞, q ≥ 1. Model (5) is also known as a k-nearest-neighbor or interaction model with the radius of interaction r = max_1≤_l_≤_k |v_l|.

As shown by Doukhan and Lang (2002), there exists a stationary solution of (5) given by:

Z_{i} = \sum_{m = 0}^{\infty} \sum_{m_{1} + \dots + m_{k} = m} \frac{m!}{m_{1}! \dots m_{k}!} a_{1}^{m_{1}} \dots a_{k}^{m_{k}} ε_{i - (m_{1} v_{1} + \dots + m_{k} v_{k})}

with m_i ∈ ℕ. Thus, (5) can be represented as a linear random field Z_i = Σ_j∈ℤ^dw_jε_i₋_j, with

w_{j} = \sum_{m \geq ∣ j ∣ / r}^{\infty} \sum_{V (j, m)} \frac{m!}{m_{1}! \dots m_{k}!} a_{1}^{m_{1}} \dots a_{m}^{m_{k}},

where V (j, m) = {(m₁, …, m_k) ∈ ℕ^k: m₁ + … + m_k = m, m₁v₁ + … + m_kv_k = j}, observing that V (j, m) is empty if m < |j|/r. Observing further that

\sum_{m_{1} + \dots + m_{k} = m} \frac{m!}{m_{1}! \dots m_{k}!} ∣ a_{1}^{m_{1}} \dots a_{k}^{m_{k}} ∣ = a^{m}

the coefficients w_j can be bounded as

∣ w_{j} ∣ \leq \sum_{m \geq ∣ j ∣ / r}^{\infty} \sum_{m_{1} + \dots + m_{k} = m} \frac{m!}{m_{1}! \dots m_{k}!} ∣ a_{1}^{m_{1}} \dots a_{k}^{m_{k}} ∣ = \sum_{m \geq ∣ j ∣ / r}^{\infty} a^{m} = {(1 - a)}^{- 1} a^{∣ j ∣ / r} .

Rewriting the process Z_i as Z_i = Σ_j∈ℤ^dw_ijε_j with w_ij = w_i₋_j it follows from Proposition 1 that the random field (5) is L_p-NED on ε with the NED coefficients ψ(s) = ||ε||_q(1 − a)⁻¹(1 − a^1/^r)⁻¹a^s^/^r.

The asymptotic theory of AR and MA(∞), satisfying the NED condition, can be useful in a variety of empirical applications where the data are cross sectionally correlated. For instance, Pinkse et al.’s (2002) study of spatial price competition among firms that produce differentiated products in one example of an empirical application with cross sectional dependence. They model the price charged by firm at location i in the geographic (or product characteristic) space as a linear spatial autoregressive process. Another example is Fogli and Veldkamp (2011) who investigate spatial correlation in the female labor force participation (LFP). In particular, they consider a spatial autoregression of county i’s LFP rate on LFP rates of its neighbors. Dell (2010) examines the impact of mita, the forced mining labor system in colonial Peru and Bolivia, on household consumption and child growth across different regions. Although her model is not spatially autoregressive, the regressors and errors exhibit persistent spatial correlation, which can be modeled as a spatial NED process.

As discussed, an attractive feature of NED processes is that the NED property is preserved under transformations. Econometric estimators are usually defined either explicitly as functions of some underlying data generating processes or implicitly as optimizers of a function of the data generating process. Thus, if the data generating process is NED on some input process, the question arises under what conditions functions of random fields are also NED on the same input process.

Various conditions that ensure preservation of the NED property under transformations have been established in the time series literature by Gallant and White (1988), and Davidson (1994). In fact, these results extend to random fields. In particular, the NED property is preserved under summation and multiplication, and carries over from a random vector to its components and vice versa. For future reference, we now state some results for generalized classes of nonlinear functions. Their proofs are analogous to those in the time series literature, and therefore omitted.

Consider transformations of Z_i_,_n given by a family of functions g_i_,_n: ℝ^p_z → ℝ. The functions g_i_,_n are assumed Borel-measurable for all n and i ∈ D. They are furthermore assumed to satisfy the following Lipschitz condition: For all (z, z^•) ∈ ℝ^p_z × ℝ^p_z and all i ∈ D_n and n ≥ 1:

∣ g_{i, n} (z) - g_{i, n} (z^{•}) ∣ \leq B_{i, n} (z, z^{•}) ∣ z - z^{•} ∣

(6)

where B_i_,_n(z, z^•): ℝ^p_z × ℝ^p_z → ℝ₊ is Borel-measurable. Of course, this condition would be devoid of meaning without further restrictions on B_i_,_n(z, z^•), which are given in the next propositions.

Proposition 2

Suppose g_i_,_n(·) satisfies Lipschitz condition (6) with |B_i_,_n(z, z^•)| ≤ C < ∞, for all (z, z^•) ∈ ℝ^p_z × ℝ^p_z and all i and n. If for p ≥ 1 the {Z_i_,_n} are L_p-NED of size −λ on {ε_i_,_n} with scaling factors {d_i_,_n}, then g_i_,_n(Z_i_,_n) is also L_p-NED of size −λ on {ε_i_,_n} with scaling factors {2Cd_i_,_n}.⁵

Proposition 3

Suppose g_i_,_n(·) satisfies Lipschitz condition (6) with

sup_{s} {‖ B_{i, n}^{(s)} ‖}_{2} < \infty and sup_{s} {‖ B_{i, n}^{(s)} | Z_{i, n} - {\tilde{Z}}_{i, n}^{s} | ‖}_{r} < \infty

(7)

for some r > 2, where $B_{i, n}^{(s)} = B_{i, n} (Z_{i, n}, {\tilde{Z}}_{i, n}^{s})$ and ${\tilde{Z}}_{i, n}^{s} = E [Z_{i, n} | F_{i, n} (s)]$ . If ||g_i_,_n(Z_i_,_n)||₂ < ∞ and Z_i_,_n is L₂-NED of size −λ on {ε_i_,_n} with scaling factors {d_i_,_n}, then g_i_,_n(Z_i_,_n) is L₂-NED of size −λ(r − 2)/(2r − 2) on {ε_i_,_n} with scaling factors

d_{i, n}^{'} = d_{i, n}^{(r - 2) / (2 r - 2)} sup_{s} {‖ B_{i, n}^{(s)} ‖}_{2}^{(r - 2) / (2 r - 2)} {‖ B_{i, n}^{(s)} | Z_{i, n} - {\tilde{Z}}_{i, n}^{s} ∣ ‖}_{r}^{r / (2 r - 2)} .

Thus, the NED property is hereditary under reasonably weak conditions. These conditions facilitate verification of the NED property in practical application. In particular, we will use them in the proof of asymptotic normality of spatial GMM estimators in Section 4.

3 Limit Theorems

3.1 Law of Large Numbers

In this section, we present a LLN for real valued random fields Z = {Z_i_,_n, i ∈ D_n, n ≥ 1} that are L₁-NED on some vector-valued α-mixing random field ε = {ε_i_,_n, i ∈ T_n, n ≥ 1} with the NED coefficients {ψ(s)} and scaling factors {d_i_,_n}, where D_n ⊆ T_n ⊆ D and the lattice D satisfies Assumption 1. For ease of reference, we state below the definition of the α-mixing coefficients employed in the paper.

Definition 2

Let Inline graphic and be two σ-algebras of , and let

α (A, B) = sup (∣ P (A B) - P (A) P (B) ∣, A \in A, B \in B),

For U ⊆ D_n and V ⊆ D_n, let σ_n(U) = σ(ε_i_,_n; i ∈ U) and α_n(U, V) = α(σ_n(U), σ_n(V)). Then, the α-mixing coefficients for the random field ε are defined as:

\bar{α} (u, v, r) = sup_{n} sup_{U, V} (α_{n} (U, V), ∣ U ∣ \leq u, ∣ V ∣ \leq v, ρ (U, V) \geq r) .

Dobrushin (1968) showed that weak dependence conditions based on the above mixing coefficients are satisfied by broad classes of random fields including Markov fields. In contrast to standard mixing numbers for time-series processes, the mixing coefficients for random fields depend not only on the distance between two datasets but also their sizes. To explicitly account for such dependence, it is furthermore assumed that

\bar{α} (u, v, r) \leq ϕ (u, v) \hat{α} (r)

(8)

where the function ϕ(u, v) is nondecreasing in each argument, and α̂(r) → 0 as r → ∞. The idea is to account separately for the two different aspects of dependence: (i) decay of dependence with the distance, and (ii) accumulation of dependence as the sample region expands. The two common choices of ϕ(u, v) in the random fields literature are

ϕ (u, v) = {(u + v)}^{τ}, τ \geq 0,

(9)

ϕ (u, v) = min {u, v} .

(10)

The above mixing conditions have been used extensively in the random fields literature including Takahata (1983), Nahapetian (1987), Bulinskii (1989), Bulinskii and Doukhan (1990), and Bradley (1993). They are satisfied by fairly large classes of random fields. Bradley (1993) provides examples of random fields satisfying conditions (8)–(9) with u = v and τ = 1. Furthermore, Bulinskii (1989) constructs moving average random fields satisfying the same conditions with τ = 1 for any given decay rate of coefficients α̂(r). Clearly, standard mixing coefficients in the time series literature are covered by conditions (8)–(9) when τ = 0.

Following the literature, we employ the above mixing conditions for the input random field, and impose further restrictions on the decay rates of the mixing coefficients.

Assumption 2

There exist nonrandom positive constants {c_i_,_n, i ∈ D_n, n ≥ 1} such that Z_i_,_n/c_i_,_n is uniformly L_p-bounded for some p > 1, i.e., sup_nsup_{i∊D_n}E|Z_i,n/c_i,n|^p < ∞
The α-mixing coefficients of the input field ε satisfy (8) for some function ϕ(u, v) which is nondecreasing in each argument, and some α̂(r) such that $\sum_{r = 1}^{\infty} r^{d - 1} \hat{α} (r) < \infty$ .

Theorem 1

Let {D_n} be a sequence of arbitrary finite subsets of D such that |D_n| → ∞ as n → ∞, where D ⊂ ℝ^d, d ≥ 1 is as in Assumption 1, and let T_n be a sequence of subsets of D such that D_n ⊆ T_n. Suppose further that Z = {Z_i_,_n, i ∈ D_n, n ≥ 1} is L₁-NED on ε = {ε_i_,_n, i ∈ T_n, n ≥ 1} with the scaling factors d_i_,_n. If Z and ε satisfy Assumption 2, then

\frac{1}{M_{n} ∣ D_{n} ∣} \sum_{i \in D_{n}} (Z_{i, n} - {E Z}_{i, n}) \overset{L_{1}}{\to} 0,

where M_n = max_{i∈D_n}max(c_i_,_n, d_i_,_n).

This LLN can be used to establish uniform convergence of random functions by combining it with the generic ULLN given in Jenish and Prucha (2009), which transforms pointwise LLNs (at a given parameter value) into ULLNs.

Assumption 2(a) is a standard moment condition employed in weak LLNs for dependent processes. It requires existence of moments of order slightly greater than 1. As in Theorem 2 below, c_i_,_n and d_i_,_n are the scaling factors that reflect the magnitudes of potentially trending moments. The case of variables with uniformly bounded moments is covered by setting c_i_,_n = d_i_,_n = 1. The LLN does not require any restrictions on the NED coefficients. In the time series literature, weak LLNs for NED processes have been obtained by Andrews (1988) and Davidson (1993), among others. Andrews (1988) derives an L₁-law for triangular arrays of L₁-mixingales. He then shows that NED processes are L₁-mixingales, and hence, satisfy his LLN. Davidson (1993) extends the latter result to processes with trending moments.

3.2 Central Limit Theorem

In this section, we present a CLT for real valued random fields Z = {Z_i_,_n, i ∈ D_n, n ≥ 1} that are L₂-NED on some vector-valued α-mixing random field ε = {ε_i_,_n, i ∈ T_n, n ≥ 1} with the NED coefficients {ψ(s)} and scaling factors {d_i_,_n}, where D_n ⊆ T_n ⊆ D and the lattice D satisfies Assumption 1. In the following, we will use the following notation:

S_{n} = \sum_{i \in D_{n}} Z_{i, n}; σ_{n}^{2} = var (S_{n}) .

The CLT relies on the following assumptions.

Assumption 3

The α-mixing coefficients of ε satisfy (8) and (9) for some τ ≥ 0 and α̂(r), such that for some δ > 0

\sum_{r = 1}^{\infty} r^{d (τ_{*} + 1) - 1} {\hat{α}}^{\frac{δ}{2 (2 + δ)}} (r) < \infty,

(11)

where τ_* = δτ/(2 + δ).

Assumption 3 restricts the dependence structure of the input process ε. Note that if τ = 1 this assumption also covers the case where ϕ(u, v) is given by (10).

Assumption 4

(Uniform L₂₊_δ integrability) There exists an array of positive constants {c_i_,_n} such that
$lim_{k \to \infty} sup_{n} sup_{i \in D_{n}} E [{∣ Z_{i, n} / c_{i, n} ∣}^{2 + δ} 1 (∣ Z_{i, n} / c_{i, n} ∣ > k)] = 0,$

where 1 (·) is the indicator function and δ > 0 is as in Assumption 3.
${inf}_{n} {∣ D_{n} ∣}^{- 1} M_{n}^{- 2} σ_{n}^{2} > 0$ , where M_n = max_{i∈D_n}c_i,n.
NED coefficients satisfy $\sum_{r = 1}^{\infty} r^{d - 1} ψ (r) < \infty$ .
NED scaling factors satisfy ${sup}_{n} {sup}_{i \in D_{n}} c_{i, n}^{- 1} d_{i, n} \leq C < \infty$ .

Assumptions 4(a),(b) are standard in the limit theory of mixing processes, e.g., Wooldridge (1986), Davidson (1992), de Jong (1997), and Jenish and Prucha (2009). Assumption 4(a) is satisfied if Z_i_,_n/c_i_,_n are uniformly L_p-bounded for p > 2+δ, i.e., sup_{n,i∈D_n}||Z_i_,_n/c_i_,_n||_p < ∞.

Assumption 4(b) is an asymptotic negligibility condition that ensures that no single summand influences disproportionately the entire sum. In the case of uniformly L₂₊_δ-bounded fields, 4(b) reduces to $lim {inf}_{n \to \infty} {∣ D_{n} ∣}^{- 1} σ_{n}^{2} > 0$ , as is, e.g., maintained in Bolthausen (1982). Assumption 4(c) controls the size of the NED coefficients which measure the error in the approximation of Z_i_,_n by ε. Intuitively, the approximation errors have to decline sufficiently fast with each successive approximation. Assumption 4(c) is satisfied if ψ(r) = O(r⁻^d⁻^γ) for some γ > 0, i.e., ψ(r) is of size −d. Finally, Assumption 4(d) is a technical condition, which ensures that the order of magnitude of the NED scaling factors does not exceed that of the 2 + δ moments. For instance, suppose the constant c_i_,_n can be chosen as c_i_,_n = ||Z_i_,_n||₂₊_δ, and the NED scaling numbers as d_i_,_n ≤ 2 ||Z_i_,_n||₂. Then Assumption 4(d) is satisfied, since by Lyapunov’s inequality, ||Z_i_,_n||₂ ≤ ||Z_i_,_n||₂₊_δ. This condition has also been used by de Jong (1997) and Davidson (1992).

Theorem 2

σ_{n}^{- 1} S_{n} \Rightarrow N (0, 1) .

Theorem 2 contains the CLT for α-mixing random fields given in Jenish and Prucha (2009) as a special case. It also contains as a special case the CLT for time series NED processes of Wooldridge (1986), see Theorem 3.13 and Corollary 4.4.

Theorem 2 can be easily extended to vector-valued fields using the standard Cramér-Wold device.

Corollary 1

Suppose {D_n} is a sequence of finite subsets such that |D_n| → ∞ as n → ∞ and {T_n} is a sequence of subsets such that D_n ⊆ T_n ⊆ D of the lattice D satisfying Assumption 1. Let Z = {Z_i_,_n, i ∈ D_n, n ≥ 1} with Z_i_,_n ∈ ℝ^k be a zero-mean random field that is L₂-NED on a vector-valued α-mixing random field ε = {ε_i_,_n, i ∈ T_n, n ≥ 1}. Suppose Assumptions 3 and 4 hold with |Z_i_,_n| denoting the Euclidean norm of Z_i_,_n and $σ_{n}^{2}$ replaced by λ_min(Σ_n), where Σ_n = Var(S_n) and λ_min(.) is the smallest eigenvalue, then

\sum_{n}^{- 1 / 2} S_{n} \Rightarrow N (0, I_{k}) .

Furthermore, sup_n |D_n|⁻¹ λ_max (Σ_n) < ∞, where λ_max(.) denotes the largest eigenvalue.

4 Large Sample Properties of Spatial GMM Estimators

We now apply the limit theorems of the previous section to establish the large sample properties of spatial GMM estimators under a reasonably general set of assumptions that should cover a wide range of empirical problems. More specifically, our consistency and asymptotic normality results (i) maintain only that the spatial data process is NED on an α-mixing basis process to accommodate spatial lags in the data process as discussed above, (ii) allow for unevenly placed locations, and (iii) allow for the data process to be non-stationary, which will frequently be the case in empirical applications. We also give our results under a set of primitive sufficient conditions for easier interpretation by the applied researcher.⁶

We continue with the basic set-up of Section 2. Consider the moment function q_i_,_n: ℝ^p_z × Θ → ℝ^p_q, where Θ denotes the parameter space, and let θ₀_n ∈ Θ denote the parameter vector of interest (which we allow to depend on n for reasons of generality). Suppose the following moment conditions hold

{E q}_{i, n} (Z_{i, n}, θ_{0 n}) = 0.

(12)

Then, the corresponding spatial GMM estimator is defined as

{\hat{θ}}_{n} = arg min_{θ \in Θ} Q_{n} (ω, θ),

(13)

where Q_n: Ω × Θ → ℝ,

Q_{n} (ω, θ) = R_{n} {(θ)}^{'} P_{n} R_{n} (θ),

with R_n(θ) = |D_n|⁻¹Σ_{i∈D_n}q_i,n(Z_{i,_n}, θ), and where the P_n are some positive semidefinite weighting matrices. To show consistency, consider the following non-stochastic analogue of Q_n, say

{\bar{Q}}_{n} (θ) = {[{E R}_{n} (θ)]}^{'} P [{E R}_{n} (θ)],

(14)

where P denotes the probability limit of P_n. Given the moment condition (12), E [R_n(θ₀_n)] = 0, the functions Q̄_n are minimized at θ₀_n. In proving consistency, we follow the classical approach; see, e.g., Gallant and White (1988) or Pötscher and Prucha (1997) for more recent expositions. In particular, given identifiable uniqueness of θ₀_n we establish, loosely speaking, convergence of the minimizers θ̂_n to the minimizers θ₀_n by establishing convergence of the objective function Q_n(ω, θ) to its non-stochastic analogue Q̄_n(θ) uniformly over the parameter space.

Throughout the sequel, we maintain the following assumptions regarding the parameter space, the GMM objective function and the unknown parameters θ₀_n.

Assumption 5

The parameter space Θ is a compact metric space with metric ν.
The functions q_i_,_n: ℝ^p_z × Θ → ℝ^p_q are / -measurable for each θ ∈ Θ, and continuous on Θ for each z ∈ ℝ^p_z.
The elements of the p_q × p_q real matrices P_n are -measurable, and P_n is positive semidefinite. Furthermore P = p lim P_n exists and P is positive definite.
The minimizers θ₀_n are identifiably unique in the sense that every ε > 0, lim inf_n_→∞ [inf_{θ∈Θ:ν(θ, θ_0n)≥ε} [ER_n(θ)]′[ER_n(θ)]] > 0.

Compactness of the parameter space as maintained in Assumption 5(a) is typical for the GMM literature. Assumptions 5(b),(c) imply that Q_n(·, θ) is measurable for all θ ∈ Θ, and Q_n(ω, ·) is continuous on Θ. Given those assumptions the existence of measurable functions θ̂_n that solves (13) follows, e.g., from Lemma A3 of Pötscher and Prucha (1997).

Since P is positive definite, it is readily seen that Assumption 5(d) implies that for every ε > 0:

lim inf_{n \to \infty} [inf_{θ \in Θ : ν (θ, θ_{0 n}) \geq ε} ∣ {\bar{Q}}_{n} (θ) - {\bar{Q}}_{n} (θ_{0 n}) ∣] > 0,

observing that Q̄_n(θ₀_n) = 0. Thus, under Assumption 5(d) the minimizers θ₀_n are identifiably unique; compare, e.g., Gallant and White (1988), p.19. For interpretation, consider the important special case where θ₀_n = θ₀, ER_n(θ) does not depend on n, and is continuous in θ. In this case, identifiable uniqueness of θ₀ is equivalent to the assumption that θ₀ is the unique solution of the moment conditions, i.e., E [R_n(θ)] ≠ 0 for all θ ≠ θ₀; compare, e.g., Pötscher and Prucha (1997), p. 16.

4.1 Consistency

Given the minimizers θ₀_n are identifiably unique, θ̂_n is a consistent estimator for θ₀_n if Q_n converges uniformly to Q̄_n, i.e., if ${sup}_{θ \in Θ} ∣ Q_{n} (ω, θ) - {\bar{Q}}_{n} (θ) ∣ \overset{p}{\to} 0$ as n → ∞; this follows immediately from, e.g, Pötscher and Prucha (1997), Lemma 3.1.

We now proceed by giving a set of primitive domination and Lipschitz type conditions for the moment functions that ensure uniform convergence of Q_n to Q̄_n. The conditions are in line with those maintained in the general literature on M-estimation, e.g., Andrews (1987), Gallant and White (1988), and Pötscher and Prucha (1989,1994).

Definition 3

Let f_i_,_n: ℝ^p_z × Θ → ℝ^p_q be Inline graphic / -measurable functions for each θ ∈ Θ, then:

The random functions f_i_,_n(Z_i_,_n; θ) are said to be p-dominated on Θ for some p > 1 if sup_n sup_{i∈D_n}E sup_θ∈Θ|f_i,n(Z_i_,_n; θ)|^p < ∞.
The random functions f_i_,_n(Z_i_,_n; θ) are said to be Lipschitz in the parameter θ on Θ if
$∣ f_{i, n} (Z_{i, n}, θ) - f_{i, n} (Z_{i, n}, θ^{•}) ∣ \leq L_{i, n} (Z_{i, n}) h (ν (θ, θ^{•})) a . s .,$ (15)

for all θ, θ^• ∈ Θ and i ∈ D_n, n ≥ 1, where h is a nonrandom function with h(x) ↓ 0 as x ↓ 0, and L_i_,_n are random variables with $lim {sup}_{n \to \infty} {∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} {E L}_{i, n}^{η} < \infty$ for some η > 0

Towards establishing consistency of θ̂_n we furthermore maintain the following moment and mixing assumptions.

Assumption 6

The moment functions q_i_,_n(Z_i_,_n; θ) have the following properties:

They are p-dominated on Θ for p = 2.
They are uniformly L₁-NED on ε = {ε_i_,_n, i ∈ T_n, n ≥ 1}, where D_n ⊆ T_n ⊆ D, and ε is α-mixing with α-mixing coefficients the conditions stated in Assumption 2(b).
They are Lipschitz in the parameter θ on Θ.

Assumption 6(a) implies that sup_{n,i∈D_n}E |q_i_,_n(Z_i_,_n; θ)|^p < ∞ for each θ ∈ Θ. Assumptions 6(b) then allow us to apply the LLN given as Theorem 1 above the sample moments R_n(θ) = |D_n|⁻¹Σ_{i∈D_n}q_i_,_n(Z_i_,_n, θ).

To verify Assumption 6(b) one can use either Proposition 2 or Proposition 3 to imply this condition from the lower level assumption that the data Z_i_,_n are L₁-NED. For example, the q_i_,_n are L₁-NED, if the Z_i_,_n are L₁-NED and satisfy the Lipschitz condition of Proposition 2. Note that no restrictions on the sizes of the NED coefficients are required.

Assumption 6(c) ensures stochastic equicontinuity of q_i_,_n w.r.t. θ. Stochastic equicontinuity jointly with Assumption 6(a) and the pointwise LLN enable us to invoke the ULLN of Jenish and Prucha (2009) to prove uniform convergence of the sample moments, which in turn is used to establish that Q_n converges uniformly to Q̄_n. A sufficient condition for Assumption 6(c) is existence of integrable partial derivatives of q_i_,_n w.r.t. θ if θ ∈ ℝ^k.

Our consistency results for the spatial GMM estimator given by (13) is summarized by the next theorem.

Theorem 3 (Consistency)

Suppose {D_n} is a sequence of finite sets of D such that |D_n| → ∞ as n → ∞, where D ⊂ ℝ^d, d ≥ 1 is as in Assumption 1. Suppose further that Assumptions 5 and 6 hold. Then

ν ({\hat{θ}}_{n}, θ_{0 n}) \overset{p}{\to} 0 a s n \to \infty,

and Q̄_n(θ) is uniformly equicontinuous on Θ.

4.2 Asymptotic Normality

We next establish that the spatial GMM estimators defined by (13) is asymptotically normally distributed. For that purpose, we need a stronger set of assumptions than for consistency, including differentiability of the moment functions in θ. It proofs helpful to adopt the notation ∇_θ in place of ∂/∂θ.⁷

Assumption 7

The minimizers θ₀_n lie uniformly in the interior of Θ with Θ ⊆ ℝ^k. Furthermore E [R_n(θ₀_n)] = 0.
The functions q_i_,_n: ℝ^p_z × Θ → ℝ^p_q are continuously differentiable w.r.t. θ for each z ∈ ℝ^p_z.
The functions q_i_,_n(Z_i_,_n; θ₀_n) are uniformly L₂-NED on ε of size −d, and for some δ′ > 0 sup_{n,i∈D_n}E |q_i_,_n(Z_i_,_n; θ₀_n)|²⁺^δ^′< ∞. The functions ∇_θq_i_,_n(Z_i_,_n; θ) are uniformly L₁-NED on ε.
The input process ε = {ε_i_,_n, i ∈ T_n, n ≥ 1}, where D_n ⊆ T_n ⊆ D, is α-mixing and the mixing coefficients satisfy Assumption 3 for some δ < δ′, where δ′ is the same as in Assumption 7(c).
The functions ∇_θq_i_,_n are p-dominated on Θ for some p > 1.
The functions ∇_θq_i_,_n are Lipschitz in θ on Θ.
inf_n λ_min(|D_n|⁻¹ Σ_n) > 0 where Σ_n = Var [Σ_{i∈D_n}q_i_,_n(Z_i_,_n, θ₀_n)].
inf_n λ_min [E∇_θR_n(θ₀_n)′ ∇_θR_n(θ₀_n)] > 0.

The first part of Assumption 7(a) is needed to ensure that the estimator θ̂_n lies in the interior of Θ with probability tending to one, and facilitates the application of the mean value theorem to R_n(θ̂_n) around θ₀_n. The second part states in essence that the moment conditions are correctly specified. Its violation will generally invalidate the limiting distribution result.

Assumptions 7(c),(d),(g) enable us to apply the CLT for vector-valued NED processes given above as Corollary 1 to R_n(θ₀_n). Some low level sufficient conditions for Assumption 7(c) are given below. To establish asymptotic normality, we also need uniform convergence of ∇_θR_n on Θ, which is implied via Assumptions 7(c),(d),(e),(f). Finally, Assumption 7(h) ensures positive-definitness of the variance-covariance matrix of the GMM estimator.

Given the above assumptions, we have the following asymptotic normality result for the spatial GMM estimator defined by (13).

Theorem 4

Suppose {D_n} is a sequence of finite sets of D such that |D_n| → ∞ as n → ∞, where D ⊂ ℝ^d, d ≥ 1 is as in Assumption 1. Suppose further that Assumptions 5–7 hold. Then

{(A_{n}^{- 1} B_{n} B_{n}^{'} A_{n}^{- 1^{'}})}^{- 1 / 2} {∣ D_{n} ∣}^{1 / 2} ({\hat{θ}}_{n} - θ_{0 n}) \Rightarrow N (0, I_{k}),

where

A_{n} = {[E \nabla_{θ} R_{n} (θ_{0 n})]}^{'} P [E \nabla_{θ} R_{n} (θ_{0 n})] and B_{n} = {[E \nabla_{θ} R_{n} (θ_{0 n})]}^{'} P {[{∣ D_{n} ∣}^{- 1} \sum_{n}]}^{1 / 2} .

Moreover, $∣ A_{n} ∣ = O (1); ∣ A_{n}^{- 1} ∣ = O (1); ∣ B_{n} ∣ = O (1); | {(B_{n} B_{n}^{'})}^{- 1} | = O (1)$ and hence, θ̂_n is |D_n|^1/2-consistent for θ₀_n.

As remarked above, relative to the existing literature Theorem 4 allows for nonstationary processes and only assumes that q_i_,_n and ∇_θq_i_,_n are NED on an α-mixing input process, rather than postulating that q_i_,_n and ∇_θq_i_,_n are α-mixing. As such, Theorem 4 should provide a basis for constructing confidence intervals and hypothesis testing in a wider range of spatial models.

Using Proposition 3, we now give some sufficient conditions for Assumption 7(c).

Assumption 8

The process {Z_i_,_n, i ∈ D_n ⊂ T_n, n ≥ 1} is uniformly L₂-NED on {ε_i_,_n, i ∈ T_n, n ≥ 1} of size − 2d(r − 1)/(r − 2) for some r > 2.

Assumption 9

For every sequence {θ₀_n} on Θ, the functions q_i_,_n(Z_i_,_n; θ₀_n) and ∇_θq_i_,_n(Z_i_,_n; θ₀_n) satisfy Lipschitz condition (6) in z, that is, for g_i_,_n = q_i_,_n or ∇_θq_i_,_n:

∣ g_{i, n} (z; θ_{n}) - g_{i, n} (z^{•}; θ_{n}) ∣ \leq B_{i, n} (z, z^{•}) ∣ z - z^{•} ∣ .

Furthermore, for the r > 2 as specified in Assumption 8,

sup_{n, i \in D_{n}} sup_{s} {‖ B_{i, n}^{(s)} ‖}_{2} < \infty and sup_{n, i \in D_{n}} sup_{s} {‖ B_{i, n}^{(s)} | Z_{i, n} - {\tilde{Z}}_{i, n}^{s} | ‖}_{r} < \infty

where $B_{i, n}^{(s)} = B_{i, n} (Z_{i, n}, {\tilde{Z}}_{i, n}^{s})$ with ${\tilde{Z}}_{i, n}^{s} = E [Z_{i, n} ∣ F_{i, n} (s)]$ .

5 Conclusion

The paper develops an asymptotic inference theory for a class of dependent nonstationary random fields that could be used in a wide range of econometric models with spatial dependence. More specifically, the paper extends the notion of near-epoch dependent (NED) processes used in the time series literature to spatial processes. This allows to accommodate larger classes of dependent processes than mixing random fields. The class of NED random fields is “closed with respect to infinite transformations” and thus should be sufficiently broad for many applications of interest. In particular, it covers autoregressive and infinite moving average random fields as well as nonlinear functionals of mixing processes. The NED property is also compatible with considerable heterogeneity and preserved under transformations under fairly mild conditions. Furthermore, a CLT and an LLN are derived for spatial processes that are NED on an α-mixing process. Apart from covering a larger class of dependent processes, these limit theorems also allow for arrays of nonstationary random fields on unevenly spaced lattices. Building on these limit results, the paper develops an asymptotic theory of spatial GMM estimators, which provides a basis for inference in a broad range of models with cross-sectional or spatial dependence.

Much of the random fields literature assumes that the process resides on an equally spaced grid. In contrast, and as in Jenish and Prucha (2009), we allow for locations to be unequally spaced. The implicit assumption of fixed locations seems reasonable for a large class of applications, especially in the short run. Still, an important direction for future work would be to extend the asymptotic theory to spatial processes with endogenous locations, while maintaining a set of assumptions that are reasonably easy to interpret.⁸ One possible approach may be to augment the contributions of the present paper with theory from point processes.

Acknowledgments

We would like to thank the Editor P.M. Robinson, Associate Editor and three anonymous referees for their valuable comments that led to substantial improvement of the paper. We thank the participants of the Cowles Foundation Conference, Yale, June 2009, and the seminar participants at the Columbia University for helpful discussions. This research benefitted from a University of Maryland Ann G. Wylie Dissertation Fellowship for the first author, and from financial support from the National Institute of Health through SBIR grant 1 R43 AG027622 for the second author.

A Appendix: Proofs for Sections 2 and 3

Throughout, let Inline graphic (s) = σ(ε_j_,_n; j ∈ T_n: ρ(i, j) ≤ s) be the σ-field generated by the random vectors ε_j_,_n located in the s-neighborhood of location i. Furthermore, C denotes a generic constant that does not depend on n and may be different from line to line.

Proof of Proposition 1

The proof is available online on the authors’ webpages.

Proof of Theorem 1

Define Y_i_,_n = Z_i_,_n/M_n, then to prove the theorem, it suffices to show that ${∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} (Y_{i, n} - {E Y}_{i, n}) \overset{L_{1}}{\to} 0$ . We first establish moment and mixing conditions for Y_i_,_n from those for Z_i_,_n. Observe that in light of the definition of M_n and Assumption 2(a)

sup_{n, i \in D_{n}} E {∣ Y_{i, n} ∣}^{p} \leq sup_{n, i \in D_{n}} E {∣ Z_{i, n} / c_{i, n} ∣}^{p} < \infty .

(A.1)

Thus, Y_i_,_n is uniformly L_p-bounded for p > 1. Let Inline graphic (s) = σ(ε_j_,_n; j ∈ T_n: ρ(i, j) ≤ s). Since Z_i_,_n is L₁-NED on ε = {ε_i_,_n, i ∈ T_n, n ≥ 1}:

sup_{n, i \in D_{n}} {| | Y_{i, n} - E (Y_{i, n} ∣ F_{i, n} (s)) | |}_{1} \leq sup_{n, i \in D_{n}} M_{n}^{- 1} d_{i, n} ψ (s) \leq ψ (s),

(A.2)

observing that M_n = max_{i∈D_n}max(c_i_,_n, d_i_,_n). Thus Y_i_,_n is also L₁-NED on ε.

Next we show that for each given s > 0, the conditional mean $V_{i, n}^{s} = E (Y_{i, n} ∣ F_{i, n} (s))$ satisfies the assumptions of the L₁-norm LLN of Jenish and Prucha (2009, Theorem 3). Using the Jensen and Lyapunov inequalities gives for all s > 0, i ∈ D_n, n ≥ 1:

E {∣ V_{i, n}^{s} ∣}^{p} \leq E {E ({∣ Y_{i, n} ∣}^{p} ∣ F_{i, n} (s))} \leq sup_{n, i \in D_{n}} E {∣ Y_{i, n} ∣}^{p} < \infty .

So, $V_{i, n}^{s}$ is uniformly L_p-bounded for p > 1 and hence uniformly integrable. For each fixed s, $V_{i, n}^{s}$ is a measurable function of {ε_j_,_n; j ∈ T_n: ρ(i, j) ≤ s}. Observe that under Assumption 1 there exists a finite constant C such that the cardinality of the set {j ∈ T_n: ρ(i, j) ≤ s} is bounded by Cs^d; compare Lemma A.1 in Jenish and Prucha (2009). Hence,

{\bar{α}}_{V^{s}} (1, 1, r) \leq {\begin{matrix} 1, r \leq 2 s \\ \bar{α} ({C s}^{d}, {C s}^{d}, r - 2 s), r > 2 s \end{matrix}

and thus in light of Assumption 2(b)

\sum_{r = 1}^{\infty} r^{d - 1} {\bar{α}}_{V^{s}} (1, 1, r) \leq \sum_{r = 1}^{2 s} r^{d - 1} + ϕ ({C s}^{d}, {C s}^{d}) \sum_{r = 1}^{\infty} {(r + 2 s)}^{d - 1} \hat{α} (r) < \infty .

The above shows that indeed, for each fixed s, $V_{i, n}^{s}$ satisfies the assumptions of the L₁-norm LLN of Jenish and Prucha (2009, Theorem 3). Therefore, for each s, we have

{‖ {∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} [E (Y_{i, n} ∣ F_{i, n} (s)) - {E Y}_{i, n}] ‖}_{1} \to 0 as n \to \infty .

(A.3)

Furthermore observe that from (A.2) and the Minkowski inequality

{‖ {∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} (Y_{i, n} - E (Y_{i, n} ∣ F_{i, n} (s))) ‖}_{1} \leq ψ (s) .

(A.4)

Given (A.3) and (A.4), and observing that lim_s_→∞ ψ(s) = 0 it now follows that

\begin{array}{l} lim_{n \to \infty} {‖ {∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} (Y_{i, n} - {E Y}_{i, n}) ‖}_{1} = lim_{s \to \infty} lim_{n \to \infty} {‖ {∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} (Y_{i, n} - {E Y}_{i, n}) ‖}_{1} \\ \leq lim_{s \to \infty} lim sup_{n \to \infty} {‖ {∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} (Y_{i, n} - E (Y_{i, n} ∣ F_{i, n} (s))) ‖}_{1} + lim_{s \to \infty} lim_{s \to \infty} {‖ {∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} (E (Y_{i, n} ∣ F_{i, n} (s)) - {E Y}_{i, n}) ‖}_{1} = 0. \end{array}

This completes the proof of the LLN.

The proof of the CLT builds on Ibragimov and Linnik (1971), pp. 352–355, and makes use of the following lemmata:

Lemma A.1 (Brockwell and Davis (1991), Proposition 6.3.9)

Let Y_n, n = 1, 2, … and V_ns, s = 1, 2, …; n = 1, 2, …, be random vectors such that

V_ns ⇒ V_s as n → ∞ for each s = 1; 2; …
V_s rr; V as s → ∞, and
lim_s_→∞ lim sup_n_→∞ P(|Y_n − V_ns| > ε) = 0 for every ε > 0.

Then Y_n ⇒ V as n → ∞.

Lemma A.2 (Ibragimov and Linnik (1971))

Let L_p( Inline graphic ) and L_p( ) denote, respectively, the class of -measurable and -measurable random variables ξ satisfying ||ξ||_p < ∞. Let X ∈ L_p( ) and Y ∈ L_q( ). Then, for any 1 ≤ p, q, r < ∞ such that p⁻¹ + q⁻¹ + r⁻¹ = 1,

∣ Cov (X, Y) ∣ < 4 α^{1 / r} (F_{1}, F_{2}) {| | X | |}_{p} {| | Y | |}_{q}

where α( Inline graphic ; ) = sup_A_∈
,_B_∈(|P(AB) − P(A)P(B)|).

To prove the CLT for NED random fields, we first establish some moment inequalities and a slightly modified version of the CLT for mixing fields developed in Jenish and Prucha (2009). It is helpful to introduce the following notation. Let X = {X_i_,_n, i ∈ D_n, n ≥ 1} be a random field, then ||X||_q: = sup_{n,i∈D_n}||X_i_,_n||_q for q ≥ 1.

Lemma A.3

Let {X_i_,_n} be uniformly L₂-NED on a random field {ε_i_,_n} with α-mixing coefficients ᾱ(u, v, r) ≤ (u + v)^τ α̂(r), τ ≥ 0. Let S_n = Σ_{i∈D_n}X_i_,_n and suppose that the NED coefficients of {X_i_,_n} satisfy $\sum_{r = 1}^{\infty} r^{d - 1} ψ (r) < \infty$ and ||X||_2+δ < ∞ for some δ > 0. Then,

|Cov (X_i_,_nX_j_,_n)| ≤ ||X||₂₊_δ {C₁||X||₂₊_δ [h/3]^dτ_*α̂^δ^/(2+^δ⁾ ([h/3]) + C₂ψ([h/3])}, where h = ρ(i, j) and τ_* = δτ/(2 + δ). If, $\sum_{r = 1}^{\infty} r^{d (τ_{*} + 1) - 1} {\hat{α}}^{δ / (2 + δ)} (r) < \infty$ , then for some C < ∞, not depending on n
$Var (S_{n}) \leq C ∣ D_{n} ∣ .$
|Cov (X_i_,_nX_j_,_n)| ≤ ||X||₂ {C₃||X||₂₊_δ [h/3]^{dτ^*} α̂^δ^/(4+2^δ⁾ ([h/3]) + C₄ψ([h/3])}, where h = ρ(i, j) and τ^* = δτ/(4 + 2δ). If, $\sum_{r = 1}^{\infty} r^{d (τ^{*} + 1) - 1} {\hat{α}}^{δ / (4 + 2 δ)} (r) < \infty$ where τ^* = δτ/(4 + 2δ), then for some C < ∞, not depending on n
$Var (S_{n}) \leq C {| | X | |}_{2} ∣ D_{n} ∣ .$

Proof of Lemma A.3

The proof is available online on the authors’ webpages.

Theorem A.1

Suppose {D_n} is a sequence of finite subsets of D, satisfying Assumption 1, with |D_n| → ∞ as n → ∞. Suppose further that {ε_i_,_n; i ∈ D_n, n ∈ ℕ} is an array of zero-mean random variables with α-coefficients ᾱ(u, v, r) ≤ C(u + v)^τ α̂(r) for some constants C < ∞ and τ ≥ 0. Suppose for some δ > 0 and γ > 0

lim_{k \to \infty} sup_{n, i \in D_{n}} E [{∣ ε_{i, n} ∣}^{2 + δ} 1 (∣ ε_{i, n} ∣ > k)] = 0

and

\hat{α} (r) = O (r^{- d (2 μ + 1) - γ})

with μ = max {τ, 1/δ}, and suppose $lim {inf}_{n \to \infty} {∣ D_{n} ∣}^{- 1} σ_{n}^{2} > 0$ , then

σ_{n}^{- 1} \sum_{i \in D_{n}} ε_{i, n} \Rightarrow N (0, 1) .

where $σ_{n}^{2} = Var (\sum_{i \in D_{n}} ε_{i, n})$ .

Proof of Theorem A.1

The proof is available online on the authors’ webpages.

The above CLT is in essence a variant of CLT for α-mixing random fields given as Corollary 1 of Theorem 1 in Jenish and Prucha (2009), applied to mixing coefficients of the type ᾱ(u, v, r) ≤ C(u + v)^τα̂(r), τ ≥ 0.

Proof of Theorem 2

Since the proof is lengthy it is broken into steps.

1. Transition from Z_i_,_n to Y_i_,_n = Z_i_,_n/M_n

Let M_n = max_{i∈D_n}c_i_,_n and Y_i_,_n = Z_i_,_n/M_n. Also, let $σ_{Z, n}^{2} = Var [\sum Z_{i, n}]$ and $σ_{Y, n}^{2} = Var [\sum Y_{i, n}] = M_{n}^{- 2} σ_{Z, n}^{2}$ . Since

σ_{Y, n}^{- 1} \sum_{i \in D_{n}} Y_{i, n} = σ_{Z, n}^{- 1} \sum_{i \in D_{n}} Z_{i, n},

to prove the theorem, it suffices to show that $σ_{Y, n}^{- 1} \sum_{i \in D_{n}} Y_{i, n} \Rightarrow N (0, 1)$ . Therefore, it proves convenient to switch notation from the text and to define

S_{n} = \sum_{i \in D_{n}} Y_{i, n}, σ_{n}^{2} = Var (S_{n}) .

That is, in the following, S_n denotes Σ_{i∈D_n}Y_i_,_n rather than Σ_{i∈D_n}Z_i_,_n, and $σ_{n}^{2}$ denotes the variance of Σ_{i∈D_n}Y_i_,_n rather than of Σ_{i∈D_n}Z_i_,_n. We now establish moment and mixing conditions for Y_i_,_n from the assumptions of the theorem. Observe that by definition of M_n

1 (∣ Y_{i, n} ∣ > k) = 1 (∣ Z_{i, n} / M_{n} ∣ > k) \leq 1 (∣ Z_{i, n} / c_{i, n} ∣ > k),

and hence

E [{∣ Y_{i, n} ∣}^{2 + δ} 1 (∣ Y_{i, n} ∣ > k)] \leq E [{∣ Z_{i, n} / c_{i, n} ∣}^{2 + δ} 1 (∣ Z_{i, n} / c_{i, n} ∣ > k)]

so that Assumption 4(a) implies that

lim_{k \to \infty} sup_{n, i \in D_{n}} E [{∣ Y_{i, n} ∣}^{2 + δ} 1 (∣ Y_{i, n} ∣ > k)] = 0.

(A.5)

Hence, Y_i_,_n is also uniformly L₂₊_δ bounded. Let ||Y||₂₊_δ = sup_{n,i∈D_n}||Y_i_,_n||₂₊_δ. Further, note that

\begin{array}{l} {| | Y_{i, n} - E (Y_{i, n} ∣ F_{i, n} (s)) | |}_{2} = M_{n}^{- 1} {| | Z_{i, n} - E (Z_{i, n} ∣ F_{i, n} (s)) | |}_{2} \\ \leq c_{i, n}^{- 1} d_{i, n} ψ (s) \leq C ψ (s) \end{array}

(A.6)

since ${sup}_{n, i \in D_{n}} c_{i, n}^{- 1} d_{i, n} \leq C < \infty$ , by assumption. Thus, Y_i_,_n is uniformly L₂-NED on ε with the NED coefficients ψ(m). Finally, observe that by Assumption 4(b):

inf_{n} {∣ D_{n} ∣}^{- 1} σ_{n}^{2} > 0.

(A.7)

Hence, there exists 0 < B < ∞ such that for all n

B ∣ D_{n} ∣ \leq σ_{n}^{2} .

(A.8)

2. Decomposition of Y_i_,_n

For any fixed s > 0, decompose X_i_,_n as

Y_{i, n} = ξ_{i, n}^{s} + η_{i, n}^{s}

where $ξ_{i, n}^{s} = E (Y_{i, n} ∣ F_{i, n} (s)), η_{i, n}^{s} = Y_{i, n} - ξ_{i, n}^{s}$ . Let

\begin{array}{l} S_{n, s} = \sum_{i \in D_{n}} ξ_{i, n}^{s}; {\tilde{S}}_{n, s} = \sum_{i \in D_{n}} η_{i, n}^{s} \\ σ_{n, s}^{2} = Var [S_{n, s}], {\tilde{σ}}_{n, s}^{2} = Var [{\tilde{S}}_{n, s}] \end{array}

Repeated use of the Minkowski inequality yields:

∣ σ_{n} - σ_{n, s} ∣ \leq {\tilde{σ}}_{n, s}, ∣ σ_{n} - {\tilde{σ}}_{n, s} ∣ \leq σ_{n, s} .

(A.9)

Observe that

E [E (Y_{i, n} ∣ F_{i, n} (s)) ∣ F_{i, n} (m))] = {\begin{matrix} E (Y_{i, n} ∣ F_{i, n} (s)), m \geq s, \\ E (Y_{i, n} ∣ F_{i, n} (m)), m < s . \end{matrix}

and hence

\begin{array}{l} {| | η_{i, n}^{s} - E (η_{i, n}^{s} ∣ F_{i, n} (m)) | |}_{2} = {| | Y_{i, n} - E [Y_{i, n} ∣ F_{i, n} (s)] - E [Y_{i, n} ∣ F_{i, n} (m)] + E [(Y_{i, n} ∣ F_{i, n} (s)) ∣ F_{i, n} (m)] | |}_{2} \\ = {\begin{matrix} {| | Y_{i, n} - E (Y_{i, n} ∣ F_{i, n} (m)) | |}_{2} \leq C ψ (m), if m \geq s, \\ {| | Y_{i, n} - E (Y_{i, n} ∣ F_{i, n} (s)) | |}_{2} \leq C ψ (s) \leq C ψ (m), if m < s . \end{matrix} \end{array}

since by definition the sequence ψ(m) is non-increasing. Thus, for any fixed s > 0, { $η_{i, n}^{s}$ } is uniformly L₂-NED on ε with the same NED coefficients ψ(m) as the random field {Y_i_,_n}. Furthermore, as shown in the proof of Lemma A.3, { $η_{i, n}^{s}$ } is also uniformly L₂₊_δ bounded.

3. Bounds for the Variances of ΣY_i;n and $\sum η_{i, n}^{s}$

First note that in light of Assumption 3, and observing that τ^* = δτ/(4 + 2δ) ≤ τ_* =δτ/(2 + δ) and ${\hat{α}}^{δ / (2 + δ)} (r) \leq {\hat{α}}^{\frac{δ}{2 (2 + δ)}} (r)$ we have

\begin{array}{l} \sum_{r = 1}^{\infty} r^{d (τ_{*} + 1) - 1} {\hat{α}}^{δ / (2 + δ)} (r) \leq \sum_{r = 1}^{\infty} r^{d (τ_{*} + 1) - 1} {\hat{α}}^{\frac{δ}{2 (2 + δ)}} (r) < \infty, \\ \sum_{r = 1}^{\infty} r^{d (τ^{*} + 1) - 1} {\hat{α}}^{δ / (4 + 2 δ)} (r) \leq \sum_{r = 1}^{\infty} r^{d (τ_{*} + 1) - 1} {\hat{α}}^{\frac{δ}{2 (2 + δ)}} (r) < \infty . \end{array}

Using part (a) of Lemma A.3 with X_i_,_n = Y_i_,_n and recalling (A.8), we have

B ∣ D_{n} ∣ \leq σ_{n}^{2} = Var (S_{n}) \leq C ∣ D_{n} ∣ .

for some B > 0. Using part (b) of Lemma A.3 with $X_{i, n} = η_{i, n}^{s}$ we have

{\tilde{σ}}_{n, s}^{2} = Var ({\tilde{S}}_{n, s}) \leq C ∣ D_{n} ∣ {| | η_{i, n}^{s} | |}_{2} = C ∣ D_{n} ∣ ψ (s)

(A.10)

in light of (A.6). Hence,

lim_{s \to \infty} lim sup_{n \to \infty} \frac{{\tilde{σ}}_{n, s}^{2}}{σ_{n}^{2}} \leq C lim_{s \to \infty} ψ (s) = 0.

(A.11)

Furthermore, by (A.9) we have

lim_{s \to \infty} lim sup_{n \to \infty} | 1 - \frac{σ_{n, s}}{σ_{n}} | \leq lim_{s \to \infty} lim sup_{n \to \infty} \frac{{\tilde{σ}}_{n, s}}{σ_{n}} = 0

(A.12)

and hence for all s ≥ 1 and n ≥ 1

\frac{σ_{n, s}}{σ_{n}} \leq C < \infty .

(A.13)

4. CLT for $\sum_{i \in D_{n}} ξ_{i, n}^{s}$

We now show that for any fixed s > 0, $ξ_{i, n}^{s}$ satisfies Theorem A.1.

First, since ${sup}_{n, i \in D_{n}} E [{∣ ξ_{i, n}^{s} ∣}^{2 + δ}] < \infty$ , the process { $ξ_{i, n}^{s}$ } is uniformly L₂₊_δ_′-integrable for δ′ = δ/2, i.e.,

lim_{k \to \infty} sup_{n, i \in D_{n}} E [{∣ ξ_{i, n}^{s} ∣}^{2 + δ / 2} 1 (∣ ξ_{i, n}^{s} ∣ > k)] = 0.

Second, since $ξ_{i, n}^{s}$ is a measurable function of ε_i_,_n for any u, v ∈ ℕ and r > 2s

{\bar{α}}_{ξ} (u, v, r) \leq \bar{α} ({uMs}^{d}, {vMs}^{d}, r - 2 s) \leq C {(u + v)}^{τ} \hat{α} (r - 2 s)

We next to show that α̂(r) = O(r⁻^d⁽²^μ^{+1) −}^γ) for μ = max {τ, 2/δ} and some γ > 0. By assumption,

\sum_{r = 1}^{\infty} r^{d (τ_{*} + 1) - 1} {\hat{α}}^{\frac{δ}{2 (2 + δ)}} (r) < \infty,

where τ_* = δτ/(2 + δ), which implies

\hat{α} (r) = o (r^{- 2 d (2 + δ) (τ_{*} + 1) / δ}) = o (r^{- d [2 (τ + 2 / δ) + 1] - d}) = o (r^{- d [2 μ + 1] - d})

since μ ≤ τ + 2/δ for μ = max {τ, 2/δ}. Thus, α̂(r) = O(r⁻^d⁽²^μ^{+1) −}^γ) for γ = d.

We next show that for sufficiently large s,

0 < lim inf_{n \to \infty} {∣ D_{n} ∣}^{- 1} σ_{n, s}^{2} .

By (A.8),

B^{1 / 2} \leq inf {∣ D_{n} ∣}^{- 1 / 2} σ_{n}

Since lim_s_→∞ ψ(s) = 0, there exists s_* such that in light of (A.10) for all s ≥ s_*,

{∣ D_{n} ∣}^{- 1 / 2} {\tilde{σ}}_{n, s} \leq C ψ^{1 / 2} (s) \leq B^{1 / 2} / 2.

(A.14)

Hence by (A.9) for all s ≥ s_*, |D_n|^−1/2(σ_n− σ̃_n_,_s) ≤ |D_n|^−1/2σ_n_,_s, and thus inf_n |D_n|^−1/2σ_n_,_s ≥ inf_n |D_n|^−1/2σ_n − sup_n |D_n|^−1/2 σ̃_n_,_s. Using (A.7) and (A.14), we have

lim inf_{n \to \infty} {∣ D_{n} ∣}^{- 1 / 2} σ_{n, s} \geq B^{1 / 2} - \frac{B^{1 / 2}}{2} = \frac{B^{1 / 2}}{2} > 0

Thus, for all s ≥ s_*,

σ_{n, s}^{- 1} \sum_{i \in D_{n}} ξ_{i, n}^{s} \Rightarrow N (0, 1) as n \to \infty .

(A.15)

Since the first s_* terms do not affect the analysis below we take in the following s_* = 1.

5. CLT for $σ_{n}^{- 1} \sum_{i \in D_{n}} Y_{i, n}$

Finally, using Lemma A.1 we now show that, given the maintained NED assumption, the just established CLT in (A.15) for the approximators $ξ_{i, n}^{s}$ can be carried over to the the Y_i_,_n. Define

W_{n} = σ_{n}^{- 1} \sum_{i \in D_{n}} Y_{i, n}, V_{n s} = σ_{n}^{- 1} \sum_{i \in D_{n}} ξ_{i, n}^{s}, W_{n} - V_{n s} = σ_{n}^{- 1} \sum_{i \in D_{n}} η_{i, n}^{s}

so that we can exploit Lemma A.1 to prove that

W_{n} = σ_{n}^{- 1} \sum_{i \in D_{n}} Y_{i, n} \Rightarrow V ~ N (0, 1) .

We first verify condition (iii) of Lemma A.1. By Markov’s inequality and (A.11), for every ε > 0 we have

\begin{array}{l} lim_{s \to \infty} lim sup_{n \to \infty} P (∣ W_{n} - V_{n s} ∣ > ε) = lim_{s \to \infty} lim sup_{n \to \infty} P (| σ_{n}^{- 1} \sum_{i \in D_{n}} η_{i, n}^{s} | > ε) \\ \leq lim_{s \to \infty} lim sup_{n \to \infty} \frac{{\tilde{σ}}_{n, s}^{2}}{ε^{2} σ_{n}^{2}} = 0. \end{array}

Next observe that $V_{n s} = \frac{σ_{n, s}}{σ_{n}} [σ_{n, s}^{- 1} \sum_{i \in D_{n}} ξ_{i, n}^{s}]$ . We proceed to show W_n ⇒ V by contradiction. For that purpose let Inline graphic be the set of all probability measures on (ℝ; ), and observe that we can metrize by, e.g., the Prokhorov distance d(., .). Let μ_n and μ be the probability measures corresponding to W_n and V, respectively, then W_n ⇒ V, or μ_n ⇒ μ, iff d(μ_n, μ) → 0 as n → ∞. Now suppose μ_n does not converge to μ. Then for some ε > 0 there exists a subsequence {n(m)} such that d(μ_n₍_m₎, μ) > ε for all n(m). By (A.13), we have 0 ≤ σ_n_,_s/σ_n ≤ C < ∞ for all s, n ≥ 1. Hence, 0 ≤ σ_n₍_m_),_s/σ_n₍_m₎ ≤ C < ∞ for all n(m). Consequently, for s = 1 there exists a subsubsequence {n(m(l₁))} such that σ_n(m(l₁)),1/σ_n(m(l₁)) → p(1) as l₁ → ∞. For s = 2, there exists a subsubsubsequence {n(m(l₁(l₂)))} such that σ_{n(m(l₁(l₂))),2}/σ_{n(m(l₁ (l₂)))} → p(2) as l₂ → ∞. The argument can be repeated for s = 3, 4…. Now construct a subsequence {n_l} such that n₁ corresponds to the first element of {n(m(l₁))}, n₂ corresponds to the second element of {n(m(l₁(l₂)))}, and so on, then

lim_{l \to \infty} \frac{σ_{n_{l}, s}}{σ_{n_{l}}} = p (s)

(A.16)

for s = 1, 2, … Given (A.15), it follows that as l → ∞

V_{n_{l} s} \Rightarrow V_{s} ~ N (0, p^{2} (s)) .

Then, it follows from (A.12) that

lim_{s \to \infty} ∣ p (s) - 1 ∣ \leq lim_{s \to \infty} lim_{l \to \infty} | p (s) - \frac{σ_{n_{l}, s}}{σ_{n_{l}}} | + lim_{s \to \infty} sup_{n \geq 1} | \frac{σ_{n, s}}{σ_{n}} - 1 | = 0.

Thus V_s ⇒ V and thus by Lemma A.1 W_{n_l} ⇒ V ~ N(0, 1) as l → ∞. Since {n_l} ⊆ {n(m)} this contradicts the assumption that d(μ_n₍_m₎, μ) > ε for all n(m). This completes the proof of the CLT.

Proof of Corollary 1

The proof is available online on the authors’ webpages.

B Appendix: Proofs for Section 4

Proof of Theorem 3

We show that

sup_{θ \in Θ} ∣ Q_{n} (θ) - {\bar{Q}}_{n} (θ) ∣ \overset{p}{\to} 0

(B.1)

as n → ∞. As discussed in the text, given that the θ₀_n are identifiably unique it then follows immediately from, e.g., Pötscher and Prucha (1997), Lemma 3.1, that $ν ({\hat{θ}}_{n}, θ_{0 n}) \overset{p}{\to} 0$ as claimed.

We start by proving that

{∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} [q_{i, n} (Z_{i, n}, θ) - {E q}_{i, n} (S_{i, n}, θ)] \overset{p}{\to} 0

(B.2)

for each θ ∈ Θ, by applying the LLN given as Theorem 1 in the text to q_i_,_n(Z_i_,_n, θ). By Assumption 6(a), we have sup_{n,i∈D_n}E |q_i_,_n(Z_i_,_n, θ)|^p < ∞ for each θ ∈ Θ and p = 2, which verifies Assumption 2(a) for q_i_,_n(Z_i_,_n, θ) with c_i_,_n = 1. By Assumption 6(b), the q_i_,_n(Z_i_,_n, θ) are uniformly L₁-NED on ε, and hence w.o.l.g. we can take d_i_,_n = 1. Furthermore, by Assumption 6(b) the input process ε is α-mixing, and the α-mixing coefficients satisfy Assumption 2(b). Consequently (B.2) follows directly from Theorem 1 applied to q_i_,_n(Z_i_,_n, θ).

Next, by Proposition 1 of Jenish and Prucha (2009), Assumption 6(c) implies that q_i_,_n is L₀ stochastically equicontinuous on Θ, i.e., for every ε > 0

lim sup_{n \to \infty} \frac{1}{∣ D_{n} ∣} \sum_{i \in D_{n}} P (sup_{ν (θ, θ^{•}) \leq δ} ∣ q_{i, n} (Z_{i, n}, θ) - q_{i, n} (Z_{i, n}, θ^{•}) ∣ > ε) \to 0 as δ \to 0.

Furthermore, in light of Assumption 6(a) the q_i_,_n(Z_i_,_n, θ) clearly satisfy the domination condition postulated by the ULLN in Jenish and Prucha (2009), stated as Theorem 2 in that paper. Given that we have already verified the pointwise LLN in (B.2) it now follows directly from that theorem that

sup_{θ \in Θ} ∣ R_{n} (θ) - {E R}_{n} (θ) ∣ \overset{p}{\to} 0

(B.3)

with R_n(θ) = |D_n|⁻¹Σ_{i∈D_n}q_i,n(Z_i_,_n, θ), and that the ER_n(θ) are uniformly equicontinuous on Θ in the sense that

lim sup_{n \to \infty} sup_{θ^{•} \in Θ} sup_{ν (θ, θ^{•}) \leq δ} ∣ {E R}_{n} (θ) - {E R}_{n} (θ^{•}) ∣ \to 0 as δ \to 0.

To prove (B.1), observe that

\begin{array}{l} sup_{θ \in Θ} ∣ Q_{n} (θ) - {\bar{Q}}_{n} (θ) ∣ \leq sup_{θ \in Θ} ∣ R_{n} {(θ)}^{'} {P R}_{n} (θ) - {E R}_{n} (θ) {PER}_{n} (θ) ∣ + sup_{θ \in Θ} ∣ R_{n} {(θ)}^{'} (P_{n} - P) R_{n} (θ) ∣ \\ \leq sup_{θ \in Θ} ∣ R_{n} {(θ)}^{'} {P R}_{n} (θ) - {E R}_{n} (θ) {PER}_{n} (θ) ∣ + 2 sup_{θ \in Θ} {∣ R_{n} (θ) ∣}^{2} ∣ P_{n} - P ∣ . \end{array}

(B.4)

Furthermore observe that Assumption 6(a) we have E [sup_θ_∈Θ|q_i_,_n(Z_i_,_n, θ)|] ≤ K and E [sup_θ_∈Θ|q_i_,_n(Z_i_,_n, θ)|]² ≤ K for some finite constant K. Thus

sup_{θ \in Θ} E ∣ R_{n} (θ) ∣ \leq E sup_{θ \in Θ} ∣ R_{n} (θ) ∣ \leq {∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} E sup_{θ \in Θ} ∣ q_{i, n} (Z_{i, n}, θ) ∣ \leq K

(B.5)

\begin{array}{l} E sup_{θ \in Θ} {∣ R_{n} (θ) ∣}^{2} \leq {∣ D_{n} ∣}^{- 2} \sum_{i, j \in D_{n}} E [sup_{θ \in Θ} ∣ q_{i, n} (Z_{i, n}, θ) ∣ sup_{θ \in Θ} ∣ q_{j, n} (Z_{j, n}, θ) ∣] \\ \leq {∣ D_{n} ∣}^{- 2} \sum_{i, j \in D_{n}} {[E {(sup_{θ \in Θ} ∣ q_{i, n} (Z_{i, n}, θ) ∣)}^{2}]}^{1 / 2} {[E {(sup_{θ \in Θ} ∣ q_{j, n} (Z_{j, n}, θ) ∣)}^{2}]}^{1 / 2} \leq K . \end{array}

(B.6)

Now consider the first terms on the r.h.s. of the last inequality of (B.4). From (B.5) we see that E |R_n(θ)| takes on its values in a compact set. Given (B.3) it now follows immediately from part (a) of Lemma 3.3 of Pötscher and Prucha (1997) that

sup_{θ \in Θ} ∣ R_{n} {(θ)}^{'} {P R}_{n} (θ) - {E R}_{n} (θ) {PER}_{n} (θ) ∣ \overset{P}{\to} 0.

(B.7)

Next we show that also the second term on the r.h.s. of the last inequality of (B.4) converges in probability to zero. To see that this is indeed the case observe that sup_θ_∈Θ |R_n(θ)|² = O_p(1) in light of (B.6) and $∣ P_{n} - P ∣ \overset{P}{\to} 0$ by assumption. This completes the proof of (B.1).

Having established that ER_n(θ) are uniformly equicontinuous on Θ, the uniform equicontinuity of Q̄_n(θ) on Θ follows immediately from Lemma 3.3(b) of Pötscher and Prucha (1997).

Proof of Theorem 4

Clearly by Theorem 3 we have θ̂_n − θ₀_n = o_p(1).

Step 1

The estimators θ̂_n corresponding to the objective function (13) satisfy the following first order conditions:

\nabla_{θ} R_{n} {({\hat{θ}}_{n})}^{'} P_{n} [∣ D_{n} ∣^{1 / 2} R_{n} ({\hat{θ}}_{n})] = o_{p} (1) .

(B.8)

The o_p(1) term on the r.h.s. reflects that the first order conditions may not hold if θ̂_n falls onto the boundary of Θ, and that the probability of that event goes to zero as n → ∞, since the θ₀_n are uniformly in the in the interior of Θ by Assumption 7(a). If θ̂_n is in the interior of Θ, then the l.h.s. of (B.8) is zero.

Taking the mean value expansion of R_n(θ̂_n) about θ₀_n yields

R_{n} ({\hat{θ}}_{n}) = R_{n} (θ_{0 n}) + \nabla_{θ} R_{n} ({\tilde{θ}}_{n}) ({\hat{θ}}_{n} - θ_{0 n})

(B.9)

where θ̃_n ∈ Θ is between θ̃_n and θ₀_n (component-by-component). Let

{\hat{A}}_{n} = \nabla_{θ} R_{n} {({\hat{θ}}_{n})}^{'} P_{n} \nabla_{θ} R_{n} ({\tilde{θ}}_{n}) and {\hat{B}}_{n} = \nabla_{θ} R_{n} {({\hat{θ}}_{n})}^{'} P_{n} {[{∣ D_{n} ∣}^{- 1} \sum_{n}]}^{1 / 2},

then combining (B.8) and (B.9) gives

\begin{array}{l} {∣ D_{n} ∣}^{1 / 2} ({\hat{θ}}_{n} - θ_{0 n}) = [I - {\hat{A}}_{n}^{+} {\hat{A}}_{n}] {∣ D_{n} ∣}^{1 / 2} ({\hat{θ}}_{n} - θ_{0 n}) - {\hat{A}}_{n}^{+} \nabla_{θ} R_{n} {({\hat{θ}}_{n})}^{'} P_{n} [{∣ D_{n} ∣}^{1 / 2} R_{n} (θ_{o n})] + {\hat{A}}_{n}^{+} o_{p} (1) \\ = [I - {\hat{A}}_{n}^{+} {\hat{A}}_{n}] {∣ D_{n} ∣}^{1 / 2} ({\hat{θ}}_{n} - θ_{0 n}) - {\hat{A}}_{n}^{+} {\hat{B}}_{n} [\sum_{n}^{- 1 / 2} ∣ D_{n} ∣ R_{n} (θ_{0 n})] + {\hat{A}}_{n}^{+} o_{p} (1), \end{array}

(B.10)

where ${\hat{A}}_{n}^{+}$ denotes the generalized inverse of Â_n.

Step 2

By Assumptions 7(c) the q_i_,_n(Z_i_,_n, θ₀_n) are uniformly L₂-NED and uniformly L₂₊_δ-integrable with c_i_,_n = 1. Given Assumptions 7(d),(g) it is now readily seen that the process {q_i_,_n(Z_i_,_n, θ₀_n), i ∈ D_n} satisfies all assumptions of the CLT for vector-valued NED processes, given as Corollary 1 in the text, with c_in = 1. (Note that Assumption 4(d) is satisfied automatically since the q_i_,_n(Z_i_,_n, θ₀_n) are uniformly L₂-NED.) Hence,

\sum_{n}^{- 1 / 2} ∣ D_{n} ∣ R_{n} (θ_{0 n}) = \sum_{n}^{- 1 / 2} \sum_{i \in D_{n}} q_{i, n} (Z_{i, n}, θ_{0 n}) \Rightarrow N (0, I_{p_{q}}),

(B.11)

with Σ_n = Var [Σ_{i∈D_n}q_i,n(Z_i,n, θ₀_n)] and sup_n λ_max [|D_n|⁻¹ Σ_n]< ∞.

Step 3

By Assumptions 7(c),(d),(e) the functions ∇_θq_i_,_n(Z_i_,_n, θ) satisfy for each θ ∈ Θ the LLN given as Theorem 1 in the text with c_i_,_n = 1, observing that Assumption 2(b) is implied by 3. By argumentation analogous as used in the proof of consistency we have

{∣ D_{n} ∣}^{- 1} \sum_{i \in D_{n}} (\nabla_{θ} q_{i, n} (Z_{i, n}, θ) - E \nabla_{θ} q_{i, n} (Z_{i, n}, θ)) \overset{p}{\to} 0 .

By Proposition 1 of Jenish and Prucha (2009), Assumption 7(f) implies that the ∇_θq_i_,_n(Z_i_,_n; θ) are uniformly L₀-equicontinuous on Θ. Given L₀-equicontinuity and Assumption 7(e), we have by the ULLN of Jenish and Prucha (2009):

sup_{θ \in Θ} ∣ \nabla_{θ} R_{n} (θ) - E \nabla_{θ} R_{n} (θ) ∣ \overset{p}{\to} 0.

(B.12)

and furthermore, the E∇_θR_n(θ) are uniformly equicontinuous on Θ in the sense:

lim sup_{n \to \infty} sup_{θ^{'} \in Θ} sup_{∣ θ - θ^{'} ∣ < δ} ∣ E \nabla_{θ} R_{n} (θ) - E \nabla_{θ} R_{n} (θ) ∣ \to 0

(B.13)

as δ → 0. In light of (B.12) and (B.13), and given that θ̂_n − θ₀_n = o_p(1) and hence θ̃_n − θ₀_n = o_p(1), if follows further that

\nabla_{θ} R_{n} ({\hat{θ}}_{n}) - E \nabla_{θ} R_{n} (θ_{0 n}) \overset{p}{\to} 0, and \nabla_{θ} R_{n} ({\tilde{θ}}_{n}) - E \nabla_{θ} R_{n} (θ_{0 n}) \overset{p}{\to} 0.

Hence,

{\hat{A}}_{n} - A_{n} \overset{p}{\to} 0 and {\hat{B}}_{n} - B_{n} \overset{p}{\to} 0,

(B.14)

where Â_n and B̂_n are as defined above, and

A_{n} = {[E \nabla_{θ} R_{n} (θ_{0 n})]}^{'} P [E \nabla_{θ} R_{n} (θ_{0 n})] and B_{n} = {[E \nabla_{θ} R_{n} (θ_{0 n})]}^{'} P {[{∣ D_{n} ∣}^{- 1} \sum_{n}]}^{1 / 2} .

Step 4

Given Assumptions 7(e),(f), and since P is positive definite, we have |A_n| = O(1) and $∣ A_{n}^{- 1} ∣ = O (1)$ , respectively. Hence by, e.g., Lemma F1 in Pötscher and Prucha (1997) we have Â_n = O_p(1), ${\hat{A}}_{n}^{+} = O_{p} (1)$ , Â_n is nonsingular with probability tending to one, and ${\hat{A}}_{n}^{+} - A_{n}^{- 1} \overset{p}{\to} 0$ . In light of the above it follows from (B.10) that

\begin{array}{l} {∣ D_{n} ∣}^{1 / 2} ({\hat{θ}}_{n} - θ_{0 n}) = - {\hat{A}}_{n}^{+} {\hat{B}}_{n} [\sum_{n}^{- 1 / 2} ∣ D_{n} ∣ R_{n} (θ_{0 n})] + o_{p} (1) \\ = - A_{n}^{- 1} B_{n} [\sum_{n}^{- 1 / 2} ∣ D_{n} ∣ R_{n} (θ_{0 n})] + o_{p} (1) \end{array}

Recalling that sup_n λ_max [|D_n|⁻¹ Σ_n]< ∞, Assumptions 7(e) implies that |B_n| = O_p(1). In light of Assumption 7(g),(h) $B_{n} B_{n}^{'}$ is invertible and furthermore ${(B_{n} B_{n}^{'})}^{- 1} = O (1)$ . Thus $| {(A_{n}^{- 1} B_{n} B_{n}^{'} {A_{n}^{- 1}}^{'})}^{- 1} | \leq {∣ A_{n} ∣}^{2} ∣ {(B_{n} B_{n}^{'})}^{- 1} ∣ = O (1)$ and therefore

{(A_{n}^{- 1} B_{n} B_{n}^{'} {A_{n}^{- 1}}^{'})}^{- 1 / 2} {∣ D_{n} ∣}^{1 / 2} ({\hat{θ}}_{n} - θ_{0 n}) = - {(A_{n}^{- 1} B_{n} B_{n}^{'} {A_{n}^{- 1}}^{'})}^{- 1 / 2} A_{n}^{- 1} B_{n} [\sum_{n}^{- 1 / 2} ∣ D_{n} ∣ R_{n} (θ_{0 n})] + o_{p} (1) .

The claim that ${(A_{n}^{- 1} B_{n} B_{n}^{'} {A_{n}^{- 1}}^{'})}^{- 1 / 2} {∣ D_{n} ∣}^{1 / 2} ({\hat{θ}}_{n} - θ_{0 n}) \Rightarrow N (0, I_{k})$ now follows, e.g., from Corollary F4(b) in Pötscher and Prucha (1997).

Footnotes

The space and metric are not restricted to physical space and distance.

For recent contributions see, e.g., Robinson (2010, 2009), Yu, de Jong and Lee (2008), Kelejian and Prucha (2010, 2007, 2004), Lee (2007, 2004), and Chen and Conley (2001).

These conditions for linear processes with general independent innovations were first established by Gorodetskii (1977). Doukhan and Guyon (1991) generalized them to random fields.

⁴

This important early contribution employs Bolthausen’s (1982) CLT for stationary α-mixing random fields on the regular lattice ℤ². However, the mixing and stationarity assumptions may not hold in many applications. The present paper relaxes these critical assumptions.

⁵

The proof of the proposition shows that ${| | g_{i, n} (Z_{i, n}) - E [g_{i, n} (Z_{i, n}) ∣ F_{i, n} (s)] | |}_{p} \leq 2 C {‖ Z_{i, n} - {\tilde{Z}}_{i, n}^{s} ‖}_{p}$ , which explains the 2 in the scaling factor for g_i_,_n(Z_i_,_n).

⁶

In an important contribution, Conley (1999) gives a first set of results regarding the asymptotic properties of GMM estimators under the assumption that the data process is stationary and α-mixing. Conley also maintains some high level assumption such as first moment continuity of the moment function, which in turn immediately implies uniform convergence - see, e.g., Pötscher and Prucha (1989) for a discussion. Our results extend Conley (1999) in several important directions, as indicated above. We establish uniform convergence from primitive sufficient conditions via the generic uniform law of large numbers given in Jenish and Prucha (2009) and the law of large numbers given as Theorem 1 above.

⁷

To ensure that the derivatives are defined on the border of Θ, we assume in the following that the moment functions are defined on an open set containing Θ, and that the q_i_,_n and ∇_{θq_i,n} are restrictions to Θ.

⁸

Pinkse et al. (2007) made an interesting contribution in this direction. Their catalogue of assumption is at the level of Bernstein blocks. Without further sufficient conditions, verification of those assumptions would typically be challenging in practical situations.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Nazgul Jenish, Email: nazgul.jenish@nyu.edu.

Ingmar R. Prucha, Email: prucha@econ.umd.edu.

References

1.Andrews DWK. Non-strong mixing autoregressive processes. Journal of Applied Probability. 1984;21:930–934. [Google Scholar]
2.Andrews DWK. Consistency in nonlinear econometric models: a generic uniform law of large numbers. Econometrica. 1987;55:1465–1471. [Google Scholar]
3.Andrews DWK. Laws of large numbers for dependent non-identically distributed variables. Econometric Theory. 1988;4:458–467. [Google Scholar]
4.Bierens HJ. Robust methods and asymptotic theory. Berlin: Springer Verlag; 1981. [Google Scholar]
5.Billingsley P. Convergence of probability measures. New York: John Wiley and Sons; 1968. [Google Scholar]
6.Bolthausen E. On the central limit theorem for stationary mixing random fields. Annals of Probability. 1982;10:1047–1050. [Google Scholar]
7.Bradley R. Some examples of mixing random fields. Rocky Mountain Journal of Mathematics. 1993;23(2):495–519. [Google Scholar]
8.Brockwell P, Davis R. Times series: theory and methods. Springer Verlag; 1991. [Google Scholar]
9.Bulinskii AV. Limit theorems under weak dependence conditions. Moscow University Press; 1989. [Google Scholar]
10.Bulinskii AV, Doukhan P. Vitesse de convergence dans le theorem de limite centrale pour des champs melangeants des hypotheses de moment faibles. C.R. Academie Sci Paris, Serie I. 1990:801–105. [Google Scholar]
11.Chen X, Conley T. A new semiparametric spatial model for panel time series. Journal of Econometrics. 2001;105:59–83. [Google Scholar]
12.Cliff A, Ord J. Spatial processes, models and applications. London: Pion; 1981. [Google Scholar]
13.Conley T. GMM estimation with cross sectional dependence. Journal of Econometrics. 1999;92:1–45. [Google Scholar]
14.Davidson J. A central limit theorem for globally nonstationary near-epoch dependent functions of mixing processes. Econometric Theory. 1992;8:313–329. [Google Scholar]
15.Davidson J. An L1 convergence theorem for heterogenous mixingale arrays with trending moments. Statistics and Probability Letters. 1993;8:313–329. [Google Scholar]
16.Davidson J. Stochastic limit theory. Oxford University Press; 1994. [Google Scholar]
17.De Jong RM. Central limit theorems for dependent heterogeneous random variables. Econometric Theory. 1997;13:353–367. [Google Scholar]
18.Dell M. The persistent effects of Peru’s mining mita. Econometrica. 2010;78:1863, 1903. [Google Scholar]
19.Dobrushin R. The description of a random field by its conditional distribution and its regularity condition. Theory of Probability and its Applications. 1968;13:197–227. [Google Scholar]
20.Doukhan P, Guyon X. Mixing for linear random fields, C.R. Academie Sciences Paris, Serie 1. 1991;313:465–470. [Google Scholar]
21.Doukhan P, Lang G. Rates in the empirical central limit theorem for stationary weakly dependent random fields. Statistical Inference for Stochastic Processes. 2002;5:199–228. [Google Scholar]
22.Doukhan P, Louhichi S. A new weak dependence condition and applications to moment inequalities. Stochastic Processes and Their Applications. 1999;84:313–342. [Google Scholar]
23.Fogli A, Veldkamp L. Nature or Nurture? Learning and geography of female force participation. Econometrica. 2011;79:1103–1138. [Google Scholar]
24.Gallant AR, White H. A unified theory of estimation and inference for nonlinear dynamic models. New York: Basil Blackwell; 1988. [Google Scholar]
25.Gorodetskii VV. On the strong mixing property for linear sequences. Theory of Probability and Applications. 1977;22:411–413. [Google Scholar]
26.Hallin M, Lu Z, Tran LT. Density estimation for spatial linear processes. Bernoulli. 2001;7:657–668. [Google Scholar]
27.Hallin M, Lu Z, Tran LT. Kernel density estimation for spatial processes: the L1 theory. Multivariate Analysis. 2004;88:61–75. [Google Scholar]
28.Ibragimov IA. Some limit theorems for stationary processes. Theory of Probability and Applications. 1962;7:349–382. [Google Scholar]
29.Ibragimov IA, Linnik YV. Independent and stationary sequences of random variables. Wolters-Noordhoff; Groningen: 1971. [Google Scholar]
30.Jenish N, I, Prucha R. Central limit theorems and uniform laws of large numbers for arrays of random fields. Journal of Econometrics. 2009;150:86–98. doi: 10.1016/j.jeconom.2009.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Jenish N, Prucha IR. Working paper. 2011. On spatial processes and asymptotic inference under near epoch dependence. Version January 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Kelejian HH, I, Prucha R. Estimation of simultaneous systems of spatially interrelated cross sectional equations. Journal of Econometrics. 2004;118:27–50. [Google Scholar]
33.Kelejian HH, I, Prucha R. HAC estimation in a spatial framework. Journal of Econometrics. 2007;140:131–154. doi: 10.1016/j.jeconom.2009.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Kelejian HH, I, Prucha R. Specification and estimation of spatial autoregressive models with autoregressive and heteroskedastic disturbances. Journal of Econometrics. 2010;157:53–67. doi: 10.1016/j.jeconom.2009.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Lee LF. Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive models. Econometrica. 2004;72:1899–1925. [Google Scholar]
36.Lee LF. GMM and 2SLS for mixed regressive, spatial autoregressive models. Journal of Econometrics. 2007;137:489–514. [Google Scholar]
37.Lu Z. Asymptotic normality of kernel density estimators under dependence. Annals of Institute of Statistical Mathematics. 2001;53:447–468. [Google Scholar]
38.Lu Z, Linton O. Local linear fitting under near epoch dependence. Econometric Theory. 2007;23:37–70. [Google Scholar]
39.McLeish DL. Invariance principles for dependent variables. Z. Wahrsch. verw. Gebiete. 1975;32:165–78. [Google Scholar]
40.Nahapetian B. An approach to central limit theorems for dependent random variables. Theory of Probability and its Applications. 1987;32:589–594. [Google Scholar]
41.Pinkse J, Shen L, Slade ME. A central limit theorem for endogenous locations and complex spatial interactions. Journal of Econometrics. 2007;140:215–225. [Google Scholar]
42.Pinkse J, Slade LME, Brett Spatial price competition: a semiparametric approach. Econometrica. 2002;70:1111–1153. [Google Scholar]
43.Pötscher BM, I, Prucha R. A uniform law of large numbers for dependent and heterogeneous data processes. Econometrica. 1989;57:675–683. [Google Scholar]
44.Pötscher BM, I, Prucha R. Generic uniform convergence and equicontinuity concepts for random functions. Journal of Econometrics. 1994;60:23–63. [Google Scholar]
45.Pötscher BM, Prucha IR. Dynamic nonlinear econometric models. Springer-Verlag; New York: 1997. [Google Scholar]
46.Robinson PM. Large-sample inference on spatial dependence. The Econometrics Journal, Tenth Anniversary Special Issue. 2009;12:S68–S82. [Google Scholar]
47.Robinson PM. Efficient estimation of the semiparametric spatial autoregressive model. Journal of Econometrics. 2010;157:6–17. [Google Scholar]
48.Takahata H. On the rates in the central limit theorem for weakly dependent random fields. Z. Wahrsch. verw. Gebiete. 1983;64:445–456. [Google Scholar]
49.Wooldridge J. PhD Dissertation. University of California; San Diego: Department of Economics; 1986. Asymptotic properties of econometric estimators. [Google Scholar]
50.Yu J, de Jong R, Lee LF. Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both N and T are large. Journal of Econometrics. 2008;146:118–134. [Google Scholar]

[R1] 1.Andrews DWK. Non-strong mixing autoregressive processes. Journal of Applied Probability. 1984;21:930–934. [Google Scholar]

[R2] 2.Andrews DWK. Consistency in nonlinear econometric models: a generic uniform law of large numbers. Econometrica. 1987;55:1465–1471. [Google Scholar]

[R3] 3.Andrews DWK. Laws of large numbers for dependent non-identically distributed variables. Econometric Theory. 1988;4:458–467. [Google Scholar]

[R4] 4.Bierens HJ. Robust methods and asymptotic theory. Berlin: Springer Verlag; 1981. [Google Scholar]

[R5] 5.Billingsley P. Convergence of probability measures. New York: John Wiley and Sons; 1968. [Google Scholar]

[R6] 6.Bolthausen E. On the central limit theorem for stationary mixing random fields. Annals of Probability. 1982;10:1047–1050. [Google Scholar]

[R7] 7.Bradley R. Some examples of mixing random fields. Rocky Mountain Journal of Mathematics. 1993;23(2):495–519. [Google Scholar]

[R8] 8.Brockwell P, Davis R. Times series: theory and methods. Springer Verlag; 1991. [Google Scholar]

[R9] 9.Bulinskii AV. Limit theorems under weak dependence conditions. Moscow University Press; 1989. [Google Scholar]

[R10] 10.Bulinskii AV, Doukhan P. Vitesse de convergence dans le theorem de limite centrale pour des champs melangeants des hypotheses de moment faibles. C.R. Academie Sci Paris, Serie I. 1990:801–105. [Google Scholar]

[R11] 11.Chen X, Conley T. A new semiparametric spatial model for panel time series. Journal of Econometrics. 2001;105:59–83. [Google Scholar]

[R12] 12.Cliff A, Ord J. Spatial processes, models and applications. London: Pion; 1981. [Google Scholar]

[R13] 13.Conley T. GMM estimation with cross sectional dependence. Journal of Econometrics. 1999;92:1–45. [Google Scholar]

[R14] 14.Davidson J. A central limit theorem for globally nonstationary near-epoch dependent functions of mixing processes. Econometric Theory. 1992;8:313–329. [Google Scholar]

[R15] 15.Davidson J. An L1 convergence theorem for heterogenous mixingale arrays with trending moments. Statistics and Probability Letters. 1993;8:313–329. [Google Scholar]

[R16] 16.Davidson J. Stochastic limit theory. Oxford University Press; 1994. [Google Scholar]

[R17] 17.De Jong RM. Central limit theorems for dependent heterogeneous random variables. Econometric Theory. 1997;13:353–367. [Google Scholar]

[R18] 18.Dell M. The persistent effects of Peru’s mining mita. Econometrica. 2010;78:1863, 1903. [Google Scholar]

[R19] 19.Dobrushin R. The description of a random field by its conditional distribution and its regularity condition. Theory of Probability and its Applications. 1968;13:197–227. [Google Scholar]

[R20] 20.Doukhan P, Guyon X. Mixing for linear random fields, C.R. Academie Sciences Paris, Serie 1. 1991;313:465–470. [Google Scholar]

[R21] 21.Doukhan P, Lang G. Rates in the empirical central limit theorem for stationary weakly dependent random fields. Statistical Inference for Stochastic Processes. 2002;5:199–228. [Google Scholar]

[R22] 22.Doukhan P, Louhichi S. A new weak dependence condition and applications to moment inequalities. Stochastic Processes and Their Applications. 1999;84:313–342. [Google Scholar]

[R23] 23.Fogli A, Veldkamp L. Nature or Nurture? Learning and geography of female force participation. Econometrica. 2011;79:1103–1138. [Google Scholar]

[R24] 24.Gallant AR, White H. A unified theory of estimation and inference for nonlinear dynamic models. New York: Basil Blackwell; 1988. [Google Scholar]

[R25] 25.Gorodetskii VV. On the strong mixing property for linear sequences. Theory of Probability and Applications. 1977;22:411–413. [Google Scholar]

[R26] 26.Hallin M, Lu Z, Tran LT. Density estimation for spatial linear processes. Bernoulli. 2001;7:657–668. [Google Scholar]

[R27] 27.Hallin M, Lu Z, Tran LT. Kernel density estimation for spatial processes: the L1 theory. Multivariate Analysis. 2004;88:61–75. [Google Scholar]

[R28] 28.Ibragimov IA. Some limit theorems for stationary processes. Theory of Probability and Applications. 1962;7:349–382. [Google Scholar]

[R29] 29.Ibragimov IA, Linnik YV. Independent and stationary sequences of random variables. Wolters-Noordhoff; Groningen: 1971. [Google Scholar]

[R30] 30.Jenish N, I, Prucha R. Central limit theorems and uniform laws of large numbers for arrays of random fields. Journal of Econometrics. 2009;150:86–98. doi: 10.1016/j.jeconom.2009.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Jenish N, Prucha IR. Working paper. 2011. On spatial processes and asymptotic inference under near epoch dependence. Version January 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Kelejian HH, I, Prucha R. Estimation of simultaneous systems of spatially interrelated cross sectional equations. Journal of Econometrics. 2004;118:27–50. [Google Scholar]

[R33] 33.Kelejian HH, I, Prucha R. HAC estimation in a spatial framework. Journal of Econometrics. 2007;140:131–154. doi: 10.1016/j.jeconom.2009.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.Kelejian HH, I, Prucha R. Specification and estimation of spatial autoregressive models with autoregressive and heteroskedastic disturbances. Journal of Econometrics. 2010;157:53–67. doi: 10.1016/j.jeconom.2009.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] 35.Lee LF. Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive models. Econometrica. 2004;72:1899–1925. [Google Scholar]

[R36] 36.Lee LF. GMM and 2SLS for mixed regressive, spatial autoregressive models. Journal of Econometrics. 2007;137:489–514. [Google Scholar]

[R37] 37.Lu Z. Asymptotic normality of kernel density estimators under dependence. Annals of Institute of Statistical Mathematics. 2001;53:447–468. [Google Scholar]

[R38] 38.Lu Z, Linton O. Local linear fitting under near epoch dependence. Econometric Theory. 2007;23:37–70. [Google Scholar]

[R39] 39.McLeish DL. Invariance principles for dependent variables. Z. Wahrsch. verw. Gebiete. 1975;32:165–78. [Google Scholar]

[R40] 40.Nahapetian B. An approach to central limit theorems for dependent random variables. Theory of Probability and its Applications. 1987;32:589–594. [Google Scholar]

[R41] 41.Pinkse J, Shen L, Slade ME. A central limit theorem for endogenous locations and complex spatial interactions. Journal of Econometrics. 2007;140:215–225. [Google Scholar]

[R42] 42.Pinkse J, Slade LME, Brett Spatial price competition: a semiparametric approach. Econometrica. 2002;70:1111–1153. [Google Scholar]

[R43] 43.Pötscher BM, I, Prucha R. A uniform law of large numbers for dependent and heterogeneous data processes. Econometrica. 1989;57:675–683. [Google Scholar]

[R44] 44.Pötscher BM, I, Prucha R. Generic uniform convergence and equicontinuity concepts for random functions. Journal of Econometrics. 1994;60:23–63. [Google Scholar]

[R45] 45.Pötscher BM, Prucha IR. Dynamic nonlinear econometric models. Springer-Verlag; New York: 1997. [Google Scholar]

[R46] 46.Robinson PM. Large-sample inference on spatial dependence. The Econometrics Journal, Tenth Anniversary Special Issue. 2009;12:S68–S82. [Google Scholar]

[R47] 47.Robinson PM. Efficient estimation of the semiparametric spatial autoregressive model. Journal of Econometrics. 2010;157:6–17. [Google Scholar]

[R48] 48.Takahata H. On the rates in the central limit theorem for weakly dependent random fields. Z. Wahrsch. verw. Gebiete. 1983;64:445–456. [Google Scholar]

[R49] 49.Wooldridge J. PhD Dissertation. University of California; San Diego: Department of Economics; 1986. Asymptotic properties of econometric estimators. [Google Scholar]

[R50] 50.Yu J, de Jong R, Lee LF. Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both N and T are large. Journal of Econometrics. 2008;146:118–134. [Google Scholar]

PERMALINK

On Spatial Processes and Asymptotic Inference under Near-Epoch Dependence

Nazgul Jenish

Ingmar R Prucha

Abstract

1 Introduction

2 NED Spatial Processes

Assumption 1

Definition 1

Proposition 1

Proposition 2

Proposition 3

3 Limit Theorems

3.1 Law of Large Numbers

Definition 2

Assumption 2

Theorem 1

3.2 Central Limit Theorem

Assumption 3

Assumption 4

Theorem 2

Corollary 1

4 Large Sample Properties of Spatial GMM Estimators

Assumption 5

4.1 Consistency

Definition 3

Assumption 6

Theorem 3 (Consistency)

4.2 Asymptotic Normality

Assumption 7

Theorem 4

Assumption 8

Assumption 9

5 Conclusion

Acknowledgments

A Appendix: Proofs for Sections 2 and 3

Proof of Proposition 1

Proof of Theorem 1

Lemma A.1 (Brockwell and Davis (1991), Proposition 6.3.9)

Lemma A.2 (Ibragimov and Linnik (1971))

Lemma A.3

Proof of Lemma A.3

Theorem A.1

Proof of Theorem A.1

Proof of Theorem 2

1. Transition from Zi,n to Yi,n = Zi,n/Mn

2. Decomposition of Yi,n

3. Bounds for the Variances of ΣYi;n and ∑ηi,ns

4. CLT for ∑i∈Dnξi,ns

5. CLT for σn-1∑i∈DnYi,n

Proof of Corollary 1

B Appendix: Proofs for Section 4

Proof of Theorem 3

Proof of Theorem 4

Step 1

Step 2

Step 3

Step 4

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

1. Transition from Z_i_,_n to Y_i_,_n = Z_i_,_n/M_n

2. Decomposition of Y_i_,_n

3. Bounds for the Variances of ΣY_i;n and $\sum η_{i, n}^{s}$

4. CLT for $\sum_{i \in D_{n}} ξ_{i, n}^{s}$

5. CLT for $σ_{n}^{- 1} \sum_{i \in D_{n}} Y_{i, n}$