Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Sep 1.
Published in final edited form as: J Econom. 2012 Sep;170(1):178–190. doi: 10.1016/j.jeconom.2012.05.022

On Spatial Processes and Asymptotic Inference under Near-Epoch Dependence

Nazgul Jenish 1, Ingmar R Prucha 2
PMCID: PMC3441186  NIHMSID: NIHMS390113  PMID: 22984323

Abstract

The development of a general inferential theory for nonlinear models with cross-sectionally or spatially dependent data has been hampered by a lack of appropriate limit theorems. To facilitate a general asymptotic inference theory relevant to economic applications, this paper first extends the notion of near-epoch dependent (NED) processes used in the time series literature to random fields. The class of processes that is NED on, say, an α-mixing process, is shown to be closed under infinite transformations, and thus accommodates models with spatial dynamics. This would generally not be the case for the smaller class of α-mixing processes. The paper then derives a central limit theorem and law of large numbers for NED random fields. These limit theorems allow for fairly general forms of heterogeneity including asymptotically unbounded moments, and accommodate arrays of random fields on unevenly spaced lattices. The limit theorems are employed to establish consistency and asymptotic normality of GMM estimators. These results provide a basis for inference in a wide range of models with spatial dependence.

Keywords: Random fields, near-epoch dependent processes, central limit theorem, law of large numbers, GMM estimator

1 Introduction

Models with spatially dependent data have recently attracted considerable attention in various fields of economics including labor and public economics, IO, political economy, international and urban economics. In these models, strategic interaction, neighborhood effects, shared resources and common shocks lead to interdependences in the dependent and/or explanatory variables, with the variables indexed by their location in some socioeconomic space.1 Insofar as these locations are deterministic, observations can be modeled as a realization of a dependent heterogenous process indexed by a point in ℝd, d > 1, i.e., as a random field.

The aim of this paper is to define a class of random fields that is sufficiently general to accommodate many applications of interest, and to establish corresponding limit theorems that can be used for asymptotic inference. In particular, we apply these limit theorems to prove consistency and asymptotic normality of generalized method of moments (GMM) estimators for a general class of nonlinear spatial models.

To date, linear spatial autoregressive models, also known as Cliff Ord (1981) type models2, have arguably been one of the most popular approaches to modeling spatial dependence in the econometrics literature. The asymptotic theory in these models is facilitated, loosely speaking, by imposing specific structural conditions on the data generating process, and by exploiting some underlying independence assumptions. Another popular approach to model dependence is through mixing conditions. Various mixing concepts developed for time series processes have been extended to random fields. However, the respective limit theorems for random fields have not been sufficiently general to accommodate many of the processes encountered in economics. This hampered the development of a general asymptotic inference theory for nonlinear models with cross-sectional dependence. Towards filling this gap, Jenish and Prucha (2009) have recently introduced a set of limit theorems (CLT, ULLN, LLN) for α-mixing random fields on unevenly spaced lattices that allow for nonstationary processes with trending moments.

However, some important classes of dependent processes are not necessarily mixing, including linear autoregressive (AR) and infinite moving average (MA(∞)) processes. Sufficient conditions for the α-mixing property of linear processes3 are fairly stringent, and involve three types of restrictions (i) smoothness of the density functions of the innovations, (ii) sufficiently fast rates of decay of the coefficients, and (iii) invertibility of the linear process. There are examples demonstrating that the mixing property can fail for any of these reasons. In particular, Andrews (1984) showed that a simple AR(1) process of independent Bernoulli innovations is not α-mixing. Similar examples have been constructed for random fields, see, e.g. Doukhan and Lang (2002). Thus, mixing may break down in the case of discrete innovations. Further, Gorodetskii (1977) showed that the strong mixing property may fail even in the case of continuously distributed (normal) innovations when the coefficients of the linear process do not decline sufficiently fast. As these examples suggest, the mixing property is generally not preserved under infinite transformations of mixing processes. Yet stochastic processes generated as functionals of some underlying process arise in a wide range of models, with autoregressive models being the leading example. Thus, it is important to develop an asymptotic theory for a generalized class of random fields that is “closed with respect to infinite transformations”.

To tackle this problem, the paper first extends the concept of near-epoch dependent (NED) processes used in the time series literature to spatial processes. The notion dates back to Ibragimov (1962), and Billingsley (1968). The NED concept, or variants thereof, have been used extensively in the time series literature by McLeish (1975), Bierens (1981), Wooldridge (1986), Gallant and White (1988), Andrews (1988), Pötscher and Prucha (1997), Davidson (1992, 1993, 1994) and de Jong (1997), among others. Doukhan and Louhichi (1999) introduced an alternative class of dependent processes called “θ-weakly dependent”.

In deriving our limit theorems we then only assume that the process is NED on a mixing input process, i.e., that the process can be approximated by a mixing input process in the NED sense, rather than to assume that the process itself is mixing. Of course, every mixing process is trivially also NED on itself, and thus the class of processes that are NED on a mixing input process includes the class of mixing processes. There are several advantages to working with the enlarged class of process that are NED on a mixing process. First, linear processes with discrete innovations, which results in the process to not satisfy the strong mixing property, will still be NED on the mixing input process of innovations, provided the latter are mixing. We note that, in particular, the NED property holds in both examples of Andrews (1984) and Gorodetskii (1977), by Proposition 1 of this paper. Second, as shown in this paper, nonlinear MA(∞) random fields are also NED under some mild conditions, while such conditions are not readily available for mixing. Third, the NED property is often easy to verify. For instance, the sufficient conditions for MA(∞) random fields involve only smoothness conditions on the functional form and absolute summability of the coefficients, which are not difficult to check, while verification of mixing is usually more difficult.

The paper derives a CLT and an LLN for spatial processes that are near epoch dependent on an α-mixing input process. These limit theorems allow for fairly general forms of heterogeneity including asymptotically unbounded moments, and accommodate arrays of random fields on unevenly spaced lattices. The LLN can be combined with the generic ULLN in Jenish and Prucha (2009) to obtain an ULLN for NED spatial processes. In the time series literature, CLTs for NED processes were derived by Wooldridge (1986), Davidson (1992, 1993), and de Jong (1997). Interestingly, our CLT contains as a special case the CLT of Wooldridge (1986), Theorem 3.13 and Corollary 4.4.

In addition, we give conditions under which the NED property is preserved under transformations. These results play a key role in verifying the NED property in applications. Thus, the NED property is compatible with considerable heterogeneity and dependence, invariant under transformations, and leads to a CLT and LLN under fairly general conditions. All these features make it a convenient tool for modeling spatial dependence.

As an application, we establish consistency and asymptotic normality of spatial GMM estimators. These results provide a fundamental basis for constructing confidence intervals and testing hypothesis for GMM estimators in nonlinear spatial models. Our results also expand on Conley (1999), who established the asymptotic properties of GMM estimators assuming that the data generating process and the moment functions are stationary and α-mixing.4

The rest of the paper is organized as follows. Section 2 introduces the concept of NED spatial processes and gives of examples random fields satisfying this condition. Section 3 contains the LLN and CLT for NED spatial processes. Section 4 establishes the asymptotic properties of spatial GMM estimators. All proofs are relegated to the appendices.

2 NED Spatial Processes

Let D ⊂ ℝd, d ≥ 1, be a lattice of (possibly) unevenly placed locations in ℝd, and let Z = {Zi,n, iDn, n ≥ 1} and ε = {εi,n, iTn, n ≥ 1} be triangular arrays of random fields defined on a probability space (Ω, Inline graphic, P) with DnTnD. The space ℝd is equipped with the metric ρ(i, j) = max1≤ld |jlil|, where il is the l-th component of i. The distance between any subsets U, VD is defined as ρ(U, V ) = inf {ρ(i, j): iU and jV }. Furthermore, let |U| denote the cardinality of a finite subset UD.

The random variables Zi,n and εi,n are possibly vector-valued taking their values in ℝpz and ℝpε, respectively. We assume that ℝpz and ℝpε are normed metric spaces equipped with the Euclidean norm, which we denote (in an obvious misuse of notation) as |.|. For any random vector Y, let ||Y||p = [E |Y|p]1/p, p ≥ 1, denote its Lp-norm. Finally, let Inline graphic(s) = σ(εj,n; jTn: ρ(i, j) ≤ s) be the σ-field generated by the random vectors εj,n located in the s-neighborhood of location i.

Throughout the paper, we maintain these notational conventions and the following assumption concerning D.

Assumption 1

The lattice D ⊂ ℝd, d ≥ 1, is infinitely countable. All elements in D are located at distances of at least ρ0 > 0 from each other, i.e., for all i, jD: ρ(i, j) ≥ ρ0; w.l.o.g. we assume that ρ0 > 1.

The assumption of a minimum distance has also been used by Conley (1999) and Jenish and Prucha (2009). It ensures the growth of the sample size as the sample regions Dn and Tn expand. The setup is thus geared towards what is referred to in the spatial literature as increasing domain asymptotics.

We now introduce the notion of near-epoch dependent (NED) random fields.

Definition 1

Let Z = {Zi,n, iDn, n ≥ 1} be a random field with ||Zi,n||p < ∞, p ≥ 1, let ε = {εi,n, iTn, n ≥ 1} be a random field, where |Tn| → ∞ as n → ∞, and let d = {di,n, iDn, n ≥ 1} be an array of finite positive constants. Then the random field Z is said to be Lp(d)-near-epoch dependent on the random field ε if

||Zi,n-E(Zi,nFi,n(s))||pdi,nψ(s) (1)

for some sequence ψ(s) ≥ 0 with lims→∞ ψ(s) = 0. The ψ(s), which are w.l.o.g. assumed to be non-increasing, are called the NED coefficients, and the di,n are called NED scaling factors. Z is said to be Lp-NED on ε of size −λ if ψ(s) = O(sμ) for some μ > λ > 0. Furthermore, if supn supiDndi,n < ∞, then Z is said to be uniformly Lp-NED on ε.

Recall that DnTn. Typically, Tn will be an infinite subset of D, and often Tn = D. However, as discussed in more detail in Jenish and Prucha (2011), to cover Cliff Ord type processes Tn is allowed to depend on n and to be finite provided that it increases in size with n.

The role of the scaling factors {di,n} is to allow for the the possibility of “unbounded moments”, i.e., supn supiDndi,n = ∞. Unbounded moments may reflect trends in the moments in certain directions, in which case we may also use, as in the time series literature, the terminology of “trending moments”. The NED property is thus compatible with a considerable amount of heterogeneity. In establishing limit theorems for NED processes, we will have to impose restrictions on the scaling factors di,n. In this respect, observe that

||Zi,n-E(Zi,nFi,n(s))||p||Zi,n||p+||E(Zi,nFi,n(s))||p2||Zi,n||p

by the Minkowski and the conditional Jensen inequalities. Given this, we may choose di,n ≤ 2 ||Zi,n||p, and consequently w.l.o.g. 0 ≤ ψ(s) ≤ 1; see, e.g., Davidson (1994), p. 262, for a corresponding discussion within the context of time series processes. Note that by the Lyapunov inequality, if Zi,n is Lp-NED, then it is also Lq-NED with the same coefficients {di,n} and {ψ(s)} for any qp.

Our definition of NED for spatial processes is adapted from the definition of NED for time series processes. In the time series literature, the NED concept first appeared in the works of Ibragimov (1962) and Billingsley (1968), although they did not use the present term. The concept of time series NED processes was later formalized by McLeish (1975), Wooldridge (1986), Gallant and White (1988). These authors considered only L2-NED processes. Andrews (1988) generalized it to Lp-NED processes for p ≥ 1. Davidson (1992, 1993, 1994) and de Jong (1997) further extended it to allow for trending time series processes.

We note that aside from the NED condition, a number of different notions of dependence have been used in the time series literature. For instance, Pötscher and Prucha (1997) considered a more general dependence condition (called Lp-approximability). They use more general approximating functions than the conditional mean in Definition 1 to describe the dependence structure of a process. Similar conditions are also used by Lu (2001), Lu and Linton (2007), among others. These conditions allow for more general choices of approximating functions than the conditional expectation. One of the main results in this paper is a central limit theorem, which requires the existence of second moments. Since for p = 2 the conditional mean is the best approximator in the sense of minimizing the mean squared error, our use the conditional mean as an approximating function is not restrictive. Still, in particular applications it may be convenient to work with some other Inline graphic(s)-measurable approximating function, say hi,s,n. Of course, if one can show that ||Zi,nhi,s,n||2di,nψ(s), then this also established (1) for p = 2.

In the spatial literature, NED processes were considered in the special context of density estimation by Hallin, Lu and Tran (2001), and Hallin, Lu and Tran (2004), albeit they did not use this term. The first paper proves asymptotic normality of the kernel density estimator for linear random fields, the second paper shows L1-consistency of the kernel density estimator for nonlinear functionals of i.i.d. random fields. We note that neither of these papers establishes a central limit theorem for nonlinear NED random fields.

As discussed earlier, an important motivation for considering NED processes is that mixing is generally not preserved under transformations involving infinitely many arguments. However, as illustrated below, the output process is generated as a function of infinitely many input variables in a wide range of models. In those situations, mixing of the input process does not necessarily carry over to the output process, and thus limit theorems for averages of the output process cannot simply be established from limit theorems for mixing processes. Nevertheless, as with time series processes, we show below that limit theorems can be extended to spatial processes that are NED on a mixing input process, provided the approximation error declines “sufficiently fast” as the conditioning set of input variables expands.

We now give examples of NED spatial processes. First, spatial Cliff and Ord (1981) type autoregressive processes are NED under some weak conditions on the spatial weight coefficients. These models have been used widely in applications. For recent contributions on estimation strategies for these models see, e.g., Robinson (2010, 2009), Kelejian and Prucha (2010, 2007, 2004), and Lee (2007, 2004). The second example is linear infinite moving average (MA(∞)) random fields. In preparation of the example, we first give a more general result, which shows that the NED property is satisfied by random fields generated from nonlinear Lipschitz type functionals of some ℝpε-valued random field ε = {εin, iD}:

Zin=Hin((εjn)jD) (2)

where Hin: Inline graphic → ℝpz, Inline graphic ⊆ ℝpε, are measurable functions satisfying for all e, e′ ∈ Inline graphic

Hin(e)-Hin(e)jDwijnej-ejwithwijn0 (3)

with

limssupn,iDjD:ρ(i,j)>swijn=0,and||ε||2=supn,iD||εin||2<. (4)

Proposition 1

Under conditions (3)–(4),Z = {Zin, iDn, n ≥ 1} given by (2) is well-defined, and is L2-NED on ε with ψ(s) = ||ε||2 supn,iDΣjD:ρ(i,j)>s wijn.

We now use the above proposition to establish the NED property for linear MA(∞) random fields. Linear MA(∞) random fields may arise as solutions of autoregressive models. For any k ∈ ℕ and fixed vectors vl ∈ ℤd, l = 1, …, k, consider the following autoregressive random field:

Zi=l=1kalZi-vl+εi (5)

where a=l=1kal<1, {εi, i ∈ ℤd} are i.i.d. with ||εi||q < ∞, q ≥ 1. Model (5) is also known as a k-nearest-neighbor or interaction model with the radius of interaction r = max1≤lk |vl|.

As shown by Doukhan and Lang (2002), there exists a stationary solution of (5) given by:

Zi=m=0m1++mk=mm!m1!mk!a1m1akmkεi-(m1v1++mkvk)

with mi ∈ ℕ. Thus, (5) can be represented as a linear random field Zi = Σj∈ℤdwjεij, with

wj=mj/rV(j,m)m!m1!mk!a1m1ammk,

where V (j, m) = {(m1, …, mk) ∈ ℕk: m1 + … + mk = m, m1v1 + … + mkvk = j}, observing that V (j, m) is empty if m < |j|/r. Observing further that

m1++mk=mm!m1!mk!a1m1akmk=am

the coefficients wj can be bounded as

wjmj/rm1++mk=mm!m1!mk!a1m1akmk=mj/ram=(1-a)-1aj/r.

Rewriting the process Zi as Zi = Σj∈ℤdwijεj with wij = wij it follows from Proposition 1 that the random field (5) is Lp-NED on ε with the NED coefficients ψ(s) = ||ε||q(1 − a)−1(1 − a1/r)−1as/r.

The asymptotic theory of AR and MA(∞), satisfying the NED condition, can be useful in a variety of empirical applications where the data are cross sectionally correlated. For instance, Pinkse et al.’s (2002) study of spatial price competition among firms that produce differentiated products in one example of an empirical application with cross sectional dependence. They model the price charged by firm at location i in the geographic (or product characteristic) space as a linear spatial autoregressive process. Another example is Fogli and Veldkamp (2011) who investigate spatial correlation in the female labor force participation (LFP). In particular, they consider a spatial autoregression of county i’s LFP rate on LFP rates of its neighbors. Dell (2010) examines the impact of mita, the forced mining labor system in colonial Peru and Bolivia, on household consumption and child growth across different regions. Although her model is not spatially autoregressive, the regressors and errors exhibit persistent spatial correlation, which can be modeled as a spatial NED process.

As discussed, an attractive feature of NED processes is that the NED property is preserved under transformations. Econometric estimators are usually defined either explicitly as functions of some underlying data generating processes or implicitly as optimizers of a function of the data generating process. Thus, if the data generating process is NED on some input process, the question arises under what conditions functions of random fields are also NED on the same input process.

Various conditions that ensure preservation of the NED property under transformations have been established in the time series literature by Gallant and White (1988), and Davidson (1994). In fact, these results extend to random fields. In particular, the NED property is preserved under summation and multiplication, and carries over from a random vector to its components and vice versa. For future reference, we now state some results for generalized classes of nonlinear functions. Their proofs are analogous to those in the time series literature, and therefore omitted.

Consider transformations of Zi,n given by a family of functions gi,n: ℝpz → ℝ. The functions gi,n are assumed Borel-measurable for all n and iD. They are furthermore assumed to satisfy the following Lipschitz condition: For all (z, z) ∈ ℝpz × ℝpz and all iDn and n ≥ 1:

gi,n(z)-gi,n(z)Bi,n(z,z)z-z (6)

where Bi,n(z, z): ℝpz × ℝpz → ℝ+ is Borel-measurable. Of course, this condition would be devoid of meaning without further restrictions on Bi,n(z, z), which are given in the next propositions.

Proposition 2

Suppose gi,n(·) satisfies Lipschitz condition (6) with |Bi,n(z, z)| ≤ C < ∞, for all (z, z) ∈ ℝpz × ℝpz and all i and n. If for p ≥ 1 the {Zi,n} are Lp-NED of size −λ on {εi,n} with scaling factors {di,n}, then gi,n(Zi,n) is also Lp-NED of size −λ on {εi,n} with scaling factors {2Cdi,n}.5

Proposition 3

Suppose gi,n(·) satisfies Lipschitz condition (6) with

supsBi,n(s)2<andsupsBi,n(s)|Zi,n-Zi,ns|r< (7)

for some r > 2, where Bi,n(s)=Bi,n(Zi,n,Zi,ns) and Zi,ns=E[Zi,n|Fi,n(s)]. If ||gi,n(Zi,n)||2 < ∞ and Zi,n is L2-NED of size −λ on {εi,n} with scaling factors {di,n}, then gi,n(Zi,n) is L2-NED of size −λ(r − 2)/(2r − 2) on {εi,n} with scaling factors

di,n=di,n(r-2)/(2r-2)supsBi,n(s)2(r-2)/(2r-2)Bi,n(s)|Zi,n-Zi,nsrr/(2r-2).

Thus, the NED property is hereditary under reasonably weak conditions. These conditions facilitate verification of the NED property in practical application. In particular, we will use them in the proof of asymptotic normality of spatial GMM estimators in Section 4.

3 Limit Theorems

3.1 Law of Large Numbers

In this section, we present a LLN for real valued random fields Z = {Zi,n, iDn, n ≥ 1} that are L1-NED on some vector-valued α-mixing random field ε = {εi,n, iTn, n ≥ 1} with the NED coefficients {ψ(s)} and scaling factors {di,n}, where DnTnD and the lattice D satisfies Assumption 1. For ease of reference, we state below the definition of the α-mixing coefficients employed in the paper.

Definition 2

Let Inline graphic and Inline graphic be two σ-algebras of Inline graphic, and let

α(A,B)=sup(P(AB)-P(A)P(B),AA,BB),

For UDn and VDn, let σn(U) = σ(εi,n; iU) and αn(U, V) = α(σn(U), σn(V)). Then, the α-mixing coefficients for the random field ε are defined as:

α¯(u,v,r)=supnsupU,V(αn(U,V),Uu,Vv,ρ(U,V)r).

Dobrushin (1968) showed that weak dependence conditions based on the above mixing coefficients are satisfied by broad classes of random fields including Markov fields. In contrast to standard mixing numbers for time-series processes, the mixing coefficients for random fields depend not only on the distance between two datasets but also their sizes. To explicitly account for such dependence, it is furthermore assumed that

α¯(u,v,r)ϕ(u,v)α^(r) (8)

where the function ϕ(u, v) is nondecreasing in each argument, and α̂(r) → 0 as r → ∞. The idea is to account separately for the two different aspects of dependence: (i) decay of dependence with the distance, and (ii) accumulation of dependence as the sample region expands. The two common choices of ϕ(u, v) in the random fields literature are

ϕ(u,v)=(u+v)τ,τ0, (9)
ϕ(u,v)=min{u,v}. (10)

The above mixing conditions have been used extensively in the random fields literature including Takahata (1983), Nahapetian (1987), Bulinskii (1989), Bulinskii and Doukhan (1990), and Bradley (1993). They are satisfied by fairly large classes of random fields. Bradley (1993) provides examples of random fields satisfying conditions (8)–(9) with u = v and τ = 1. Furthermore, Bulinskii (1989) constructs moving average random fields satisfying the same conditions with τ = 1 for any given decay rate of coefficients α̂(r). Clearly, standard mixing coefficients in the time series literature are covered by conditions (8)–(9) when τ = 0.

Following the literature, we employ the above mixing conditions for the input random field, and impose further restrictions on the decay rates of the mixing coefficients.

Assumption 2

  1. There exist nonrandom positive constants {ci,n, iDn, n ≥ 1} such that Zi,n/ci,n is uniformly Lp-bounded for some p > 1, i.e., supnsupiDnE|Zi,n/ci,n|p < ∞

  2. The α-mixing coefficients of the input field ε satisfy (8) for some function ϕ(u, v) which is nondecreasing in each argument, and some α̂(r) such that r=1rd-1α^(r)<.

Theorem 1

Let {Dn} be a sequence of arbitrary finite subsets of D such that |Dn| → ∞ as n → ∞, where D ⊂ ℝd, d ≥ 1 is as in Assumption 1, and let Tn be a sequence of subsets of D such that DnTn. Suppose further that Z = {Zi,n, iDn, n ≥ 1} is L1-NED on ε = {εi,n, iTn, n ≥ 1} with the scaling factors di,n. If Z and ε satisfy Assumption 2, then

1MnDniDn(Zi,n-EZi,n)L10,

where Mn = maxiDnmax(ci,n, di,n).

This LLN can be used to establish uniform convergence of random functions by combining it with the generic ULLN given in Jenish and Prucha (2009), which transforms pointwise LLNs (at a given parameter value) into ULLNs.

Assumption 2(a) is a standard moment condition employed in weak LLNs for dependent processes. It requires existence of moments of order slightly greater than 1. As in Theorem 2 below, ci,n and di,n are the scaling factors that reflect the magnitudes of potentially trending moments. The case of variables with uniformly bounded moments is covered by setting ci,n = di,n = 1. The LLN does not require any restrictions on the NED coefficients. In the time series literature, weak LLNs for NED processes have been obtained by Andrews (1988) and Davidson (1993), among others. Andrews (1988) derives an L1-law for triangular arrays of L1-mixingales. He then shows that NED processes are L1-mixingales, and hence, satisfy his LLN. Davidson (1993) extends the latter result to processes with trending moments.

3.2 Central Limit Theorem

In this section, we present a CLT for real valued random fields Z = {Zi,n, iDn, n ≥ 1} that are L2-NED on some vector-valued α-mixing random field ε = {εi,n, iTn, n ≥ 1} with the NED coefficients {ψ(s)} and scaling factors {di,n}, where DnTnD and the lattice D satisfies Assumption 1. In the following, we will use the following notation:

Sn=iDnZi,n;σn2=var(Sn).

The CLT relies on the following assumptions.

Assumption 3

The α-mixing coefficients of ε satisfy (8) and (9) for some τ ≥ 0 and α̂(r), such that for some δ > 0

r=1rd(τ+1)-1α^δ2(2+δ)(r)<, (11)

where τ* = δτ/(2 + δ).

Assumption 3 restricts the dependence structure of the input process ε. Note that if τ = 1 this assumption also covers the case where ϕ(u, v) is given by (10).

Assumption 4

  1. (Uniform L2+δ integrability) There exists an array of positive constants {ci,n} such that
    limksupnsupiDnE[Zi,n/ci,n2+δ1(Zi,n/ci,n>k)]=0,

    where 1 (·) is the indicator function and δ > 0 is as in Assumption 3.

  2. infnDn-1Mn-2σn2>0, where Mn = maxiDnci,n.

  3. NED coefficients satisfy r=1rd-1ψ(r)<.

  4. NED scaling factors satisfy supnsupiDnci,n-1di,nC<.

Assumptions 4(a),(b) are standard in the limit theory of mixing processes, e.g., Wooldridge (1986), Davidson (1992), de Jong (1997), and Jenish and Prucha (2009). Assumption 4(a) is satisfied if Zi,n/ci,n are uniformly Lp-bounded for p > 2+δ, i.e., supn,iDn||Zi,n/ci,n||p < ∞.

Assumption 4(b) is an asymptotic negligibility condition that ensures that no single summand influences disproportionately the entire sum. In the case of uniformly L2+δ-bounded fields, 4(b) reduces to liminfnDn-1σn2>0, as is, e.g., maintained in Bolthausen (1982). Assumption 4(c) controls the size of the NED coefficients which measure the error in the approximation of Zi,n by ε. Intuitively, the approximation errors have to decline sufficiently fast with each successive approximation. Assumption 4(c) is satisfied if ψ(r) = O(rdγ) for some γ > 0, i.e., ψ(r) is of size −d. Finally, Assumption 4(d) is a technical condition, which ensures that the order of magnitude of the NED scaling factors does not exceed that of the 2 + δ moments. For instance, suppose the constant ci,n can be chosen as ci,n = ||Zi,n||2+δ, and the NED scaling numbers as di,n ≤ 2 ||Zi,n||2. Then Assumption 4(d) is satisfied, since by Lyapunov’s inequality, ||Zi,n||2 ≤ ||Zi,n||2+δ. This condition has also been used by de Jong (1997) and Davidson (1992).

Theorem 2

Suppose {Dn} is a sequence of finite subsets such that |Dn| → ∞ as n → ∞ and {Tn} is a sequence of subsets such that DnTnD of the lattice D satisfying Assumption 1. Let Z = {Zi,n, iDn, n ≥ 1} be a real valued zero-mean random field that is L2-NED on a vector-valued α-mixing random field ε = {εi,n, iTn, n ≥ 1}. Suppose Assumptions 3 and 4 hold, then

σn-1SnN(0,1).

Theorem 2 contains the CLT for α-mixing random fields given in Jenish and Prucha (2009) as a special case. It also contains as a special case the CLT for time series NED processes of Wooldridge (1986), see Theorem 3.13 and Corollary 4.4.

Theorem 2 can be easily extended to vector-valued fields using the standard Cramér-Wold device.

Corollary 1

Suppose {Dn} is a sequence of finite subsets such that |Dn| → ∞ as n → ∞ and {Tn} is a sequence of subsets such that DnTnD of the lattice D satisfying Assumption 1. Let Z = {Zi,n, iDn, n ≥ 1} with Zi,n ∈ ℝk be a zero-mean random field that is L2-NED on a vector-valued α-mixing random field ε = {εi,n, iTn, n ≥ 1}. Suppose Assumptions 3 and 4 hold with |Zi,n| denoting the Euclidean norm of Zi,n and σn2 replaced by λminn), where Σn = Var(Sn) and λmin(.) is the smallest eigenvalue, then

n-1/2SnN(0,Ik).

Furthermore, supn |Dn|−1 λmaxn) < ∞, where λmax(.) denotes the largest eigenvalue.

4 Large Sample Properties of Spatial GMM Estimators

We now apply the limit theorems of the previous section to establish the large sample properties of spatial GMM estimators under a reasonably general set of assumptions that should cover a wide range of empirical problems. More specifically, our consistency and asymptotic normality results (i) maintain only that the spatial data process is NED on an α-mixing basis process to accommodate spatial lags in the data process as discussed above, (ii) allow for unevenly placed locations, and (iii) allow for the data process to be non-stationary, which will frequently be the case in empirical applications. We also give our results under a set of primitive sufficient conditions for easier interpretation by the applied researcher.6

We continue with the basic set-up of Section 2. Consider the moment function qi,n: ℝpz × Θ → ℝpq, where Θ denotes the parameter space, and let θ0n ∈ Θ denote the parameter vector of interest (which we allow to depend on n for reasons of generality). Suppose the following moment conditions hold

Eqi,n(Zi,n,θ0n)=0. (12)

Then, the corresponding spatial GMM estimator is defined as

θ^n=argminθΘQn(ω,θ), (13)

where Qn: Ω × Θ → ℝ,

Qn(ω,θ)=Rn(θ)PnRn(θ),

with Rn(θ) = |Dn|−1ΣiDnqi,n(Zi,n, θ), and where the Pn are some positive semidefinite weighting matrices. To show consistency, consider the following non-stochastic analogue of Qn, say

Q¯n(θ)=[ERn(θ)]P[ERn(θ)], (14)

where P denotes the probability limit of Pn. Given the moment condition (12), E [Rn(θ0n)] = 0, the functions n are minimized at θ0n. In proving consistency, we follow the classical approach; see, e.g., Gallant and White (1988) or Pötscher and Prucha (1997) for more recent expositions. In particular, given identifiable uniqueness of θ0n we establish, loosely speaking, convergence of the minimizers θ̂n to the minimizers θ0n by establishing convergence of the objective function Qn(ω, θ) to its non-stochastic analogue n(θ) uniformly over the parameter space.

Throughout the sequel, we maintain the following assumptions regarding the parameter space, the GMM objective function and the unknown parameters θ0n.

Assumption 5

  1. The parameter space Θ is a compact metric space with metric ν.

  2. The functions qi,n: ℝpz × Θ → ℝpq are Inline graphic/ Inline graphic-measurable for each θ ∈ Θ, and continuous on Θ for each z ∈ ℝpz.

  3. The elements of the pq × pq real matrices Pn are Inline graphic-measurable, and Pn is positive semidefinite. Furthermore P = p lim Pn exists and P is positive definite.

  4. The minimizers θ0n are identifiably unique in the sense that every ε > 0, lim infn→∞ [infθ∈Θ:ν(θ, θ0n)≥ε [ERn(θ)]′[ERn(θ)]] > 0.

Compactness of the parameter space as maintained in Assumption 5(a) is typical for the GMM literature. Assumptions 5(b),(c) imply that Qn(·, θ) is measurable for all θ ∈ Θ, and Qn(ω, ·) is continuous on Θ. Given those assumptions the existence of measurable functions θ̂n that solves (13) follows, e.g., from Lemma A3 of Pötscher and Prucha (1997).

Since P is positive definite, it is readily seen that Assumption 5(d) implies that for every ε > 0:

liminfn[infθΘ:ν(θ,θ0n)εQ¯n(θ)-Q¯n(θ0n)]>0,

observing that n(θ0n) = 0. Thus, under Assumption 5(d) the minimizers θ0n are identifiably unique; compare, e.g., Gallant and White (1988), p.19. For interpretation, consider the important special case where θ0n = θ0, ERn(θ) does not depend on n, and is continuous in θ. In this case, identifiable uniqueness of θ0 is equivalent to the assumption that θ0 is the unique solution of the moment conditions, i.e., E [Rn(θ)] ≠ 0 for all θθ0; compare, e.g., Pötscher and Prucha (1997), p. 16.

4.1 Consistency

Given the minimizers θ0n are identifiably unique, θ̂n is a consistent estimator for θ0n if Qn converges uniformly to n, i.e., if supθΘQn(ω,θ)-Q¯n(θ)p0 as n → ∞; this follows immediately from, e.g, Pötscher and Prucha (1997), Lemma 3.1.

We now proceed by giving a set of primitive domination and Lipschitz type conditions for the moment functions that ensure uniform convergence of Qn to n. The conditions are in line with those maintained in the general literature on M-estimation, e.g., Andrews (1987), Gallant and White (1988), and Pötscher and Prucha (1989,1994).

Definition 3

Let fi,n: ℝpz × Θ → ℝpq be Inline graphic/ Inline graphic-measurable functions for each θ ∈ Θ, then:

  1. The random functions fi,n(Zi,n; θ) are said to be p-dominated on Θ for some p > 1 if supn supiDnE supθ∈Θ|fi,n(Zi,n; θ)|p < ∞.

  2. The random functions fi,n(Zi,n; θ) are said to be Lipschitz in the parameter θ on Θ if
    fi,n(Zi,n,θ)-fi,n(Zi,n,θ)Li,n(Zi,n)h(ν(θ,θ))a.s., (15)

    for all θ, θ ∈ Θ and iDn, n ≥ 1, where h is a nonrandom function with h(x) ↓ 0 as x ↓ 0, and Li,n are random variables with limsupnDn-1iDnELi,nη< for some η > 0

Towards establishing consistency of θ̂n we furthermore maintain the following moment and mixing assumptions.

Assumption 6

The moment functions qi,n(Zi,n; θ) have the following properties:

  1. They are p-dominated on Θ for p = 2.

  2. They are uniformly L1-NED on ε = {εi,n, iTn, n ≥ 1}, where DnTnD, and ε is α-mixing with α-mixing coefficients the conditions stated in Assumption 2(b).

  3. They are Lipschitz in the parameter θ on Θ.

Assumption 6(a) implies that supn,iDnE |qi,n(Zi,n; θ)|p < ∞ for each θ ∈ Θ. Assumptions 6(b) then allow us to apply the LLN given as Theorem 1 above the sample moments Rn(θ) = |Dn|−1ΣiDnqi,n(Zi,n, θ).

To verify Assumption 6(b) one can use either Proposition 2 or Proposition 3 to imply this condition from the lower level assumption that the data Zi,n are L1-NED. For example, the qi,n are L1-NED, if the Zi,n are L1-NED and satisfy the Lipschitz condition of Proposition 2. Note that no restrictions on the sizes of the NED coefficients are required.

Assumption 6(c) ensures stochastic equicontinuity of qi,n w.r.t. θ. Stochastic equicontinuity jointly with Assumption 6(a) and the pointwise LLN enable us to invoke the ULLN of Jenish and Prucha (2009) to prove uniform convergence of the sample moments, which in turn is used to establish that Qn converges uniformly to n. A sufficient condition for Assumption 6(c) is existence of integrable partial derivatives of qi,n w.r.t. θ if θ ∈ ℝk.

Our consistency results for the spatial GMM estimator given by (13) is summarized by the next theorem.

Theorem 3 (Consistency)

Suppose {Dn} is a sequence of finite sets of D such that |Dn| → ∞ as n → ∞, where D ⊂ ℝd, d ≥ 1 is as in Assumption 1. Suppose further that Assumptions 5 and 6 hold. Then

ν(θ^n,θ0n)p0asn,

and n(θ) is uniformly equicontinuous on Θ.

4.2 Asymptotic Normality

We next establish that the spatial GMM estimators defined by (13) is asymptotically normally distributed. For that purpose, we need a stronger set of assumptions than for consistency, including differentiability of the moment functions in θ. It proofs helpful to adopt the notation ∇θ in place of /∂θ.7

Assumption 7

  1. The minimizers θ0n lie uniformly in the interior of Θ with Θ ⊆ ℝk. Furthermore E [Rn(θ0n)] = 0.

  2. The functions qi,n: ℝpz × Θ → ℝpq are continuously differentiable w.r.t. θ for each z ∈ ℝpz.

  3. The functions qi,n(Zi,n; θ0n) are uniformly L2-NED on ε of size −d, and for some δ′ > 0 supn,iDnE |qi,n(Zi,n; θ0n)|2+δ< ∞. The functions ∇θqi,n(Zi,n; θ) are uniformly L1-NED on ε.

  4. The input process ε = {εi,n, iTn, n ≥ 1}, where DnTnD, is α-mixing and the mixing coefficients satisfy Assumption 3 for some δ < δ′, where δ′ is the same as in Assumption 7(c).

  5. The functions ∇θqi,n are p-dominated on Θ for some p > 1.

  6. The functions ∇θqi,n are Lipschitz in θ on Θ.

  7. infn λmin(|Dn|−1 Σn) > 0 where Σn = VariDnqi,n(Zi,n, θ0n)].

  8. infn λmin [EθRn(θ0n)′ ∇θRn(θ0n)] > 0.

The first part of Assumption 7(a) is needed to ensure that the estimator θ̂n lies in the interior of Θ with probability tending to one, and facilitates the application of the mean value theorem to Rn(θ̂n) around θ0n. The second part states in essence that the moment conditions are correctly specified. Its violation will generally invalidate the limiting distribution result.

Assumptions 7(c),(d),(g) enable us to apply the CLT for vector-valued NED processes given above as Corollary 1 to Rn(θ0n). Some low level sufficient conditions for Assumption 7(c) are given below. To establish asymptotic normality, we also need uniform convergence of ∇θRn on Θ, which is implied via Assumptions 7(c),(d),(e),(f). Finally, Assumption 7(h) ensures positive-definitness of the variance-covariance matrix of the GMM estimator.

Given the above assumptions, we have the following asymptotic normality result for the spatial GMM estimator defined by (13).

Theorem 4

Suppose {Dn} is a sequence of finite sets of D such that |Dn| → ∞ as n → ∞, where D ⊂ ℝd, d ≥ 1 is as in Assumption 1. Suppose further that Assumptions 5–7 hold. Then

(An-1BnBnAn-1)-1/2Dn1/2(θ^n-θ0n)N(0,Ik),

where

An=[EθRn(θ0n)]P[EθRn(θ0n)]andBn=[EθRn(θ0n)]P[Dn-1n]1/2.

Moreover, An=O(1);An-1=O(1);Bn=O(1);|(BnBn)-1|=O(1) and hence, θ̂n is |Dn|1/2-consistent for θ0n.

As remarked above, relative to the existing literature Theorem 4 allows for nonstationary processes and only assumes that qi,n and ∇θqi,n are NED on an α-mixing input process, rather than postulating that qi,n and ∇θqi,n are α-mixing. As such, Theorem 4 should provide a basis for constructing confidence intervals and hypothesis testing in a wider range of spatial models.

Using Proposition 3, we now give some sufficient conditions for Assumption 7(c).

Assumption 8

The process {Zi,n, iDnTn, n ≥ 1} is uniformly L2-NED on {εi,n, iTn, n ≥ 1} of size − 2d(r − 1)/(r − 2) for some r > 2.

Assumption 9

For every sequence {θ0n} on Θ, the functions qi,n(Zi,n; θ0n) and ∇θqi,n(Zi,n; θ0n) satisfy Lipschitz condition (6) in z, that is, for gi,n = qi,n or ∇θqi,n:

gi,n(z;θn)-gi,n(z;θn)Bi,n(z,z)z-z.

Furthermore, for the r > 2 as specified in Assumption 8,

supn,iDnsupsBi,n(s)2<andsupn,iDnsupsBi,n(s)|Zi,n-Zi,ns|r<

where Bi,n(s)=Bi,n(Zi,n,Zi,ns) with Zi,ns=E[Zi,nFi,n(s)].

5 Conclusion

The paper develops an asymptotic inference theory for a class of dependent nonstationary random fields that could be used in a wide range of econometric models with spatial dependence. More specifically, the paper extends the notion of near-epoch dependent (NED) processes used in the time series literature to spatial processes. This allows to accommodate larger classes of dependent processes than mixing random fields. The class of NED random fields is “closed with respect to infinite transformations” and thus should be sufficiently broad for many applications of interest. In particular, it covers autoregressive and infinite moving average random fields as well as nonlinear functionals of mixing processes. The NED property is also compatible with considerable heterogeneity and preserved under transformations under fairly mild conditions. Furthermore, a CLT and an LLN are derived for spatial processes that are NED on an α-mixing process. Apart from covering a larger class of dependent processes, these limit theorems also allow for arrays of nonstationary random fields on unevenly spaced lattices. Building on these limit results, the paper develops an asymptotic theory of spatial GMM estimators, which provides a basis for inference in a broad range of models with cross-sectional or spatial dependence.

Much of the random fields literature assumes that the process resides on an equally spaced grid. In contrast, and as in Jenish and Prucha (2009), we allow for locations to be unequally spaced. The implicit assumption of fixed locations seems reasonable for a large class of applications, especially in the short run. Still, an important direction for future work would be to extend the asymptotic theory to spatial processes with endogenous locations, while maintaining a set of assumptions that are reasonably easy to interpret.8 One possible approach may be to augment the contributions of the present paper with theory from point processes.

Acknowledgments

We would like to thank the Editor P.M. Robinson, Associate Editor and three anonymous referees for their valuable comments that led to substantial improvement of the paper. We thank the participants of the Cowles Foundation Conference, Yale, June 2009, and the seminar participants at the Columbia University for helpful discussions. This research benefitted from a University of Maryland Ann G. Wylie Dissertation Fellowship for the first author, and from financial support from the National Institute of Health through SBIR grant 1 R43 AG027622 for the second author.

A Appendix: Proofs for Sections 2 and 3

Throughout, let Inline graphic(s) = σ(εj,n; jTn: ρ(i, j) ≤ s) be the σ-field generated by the random vectors εj,n located in the s-neighborhood of location i. Furthermore, C denotes a generic constant that does not depend on n and may be different from line to line.

Proof of Proposition 1

The proof is available online on the authors’ webpages.

Proof of Theorem 1

Define Yi,n = Zi,n/Mn, then to prove the theorem, it suffices to show that Dn-1iDn(Yi,n-EYi,n)L10. We first establish moment and mixing conditions for Yi,n from those for Zi,n. Observe that in light of the definition of Mn and Assumption 2(a)

supn,iDnEYi,npsupn,iDnEZi,n/ci,np<. (A.1)

Thus, Yi,n is uniformly Lp-bounded for p > 1. Let Inline graphic(s) = σ(εj,n; jTn: ρ(i, j) ≤ s). Since Zi,n is L1-NED on ε = {εi,n, iTn, n ≥ 1}:

supn,iDn||Yi,n-E(Yi,nFi,n(s))||1supn,iDnMn-1di,nψ(s)ψ(s), (A.2)

observing that Mn = maxiDnmax(ci,n, di,n). Thus Yi,n is also L1-NED on ε.

Next we show that for each given s > 0, the conditional mean Vi,ns=E(Yi,nFi,n(s)) satisfies the assumptions of the L1-norm LLN of Jenish and Prucha (2009, Theorem 3). Using the Jensen and Lyapunov inequalities gives for all s > 0, iDn, n ≥ 1:

EVi,nspE{E(Yi,npFi,n(s))}supn,iDnEYi,np<.

So, Vi,ns is uniformly Lp-bounded for p > 1 and hence uniformly integrable. For each fixed s, Vi,ns is a measurable function of {εj,n; jTn: ρ(i, j) ≤ s}. Observe that under Assumption 1 there exists a finite constant C such that the cardinality of the set {jTn: ρ(i, j) ≤ s} is bounded by Csd; compare Lemma A.1 in Jenish and Prucha (2009). Hence,

α¯Vs(1,1,r){1,r2sα¯(Csd,Csd,r-2s),r>2s

and thus in light of Assumption 2(b)

r=1rd-1α¯Vs(1,1,r)r=12srd-1+ϕ(Csd,Csd)r=1(r+2s)d-1α^(r)<.

The above shows that indeed, for each fixed s, Vi,ns satisfies the assumptions of the L1-norm LLN of Jenish and Prucha (2009, Theorem 3). Therefore, for each s, we have

Dn-1iDn[E(Yi,nFi,n(s))-EYi,n]10asn. (A.3)

Furthermore observe that from (A.2) and the Minkowski inequality

Dn-1iDn(Yi,n-E(Yi,nFi,n(s)))1ψ(s). (A.4)

Given (A.3) and (A.4), and observing that lims→∞ ψ(s) = 0 it now follows that

limnDn-1iDn(Yi,n-EYi,n)1=limslimnDn-1iDn(Yi,n-EYi,n)1limslimsupnDn-1iDn(Yi,n-E(Yi,nFi,n(s)))1+limslimsDn-1iDn(E(Yi,nFi,n(s))-EYi,n)1=0.

This completes the proof of the LLN.

The proof of the CLT builds on Ibragimov and Linnik (1971), pp. 352–355, and makes use of the following lemmata:

Lemma A.1 (Brockwell and Davis (1991), Proposition 6.3.9)

Let Yn, n = 1, 2, … and Vns, s = 1, 2, …; n = 1, 2, …, be random vectors such that

  1. Vns ⇒ Vs as n → ∞ for each s = 1; 2; …

  2. Vs rr; V as s → ∞, and

  3. lims→∞ lim supn→∞ P(|YnVns| > ε) = 0 for every ε > 0.

    Then YnV as n → ∞.

Lemma A.2 (Ibragimov and Linnik (1971))

Let Lp( Inline graphic) and Lp( Inline graphic) denote, respectively, the class of Inline graphic-measurable and Inline graphic-measurable random variables ξ satisfying ||ξ||p < ∞. Let XLp( Inline graphic) and YLq( Inline graphic). Then, for any 1 ≤ p, q, r < ∞ such that p−1 + q−1 + r−1 = 1,

Cov(X,Y)<4α1/r(F1,F2)||X||p||Y||q

where α( Inline graphic; Inline graphic) = supAInline graphic,BInline graphic(|P(AB) − P(A)P(B)|).

To prove the CLT for NED random fields, we first establish some moment inequalities and a slightly modified version of the CLT for mixing fields developed in Jenish and Prucha (2009). It is helpful to introduce the following notation. Let X = {Xi,n, iDn, n ≥ 1} be a random field, then ||X||q: = supn,iDn||Xi,n||q for q ≥ 1.

Lemma A.3

Let {Xi,n} be uniformly L2-NED on a random field {εi,n} with α-mixing coefficients (u, v, r) ≤ (u + v)τ α̂(r), τ ≥ 0. Let Sn = ΣiDnXi,n and suppose that the NED coefficients of {Xi,n} satisfy r=1rd-1ψ(r)< and ||X||2+δ < ∞ for some δ > 0. Then,

  1. |Cov (Xi,nXj,n)| ≤ ||X||2+δ {C1||X||2+δ [h/3]*α̂δ/(2+δ) ([h/3]) + C2ψ([h/3])}, where h = ρ(i, j) and τ* = δτ/(2 + δ). If, r=1rd(τ+1)-1α^δ/(2+δ)(r)<, then for some C < ∞, not depending on n
    Var(Sn)CDn.
  2. |Cov (Xi,nXj,n)| ≤ ||X||2 {C3||X||2+δ [h/3]* α̂δ/(4+2δ) ([h/3]) + C4ψ([h/3])}, where h = ρ(i, j) and τ* = δτ/(4 + 2δ). If, r=1rd(τ+1)-1α^δ/(4+2δ)(r)< where τ* = δτ/(4 + 2δ), then for some C < ∞, not depending on n
    Var(Sn)C||X||2Dn.

Proof of Lemma A.3

The proof is available online on the authors’ webpages.

Theorem A.1

Suppose {Dn} is a sequence of finite subsets of D, satisfying Assumption 1, with |Dn| → ∞ as n → ∞. Suppose further that {εi,n; iDn, n ∈ ℕ} is an array of zero-mean random variables with α-coefficients (u, v, r) ≤ C(u + v)τ α̂(r) for some constants C < ∞ and τ ≥ 0. Suppose for some δ > 0 and γ > 0

limksupn,iDnE[εi,n2+δ1(εi,n>k)]=0

and

α^(r)=O(r-d(2μ+1)-γ)

with μ = max {τ, 1/δ}, and suppose liminfnDn-1σn2>0, then

σn-1iDnεi,nN(0,1).

where σn2=Var(iDnεi,n).

Proof of Theorem A.1

The proof is available online on the authors’ webpages.

The above CLT is in essence a variant of CLT for α-mixing random fields given as Corollary 1 of Theorem 1 in Jenish and Prucha (2009), applied to mixing coefficients of the type (u, v, r) ≤ C(u + v)τα̂(r), τ ≥ 0.

Proof of Theorem 2

Since the proof is lengthy it is broken into steps.

1. Transition from Zi,n to Yi,n = Zi,n/Mn

Let Mn = maxiDnci,n and Yi,n = Zi,n/Mn. Also, let σZ,n2=Var[Zi,n] and σY,n2=Var[Yi,n]=Mn-2σZ,n2. Since

σY,n-1iDnYi,n=σZ,n-1iDnZi,n,

to prove the theorem, it suffices to show that σY,n-1iDnYi,nN(0,1). Therefore, it proves convenient to switch notation from the text and to define

Sn=iDnYi,n,σn2=Var(Sn).

That is, in the following, Sn denotes ΣiDnYi,n rather than ΣiDnZi,n, and σn2 denotes the variance of ΣiDnYi,n rather than of ΣiDnZi,n. We now establish moment and mixing conditions for Yi,n from the assumptions of the theorem. Observe that by definition of Mn

1(Yi,n>k)=1(Zi,n/Mn>k)1(Zi,n/ci,n>k),

and hence

E[Yi,n2+δ1(Yi,n>k)]E[Zi,n/ci,n2+δ1(Zi,n/ci,n>k)]

so that Assumption 4(a) implies that

limksupn,iDnE[Yi,n2+δ1(Yi,n>k)]=0. (A.5)

Hence, Yi,n is also uniformly L2+δ bounded. Let ||Y||2+δ = supn,iDn||Yi,n||2+δ. Further, note that

||Yi,n-E(Yi,nFi,n(s))||2=Mn-1||Zi,n-E(Zi,nFi,n(s))||2ci,n-1di,nψ(s)Cψ(s) (A.6)

since supn,iDnci,n-1di,nC<, by assumption. Thus, Yi,n is uniformly L2-NED on ε with the NED coefficients ψ(m). Finally, observe that by Assumption 4(b):

infnDn-1σn2>0. (A.7)

Hence, there exists 0 < B < ∞ such that for all n

BDnσn2. (A.8)
2. Decomposition of Yi,n

For any fixed s > 0, decompose Xi,n as

Yi,n=ξi,ns+ηi,ns

where ξi,ns=E(Yi,nFi,n(s)),ηi,ns=Yi,n-ξi,ns. Let

Sn,s=iDnξi,ns;Sn,s=iDnηi,nsσn,s2=Var[Sn,s],σn,s2=Var[Sn,s]

Repeated use of the Minkowski inequality yields:

σn-σn,sσn,s,σn-σn,sσn,s. (A.9)

Observe that

E[E(Yi,nFi,n(s))Fi,n(m))]={E(Yi,nFi,n(s)),ms,E(Yi,nFi,n(m)),m<s.

and hence

||ηi,ns-E(ηi,nsFi,n(m))||2=||Yi,n-E[Yi,nFi,n(s)]-E[Yi,nFi,n(m)]+E[(Yi,nFi,n(s))Fi,n(m)]||2={||Yi,n-E(Yi,nFi,n(m))||2Cψ(m),ifms,||Yi,n-E(Yi,nFi,n(s))||2Cψ(s)Cψ(m),ifm<s.

since by definition the sequence ψ(m) is non-increasing. Thus, for any fixed s > 0, { ηi,ns} is uniformly L2-NED on ε with the same NED coefficients ψ(m) as the random field {Yi,n}. Furthermore, as shown in the proof of Lemma A.3, { ηi,ns} is also uniformly L2+δ bounded.

3. Bounds for the Variances of ΣYi;n and ηi,ns

First note that in light of Assumption 3, and observing that τ* = δτ/(4 + 2δ) ≤ τ* =δτ/(2 + δ) and α^δ/(2+δ)(r)α^δ2(2+δ)(r) we have

r=1rd(τ+1)-1α^δ/(2+δ)(r)r=1rd(τ+1)-1α^δ2(2+δ)(r)<,r=1rd(τ+1)-1α^δ/(4+2δ)(r)r=1rd(τ+1)-1α^δ2(2+δ)(r)<.

Using part (a) of Lemma A.3 with Xi,n = Yi,n and recalling (A.8), we have

BDnσn2=Var(Sn)CDn.

for some B > 0. Using part (b) of Lemma A.3 with Xi,n=ηi,ns we have

σn,s2=Var(Sn,s)CDn||ηi,ns||2=CDnψ(s) (A.10)

in light of (A.6). Hence,

limslimsupnσn,s2σn2Climsψ(s)=0. (A.11)

Furthermore, by (A.9) we have

limslimsupn|1-σn,sσn|limslimsupnσn,sσn=0 (A.12)

and hence for all s ≥ 1 and n ≥ 1

σn,sσnC<. (A.13)
4. CLT for iDnξi,ns

We now show that for any fixed s > 0, ξi,ns satisfies Theorem A.1.

First, since supn,iDnE[ξi,ns2+δ]<, the process { ξi,ns} is uniformly L2+δ-integrable for δ′ = δ/2, i.e.,

limksupn,iDnE[ξi,ns2+δ/21(ξi,ns>k)]=0.

Second, since ξi,ns is a measurable function of εi,n for any u, v ∈ ℕ and r > 2s

α¯ξ(u,v,r)α¯(uMsd,vMsd,r-2s)C(u+v)τα^(r-2s)

We next to show that α̂(r) = O(rd(2μ+1) −γ) for μ = max {τ, 2/δ} and some γ > 0. By assumption,

r=1rd(τ+1)-1α^δ2(2+δ)(r)<,

where τ* = δτ/(2 + δ), which implies

α^(r)=o(r-2d(2+δ)(τ+1)/δ)=o(r-d[2(τ+2/δ)+1]-d)=o(r-d[2μ+1]-d)

since μτ + 2/δ for μ = max {τ, 2/δ}. Thus, α̂(r) = O(rd(2μ+1) −γ) for γ = d.

We next show that for sufficiently large s,

0<liminfnDn-1σn,s2.

By (A.8),

B1/2infDn-1/2σn

Since lims→∞ ψ(s) = 0, there exists s* such that in light of (A.10) for all ss*,

Dn-1/2σn,sCψ1/2(s)B1/2/2. (A.14)

Hence by (A.9) for all ss*, |Dn|−1/2(σnσ̃n,s) ≤ |Dn|−1/2σn,s, and thus infn |Dn|−1/2σn,s ≥ infn |Dn|−1/2σn − supn |Dn|−1/2 σ̃n,s. Using (A.7) and (A.14), we have

liminfnDn-1/2σn,sB1/2-B1/22=B1/22>0

Thus, for all ss*,

σn,s-1iDnξi,nsN(0,1)asn. (A.15)

Since the first s* terms do not affect the analysis below we take in the following s* = 1.

5. CLT for σn-1iDnYi,n

Finally, using Lemma A.1 we now show that, given the maintained NED assumption, the just established CLT in (A.15) for the approximators ξi,ns can be carried over to the the Yi,n. Define

Wn=σn-1iDnYi,n,Vns=σn-1iDnξi,ns,Wn-Vns=σn-1iDnηi,ns

so that we can exploit Lemma A.1 to prove that

Wn=σn-1iDnYi,nV~N(0,1).

We first verify condition (iii) of Lemma A.1. By Markov’s inequality and (A.11), for every ε > 0 we have

limslimsupnP(Wn-Vns>ε)=limslimsupnP(|σn-1iDnηi,ns|>ε)limslimsupnσn,s2ε2σn2=0.

Next observe that Vns=σn,sσn[σn,s-1iDnξi,ns]. We proceed to show WnV by contradiction. For that purpose let Inline graphic be the set of all probability measures on (ℝ; Inline graphic), and observe that we can metrize Inline graphic by, e.g., the Prokhorov distance d(., .). Let μn and μ be the probability measures corresponding to Wn and V, respectively, then WnV, or μnμ, iff d(μn, μ) → 0 as n → ∞. Now suppose μn does not converge to μ. Then for some ε > 0 there exists a subsequence {n(m)} such that d(μn(m), μ) > ε for all n(m). By (A.13), we have 0 ≤ σn,s/σnC < ∞ for all s, n ≥ 1. Hence, 0 ≤ σn(m),s/σn(m)C < ∞ for all n(m). Consequently, for s = 1 there exists a subsubsequence {n(m(l1))} such that σn(m(l1)),1/σn(m(l1))p(1) as l1 → ∞. For s = 2, there exists a subsubsubsequence {n(m(l1(l2)))} such that σn(m(l1(l2))),2/σn(m(l1 (l2)))p(2) as l2 → ∞. The argument can be repeated for s = 3, 4…. Now construct a subsequence {nl} such that n1 corresponds to the first element of {n(m(l1))}, n2 corresponds to the second element of {n(m(l1(l2)))}, and so on, then

limlσnl,sσnl=p(s) (A.16)

for s = 1, 2, … Given (A.15), it follows that as l → ∞

VnlsVs~N(0,p2(s)).

Then, it follows from (A.12) that

limsp(s)-1limsliml|p(s)-σnl,sσnl|+limssupn1|σn,sσn-1|=0.

Thus VsV and thus by Lemma A.1 WnlV ~ N(0, 1) as l → ∞. Since {nl} ⊆ {n(m)} this contradicts the assumption that d(μn(m), μ) > ε for all n(m). This completes the proof of the CLT.

Proof of Corollary 1

The proof is available online on the authors’ webpages.

B Appendix: Proofs for Section 4

Proof of Theorem 3

We show that

supθΘQn(θ)-Q¯n(θ)p0 (B.1)

as n → ∞. As discussed in the text, given that the θ0n are identifiably unique it then follows immediately from, e.g., Pötscher and Prucha (1997), Lemma 3.1, that ν(θ^n,θ0n)p0 as claimed.

We start by proving that

Dn-1iDn[qi,n(Zi,n,θ)-Eqi,n(Si,n,θ)]p0 (B.2)

for each θ ∈ Θ, by applying the LLN given as Theorem 1 in the text to qi,n(Zi,n, θ). By Assumption 6(a), we have supn,iDnE |qi,n(Zi,n, θ)|p < ∞ for each θ ∈ Θ and p = 2, which verifies Assumption 2(a) for qi,n(Zi,n, θ) with ci,n = 1. By Assumption 6(b), the qi,n(Zi,n, θ) are uniformly L1-NED on ε, and hence w.o.l.g. we can take di,n = 1. Furthermore, by Assumption 6(b) the input process ε is α-mixing, and the α-mixing coefficients satisfy Assumption 2(b). Consequently (B.2) follows directly from Theorem 1 applied to qi,n(Zi,n, θ).

Next, by Proposition 1 of Jenish and Prucha (2009), Assumption 6(c) implies that qi,n is L0 stochastically equicontinuous on Θ, i.e., for every ε > 0

limsupn1DniDnP(supν(θ,θ)δqi,n(Zi,n,θ)-qi,n(Zi,n,θ)>ε)0asδ0.

Furthermore, in light of Assumption 6(a) the qi,n(Zi,n, θ) clearly satisfy the domination condition postulated by the ULLN in Jenish and Prucha (2009), stated as Theorem 2 in that paper. Given that we have already verified the pointwise LLN in (B.2) it now follows directly from that theorem that

supθΘRn(θ)-ERn(θ)p0 (B.3)

with Rn(θ) = |Dn|−1ΣiDnqi,n(Zi,n, θ), and that the ERn(θ) are uniformly equicontinuous on Θ in the sense that

limsupnsupθΘsupν(θ,θ)δERn(θ)-ERn(θ)0asδ0.

To prove (B.1), observe that

supθΘQn(θ)-Q¯n(θ)supθΘRn(θ)PRn(θ)-ERn(θ)PERn(θ)+supθΘRn(θ)(Pn-P)Rn(θ)supθΘRn(θ)PRn(θ)-ERn(θ)PERn(θ)+2supθΘRn(θ)2Pn-P. (B.4)

Furthermore observe that Assumption 6(a) we have E [supθ∈Θ|qi,n(Zi,n, θ)|] ≤ K and E [supθ∈Θ|qi,n(Zi,n, θ)|]2K for some finite constant K. Thus

supθΘERn(θ)EsupθΘRn(θ)Dn-1iDnEsupθΘqi,n(Zi,n,θ)K (B.5)
EsupθΘRn(θ)2Dn-2i,jDnE[supθΘqi,n(Zi,n,θ)supθΘqj,n(Zj,n,θ)]Dn-2i,jDn[E(supθΘqi,n(Zi,n,θ))2]1/2[E(supθΘqj,n(Zj,n,θ))2]1/2K. (B.6)

Now consider the first terms on the r.h.s. of the last inequality of (B.4). From (B.5) we see that E |Rn(θ)| takes on its values in a compact set. Given (B.3) it now follows immediately from part (a) of Lemma 3.3 of Pötscher and Prucha (1997) that

supθΘRn(θ)PRn(θ)-ERn(θ)PERn(θ)P0. (B.7)

Next we show that also the second term on the r.h.s. of the last inequality of (B.4) converges in probability to zero. To see that this is indeed the case observe that supθ∈Θ |Rn(θ)|2 = Op(1) in light of (B.6) and Pn-PP0 by assumption. This completes the proof of (B.1).

Having established that ERn(θ) are uniformly equicontinuous on Θ, the uniform equicontinuity of n(θ) on Θ follows immediately from Lemma 3.3(b) of Pötscher and Prucha (1997).

Proof of Theorem 4

Clearly by Theorem 3 we have θ̂nθ0n = op(1).

Step 1

The estimators θ̂n corresponding to the objective function (13) satisfy the following first order conditions:

θRn(θ^n)Pn[Dn1/2Rn(θ^n)]=op(1). (B.8)

The op(1) term on the r.h.s. reflects that the first order conditions may not hold if θ̂n falls onto the boundary of Θ, and that the probability of that event goes to zero as n → ∞, since the θ0n are uniformly in the in the interior of Θ by Assumption 7(a). If θ̂n is in the interior of Θ, then the l.h.s. of (B.8) is zero.

Taking the mean value expansion of Rn(θ̂n) about θ0n yields

Rn(θ^n)=Rn(θ0n)+θRn(θn)(θ^n-θ0n) (B.9)

where θ̃n ∈ Θ is between θ̃n and θ0n (component-by-component). Let

A^n=θRn(θ^n)PnθRn(θn)andB^n=θRn(θ^n)Pn[Dn-1n]1/2,

then combining (B.8) and (B.9) gives

Dn1/2(θ^n-θ0n)=[I-A^n+A^n]Dn1/2(θ^n-θ0n)-A^n+θRn(θ^n)Pn[Dn1/2Rn(θon)]+A^n+op(1)=[I-A^n+A^n]Dn1/2(θ^n-θ0n)-A^n+B^n[n-1/2DnRn(θ0n)]+A^n+op(1), (B.10)

where A^n+ denotes the generalized inverse of Ân.

Step 2

By Assumptions 7(c) the qi,n(Zi,n, θ0n) are uniformly L2-NED and uniformly L2+δ-integrable with ci,n = 1. Given Assumptions 7(d),(g) it is now readily seen that the process {qi,n(Zi,n, θ0n), iDn} satisfies all assumptions of the CLT for vector-valued NED processes, given as Corollary 1 in the text, with cin = 1. (Note that Assumption 4(d) is satisfied automatically since the qi,n(Zi,n, θ0n) are uniformly L2-NED.) Hence,

n-1/2DnRn(θ0n)=n-1/2iDnqi,n(Zi,n,θ0n)N(0,Ipq), (B.11)

with Σn = VariDnqi,n(Zi,n, θ0n)] and supn λmax [|Dn|−1 Σn]< ∞.

Step 3

By Assumptions 7(c),(d),(e) the functions ∇θqi,n(Zi,n, θ) satisfy for each θ ∈ Θ the LLN given as Theorem 1 in the text with ci,n = 1, observing that Assumption 2(b) is implied by 3. By argumentation analogous as used in the proof of consistency we have

Dn-1iDn(θqi,n(Zi,n,θ)-Eθqi,n(Zi,n,θ))p0.

By Proposition 1 of Jenish and Prucha (2009), Assumption 7(f) implies that the ∇θqi,n(Zi,n; θ) are uniformly L0-equicontinuous on Θ. Given L0-equicontinuity and Assumption 7(e), we have by the ULLN of Jenish and Prucha (2009):

supθΘθRn(θ)-EθRn(θ)p0. (B.12)

and furthermore, the EθRn(θ) are uniformly equicontinuous on Θ in the sense:

limsupnsupθΘsupθ-θ<δEθRn(θ)-EθRn(θ)0 (B.13)

as δ → 0. In light of (B.12) and (B.13), and given that θ̂nθ0n = op(1) and hence θ̃nθ0n = op(1), if follows further that

θRn(θ^n)-EθRn(θ0n)p0,andθRn(θn)-EθRn(θ0n)p0.

Hence,

A^n-Anp0andB^n-Bnp0, (B.14)

where Ân and n are as defined above, and

An=[EθRn(θ0n)]P[EθRn(θ0n)]andBn=[EθRn(θ0n)]P[Dn-1n]1/2.
Step 4

Given Assumptions 7(e),(f), and since P is positive definite, we have |An| = O(1) and An-1=O(1), respectively. Hence by, e.g., Lemma F1 in Pötscher and Prucha (1997) we have Ân = Op(1), A^n+=Op(1), Ân is nonsingular with probability tending to one, and A^n+-An-1p0. In light of the above it follows from (B.10) that

Dn1/2(θ^n-θ0n)=-A^n+B^n[n-1/2DnRn(θ0n)]+op(1)=-An-1Bn[n-1/2DnRn(θ0n)]+op(1)

Recalling that supn λmax [|Dn|−1 Σn]< ∞, Assumptions 7(e) implies that |Bn| = Op(1). In light of Assumption 7(g),(h) BnBn is invertible and furthermore (BnBn)-1=O(1). Thus |(An-1BnBnAn-1)-1|An2(BnBn)-1=O(1) and therefore

(An-1BnBnAn-1)-1/2Dn1/2(θ^n-θ0n)=-(An-1BnBnAn-1)-1/2An-1Bn[n-1/2DnRn(θ0n)]+op(1).

The claim that (An-1BnBnAn-1)-1/2Dn1/2(θ^n-θ0n)N(0,Ik) now follows, e.g., from Corollary F4(b) in Pötscher and Prucha (1997).

Footnotes

1

The space and metric are not restricted to physical space and distance.

3

These conditions for linear processes with general independent innovations were first established by Gorodetskii (1977). Doukhan and Guyon (1991) generalized them to random fields.

4

This important early contribution employs Bolthausen’s (1982) CLT for stationary α-mixing random fields on the regular lattice ℤ2. However, the mixing and stationarity assumptions may not hold in many applications. The present paper relaxes these critical assumptions.

5

The proof of the proposition shows that ||gi,n(Zi,n)-E[gi,n(Zi,n)Fi,n(s)]||p2CZi,n-Zi,nsp, which explains the 2 in the scaling factor for gi,n(Zi,n).

6

In an important contribution, Conley (1999) gives a first set of results regarding the asymptotic properties of GMM estimators under the assumption that the data process is stationary and α-mixing. Conley also maintains some high level assumption such as first moment continuity of the moment function, which in turn immediately implies uniform convergence - see, e.g., Pötscher and Prucha (1989) for a discussion. Our results extend Conley (1999) in several important directions, as indicated above. We establish uniform convergence from primitive sufficient conditions via the generic uniform law of large numbers given in Jenish and Prucha (2009) and the law of large numbers given as Theorem 1 above.

7

To ensure that the derivatives are defined on the border of Θ, we assume in the following that the moment functions are defined on an open set containing Θ, and that the qi,n and ∇θqi,n are restrictions to Θ.

8

Pinkse et al. (2007) made an interesting contribution in this direction. Their catalogue of assumption is at the level of Bernstein blocks. Without further sufficient conditions, verification of those assumptions would typically be challenging in practical situations.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Nazgul Jenish, Email: nazgul.jenish@nyu.edu.

Ingmar R. Prucha, Email: prucha@econ.umd.edu.

References

  • 1.Andrews DWK. Non-strong mixing autoregressive processes. Journal of Applied Probability. 1984;21:930–934. [Google Scholar]
  • 2.Andrews DWK. Consistency in nonlinear econometric models: a generic uniform law of large numbers. Econometrica. 1987;55:1465–1471. [Google Scholar]
  • 3.Andrews DWK. Laws of large numbers for dependent non-identically distributed variables. Econometric Theory. 1988;4:458–467. [Google Scholar]
  • 4.Bierens HJ. Robust methods and asymptotic theory. Berlin: Springer Verlag; 1981. [Google Scholar]
  • 5.Billingsley P. Convergence of probability measures. New York: John Wiley and Sons; 1968. [Google Scholar]
  • 6.Bolthausen E. On the central limit theorem for stationary mixing random fields. Annals of Probability. 1982;10:1047–1050. [Google Scholar]
  • 7.Bradley R. Some examples of mixing random fields. Rocky Mountain Journal of Mathematics. 1993;23(2):495–519. [Google Scholar]
  • 8.Brockwell P, Davis R. Times series: theory and methods. Springer Verlag; 1991. [Google Scholar]
  • 9.Bulinskii AV. Limit theorems under weak dependence conditions. Moscow University Press; 1989. [Google Scholar]
  • 10.Bulinskii AV, Doukhan P. Vitesse de convergence dans le theorem de limite centrale pour des champs melangeants des hypotheses de moment faibles. C.R. Academie Sci Paris, Serie I. 1990:801–105. [Google Scholar]
  • 11.Chen X, Conley T. A new semiparametric spatial model for panel time series. Journal of Econometrics. 2001;105:59–83. [Google Scholar]
  • 12.Cliff A, Ord J. Spatial processes, models and applications. London: Pion; 1981. [Google Scholar]
  • 13.Conley T. GMM estimation with cross sectional dependence. Journal of Econometrics. 1999;92:1–45. [Google Scholar]
  • 14.Davidson J. A central limit theorem for globally nonstationary near-epoch dependent functions of mixing processes. Econometric Theory. 1992;8:313–329. [Google Scholar]
  • 15.Davidson J. An L1 convergence theorem for heterogenous mixingale arrays with trending moments. Statistics and Probability Letters. 1993;8:313–329. [Google Scholar]
  • 16.Davidson J. Stochastic limit theory. Oxford University Press; 1994. [Google Scholar]
  • 17.De Jong RM. Central limit theorems for dependent heterogeneous random variables. Econometric Theory. 1997;13:353–367. [Google Scholar]
  • 18.Dell M. The persistent effects of Peru’s mining mita. Econometrica. 2010;78:1863, 1903. [Google Scholar]
  • 19.Dobrushin R. The description of a random field by its conditional distribution and its regularity condition. Theory of Probability and its Applications. 1968;13:197–227. [Google Scholar]
  • 20.Doukhan P, Guyon X. Mixing for linear random fields, C.R. Academie Sciences Paris, Serie 1. 1991;313:465–470. [Google Scholar]
  • 21.Doukhan P, Lang G. Rates in the empirical central limit theorem for stationary weakly dependent random fields. Statistical Inference for Stochastic Processes. 2002;5:199–228. [Google Scholar]
  • 22.Doukhan P, Louhichi S. A new weak dependence condition and applications to moment inequalities. Stochastic Processes and Their Applications. 1999;84:313–342. [Google Scholar]
  • 23.Fogli A, Veldkamp L. Nature or Nurture? Learning and geography of female force participation. Econometrica. 2011;79:1103–1138. [Google Scholar]
  • 24.Gallant AR, White H. A unified theory of estimation and inference for nonlinear dynamic models. New York: Basil Blackwell; 1988. [Google Scholar]
  • 25.Gorodetskii VV. On the strong mixing property for linear sequences. Theory of Probability and Applications. 1977;22:411–413. [Google Scholar]
  • 26.Hallin M, Lu Z, Tran LT. Density estimation for spatial linear processes. Bernoulli. 2001;7:657–668. [Google Scholar]
  • 27.Hallin M, Lu Z, Tran LT. Kernel density estimation for spatial processes: the L1 theory. Multivariate Analysis. 2004;88:61–75. [Google Scholar]
  • 28.Ibragimov IA. Some limit theorems for stationary processes. Theory of Probability and Applications. 1962;7:349–382. [Google Scholar]
  • 29.Ibragimov IA, Linnik YV. Independent and stationary sequences of random variables. Wolters-Noordhoff; Groningen: 1971. [Google Scholar]
  • 30.Jenish N, I, Prucha R. Central limit theorems and uniform laws of large numbers for arrays of random fields. Journal of Econometrics. 2009;150:86–98. doi: 10.1016/j.jeconom.2009.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Jenish N, Prucha IR. Working paper. 2011. On spatial processes and asymptotic inference under near epoch dependence. Version January 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kelejian HH, I, Prucha R. Estimation of simultaneous systems of spatially interrelated cross sectional equations. Journal of Econometrics. 2004;118:27–50. [Google Scholar]
  • 33.Kelejian HH, I, Prucha R. HAC estimation in a spatial framework. Journal of Econometrics. 2007;140:131–154. doi: 10.1016/j.jeconom.2009.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kelejian HH, I, Prucha R. Specification and estimation of spatial autoregressive models with autoregressive and heteroskedastic disturbances. Journal of Econometrics. 2010;157:53–67. doi: 10.1016/j.jeconom.2009.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Lee LF. Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive models. Econometrica. 2004;72:1899–1925. [Google Scholar]
  • 36.Lee LF. GMM and 2SLS for mixed regressive, spatial autoregressive models. Journal of Econometrics. 2007;137:489–514. [Google Scholar]
  • 37.Lu Z. Asymptotic normality of kernel density estimators under dependence. Annals of Institute of Statistical Mathematics. 2001;53:447–468. [Google Scholar]
  • 38.Lu Z, Linton O. Local linear fitting under near epoch dependence. Econometric Theory. 2007;23:37–70. [Google Scholar]
  • 39.McLeish DL. Invariance principles for dependent variables. Z. Wahrsch. verw. Gebiete. 1975;32:165–78. [Google Scholar]
  • 40.Nahapetian B. An approach to central limit theorems for dependent random variables. Theory of Probability and its Applications. 1987;32:589–594. [Google Scholar]
  • 41.Pinkse J, Shen L, Slade ME. A central limit theorem for endogenous locations and complex spatial interactions. Journal of Econometrics. 2007;140:215–225. [Google Scholar]
  • 42.Pinkse J, Slade LME, Brett Spatial price competition: a semiparametric approach. Econometrica. 2002;70:1111–1153. [Google Scholar]
  • 43.Pötscher BM, I, Prucha R. A uniform law of large numbers for dependent and heterogeneous data processes. Econometrica. 1989;57:675–683. [Google Scholar]
  • 44.Pötscher BM, I, Prucha R. Generic uniform convergence and equicontinuity concepts for random functions. Journal of Econometrics. 1994;60:23–63. [Google Scholar]
  • 45.Pötscher BM, Prucha IR. Dynamic nonlinear econometric models. Springer-Verlag; New York: 1997. [Google Scholar]
  • 46.Robinson PM. Large-sample inference on spatial dependence. The Econometrics Journal, Tenth Anniversary Special Issue. 2009;12:S68–S82. [Google Scholar]
  • 47.Robinson PM. Efficient estimation of the semiparametric spatial autoregressive model. Journal of Econometrics. 2010;157:6–17. [Google Scholar]
  • 48.Takahata H. On the rates in the central limit theorem for weakly dependent random fields. Z. Wahrsch. verw. Gebiete. 1983;64:445–456. [Google Scholar]
  • 49.Wooldridge J. PhD Dissertation. University of California; San Diego: Department of Economics; 1986. Asymptotic properties of econometric estimators. [Google Scholar]
  • 50.Yu J, de Jong R, Lee LF. Quasi-maximum likelihood estimators for spatial dynamic panel data with fixed effects when both N and T are large. Journal of Econometrics. 2008;146:118–134. [Google Scholar]

RESOURCES