Quasi-least squares with mixed linear correlation structures

Jichun Xie; Justine Shults; Jon Peet; Dwight Stambolian; Mary Frances Cotch

doi:10.4310/sii.2010.v3.n2.a9

. Author manuscript; available in PMC: 2012 Apr 17.

Published in final edited form as: Stat Interface. 2010;3(2):223–234. doi: 10.4310/sii.2010.v3.n2.a9

Quasi-least squares with mixed linear correlation structures

Jichun Xie ¹, Justine Shults ^2,^✉, Jon Peet ³, Dwight Stambolian ⁴, Mary Frances Cotch ⁵

PMCID: PMC3328409 NIHMSID: NIHMS239795 PMID: 22518205

Abstract

Quasi-least squares (QLS) is a two-stage computational approach for estimation of the correlation parameters in the framework of generalized estimating equations. We prove two general results for the class of mixed linear correlation structures: namely, that the stage one QLS estimate of the correlation parameter always exists and is feasible (yields a positive definite estimated correlation matrix) for any correlation structure, while the stage two estimator exists and is unique (and therefore consistent) with probability one, for the class of mixed linear correlation structures. Our general results justify the implementation of QLS for particular members of the class of mixed linear correlation structures that are appropriate for analysis of data from families that may vary in size and composition. We describe the familial structures and implement them in an analysis of optical spherical values in the Old Order Amish (OOA). For the OOA analysis, we show that we would suffer a substantial loss in efficiency, if the familial structures were the true structures, but were misspecified as simpler approximate structures. To help bridge the interface between Statistics and Medicine, we also provide R software so that medical researchers can implement the familial structures in a QLS analysis of their own data.

Keywords and phrases: Quasi-least squares, Linear correlation structure, Mixed correlation structure, Familial data

1. INTRODUCTION

We consider a secondary analysis of optical spherical values in a study in the Old Order Amish (OOA) (Wojciechowski et al., 2009). The families in the OOA study varied in both size and composition, because some nuclear families contained only siblings, while other families included siblings and one or both parents. The goal of the OOA analysis was to relate the expected spherical values, measurements that reflect quality of vision, with gender and age, while also adjusting for the correlation among measurements within each family. Because the correlations in the OOA study were thought to vary according to familial relationship, it was important to allow the sibling-sibling, sibling-father, and sibling-mother correlations to vary in value.

To model the pattern of association amongst measurements in families in the OOA study, we implemented slight generalizations of familial correlation structures considered by Karlin, Cameron, and Williams (1981) and Gleseer (1992). Gleseer (1992) noted that it is computationally difficult to obtain maximum likelihood (ML) estimates of the correlation parameters for normal data, when family sizes are not constant. Gleseer (1992) therefore obtained ML estimates that were weighted averages of estimates obtained for sub-groups with families of equal size. One limitation of the approaches of both Karlin et al. (1981) and of Gleseer (1992) was that they assumed that the expected value of the outcome variable was constant between the siblings. However, it is important to note that Karlin et al. (1981) and Gleseer (1992) allowed the variance of the outcome variable to vary between parents and siblings, while we assume a constant standard deviation of spherical values for all subjects.

We implement the familial correlation structures for analysis of the OOA study with quasi-least squares (QLS). QLS is an approach based on GEE that estimates the correlation parameters in two stages. In the following summary, estimates of the correlation parameters are defined to be feasible if they yield positive definite correlation matrices. Chaganty (1997) considered balanced data and established feasibility of the stage one estimates for the first order autoregressive AR(1), exchangeable, and tri-diagonal structures. Shults (1996) and Shults and Chaganty (1998) proved feasibility for the afore-mentioned structures, in addition to the Markov structure, for unbalanced data. However, although the stage one estimates exist and are feasible, they are not consistent. Chaganty and Shults (1999) therefore introduced a second stage of QLS and established consistency of the stage two estimates for the AR(1), Markov, and tri-diagonal correlation structures. The second stage of QLS updates the stage one estimate of α by obtaining a solution to an estimating equation (stage two estimating equation for α) with an estimating function that only depends on α and the stage one estimate of α. Theorem (3.2) of Chaganty and Shults (1999) establishes that if there exists a unique solution to the stage two estimating equation for α that is a continuous and one to one function of the stage one estimate, then that solution will be consistent for α. Software for implementation of QLS is available in SAS (Kim and Shults, 2008), Stata (Shults, Ratcliffe, and Leonard, 2007), MATLAB (Ratcliffe and Shults, 2008), and R (Xie and Shults, 2009).

We implement the familial structures using QLS instead of other extensions of GEE that involve more complicated correlation structures or weighting matrices than the original formulation of the GEE approach. For example, Prentice (1988) developed GEE1 for binary outcomes by constructing a second estimating equation (equation (14) on p. 1039) that involves specifying a working correlation structure for the sample correlations. Zhao and Prentice (1990) developed GEE2 for discrete and continuous outcomes by constructing a joint estimating equation for the regression and correlation parameters that involves a working covariance structure that depends on the third and fourth moments of the outcome variable. Carey, Zeger, and Diggle (1993) developed alternating logistic regression (ALR) for binary outcomes by setting up a GEE based on conditional means of the outcome variable (equation (7) on p. 521) that corresponds to the logistic regression of one response on another and that also involves specification of a weighting matrix. When each of these methods is applied, perhaps because of the difficulty in specifying an appropriate patterned structure, the typical approach is to simplify the correlation (weighting) structures by setting some or all of their off-diagonal elements equal to zero. For example, Carey et al. define their weighting matrix ((7), p. 521, 1993) as a diagonal matrix and say in their discussion that this results in a reasonably optimal weighting. When GEE1 and GEE2 are applied in analyses, off-diagonal elements of the working structures are often set equal to zero. Recent approaches based on GEE1 and GEE2, for example methods for hierarchical data by Qu, Williams, Beck, and Medendrop (1992) and Qaqish and Liang (1992), face the same challenge as GEE1 and GEE2 with regard to specification of an appropriate working structure. Overall, implementation of the structures we consider would be potentially much more challenging for these other recent extensions of GEE than it was for QLS.

In this manuscript, we prove two general results for QLS that can be used to justify implementation of QLS for the familial structures we consider. First, we prove that the QLS stage one estimate of α will exist and is feasible with probability one, for any correlation structure. Next, for the class of mixed linear correlation structures, we prove the existence and uniqueness of the QLS stage two estimates, both of which are required for consistency of α̂. A benefit of our results is that not only do they justify implementation of QLS for the familial structures we consider in this manuscript, but they can also be used to justify QLS for other structures. For example, Shults, Mazurick, and Landis (2006) implemented QLS for a banded Toeplitz (BT) correlation structure, but did not provide proofs regarding the existence and uniqueness of solutions of the QLS estimating equations for α for this structure. The BT structure is a member of the class of linear correlation structures, so that the results provided in this paper establish the consistency of the QLS estimators of α for this structure. In general, our results for stage one are applicable to any correlation structure, while our results for stage two are applicable to any mixed linear correlation structure.

As an outline for our paper, in Section 2, we give some notation; describe the familial structures we consider; and define mixed linear correlation structures. In Section 3 we then extend QLS for mixed linear correlation structures by proving several results for these structures. Next, we demonstrate the benefit of fitting mixed linear correlation structures: in Section 4, we conduct asymptotic relative efficiency (ARE) comparisons to show that the loss in efficiency in estimation of the regression parameter could be substantial in a QLS (or GEE) analysis of the OOA study, if the true mixed linear correlation structures were misspecified as a simpler, approximate structure. In Section 5 we then present our analysis of the OOA study that demonstrates application of the mixed correlation structure with QLS. The proofs of our theorems and lemmas are provided in the appendices.

2. BACKGROUND

2.1 Notation

We assume that outcomes Y_i = (Y_i₁, …, Y_{in_i})^T and associated covariates X_ij = (X_ij₁, …, X_ijp)^T are collected on family i, for i = 1, …, m. The expected value and variance of measurement Y_ij can be expressed using a generalized linear model (GLM):

E (Y_{i j}) = g^{- 1} (X_{i j}^{T} β) = μ_{i j} and Var (Y_{i j}) = φ h (μ_{i j})

(1)

respectively, where g⁻¹(·) is the link function; h(·) is the variance function; and φ is a known or unknown scale parameter. We assume that observations from different families are independent. However, measurements within families are correlated, with a pattern of association that can be described with correlation structures for each family i, Cor(Y_i) = R_i(α), that depend on an s by 1 correlation parameter α. The covariance matrix of Y_i is then given by $Cov (Y_{i}) = φ A_{i}^{1 / 2} R_{i} (α) A_{i}^{1 / 2}$ , where A_i = diag(h(μ_i1), …, h(μ_{in_i})).

2.2 Familial structures in the class of linear and mixed correlation structures

Define e_i as the unit vector with only the ith entry equal to 1. We refer to a correlation matrix as linear if

R_{i} (α) = \sum_{j = 1}^{s} (R_{i} (e_{j}) - R_{i} (0)) α_{j} + R_{i} (0),

(2)

so that each element of the matrix can be expressed as a linear combination of α. In this case α is identifiable if and only if

\sum_{j = 1}^{s} (R_{i} (e_{j}) - R_{i} (0)) c_{j} α_{j} = 0 if and only if c = {(c_{1}, \dots, c_{s})}^{'} = 0.

(3)

Several linear correlation structures were considered for analysis of the OOA study, which included two-generation families that varied in both size and composition. We assumed that the father-mother, father-sibling, mother-sibling, sibling-sibling correlations were γ, ρ₁, ρ₂ and α, respectively. If family i included both parents and siblings, this resulted in an extended familial correlation structure R_i to describe the pattern of association among the n_i measurements on family i:

Cor (Y_{i}) = {(\begin{matrix} 1 & γ & ρ_{1} & ρ_{1} & \dots & ρ_{1} \\ γ & 1 & ρ_{2} & ρ_{2} & \dots & ρ_{2} \\ ρ_{1} & ρ_{2} & 1 & α & \dots & α \\ ρ_{1} & ρ_{2} & α & 1 & \dots & α \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ ρ_{1} & ρ_{2} & α & α & \dots & 1 \end{matrix})}_{n_{i} \times n_{i}} .

(4)

Sabo and Chaganty (2009) made an excellent comparison of (4) for several approaches, for continuous outcomes and for families of equal size and composition.

For a family with only a father and siblings, R_i would have a familial structure:

Cor (Y_{i}) = {(\begin{matrix} 1 & ρ_{1} & ρ_{1} & \dots & ρ_{1} \\ ρ_{1} & 1 & α & \dots & α \\ ρ_{1} & α & 1 & \dots & α \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ ρ_{1} & α & α & \dots & 1 \end{matrix})}_{n_{i} \times n_{i}} .

(5)

For a family with only a mother and siblings, R_i would still have a familial structure, but with ρ₁ replaced by ρ₂ in (5):

Cor (Y_{i}) = {(\begin{matrix} 1 & ρ_{2} & ρ_{2} & \dots & ρ_{2} \\ ρ_{2} & 1 & α & \dots & α \\ ρ_{2} & α & 1 & \dots & α \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ ρ_{2} & α & α & \dots & 1 \end{matrix})}_{n_{i} \times n_{i}} .

(6)

Finally, for families with only siblings, the correlation structure would be exchangeable:

Cor (Y_{i}) = {(\begin{matrix} 1 & α & \dots & α \\ α & 1 & \dots & α \\ ⋮ & ⋮ & ⋱ & ⋮ \\ α & α & \dots & 1 \end{matrix})}_{n_{i} \times n_{i}} .

(7)

In our analysis of the OOA data, different families were allowed to have different correlation structures. However, all the structures were mixed correlation structures (MCS), which we define as structures that may vary between families but share correlation parameters, so that the parameters for family i take value in {γ, ρ₁, ρ₂, α}. See Chaganty and Deng (2007) for a discussion of ranges of measures of association for binary outcomes with familial patterns of association.

3. EXTENSION OF QUASI-LEAST SQUARES FOR MIXED LINEAR CORRELATION STRUCTURES

3.1 Quasi-least squares

Here, we briefly describe the method of QLS. Stage one of QLS iterates between updating the regression parameter β via (i) solution of the GEE estimating equation for β (Liang and Zeger, 1986):

\sum_{i = 1}^{m} D_{i}^{T} A_{i}^{- 1 / 2} R_{i}^{- 1} (α) A_{i}^{- 1 / 2} (Y_{i} - U_{i} (β)) = 0,

(8)

where U_i(β) = E(Y_i) and $D_{i} = \frac{\partial U_{i}}{\partial β}$ ; and (ii) updating the correlation parameter α by minimizing the generalized error sum of squares

Q (β, R (α)) = \sum_{i = 1}^{m} z_{i}^{T} (β) R_{i}^{- 1} (α) z_{i} (β)

(9)

with respect α ∈ Ω ⊆ ℝ^s, where $z_{i} (β) = A_{i}^{- 1 / 2} (Y_{i} - U_{i}) = (z_{i 1}, \dots, z_{{i n}_{i}})$ are known as the Pearson residuals. In addition, Ω is defined as the feasible region for the correlation structure (R_i(α))_1,…,_m, so that ∀α ∈ Ω and ∀i ∈ {1, …, m}, R_i(α) is positive definite. Stage one of QLS therefore involves solving the stage one estimating equation

D_{G} = \frac{\partial}{\partial α} {\sum_{i = 1}^{m} z_{i}^{T} (β) R_{i}^{- 1} (α) z_{i} (β)} = 0.

(10)

In general, the solution of (10) is not necessarily the minimizer of (9). However, in Section 3.2 we prove that if the R_i(α) are linear for all i, the solution of (10) does indeed minimize the generalized error sum of squares (9); Furthermore, the solution will be unique and feasible almost surely.

The QLS stage one estimates β̂ for β and δ̂ for α are the solutions of (8) and the minimizer of (9), respectively. However, Chaganty (1997) proved that the stage one QLS estimate of α is not consistent. In order to correct the asymptotic bias for the QLS stage one estimates, after convergence in stage one, we next solve the stage two estimating equation that depends on the stage one estimates δ̂ (Chaganty and Shults, 1999), for α:

{\sum_{i = 1}^{m} tr [\frac{\partial R_{i}^{- 1} (δ)}{\partial δ} R_{i} (α)] |}_{δ = \hat{δ}} = 0.

(11)

Theorem 3.2 of Chaganty and Shults (1999) established that if there is a unique root α̂ for equation (11) that is a one to one and continuous function of δ̂ and the structure is correctly specified, then α̂ is consistent. We refer to α̂ as the stage two estimator of α, based on which, we obtain the final estimator β̂ for β by again solving the GEE estimating equation (8) for β, evaluated at the stage two estimates for α.

3.2 Results that justify application of quasi-least squares for mixed linear correlation structures

In this section we first provide general proofs regarding the existence and feasibility of the stage one QLS estimates in Section 3.2.1. Next, in Section 3.2.2 we prove the consistency of the stage two QLS estimates for the mixed linear correlation structures. The proofs for all results are provided in the appendices.

3.2.1 General proof of feasibility for stage one QLS estimates

We first provide a theorem that establishes the feasibility of the global minimizer for (9).

Theorem 3.1

If for each subject i, R_i(α) is a differentiable n_i × n_i matrix, then the global minimizer for (9) in Ω is an inner point of Ω, where Ω is the feasible region of (R_i(α))_1,…,m.

Although the stage one QLS estimator of α is not the final estimator, its existence and feasibility is very important because failure to yield feasible estimates in stage one of QLS could cause a breakdown in the first phase of the procedure. For example, Crowder (1995) described the potential for breakdown in iterative procedures such as GEE that can occur when the estimated correlation matrices are not positive definite. Theorem 3.1 ensures that this type of failure will not occur in stage one of QLS.

However, while Theorem 3.1 ensures the existence of solutions for the stage one QLS estimating equation (10), it does not guarantee that the root is unique. If (10) has multiple roots, it can be difficult to obtain all the roots. Furthermore, it might not be straightforward to find the global minimizer for (9), if the generalized error sum of squares has several local minimizers. However, for correlation structures that meet the condition in (12), this minimization problem is fairly straightforward, because under this fairly general condition, (9) is convex almost surely, so that there will exist a unique root for (10) almost surely.

Theorem 3.2

Suppose each cluster i ∈ {1, …, m} in the data under consideration has correlation structure R_i(α). If ∀ α ∈ Ω,

\sum_{j = 1}^{s} \frac{\partial R_{i} (α)}{\partial α_{j}} c_{j} = 0 i f and only i f c = 0,

(12)

then (10) has a unique solution in the feasible region Ω almost surely.

Corollary 3.3

Suppose for each cluster i ∈ {1, …, m} of the longitudinal data, we have a linear correlation structure R_i(α) of the form (2). Then if α is identifiable, (10) has a unique solution in the feasible region Ω almost surely.

Theorem 3.2 provides the criterion (12) that will ensure that the stage one estimating equation (10) has a unique solution; This requirement is fairly general, and is to be satisfied by several common structures, including the exchangeable, tri-diagonal (Chaganty and Shults, 1999), BT (Shults et al., 2006), and also by the familial structures implemented in this manuscript.

3.2.2 Consistency of the stage two QLS estimates for linear correlation structures

Here we first prove that for linear correlation structures, the stage two estimator exists and is unique, with probability one.

Theorem 3.4

If for each cluster i ∈ {1, …, m}, the within subject correlation R_i(α) has a linear correlation structure of form (2), then the stage two estimating equation (11) has a unique solution with probability one.

In the proof of Theorem 3.4 in Appendix A, we provide an explicit solution for the stage two estimating solution for linear correlation structures. Suppose we obtain the stage one estimator δ̂. Define $A_{i j} = R_{i}^{- 1} (\hat{δ}) (R_{i} (e_{j}) - R_{i} (0)), M_{j k} = \sum_{i = 1}^{m} tr (A_{i j} A_{i k})$ , and $w_{j} = - \sum_{i = 1}^{m} tr (A_{i j} R_{i}^{- 1} (\hat{δ}) R_{i} (0))$ . Suppose M = (M_jk)_s_×_s and w = (w₁, …, w_s)^T. We can then express the stage two estimator in a very simple form:

\hat{α} = M^{- 1} (\hat{δ}) w (\hat{δ}),

(13)

which is very helpful with respect to computation, especially when the dimension of α is high. Chaganty and Shults (1999) proved that if a unique solution α̂ exists to the stage two estimating equation for α that is a continuous and one to one function of the stage one estimate of α, then α̂ will be consistent. Therefore, we have proven that under correct specification of the mixed linear correlation structure, there will exist a unique solution of the stage two estimating equation that will be consistent for α.

4. ASYMPTOTIC RELATIVE EFFICIENCY CALCULATIONS

Here we assess the loss in efficiency that results from incorrectly specifying the mixed correlation structure in the OOA analysis. We assume here that the true structure for cluster i is the mixed structure R_i(α) described in Section 5, for α = (ρ₁, ρ₂, γ, α), while the working structure is the exchangeable structure W_i(γ) = (1−γ)I_{n_i×n_i} + γJ_{n_i×n_i}, where I_{n_i×n_i} is an n_i by n_i identity matrix and J_{n_i×n_i} is an n_i × n_i matrix of ones. We consider the exchangeable working structure because this is a popular structure for analysis of clustered data. In addition, we note that the true mixed familial structures include exchangeable structures for OOA families that contain only siblings. Our misidentification scenario therefore represents the situation in which we have correctly assumed that the sibling-sibling correlations are equal, but have incorrectly assumed that the sibling-sibling, sibling-father, sibling-mother, and father-mother correlations are identical.

The efficiencies are calculated using the same approach that was implemented and described in Shults and Morrow (2002) and in Shults et al. (2006). To briefly summarize, we first note that Chaganty (1997) proved that $\sqrt{m} (\hat{β} - β)$ is asymptotically normal with mean zero and covariance matrix

V_{w} = lim_{m \to \infty} m φ W_{t} {\sum_{i = 1}^{m} X_{i}^{'} A_{i}^{1 / 2} W_{i}^{- 1} R_{i} W_{i}^{- 1} A_{i}^{1 / 2} X_{i}} W_{t},

(14)

where

W_{t} = {\sum_{i = 1}^{m} X_{i}^{'} A_{i}^{1 / 2} W_{i}^{- 1} A_{i}^{1 / 2} X_{i}}^{- 1} .

(15)

If the correct structure was specified, so that W_i = R_i, then the covariance matrix V_w can be simplified as V_t = lim_m_→∞ mφW_t.

The efficiency for β̂_j was then evaluated as the j^th diagonal element of V_t divided by the j^th diagonal element of V_w. However, as noted by Sutradhar and Das (1999), γ̂ may fail to be consistent when the true structure is mis-specified, so that the efficiencies should be calculated at the limiting value of γ̂. We therefore evaluated the efficiencies at W_i(f(α)) and R_i(α), where f(α) is the limiting value of γ̂ when the mixed correlation structure is misspecified as exchangeable. An algorithm to obtain the limiting value f(α) as a function of the true correlation parameter α is provided in the Appendix. Because the efficiencies were calculated as the number of subjects m → ∞, we assumed that the covariate design for the OOA study was replicated as m increases.

In addition, because the asymptotic distribution for β̂ is identical for QLS and GEE, the approach for calculation of AREs described in this section also applies when GEE is applied for an exchangeable working structure, but the true structures are mixed familial structures. GEE implements the following moment estimate for the exchangeable structure that is a function of the Pearson residuals z_ij:

{\hat{α}}_{GEE} = \frac{\sum_{i = 1}^{m} \sum_{k \neq j} z_{i k} z_{i j}}{\sum_{i = 1}^{m} \sum_{k = 1}^{n_{i}} (n_{i} - 1) z_{i k}^{2}} .

(16)

It is straightforward to show (Wang and Carey, 2003) that the limiting value of α̂_GEE is given by

\frac{\sum_{i = 1}^{m} \sum_{k \neq j} Corr (y_{i j}, y_{i k})}{\sum_{i = 1}^{m} (n_{i} - 1) n_{i}} = \frac{\sum_{i = 1}^{m} (e_{i}^{'} R_{i} (α) e_{i} - n_{i})}{\sum_{i = 1}^{m} (n_{i} - 1) n_{i}},

(17)

where e_i is an n_i by 1 vector of ones. The limiting values were almost identical for QLS and GEE. As a result, the efficiencies were almost identical for the two approaches.

Table 1 displays the efficiencies for QLS. (An equivalent table for GEE, with almost identical results, is available on request.) Lines 1–3 in Table 1 assess the situation when the father-sibling and sibling-sibling correlations are negligible, but the mother-sibling correlations are non-negligible and get increasingly larger (in going from line 1 to line 3). Lines 4–6 assess the situation when the father-sibling and mother-sibling correlations are negligible, but the sibling-sibling correlations are non-negligible and get increasingly larger (in going from lines 4 to 6). Lines 7–9 assess the situation when the father-sibling and mother-sibling correlations are nonnegligible and similar in value, with sibling-sibling correlations that get increasingly larger (in going from lines 7–9). Table 1 indicates that, as we might anticipate, the loss in efficiency is negligible when the true correlations are small, so that the true structure is close to an identity structure, which is a special case of an exchangeable structure (with γ = 0). However, as the true correlations increase in value, the loss in efficiency can become substantial when the true mixed familial structures are misspecified as exchangeable. For example, as shown in line 6, when ρ₁ = 0.02, ρ₂ = 0.05, and α = 0.71, then the ARE for age is only 79 percent.

Table 1.

Percent efficiencies for the regression coefficients for the constant term, gender, and age, when the true mixed correlation structure is misspecified as exchangeable. True structure = mixed R_i(α) where α = (ρ₁, ρ₂, α); working structure = exchangeable with parameter γ. limit = f(α) is the limiting value of γ̂ when the true mixed structure is misspecified as exchangeable in the analysis of the OOA study

ρ₁	ρ₂	α	limit	constant	gender	age
0.02	0.11	0.05	0.0510	0.99	0.99	0.99
0.02	0.31	0.05	0.0604	0.97	0.98	0.97
0.02	0.41	0.05	0.0652	0.88	0.72	0.88

0.02	0.05	0.41	0.3582	0.94	0.96	0.95
0.02	0.05	0.51	0.4422	0.90	0.92	0.92
0.02	0.05	0.71	0.6092	0.81	0.79	0.79

0.30	0.20	0.50	0.4657	0.95	0.94	0.96
0.30	0.20	0.70	0.6345	0.87	0.83	0.86
0.30	0.20	0.90	0.8029	0.73	0.48	0.53

Open in a new tab

The results shown in Table 1 therefore indicate that incorrect application of the exchangeable structure (which is a popular structure in analysis of clustered data) for all families can result in a substantial loss in efficiency in estimation of β. The results in Table 1 are important because it is sometimes claimed that careful modeling of the correlation structure is not crucial, because even if the structure is misspecified, GEE (and QLS) will yield a consistent estimate of the regression parameter. However, our ARE calculations demonstrate that if the structure is misspecified, even though β̂ is consistent, we can suffer a substantial loss in efficiency in estimation of β.

5. ANALYSIS OF THE MOTIVATIONAL STUDY

Here we present our results of the OOA analysis, to demonstrate implementation of the familial structures considered in this manuscript. The OOA population is ideal for studying familial association because the OOA live within a structured and uniform society where most individuals share a common lifestyle. The data considered here represent information on 296 individuals organized from 60 families, of which 33 had both parents and some siblings; 1 had only a father and siblings; 4 had a mother and siblings; and 22 had only siblings. The mean number of siblings in a family was 3.8 (range = 1–11). The mean age was 37.6 (range = 18–85). Recruitment and data collection of the parent study which provided the data for our secondary analysis has been described elsewhere (Wojciechowski et al., 2009).

The main outcome measure used in this analysis was the spherical component of each subject’s refractive error. Briefly, refractive error relates to an individual’s spectacle prescription. Refractive error is a spherical correction which denotes the power of a spherical lens (a lens whose properties do not change based on orientation) placed in front of a subject’s eye to optimize their vision. For some subjects spherical correction alone is sufficient to correct their vision. Lens values for the spherical component of a subject’s refraction can have either a positive or negative value and are expressed in units of optical power called diopters. The outcome for our analysis was the spherical (correction) value, which measures the power of a lens placed in front of the eye that does not depend on orientation. We considered the spherical values of the left eye, right eye, and the average spherical value of both eyes. The covariates we considered included gender (gender = 1 for males and gender = 0 for females) and the age in years at which the eye exam was conducted.

Our primary objective was to relate the expected spherical values with gender and age. We assumed that the families with both parents and siblings had an extended familial correlation structure (4) with zero correlation between parents, so that γ = 0 in (4). Families with only a father and siblings, only a mother and siblings, or only siblings, were assumed to have correlation structures (5), (6), and (7), respectively.

Table 2 displays the estimates of the regression parameter estimators. (QLS and GEE share the same asymptotic distribution for β̂; The results shown here are based on application of a “sandwich based” estimate of the covariance matrix of β̂ for calculation of standard errors (Chaganty and Shults, 1999), and p-values for the tests that β_j = 0.) As shown in Table 2, the estimated constant was negative, while the regression coefficients for (male) gender and for age were positive. Although the regression coefficients for age and gender did not differ significantly from zero at a 0.05 level (perhaps as a result of limited power due to the modest number of OOA families studied), the coefficients did differ significantly from zero at a 0.10 level. These results suggest that male gender and higher age are associated with less myopia, where myopia is indicated by negative spherical values.

Table 2.

The regression parameter estimators for the OOA ophthalmology study. Gender = 1 for male and 0 for female. Age is in years

Outcome	intercept	gender	age
Outcome	est.(p-value)	est.(p-value)	est.(p value)
R. Sph.	−2.67(< .0001)	0.75(0.067)	0.016(0.102)
L. Sph.	−2.80(< .0001)	0.77(0.051)	0.015(0.098)
Av. Sph.	−2.77(< .0001)	0.76(0.055)	0.016(0.074)

Open in a new tab

Next, Table 3 displays the QLS estimates of the correlation parameters. Notice that the estimated correlations were similar for the right sphere, left sphere, and average sphere. The estimated correlations were greatest between father and siblings, and smallest between siblings. These findings are consistent with the method of family ascertainment.

Table 3.

The correlation parameter estimators for the OOA ophthalmology study. ρ̂₁ is the estimated correlation between father and siblings, ρ̂₂ is the estimated correlation between mother and siblings and α̂ is the estimated correlation within siblings

Outcome	ρ̂₁	ρ̂₂	α̂
Right Sphere	0.2932	0.2241	0.0234
Left Sphere	0.2740	0.1420	0.0130
Average Sphere	0.2880	0.1996	0.0177

Open in a new tab

6. DISCUSSION

In this paper, we considered QLS, a two-stage approach based on GEE that uses the same estimating equation for estimation of β, but that differs from GEE with respect to estimation of α. We proved that the stage one QLS estimates exist and are feasible, while the stage two QLS estimates will be consistent with probability one, for the class of mixed linear correlation structures.

Our results regarding the stage one QLS estimators did not require correct specification of the correlation structure, so that the working correlation structure need not equal the true structure in order for a feasible stage one QLS estimate to exist that minimizes the generalized error sum of squares evaluated at that working structure (Theorem 3.1). Furthermore, the stage one estimate can be obtained as the unique solution to the QLS stage one estimating equation as long as the working structure (not necessarily the true structure) has a linear correlation structure (Theorem 3.2 and Corollary 3.3). In other words, there exists a stage one QLS estimate for any working structure and this estimate will be straightforward to obtain if the working structure is linear.

However, unlike the first stage, stage two of QLS does require correct specification of the working structure because this second stage was developed to overcome a major flaw with stage one of the procedure, namely that it does not yield a consistent estimate of α, even when the working structure is correctly specified. Stage two of QLS provides a correction to the stage one estimate; if the working structure does not equal the true structure, the wrong correction will be applied and consistency in general will not be achieved. Our results regarding the stage two QLS estimate of α, including Theorem 3.4 and the results in Section 3.2.2, therefore require correct specification of the working correlation structure. Just as does GEE, QLS requires correct specification of the working correlation structure as a prerequisite for consistent estimation of α. However, just like the GEE estimate, the QLS estimate of β will be consistent even if the working structure does not equal the true structure.

We considered familial correlation structures that are members of the class of mixed linear correlation structures. Our general results justified implementation of QLS for the familial structures, in addition to different members of the class of mixed linear structures, e.g. the banded Toeplitz structure that was considered by Shults et al. (2006).

Our work was motivated by a study of spherical optical values in the Old Order Amish (OOA). For this analysis, we implemented QLS for mixed familial correlation structures, which allowed the father-sibling, mother-sibling and sibling-sibling correlations to vary in value. An important feature of the OOA study was that the families varied in size; Our implementation of QLS therefore relaxed the assumption of constant family size and composition that is sometimes made in analysis of familial data.

We also conducted efficiency calculations based on the covariate design of the OOA study, to demonstrate that if the mixed familial structures were the true structures, but were misspecified as exchangeable structures, then we could suffer a serious loss in efficiency in estimation of the regression parameter. Our analysis and efficiency calculations demonstrated that it can be important to carefully model the correlation structure of the data, in order to maximize the information from the data and improve efficiency in estimation of the regression parameter. To encourage the use of the mixed familial correlation structures in practice, we also provide R functions that extend our previous software for application of QLS in R (Xie and Shults, 2009) for implementation of these structures. The R functions, and an R script file that demonstrates their use, is available on request from the first and second authors.

Future research that builds on our methods would be helpful. In extending this exploratory analysis to a larger sample ascertained without regard to myopia status, it will be useful to develop a test that incorporates the gender of each type of family member and to test whether like gender relationships differ from mixed gender relationships within and between families. For example, are the father-son and father-daughter correlations equal in value and are they significantly different from the mother-son and mother-daughter correlations? In addition, in this manuscript we considered spherical values that were measured on the left eye and right eye of each subject, and that were computed as the average of measurements on both eyes. Future work might extend our approach to allow for simultaneous analysis of both eyes. For example, the approach of Shults and Morrow (2002) and Shults, Whitt, and Kumanyika (2004) might be applied to adjust for two sources of correlation: due to the potential similarity of spherical values that are measured on the same subject, or between two members of the same family.

APPENDIX A. PROOFS OF MAIN RESULTS

Proof of Theorem 3.1

To prove this theorem, we need the following lemma:

Lemma A.1

R(ρ) is a differentiable n × n correlation matrix. Ω₀ is the margin of the feasible region for R(ρ). Then we have

Prob (lim_{ρ \to Ω_{0}} z^{T} R^{- 1} (ρ) z = \infty ∣ z \in R^{n}) = 1.

(18)

We prove Lemma A.1 in Appendix B. Here, we directly use this lemma to prove Theorem 3.1. Suppose the feasible region for R_i(ρ) is Ω_i, and the margin of Ω_i is Ω_i₀. Then the overall feasible region is Ω = ∩Ω_i, and the margin is Ω₀ Inline graphic ∪Ω_i₀. Therefore,

Prob (lim_{ρ \to Ω_{0}} \sum_{i = 1}^{m} z_{i}^{T} R_{i}^{- 1} (ρ) z_{i} = \infty ∣ z_{i} \in R^{n}, \forall i) \geq Prob (lim_{ρ \to \cup Ω_{i 0}} \sum_{i = 1}^{m} z_{i}^{T} R_{i}^{- 1} (ρ) z_{i} = \infty ∣ z_{i} \in R^{n}, \forall i) \geq Prob (lim_{ρ \to Ω_{i^{'} 0}} z_{i^{'}}^{T} R_{i^{'}}^{- 1} (ρ) z_{i^{'}} = \infty ∣ z_{i^{'}} \in R^{n}) = 1.

(19)

Ω = ∪Ω_i is an open set. Because of (19), we know the minimized point of (9) is taken within Ω. And thus the stage one estimators exist and are feasible almost surely.

Proof of Theorem 3.2

We only need to show that (9) is convex when α ∈ Ω, and it is equivalent to show that

H = \frac{\partial^{2} Q (β, R (α))}{\partial α^{2}}

(20)

is positive definite for all α ∈ Ω.

Using the fact that

\frac{\partial R_{i}^{- 1} (α)}{\partial α} = - R_{i}^{- 1} (α) \frac{\partial R_{i} (α)}{\partial α} R_{i}^{- 1} (α),

(21)

we get

\begin{array}{l} H_{j k} = \frac{\partial^{2} Q (β, R (α))}{\partial α_{j} \partial α_{k}} \\ = \sum_{i = 1}^{m} z_{i}^{T} R_{i}^{- 1} (α) \frac{\partial R_{i} (α)}{\partial α_{j}} R_{i}^{- 1} (α) \frac{\partial R_{i} (α)}{\partial α_{k}} R_{i}^{- 1} (α) z_{i} \end{array}

(22)

Therefore, ∀ nonzero x = (x₁, …, x_s) ∈ ℝ^s,

x^{T} H x = \sum_{j, k} x_{j} H_{j k} x_{k}

(23)

= \sum_{i = 1}^{m} \sum_{j, k} x_{j} z_{i}^{T} R_{i}^{- 1} (α) \frac{\partial R_{i} (α)}{\partial α_{j}} \cdot R_{i}^{- 1} (α) \frac{\partial R_{i} (α)}{\partial α_{k}} R_{i}^{- 1} (α) z_{i} x_{k}

(24)

Define $γ_{j}^{(i)} = \frac{\partial R_{i} (α)}{\partial α_{j}} R_{i}^{- 1} (α) z_{i} x_{j}$ and $G^{(i)} = (β_{1}^{(i)}, \dots, β_{s}^{(i)})$ . Then,

x^{T} H x = \sum_{i = 1}^{m} 1^{T} G^{(i) T} R_{i}^{- 1} (α) G^{(i)} 1

(25)

For all α ∈ Ω, since R⁻¹ (α) is positive definite, to show x^THx > 0, we only need to show G⁽ⁱ⁾1 ≠ 0 when x ≠ 0.

\begin{array}{l} G^{(i)} 1 = \sum_{j = 1}^{s} γ_{j}^{(i)} \\ = [\sum_{j = 1}^{s} \frac{\partial R_{i} (α)}{\partial α_{j}} x_{j}] R_{i}^{- 1} (α) z_{i} \end{array}

(26)

By assumption, for all x ≠ 0,

[\sum_{j = 1}^{s} \frac{\partial R_{i} (α)}{\partial α_{j}} x_{j}] \neq 0.

(27)

Since z_i ∈ ℝ_{n_i}, $R_{i}^{- 1} (α) z_{i}$ does not lie in the solution space for (27) almost surely. And therefore, (26) does not equal to 0 almost surely.

Proof of Corollary 3.3

It is easy to show that for linear correlation structure, α is identifiable if and only if (12) is satisfied.

Proof of Theorem 3.4

If R_i(α) has the form as (2), then

{\frac{d R_{i}^{- 1} (δ)}{d δ_{j}} |}_{δ = \hat{δ}} = - R_{i}^{- 1} (\hat{δ}) (R_{i} (e_{j}) - R_{i} (0)) R_{i}^{- 1} (\hat{δ}) .

(28)

Plug (2) and (28) into (11) and define $A_{i j} = R_{i}^{- 1} (\hat{δ}) (R_{i} (e_{j}) - R_{i} (0))$ , we can rewrite the stage two estimating equation as

\sum_{i = 1}^{m} \sum_{k = 1}^{s} tr (A_{i j} A_{i k}) α_{k} = - tr (A_{i j} R_{i}^{- 1} (\hat{δ}) R_{i} (0)), \forall j = 1, \dots, s .

(29)

Let $M_{j k} = \sum_{i = 1}^{m} tr (A_{i j} A_{i k}), w_{j} = - \sum_{i = 1}^{m} tr (A_{i j} R_{i}^{- 1} (\hat{δ}) R_{i} (0))$ . Suppose M = (M_jk)_s_×_s and w = (w₁, …, w_s)^T, then (29) can be written as a linear form

M α = w .

(30)

We have the following lemma, which will be proved in Appendix B.

Lemma A.2

Let $A_{i j} = R_{i}^{- 1} (\hat{δ}) (R_{i} (e_{j}) - R_{i} (0)), M_{j k} = \sum_{i = 1}^{m} tr (A_{i j} A_{i k})$ . If for each cluster i ∈ {1, …, m}, R_i has the linear correlation structure form (2), then M = (M_jk)_s×s is positive definite.

Therefore, the stage two estimator α̂ = M⁻¹w always exists and is unique.

APPENDIX B. PROOFS OF OTHER RESULTS

Proof of Lemma A.1

Suppose eigenvalue, eigenvector pair of R(ρ) is

{(λ_{1} (ρ), v_{1} (ρ)), \dots, (λ_{n} (ρ), v_{n} (ρ))} .

Note that the corresponding eigenvalue and eigenvector pairs of R⁻¹(ρ) is

{(1 / λ_{1} (ρ), v_{1} (ρ)), \dots, (1 / λ_{n} (ρ), v_{n} (ρ))} .

Let Inline graphic (ρ) = span{v_i(ρ) : λ_i(ρ) = 0}. = ℝⁿ\.

Note that the feasible region Ω, which requires all the eigenvalues of R is positive definite, is an open region. It is obvious that on Ω, R⁻¹ is continuous and differentiable too, since R⁻¹ = det(R)R^*, where R^* is the companion matrix of R.

Forall z and M₁, let’s fix them temporarily. We take a point ρ₀ in the feasible region. ∀ρ₁ ∈ Ω₀, if R(ρ₁) = 0 (I will prove the other situation later), we choose 0 < ε < ||z||²M₁, ∃δ > 0, such that if ||ρ − ρ₁|| < δ, ||R(ρ) − R(ρ₁)||_F < ε. According to Hoffman-Wielandt Theorem, there exists a permutation π(1), π(2), …, π(n) of 1, 2, …, n, such that ∀ρ ∈ Θ₁,

{(\sum_{i = 1}^{n} {∣ λ {(ρ)}_{π (i)} - λ {(ρ_{1})}_{i} ∣}^{2})}^{\frac{1}{2}} < {| | R (ρ) - R (ρ_{1}) | |}_{F} < ε .

(31)

From (31), we know that ∀ρ ∈ Θ₁, λ(ρ) < ε, and therefore $\frac{1}{λ (ρ)} > \frac{1}{ε}$ . Thus, we have

z^{'} R^{- 1} (ρ) z > {| | z | |}^{2} \min (1 / λ ρ) > {| | z | |}^{2} / ε > M_{1} .

(32)

If R(ρ₁) ≠ 0, let’s suppose λ₁(ρ₁) = ···= λ_k(ρ₁) = 0, and 0 < λ_k₊₁(ρ₁) ≤ ··· λ_n(ρ₁). Since ρ₁ ∈ Ω₀ and R(ρ₁) ≠ 0, 1 ≤ k ≤ n − 1. Then Inline graphic ₁(ρ₁) = span{v₁(ρ), …, v_k(ρ)}. Obviously, ⊥ . Since (ρ₁) ≠ φ,

Prob {Proj (z ∣ X_{1}) = 0} = 1.

(33)

Therefore, with probability 1, M₂ = Proj(z | Inline graphic ) > 0.

Suppose M₃ = λ_k₊₁(ρ₁). ∀ 0 < ε < min{M₂M₃/4, M₂/(2M₁)}, ∃δ > 0, when ||ρ − ρ₁|| < δ, ||R(ρ) − R(ρ₁)||₂ < ε, and ||R(ρ) − R(ρ₁)||_F < ε. From Hoffman-Weilandt Inequality, we know that λ_i(ρ) < ε, ∀i = 1, …, k. (The induction is the same as (31)). According to Stewart Inequality, since ||R(ρ) − R(ρ₁)||₂ < ε, dist( Inline graphic (ρ), (ρ₁)) ≤ 2ε/M₃ = M₂/2. If Proj(z | (ρ₁)) = M₂ ≠ 0, then Proj(z | (X)₁(ρ)) > M₂/2 > 0. Thus,

z' R^{- 1} (ρ) z \geq {| | Proj (z ∣ X_{1} (ρ)) | |}^{2} / ε > M_{1} .

(34)

Therefore, we have

Prob {z^{'} R^{- 1} (ρ) z > M_{1}} = Prob {Proj (z ∣ X_{1} (ρ_{1}))} = 1, \forall | | ρ - ρ_{1} | | < δ .

(35)

Since Ω₀ is a close region, there exists finite round discs which can cover Ω₀. Within every disc, (35) stands. Therefore, within all the finite round discs, (35) stands. Thus, we proved Lemma A.1, and therefore demonstrated that the stage one estimates will have feasible solution with probability 1 for any correlation structure.

Proof of Lemma A.2

∀x ∈ ℝ^s, we will show x^TM x > 0. Suppose x = (x₁, …, x_s).

\begin{array}{l} x^{T} M x = \sum_{i = 1}^{m} \sum_{j = 1}^{s} \sum_{k = 1}^{s} x_{j} tr (A_{i j} A_{i k}) x_{k} \\ = \sum_{i = 1}^{m} \sum_{j = 1}^{s} \sum_{k = 1}^{s} tr (B_{i j} B_{i k}) \\ = \sum_{i = 1}^{m} tr (B_{i}^{2}), \end{array}

(36)

where

B_{i j} = x_{j} A_{i j} = R_{i}^{- 1} (\hat{δ}) (R_{i} (x_{j} e_{j}) - R_{i} (0)),

(37)

\begin{array}{l} B_{i} = \sum_{j = 1}^{k} B_{i j} = R_{i}^{- 1} (\hat{δ}) (\sum_{j = 1}^{s} (R_{i} (x_{j} e_{j}) - R_{i} (0))) \\ = R_{i}^{- 1} (\hat{δ}) (R_{i} (x) - R_{i} (0)) . \end{array}

(38)

Thus,

B_{i}^{2} = G_{i} H_{i},

(39)

where

G_{i} = R_{i}^{- 1} (\hat{δ})

(40)

H_{i} = (R_{i} (x) - R_{i} (0)) R_{i}^{- 1} (\hat{δ}) (R_{i} (x) - R_{i} (0)) .

(41)

Since δ̂ is the final stage one estimates for the correlation parameters, by Theorem 3.1 G_i is positive definite. ∀y ∈ ℝ^s, y^TH_iy = [(R_i(x) − R_i(0))y]^TG_i[(R_i(x) − R_i(0))y] ≥ 0, and thus H_i is semi-positive definite.

Kleinman and Athans (1968), in the context of design of suboptimal control systems, obtained that, for any two semi-positive definite matrix A and B,

λ_{n} (A) tr (B) \leq tr (A B) \leq λ_{1} (A) tr (B),

(42)

where λ_i(A) is the ith largest eigenvalue of A.

Because G_i is positive definite, λ_n(G_i) > 0; and because H_i is semi-positive definite and H_i ≠ 0, tr(H_i) > 0. Therefore,

tr (B_{i}^{2}) = tr (G_{i} H_{i}) \geq λ_{n} (G_{i}) tr (H_{i}) > 0.

(43)

As a result,

x^{T} M x = \sum_{i = 1}^{m} tr (B_{i}^{2}) > 0,

(44)

and thus M is positive definite.

APPENDIX C. THE LIMITING VALUE OF THE QLS ESTIMATE OF WHEN THE TRUE MIXED CORRELATION STRUCTURE IS MISSPECIFIED AS EXCHANGEABLE

Assume the true mixed correlation structures R_i(α) have been misspecified as exchangeable W_i(γ). Next, using arguments similar to those given in Theorem 3.2 of Chaganty and Shults (1999), we note that $E (Z_{i} (β) Z_{i}^{'} (β)) = φ R_{i} (α)$ . It is then easy to show that the solution to the stage one estimating equation (10) converges in probability to the solution (for γ) to the following estimating equation:

trace (\sum_{i = 1}^{m} \frac{\partial}{\partial α} W_{i}^{- 1} (γ) R_{i} (α)) = 0.

(45)

The inverse of an exchangeable structure W_i(γ) can be expressed as $W_{i}^{- 1} (γ) = \frac{1}{(1 - γ)} I_{n_{i}} - \frac{γ}{(1 - γ) (1 + (n_{i} - 1) γ)} e_{j} e_{j}^{'}$ , where I_{n_i} is the identity matrix and e_j is a n_i × 1 column vector of ones. Next, if we note that $trace (e_{j} e_{j}^{'} R_{i} (α)) = trace (e_{j}^{'} R_{i} (α) e_{j}) = e_{j}^{'} R_{i} (α) e$ , equation (45) can easily be simplified as follows:

\sum_{i = 1}^{m} n_{i} - \sum_{i = 1}^{m} \frac{1 + γ^{2} (n_{i} - 1)}{{(1 + γ (n_{i} - 1))}^{2}} e_{j}^{'} R_{i} (α) e_{j} = 0 .

(46)

In general, a solution g(α) (for γ) to (46) can be obtained using the bisection method. We next note that under an assumption of an exchangeable structure, the stage two estimate is obtained as the solution f (γ) to the stage two estimating equation (11) that is evaluated at γ̂ for exchangeable structures R_i(γ). Since $\hat{γ} \overset{p}{\to} g (α)$ , it then follows that the limiting value of the stage two estimate for γ converges in probability to f (g(α)), so that the limiting value of γ̂ can be obtained by solving (11) at δ̂ = g(α). The stage two estimating equation has a closed form solution for the exchangeable structure that is provided in (C.3) of Shults and Morrow (2002), for s_i = n_i and when (C.3) is calculated over all i, i.e. when g_i = 1 for all i, so that we only have one group of subjects.

An algorithm to obtain the limiting value can then be expressed as follows:

For assumed true values of α, use the bisection method to obtain a solution g(α) to (46).
Next, obtain the limiting value of γ̂ by evaluating (C.3) of Shults and Morrow (2002) at τ̂₁ = g(α), where s_i =n_i and g_i = 1 for all i.

Contributor Information

Jichun Xie, Email: jichun@mail.med.upenn.edu, Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Tel: (215) 573-8950.

Justine Shults, Email: shults@mail.med.upenn.edu, Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Tel: (215) 573-6526.

Jon Peet, Email: jonpeet@comcast.net, Department of Ophthalmology, University of Pennsylvania School of Medicine, Tel: (215) 662-8100.

Dwight Stambolian, Email: stamboli@mail.med.upenn.edu, Department of Ophthalmology, University of Pennsylvania School of Medicine, Tel: (215) 898-0305.

Mary Frances Cotch, Email: mfc@nei.nih.gov, National Eye Institute, Division of Epidemiology and Clinical Applications, National Institutes of Health, Tel: (301) 496-6583.

References

Carey VC, Zeger SL, Diggle PJ. Modelling multivariate binary data with alternating logistic regressions. Biometrika. 1993;80 (3):517–526. [Google Scholar]
Chaganty NR. An alternative approach to the analysis of longitudinal data via generalized estimating equations. Journal of Statistical Planning and Inference. 1997;63:39–54. MR1474184. [Google Scholar]
Chaganty NR, Deng Y. Ranges of measures of association for familial binary variables. Communications in Statistics –Theory and Methods. 2007;36(3):587–598. MR2391887. [Google Scholar]
Chaganty NR, Shults J. On eliminating the asymptotic bias in the quasi-least squares estimate of the correlation parameter. Journal of Statistical Planning and Inference. 1999;76:127–144. MR1673345. [Google Scholar]
Crowder M. On the use of a working correlation matrix in using generalised linear models for repeated measures. Biometrika. 1995;82 (2):407–410. [Google Scholar]
Gleseer LJ. A note on the analysis of familial data. Biometrika. 1992;79(2):412–415. MR1185143. [Google Scholar]
Karlin S, Cameron EC, Williams PT. Sibling and parent-offspring correlation estimation with variable family size. Proceedings of the National Academy of Sciences. 1981;78(5):2664–2668. doi: 10.1073/pnas.78.5.2664. MR0615035. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim H, Shults J. QLS SAS macro: A SAS macro for analysis of longitudinal data using quasi-least squares. UPenn Biostatistics Working Papers, Working Paper. 2008:27. http://biostats.bepress.com/upennbiostat/papers/art27. This paper is also in press at the Journal of Statistical Software.
Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73(1):13–22. MR0836430. [Google Scholar]
Prentice RL. Correlated binary regression with covariates specific to each binary observation. Biometrics. 1988;44(4):1033–1048. MR0980998. [PubMed] [Google Scholar]
Qaquish BF, Liang KY. Marginal models for correlated binary responses with multiple classes and multiple levels of nesting. Biometrics. 1992;48(3):939–950. [PubMed] [Google Scholar]
Qu Y, Williams GW, Beck GJ, Medendorp SV. Latent variable models for clustered dichotomous data with multiple subclusters. Biometrics. 1992;48(4):1095–1102. [Google Scholar]
Ratcliffe S, Shults J. GEEQBOX: A MATLAB tool-box for implementation of quasi-least squares and generalized estimating equations. Journal of Statistical Software. 2008;25(14):1–13. [Google Scholar]
Sabo RT, Chaganty NR. Adaptation of quasi-least squares to estimate correlations within a nuclear family. Communications in Statistics – Theory and Methods. 2009;38(16):3059–3076. MR2568204. [Google Scholar]
Shults J. PhD Thesis. Department of Mathematics and Statistics, Old Dominion University; Norfolk, Virginia: 1996. The analysis of unbalanced and unequally spaced longitudinal data using quasi-least squares. [Google Scholar]
Shults J, Chaganty NR. Analysis of serially correlated data using quasi-least squares. Biometrics. 1998;54(4):1622–1630. [Google Scholar]
Shults J, Mazurick C, Landis JR. Analysis of repeated bouts of measurements in the framework of generalized estimating equations. Statistics in Medicine. 2006;25:4114–4128. doi: 10.1002/sim.2515. MR2297655. [DOI] [PubMed] [Google Scholar]
Shults J, Morrow A. Use of quasi-least squares to adjust for two levels of correlation. Biometrics. 2002;58(3):521–530. doi: 10.1111/j.0006-341x.2002.00521.x. MR1925549. [DOI] [PubMed] [Google Scholar]
Shults J, Ratcliffe S, Leonard M. Improved generalized estimating equation analysis via xtqls for implementation of quasi-least squares in Stata. Stata Journal. 2007;7:147–166. [Google Scholar]
Shults J, Whitt CM, Kumanyika S. Analysis of data with multiple sources of correlation in the framework of generalized estimating equations. Statistics in Medicine. 2004;23(20):3209–3226. doi: 10.1002/sim.1887. [DOI] [PubMed] [Google Scholar]
Sutradhar BD, Das K. On the efficiency of regression estimators in generalised linear models for longitudinal data. Biometrika. 1999;86(2):459–465. MR1705378. [Google Scholar]
Wang YG, Carey VJ. Working correlation structure misspecification, estimation and covariate design: Implications for generalised estimating equations performance. Biometrika. 2003;90(1):29–41. MR1966548. [Google Scholar]
Wojciechowski R, Stambolian D, Ciner E, Ibay G, Holmes T, Bailey-Wilson J. Genomewide linkage scans for ocular refraction and meta-analysis of four populations in the Myopia Family Study. Investigative Ophthalmology and Visual Science. 2009;50:2024–2032. doi: 10.1167/iovs.08-2848. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xie J, Shults S. Implementation of quasi-least squares with the R package qlspack. UPenn Biostatistics Working Papers, Working Paper. 2009:32. http://biostats.bepress.com/upennbiostat/papers/art32. This paper is under revision at the Journal of Statistical Software.
Zhao LP, Prentice RL. Correlated binary regression using a quadratic exponential model. Biometrika. 1990;77(3):642–648. MR1087856. [Google Scholar]

[R1] Carey VC, Zeger SL, Diggle PJ. Modelling multivariate binary data with alternating logistic regressions. Biometrika. 1993;80 (3):517–526. [Google Scholar]

[R2] Chaganty NR. An alternative approach to the analysis of longitudinal data via generalized estimating equations. Journal of Statistical Planning and Inference. 1997;63:39–54. MR1474184. [Google Scholar]

[R3] Chaganty NR, Deng Y. Ranges of measures of association for familial binary variables. Communications in Statistics –Theory and Methods. 2007;36(3):587–598. MR2391887. [Google Scholar]

[R4] Chaganty NR, Shults J. On eliminating the asymptotic bias in the quasi-least squares estimate of the correlation parameter. Journal of Statistical Planning and Inference. 1999;76:127–144. MR1673345. [Google Scholar]

[R5] Crowder M. On the use of a working correlation matrix in using generalised linear models for repeated measures. Biometrika. 1995;82 (2):407–410. [Google Scholar]

[R6] Gleseer LJ. A note on the analysis of familial data. Biometrika. 1992;79(2):412–415. MR1185143. [Google Scholar]

[R7] Karlin S, Cameron EC, Williams PT. Sibling and parent-offspring correlation estimation with variable family size. Proceedings of the National Academy of Sciences. 1981;78(5):2664–2668. doi: 10.1073/pnas.78.5.2664. MR0615035. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Kim H, Shults J. QLS SAS macro: A SAS macro for analysis of longitudinal data using quasi-least squares. UPenn Biostatistics Working Papers, Working Paper. 2008:27. http://biostats.bepress.com/upennbiostat/papers/art27. This paper is also in press at the Journal of Statistical Software.

[R9] Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73(1):13–22. MR0836430. [Google Scholar]

[R10] Prentice RL. Correlated binary regression with covariates specific to each binary observation. Biometrics. 1988;44(4):1033–1048. MR0980998. [PubMed] [Google Scholar]

[R11] Qaquish BF, Liang KY. Marginal models for correlated binary responses with multiple classes and multiple levels of nesting. Biometrics. 1992;48(3):939–950. [PubMed] [Google Scholar]

[R12] Qu Y, Williams GW, Beck GJ, Medendorp SV. Latent variable models for clustered dichotomous data with multiple subclusters. Biometrics. 1992;48(4):1095–1102. [Google Scholar]

[R13] Ratcliffe S, Shults J. GEEQBOX: A MATLAB tool-box for implementation of quasi-least squares and generalized estimating equations. Journal of Statistical Software. 2008;25(14):1–13. [Google Scholar]

[R14] Sabo RT, Chaganty NR. Adaptation of quasi-least squares to estimate correlations within a nuclear family. Communications in Statistics – Theory and Methods. 2009;38(16):3059–3076. MR2568204. [Google Scholar]

[R15] Shults J. PhD Thesis. Department of Mathematics and Statistics, Old Dominion University; Norfolk, Virginia: 1996. The analysis of unbalanced and unequally spaced longitudinal data using quasi-least squares. [Google Scholar]

[R16] Shults J, Chaganty NR. Analysis of serially correlated data using quasi-least squares. Biometrics. 1998;54(4):1622–1630. [Google Scholar]

[R17] Shults J, Mazurick C, Landis JR. Analysis of repeated bouts of measurements in the framework of generalized estimating equations. Statistics in Medicine. 2006;25:4114–4128. doi: 10.1002/sim.2515. MR2297655. [DOI] [PubMed] [Google Scholar]

[R18] Shults J, Morrow A. Use of quasi-least squares to adjust for two levels of correlation. Biometrics. 2002;58(3):521–530. doi: 10.1111/j.0006-341x.2002.00521.x. MR1925549. [DOI] [PubMed] [Google Scholar]

[R19] Shults J, Ratcliffe S, Leonard M. Improved generalized estimating equation analysis via xtqls for implementation of quasi-least squares in Stata. Stata Journal. 2007;7:147–166. [Google Scholar]

[R20] Shults J, Whitt CM, Kumanyika S. Analysis of data with multiple sources of correlation in the framework of generalized estimating equations. Statistics in Medicine. 2004;23(20):3209–3226. doi: 10.1002/sim.1887. [DOI] [PubMed] [Google Scholar]

[R21] Sutradhar BD, Das K. On the efficiency of regression estimators in generalised linear models for longitudinal data. Biometrika. 1999;86(2):459–465. MR1705378. [Google Scholar]

[R22] Wang YG, Carey VJ. Working correlation structure misspecification, estimation and covariate design: Implications for generalised estimating equations performance. Biometrika. 2003;90(1):29–41. MR1966548. [Google Scholar]

[R23] Wojciechowski R, Stambolian D, Ciner E, Ibay G, Holmes T, Bailey-Wilson J. Genomewide linkage scans for ocular refraction and meta-analysis of four populations in the Myopia Family Study. Investigative Ophthalmology and Visual Science. 2009;50:2024–2032. doi: 10.1167/iovs.08-2848. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] Xie J, Shults S. Implementation of quasi-least squares with the R package qlspack. UPenn Biostatistics Working Papers, Working Paper. 2009:32. http://biostats.bepress.com/upennbiostat/papers/art32. This paper is under revision at the Journal of Statistical Software.

[R25] Zhao LP, Prentice RL. Correlated binary regression using a quadratic exponential model. Biometrika. 1990;77(3):642–648. MR1087856. [Google Scholar]

PERMALINK

Quasi-least squares with mixed linear correlation structures

Jichun Xie

Justine Shults

Jon Peet

Dwight Stambolian

Mary Frances Cotch

Abstract

1. INTRODUCTION

2. BACKGROUND

2.1 Notation

2.2 Familial structures in the class of linear and mixed correlation structures

3. EXTENSION OF QUASI-LEAST SQUARES FOR MIXED LINEAR CORRELATION STRUCTURES

3.1 Quasi-least squares

3.2 Results that justify application of quasi-least squares for mixed linear correlation structures

3.2.1 General proof of feasibility for stage one QLS estimates

Theorem 3.1

Theorem 3.2

Corollary 3.3

3.2.2 Consistency of the stage two QLS estimates for linear correlation structures

Theorem 3.4

4. ASYMPTOTIC RELATIVE EFFICIENCY CALCULATIONS

Table 1.

5. ANALYSIS OF THE MOTIVATIONAL STUDY

Table 2.

Table 3.

6. DISCUSSION

APPENDIX A. PROOFS OF MAIN RESULTS

Proof of Theorem 3.1

Lemma A.1

Proof of Theorem 3.2

Proof of Corollary 3.3

Proof of Theorem 3.4

Lemma A.2

APPENDIX B. PROOFS OF OTHER RESULTS

Proof of Lemma A.1

Proof of Lemma A.2

APPENDIX C. THE LIMITING VALUE OF THE QLS ESTIMATE OF WHEN THE TRUE MIXED CORRELATION STRUCTURE IS MISSPECIFIED AS EXCHANGEABLE

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases