Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Apr 17.
Published in final edited form as: Stat Interface. 2010;3(2):223–234. doi: 10.4310/sii.2010.v3.n2.a9

Quasi-least squares with mixed linear correlation structures

Jichun Xie 1, Justine Shults 2,, Jon Peet 3, Dwight Stambolian 4, Mary Frances Cotch 5
PMCID: PMC3328409  NIHMSID: NIHMS239795  PMID: 22518205

Abstract

Quasi-least squares (QLS) is a two-stage computational approach for estimation of the correlation parameters in the framework of generalized estimating equations. We prove two general results for the class of mixed linear correlation structures: namely, that the stage one QLS estimate of the correlation parameter always exists and is feasible (yields a positive definite estimated correlation matrix) for any correlation structure, while the stage two estimator exists and is unique (and therefore consistent) with probability one, for the class of mixed linear correlation structures. Our general results justify the implementation of QLS for particular members of the class of mixed linear correlation structures that are appropriate for analysis of data from families that may vary in size and composition. We describe the familial structures and implement them in an analysis of optical spherical values in the Old Order Amish (OOA). For the OOA analysis, we show that we would suffer a substantial loss in efficiency, if the familial structures were the true structures, but were misspecified as simpler approximate structures. To help bridge the interface between Statistics and Medicine, we also provide R software so that medical researchers can implement the familial structures in a QLS analysis of their own data.

Keywords and phrases: Quasi-least squares, Linear correlation structure, Mixed correlation structure, Familial data

1. INTRODUCTION

We consider a secondary analysis of optical spherical values in a study in the Old Order Amish (OOA) (Wojciechowski et al., 2009). The families in the OOA study varied in both size and composition, because some nuclear families contained only siblings, while other families included siblings and one or both parents. The goal of the OOA analysis was to relate the expected spherical values, measurements that reflect quality of vision, with gender and age, while also adjusting for the correlation among measurements within each family. Because the correlations in the OOA study were thought to vary according to familial relationship, it was important to allow the sibling-sibling, sibling-father, and sibling-mother correlations to vary in value.

To model the pattern of association amongst measurements in families in the OOA study, we implemented slight generalizations of familial correlation structures considered by Karlin, Cameron, and Williams (1981) and Gleseer (1992). Gleseer (1992) noted that it is computationally difficult to obtain maximum likelihood (ML) estimates of the correlation parameters for normal data, when family sizes are not constant. Gleseer (1992) therefore obtained ML estimates that were weighted averages of estimates obtained for sub-groups with families of equal size. One limitation of the approaches of both Karlin et al. (1981) and of Gleseer (1992) was that they assumed that the expected value of the outcome variable was constant between the siblings. However, it is important to note that Karlin et al. (1981) and Gleseer (1992) allowed the variance of the outcome variable to vary between parents and siblings, while we assume a constant standard deviation of spherical values for all subjects.

We implement the familial correlation structures for analysis of the OOA study with quasi-least squares (QLS). QLS is an approach based on GEE that estimates the correlation parameters in two stages. In the following summary, estimates of the correlation parameters are defined to be feasible if they yield positive definite correlation matrices. Chaganty (1997) considered balanced data and established feasibility of the stage one estimates for the first order autoregressive AR(1), exchangeable, and tri-diagonal structures. Shults (1996) and Shults and Chaganty (1998) proved feasibility for the afore-mentioned structures, in addition to the Markov structure, for unbalanced data. However, although the stage one estimates exist and are feasible, they are not consistent. Chaganty and Shults (1999) therefore introduced a second stage of QLS and established consistency of the stage two estimates for the AR(1), Markov, and tri-diagonal correlation structures. The second stage of QLS updates the stage one estimate of α by obtaining a solution to an estimating equation (stage two estimating equation for α) with an estimating function that only depends on α and the stage one estimate of α. Theorem (3.2) of Chaganty and Shults (1999) establishes that if there exists a unique solution to the stage two estimating equation for α that is a continuous and one to one function of the stage one estimate, then that solution will be consistent for α. Software for implementation of QLS is available in SAS (Kim and Shults, 2008), Stata (Shults, Ratcliffe, and Leonard, 2007), MATLAB (Ratcliffe and Shults, 2008), and R (Xie and Shults, 2009).

We implement the familial structures using QLS instead of other extensions of GEE that involve more complicated correlation structures or weighting matrices than the original formulation of the GEE approach. For example, Prentice (1988) developed GEE1 for binary outcomes by constructing a second estimating equation (equation (14) on p. 1039) that involves specifying a working correlation structure for the sample correlations. Zhao and Prentice (1990) developed GEE2 for discrete and continuous outcomes by constructing a joint estimating equation for the regression and correlation parameters that involves a working covariance structure that depends on the third and fourth moments of the outcome variable. Carey, Zeger, and Diggle (1993) developed alternating logistic regression (ALR) for binary outcomes by setting up a GEE based on conditional means of the outcome variable (equation (7) on p. 521) that corresponds to the logistic regression of one response on another and that also involves specification of a weighting matrix. When each of these methods is applied, perhaps because of the difficulty in specifying an appropriate patterned structure, the typical approach is to simplify the correlation (weighting) structures by setting some or all of their off-diagonal elements equal to zero. For example, Carey et al. define their weighting matrix ((7), p. 521, 1993) as a diagonal matrix and say in their discussion that this results in a reasonably optimal weighting. When GEE1 and GEE2 are applied in analyses, off-diagonal elements of the working structures are often set equal to zero. Recent approaches based on GEE1 and GEE2, for example methods for hierarchical data by Qu, Williams, Beck, and Medendrop (1992) and Qaqish and Liang (1992), face the same challenge as GEE1 and GEE2 with regard to specification of an appropriate working structure. Overall, implementation of the structures we consider would be potentially much more challenging for these other recent extensions of GEE than it was for QLS.

In this manuscript, we prove two general results for QLS that can be used to justify implementation of QLS for the familial structures we consider. First, we prove that the QLS stage one estimate of α will exist and is feasible with probability one, for any correlation structure. Next, for the class of mixed linear correlation structures, we prove the existence and uniqueness of the QLS stage two estimates, both of which are required for consistency of α̂. A benefit of our results is that not only do they justify implementation of QLS for the familial structures we consider in this manuscript, but they can also be used to justify QLS for other structures. For example, Shults, Mazurick, and Landis (2006) implemented QLS for a banded Toeplitz (BT) correlation structure, but did not provide proofs regarding the existence and uniqueness of solutions of the QLS estimating equations for α for this structure. The BT structure is a member of the class of linear correlation structures, so that the results provided in this paper establish the consistency of the QLS estimators of α for this structure. In general, our results for stage one are applicable to any correlation structure, while our results for stage two are applicable to any mixed linear correlation structure.

As an outline for our paper, in Section 2, we give some notation; describe the familial structures we consider; and define mixed linear correlation structures. In Section 3 we then extend QLS for mixed linear correlation structures by proving several results for these structures. Next, we demonstrate the benefit of fitting mixed linear correlation structures: in Section 4, we conduct asymptotic relative efficiency (ARE) comparisons to show that the loss in efficiency in estimation of the regression parameter could be substantial in a QLS (or GEE) analysis of the OOA study, if the true mixed linear correlation structures were misspecified as a simpler, approximate structure. In Section 5 we then present our analysis of the OOA study that demonstrates application of the mixed correlation structure with QLS. The proofs of our theorems and lemmas are provided in the appendices.

2. BACKGROUND

2.1 Notation

We assume that outcomes Yi = (Yi1, …, Yini)T and associated covariates Xij = (Xij1, …, Xijp)T are collected on family i, for i = 1, …, m. The expected value and variance of measurement Yij can be expressed using a generalized linear model (GLM):

E(Yij)=g1(XijTβ)=μijandVar(Yij)=φh(μij) (1)

respectively, where g−1(·) is the link function; h(·) is the variance function; and φ is a known or unknown scale parameter. We assume that observations from different families are independent. However, measurements within families are correlated, with a pattern of association that can be described with correlation structures for each family i, Cor(Yi) = Ri(α), that depend on an s by 1 correlation parameter α. The covariance matrix of Yi is then given by Cov(Yi)=φAi1/2Ri(α)Ai1/2, where Ai = diag(h(μi1), …, h(μini)).

2.2 Familial structures in the class of linear and mixed correlation structures

Define ei as the unit vector with only the ith entry equal to 1. We refer to a correlation matrix as linear if

Ri(α)=j=1s(Ri(ej)Ri(0))αj+Ri(0), (2)

so that each element of the matrix can be expressed as a linear combination of α. In this case α is identifiable if and only if

j=1s(Ri(ej)Ri(0))cjαj=0ifandonlyifc=(c1,,cs)=0. (3)

Several linear correlation structures were considered for analysis of the OOA study, which included two-generation families that varied in both size and composition. We assumed that the father-mother, father-sibling, mother-sibling, sibling-sibling correlations were γ, ρ1, ρ2 and α, respectively. If family i included both parents and siblings, this resulted in an extended familial correlation structure Ri to describe the pattern of association among the ni measurements on family i:

Cor(Yi)=(1γρ1ρ1ρ1γ1ρ2ρ2ρ2ρ1ρ21ααρ1ρ2α1αρ1ρ2αα1)ni×ni. (4)

Sabo and Chaganty (2009) made an excellent comparison of (4) for several approaches, for continuous outcomes and for families of equal size and composition.

For a family with only a father and siblings, Ri would have a familial structure:

Cor(Yi)=(1ρ1ρ1ρ1ρ11ααρ1α1αρ1αα1)ni×ni. (5)

For a family with only a mother and siblings, Ri would still have a familial structure, but with ρ1 replaced by ρ2 in (5):

Cor(Yi)=(1ρ2ρ2ρ2ρ21ααρ2α1αρ2αα1)ni×ni. (6)

Finally, for families with only siblings, the correlation structure would be exchangeable:

Cor(Yi)=(1ααα1ααα1)ni×ni. (7)

In our analysis of the OOA data, different families were allowed to have different correlation structures. However, all the structures were mixed correlation structures (MCS), which we define as structures that may vary between families but share correlation parameters, so that the parameters for family i take value in {γ, ρ1, ρ2, α}. See Chaganty and Deng (2007) for a discussion of ranges of measures of association for binary outcomes with familial patterns of association.

3. EXTENSION OF QUASI-LEAST SQUARES FOR MIXED LINEAR CORRELATION STRUCTURES

3.1 Quasi-least squares

Here, we briefly describe the method of QLS. Stage one of QLS iterates between updating the regression parameter β via (i) solution of the GEE estimating equation for β (Liang and Zeger, 1986):

i=1mDiTAi1/2Ri1(α)Ai1/2(YiUi(β))=0, (8)

where Ui(β) = E(Yi) and Di=Uiβ; and (ii) updating the correlation parameter α by minimizing the generalized error sum of squares

Q(β,R(α))=i=1mziT(β)Ri1(α)zi(β) (9)

with respect α ∈ Ω ⊆ ℝs, where zi(β)=Ai1/2(YiUi)=(zi1,,zini) are known as the Pearson residuals. In addition, Ω is defined as the feasible region for the correlation structure (Ri(α))1,…,m, so that ∀α ∈ Ω and ∀i ∈ {1, …, m}, Ri(α) is positive definite. Stage one of QLS therefore involves solving the stage one estimating equation

DG=α{i=1mziT(β)Ri1(α)zi(β)}=0. (10)

In general, the solution of (10) is not necessarily the minimizer of (9). However, in Section 3.2 we prove that if the Ri(α) are linear for all i, the solution of (10) does indeed minimize the generalized error sum of squares (9); Furthermore, the solution will be unique and feasible almost surely.

The QLS stage one estimates β̂ for β and δ̂ for α are the solutions of (8) and the minimizer of (9), respectively. However, Chaganty (1997) proved that the stage one QLS estimate of α is not consistent. In order to correct the asymptotic bias for the QLS stage one estimates, after convergence in stage one, we next solve the stage two estimating equation that depends on the stage one estimates δ̂ (Chaganty and Shults, 1999), for α:

i=1mtr[Ri1(δ)δRi(α)]|δ=δ^=0. (11)

Theorem 3.2 of Chaganty and Shults (1999) established that if there is a unique root α̂ for equation (11) that is a one to one and continuous function of δ̂ and the structure is correctly specified, then α̂ is consistent. We refer to α̂ as the stage two estimator of α, based on which, we obtain the final estimator β̂ for β by again solving the GEE estimating equation (8) for β, evaluated at the stage two estimates for α.

3.2 Results that justify application of quasi-least squares for mixed linear correlation structures

In this section we first provide general proofs regarding the existence and feasibility of the stage one QLS estimates in Section 3.2.1. Next, in Section 3.2.2 we prove the consistency of the stage two QLS estimates for the mixed linear correlation structures. The proofs for all results are provided in the appendices.

3.2.1 General proof of feasibility for stage one QLS estimates

We first provide a theorem that establishes the feasibility of the global minimizer for (9).

Theorem 3.1

If for each subject i, Ri(α) is a differentiable ni × ni matrix, then the global minimizer for (9) in Ω is an inner point of Ω, where Ω is the feasible region of (Ri(α))1,…,m.

Although the stage one QLS estimator of α is not the final estimator, its existence and feasibility is very important because failure to yield feasible estimates in stage one of QLS could cause a breakdown in the first phase of the procedure. For example, Crowder (1995) described the potential for breakdown in iterative procedures such as GEE that can occur when the estimated correlation matrices are not positive definite. Theorem 3.1 ensures that this type of failure will not occur in stage one of QLS.

However, while Theorem 3.1 ensures the existence of solutions for the stage one QLS estimating equation (10), it does not guarantee that the root is unique. If (10) has multiple roots, it can be difficult to obtain all the roots. Furthermore, it might not be straightforward to find the global minimizer for (9), if the generalized error sum of squares has several local minimizers. However, for correlation structures that meet the condition in (12), this minimization problem is fairly straightforward, because under this fairly general condition, (9) is convex almost surely, so that there will exist a unique root for (10) almost surely.

Theorem 3.2

Suppose each cluster i ∈ {1, …, m} in the data under consideration has correlation structure Ri(α). If ∀ α ∈ Ω,

j=1sRi(α)αjcj=0ifandonlyifc=0, (12)

then (10) has a unique solution in the feasible region Ω almost surely.

Corollary 3.3

Suppose for each cluster i ∈ {1, …, m} of the longitudinal data, we have a linear correlation structure Ri(α) of the form (2). Then if α is identifiable, (10) has a unique solution in the feasible region Ω almost surely.

Theorem 3.2 provides the criterion (12) that will ensure that the stage one estimating equation (10) has a unique solution; This requirement is fairly general, and is to be satisfied by several common structures, including the exchangeable, tri-diagonal (Chaganty and Shults, 1999), BT (Shults et al., 2006), and also by the familial structures implemented in this manuscript.

3.2.2 Consistency of the stage two QLS estimates for linear correlation structures

Here we first prove that for linear correlation structures, the stage two estimator exists and is unique, with probability one.

Theorem 3.4

If for each cluster i ∈ {1, …, m}, the within subject correlation Ri(α) has a linear correlation structure of form (2), then the stage two estimating equation (11) has a unique solution with probability one.

In the proof of Theorem 3.4 in Appendix A, we provide an explicit solution for the stage two estimating solution for linear correlation structures. Suppose we obtain the stage one estimator δ̂. Define Aij=Ri1(δ^)(Ri(ej)Ri(0)),Mjk=i=1mtr(AijAik), and wj=i=1mtr(AijRi1(δ^)Ri(0)). Suppose M = (Mjk)s×s and w = (w1, …, ws)T. We can then express the stage two estimator in a very simple form:

α^=M1(δ^)w(δ^), (13)

which is very helpful with respect to computation, especially when the dimension of α is high. Chaganty and Shults (1999) proved that if a unique solution α̂ exists to the stage two estimating equation for α that is a continuous and one to one function of the stage one estimate of α, then α̂ will be consistent. Therefore, we have proven that under correct specification of the mixed linear correlation structure, there will exist a unique solution of the stage two estimating equation that will be consistent for α.

4. ASYMPTOTIC RELATIVE EFFICIENCY CALCULATIONS

Here we assess the loss in efficiency that results from incorrectly specifying the mixed correlation structure in the OOA analysis. We assume here that the true structure for cluster i is the mixed structure Ri(α) described in Section 5, for α = (ρ1, ρ2, γ, α), while the working structure is the exchangeable structure Wi(γ) = (1−γ)Ini×ni + γJni×ni, where Ini×ni is an ni by ni identity matrix and Jni×ni is an ni × ni matrix of ones. We consider the exchangeable working structure because this is a popular structure for analysis of clustered data. In addition, we note that the true mixed familial structures include exchangeable structures for OOA families that contain only siblings. Our misidentification scenario therefore represents the situation in which we have correctly assumed that the sibling-sibling correlations are equal, but have incorrectly assumed that the sibling-sibling, sibling-father, sibling-mother, and father-mother correlations are identical.

The efficiencies are calculated using the same approach that was implemented and described in Shults and Morrow (2002) and in Shults et al. (2006). To briefly summarize, we first note that Chaganty (1997) proved that m(β^β) is asymptotically normal with mean zero and covariance matrix

Vw=limmmφWt{i=1mXiAi1/2Wi1RiWi1Ai1/2Xi}Wt, (14)

where

Wt={i=1mXiAi1/2Wi1Ai1/2Xi}1. (15)

If the correct structure was specified, so that Wi = Ri, then the covariance matrix Vw can be simplified as Vt = limm→∞ mφWt.

The efficiency for β̂j was then evaluated as the jth diagonal element of Vt divided by the jth diagonal element of Vw. However, as noted by Sutradhar and Das (1999), γ̂ may fail to be consistent when the true structure is mis-specified, so that the efficiencies should be calculated at the limiting value of γ̂. We therefore evaluated the efficiencies at Wi(f(α)) and Ri(α), where f(α) is the limiting value of γ̂ when the mixed correlation structure is misspecified as exchangeable. An algorithm to obtain the limiting value f(α) as a function of the true correlation parameter α is provided in the Appendix. Because the efficiencies were calculated as the number of subjects m → ∞, we assumed that the covariate design for the OOA study was replicated as m increases.

In addition, because the asymptotic distribution for β̂ is identical for QLS and GEE, the approach for calculation of AREs described in this section also applies when GEE is applied for an exchangeable working structure, but the true structures are mixed familial structures. GEE implements the following moment estimate for the exchangeable structure that is a function of the Pearson residuals zij:

α^GEE=i=1mkjzikziji=1mk=1ni(ni1)zik2. (16)

It is straightforward to show (Wang and Carey, 2003) that the limiting value of α̂GEE is given by

i=1mkjCorr(yij,yik)i=1m(ni1)ni=i=1m(eiRi(α)eini)i=1m(ni1)ni, (17)

where ei is an ni by 1 vector of ones. The limiting values were almost identical for QLS and GEE. As a result, the efficiencies were almost identical for the two approaches.

Table 1 displays the efficiencies for QLS. (An equivalent table for GEE, with almost identical results, is available on request.) Lines 1–3 in Table 1 assess the situation when the father-sibling and sibling-sibling correlations are negligible, but the mother-sibling correlations are non-negligible and get increasingly larger (in going from line 1 to line 3). Lines 4–6 assess the situation when the father-sibling and mother-sibling correlations are negligible, but the sibling-sibling correlations are non-negligible and get increasingly larger (in going from lines 4 to 6). Lines 7–9 assess the situation when the father-sibling and mother-sibling correlations are nonnegligible and similar in value, with sibling-sibling correlations that get increasingly larger (in going from lines 7–9). Table 1 indicates that, as we might anticipate, the loss in efficiency is negligible when the true correlations are small, so that the true structure is close to an identity structure, which is a special case of an exchangeable structure (with γ = 0). However, as the true correlations increase in value, the loss in efficiency can become substantial when the true mixed familial structures are misspecified as exchangeable. For example, as shown in line 6, when ρ1 = 0.02, ρ2 = 0.05, and α = 0.71, then the ARE for age is only 79 percent.

Table 1.

Percent efficiencies for the regression coefficients for the constant term, gender, and age, when the true mixed correlation structure is misspecified as exchangeable. True structure = mixed Ri(α) where α = (ρ1, ρ2, α); working structure = exchangeable with parameter γ. limit = f(α) is the limiting value of γ̂ when the true mixed structure is misspecified as exchangeable in the analysis of the OOA study

ρ1 ρ2 α limit constant gender age
0.02 0.11 0.05 0.0510 0.99 0.99 0.99
0.02 0.31 0.05 0.0604 0.97 0.98 0.97
0.02 0.41 0.05 0.0652 0.88 0.72 0.88

0.02 0.05 0.41 0.3582 0.94 0.96 0.95
0.02 0.05 0.51 0.4422 0.90 0.92 0.92
0.02 0.05 0.71 0.6092 0.81 0.79 0.79

0.30 0.20 0.50 0.4657 0.95 0.94 0.96
0.30 0.20 0.70 0.6345 0.87 0.83 0.86
0.30 0.20 0.90 0.8029 0.73 0.48 0.53

The results shown in Table 1 therefore indicate that incorrect application of the exchangeable structure (which is a popular structure in analysis of clustered data) for all families can result in a substantial loss in efficiency in estimation of β. The results in Table 1 are important because it is sometimes claimed that careful modeling of the correlation structure is not crucial, because even if the structure is misspecified, GEE (and QLS) will yield a consistent estimate of the regression parameter. However, our ARE calculations demonstrate that if the structure is misspecified, even though β̂ is consistent, we can suffer a substantial loss in efficiency in estimation of β.

5. ANALYSIS OF THE MOTIVATIONAL STUDY

Here we present our results of the OOA analysis, to demonstrate implementation of the familial structures considered in this manuscript. The OOA population is ideal for studying familial association because the OOA live within a structured and uniform society where most individuals share a common lifestyle. The data considered here represent information on 296 individuals organized from 60 families, of which 33 had both parents and some siblings; 1 had only a father and siblings; 4 had a mother and siblings; and 22 had only siblings. The mean number of siblings in a family was 3.8 (range = 1–11). The mean age was 37.6 (range = 18–85). Recruitment and data collection of the parent study which provided the data for our secondary analysis has been described elsewhere (Wojciechowski et al., 2009).

The main outcome measure used in this analysis was the spherical component of each subject’s refractive error. Briefly, refractive error relates to an individual’s spectacle prescription. Refractive error is a spherical correction which denotes the power of a spherical lens (a lens whose properties do not change based on orientation) placed in front of a subject’s eye to optimize their vision. For some subjects spherical correction alone is sufficient to correct their vision. Lens values for the spherical component of a subject’s refraction can have either a positive or negative value and are expressed in units of optical power called diopters. The outcome for our analysis was the spherical (correction) value, which measures the power of a lens placed in front of the eye that does not depend on orientation. We considered the spherical values of the left eye, right eye, and the average spherical value of both eyes. The covariates we considered included gender (gender = 1 for males and gender = 0 for females) and the age in years at which the eye exam was conducted.

Our primary objective was to relate the expected spherical values with gender and age. We assumed that the families with both parents and siblings had an extended familial correlation structure (4) with zero correlation between parents, so that γ = 0 in (4). Families with only a father and siblings, only a mother and siblings, or only siblings, were assumed to have correlation structures (5), (6), and (7), respectively.

Table 2 displays the estimates of the regression parameter estimators. (QLS and GEE share the same asymptotic distribution for β̂; The results shown here are based on application of a “sandwich based” estimate of the covariance matrix of β̂ for calculation of standard errors (Chaganty and Shults, 1999), and p-values for the tests that βj = 0.) As shown in Table 2, the estimated constant was negative, while the regression coefficients for (male) gender and for age were positive. Although the regression coefficients for age and gender did not differ significantly from zero at a 0.05 level (perhaps as a result of limited power due to the modest number of OOA families studied), the coefficients did differ significantly from zero at a 0.10 level. These results suggest that male gender and higher age are associated with less myopia, where myopia is indicated by negative spherical values.

Table 2.

The regression parameter estimators for the OOA ophthalmology study. Gender = 1 for male and 0 for female. Age is in years

Outcome intercept gender age
est.(p-value) est.(p-value) est.(p value)
R. Sph. −2.67(< .0001) 0.75(0.067) 0.016(0.102)
L. Sph. −2.80(< .0001) 0.77(0.051) 0.015(0.098)
Av. Sph. −2.77(< .0001) 0.76(0.055) 0.016(0.074)

Next, Table 3 displays the QLS estimates of the correlation parameters. Notice that the estimated correlations were similar for the right sphere, left sphere, and average sphere. The estimated correlations were greatest between father and siblings, and smallest between siblings. These findings are consistent with the method of family ascertainment.

Table 3.

The correlation parameter estimators for the OOA ophthalmology study. ρ̂1 is the estimated correlation between father and siblings, ρ̂2 is the estimated correlation between mother and siblings and α̂ is the estimated correlation within siblings

Outcome ρ̂1 ρ̂2 α̂
Right Sphere 0.2932 0.2241 0.0234
Left Sphere 0.2740 0.1420 0.0130
Average Sphere 0.2880 0.1996 0.0177

6. DISCUSSION

In this paper, we considered QLS, a two-stage approach based on GEE that uses the same estimating equation for estimation of β, but that differs from GEE with respect to estimation of α. We proved that the stage one QLS estimates exist and are feasible, while the stage two QLS estimates will be consistent with probability one, for the class of mixed linear correlation structures.

Our results regarding the stage one QLS estimators did not require correct specification of the correlation structure, so that the working correlation structure need not equal the true structure in order for a feasible stage one QLS estimate to exist that minimizes the generalized error sum of squares evaluated at that working structure (Theorem 3.1). Furthermore, the stage one estimate can be obtained as the unique solution to the QLS stage one estimating equation as long as the working structure (not necessarily the true structure) has a linear correlation structure (Theorem 3.2 and Corollary 3.3). In other words, there exists a stage one QLS estimate for any working structure and this estimate will be straightforward to obtain if the working structure is linear.

However, unlike the first stage, stage two of QLS does require correct specification of the working structure because this second stage was developed to overcome a major flaw with stage one of the procedure, namely that it does not yield a consistent estimate of α, even when the working structure is correctly specified. Stage two of QLS provides a correction to the stage one estimate; if the working structure does not equal the true structure, the wrong correction will be applied and consistency in general will not be achieved. Our results regarding the stage two QLS estimate of α, including Theorem 3.4 and the results in Section 3.2.2, therefore require correct specification of the working correlation structure. Just as does GEE, QLS requires correct specification of the working correlation structure as a prerequisite for consistent estimation of α. However, just like the GEE estimate, the QLS estimate of β will be consistent even if the working structure does not equal the true structure.

We considered familial correlation structures that are members of the class of mixed linear correlation structures. Our general results justified implementation of QLS for the familial structures, in addition to different members of the class of mixed linear structures, e.g. the banded Toeplitz structure that was considered by Shults et al. (2006).

Our work was motivated by a study of spherical optical values in the Old Order Amish (OOA). For this analysis, we implemented QLS for mixed familial correlation structures, which allowed the father-sibling, mother-sibling and sibling-sibling correlations to vary in value. An important feature of the OOA study was that the families varied in size; Our implementation of QLS therefore relaxed the assumption of constant family size and composition that is sometimes made in analysis of familial data.

We also conducted efficiency calculations based on the covariate design of the OOA study, to demonstrate that if the mixed familial structures were the true structures, but were misspecified as exchangeable structures, then we could suffer a serious loss in efficiency in estimation of the regression parameter. Our analysis and efficiency calculations demonstrated that it can be important to carefully model the correlation structure of the data, in order to maximize the information from the data and improve efficiency in estimation of the regression parameter. To encourage the use of the mixed familial correlation structures in practice, we also provide R functions that extend our previous software for application of QLS in R (Xie and Shults, 2009) for implementation of these structures. The R functions, and an R script file that demonstrates their use, is available on request from the first and second authors.

Future research that builds on our methods would be helpful. In extending this exploratory analysis to a larger sample ascertained without regard to myopia status, it will be useful to develop a test that incorporates the gender of each type of family member and to test whether like gender relationships differ from mixed gender relationships within and between families. For example, are the father-son and father-daughter correlations equal in value and are they significantly different from the mother-son and mother-daughter correlations? In addition, in this manuscript we considered spherical values that were measured on the left eye and right eye of each subject, and that were computed as the average of measurements on both eyes. Future work might extend our approach to allow for simultaneous analysis of both eyes. For example, the approach of Shults and Morrow (2002) and Shults, Whitt, and Kumanyika (2004) might be applied to adjust for two sources of correlation: due to the potential similarity of spherical values that are measured on the same subject, or between two members of the same family.

APPENDIX A. PROOFS OF MAIN RESULTS

Proof of Theorem 3.1

To prove this theorem, we need the following lemma:

Lemma A.1

R(ρ) is a differentiable n × n correlation matrix. Ω0 is the margin of the feasible region for R(ρ). Then we have

Prob(limρΩ0zTR1(ρ)z=zRn)=1. (18)

We prove Lemma A.1 in Appendix B. Here, we directly use this lemma to prove Theorem 3.1. Suppose the feasible region for Ri(ρ) is Ωi, and the margin of Ωi is Ωi0. Then the overall feasible region is Ω = ∩Ωi, and the margin is Ω0 Inline graphic ∪Ωi0. Therefore,

Prob(limρΩ0i=1mziTRi1(ρ)zi=ziRn,i)Prob(limρΩi0i=1mziTRi1(ρ)zi=ziRn,i)Prob(limρΩi0ziTRi1(ρ)zi=ziRn)=1. (19)

Ω = ∪Ωi is an open set. Because of (19), we know the minimized point of (9) is taken within Ω. And thus the stage one estimators exist and are feasible almost surely.

Proof of Theorem 3.2

We only need to show that (9) is convex when α ∈ Ω, and it is equivalent to show that

H=2Q(β,R(α))α2 (20)

is positive definite for all α ∈ Ω.

Using the fact that

Ri1(α)α=Ri1(α)Ri(α)αRi1(α), (21)

we get

Hjk=2Q(β,R(α))αjαk=i=1mziTRi1(α)Ri(α)αjRi1(α)Ri(α)αkRi1(α)zi (22)

Therefore, ∀ nonzero x = (x1, …, xs) ∈ ℝs,

xTHx=j,kxjHjkxk (23)
=i=1mj,kxjziTRi1(α)Ri(α)αj·Ri1(α)Ri(α)αkRi1(α)zixk (24)

Define γj(i)=Ri(α)αjRi1(α)zixj and G(i)=(β1(i),,βs(i)). Then,

xTHx=i=1m1TG(i)TRi1(α)G(i)1 (25)

For all α ∈ Ω, since R−1 (α) is positive definite, to show xTHx > 0, we only need to show G(i)1 ≠ 0 when x ≠ 0.

G(i)1=j=1sγj(i)=[j=1sRi(α)αjxj]Ri1(α)zi (26)

By assumption, for all x ≠ 0,

[j=1sRi(α)αjxj]0. (27)

Since zi ∈ ℝni, Ri1(α)zi does not lie in the solution space for (27) almost surely. And therefore, (26) does not equal to 0 almost surely.

Proof of Corollary 3.3

It is easy to show that for linear correlation structure, α is identifiable if and only if (12) is satisfied.

Proof of Theorem 3.4

If Ri(α) has the form as (2), then

dRi1(δ)dδj|δ=δ^=Ri1(δ^)(Ri(ej)Ri(0))Ri1(δ^). (28)

Plug (2) and (28) into (11) and define Aij=Ri1(δ^)(Ri(ej)Ri(0)), we can rewrite the stage two estimating equation as

i=1mk=1str(AijAik)αk=tr(AijRi1(δ^)Ri(0)),j=1,,s. (29)

Let Mjk=i=1mtr(AijAik),wj=i=1mtr(AijRi1(δ^)Ri(0)). Suppose M = (Mjk)s×s and w = (w1, …, ws)T, then (29) can be written as a linear form

Mα=w. (30)

We have the following lemma, which will be proved in Appendix B.

Lemma A.2

Let Aij=Ri1(δ^)(Ri(ej)Ri(0)),Mjk=i=1mtr(AijAik). If for each cluster i ∈ {1, …, m}, Ri has the linear correlation structure form (2), then M = (Mjk)s×s is positive definite.

Therefore, the stage two estimator α̂ = M−1w always exists and is unique.

APPENDIX B. PROOFS OF OTHER RESULTS

Proof of Lemma A.1

Suppose eigenvalue, eigenvector pair of R(ρ) is

{(λ1(ρ),v1(ρ)),,(λn(ρ),vn(ρ))}.

Note that the corresponding eigenvalue and eigenvector pairs of R−1(ρ) is

{(1/λ1(ρ),v1(ρ)),,(1/λn(ρ),vn(ρ))}.

Let Inline graphic(ρ) = span{vi(ρ) : λi(ρ) = 0}. Inline graphic = ℝn\Inline graphic.

Note that the feasible region Ω, which requires all the eigenvalues of R is positive definite, is an open region. It is obvious that on Ω, R−1 is continuous and differentiable too, since R−1 = det(R)R*, where R* is the companion matrix of R.

Forall z and M1, let’s fix them temporarily. We take a point ρ0 in the feasible region. ∀ρ1 ∈ Ω0, if R(ρ1) = 0 (I will prove the other situation later), we choose 0 < ε < ||z||2M1, ∃δ > 0, such that if ||ρρ1|| < δ, ||R(ρ) − R(ρ1)||F < ε. According to Hoffman-Wielandt Theorem, there exists a permutation π(1), π(2), …, π(n) of 1, 2, …, n, such that ∀ρΘ1,

(i=1nλ(ρ)π(i)λ(ρ1)i2)12<||R(ρ)R(ρ1)||F<ε. (31)

From (31), we know that ∀ρΘ1, λ(ρ) < ε, and therefore 1λ(ρ)>1ε. Thus, we have

zR1(ρ)z>||z||2min(1/λρ)>||z||2/ε>M1. (32)

If R(ρ1) ≠ 0, let’s suppose λ1(ρ1) = ···= λk(ρ1) = 0, and 0 < λk+1(ρ1) ≤ ··· λn(ρ1). Since ρ1 ∈ Ω0 and R(ρ1) ≠ 0, 1 ≤ kn − 1. Then Inline graphic1(ρ1) = span{v1(ρ), …, vk(ρ)}. Obviously, Inline graphicInline graphic. Since Inline graphic(ρ1) ≠ φ,

Prob{Proj(zX1)=0}=1. (33)

Therefore, with probability 1, M2 = Proj(z | Inline graphic) > 0.

Suppose M3 = λk+1(ρ1). ∀ 0 < ε < min{M2M3/4, M2/(2M1)}, ∃δ > 0, when ||ρρ1|| < δ, ||R(ρ) − R(ρ1)||2 < ε, and ||R(ρ) − R(ρ1)||F < ε. From Hoffman-Weilandt Inequality, we know that λi(ρ) < ε, ∀i = 1, …, k. (The induction is the same as (31)). According to Stewart Inequality, since ||R(ρ) − R(ρ1)||2 < ε, dist(Inline graphic (ρ), Inline graphic (ρ1)) ≤ 2ε/M3 = M2/2. If Proj(z | Inline graphic (ρ1)) = M2 ≠ 0, then Proj(z | (X)1(ρ)) > M2/2 > 0. Thus,

z'R1(ρ)z||Proj(zX1(ρ))||2/ε>M1. (34)

Therefore, we have

Prob{zR1(ρ)z>M1}=Prob{Proj(zX1(ρ1))}=1,||ρρ1||<δ. (35)

Since Ω0 is a close region, there exists finite round discs which can cover Ω0. Within every disc, (35) stands. Therefore, within all the finite round discs, (35) stands. Thus, we proved Lemma A.1, and therefore demonstrated that the stage one estimates will have feasible solution with probability 1 for any correlation structure.

Proof of Lemma A.2

x ∈ ℝs, we will show xTM x > 0. Suppose x = (x1, …, xs).

xTMx=i=1mj=1sk=1sxjtr(AijAik)xk=i=1mj=1sk=1str(BijBik)=i=1mtr(Bi2), (36)

where

Bij=xjAij=Ri1(δ^)(Ri(xjej)Ri(0)), (37)
Bi=j=1kBij=Ri1(δ^)(j=1s(Ri(xjej)Ri(0)))=Ri1(δ^)(Ri(x)Ri(0)). (38)

Thus,

Bi2=GiHi, (39)

where

Gi=Ri1(δ^) (40)
Hi=(Ri(x)Ri(0))Ri1(δ^)(Ri(x)Ri(0)). (41)

Since δ̂ is the final stage one estimates for the correlation parameters, by Theorem 3.1 Gi is positive definite. ∀y ∈ ℝs, yTHiy = [(Ri(x) − Ri(0))y]TGi[(Ri(x) − Ri(0))y] ≥ 0, and thus Hi is semi-positive definite.

Kleinman and Athans (1968), in the context of design of suboptimal control systems, obtained that, for any two semi-positive definite matrix A and B,

λn(A)tr(B)tr(AB)λ1(A)tr(B), (42)

where λi(A) is the ith largest eigenvalue of A.

Because Gi is positive definite, λn(Gi) > 0; and because Hi is semi-positive definite and Hi ≠ 0, tr(Hi) > 0. Therefore,

tr(Bi2)=tr(GiHi)λn(Gi)tr(Hi)>0. (43)

As a result,

xTMx=i=1mtr(Bi2)>0, (44)

and thus M is positive definite.

APPENDIX C. THE LIMITING VALUE OF THE QLS ESTIMATE OF WHEN THE TRUE MIXED CORRELATION STRUCTURE IS MISSPECIFIED AS EXCHANGEABLE

Assume the true mixed correlation structures Ri(α) have been misspecified as exchangeable Wi(γ). Next, using arguments similar to those given in Theorem 3.2 of Chaganty and Shults (1999), we note that E(Zi(β)Zi(β))=φRi(α). It is then easy to show that the solution to the stage one estimating equation (10) converges in probability to the solution (for γ) to the following estimating equation:

trace(i=1mαWi1(γ)Ri(α))=0. (45)

The inverse of an exchangeable structure Wi(γ) can be expressed as Wi1(γ)=1(1γ)Iniγ(1γ)(1+(ni1)γ)ejej, where Ini is the identity matrix and ej is a ni × 1 column vector of ones. Next, if we note that trace(ejejRi(α))=trace(ejRi(α)ej)=ejRi(α)e, equation (45) can easily be simplified as follows:

i=1mnii=1m1+γ2(ni1)(1+γ(ni1))2ejRi(α)ej=0. (46)

In general, a solution g(α) (for γ) to (46) can be obtained using the bisection method. We next note that under an assumption of an exchangeable structure, the stage two estimate is obtained as the solution f (γ) to the stage two estimating equation (11) that is evaluated at γ̂ for exchangeable structures Ri(γ). Since γ^pg(α), it then follows that the limiting value of the stage two estimate for γ converges in probability to f (g(α)), so that the limiting value of γ̂ can be obtained by solving (11) at δ̂ = g(α). The stage two estimating equation has a closed form solution for the exchangeable structure that is provided in (C.3) of Shults and Morrow (2002), for si = ni and when (C.3) is calculated over all i, i.e. when gi = 1 for all i, so that we only have one group of subjects.

An algorithm to obtain the limiting value can then be expressed as follows:

  1. For assumed true values of α, use the bisection method to obtain a solution g(α) to (46).

  2. Next, obtain the limiting value of γ̂ by evaluating (C.3) of Shults and Morrow (2002) at τ̂1 = g(α), where si =ni and gi = 1 for all i.

Contributor Information

Jichun Xie, Email: jichun@mail.med.upenn.edu, Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Tel: (215) 573-8950.

Justine Shults, Email: shults@mail.med.upenn.edu, Department of Biostatistics and Epidemiology, University of Pennsylvania School of Medicine, Tel: (215) 573-6526.

Jon Peet, Email: jonpeet@comcast.net, Department of Ophthalmology, University of Pennsylvania School of Medicine, Tel: (215) 662-8100.

Dwight Stambolian, Email: stamboli@mail.med.upenn.edu, Department of Ophthalmology, University of Pennsylvania School of Medicine, Tel: (215) 898-0305.

Mary Frances Cotch, Email: mfc@nei.nih.gov, National Eye Institute, Division of Epidemiology and Clinical Applications, National Institutes of Health, Tel: (301) 496-6583.

References

  1. Carey VC, Zeger SL, Diggle PJ. Modelling multivariate binary data with alternating logistic regressions. Biometrika. 1993;80 (3):517–526. [Google Scholar]
  2. Chaganty NR. An alternative approach to the analysis of longitudinal data via generalized estimating equations. Journal of Statistical Planning and Inference. 1997;63:39–54. MR1474184. [Google Scholar]
  3. Chaganty NR, Deng Y. Ranges of measures of association for familial binary variables. Communications in Statistics –Theory and Methods. 2007;36(3):587–598. MR2391887. [Google Scholar]
  4. Chaganty NR, Shults J. On eliminating the asymptotic bias in the quasi-least squares estimate of the correlation parameter. Journal of Statistical Planning and Inference. 1999;76:127–144. MR1673345. [Google Scholar]
  5. Crowder M. On the use of a working correlation matrix in using generalised linear models for repeated measures. Biometrika. 1995;82 (2):407–410. [Google Scholar]
  6. Gleseer LJ. A note on the analysis of familial data. Biometrika. 1992;79(2):412–415. MR1185143. [Google Scholar]
  7. Karlin S, Cameron EC, Williams PT. Sibling and parent-offspring correlation estimation with variable family size. Proceedings of the National Academy of Sciences. 1981;78(5):2664–2668. doi: 10.1073/pnas.78.5.2664. MR0615035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Kim H, Shults J. QLS SAS macro: A SAS macro for analysis of longitudinal data using quasi-least squares. UPenn Biostatistics Working Papers, Working Paper. 2008:27. http://biostats.bepress.com/upennbiostat/papers/art27. This paper is also in press at the Journal of Statistical Software.
  9. Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73(1):13–22. MR0836430. [Google Scholar]
  10. Prentice RL. Correlated binary regression with covariates specific to each binary observation. Biometrics. 1988;44(4):1033–1048. MR0980998. [PubMed] [Google Scholar]
  11. Qaquish BF, Liang KY. Marginal models for correlated binary responses with multiple classes and multiple levels of nesting. Biometrics. 1992;48(3):939–950. [PubMed] [Google Scholar]
  12. Qu Y, Williams GW, Beck GJ, Medendorp SV. Latent variable models for clustered dichotomous data with multiple subclusters. Biometrics. 1992;48(4):1095–1102. [Google Scholar]
  13. Ratcliffe S, Shults J. GEEQBOX: A MATLAB tool-box for implementation of quasi-least squares and generalized estimating equations. Journal of Statistical Software. 2008;25(14):1–13. [Google Scholar]
  14. Sabo RT, Chaganty NR. Adaptation of quasi-least squares to estimate correlations within a nuclear family. Communications in Statistics – Theory and Methods. 2009;38(16):3059–3076. MR2568204. [Google Scholar]
  15. Shults J. PhD Thesis. Department of Mathematics and Statistics, Old Dominion University; Norfolk, Virginia: 1996. The analysis of unbalanced and unequally spaced longitudinal data using quasi-least squares. [Google Scholar]
  16. Shults J, Chaganty NR. Analysis of serially correlated data using quasi-least squares. Biometrics. 1998;54(4):1622–1630. [Google Scholar]
  17. Shults J, Mazurick C, Landis JR. Analysis of repeated bouts of measurements in the framework of generalized estimating equations. Statistics in Medicine. 2006;25:4114–4128. doi: 10.1002/sim.2515. MR2297655. [DOI] [PubMed] [Google Scholar]
  18. Shults J, Morrow A. Use of quasi-least squares to adjust for two levels of correlation. Biometrics. 2002;58(3):521–530. doi: 10.1111/j.0006-341x.2002.00521.x. MR1925549. [DOI] [PubMed] [Google Scholar]
  19. Shults J, Ratcliffe S, Leonard M. Improved generalized estimating equation analysis via xtqls for implementation of quasi-least squares in Stata. Stata Journal. 2007;7:147–166. [Google Scholar]
  20. Shults J, Whitt CM, Kumanyika S. Analysis of data with multiple sources of correlation in the framework of generalized estimating equations. Statistics in Medicine. 2004;23(20):3209–3226. doi: 10.1002/sim.1887. [DOI] [PubMed] [Google Scholar]
  21. Sutradhar BD, Das K. On the efficiency of regression estimators in generalised linear models for longitudinal data. Biometrika. 1999;86(2):459–465. MR1705378. [Google Scholar]
  22. Wang YG, Carey VJ. Working correlation structure misspecification, estimation and covariate design: Implications for generalised estimating equations performance. Biometrika. 2003;90(1):29–41. MR1966548. [Google Scholar]
  23. Wojciechowski R, Stambolian D, Ciner E, Ibay G, Holmes T, Bailey-Wilson J. Genomewide linkage scans for ocular refraction and meta-analysis of four populations in the Myopia Family Study. Investigative Ophthalmology and Visual Science. 2009;50:2024–2032. doi: 10.1167/iovs.08-2848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Xie J, Shults S. Implementation of quasi-least squares with the R package qlspack. UPenn Biostatistics Working Papers, Working Paper. 2009:32. http://biostats.bepress.com/upennbiostat/papers/art32. This paper is under revision at the Journal of Statistical Software.
  25. Zhao LP, Prentice RL. Correlated binary regression using a quadratic exponential model. Biometrika. 1990;77(3):642–648. MR1087856. [Google Scholar]

RESOURCES