xtgeebcv: A command for bias-corrected sandwich variance estimation for GEE analyses of cluster randomized trials

John A Gallis; Fan Li; Elizabeth L Turner

doi:10.1177/1536867x20931001

. Author manuscript; available in PMC: 2022 Mar 23.

Published in final edited form as: Stata J. 2020 Jun 19;20(2):363–381. doi: 10.1177/1536867x20931001

xtgeebcv: A command for bias-corrected sandwich variance estimation for GEE analyses of cluster randomized trials

John A Gallis ¹, Fan Li ², Elizabeth L Turner ³

PMCID: PMC8942127 NIHMSID: NIHMS1740707 PMID: 35330784

Abstract

Cluster randomized trials, where clusters (for example, schools or clinics) are randomized to comparison arms but measurements are taken on individuals, are commonly used to evaluate interventions in public health, education, and the social sciences. Analysis is often conducted on individual-level outcomes, and such analysis methods must consider that outcomes for members of the same cluster tend to be more similar than outcomes for members of other clusters. A popular individual-level analysis technique is generalized estimating equations (GEE). However, it is common to randomize a small number of clusters (for example, 30 or fewer), and in this case, the GEE standard errors obtained from the sandwich variance estimator will be biased, leading to inflated type I errors. Some bias-corrected standard errors have been proposed and studied to account for this finite-sample bias, but none has yet been implemented in Stata. In this article, we describe several popular bias corrections to the robust sandwich variance. We then introduce our newly created command, xtgeebcv, which will allow Stata users to easily apply finite-sample corrections to standard errors obtained from GEE models. We then provide examples to demonstrate the use of xtgeebcv. Finally, we discuss suggestions about which finite-sample corrections to use in which situations and consider areas of future research that may improve xtgeebcv.

Keywords: st0599, xtgeebcv, cluster randomized trials, bias-corrected variances, sandwich variance, generalized estimating equations, finite-sample correction

1. Introduction

The cluster randomized trial (CRT) is a study design used in many fields of research. In a CRT, randomization to intervention arms is carried out at the cluster level (for example, schools or clinics) and outcomes are assessed for each member of each cluster. The cluster randomization design is typically chosen when there is a high chance of treatment spillover across study arms, when the intervention is group based, or when individual randomization is not feasible (Turner et al. 2017a). For example, a recent trial in Ghana is evaluating an intervention designed to assist mothers with children that are under two years old to become more resilient and more effectively manage daily stress (Baumgartner 2018). The trial adopts a cluster randomized design because the intervention is designed to be delivered to groups of women. As another example, in the Thinking Healthy Program Peer-Delivered Plus study, the researchers recruited depressed women in their third trimester of pregnancy from 40 villages in Pakistan, with each village then being randomized to receive either the intervention or enhanced usual care (Sikander et al. 2015; Turner et al. 2016). Because this was a public health intervention delivered by community health workers, the risk of contamination (that is, the intervention being transmitted to women in the control group) would be too high if individual women were randomized, given that many of the women within each village live relatively close to one another.

Randomizing clusters instead of individuals poses unique challenges to the data analyses because the outcomes for members of the same cluster tend to be more similar than those for members of different clusters. The intraclass correlation coefficient (ICC) is a quantity that measures the degree of similarity for within-cluster observations and plays a central role in the design and analysis of CRTs (Murray 1998). Appropriate statistical methods used for trial analyses should properly reflect the within-cluster correlation and mainly include two classes of regression models: the cluster-specific (conditional) model and the population-averaged (marginal) model (Fitzmaurice, Laird, and Ware 2011). Although each modeling strategy has its own advantages, an important distinction between them is the difference in interpretation of the regression parameters (Preisser et al. 2003). A conditional model, such as the generalized linear mixed model, induces the within-cluster correlation through the latent random effects. Thus, the interpretation of the treatment effect is the average change in outcomes from control to intervention, conditional on the unobserved random effect. By contrast, marginal models separately specify a mean structure and a “working” correlation structure, and the interpretation of the corresponding treatment effect is the average change in outcomes due to intervention among the population defined by all participating clusters. Because CRTs are often conducted to evaluate public health intervention and inform policy decision, the marginal model carries a straightforward population-averaged interpretation and may be preferred (Li, Turner, and Preisser 2018). Furthermore, the estimation and inference of marginal models are often conducted through generalized estimating equations (GEE) (Liang and Zeger 1986), a multivariate extension of the quasilikelihood inference (Wedderburn 1974).

In addition to straightforward interpretation of estimated model parameters, GEE maintains a robustness property in that the treatment-effects estimates are consistent even if the working correlation model deviates from the true correlation model. In this case, the sandwich variance estimator (Liang and Zeger 1986) remains consistent to the true variance. However, the approximate unbiasedness of the sandwich variance holds only when there are many clusters (a rule of thumb is ≥ 30, although this rule is sometimes given as ≥ 40 or even ≥ 50), whereas a frequent practical limitation of CRTs is that few clusters are available, because of resource constraints. In fact, a recent review by Fiero et al. (2016) found that, of the 86 studies included, about 50% randomized 24 or fewer clusters. In CRTs related to cancer published between 2002 and 2006, Murray et al. (2008) found similar results, with about 50% randomizing 24 or fewer clusters. Additionally, in their review of 300 CRTs published between 2000 and 2008, Ivers et al. (2011) found that, of the 285 studies reporting the number of clusters randomized, at least 50% randomized 21 or fewer clusters. Often, randomizing such few clusters is done because every cluster included in the study adds strain to limited financial and human resources. For example, in a study examining an intervention targeted at early childhood development among HIV-exposed children in Cameroon, only 10 total clusters were randomized because of resource and practical limitations (Baumgartner 2017).

When fewer than 30 to 40 clusters are randomized, the GEE sandwich variance estimator tends to be biased toward zero, leading to inflated type I error rates when testing for the intervention effect (Hayes and Moulton 2009). Proper analyses of CRTs should account for such finite-sample bias in variance estimation and adopt the bias-corrected variance estimator (Turner et al. 2017b). Several proposals for correcting such finite-sample bias have appeared in the statistical literature; see, for example, Mancl and DeRouen (2001); Kauermann and Carroll (2001); Fay and Graubard (2001) among others. These proposals have existed for over 15 years, but to our knowledge none has yet been implemented in Stata. Introducing the bias-corrected variance estimators to Stata has significant practical implications because Stata is a popular software tool for CRT analysts. The availability of this routine will help promote better statistical practice by allowing future analysts to report appropriate p-values and confidence intervals.

The remainder of this article is organized into four sections. In section 2, we introduce the theory of bias-corrected sandwich variance estimators for GEE analyses of CRTs. In section 3, we present our newly created command, xtgeebcv, which computes parameter estimates and bias-corrected variance in GEE models. In section 4, we present two examples of its use. We conclude in section 5 with recommendations to xtgeebcv users and ideas for future additions to the functionality of the program.

2. Statistical methods

2.1. GEE

We consider a parallel-arm CRT consisting of n clusters allocated into two intervention arms and note that the methods are generalizable to CRTs with more than two intervention arms. The outcome of each participant is typically measured at the end of the study and represented by Y_ij (i = 1, …, n, j = 1, …, m_i), where m_i is the number of individuals in cluster i. We denote the p × 1 design vector by X_ij, which includes 1 (intercept), the cluster-level binary indicator for treatment assignment, and possibly additional p − 2 baseline covariates. Note that, for CRTs with more than two arms, one could include additional dummy variables in the design vector X_ij, and the following discussions remain unchanged. The marginal model parameterizes the marginal mean through a generalized linear model, $E (Y_{i j} ∣ X_{i j}) = μ_{i j} = g^{- 1} (X_{i j}^{'} β)$ , where g is the link function and β is the p-vector of coefficients. The intervention effect is the component of β that corresponds to the treatment indicator. To characterize the similarity between individual responses within each cluster, we often employ the exchangeable working correlation so that corr(Y_ij, Y_ij′) = α for j ≠ j′. The parameter α is interpreted as the ICC, a quantity that is vitally important for both the design and analysis of CRTs (Murray 1998). The exchangeable correlation structure is assumed for observations within the same cluster, while the observations from different clusters are assumed to be uncorrelated.

Let $Y_{i} = {(Y_{i 1}, \dots, Y_{i m_{i}})}^{'}$ and $μ_{i} = {(μ_{i 1}, \dots, μ_{i m_{i}})}^{'}$ be the m_i × 1 vector of outcomes and marginal means for cluster i, respectively, where m_i is the ith cluster size. The GEE method is used to estimate the parameter β from the marginal mean model with a specified working correlation matrix (Liang and Zeger 1986). We define D_i = ∂μ_i/∂β^′ and let $V_{i} = A_{i}^{1 / 2} R_{i} A_{i}^{1 / 2}$ be a working covariance matrix for Y_i, where A_i is the m_i-dimensional diagonal matrix with elements ϕν(μ_ij), ϕ is the dispersion parameter, and ν is the variance function; R_i(α) is a working correlation matrix whose dimension may vary across clusters but is specified by the common parameter α. With the exchangeable working correlation structure, we can succinctly write $R_{i} (α) = (1 - α) I_{m_{i}} + α J_{m_{i}}$ , where $I_{m_{i}}$ is the m_i × m_i identity matrix and $J_{m_{i}}$ is an m_i × m_i matrix of ones. From the results given in Li, Turner, and Preisser (2018) and Li et al. (2019), R_i(α) has two distinct eigenvalues, λ₁ = 1 − α and λ_i2 = 1+(m_i − 1)α. Valid values of α guarantee a positive definite correlation matrix and can be easily determined from the set of linear constraints given by min{λ₁, λ₁₂, …, λ_n2} > 0. In other words, the plausible range of ICC is provided by $- {({max}_{i = 1}^{n} {m_{i}} - 1)}^{- 1} < α < 1 \forall m_{i} \geq 2$ .

The GEE estimators $\hat{β}$ , $\hat{α}$ , and $\hat{ϕ}$ are jointly obtained by solving the set of estimating equations

\sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} (Y_{i} - μ_{i}) = 0

with a Newton-type algorithm implemented in the xtgee command. Furthermore, when the number of clusters is sufficiently large (n ≥ 30), the variance–covariance of $\hat{β}$ can be consistently estimated by

\hat{Σ} = \hat{Ω} (\sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} r_{i} r_{i}^{'} V_{i}^{- 1} D_{i}) \hat{Ω}

(1)

where $\hat{Ω} = {(\sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} D_{i})}^{- 1}$ is the model-based variance (what Stata terms the “conventional” variance) and $r_{i} = Y_{i} - {\hat{μ}}_{i}$ is the residual vector of cluster i. Equation (1) is referred to as the robust sandwich variance. Under mild regularity conditions, the sandwich variance estimator is consistent even if the correlation structure is misspecified (Liang and Zeger 1986). In practice, the sandwich variance is often preferred over the model-based variance (whose consistency is dictated by the correct specification of the working correlation) because of this robustness property.

2.2. Bias-corrected sandwich variance estimators

A practical limitation of CRTs is that fewer than 30 to 40 clusters are often randomized, mainly because of availability or resource constraints (Ivers et al. 2011; Fiero et al. 2016). When the number of clusters is small, it is known that the residuals, r_i, tend to be too small, and therefore the sandwich variance tends to underestimate the true variability of $\hat{β}$ (Mancl and DeRouen 2001). One simple correction is known as the degrees-of-freedom (DF) correction, defined as ${\hat{Σ}}_{DF} = K \hat{Σ} / (K - p)$ , where K is the number of clusters and p is the number of parameters. Such an ad hoc correction lacks theoretical motivation and does not provide satisfactory performance in empirical simulation studies designed to reflect characteristics expected in cluster randomized designs (Li and Redden 2015).¹ To improve finite-sample variance estimation, we consider four additional bias-corrected sandwich variance estimators that facilitate the implementation of the state-of-the-art recommendations for the analysis of CRTs (Li and Redden 2015; Ford and Westgate 2017).

Define the cluster leverage to be $H_{i} = D_{i} \hat{Ω} D_{i}^{'} V_{i}^{- 1}$ (Preisser and Qaqish 1996). Kauermann and Carroll (2001) used the cluster-leverage-adjusted residuals to estimate the sandwich variance given by

{\hat{Σ}}_{KC} = \hat{Ω} {\sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} {(I_{m_{i}} - H_{i})}^{- 1 / 2} r_{i} r_{i}^{'} {(I_{m_{i}} - H_{i}^{'})}^{- 1 / 2} V_{i}^{- 1} D_{i}} \hat{Ω}

(2)

Because elements of H_i are between zero and one, ${\hat{Σ}}_{KC}$ is expected to inflate the uncorrected sandwich variance $\hat{Σ}$ . In practice, because the calculation of (I − H_i)^−1/2 tends to be unstable compared with (I − H_i)⁻¹, we approximate the summation within the curly brackets of (2) by

{\sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} {(I_{m_{i}} - H_{i})}^{- 1} r_{i} r_{i}^{'} V_{i}^{- 1} D_{i} + \sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} r_{i} r_{i}^{'} {(I_{m_{i}} - H_{i}^{'})}^{- 1} V_{i}^{- 1} D_{i}} / 2

Mancl and DeRouen (2001) devised a similar bias correction by using

{\hat{Σ}}_{MD} = \hat{Ω} {\sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} {(I_{m_{i}} - H_{i})}^{- 1} r_{i} r_{i}^{'} {(I_{m_{i}} - H_{i}^{'})}^{- 1} V_{i}^{- 1} D_{i}} \hat{Ω}

(3)

Because elements of the cluster leverage H_i are less than one, ${\hat{Σ}}_{MD}$ further inflates ${\hat{Σ}}_{KC}$ . Fay and Graubard (2001) corrected the finite-sample bias in variance estimation by scaling the contribution from each cluster to the empirical variance

{\hat{Σ}}_{FG} = \hat{Ω} (\sum_{i = 1}^{n} C_{i} D_{i}^{'} V_{i}^{- 1} r_{i} r_{i}^{'} V_{i}^{- 1} D_{i} C_{i}) \hat{Ω}

(4)

where C_i = diag([1−min{r,(Q_i)_jj}]^−1/2) and $Q_{i} = D_{i}^{'} V_{i}^{- 1} D_{i} \hat{Ω}$ . The bound parameter r < 1 can be specified by the user but usually takes the default value 0.75 to avoid overcorrection of the bias. Finally, we implement the bias correction proposed by Morel, Bokossa, and Neerchal (2003). Their bias-corrected variance is given by

{\hat{Σ}}_{MBN} = \frac{(N - 1) n}{(N - p) (n - 1)} \hat{Ω} (\sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} r_{i} r_{i}^{'} V_{i}^{- 1} D_{i}) \hat{Ω} + δ_{n} φ \hat{Ω}

(5)

where $N = \sum_{i = 1}^{n} m_{i}$ is the total sample size, δ_n = min{0.5,p/(n − p)} is the correction factor that converges to zero as n increases to infinity, and

φ = max [1, tr {(\sum_{i = 1}^{n} D_{i}^{'} V_{i}^{- 1} r_{i} r_{i}^{'} V_{i}^{- 1} D_{i}) \hat{Ω}} / p]

quantifies the design effect (Morel 1989). Of note, the additive bias correction (5) ensures a positive-definite covariance matrix, while the multiplicative bias corrections (2), (3), and (4) do not guarantee the positive definiteness of the estimated covariance (Morel, Bokossa, and Neerchal 2003), which was argued to be an additional benefit of (5). Once the variance estimator for the intervention effect is obtained using one of these bias-corrected variance formulas, we could conduct a test of no intervention effect by using the standard Wald z test or the Wald t test with DF n − p.

2.3. Computations with large cluster sizes

When the cluster sizes m_i become large (greater than 1,000), calculation of the bias-corrected variance estimators may become computationally inefficient because of numerical inversion of large matrices. To alleviate such a concern, we first note that a closed-form expression is available for the inverse of the exchangeable correlation structure (Li, Turner, and Preisser 2018; Li et al. 2019) and is given by

R^{- 1} (α) = \frac{1}{1 - α} I_{m_{i}} - \frac{α}{(1 - α) {1 + (m_{i} - 1) α}} J_{m_{i}}

Furthermore, Preisser, Qaqish, and Perin (2008) noted that inverting the asymmetric matrix $I_{m_{i}} - H_{i}$ is computationally demanding with large cluster sizes. Instead, they recommend working with its equivalent form $(V_{i} - D_{i} \hat{Ω} D_{i}^{'}) V_{i}^{- 1}$ and efficiently calculate the inverse of the symmetric matrix $V_{i} - D_{i} \hat{Ω} D_{i}^{'}$ by iteratively applying the Sherman–Morrison–Woodbury formula (Sherman and Morrison 1950; Henderson and Searle 1981). Preisser, Qaqish, and Perin (2008) demonstrated huge computational advantage of their algorithm over standard numeric inversions, and therefore we implement their algorithm in obtaining the multiplicative bias-correction factor ${(I_{m_{i}} - H_{i})}^{- 1}$ for ${\hat{Σ}}_{KC}$ and ${\hat{Σ}}_{MD}$ . See Preisser, Qaqish, and Perin (2008) for additional computational details.

3. The xtgeebcv command

The xtgeebcv command was created to provide easy computation of finite-sample bias-corrected variances (hence the “bcv” in xtgeebcv) in Stata. In this section, we explain the available options in detail and examine the inner workings of the command.

The user should first specify a variable list (varlist) with an outcome (dependent) variable followed by predictor (independent) variables, just as one would do with the xtgee command. The user must tell xtgeebcv what the outcome variable and cluster indicator variable are by using the options outcome() and cluster(), respectively. Options are also available to specify the distribution family, link function, and type of finite-sample correction, as described in section 3.2.

Inside the command, the user-supplied data are passed to the xtgee command, with the command running xtset on the variable provided in the cluster() option before running xtgee. The xtgee command is specified with the option nmp. The nmp option tells xtgee to divide the scale parameter by n − p, where n is the number of clusters and p is the number of coefficients estimated. Although without the nmp option, Stata defaults to dividing only by n, n − p is the form of the divisor used in Liang and Zeger (1986), so we use this option by default for the first set of output produced by xtgee, which reports the conventional (model-based) standard errors.

xtgeebcv allows use of either the independence or exchangeable working correlation matrices using the corr() option. Exchangeable is usually the most appropriate correlation structure to characterize the similarity between individual responses within each cluster in a cluster randomized design.

The design matrix, coefficient estimates, and variance–covariance matrix of the parameters output by the xtgee command are then passed to a mata command, which is used to compute and output the desired finite-sample corrected standard errors of the parameter estimates. As described below, the option stderr() is used to specify which of five finite-sample bias-corrected standard errors ( ${\hat{Σ}}_{DF}$ , ${\hat{Σ}}_{MD}$ , ${\hat{Σ}}_{FG}$ , ${\hat{Σ}}_{KC}$ , or ${\hat{Σ}}_{MBN}$ ) to use for the output of standard errors, confidence intervals, and p-values.

3.1. Syntax

xtgeebcv varlist, outcome(varname) cluster(varname) [family(string) link(string) stderr(string) statistic(string) corr(string) xtgee options]

varlist contains the regression specification: the dependent variable (outcome) followed by independent variables (predictors). Note that all categorical variables with more than two levels will need to be dummy coded by the user before supplying them to the command.

3.2. Options

outcome(varname) specifies the name of the outcome variable. outcome() is required.

cluster(varname) specifies the name of the cluster indicator variable. cluster() is required.

family(string) specifies the distributional family. The default is family(binomial).

link(string) specifies the link function. The following table gives more information on the available family() and link() combinations. The default depends on the specification of family(). The default for Gaussian, binomial, and Poisson are link(identity), link(logit), and link(log), respectively.

family()	link()
inomial	logit
binomial	log
binomial	identity
poisson	log
poisson	identity
gaussian	identity

string	Description
rb	Robust (sandwich) standard errors
df	DF correction
md	Mancl and DeRouen (2001) correction
fg	Fay and Graubard (2001) correction
kc	Kauermann and Carroll (2001) correction
mbn	Morel, Bokossa, and Neerchal (2003) correction

GEE population-averaged model		Number of obs	=	280
Group variable:	cluster	Number of groups	=	20
Link:	log	Obs per group:
Family:	binomial	min	=	14
Correlation:	exchangeable	avg	=	14.0
		max	=	14
		Wald chi2(1)	=	4.62
Scale parameter:	1	Prob > chi2	=	0.0316

yij	exp(b)	Std. Err.	z	P>\|z\|	[95% Conf. Interval]
treatment	1.460317	.257182	2.15	0.032	1.034044	2.062318
_cons	.45	.0663456	−5.42	0.000	.3370667	.6007713

	exp(b)	Std. Err.	z	P>\|z\|	[95% Conf. Interval]
treatment	1.460317	.3027435	1.83	0.068	.9727063	2.192365
_cons	.45	.0840296	−4.28	0.000	.3120797	.6488726

	exp(b)	Std. Err.	z	P>\|z\|	[95% Conf. Interval]
treatment	1.460317	.2724691	2.03	0.042	1.013044	2.105069
_cons	.45	.0756266	−4.75	0.000	.3237131	.6255539

GEE population-averaged model		Number of obs	=	4,100
Group variable:	community	Number of groups	=	20
Link:	logit	Obs per group:
Family:	binomial	min	=	169
Correlation:	exchangeable	avg	=	205.0
		max	=	257
		Wald chi2(4)	=	43.75
Scale parameter:	1	Prob > chi2	=	0.0000

know	Odds Ratio	Std. Err.	z	P>\|z\|	[95% Conf. Interval]
arm	2.286608	.338949	5.58	0.000	1.710079	3.057506
stratum2	1.051687	.1885727	0.28	0.779	.7400511	1.494552
stratum3	1.133454	.2181231	0.65	0.515	.7773161	1.652761
ethnicgp	.737854	.0648754	−3.46	0.001	.6210536	.8766209
_cons	.9892138	.1624665	−0.07	0.947	.7169527	1.364865

	exp(b)	Std. Err.	t	P>\|t\|	[95% Conf. Interval]
arm	2.286608	.3808982	4.97	0.000	1.603225	3.261287
stratum2	1.051687	.2259479	0.23	0.818	.6652899	1.662501
stratum3	1.133454	.2373591	0.60	0.559	.7253637	1.771136
ethnicgp	.737854	.0759666	−2.95	0.010	.5924699	.9189134
_cons	.9892138	.1957989	−0.05	0.957	.6487352	1.508387

	exp(b)	Std. Err.	t	P>\|t\|	[95% Conf. Interval]
arm	2.286608	.3406765	5.55	0.000	1.664475	3.141277
stratum2	1.051687	.2018406	0.26	0.796	.6986018	1.583227
stratum3	1.133454	.2094824	0.68	0.508	.7644029	1.680681
ethnicgp	.737854	.0728592	−3.08	0.008	.5978122	.9107016
_cons	.9892138	.1760339	−0.06	0.952	.6769598	1.445498

GEE population-averaged model		Number of obs	=	1,712
Group variable:	community	Number of groups	=	8
Link:	logit	Obs per group:
Family:	binomial	min	=	187
Correlation:	exchangeable	avg	=	214.0
		max	=	243
		Wald chi2(2)	=	18.07
Scale parameter:	1	Prob > chi2	=	0.0001

	c1	c2
r1	.04054132
r2	−.03236501	.03148758

	c1	c2
r1	.03868099
r2	−.0313821	.0313821

know	Odds Ratio	Std. Err.	z	P>\|z\|	[95% Conf. Interval]
arm	1.870034	.4094975	2.86	0.004	1.217459	2.872397
ethnicgp	.6190309	.0988164	−3.00	0.003	.4527249	.8464285
_cons	1.337813	.2828975	1.38	0.169	.8838899	2.02485

	exp(b)	Std. Err.	t	P>\|t\|	[95% Conf. Interval]
arm	1.870034	.4815623	2.43	0.059	.964633	3.625241
ethnicgp	.6190309	.1072287	−2.77	0.039	.3965803	.9662591
_cons	1.337813	.4277156	0.91	0.404	.588128	3.043121

	exp(b)	Std. Err.	t	P>\|t\|	[95% Conf. Interval]
arm	1.870034	.4214707	2.78	0.039	1.047698	3.337819
ethnicgp	.6190309	.0959661	−3.09	0.027	.4155684	.9221089
_cons	1.337813	.3798498	1.03	0.352	.6447855	2.77572

PERMALINK

xtgeebcv: A command for bias-corrected sandwich variance estimation for GEE analyses of cluster randomized trials

John A Gallis

Fan Li

Elizabeth L Turner

Abstract

1. Introduction

2. Statistical methods

2.1. GEE

2.2. Bias-corrected sandwich variance estimators

2.3. Computations with large cluster sizes

3. The xtgeebcv command

3.1. Syntax

3.2. Options

4. Illustrative examples

4.1. Equal-sized clusters

4.2. Unequal-sized clusters

5. Discussion

6. Acknowledgments

About the authors

Footnotes

Contributor Information

8 References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases