Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2014 Jan 30;70(2):449–456. doi: 10.1111/biom.12151

Methods for Observed-Cluster Inference When Cluster Size Is Informative: A Review and Clarifications

Shaun R Seaman 1,*, Menelaos Pavlou 2, Andrew J Copas 3
PMCID: PMC4312901  PMID: 24479899

Abstract

Clustered data commonly arise in epidemiology. We assume each cluster member has an outcome Y and covariates Inline graphic. When there are missing data in Y, the distribution of Y given Inline graphic in all cluster members (“complete clusters”) may be different from the distribution just in members with observed Y (“observed clusters”). Often the former is of interest, but when data are missing because in a fundamental sense Y does not exist (e.g., quality of life for a person who has died), the latter may be more meaningful (quality of life conditional on being alive). Weighted and doubly weighted generalized estimating equations and shared random-effects models have been proposed for observed-cluster inference when cluster size is informative, that is, the distribution of Y given Inline graphic in observed clusters depends on observed cluster size. We show these methods can be seen as actually giving inference for complete clusters and may not also give observed-cluster inference. This is true even if observed clusters are complete in themselves rather than being the observed part of larger complete clusters: here methods may describe imaginary complete clusters rather than the observed clusters. We show under which conditions shared random-effects models proposed for observed-cluster inference do actually describe members with observed Y. A psoriatic arthritis dataset is used to illustrate the danger of misinterpreting estimates from shared random-effects models.

Keywords: Bridge distribution, Immortal cohort inference, Informative missingness, Missing not at random, Mortal cohort inference, Semi-continuous data

1. Introduction

Clustered data are common in epidemiology. Repeated measures are clustered in individuals; teeth in patients; pups in litters. Suppose interest is in the association between outcome Y and covariates Inline graphic measured on members of the clusters. Often Y and Inline graphic are missing for some members of sampled clusters. For simplicity, we assume that a member's Inline graphic is observed whenever Y is observed. We call members with observed Y “observed members,” those with missing Y “missing members,” the original clusters “complete clusters,” and the subclusters that remain after discarding missing members “observed clusters.”

Missing data may arise because although a variable could, in principle, be measured, circumstances meant it was not, for example, because an individual missed a visit. We call such missing data “potentially observable.” When missing data are potentially observable, a model can be proposed for the distribution of Y given Inline graphic in all cluster members, and methods used that, under specified assumptions about the missingness (e.g., missing at random, MAR), give consistent estimates for this model. We call this “complete-cluster inference.”

Alternatively, missing data may arise because in a fundamental sense a variable does not exist. We call such missing data “unobservable.” Three examples of unobservable Y are measures of: (1) cognitive function of an individual after death; (2) degree of disablement of an individual who is not disabled; (3) health of a tooth that has been lost. Although missing Y could be set to zero when a patient is dead/not disabled/tooth is lost, in practice often a model is instead proposed for Y given Inline graphic in observed members only (so conditional on alive/disabled/tooth not lost). We call this “observed-cluster inference.” Sometimes observed-cluster inference may be of interest even when missing data are potentially observable. When missing data are unobservable “complete-cluster” inference is philosophically problematic: what does it mean to model cognitive function in dead people?

When the size M of complete clusters varies, it is usually assumed that Y is independent of M given Inline graphic. In observed clusters, however, Y and N may be conditionally dependent given Inline graphic, where N is size of observed cluster. For example, in a dental study, the fewer teeth a patient has, the worst their condition tends to be. This is called “informative cluster size” (ICS).

So far we have assumed observed clusters are generated from complete clusters by excluding missing members, but ICS can also arise where observed clusters are complete in themselves. For example, in toxicology, exposed dams who are more sensitive to a toxin may tend to have smaller litters and offspring with greater probability of deformation than less sensitive dams, so that Y (pup being deformed) and N (litter size) are dependent given X (exposure of dam).

We shall show that three of the methods proposed for observed-cluster inference under ICS, viz. weighted and doubly weighted generalized estimating equations (GEE) and shared random effects models, can be seen as actually giving inference for complete clusters. When the YInline graphic associations in complete and observed clusters are the same, the distinction is unimportant. However, ICS causes them to differ in general. So, it is important to understand when methods proposed for observed-cluster inference really do describe observed clusters. In the literature on modeling repeated measures in cohorts with high death rates (Dufouil et al., 2004; Kurland et al., 2009) a distinction has been made between complete-cluster (termed “immortal-cohort”) inference and observed-cluster (“mortal-cohort”) inference. However, conditions under which the two inferences are equivalent have not been set out, and in the wider literature the distinction seems to be less well recognized.

In Section 2011 we define notation and discuss methods for complete-cluster inference from observed data. Section 2004 defines ICS and discusses how ICS relates to missing-data mechanisms. Section 2011 relates two weighted GEE methods, one proposed for complete-cluster inference in the missing-data literature, and one for observed-cluster inference in the ICS literature. We also show that doubly weighted GEE, proposed for observed-cluster inference, actually give complete- rather than observed-cluster inference, and that, moreover, there is no single complete-cluster inference. Shared random-effects models give complete-cluster inference, but have also been used for observed-cluster inference. In Section 2011 we discuss when this is valid, and in Section 2011 we use a psoriatic arthritis dataset to illustrate that some parameters of such a model may be relevant to observed clusters but others not. In brief, we replicate an analysis of association between disability and covariates, with measurements clustered by patient. Our interest is in how sex affects degree of disability in the “observed clusters” of measurements where degree is greater than zero, that is, given disability. The analysis uses models for probability of disability and for degree of disability given disability which share a random intercept. Because probability of disability is higher in women than in men with the same intercept and other covariates, intercept and sex are not independent given disability and other covariates. Consequently, the effect of sex on degree of disability given disability is less than is suggested by the estimated parameter.

2. Notation and Complete-Cluster Inference

Let K be the number of complete clusters in the sample. When needed we use subscript i to index cluster, but usually omit this. Let M (known) be size of complete cluster. Let Inline graphic and Inline graphic (Inline graphic) be outcome and covariate vector, respectively, for member j of the complete cluster, and Inline graphic and Inline graphic. Let Inline graphic if Inline graphic is observed, Inline graphic if Inline graphic is missing, and Inline graphic. Inline graphic is always observed. Members with Inline graphic are “observed members”; those with Inline graphic are “missing members.” Let Inline graphic be size of observed cluster. Assume Inline graphic Inline graphic are i.i.d. For any value Inline graphic of Inline graphic, partition Inline graphic, where Inline graphic belongs to Inline graphic if Inline graphic and to Inline graphic if Inline graphic. For example, Inline graphic and Inline graphic. Partition Inline graphic likewise, except that if some elements of Inline graphic are observed even on missing members, these elements belong to Inline graphic.

Data are missing at random (MAR) if Inline graphic Inline graphic for some function Inline graphic (informally, Inline graphic) and missing completely at random (MCAR) if Inline graphic (Seaman et al., 2013) (note M is a function of Inline graphic, as Inline graphic has M columns). Otherwise they are missing not at random (MNAR). We say data are missing with equal probability (MWEP) if Inline graphic Inline graphic. MCAR means that which members are observed does not depend on Inline graphic or Y values in the cluster. This would be so if, for example, missing data had been lost by the researchers. MAR allows missingness to depend on data on observed members plus any observed data on missing members. For example in a longitudinal study individuals’ probability of dropout may depend on past health measurements but not on current health. If it also depends on current health, the data are MNAR. MWEP means the number N of observed members may depend on Inline graphic and Y but given this number all sets of N observed members are equally likely. This could be so if missingness depends only on cluster-level summaries of Inline graphic and Y.

The missingness process is monotone if Inline graphic Inline graphic. Inline graphic then defines Inline graphic and vice versa. If Inline graphic are exchangeable given M, we say “members of complete clusters are exchangeable.” Indices Inline graphic can then be assigned to observed members and Inline graphic to missing members. Missingness is then monotone.

To make “complete-cluster” inference, a model is specified for Inline graphic given Inline graphic. To fit this using observed data (Inline graphic), an assumption (e.g., MAR) is made about the missingness process and a method used that is valid under this assumption, for example, inverse probability weighting (IPW) or random-effect models (Albert and Follmann, 2009). We consider two approaches to complete-cluster inference that relate to methods proposed for observed-cluster inference. The first specifies a (marginal) model for Inline graphic and assumes

graphic file with name biom0070-0449-m64.jpg (1)

so that we can define Inline graphic. This model is fitted to observed clusters using GEE with IPW. The second approach uses a shared random-effects model. This gives cluster-specific inference, but random effects can be integrated out to get Inline graphic.

3. Informative Cluster Size

3.1. Semi-Parametric Marginal Models

For each cluster with Inline graphic, let H be the index of a randomly selected member of the observed cluster. So, Inline graphic. Marginal inference for the population of typical observed members and marginal inference for the population of all observed members mean estimating the parameters of a model for Inline graphic and for Inline graphic, respectively. Whereas Inline graphic is the expectation of Y given Inline graphic giving equal weight to each observed cluster, Inline graphic gives equal weight to each observed member. Clusters with Inline graphic play no role in Inline graphic or Inline graphic.

Hoffman et al. (2001), Williamson et al. (2003) and Benhin et al. (2005) define non-informative cluster size (NICS) as Inline graphic Inline graphic. Otherwise cluster size is informative (ICS). Under NICS, Inline graphic Inline graphic. Under ICS, Inline graphic in general. They advocate using Inline graphic. Use of Inline graphic has been proposed for mortal cohorts when missing data are due to death, and for modeling degree of disability or health of teeth when missing data are due to non-disabled patients or absent teeth (Dufouil et al., 2004; Kurland et al., 2009; Su et al., 2011; Li et al., 2011). Hoffman et al. (2001) gave an estimator for Inline graphic. Williamson et al. (2003) and Benhin et al. (2005) gave an asymptotically equivalent and computationally less intensive method: weighted independence estimating equations (WIEE) (see also Wang et al. (2011) for three-level data). The same equations without weighting (IEE) estimate Inline graphic. We describe WIEE and IEE in Section 2007.

3.2. Random-Effects Models

Dunson et al. (2003), Gueorguieva (2005), Chen et al. (2011), and Neuhaus and McCulloch (2011) consider cluster-specific inference using a linear or generalized linear mixed model (LMM/GLMM). They interpret NICS to mean the random effects Inline graphic in the mixed model are independent of N, and ICS to mean they are not. NICS in this sense implies NICS in the sense of Hoffman et al., but the converse is not true. To deal with ICS when fitting the LMM/GLMM, several authors have combined it with a model for N or Inline graphic, with the same or correlated random effect (Dunson et al., 2003; Gueorguieva, 2005; Chen et al., 2011; Su et al., 2009; Su et al., 2011; Li et al., 2011). We discuss this model in Section 2011.

3.3. Relating ICS to Missingness Mechanisms

Hoffman et al. (2001) wrote that ICS is “closely related” to violation of the MCAR condition. In fact, MCAR is not a sufficient condition for NICS. For example, suppose all complete clusters have size Inline graphic and have Inline graphic, there are no covariates, and Inline graphic. It is easy to show that Inline graphic but Inline graphic.

Proposition 1

Cluster size will be non-informative if data are MCAR and, moreover, either i) equation 2009 holds, or ii) Inline graphic and the data are MWEP.

Note 2009 is often assumed with GEEs, but Inline graphic is unlikely, as Inline graphic. Proofs of Propositions are in Web Appendices A and E. Just as both ICS and NICS can arise from MCAR mechanisms, so they can from MAR and MNAR (examples in Web Appendix B).

When 2009 holds, so Inline graphic is defined, a sufficient condition for Inline graphic is MWEP and Inline graphic, because the Y-X relation in a randomly chosen member of an observed cluster is then the same as in a random member of the corresponding complete cluster.

4. Weighted and Doubly Weighted GEE

4.1. Weighted GEE (WGEE)

Assume 2009 holds and Inline graphic, where g is a link function. If Inline graphic and Inline graphic were observed, Inline graphic could be estimated with GEE. With missing data, WGEE can be used. These weight member j by Inline graphic. Robins et al. (1995) proposed use of WGEE when M does not vary, missingness is monotone and MAR, and Inline graphic.

When data are MWEP and Inline graphic, weights Inline graphic can be used instead (proof in Web Appendix C). In this case, Inline graphic (Section 2001), so WGEE with weights Inline graphic also give observed-cluster inference. In fact, with independence working correlation they are the WIEE proposed by Williamson et al. (2003) for estimating Inline graphic in Inline graphic. So, WIEE have a dual interpretation: they estimate Inline graphic under any missingness mechanism; and Inline graphic when data are MWEP and Inline graphic.

WIEE without weights (IEE) estimate Inline graphic in a model Inline graphic (Dufouil et al., 2004).

4.2. Doubly Weighted GEE (DWGEE)

If there is ICS and the distribution of Inline graphic depends on N, interpretation of Inline graphic may be awkward, because the YInline graphic association is confounded by N (Williamson et al., 2003). For example, let X be binary and Inline graphic and Inline graphic be increasing functions of N. Then typical members with Inline graphic tend to come from larger clusters than typical members with Inline graphic, so Inline graphic even though X has no effect on Y within clusters.

Huang and Leroux (2011) proposed DWGEE1 and DWGEE2. DWGEE1 can be used when Inline graphic is categorical and every observed cluster contains at least one member with each of the possible values of Inline graphic. DWGEE1 are the same as WIEE except that member j is inversely weighted not by Inline graphic but by the total number of observed members in the same cluster who have Inline graphic. Thus the total weight of members with Inline graphic is the same for all possible Inline graphic. Rather than estimating Inline graphic, DWGEE1 estimate Inline graphic in the population formed by each cluster in the population contributing one member with each possible value of Inline graphic.

DWGEE2 was proposed for when not all observed clusters contain a member with each possible value of Inline graphic. In DWGEE2 observed member j is inversely weighted by the expected (rather than actual, as in DWGEE1) number of observed members with Inline graphic. In Web Appendix D we show that DWGEE2 estimates Inline graphic in a population of larger “complete” clusters in which each cluster contains at least one member with each possible value of Inline graphic. Each cluster in the dataset is considered to be the observed component of one of these larger clusters, with the rest being missing. The problem with this is that, unless observed clusters really do arise from larger clusters in which all values of Inline graphic are represented (which is not so in Huang and Leroux's example), the larger clusters are purely hypothetical and it is unclear why they should be of scientific interest. Further, as shown in Web Appendix D, the distribution of Y given Inline graphic in the hypothetical population of complete clusters depends on which predictors are included in the model for the expected number with Inline graphic, and there is no obvious reason to prefer one set of predictors to any other.

5. Random-Effect Models

5.1. LMM, GLMM, and Shared Random Effect Model

The general form of the LMM is (continuing to omit the subscript i for cluster)

graphic file with name biom0070-0449-m140.jpg (2)
graphic file with name biom0070-0449-m141.jpg (3)
graphic file with name biom0070-0449-m142.jpg (4)

where Inline graphic is a subvector of Inline graphic, and Inline graphic a cluster-specific latent variable. This is a model for YInline graphic association in complete clusters. Assumption Inline graphic means that Inline graphic and hence that size of complete clusters is non-informative. Elements of Inline graphic not in Inline graphic are said to have fixed effects; those in Inline graphic have random effects. It follows from 2005 and 2011 that Inline graphic. So, Inline graphic also has a marginal interpretation in complete clusters. LMMs are a special case of GLMMs. In GLMMs, Inline graphic is assumed to belong to the exponential family, 2005 is replaced by

graphic file with name biom0070-0449-m155.jpg (5)

where Inline graphic is the link function, and 2011 and 2004 are assumed to hold.

If Y is binary, Inline graphic and Inline graphic has a bridge distribution with rescaling parameter Inline graphic (Inline graphic), then Inline graphic and so Inline graphic (in combination with Inline graphic) has a marginal interpretation in complete clusters (Wang and Louis, 2003). More generally, Inline graphic does not have a marginal interpretation, though Inline graphic can be calculated as Inline graphic.

The MLE of Inline graphic from fitting the mixed model to observed clusters is consistent when data are MAR, but not, in general, when MNAR. However, Neuhaus and McCulloch (2011) showed that for LMMs, if (i) Inline graphic includes an intercept term, (ii) Inline graphic are i.i.d., (iii) Inline graphic, and (iv) the only random effect is an intercept (i.e., Inline graphic), then Inline graphic is consistently estimated except for the intercept. They found the same was approximately true of GLMMs. More generally, they say that if Inline graphic and Inline graphic are subvectors of Inline graphic and Inline graphic with Inline graphic and Inline graphic, then their results suggest that the MLE of elements of Inline graphic corresponding to Inline graphic will be approximately unbiased.

For MNAR data, a model for Inline graphic can be added to the LMM/GLMM. The result is a shared random-effects model (Albert and Follmann, 2009). When

graphic file with name biom0070-0449-m182.jpg (6)

for some function Inline graphic, the MLEs of Inline graphic and Inline graphic from this model are consistent. An indirect way (Su et al., 2009; Li et al., 2011; Su et al., 2011) to model Inline graphic is to introduce another random effect Inline graphic, assume Inline graphic, and specify models Inline graphic for the distribution of Inline graphic and Inline graphic for Inline graphic. We call the resulting model for Inline graphic “a correlated random-effects model.” It is a special case of the shared random-effects model, with Inline graphic and Inline graphic.

5.2. Interpretation of Inline graphic and Inline graphic in Complete Clusters

Partition Inline graphic and Inline graphic as Inline graphic and Inline graphic, where Inline graphic and Inline graphic are the lth elements of Inline graphic and Inline graphic, respectively. If Inline graphic has a random effect, partition Inline graphic as Inline graphic, where Inline graphic corresponds to Inline graphic, and partition Inline graphic similarly. If Inline graphic has a fixed effect, Inline graphic, Inline graphic and Inline graphic. Let Inline graphic denote a vector of the same length as Inline graphic, with lth element equal to one and all other elements equal to zero.

within-cluster effects

If Inline graphic is cluster varying with fixed effect, Inline graphic is its within-complete-cluster effect in clusters of size Inline graphic. That is, if two members of the same complete cluster have Inline graphic values that differ only by Inline graphic for some Inline graphic, then their expected Y values differ by Inline graphic for an LMM. In a GLMM, the expected value is transformed by link function g; for example, for logit link, Inline graphic is their log odds ratio. If Inline graphic is cluster varying with random effect, Inline graphic and Inline graphic are the mean and variance of the within-cluster effect.

between-cluster effects

Inline graphic and Inline graphic can be interpreted in terms of differences between expected Y in members of different complete clusters. That is, if for some Inline graphic, two complete clusters are randomly sampled conditional on one containing a member with Inline graphic and the other a member with Inline graphic, then the difference between the expected Y values of these two members is

graphic file with name biom0070-0449-m234.jpg (7)

This reduces to Inline graphic for the LMM and to Inline graphic for the GLMM with bridge distribution.

causal effects

If Inline graphic is manipulable, for example, treatment, Inline graphic may be interpretable as a causal effect in complete clusters. Let Inline graphic be the potential outcome of member j when Inline graphic is manipulated to equal x. We make the following “causal assumptions” (Vansteelandt, 2007). First, Inline graphic, that is, observed outcome equals outcome that would be seen if Inline graphic were set to its observed value. Second, manipulating Inline graphic does not affect Inline graphic or Inline graphic or Y values of other members. Third, Inline graphic, where Inline graphic is set of possible values of Inline graphic. With these assumptions, the conditional expected causal effect Inline graphic of Inline graphic given Inline graphic and Inline graphic is Inline graphic. For LMMs, Inline graphic reduces to Inline graphic. The conditional expected causal effect Inline graphic of Inline graphic given Inline graphic is Inline graphic, which reduces to Inline graphic for LMMs and to Inline graphic for GLMMs with bridge distribution.

5.3. Interpretation of Inline graphic and Inline graphic in Observed Clusters

Section 1995 discussed how Inline graphic and Inline graphic in the model defined by 20052004 or 20112003 describe the YInline graphic association in complete clusters. Now we discuss how the same Inline graphic and Inline graphic relate to associations in observed clusters.

within-cluster fixed effects

When 2005 holds and Inline graphic is cluster varying with fixed effect, Inline graphic is not only the within-complete-cluster effect of Inline graphic, it is also the within-observed-cluster effect, which is the same in all observed clusters of size Inline graphic. That is, if two members of the same observed cluster of size Inline graphic have Inline graphic values that differ only by Inline graphic for some Inline graphic, then their expected values (transformed by link function g in the case of the GLMM) of Y differ by Inline graphic.

When considering within-observed-cluster effects of covariates with random effects, between-observed-cluster effects and causal effects, we find it convenient to introduce the concept of the LMM/GLMM given by equations 20052004 or 20112003 “describing observed random subclusters.” For a cluster with Inline graphic, let Inline graphic denote the set of indices of a simple random sample of size n from the N observed members, and let Inline graphic. Note that Inline graphic is the same as what we denoted in Section 2004 by H. We say “the LMM given by 20052004 describes observed random subclusters of size n from observed clusters of size Inline graphic” (or, more concisely, “the LMM describes observed random subclusters of size n”) if

graphic file with name biom0070-0449-m283.jpg (8)
graphic file with name biom0070-0449-m284.jpg (9)
graphic file with name biom0070-0449-m285.jpg (10)
graphic file with name biom0070-0449-m286.jpg (11)

where Inline graphic and Inline graphic in 20112011 are the same parameters (i.e., have the same values) as in equations 20052004. Similarly, “the GLMM (given by 20112003) describes observed random subclusters of size n” if

graphic file with name biom0070-0449-m289.jpg (12)

and 20072011 hold. If 20112011 or 20072011 hold for one or more values of n, we have a basis for interpreting the estimates of Inline graphic and Inline graphic obtained by fitting the LMM/GLMM given by 20052003 (which describes complete clusters) in terms of effects in observed clusters. We give these interpretations below. Later (Proposition 2) we give sufficient conditions for the LMM/GLMM to describe observed random subclusters of size n and (Section 2008) show what can happen when these conditions are not satisfied. Note that the statement that LMM/GLMM describes random subclusters of size n is a statement about the YInline graphic relation only in observed members of clusters with Inline graphic; the association in missing members or in clusters with Inline graphic is not relevant. We shall focus on Inline graphic when discussing between-cluster effects, but for within-cluster effects we need Inline graphic, because within-cluster comparisons only make sense in clusters with at least two members. In most realistic settings, if the sufficient conditions (Proposition 2) are satisfied for n, they are also satisfied for Inline graphic.

within-cluster random effects

If the LMM/GLMM describes observed random subclusters of size n (with Inline graphic) and Inline graphic is a cluster-varying covariate with random effect, then Inline graphic and Inline graphic are the mean and variance of the within-observed-cluster effect of Inline graphic. That is, if an observed cluster is randomly sampled conditional on Inline graphic and on n members randomly chosen from it having Inline graphic values that differ only in Inline graphic, then the expected values (transformed by link function g) of Y of any pair of these n members differ by Inline graphic, where Inline graphic is the difference between their Inline graphic values, and the distribution of Inline graphic is given by Inline graphic.

between-cluster effects

If the LMM/GLMM describes observed random subclusters of size Inline graphic, Inline graphic are the between-observed-cluster effects of Inline graphic. That is, if two clusters each with Inline graphic are randomly sampled conditional on Inline graphic in one cluster and Inline graphic in the other, then the difference between the expectations of Inline graphic in the two clusters is

graphic file with name biom0070-0449-m318.jpg (13)

Since 1995 has the same form as 2001, between-cluster effects in observed and complete clusters are equal and Inline graphic and Inline graphic describe them both. As with 2001, 1995 reduces to Inline graphic for the LMM. When Inline graphic has fixed effect, this is true even if Inline graphic is not independent of N, so 2009 is not necessary for Inline graphic to be interpreted as a between-observed-cluster fixed effect in a LMM.

causal effects

Let Inline graphic be manipulable and the “causal assumptions” of Section 1995 hold. Let Inline graphic Inline graphic and Inline graphic. If the LMM/GLMM describes observed random subclusters of size n (Inline graphic) and Inline graphic, then Inline graphic and Inline graphic describe a causal effect of Inline graphic in observed random subclusters of size n. That is, the expected causal effect given Inline graphic and Inline graphic in the members whose indices belong to Inline graphic is equal to Inline graphic with Inline graphic, and the expected causal effect given Inline graphic is equal to Inline graphic. For the LMM when Inline graphic has fixed effect, Inline graphic reduces to Inline graphic even if 2009 does not hold. Note that if Inline graphic depends on Inline graphic, this causal interpretation is problematic because membership of observed clusters may change as Inline graphic is manipulated, that is, some observed members would not have been observed if their Inline graphic values had been otherwise, while some missing members would have been observed.

Proposition 2

The LMM/GLMM describes observed random subclusters of size n if (i) Inline graphic, where Inline graphic is a cluster-constant subvector of Inline graphic; either (iia) Inline graphic are exchangeable given M or (iib) Inline graphic whenever Inline graphic is a permutation of Inline graphic; and (iii) Inline graphic.

Note that (iii) holds if the minimum possible observed cluster size is Inline graphic, but is unlikely to hold otherwise; and if (iii) is replaced by the weaker condition Inline graphic, then 2011, 2007 and 2011 still hold, but 2009 may not.

5.4. Situations Where Complete- and Observed-Cluster Effects Differ

With the exceptions mentioned above (i.e., within-cluster fixed effects, and between-cluster and causal fixed effects in LMMs when 2007 holds), Inline graphic and Inline graphic may not be so interpretable in terms of effects in observed clusters if 2007 or 2009 do not hold.

Suppose that 2009 with Inline graphic does not hold and Inline graphic has a random effect. The between-observed-cluster effect of Inline graphic is given by 1995 with Inline graphic replaced by Inline graphic. In particular, it does not reduce to Inline graphic for the LMM unless Inline graphic. Similarly, the observed-cluster causal effect Inline graphic is, in general, not the same as the complete-cluster causal effect Inline graphic; and the within-observed-cluster effect will not, in general, have mean Inline graphic and variance implied by Inline graphic.

In the following example, 2007 does not hold for Inline graphic. Suppose clusters are old people in a cohort study of cognitive function Y. A LMM is used, with a random effect for time because rate of cognitive decline varies between people. Assume a fixed effect for the intercept. The only missing data are due to death: Inline graphic if person i is alive at time j; Inline graphic if dead. So, Inline graphic, Inline graphic, Inline graphic and missingness is monotone. Suppose people with more rapid decline (more negative Inline graphic) tend to die earlier. The within-complete-cluster effect of Inline graphic has mean Inline graphic and variance Inline graphic. The mean and variance of the within-observed-cluster effect are functions of Inline graphic: they both diminish as Inline graphic increases. This is because the subsample still alive at later times is enriched for high Inline graphic. In this setting “complete-cluster” inference has been called inference for a hypothetical immortal cohort, and it has been suggested that “observed-cluster” inference (describing the population still alive at each timepoint) is of more interest (Dufouil et al., 2004). See Section 2011 and Web Appendix F for examples of between-cluster or causal effects differing in complete and observed clusters.

5.5. Observed Clusters Without Complete Clusters

Dunson et al. (2003), Chen et al. (2011) and Gueorguieva (2005) wanted observed-cluster inference when “complete clusters” do not exist, for example, toxicology experiments where clusters are litters. Dunson et al. and Gueorguieva assumed cluster-constant Inline graphic, Inline graphic and Inline graphic. Chen et al. assumed Inline graphic was cluster constant or a function of j (e.g., Inline graphic), Inline graphic and Inline graphic. It can be seen that these methods give complete-cluster inference for a hypothetical population of complete clusters in which Inline graphic and from which the population of observed clusters would be generated by applying monotone missingness mechanism Inline graphic. However, they do not only provide complete-cluster inference. When, as in Dunson et al. and Gueorguieva, Inline graphic is cluster constant and Inline graphic, conditions (i), (iia) and (iii) of Proposition 2 hold with Inline graphic, so Inline graphic and Inline graphic are also between-cluster or causal effects in observed clusters. When, as in Chen et al., Inline graphic is cluster varying, Inline graphic and Inline graphic, non-intercept elements of Inline graphic are within-observed-cluster effects.

6. Example: Psoriatic Arthritis

This example shows a model that ostensibly describes observed clusters but some of whose parameters relate only to a population of complete clusters with no obvious meaning. Husted et al. (2007) analyzed a cohort of 382 psoriatic arthritis (PsA) patients. Physical function was measured by the health assessment questionnaire score (HAQ). HAQ is semi-continuous: it is zero (no disability) with positive probability and otherwise varies continuously up to 3 (severe disability). 31% of the 2107 HAQ scores were zero. They separately modeled Inline graphic (the “binary-part”) and HAQ given Inline graphic (the “continuous-part”), using, respectively, logistic regression with random intercept Inline graphic and linear regression with random intercept Inline graphic. Both parts had the same covariates (sex, time since onset, etc.), and all covariates had fixed effects. Among the conclusions was that being female predicted higher HAQ when Inline graphic, adjusting for other covariates.

Here, clusters are patients and “observed cluster” means a patient's set of non-zero scores. Su et al. (2009) noted that estimates for the continuous part might be biased because separate modeling of binary and continuous parts did not account for ICS caused by the model for the binary part determining the observed cluster size in the continuous part. So, they modified Husted et al.’s model by replacing Inline graphic by Inline graphic, where Inline graphic is unknown. They called this shared random-effect model the “latent-process model” (SAS code provided in Web Appendix G). They also used a correlated random effects model, but results were similar.

In the original (misspecified) model of Husted et al., the estimated sex effect in the continuous part was 0.181 (SE 0.051). In the latent-process model, it was 0.246 (SE 0.052) ( Table 1). We focus on the meaning of this latter estimate. We emphasize there is nothing intrinsically wrong with the latent-process model. It can validly be used to predict HAQ. What is important is not to misinterpret the parameters in the continuous part. As this is an LMM and sex is cluster-constant with fixed effect, the estimated sex effect, 0.246, describes the between-cluster effect in “complete clusters,” that is, in a hypothetical world in which all scores are somehow non-zero. The meaning and scientific interest of this hypothetical world, analogous to the world of “immortal cohorts,” is unclear.

Table 1.

Estimates for latent process model and marginal model fitted to psoriatic arthritis data

latent process model
marginal model
binary part
continuous part
Parameter estim SE estim SE estim SE
Intercept −0.9909 0.3556 0.1748 0.0555 0.263 0.0669
Age at onset 0.6392 0.1538 0.0984 0.0250 0.115 0.0267
Female 2.0037 0.3149 0.2461 0.0523 0.100 0.0580
PsA disease duration 0.0166 0.0220 0.0044 0.0032 0.004 0.0041
Actively inflamed joints 0.1380 0.0465 0.0243 0.0027 0.023 0.0045
Clinically deformed joints 0.0179 0.0238 0.0051 0.0031 0.007 0.0037
PASI score 0.1543 0.1017 0.0257 0.0134 −0.005 0.0237
Morning stiffness 1.5691 0.2018 0.1620 0.0262 0.273 0.0444
ESR 0.2971 0.1103 0.0374 0.0126 0.065 0.0232
Medication:
 NSAIDs 0.2960 0.2439 −0.0181 0.0280 −0.235 0.0467
 DMARDs 0.3138 0.2197 0.0226 0.0272 0.003 0.0442
 steroids 0.9927 0.4355 0.0481 0.0441 0.049 0.0553
Actively inflamed jointsInline graphicdisease duration 0.0003 0.0031 −0.0005 0.0002 0.0000 0.0002
Clinically deformed jointsInline graphicdisease duration 0.0018 0.0011 0.0003 0.0001 0.0000 0.0001
Var(u) 4.2641 0.9001
Inline graphic 0.2074 0.0210
Inline graphic 0.0779 0.0039

Su et al. (2009) do not comment on the meaning of their estimated sex effect, but suppose one wished to interpret it as an effect in observed clusters, as done in Husted et al. (2007). As all the covariates have fixed effects, estimates for cluster-varying covariates can be interpreted unproblematically as within-cluster effects in complete or observed clusters. However, sex is cluster-constant. To illustrate the problem with interpreting the estimated sex effect, 0.246, as a between-cluster effect in observed clusters, we obtained the empirical Bayes estimate of each patient's random intercept Inline graphic. While the means of Inline graphic were 0.005 and 0.016 for men and women, respectively, means of Inline graphic for observations on men and women when Inline graphic were 0.165 and 0.043. This difference arises because in the binary part of the model the estimated sex effect is 2.00 (SE 0.31), meaning that a woman was more likely to have Inline graphic than a man with the same values of other covariates. So, if we compare a man and woman who both have Inline graphic and have the same time since onset and other covariate values, we expect the woman's HAQ to be not 0.246 greater but only Inline graphic greater. Note that in Su et al.’s model, none of the conditions of Proposition 2 hold for any n.

We also used IEE to fit a model for Inline graphic, the conditional mean of HAQ given sex, time since onset, etc. and Inline graphic ( Table 1). The estimated sex effect is 0.100 (SE 0.031), which is close to the effect, 0.124, worked out above using empirical Bayes estimates.

In conclusion, the estimated sex effect in the continuous part of the latent-process model (and correlated random-effects model) describes the association between sex and HAQ in a hypothetical population of little scientific interest; for this dataset it overstates the size of the effect in the population of scientific interest. In further work, Su et al. (2011) found an association of genotype HLA-B27 with HAQ when Inline graphic. The same interpretation problem applies here: this association refers to the hypothetical “complete” clusters.

7. Discussion

We have shown that shared random-effect models do not always describe observed clusters, except for cluster-varying covariates with fixed effects or under the conditions of Proposition 2. The models of Dunson et al. (2003), Gueorguieva (2005) and Chen et al. (2011) are unnecessarily restrictive. They assume either cluster-constant Inline graphic or that N does not depend on Inline graphic. Proposition 2 shows Inline graphic can be cluster varying if N depends only on cluster-constant elements. The assumptions required do, however, remain restrictive. WIEE relate to IPW for missing data. DWGEE2 give inference for a hypothetical population of complete clusters that is, in general, neither unique nor of scientific interest.

For binary Y, Li et al. (2011) used a correlated random-intercepts model with bridge distributions, so that Inline graphic. For a single binary X, they compared the log odds ratios in complete and observed clusters. They found the difference was small when the variance of the random intercepts or the correlation between them was small. However, when random-intercept variances and/or correlation are small, cluster size is only weakly informative; when size is strongly informative, inferences for complete and observed clusters will differ more. We replicated Li et al's study and found the two log odds ratios could differ by as much as 25% when Inline graphic, and 56% when Inline graphic (see Web Appendix H).

We have assumed Y and Inline graphic are observed in all members for which we wish to make inference. Dufouil et al. (2004) and Shardell and Miller (2008) give methods for when this is not so.

Having illustrated the danger of misinterpreting estimates, we recommend careful thought about which inference is of scientific interest and which analysis method will give it.

8. Supplementary Materials

Web Appendices referenced in Sections 3–7 are available with this paper at the Biometrics website on Wiley Online Library.

Acknowledgments

SRS is funded by MRC grants U1052 60558 and MC_US_A030_0015, AJC and MP by MRC grant G0600657. We thank Brian Tom for helpful comments on a draft of this article, and Li Su for providing the PsA data and advising on the use of SAS.

Supporting Information

Additional Supporting Information may be found in the online version of this article.

Supporting Information.

biom0070-0449-sd1.pdf (127.6KB, pdf)

References

  1. Albert P, Follmann D, Fitzmaurice G, Davidian M, Verbeke G, Molenberghs G. Shared-parameter models. Longitudinal Data Analysis. 2009. pp. 433–452. Chapman & Hall/CRC, Boca Raton, Florida.
  2. Benhin E, Rao J, Scott A. Mean estimating equation approach to analysing cluster-correlateddata with nonignorable cluster sizes. Biometrika. 2005;92:435–450. [Google Scholar]
  3. Chen Z, Zhang B, Albert P. A joint modeling approach to data with informative cluster size:Robustness to the cluster size model. Statistics in Medicine. 2011;30:1825–1836. doi: 10.1002/sim.4239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Dufouil C, Brayne C, Clayton D. Analysis of longitudinal studies with death and drop-out: A casestudy. Statistics in Medicine. 2004;23:2215–26. doi: 10.1002/sim.1821. [DOI] [PubMed] [Google Scholar]
  5. Dunson D, Chen Z, Harry J. A Bayesian approach for joint modeling of cluster size andsubunit-specific outcomes. Biometrics. 2003;59:521–530. doi: 10.1111/1541-0420.00062. [DOI] [PubMed] [Google Scholar]
  6. Gueorguieva R. Comments about joint modelling of cluster size and binary andcontinuous subunit-specific outcomes. Biometrics. 2005;61:862–867. doi: 10.1111/j.1541-020X.2005.00409_1.x. [DOI] [PubMed] [Google Scholar]
  7. Hoffman E, Sen P, Weinberg C. Within-cluster resampling. Biometrika. 2001;88:1121–1134. [Google Scholar]
  8. Huang Y, Leroux B. Informative cluster size for subcluster-level covariates and weightedgeneralized estimating equations. Biometrics. 2011;67:843–851. doi: 10.1111/j.1541-0420.2010.01542.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Husted J, Tom B, Farewell V, Schentag C, Gladman D. A longitudinal study of the effect of disease activity and clinicaldamage on physical function over the course of psoriatic arthritis: Does theeffect change over time. Arthritis and Rheumatism. 2007;56:840–849. doi: 10.1002/art.22443. [DOI] [PubMed] [Google Scholar]
  10. Kurland B, Johnson L, Egleston B, Diehr P. Longitudinal data with follow-up truncated by death: Match theanalysis method to research aims. Statistical Science. 2009;24:211–222. doi: 10.1214/09-STS293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Li X, Bandyopadhyay D, Lipsitz S, Sinha D. Likelihood methods for binary responses of present components in acluster. Biometrics. 2011;67:629–635. doi: 10.1111/j.1541-0420.2010.01483.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Neuhaus J, McCulloch C. Estimation of covariate effects in generalized linear mixed modelswith informative cluster sizes. Biometrika. 2011;98:147–162. doi: 10.1093/biomet/asq066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Robins J-M, Rotnitzky A, Zhao L-P. Analysis of semiparametric regression models for repeated outcomes inthe presence of missing data. Journal of the American Statistical Association. 1995;90:106–121. [Google Scholar]
  14. Seaman S, Galati J, Jackson D, Carlin J. What is meant by ‘missing at random’? Statistical Science. 2013;28:257–268. [Google Scholar]
  15. Shardell M, Miller R. Weighted estimated equations for longitudinal studies with death andnon-monotone missing time-dependent covariates and outcomes. Statistics in Medicine. 2008;27:1008–25. doi: 10.1002/sim.2964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Su L, Tom B, Farewell V. Bias in 2-part mixed models for longitudinal semicontinuous data. Biostatistics. 2009;10:374–389. doi: 10.1093/biostatistics/kxn044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Su L, Tom B, Farewell V. A likelihood-based two-part marginal model for longitudinalsemi-continuous data. Statistical Methods in Medical Research. 2011 doi: 10.1177/0962280211414620. DOI: 10.1177/0962280211414620. Available at: http://smm.sagepub.com/content/early/2011/08/25/0962280211414620.abstract. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Vansteelandt S. On confounding, prediction and efficiency in the analysis oflongitudinal and cross-sectional clustered data. Scandinavian Journal of Statistics. 2007;34:478–498. [Google Scholar]
  19. Wang M, Kong M, Datta S. Inference for marginal linear models for clustered longitudinal datawith potentially informative cluster sizes. Statistical Methods in Medical Research. 2011;20:347–367. doi: 10.1177/0962280209347043. [DOI] [PubMed] [Google Scholar]
  20. Wang Z, Louis T. Matching conditional and marginal shapes in binary random interceptmodels with a bridge distribution function. Biometrika. 2003;90:765–775. [Google Scholar]
  21. Williamson J, Datta S, Satten G. Marginal analyses of clustered data when cluster size is informative. Biometrics. 2003;59:36–42. doi: 10.1111/1541-0420.00005. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information.

biom0070-0449-sd1.pdf (127.6KB, pdf)

Articles from Biometrics are provided here courtesy of Wiley

RESOURCES