Introduction
Cluster randomized trials (CRTs) provide a powerful and flexible experimental design to evaluate interventions delivered at the group level.1 The clustered design, however, creates an analytical issue as the outcomes of participants in one cluster tend to be more similar to each other than those in different clusters, creating a positive within-cluster correlation that reflects extra variation attributable to each cluster. Power calculations for CRTs often account for such extra variability via the intracluster correlation coefficient (ICC) or the coefficient of variation (CV).
Motivated by a vector control trial with a count outcome (i.e. number of episodes of clinical malaria), Mwandigha et al.2 examined the impact of right truncation on the statistical power of CRTs. The authors found that several factors in this setting result in the attenuation of statistical power. For this reason, existing sample size formulae may severely underestimate the required sample size when outcomes are truncated. In addition, because closed-form sample size formulae for truncated counts are lacking, they proposed a simulation-based approach to calculate power.
The results of their study are important and of practical value for designing CRTs. In this commentary, we seek to expand on their findings. Specifically, their current results are restricted to the conditional model (generalized linear mixed model), which carries a cluster-specific interpretation of the intervention effect. In contrast, the marginal model, estimated by the generalized estimating equations (GEE), is a commonly used alternative where the corresponding intervention effect parameter carries a population-averaged interpretation. We discuss the connections and differences between a conditional and marginal analysis of clustered counts and explore the effect of right truncation on the marginal analysis of CRTs.
Conditional model
Let represent the observed number of malaria episodes recorded for individual () in village (. Mwandigha et al.2 assumed a random-effects model for the event rate
where is the binary intervention indicator and is the cluster random effect reflecting the extra variability. The observed count follows a Poisson distribution subject to right truncation at , with the probability mass function (PMF)
where we define . The expectation of the truncated distribution is , which is no larger than the expectation of the original distribution, .
In the absence of truncation such that , the above PMF corresponds to the standard Poisson because . In this case, is the baseline event rate conditional on the random effect . The actual baseline event rate in each cluster is , which differs from the overall baseline event rate 3,4 The intervention effect is the rate ratio (RR) conditional on the latent random effect . In the presence of right truncation, however, remains the conditional RR associated with the unobserved Poisson distribution had there been no truncation. Therefore, it is important not to mistakenly interpret the effect measure as the RR associated with the observed distribution.
Marginal model
In contrast, the marginal approach models the population-averaged RR corresponding to the observed distribution. The observed event rate is modelled by
where is the average baseline event rate, and is the population-averaged RR that corresponds to the observed distribution. Without truncation, it is known that the conditional model is collapsible such that and 3 In this case, the interpretation of treatment parameters from both models is equivalent. Right truncation, however, leads to non-collapsibility. Marginalizing over the random effect in the conditional model, we obtain the marginal expectation in the intervention group as ; the marginal expectation in the control group is similarly calculated by replacing in the expression for . Clearly, the marginal RR is , which approaches null as the truncation point moves towards one.
For clustered count outcomes, the Poisson GEE specifies the Poisson variance function and a working correlation model. While the variance estimator of the random-effects model is only valid when the likelihood is correctly specified, the Poisson GEE automatically accounts for misspecification through the sandwich variance method.4 In other words, correctly estimates the marginal RR for the observed distribution even when the Poisson variance and the correlation model are both misspecified. In particular, the Poisson GEE (or modified Poisson regression) is a viable alternative to the log-binomial regression for analysing clustered binary data.5 This scenario is identical to the extreme case of right truncation where .
Validity of marginal analysis
Since the marginal model is agnostic to the correct likelihood, a natural question is whether such a model is valid under right truncation. To empirically verify the validity of Poisson GEE, we replicate the simulations in Mwandigha et al.2 with a single cohort. We use their conditional model to generate correlated counts except that the intervention effect is set to be null (), and analyse each dataset with the Poisson GEE. In this simulation, both the variance and correlation model are misspecified, but the sandwich variance accounts for such misspecification. The statistical literature suggests that the performance of the GEE sandwich variance can be improved by finite-sample corrections.6 Our own research articles in CRTs also recommended the use of a -test coupled with the Kauermann and Carroll bias-corrected sandwich variance6 for better control of test size.7–10 The bias corrections have now been implemented in PROC GLIMMIX in SAS, the geesmv package in R, and our recent Stata package xtgeebcv.11 Because the empirical results are similar between the independence and the exchangeable working correlation model in the simulations, we only present the latter for brevity. From Table 1, it is evident that the Poisson GEE has adequate control of the type I error rate regardless of truncation. These empirical results validate the marginal analysis of CRTs with truncated counts, when appropriate finite-sample corrections are used for the sandwich variance.
Table 1.
Empirical type I error rates of the Poisson GEE analyses with the exchangeable working correlation model. The results are based on 1000 simulations. The -test is constructed based on the Kauermann and Carroll bias-corrected sandwich variance and degrees of freedom. The log-linear random effects model is used to generate outcome data with and without right truncation
Parameters for DGPa | Empirical Type I error ratesb % | ||||||
---|---|---|---|---|---|---|---|
T = ∞ | |||||||
(1.25, 1) | 0.05 | 30 | 15 | 5.3 | 5.6 | 4.7 | 6.0 |
0.10 | 30 | 45 | 5.3 | 5.2 | 5.3 | 5.4 | |
0.20 | 60 | 35 | 6.3 | 6.1 | 5.6 | 5.5 | |
0.30 | 90 | 40 | 6.0 | 5.9 | 4.8 | 5.9 | |
0.40 | 110 | 40 | 5.0 | 4.9 | 4.3 | 4.5 | |
(2.70, 1) | 0.05 | 25 | 10 | 5.6 | 5.0 | 5.2 | 4.4 |
0.10 | 30 | 30 | 6.9 | 6.9 | 6.3 | 5.9 | |
0.20 | 55 | 20 | 6.1 | 5.5 | 5.4 | 5.5 | |
0.30 | 80 | 30 | 6.1 | 5.5 | 4.8 | 5.2 | |
0.40 | 110 | 25 | 5.8 | 5.4 | 5.7 | 4.8 |
The first four columns are values used in the data generating process (DGP): the conditional baseline event rate , conditional RR , between cluster variance in the log scale , number of clusters and average cluster sizes .
The last four columns correspond to results under different truncation points.
Impact of truncation on marginal analysis
To inform the design of CRTs based on marginal models, we further investigate the impact of right truncation on marginal analysis, and compare with the patterns observed in Mwandigha et al.2 We maintain their data-generating process and set the true conditional RR as . Because the Poisson GEE models the marginal RR that corresponds to the observed distribution, the true effect size decreases as approaches 1. For example, when the conditional baseline event rate is 1.25, while the conditional RR corresponding to the unobserved Poisson distribution is always , the marginal RR corresponding to the observed counts becomes 0.72, 0.82 and 0.9, as equals 6, 3 and 1. Table 2 presents the empirical power of the Poisson GEE analysis with the bias-corrected variance. Only the results based on the exchangeable working correlation are presented, because those based on the independence working correlation are nearly identical.
Table 2.
Empirical power of the Poisson GEE analyses with the exchangeable working correlation model. The results are based on 1000 simulations. The -test is constructed based on the Kauermann and Carroll bias-corrected sandwich variance and degrees of freedom. The log-linear random effects model is used to generate outcome data with and without right truncation
Parameters for DGPa | Empirical powerb % | ||||||
---|---|---|---|---|---|---|---|
(1.25, 0.7) | 0.05 | 30 | 15 | 81.6 | 81.4 | 75.6 | 38.0 |
0.10 | 30 | 45 | 77.1 | 76.5 | 74.3 | 56.6 | |
0.20 | 60 | 35 | 79.3 | 79.6 | 78.8 | 66.6 | |
0.30 | 90 | 40 | 79.0 | 79.8 | 80.0 | 73.3 | |
0.40 | 110 | 40 | 73.7 | 76.8 | 78.9 | 73.3 | |
(2.70, 0.7) | 0.05 | 25 | 10 | 79.4 | 77.1 | 58.2 | 20.6 |
0.10 | 30 | 30 | 79.5 | 78.9 | 71.4 | 44.8 | |
0.20 | 55 | 20 | 76.0 | 76.1 | 71.1 | 47.5 | |
0.30 | 80 | 30 | 76.0 | 78.6 | 76.8 | 59.7 | |
0.40 | 110 | 25 | 74.3 | 78.6 | 76.5 | 65.2 |
The first four columns are values used in the data generating process (DGP): the conditional baseline event rate , conditional RR , between cluster variance in the log scale , number of clusters and average cluster sizes .
The last four columns correspond to results under different truncation points.
The following patterns are observed. First, the Poisson GEE has slightly lower power than the random-effects model. This is expected because the Poisson GEE is agnostic to the true data-generating process and is not likelihood-based. Second, right truncation can similarly attenuate the power of the marginal analysis, and the effect is more pronounced (i) when the number of clusters is not large, (ii) when the baseline event rate is higher and (iii) when the truncation point approaches one. That is to say, the patterns observed in Mwandigha et al. apply to marginal analysis, indicating the necessity to account for right truncation in the sample size calculation for marginal analysis. However, an interesting curvilinear relationship between the power and truncation point also emerges with marginal analysis when the number of clusters is at least 80. In those cases, the power of the Poisson GEE first increases as becomes smaller but then decreases when moves closer to 1, suggesting right truncation does not monotonically reduce the power. This result arises due to slight differences in operating characteristics between conditional and marginal analysis of truncated counts, and motivates us to more carefully extend the message in Mwandigha et al.2 to alternative models. Importantly, although we recommend accounting for truncation in sample size calculation for both conditional and marginal analyses, we realize that the effect of right truncation on power depends on the pre-specified analysis model and the number of clusters, as well as the truncation point.
Summary
Conditional models and marginal models are two commonly used approaches for the design and analysis of CRTs. We support the findings of Mwandigha et al.2 and interpret their findings in the context of marginal analysis of CRTs. We conclude that right truncation generally attenuates the power of both the conditional and marginal analyses, with a few exceptions when a marginal analysis is applied on a large number of clusters. The simulation-based approach enables researchers to explore the effect of truncation on alternative modelling strategies, and, as we demonstrate, can also be useful to provide accurate power estimates for marginal analysis of truncated counts.
Funding
F.L’s work is supported within the National Institutes of Health (NIH) Health Care Systems Research Collaboratory by the NIH Common Fund through cooperative agreement U24AT009676 from the Office of Strategic Coordination within the Office of the NIH Director and cooperative agreement UH3DA047003 from the National Institute on Drug Abuse. M.O.H’s work is supported by the United States National Institutes of Health, National Heart, Lung, and Blood Institute (R00 HL141678). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Conflict of Interest
None declared.
References
- 1. Turner EL, Li F, Gallis JA, Prague M, Murray DM.. Review of recent methodological developments in group-randomized trials: part 1—design. Am J Public Health 2017;107:907–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Mwandigha LM, Fraser KJ, Racine-Poon A, Mouksassi M-S, Ghani AC.. Power calculations for cluster randomised trials (CRTs) with right-truncated Poisson-distributed outcomes: a motivating example from a malaria vector control trial. Int J Epidemiol 2020;49(3):954–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Ritz J, Spiegelman D.. Equivalence of conditional and marginal regression models for clustered and longitudinal data. Stat Methods Med Res 2004;13(4):309–23. [Google Scholar]
- 4. Young ML, Preisser JS, Qaqish BF, Wolfson M.. Comparison of subject-specific and population averaged models for count data from cluster-unit intervention trials. Stat Methods Med Res 2007;16(2):167–84. [DOI] [PubMed] [Google Scholar]
- 5. Zou GY, Donner A.. Extension of the modified Poisson regression model to prospective studies with correlated binary data. Stat Methods Med Res 2013;22(6):661–70. [DOI] [PubMed] [Google Scholar]
- 6. Kauermann G, Carroll RJ.. A note on the efficiency of sandwich covariance matrix estimation. J Am Stat Assoc 2001;96(456):1387–96. [Google Scholar]
- 7. Li F, Turner EL, Preisser JS.. Sample size determination for GEE analyses of stepped wedge cluster randomized trials. Biometrics 2018;74(4):1450–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Li F, Forbes AB, Turner EL, Preisser JS.. Power and sample size requirements for GEE analyses of cluster randomized crossover trials. Stat Med 2019;38(4):636–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Li F. Design and analysis considerations for cohort stepped wedge cluster randomized trials with a decay correlation structure. Stat Med 2020;39(4):438–55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Li F, Turner EL, Heagerty PJ, Murray DM, Vollmer WM, DeLong ER.. An evaluation of constrained randomization for the design and analysis of group-randomized trials with binary outcomes. Statistics in Medicine 2017;36:3791–806. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Gallis JA, Li F, Turner EL.. xtgeebcv: A command for bias-corrected sandwich variance estimation for GEE analyses of cluster randomized trials. Stata J. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]