Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2021 Oct 15;41(1):128–145. doi: 10.1002/sim.9226

Ratio estimators of intervention effects on event rates in cluster randomized trials

Xiangmei Ma 1, Paul Milligan 2, Kwok Fai Lam 1,3, Yin Bun Cheung 1,4,5,
PMCID: PMC9292872  PMID: 34655097

Abstract

We consider five asymptotically unbiased estimators of intervention effects on event rates in non‐matched and matched‐pair cluster randomized trials, including ratio of mean counts r1, ratio of mean cluster‐level event rates r2, ratio of event rates r3, double ratio of counts r4, and double ratio of event rates r5. In the absence of an indirect effect, they all estimate the direct effect of the intervention. Otherwise, r1, r2, and r3 estimate the total effect, which comprises the direct and indirect effects, whereas r4 and r5 estimate the direct effect only. We derive the conditions under which each estimator is more precise or powerful than its alternatives. To control bias in studies with a small number of clusters, we propose a set of approximately unbiased estimators. We evaluate their properties by simulation and apply the methods to a trial of seasonal malaria chemoprevention. The approximately unbiased estimators are practically unbiased and their confidence intervals usually have coverage probability close to the nominal level; the asymptotically unbiased estimators perform well when the number of clusters is approximately 32 or more per trial arm. Despite its simplicity, r1 performs comparably with r2 and r3 in trials with a large but realistic number of clusters. When the variability of baseline event rate is large and there is no indirect effect, r4 and r5 tend to offer higher power than r1, r2, and r3. We discuss the implications of these findings to the planning and analysis of cluster randomized trials.

Keywords: cluster randomized trial, event rate, incidence rate ratio, ratio estimator, relative incidence

1. INTRODUCTION

The cluster randomized trial (CRT) is an important study design in medical and health research. 1 , 2 , 3 Data on outcome events may be collected by passive surveillance or active surveillance. 4 Passive surveillance methods may or may not provide data at the individual level. That is, they may determine only the number of events in a cluster, without identifying which individual members of the cluster experienced the events. Furthermore, the denominators for standard practice of calculating event rates may not be available. 4 The advantage of passive surveillance is that the monetary and opportunity cost of data collection can be much reduced.

Broadly speaking, there are two approaches to the analysis of CRTs: individual‐level analysis and cluster‐level analysis. Methods for individual‐level analysis of CRTs include random‐effects models and generalized estimating equations. As compared to cluster‐level analysis, individual‐level analysis has the relative advantage of efficiency and ease in covariate adjustment. However, it has the relative disadvantage of being less robust, especially when the number of clusters is small. 1 Furthermore, data collection by passive surveillance may not be compatible with individual‐level analysis. In this manuscript we consider only cluster‐level analysis.

An estimator of the intervention effect in terms of incidence rate ratio, also called relative incidence, that only uses event data is a ratio of the arithmetic mean of the number of outcome events per cluster in the intervention arm to that in the control arm. We call this the “ratio of mean counts”, denoted by r1.

The denominator of event rates, that is, units of person‐time, in CRTs is usually variable across clusters. The person‐time for estimation of an event rate is sometimes approximated by the population size at some point of the study duration. In this article we use the phrases person‐time and population size interchangeably. Typical statistical practice makes comparison of event rates instead of mean number of events between trial arms. It requires extra resources in the collection of person‐time data. A demographic surveillance system, a population census, or rounds of community surveys may be required for this purpose. With both the number of events and person‐time collected for each cluster, one may calculate a cluster‐level event rate for each cluster, denoted by cij=yij/pij, where yij and pij are the number of events and person‐time in the jth cluster in the ith trial arm, respectively. Then, the arithmetic means of the cluster‐level event rates in the intervention and control arms are calculated, denoted by c1 and c0, respectively. The ratio of the two means, r2=c1/c0 is a popular estimator of the incidence rate ratio. 1 , 5 , 6 This estimator has been evaluated by simulations but not analytically. One simulation study considered scenarios of, approximately, CVcij = 0, 0.125, and 0.25, where CVcij is the coefficient of variation of the cluster‐level event rate in the ith trial arm and CVc0j is known as “k” in the literature. 5 It found little bias in r2. However, another simulation study considered a broader range of CVcij. 6 It showed that r2 was practically unbiased when CVcij = 0.05 and 0.15, but it was biased when CVcij = 0.4. Analytical investigation and simulation evaluation in a broad range of parameter values are warranted.

An alternative estimator of incidence rate ratio can be obtained by first calculating the event rate in each trial arm as the sum of the number of events divided by the sum of person‐time over the clusters, 7 , 8 and then calculate the ratio of these event rate estimates between the trial arms. We call this the “ratio of event rates”, denoted by r3. While ci (i = 0,1) is an unweighted average of cluster‐level event rates in the ith trial arm, the alternative estimator of event rate here can be seen as a weighted average of cluster‐level event rates, with the clusters' population sizes as weights.

In CRTs, interventions are often provided only to a specific group of the cluster members instead of all cluster members. For example, in studies of vaccines for pediatric infectious diseases, usually only young children in a specific age range are offered the interventions or its control comparators. Older children and adults are not. We refer to the two groups of cluster members as the target and non‐target groups. The outcome events may occur in both groups. In studies based on passive surveillance, the event data may be collected for the non‐target group in addition to the target group without much additional resources required, because the capital cost and infrastructure are already invested for the target group anyway. We consider an estimator that we call “double ratio of counts”, denoted by r4, by replacing the sums of person‐time in r3 by the sums of number of events in the non‐target groups. Note that this estimator is defined even if the number of events in the non‐target group is zero in some clusters, which is a realistic situation because usually the reason of it being a non‐target group is that the disease incidence is relatively low. The motivation for considering this estimator arises from not only concerns of feasibility and cost of data collection but also concerns of precision and power. From an epidemiological point of view, sometimes we anticipate that the event counts in the target and non‐target groups are highly correlated, because they are both the manifestation of the disease burden in the clusters. In particular, some events are highly localized, for example, infectious diseases occurring in small outbreaks. For such events, the correlation between number of events in the target and non‐target groups, say children inside and outside a vaccination age range, is likely to be much stronger than the correlation between the number of events and amount of person‐time in the target group. This advantage in correlation offers a potential for improved precision. Note that r4 and the three estimators aforementioned have different targets of estimation (estimands): r1, r2, and r3 estimate the total effect of the intervention whereas r4 estimates the direct effect. Details will be discussed in the next section.

We also propose a new estimator that we call “double ratio of event rates”, denoted by r5. It has the ratio of event rates between the target and non‐target groups in the intervention arm as the numerator and its counterpart in the control arm as the denominator. Details in statistical notations will be provided in the next section. We hypothesize that this estimator will out‐perform r4 in precision and power.

Donner and Klar pointed out that CRTs of binary outcomes may regard a proportion as a ratio and then an appropriate variance estimate can be obtained from sample survey theory. 9 They used the ratio of this estimated variance to the estimated binomial variance to adjust the Chi‐square statistics for hypothesis testing. In the context of toxicological experiments in which litters of animals were the experimental units and a binary outcome was observed for each animal, Rao and Scott proposed using the aforementioned approach to adjust the Chi‐square and Cochran‐Armitage statistics. 10 There has been some subsequent research on using ratio estimators for CRTs with event rate outcomes, including the two simulation studies of r2 aforementioned. 5 , 6 Furthermore, Dufault and Jewell proposed permutation tests of counts of events only, with or without adjustment for differential ascertainment. 4 All of them concerned only CRTs that randomize clusters individually and (implicitly) aim to estimate the total effect.

This study aims to (a) evaluate and compare the performance of the five estimators aforementioned and (b) develop, evaluate, and compare bias‐corrected version of them. In Section 2 we will analytically assess and develop the methods. In Section 3 we will evaluate the methods by simulation in a broad range of realistic scenarios. In Section 4 we will apply the methods to a study of seasonal malaria chemoprevention. Section 5 gives some concluding remarks.

For brevity, we will focus on CRTs that randomize clusters individually, that is, non‐matched CRTs. Where necessary we also provide the details for matched‐pair CRTs in which one cluster per matched pair is randomized to receive the intervention and the other serves as the control. Introduction to the two types of CRTs can be found in, for example, Hayes and Moulton 1 and Donner and Klar. 2

2. STATISTICAL METHODS

2.1. Intervention effects and event rates

An intervention may have a direct effect and an indirect effect, for example, via reducing disease transmission in the community. 3 , 11 Only the intervention's target group can benefit from the direct effect; both the target and non‐target groups may benefit from the indirect effect, if any. Assume that:

βT=βD+βI

where βT, βD, and βI are the total, direct, and indirect effects in terms of log incidence rate ratio. If there is no indirect effect, βI=0 and βT=βD. The presence of an indirect effect depends on various factors including the nature of the interventions and outcome events. For example, even though vaccines are often anticipated to generate some degree of indirect effect on efficacy endpoints, they are usually anticipated to have no indirect effect on safety endpoints.

Let Yijk and Pijk be the number of events and total person‐time in the kth group of the jth cluster in the ith trial arm in the population, respectively, where k = 1 and 0 represent the target and non‐target groups, respectively, i = 1 and 0 represent intervention and control trial arms, respectively. We consider a data generating process that is often used in epidemiologic modeling, that the expected value of Yijk given Pijk is:

EYijk|Pijk=expβD,ik+βI,i+αij+γkPijk, (1)

where βD,ik and βI,i represent direct and indirect effects (βD,11=βD; βI,1=βI; βD,10=βD,01=βD,00=βI,0=0), αij is a random cluster effect with standard deviation SDαij ≥ 0 that represents variation in event rates between clusters within each trial arm, and γk represents the difference in event rates between the target and non‐target group (γ1=γ; γ0=0). Note that SDαijCVcij, the coefficient of variation in cluster‐level event rate. 5 , 6 By randomization, the distributions of αij and Pijk are identical in expectation between the intervention and control arms. From Equation (1), CVYijk=CVPijk and corrYijk,Pijk=1 if SDαij=0. The difference CVYijkCVPijk increases and the correlation corrYijk,Pijk decreases as SDαij increases.

2.2. Asymptotically unbiased estimators

Given a sample dataset of yijk,pijk:j=1,2,,ni;i=0,1;k=0,1, with randomization and a large number of clusters per trial arm, the ratio of mean counts, r1, provides an asymptotically unbiased estimator of the total effect that compares the event rates in the intervention and control arms:

r1=y1y0=j=1n1y1j1/n1j=1n0y0j1/n0j=1n1expβD+βI+α1j+γp1j1/n1j=1n0expα0j+γp0j1/n0=expβD+βIj=1n1expα1j+γp1j1/n1j=1n0expα0j+γp0j1/n0expβT.

When n1 and n0 are small, j=1n1expα1j+γp1j1/n1/j=1n0expα0j+γp0j1/n01,

causing a small sample bias in the ratio estimator. 7 This and the bias in the other estimators will be discussed in Section 2.3.

Similarly, the ratio of mean cluster‐level event rates,

r2=c1c0=j=1n1c1j/n1j=1n0c0j/n0,

where cij=yij1/pij1, and ratio of event rates,

r3=R1R0=j=1n1y1j1/j=1n1p1j1j=1n0y0j1/j=1n0p0j1,

also provide asymptotically unbiased estimators of the total effect.

The variance of r1, r2, and r3 can be written as: 5 , 7 , 8

Varr1=r12CVy1j12n1+CVy0j12n0, (2)
Varr2=r22Varc1c12+Varc0c02, (3)

and

Varr3=r32VarR1R12+VarR0R02. (4)

Furthermore, in first‐order Taylor series expansion, Varci=VarRi, 8 and

VarRi=Ri2niCVyij12+CVpij122CVyij1CVpij1corryij1,pij1.

From Equation (1), CVyijkCVpijk. Let CVpijk=θi×CVyijk, where 0<θi1.

If r1 and r2 give the same sample estimate, a comparison of Varr1 and Varr2 boils down to an evaluation of whether

niVarcici2CVyij12niniVarRici2CVyij12ni=Rici2CVyij12+CVpij122CVyij1CVpij1corryij1,pij1CVyij12=Rici21+θi22θicorryij1,pij1CVyij12CVyij12<0.

Therefore, Varr2<Varr1 under the condition that in both trial arms:

corryij1,pij1>1+θi2ciRi22θi. (5)

Similarly, if r1 and r3 give the same estimate, Varr3<Varr1 under the condition that in both trial arms:

corryij1,pij1>θi/2. (6)

If r2 and r3 give the same estimate, Varr2<Varr3 under the condition that in both trial arms VarRi/ci2<VarRi/Ri2, that is,

ci/Ri>1. (7)

If corryij1,pij1=0, ci/Ri>1 by Jensen's inequality. As we will see in the case study in Section 4, it is possible that corryij1,pij1 approximately equals zero in real‐world situations.

In contrast, with large n1,n0 and randomization, the double ratio of counts (r4) provides an asymptotically unbiased estimator of the direct effect:

r4=R1*R0*=j=1n1y1j1/j=1n1y1j0j=1n0y0j1/j=1n0y0j0
j=1n1expβD+βI+α1j+γp1j1/j=1n1expβI+α1jp1j0j=1n0expα0j+γp0j1/j=1n0expα0jp0j0=expβD.

The variance of r4 is:

Varr4=r42VarR1*R1*2+VarR0*R0*2, (8)

where

VarRi*Ri*2=1niCVyij12+CVyij022CVyij1CVyij0corryij1,yij0.

Comparisons of the variances of the estimators r2 vs r4 and r3 vs r4 are meaningful only if the indirect effect is absent or trivial and the estimates r2r4 and r3r4. In Section A of Online Supplementary Material 1 we show that Varr4<Varr2 if in both trial arms:

corryij1,yij0Rici2θicorryij1,pij1>1Rici2×1+θi22. (9)

Similarly, Varr4<Varr3 if in both trial arms:

corryij1,yij0θicorryij1,pij1>1θi22. (10)

A strong correlation between number of events in the target and non‐target groups as compared to the correlation between the number of events and person‐time in the target group would favor r4 over r2 and r3 in terms of precision.

If a non‐trivial indirect effect is present, the absolute values of the test statistics

tr2=r21Varr2,
tr3=r31Varr3

and

tr4=r41Varr4

are comparable in the sense that they all indicate the probability of rejecting the null hypothesis of the target ratio being one. Let r2=ζr4, then tr2<tr4 if in both trial arms (details in Online Supplementary Material 1):

corryij1,yij0>1ζ2r41ζr412Rici21+θi22θicorryij1,pij1. (11)

Similarly, let r3=ξr4, then tr3<tr4 if in both trial arms:

corryij1,yij0>1ξ2r41ξr4121+θi22θicorryij1,pij1. (12)

In the special case that ζ=1 or ξ=1, Equations (11) or (12) reduce to Equations (9) and (10), respectively. Otherwise, assume that the estimates of direct and indirect effects are in the same direction, the closer ζ or ξ is to 1, the more favourable r4 is in terms of power.

Similar to r4, the ratio of event rates estimator, r5=R1/R0=R11/R10R01/R00, where Rik=jyijkjpijk, also provides an asymptotically unbiased estimator of the direct effect. The variance of r5 is:

Varr5=r52VarR1R12+VarR0R02 (13)

where

VarRi=VarRi1Ri0=Ri2VarRi1Ri12+VarRi0Ri022covRi1,Ri0Ri1Ri0,
VarRikRik2=1niCVyijk2+CVpijk22CVyijkCVpijkcorryijk,pijk,
covRi1,Ri0=1nipi1pi0covyij1,yij0+Ri1Ri0covpij1,pij0Ri0covyij1,pij0Ri1covyij0,pij1,

and pik=j=1nipijk/ni.

Furthermore,

VarRiRi2=VarRi1Ri12+VarRi0Ri022covRi1,Ri0Ri1Ri0
=1ni{CVyij12+CVpij122CVyij1CVpij1corryij1,pij1+CVyij02+CVpij022CVyij0CVpij0corryij0,pij02[CVyij1CVyij0corryij1,yij0+CVpij1CVpij0corrpij1,pij0CVyij1CVpij0corryij1,pij0CVyij0CVpij1corryij0,pij1]}.

Then, Varr5<Varr4 if in both trial arms (details in Online Supplementary Material 1):

corryij1,pij0+corryij0,pij1<corryij0,pij0+corryij1,pij1. (14)

It is natural to expect that the number of events is more strongly correlated with the person‐time in the same group than the other group. Therefore, we anticipate a high chance of Varr5<Varr4 in many studies.

For matched‐pair CRTs, n1=n0=n is the number of pairs of clusters. Within the jth pair of clusters, one cluster is randomized to receive intervention (i = 1) and the other is the control cluster (i = 0). The paired design version of the five estimators, rlpaired(l=1,2,3,4,5), and their variances are shown in Appendix Table A1.

2.3. Approximately unbiased estimators

The literature about bias in ratio estimators and the mitigation methods has very much focused on paired observations, mostly concerning an estimator in the form of r1paired. 7 , 12 Rao and Pereira considered a ratio‐of‐ratio estimator in the form of r3paired or r4paired. 13 These previous works showed that the estimators have a bias of order n1; bias‐reduction methods were proposed. Useful though they are, they do not deal with non‐matched CRTs and r2paired and r5paired.

One solution is to determine the expectation and therefore bias of a ratio estimator, and then subtract the bias from the estimator. See, for example, van Kempen and van Vliet 8 and Rao and Pereira. 13 Although it has only been considered in studies of paired observations, the concept is applicable to both non‐matched and matched‐pair CRTs. Following this approach, we propose a set of approximately unbiased estimators. The key results for non‐matched CRTs are shown below. Their matched‐pair counterparts and details of the derivations are available in Section B of Online Supplementary Material 1.

2.3.1. Ratio of mean counts in non‐matched CRTs

The expectation of the asymptotically unbiased estimator r1 and approximately unbiased estimator r1* of the ratio of means estimator are, respectively:

Er1Y11Y011+Vary0j1n0Y012,
r1*=r1Er1Y11Y01=r1Y11Y01×Vary0j1n0Y012r111n0CV2y0j1,

with the unknown population mean Y01 approximated by the sample mean y01 to form the sample CV.

2.3.2. Ratio of mean cluster‐level event rates in non‐matched CRTs

The expectation of the asymptotically unbiased estimator r2 and approximately unbiased estimator r2* are, respectively:

Er2C1C01+Varc0jn0C02,r2*=r2Er2C1C0r211n0CV2c0j, (15)

with the unknown population mean of cluster event rates C0 replaced by the sample estimate to form the sample CV.

2.3.3. Ratio of event rates in non‐matched CRTs

Er3Y11/P11Y01/P011covy1j1,p1j1n1Y11P11covy0j1,p0j1n0Y01P01+Vary0j1n0Y012+Varp1j1n1P112,r3*=r3Er3Y11/P11Y01/P01r31+1n1CVy1j1CVp1j1corry1j1,p1j1+1n0CVy0j1CVp0j1corry0j1,p0j11n0CV2y0j11n1CV2p1j1,

with the unknown population mean Yi1 and Pi1(i=0,1) replaced by their sample estimates to form the sample CVs.

2.3.4. Double ratio of counts in non‐matched CRTs

Er4 and r4* can be obtained by replacing r3, pij1 and Pi1 by r4, yij0, and Yi0 in the formula in the previous sub‐section on ratio of event rates.

2.3.5. Double ratio of event rates in non‐matched CRTs

Er5Y11P11/Y10P10Y01P01/Y00P00{1i=01covyij1,yij0niYi1Yi0+covpij1,pij0niPi1Pi0+k=01k=01(1)k+knicovyijk,pijkYikPik+Vary1j0n1Y102+Vary0j1n0Y012+Varp1j1n1P112+Varp0j0n0P002},
r5*r5{1+i=01[1niCVyij1CVyij0corryij1,yij0+CVpij1CVpij0corrpij1,pij0+k=01k=01(1)k+kniCVyijkCVpijkcorryijk,pijk]1n0CV2y0j11n0CV2p0j01n1CV2y1j01n1CV2p1j1}.

2.3.6. Variances and confidence intervals

The variance of r1* is:

Varr1*=Er1*Er1*2=Er1Y11Y01×Vary0j1n0Y012Y11Y012=Er1Er12=Varr1.

Following the same steps, it can be shown that Varrl*=Varrl for l= 2, 3, 4, and 5 as well.

The distribution of ratios is not normal. For calculation of confidence intervals, we calculate lnrl*l. Using the delta method and the result above, Varlnrl*=Varrl/rl*2l, where Varrl have been given in Equations (2), (3), (4), (8), and (13). Confidence intervals (CI) are calculated using the t‐distribution with n1+n02 degrees of freedom for non‐matched CRTs and n1 degrees of freedom for matched‐paired CRTs. 14 The CIs calculated are then exponentiated back to the original scale.

In Equation (4), the calculation of Varr3 involves an asymptotic variance estimator of VarRi=Varj=1niyij1/j=1nipij1. Similarly, in Equations (8) and (13), this variance estimator is involved in the calculation of VarRi* and VarRi, and then the solutions are plugged into the estimators of Varr4 and Varr5, respectively. Cochran showed that this variance estimator gave a considerable under‐estimation. 7 In contrast, he showed that the Jackknife method only mildly over‐estimated the variance and the over‐estimation vanished quickly as the number of observations increased. As such, an alternative method to statistical inference is to use the Jackknife method to estimate VarRi, VarRi*, and VarRi and then plug these values into the calculation of Varlnrl*, l= 3, 4, and 5 and the respective CIs. We will use rl(J)*, l= 3, 4, and 5, to denote the estimators when used together with this Jackknife‐based variance estimation method.

3. SIMULATION

3.1. Simulation setting

For non‐matched CRTs, we generated the number of events in the kth group in the jth cluster in the ith trial arm, conditional on the person‐time pijk, by using a Poisson distribution with expected value given by Equation (1). We considered three sets of intervention effects, representing (a) direct protection (with no indirect effect), (b) direct and indirect protection, and (c) no effect, respectively: (a) expβD=0.5 and expβI=1; (b) expβD=0.5 and expβI=0.75; (c) expβD=1 and expβI=1. We set γ=1 for the difference in event rate between the target and non‐target groups. We set person‐times pij1,pij0 as following a bivariate distribution with means 100, coefficient of variation CVpijk= 0.2, 0.4, or 0.6, skewness 1.5, kurtosis 4, and correlation 0.8. Our choice of CVpijk takes into account the findings from a recent systematic review of CRTs that the first quartile, median, and third quartile of CVpijk were 0.22, 0.41, and 0.52, respectively. 15 We used positively skewed distributions because that is implied by the sizeable CV(0.6) and positive values of person‐time. We used the rmvnonnormal macro in Stata for the non‐normal data generation. 16 Additionally, we simulated person‐time using a bivariate normal distribution with means 100 and CVpijk= 0.2 or 0.4, as symmetric distribution is possible under modest CV. A small number of observations (<1% in total) with person‐time either below 5 or above 350 were replaced by 5 or 350, respectively. This is because CRTs often exclude clusters that are very small in size and exclude or sub‐divide very large clusters due to operational and efficiency considerations. 5 , 6 We set the cluster effect, αij, as following a normal distribution with mean − 2 and SDαij= 0.05, 0.2, and 0.5. The three levels of SDαij correspond to approximately CVcij = 0.05, 0.2, and 0.5. 5 Hayes and Bennett noted that CVc0j is “often ≤ 0.25 and seldom exceeds 0.5 for most health outcomes”. 14

For matched‐pair CRTs, we generated the number of events in the ith trial arm in the jth paired cluster (j=1,2,,n) by using a Poisson distribution with expected value expβD,ik+βI,i+αij+γpij1 in the target group and expβI,i+αijpij0 in the non‐target group. We set the paired cluster effects α1j,α0j as following a bivariate normal distribution with means −2 and SDs 0.05, 0.2, or 0.5 and correlation 0.8. The person‐time parameter in the kth group in the jth pair of clusters pijk followed a multivariate non‐normal distribution with means 100, CVpijk= 0.2, 0.4, or 0.6, skewness 1.5, kurtosis 4, and correlation 0.8. Additionally, multivariate normal distribution was used in the case of CVpijk= 0.2 or 0.4. The other parameters in the matched‐pair CRTs were the same as those in the non‐matched CRTs.

In the literature, it has been suggested that non‐matched CRTs should include at least four clusters per trial arm and matched‐pair CRTs should include at least six pairs of clusters. 1 For non‐matched CRTs, we evaluated the properties of the estimators when the number of clusters per trial arm is 4, 6, 8, 12, 16, 32, and 64. For matched‐pair CRTs, we considered 6, 8, 12, 16, 32, and 64 pairs of clusters.

In each scenario, we conducted 10 000 replicates of data generation and in each of them calculated the five asymptotically unbiased estimators and the five approximately unbiased estimators and their variances. We report the relative bias of the mean estimates of the incidence rate ratio, root mean square error (RMSE), coverage probability (CP) of the 95% confidence intervals (CI) and power to reject the null hyperthesis of the respective ratio equals one (or type 1 error when the null hyperthesis is true). Calculation of CIs was based on log‐transformation and then exponentiate back to the original scale, which were used for statistical inference. Calculations of relative bias and RMSE were based on the ratios themselves without transformation.

3.2. Simulation results

In Figures 1, 2, 3 we show the simulation results of non‐matched CRTs that the intervention only had a direct effect, that is, expβD=0.5 and expβI=1, and CVpij=0.4, which was approximately the median level of variability in cluster size found by a systematic review. 15 To maintain visual clarity, we separately present the results on r1, r2, r3, r1*, r2*, and r3* (upper panel) and r4, r5, r4*, and r5* (lower panel).

FIGURE 1.

SIM-9226-FIG-0001-c

Relative bias of intervention effect estimators in relation to the number of clusters per trial arm for non‐matched CRTs, by three levels of SDαij; population size per cluster follows a skewed distribution with mean = 100 and CV = 0.4; intervention has a direct effect only expβD=0.5;expβI=1

FIGURE 2.

SIM-9226-FIG-0002-c

Root mean squared error (RMSE) of intervention effect estimators in relation to the number of clusters per trial arm for non‐matched CRTs, by three levels of SDαij; population size per cluster follows a skewed distribution with mean = 100 and CV = 0.4; intervention has a direct effect only expβD=0.5;expβI=1

FIGURE 3.

SIM-9226-FIG-0003-c

Coverage probability (CP) of 95% confidence interval (calculated on log‐scale and exponentiated back to the original scale) in relation to the number of clusters per trial arm for non‐matched CRTs, by three levels of SDαij; population size per cluster follows a skewed distribution with mean = 100 and CV = 0.4; intervention has a direct effect only expβD=0.5;expβI=1

Figure 1 shows the patterns of relative bias. All the asymptotically unbiased estimators showed positive relative bias that decreased as the number of clusters per arm (ni) increased. Furthermore, the bias of r1, r2, and r3 increased as SDαij increased. In scenarios with SDαij0.2, r2, and r3 only showed very mild bias and the two curves mostly overlapped. In contrast, the approximately unbiased estimators, r1*, r2*, and r3*, were practically unbiased under all situations considered. The relative bias of r4 and r5 was stable in relation to SDαij. The approximately unbiased estimators r4* and r5* were practically unbiased under all situations considered.

In Figure 2, r2* and r3* had smaller RMSE than r1*, but they converged as ni increased. r2* had slightly smaller RMSE than r3* when SDαij=0.5 and ni was small. Otherwise they were almost indistinguishable from each other. r5* had smaller RMSE than r4*. Their difference was stable across level of SDαij but they converged as ni increased. The asymptotically unbiased estimators had RMSE similar to or slightly larger than their respective approximately unbiased counterparts', but the difference vanished as ni increased.

The estimators tended to have coverage probability (CP) smaller than the nominal 95% level, especially when SDαij was large (Figure 3). The CP improved as ni increased. The asymptotically unbiased estimators tended to have similar or slightly lower CP than their respective approximately unbiased counterparts. In the upper panel, r2* had CP close to the nominal 95% across all levels of ni and SDαij. r1* and r3* had CP below 94% in some scenarios of small ni and large SDαij. Using the Jackknife‐based variance estimator described in Section 2 for r3*, denoted by r3(J)*, gave improved CP that was closer to the nominal 95% level than r3*. In the lower panel, r4, r5, r4*, and r5* had varying degree of under‐coverage in different scenarios. However, using the Jackknife‐based variance estimators for r4* and r5*, denoted by r4(J)* and r5(J)*, respectively, the CP was close to the nominal level in all situations.

Figure 4 shows type 1 error rates, that is, rejection of null hypothesis in scenarios with expβD=1 and expβI=1. Otherwise, the parameters are the same as those in Figures 1, 2, 3. There was a tendency for all estimators to have type 1 error rate that exceeded the 5% target level, especially with SDαij 0.2 and ni< 32. The inflation reduced as ni increased. In the upper panel, r2 and r2*, followed by r3(J)* and r1*, performed better than the others. In the lower panel, r4, r5, r4*, and r5* had varying level of inflation of type 1 error under different parameter settings. In contrast, r4(J)* and r5(J)* performed well. In no circumstances did they show more than 1% deviation from the 5% target.

FIGURE 4.

SIM-9226-FIG-0004-c

Type 1 error rate in relation to the number of clusters per trial arm for non‐matched CRTs, by three levels of SDαij; population size per cluster follows a skewed distribution with mean = 100 and CV = 0.4; intervention has no effect expβD=1;expβI=1

Figure 5 compares the power of selected estimators that use person‐time as denominators, r2* and r3(J)*, vs selected estimators that use event counts in the non‐target group as denominators, r4(J)* and r5(J)*. We focused on them because they performed well in terms of CP and type 1 error rate. The lower panel introduced an indirect effect, expβI=0.75, in addition to a direct effect, expβD=0.5. Otherwise, the parameters here are the same as those in Figures 1, 2, 3, 4. When there was direct effect only (upper panel), r2* and r3(J)* were more powerful than r4(J)* and r5(J)* for SDαij0.2. In contrast, r4(J)* and r5(J)* were more powerful when SDαij=0.5. Furthermore, r5(J)* was more powerful than r4(J)* in all the scenarios considered. With the addition of the indirect effect, r2* and r3(J)* were more powerful than r4(J)* at all levels of SDαij, but similar to r5(J)* at SDαij=0.5.

FIGURE 5.

SIM-9226-FIG-0005-c

Power of r2*,r3(J)*,r4(J)*,andr5(J)* in relation to the number of clusters per trial arm for non‐matched CRTs, by three levels of SDαij; population size per cluster follows a skewed distribution with mean = 100 and CV = 0.4. Upper panel: with direct effect only expβD=0.5;expβI=1; lower panel: with direct and indirect effects expβD=0.5;expβI=0.75

Further simulation results on non‐matched CRTs under other parameter settings and simulation results on matched‐pair CRTs are available in Online Supplementary Material 2. The findings are qualitatively similar to those reported above. Some relatively important additional information is as follows: First, in non‐matched CRTs, the relative bias of r1 and r4 increased substantially as CVpij1 increased except when the number of cluster was about 32 or above. In contrast, r2, r3, r5 and the approximately unbiased estimators were much less sensitive to the magnitude of CVpij1 (eg, Figures S1 and S2). Second, in non‐matched CRTs, the CP of r3(J)* reduced to about 93% and its type 1 error rate increased to about 7% when CVpij1 = 0.6, SDαij = 0.5 and the number of clusters was 6 or below (eg, Figures S6 and S26). However, in matched‐paired CRTs, r3(J)*,paired performed well in these aspects while r2*,paired had somewhat inflated type 1 error rate and below target CP (eg, Figures S41, S42, S62, and S63).

4. SEASONAL MALARIA CHEMOPREVENTION TRIAL

We use a subset of data from a published study of seasonal malaria chemoprevention (SMC) in Senegalese children to illustrate. 17 The trial set‐up is shown in Figure 6. The trial had a total of 54 clusters. It had the appearance of an “optimal design”, 18 , 19 with nine clusters on intervention (leftmost column) and nine clusters on control condition (rightmost column) for all three time periods (2008 to 2010) that resembled a non‐matched CRT, flanking a standard stepped‐wedge trial (middle columns). The middle columns represent 18 clusters that were randomized to receive SMC from 2009 and another 18 clusters randomized to receive SMC from 2010. However, the trial was not planned according to the optimal design. The original plan was that the trial would continue up to 4 years (2008 to 2011), and the nine clusters on the rightmost column were randomized to receive SMC in 2011. But the trial was terminated after the malaria transmission season in 2010 according to data monitoring and interim analysis results. Furthermore, in the first period (2008), children aged 3 to 59 months in the nine clusters on the leftmost column were given SMC as part of the preparation of the study logistics. In 2009 and 2010, children aged between 3 months and 9 years (inclusive) were given SMC.

FIGURE 6.

SIM-9226-FIG-0006-b

Design of seasonal malaria chemoprevention (SMC) trial

A passive surveillance system was implemented in health facilities to determine the number of clinical malaria episodes confirmed by rapid malaria test (primary endpoint) in each cluster, by four age groups (59 months or below; 5‐9; 10‐19; over 19 years). The data was at the cluster, not individual, level. Mortality data was collected for all age groups but only data for children aged 9 years or below was used in the previously published analysis 17 and available to the present analysis. Number of deaths (secondary endpoint) and population size of the clusters at mid‐September each year (approximately the beginning of the annual malaria transmission season), by age groups, was collected by a demographic surveillance system.

For the purpose of illustration, we used the 2008 data from the 18 clusters that resembled a non‐matched CRT. Children aged between 3 to 59 months were the target group. We considered children aged 5 to 9 years the non‐target group.

Table 1 shows the descriptive statistics by endpoints and trial arms. It also included a simple average of each statistics in the two trial arms as a summary. There was very large between‐cluster variability in malaria incidence rate in the target group, with CVcij1>1 in both arms. In the target group, the CVs of malaria episodes approximately doubled the CVs of the population size, with θ=CVpij1/CVyij10.60/1.37=0.44. The correlation coefficient between malaria episodes and population size of the target group, corryij1,pij1, was weak, with average across trial arms being only 0.03. The estimates of malaria incidence rate in the SMC and control arms were c1=0.0029>R1=0.0028 and c0=0.0031>R0=0.0020, respectively. In contrast, the correlation between malaria episodes in the target and non‐target group, corryij1,yij0, was strong, averaged at 0.76.

TABLE 1.

Descriptive summary of number of malaria episodes and deaths, population size and their correlations and coefficient of variation of cluster‐level event rates in 9 intervention clusters and 9 control clusters in a seasonal malaria chemoprevention (SMC) trial in 2008

Malaria Mortality
Statistics SMC Control Average SMC Control Average
yij1
8.00 3.56 5.78 7.11 2.89 5.00
yij0
15.44 5.89 10.67 1.00 0.56 0.78
pij1
2834 1741 2288 2834 1741 2288
pij0
1977 1187 1582 1977 1187 1582
CVcij1
1.21 1.75 1.48 0.89 0.68 0.79
CVyij1
1.24 1.49 1.37 1.05 0.95 1.00
CVyij0
1.15 1.08 1.12 1.58 1.31 1.45
CVpij1
0.46 0.73 0.60 0.46 0.73 0.60
CVpij0
0.49 0.65 0.57 0.49 0.65 0.57
corryij1,pij1
0.10 −0.05 0.03 0.48 0.69 0.59
corryij1,pij0
0.13 −0.05 0.04 0.57 0.68 0.63
corryij0,pij1
0.26 0.05 0.15 0.18 0.58 0.38
corryij0,pij0
0.34 0.01 0.17 0.26 0.57 0.41
corryij1,yij0
0.88 0.64 0.76 0.67 0.35 0.51
corrpij1,pij0
0.988 0.998 0.993 0.988 0.998 0.993

Table 2 shows the estimation results. The estimates based on the asymptotically unbiased estimators were all larger than those based on the approximately unbiased estimators. Given this number of clusters, we considered the latter more accurate. For l = 3, 4, and 5, the SE's based on rl(J)* were only slightly larger than rl*. All confidence intervals were quite wide. Although corryij1,pij1 was weak, the estimated r1 was much larger than r2 and r3, leading to standard error SEr1 larger than SEr2 and SEr3. Given ci>Ri in both trial arms, SEr2<SEr3. As expected from Equations (9) and (10), with strong corryij1,yij0 and weak corryij1,pij1, SEr4 was smaller than SEr2 and SEr3. The estimators for direct effects, r4* and r5*, gave very similar result. Since there was no practical difference between corryij1,pij1+corryij0,pij0 and corryij1,pij0+corryij0,pij1 in either trial arm, as indicated by Equation (14), Varr4Varr5.

TABLE 2.

Estimates, SE and 95% confidence intervals (CI; exponentiation of log‐transformed values) for malaria and mortality in seasonal malaria chemoprevention trial data in 2008

Endpoint Estimator Estimate SE 95% CI
Malaria
r1*
1.70 1.45 (0.28, 10.4)
r2*
0.62 0.67 (0.06, 6.07)
r3*
1.01 0.97 (0.13, 7.80)
r3(J)*
1.01 1.00 (0.12, 8.26)
r4*
0.74 0.37 (0.25, 2.14)
r4(J)*
0.74 0.40 (0.23, 2.32)
r5*
0.76 0.38 (0.26, 2.19)
r5(J)*
0.76 0.41 (0.24, 2.38)
r1
2.25 1.45 (0.57, 8.85)
r2
0.94 0.67 (0.21, 4.24)
r3
1.38 0.97 (0.31, 6.16)
r4
0.86 0.37 (0.34, 2.14)
r5
0.88 0.38 (0.35, 2.20)
Mortality
r1*
2.21 1.16 (0.73, 6.74)
r2*
1.16 0.46 (0.50, 2.67)
r3*
1.44 0.58 (0.61, 3.40)
r3(J)*
1.44 0.61 (0.59, 3.54)
r4*
1.08 0.81 (0.22, 5.26)
r4(J)*
1.08 0.88 (0.19, 6.02)
r5*
1.12 0.83 (0.23, 5.41)
r5(J)*
1.12 0.91 (0.20, 6.33)
r1
2.46 1.16 (0.90, 6.70)
r2
1.22 0.46 (0.55, 2.70)
r3
1.51 0.58 (0.67, 3.42)
r4
1.37 0.81 (0.39, 4.78)
r5
1.40 0.83 (0.40, 4.92)

Table 1 also shows that the descriptive statistics on mortality. There was less between‐cluster variability in mortality rate than malaria incidence, with CVcij1 in the two arms averaged at 0.79. Furthermore, in the target group, the CVs of death approximately doubled the CVs of the population size, with θ=CVpij1/CVyij10.60/1.00=0.60. Unlike malaria episodes, the correlation coefficient between deaths and population size of the target group was more substantial, with average across trial arms being 0.59. The estimates of mortality rate in the SMC and control arms were c1=0.0022<R1=0.0025 and c0=0.0018>R0=0.0017, respectively. The correlation between deaths in the target and non‐target group was moderate, with average corryij1,yij0 at 0.51; this was weaker than that for malaria episodes.

Estimation results on mortality are available in Table 2. Again, the asymptotically unbiased estimators tended to give larger estimates than their respective approximately unbiased estimators. However, the differences between the two set of estimates were smaller than those on malaria episodes, as expected from the smaller between‐cluster variability in mortality rate than malaria incidence rate. The SE's based on rl(J)* were only slightly larger than those based on rl*. All confidence intervals were quite wide. Since corryij1,pij1 was substantial and the estimated r1 was much larger than r2 and r3, SEr2 and SEr3 were smaller than SEr1. With fairly similar values of corryij1,yij0 and corryij1,pij1 and relatively large θ, SEr2 and SEr3 were smaller than SEr4. There was no clear difference between corryij1,pij1+corryij0,pij0 and corryij1,pij0+corryij0,pij1 in either trial arm, so Varr4Varr5.

5. DISCUSSION

The proposed approximately unbiased estimators successfully reduce the bias when the number of clusters is small. They also have advantages in terms of smaller RMSE and more accurate coverage probability than the asymptotically unbiased estimators. For studies with fewer than 60 clusters per arm, we recommend the use of the approximately unbiased estimators. Some CRTs do have a large number of clusters per trial arm, for example, a trial of influenza vaccination had over 400 nursing homes per arm 20 and a trial of mass drug administration had over 700 communities per arm. 21 For such studies, the choice between the asymptotically and approximately unbiased estimators is unimportant. Furthermore, with a large number of clusters, the simple estimator r1 has performance very similar to r2 and r3. At the study planning stage, investigators may take into account this finding when they consider the cost and benefit of collecting person‐time data.

Previous simulation studies evaluated the performance of the estimator r2. 5 , 6 We caution that the range of parameter values they considered were somewhat narrow. From our analytic solution, the estimator is only asymptotically unbiased. As seen in Equation (15), the bias in the estimator r2 is a non‐linear function of the between‐cluster variability of event rate. From simulation, the bias of r2 became obvious as the variability increased, especially when the number of clusters per trial arm was below 16 or so. In the SMC study of malaria episodes, where the variability was large, the r2 estimate was much larger than r2*. In those situations, the use of our proposed bias‐corrected estimator r2* is preferable over r2.

While r3* performs well in terms of bias and RMSE, its variance estimator under‐estimates the true variability as the CV of cluster size or event rate increases; it approaches the target CP as the number of clusters increases. The under‐coverage can be corrected by plugging the Jackknife estimates of VarRi into the estimator for Varr3. We found that r2* and r3(J)* have similar performance except when CVpijk = 0.6 or SDαij = 0.5. In those settings, r3(J)* had type 1 error rate up to about 2% higher than the nominal 5% level when the number of clusters was six or below, and r2* may be preferred. However, in matched‐pair CRTs, r2*,paired did not out‐perform r3(J)*,paired. Furthermore, previous studies had shown that the estimator of cluster‐level event rate in a trial arm, ci, is a biased estimator, and the level of bias does not reduce in relation to increase in number of clusters. 8 In contrast, Ri can be used to obtain an asymptotically unbiased estimate of disease incidence. 7 , 8 Even if r2* is used to estimate incidence rate ratio, Ri is preferable over ci as an estimator of the incidence rate in each trial arm.

Our consideration of r4 and r5 was in part motivated by an evaluation of a malaria vaccine where indirect effects were unlikely and the incidence of some outcomes such as meningitis (a safety outcome) could be highly variable between clusters, and where balanced randomization with respect to access to health care facilities (where passive surveillance took place) was difficult to ensure. 22 When SDαij is large, the number of events in the target group is likely to be more strongly correlated with the number of events in the non‐target group than with the person‐time in the target group. Furthermore, it is typical that vaccine studies anticipate no indirect effect on safety endpoints. So, these estimators are expected to estimate the same target as far as safety is concerned. In simulation studies of scenarios that the intervention only had a direct effect, we have seen that when SDαij was ≤0.2, r2* and r3(J)* tended to be more powerful than r4(J)* and r5(J)*. When SDαij increased to 0.5 and in the malaria data in the SMC trial, r4(J)* and r5(J)* had substantially smaller SE than r2* and r3(J)*. For situations like this, r4(J)* and r5(J)* are our estimators of choice. In various scenarios we evaluated, r5(J)* gave higher level of statistical power than r4(J)*, except that they had similar performance when CVpijk was small. In that case, the magnitude of the benefits may not justify the extra cost in collection of person‐time data in both the target and non‐target groups. Otherwise, r5(J)* tends to be preferable over r4(J)*.

We foresee that the estimators that use person‐time in the target group and estimators that use number of events in the non‐target groups as a denominator may be used in different parts of the same CRT, depending on the considerations aforementioned. For example, r2 or r3 or their extensions may be used in efficacy analysis while r4 and r5 or their extensions may be used in safety analysis.

Furthermore, r4 and r5 are generic quantities in the sense that yij0 may be replaced by quantities other than the number of events in non‐target groups to achieve other purposes. For example, there has been interest in the use of “negative control events” to remove the bias arising from differential ascertainment of outcome events in non‐blinded CRTs. 4 The proposed estimator in the literature is in the form of r4, with y1j0 and y0j0 replaced by the number of negative control events in the intervention and control arm, respectively. 4 The previous study did not consider the properties of the estimator in situations with small number of clusters. The results here apply directly. Another example of application is to the CRT with Before and after observations (CRT‐BA) design, 23 which collects data in a baseline period before launching the randomized intervention and control comparator. By replacing yij0 by the baseline event count, r4 becomes a baseline adjusted estimator of the total effect. It offers a robust alternative to the analysis of CRT‐BA trials.

The strengths of the present study include coverage of both non‐matched and matched‐pair CRTs, analytic evaluation of the bias of existing asymptotically unbiased estimators, proposal of estimators that capitalize on denominators other than cluster population size and their potential applications, development of a bias‐corrected version of these estimators for use in studies with a small number of clusters, and simulation evaluation of the estimators with a realistic range of variability in cluster size. A limitation is that the methods do not handle covariate adjustment. However, good use of restricted randomization may reduce the need for covariate adjustment in the analysis stage. 1

In non‐matched CRTs, one approach for controlling covariate effects is stratified analysis and pooling of stratum‐specific estimates using weights inversely proportional to variances. The limitation is that it is not practical to stratify for multiple covariates and it requires categorization of continuous covariates. Another approach is to apply ANCOVA to cluster‐level data. However, it only works for methods that generate a summary value per cluster, such as r1 and r2, whose calculation begins with getting an event count and event rate per cluster, respectively. This is different from the calculation of r3 to r5, which begins with generating an estimate for a trial arm. For example, in r3, the incidence rate in a trial arm is the sum of events over clusters divided by the sum of person‐time over clusters within the trial arm. There is not a summary value for every cluster. Furthermore, when the purpose is to estimate rate ratio instead of rate difference, the ANCOVA approach would need to analyze the log‐transformed values instead. The exponentiated intervention effect estimate is then interpreted as a ratio of geometric means, which is not the same as the widely used estimator of ratio of arithmetic means (including r1 and r2). In CRTs with a small number of clusters, the bias and bias‐correction for ratios of geometric means of event counts and event rates have yet to be investigated. Another approach is to use Poisson regression analysis with cluster‐level covariates (and individual‐level covariates if available) without using the intervention variable as predictors to obtain an expected number of events for each cluster. 5 Comparison of the deviations of the observed from the expected number of events between trial arms then offers a covariate adjusted estimate for the intervention effect. Following this idea, if yij0 in r4 or pijk in r5 are replaced by this expected number of events, they become covariate adjusted estimators of the total or direct effect, respectively. The bias‐correction method may then be applied to obtain covariate adjusted r4* or r5*. But the variance estimators may not work well as they do not account for the uncertainty in the prediction of the expected number of events. In short, while there are several candidate approaches available, challenges remain. Further research is needed to develop and evaluate these or other approaches to covariate adjustment.

The size of corryij1,pij1 is important in determining the relative strength of the different estimators. A large SDαij tends to dilute this correlation. As such, pilot data and careful consideration of the between‐cluster variability in event rate is important not only for sample size determination but also for choice of study design and statistical analysis procedures.

CONFLICT OF INTEREST

The authors declare no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Supporting information

Online Supplementary Material 1: Comparisons of asymptotically unbiased estimators and derivations of approximately unbiased estimators

Online Supplementary Material 2: Further simulation results, by number of clusters per trial arm and three levels of SD (α ij ); mean population size per cluster is 100

APPENDIX A.

TABLE A1.

Estimators of incidence rate ratio for matched‐pair cluster randomized trials a

Label Estimand b Estimator Variance c
Ratio of means
expβT
r1paired=y1y0=j=1ny1j1j=1ny0j1
y12ny02Vary1j1y12+Vary0j1y022covy1j1,y0j1y1y0
Ratio of mean cluster‐level event rates
expβT
r2paired=c1c0=j=1nc1jj=1nc0j
c12nc02Varc1jc12+Varc0jc022covc1j,c0jc1c0
Ratio of event rates
expβT
r3paired=R1R0=j=1ny1j1/j=1np1j1j=1ny0j1/j=1np0j1
R1R02VarR1R12+VarR0R022covR1,R0R1R0
Double ratio of counts
expβD
r4paired=R1*R0*=j=1ny1j1/j=1ny1j0j=1ny0j1/j=1ny0j0
R1*R0*2VarR1*R1*2+VarR0*R0*22covR1*,R0*R1*R0*
Double ratio of event rates
expβD
r5paired=R1R0=j=1ny1j1j=1np1j1/j=1ny1j0j=1np1j0j=1ny0j1j=1np0j1/j=1ny0j0j=1np0j0
R1R02VarR1R12+VarR0R022covR1,R0R1R0
a

yijk,pijk, and cij=yij1/pij1 are, respectively, the number of events, person‐time/population size and cluster‐level event rates in the cluster that is randomized to receive the ith trial arm (1 for intervention and 0 for control) in the jth pair of clusters and kth group (1 for target and 0 for non‐target group), and ci=j=1ncij/n.

b

βT and βD: Total effect and direct effect in terms of log incidence rate ratio.

c

Components of the variances of rlpaired(l=3,4,5) are: VarRi=1npi12Varyij1+Ri2Varpij12Ricovyij1,pij1,i=0,1; covR1,R0=1np11p01covy1j1,y0j1+R1R0covp1j1,p0j1R1covy0j1,p1j1R0covy1j1,p0j1, VarRi*=1nyi02Varyij1+Ri*2Varyij02Ri*covyij1,yij0,i=0,1; covR1*,R0*=1ny10y00covy1j1,y0j1+R1*R0*covy1j0,y0j0R1*covy0j1,y1j0R0*covy1j1,y0j0, VarRi=VarRi1Ri0=Ri1Ri02VarRi1Ri12+VarRi0Ri022covRi1,Ri0Ri1Ri0,i=0,1; covR1,R0=covR11R10,R01R00=1nR10R00covR11,R01+R1R0covR10,R00R1covR01,R10R0covR11,R00, VarRik=1npik2Varyijk+Rik2Varpijk2Rikcovyijk,pijk,i=0,1;k=0,1; covRik,Rik=1npikpikcovyijk,yijk+RikRikcovpijk,pijkRikcovyijk,pijkRikcovyijk,pijk,i,i=0,1;k,k=0,1; where Rik=j=1nyijkj=1npijkandyik=j=1nyijk/n;pik=j=1npijk/n,i=0,1;k=0,1.

Ma X, Milligan P, Lam KF, Cheung YB. Ratio estimators of intervention effects on event rates in cluster randomized trials. Statistics in Medicine. 2022;41(1):128–145. doi: 10.1002/sim.9226

Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not reflect the views of Ministry of Health/National Medical Research Council, Singapore.

Funding information National Medical Research Council, MOH‐000526

DATA AVAILABILITY STATEMENT

The data used in the case study are available at https://doi.org/10.17037/DATA.117. Requests for access will be reviewed by a Data Access Committee to ensure use of the data protect participant privacy according to the terms of participant consent and ethics committee approval. Stata codes for the simulations will be deposited to figshare by Wiley.

REFERENCES

  • 1. Hayes RJ, Moulton LH. Cluster Randomised Trials. Boca Raton, FL: CRC Press; 2009. [Google Scholar]
  • 2. Donner A, Klar N. Design and Analysis of Cluster Randomization Trials in Health Research. Chichester, UK: Wiley; 2010. [Google Scholar]
  • 3. Dron L, Taljaard M, Cheung YB, et al. Global health clinical trials: role and challenges of cluster randomised trials for global health. Lancet Global Health. 2021;9:e701‐e710. [DOI] [PubMed] [Google Scholar]
  • 4. Dufault SM, Jewell NP. Analysis of counts for cluster randomized trials: negative controls and test‐negative designs. Stat Med. 2020;39(10):1429‐1439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Bennett S, Parpia T, Hayes R, Cousens S. Methods for the analysis of incidence rates in cluster randomized trials. Int J Epidemiol. 2002;31(4):839‐846. [DOI] [PubMed] [Google Scholar]
  • 6. Pacheco GD, Hattendorf J, Colford JM, Mausezahl D, Smith T. Performance of analytical methods for overdispersed counts in cluster randomized trials: sample size, degree of clustering and imbalance. Stat Med. 2009;28:2989‐3011. [DOI] [PubMed] [Google Scholar]
  • 7. Cochran WG. Sampling Techniques. 3rd ed. Chichester, NY: Wiley; 1977. [Google Scholar]
  • 8. van Kempen GMP, van Vliet LJ. Mean and variance of ratio estimators used in fluorescence ratio imaging. Cytometry. 2000;39:300‐305. [DOI] [PubMed] [Google Scholar]
  • 9. Donner A, Klar N. Methods for comparing event rates in intervention studies when the unit of allocation is a cluster. Am J Epidemiol. 1994;140:279‐289. [DOI] [PubMed] [Google Scholar]
  • 10. Rao JNK, Scott AJ. A simple method for the analysis of clustered binary data. Biometrics. 1992;48(2):577‐585. [PubMed] [Google Scholar]
  • 11. Halloran ME, Longini IM, Struchiner CJ. Design and Analysis of Vaccine Studies. New York, NY: Springer; 2010. [Google Scholar]
  • 12. Durbin J. A note on the application of Quenouille's method of bias reduction to the estimation of ratios. Biometrika. 1959;46:477‐480. [Google Scholar]
  • 13. Rao JNK, Pereira NP. On double ratio estimators. Sankhyā Ind J Stat Ser A. 1968;30:83‐90. [Google Scholar]
  • 14. Hayes RJ, Bennett S. Simple sample size calculation for cluster‐randomized trials. Int J Epidemiol. 1999;28(2):319‐326. [DOI] [PubMed] [Google Scholar]
  • 15. Kristunas C, Morris T, Gray L. Unequal cluster sizes in stepped‐wedge cluster randomised trials: a systematic review. BMJ Open. 2017;7:e017151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Lee S. Generating univariate and multivariate nonnormal data. Stata J. 2015;15(1):95‐109. [Google Scholar]
  • 17. Cisse B, Ba EH, Sokhan C, et al. Effectiveness of seasonal malaria chemoprevention in children under ten years of age in Senegal: a stepped‐wedge cluster‐randomized trial. PLoS Med. 2016;13(11):e1002175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Girling AJ, Hemming K. Statistical efficiency and optimal design for stepped cluster studies under linear mixed effects models. Stat Med. 2016;35:2149‐2166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Thompson JA, Fielding K, Hargreaves J, Copas A. The optimal design of stepped wedge trials with equal allocation to sequences and a comparison to other trial designs. Clin Trials. 2016;14(6):639‐647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Gravenstein S, Davidson HE, Taljaard M, et al. Comparative effectiveness of high‐dose versus standard dose influenza vaccination on numbers of US nursing home residents admitted to hospital: a cluster‐randomized trial. Lancet Respir Med. 2017;5(9):738‐746. [DOI] [PubMed] [Google Scholar]
  • 21. Keenan JD, Bailey RL, West SK, et al. Azithromycin to reduce childhood mortality in sub‐Saharan Africa. New Engl J Med. 2018;378(17):1583‐1592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Milligan P. Statistical Analysis Plan for the Malaria Vaccine Pilot Evaluation, London, UK: London School of Hygiene & Tropical Medicine and World Health Organization; 2021. https://clinicaltrials.gov/ProvidedDocs/65/NCT03806465/SAP_001.pdf. Accessed July 11, 2021. [Google Scholar]
  • 23. Hemming K, Taljaard M. Sample size calculations for stepped wedge and cluster randomised trials: a unified approach. J Clin Epidemiol. 2016;69:137‐146. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Online Supplementary Material 1: Comparisons of asymptotically unbiased estimators and derivations of approximately unbiased estimators

Online Supplementary Material 2: Further simulation results, by number of clusters per trial arm and three levels of SD (α ij ); mean population size per cluster is 100

Data Availability Statement

The data used in the case study are available at https://doi.org/10.17037/DATA.117. Requests for access will be reviewed by a Data Access Committee to ensure use of the data protect participant privacy according to the terms of participant consent and ethics committee approval. Stata codes for the simulations will be deposited to figshare by Wiley.


Articles from Statistics in Medicine are provided here courtesy of Wiley

RESOURCES