Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Sep 10.
Published in final edited form as: Stat Med. 2019 Jun 4;38(20):3733–3746. doi: 10.1002/sim.8153

Optimal designs in three-level cluster randomized trials with a binary outcome

Jingxia Liu a,b, Lei Liu b, Graham A Colditz a
PMCID: PMC6905388  NIHMSID: NIHMS1061732  PMID: 31162709

Abstract

Cluster randomized trials (CRTs) were originally proposed for use when randomization at the subject level is practically infeasible or may lead to a severe estimation bias of the treatment effect. However, recruiting an additional cluster costs more than enrolling an additional subject in an individually randomized trial. Under budget constraints, researchers have proposed the optimal sample sizes in two-level CRTs. CRTs may have a three-level structure, in which two levels of clustering should be considered. In this paper, we propose optimal designs in three-level CRTs with a binary outcome, assuming nested exchangeable correlation structure in generalized estimating equation models. We provide the variance of estimators of three commonly used measures: risk difference, risk ratio, and odds ratio. For a given sampling budget, we discuss how many clusters and how many subjects per cluster are necessary to minimize the variance of each measure estimator. For known association parameters, the locally optimal design (LOD) is proposed. When association parameters are unknown but within pre-determined ranges, the MaxiMin design (MMD) is proposed to maximize the minimum of relative efficiency over the possible ranges, that is, to minimize the risk of the worst scenario.

Keywords: Cluster randomized trial (CRT), dissemination and implementation (D&I), generalized estimating equation (GEE), intracluster correlation coefficient (ICC), nested correlation structure

1. Introduction

Cluster randomized trials (CRTs) were originally proposed for use when randomization at the subject level is practically infeasible or may possibly lead to a severe estimation bias of the treatment effect. In practice, implementation strategies play important roles in dissemination and implementation (D&I) research. Compared to subject-level randomized trials, CRT designs have appealing features for implementation science in public health and clinical medicine. Further, CRTs are greatly needed for effectiveness research from science to practice.1,2 Therefore, there has been growing interest in the design of CRTs.39 The unit of randomization might be hospitals, clinics, classrooms, etc. Subjects within a cluster are exposed to common factors and tend to share similar characteristics. The degree of such similarity is commonly quantified by the intracluster correlation coefficient (ICC). Recruiting an additional cluster costs more than enrolling an additional subject in an individually randomized trial; thus, researchers have proposed the optimal sample size as a function of sampling costs and the ICC in CRTs.1016 “Optimal” means the maximum power and precision for a given sampling budget, or the minimum sampling cost for a given power and precision. These approaches show that the optimal sample size depends strongly on the ICC. However, the ICC is usually unknown in CRTs. To overcome this shortcoming, Van Breukelen et al. considered a range of possible ICC values and presented MaxiMin designs (MMDs) based on relative efficiency (RE) under budget constraints.17 Wu et al. proposed the optimal group allocations for three measures (RD, RR, OR) in two-level CRTs with binary outcomes through the variances of the maximum likelihood estimators. 18

CRTs may have three-level structures. For example, subjects in a two-level CRT are measured at different time points. Measurements across the different time points are correlated within a subject, while subjects are correlated within a cluster. Another example is that interventions are randomly assigned to medical centers (“practices”), and health care professionals (“providers”) within the same practice are trained with the assigned intervention to provide care to participants. Participants could be correlated within a provider, while providers could be correlated within a practice. Hereafter we use a CRT with practice, provider, and participant levels as the three-level example. For simplicity, we consider the same provider size (number of participants from each provider) and equal practice sizes (number of providers per practice).

Generalized estimating equations (GEEs) proposed by Liang and Zeger19 have been commonly applied to analyze the correlated data in CRTs.2024 Liang and Zeger showed that the GEE approach still gives consistent estimates of the regression coefficients- provided that the marginal model is correctly specified- even if the working correlation matrix is incorrectly assumed.19 In this paper, we aim to propose an optimal design (OD) in three-level CRTs, in which “optimal” refers to the minimization of the variance of each measure estimator for a given sampling budget. We assume the nested exchangeable correlation structure25 throughout and utilize the GEE models in a three-level CRT with a binary outcome. The correlation structure includes correlation among participants within the same provider in the same practice, r, and correlation among participants with different providers in the same practice, ρ. Both r and ρ are assumed to be constant across all practices. Three different link functions in GEE models are considered, e.g. identity, log, and logit, where the corresponding regression coefficients are related to risk difference (RD), risk ratio (RR), and odds ratio (OR), respectively.

For known association parameters (r,ρ), we discuss how many practices m need to be enrolled and how many providers per practice n are sufficient to minimize the variance of each measure estimator under the budget constraints when the provider size K is a pre-determined value and K is not a fixed value but within a range (Kmin,Kmax), respectively. This is a locally optimal design (LOD) with corresponding numbers nLOD and mLOD. When the association parameters (r,ρ) are unknown, but we assume that ranges for r and ρ can be obtained from other literature and similar studies, we propose MMDs in the framework of relative efficiency (RE) to minimize the risk of the worst scenario. RE is defined as the ratio of the variance of each measure estimator for practice size nLOD to n,17 which is a function of n,r, and ρ. Our goal is to maximize the minimum of RE over the possible ranges of r and ρ.

The organization of this article is as follows. In Section 2, we briefly summarize the GEE method developed by Liang and Zeger in three-level CRTs,19 introduce the “nested exchangeable” correlation structure,25 and derive the variance of the estimator of the treatment for a binary outcome in a two-group comparison. Section 3 presents the LOD for known parameter values under the assumption of a “nested exchangeable” correlation structure. In Section 4, we define the RE and propose MMDs for unknown parameter values of r and ρ. We provide guidance on applying the methods and illustrate using a real CRT, followed by a discussion about the limitations of the proposed approach and directions for future research.

2. Statistical GEE models in three-level CRTs

Let Yijk be a response from participant k = 1,⋯,K, for provider j = 1,⋯,ni in practice i = 1,⋯,m. Let Xijk = (Xijk1,⋯,Xijkp)′ be a covariate vector and μijk = E(Yijk|Xijk) be a marginal mean response given Xijk. The marginal model is

gμijk=Xijk'β.

Let Yij = (Yij1,⋯,YijK), μij = (μij1, ⋯, μijK), and Xij = (Xij1,⋯,XijK) be the 1 × K response vector, 1 × K marginal mean response vector, p × K covariate matrix of provider j in practice i, respectively. Let Yi=(Yi1,,Yini)',μi=(μi1,,μini)', and Xi=Xi1,,Xini be the matrices of responses, marginal mean responses and covariate of the providers in practice i, respectively. The mean of Yi is denoted by μi = E(Yi) and the variance of Yi is var(Yi|Xi)=θAi1/2Ri0ω0Ai1/2, where Ai=diagγμi11,,γμi1K,,γμini1,,γμiniK, and a Kni × Kni correlation matrix Ri0(ω0) describes the correlation of measures within the ith practice with a vector of association parameters denoted by ω0. Both γ and θ are dependent on the distribution of responses. If Yijk is binary, γ(μijk) = μijk(1 − μijk) and θ = 1. Liang and Zeger19 showed that m(β^β) is asymptotically multivariate normal with a covariance matrix VR=limmmΣ11Σ0Σ11, where Σ1=i=1mDi'Vi1Di, Σ0=i=1mDi'Vi1covYiXiVi1Di, Di = ∂μi/∂β′, and Vi is a working covariance matrix of Yi. Let Riw(ω) be a Kni × Kni working correlation matrix with a vector of association parameters ω. The working covariance matrix is expressed as Vi=θAi1/2RiwωAi1/2 and is unequal to var (Yi|Xi) unless Riw(ω) = Ri0(ω0).

For three-level data, Teerenstra et al. proposed a “nested exchangeable” correlation structure 25:

  1. Correlation among participants within the same provider in the same practice, is constant, CorrYijk1,Yijk2=r for k1k2;

  2. Correlation among participants with different providers in the same practice, is constant, CorrYij1k1,Yij2k2=ρ for j1j2, and any k1, k2;

This three-level exchangeable working correlation structure was defined as

Riwr,ρ=ρ1Kni×Kni+rρBdiagni(1K×K)+(1r)IKni×Kni,

where 1i×i is a i × i matrix of 1’s, Bdiagi(A) is a block diagonal matrix with matrix element A replicated i times, and Ii×i is the i × i identity matrix. Here, Riw(r,ρ) must be positive definite (PD). Given a value of K and ni, PD can be determined if the constraints holds, min(λ1,λ2,λ3i) > 0, where λ1 = 1 − r, λ2 = 1 + (K − 1)r, λ3i = 1 + (K − 1)r + K(ni − 1)ρ are the distinct eigenvalues of Riw(r,ρ). The proof was provided by Web Appendix A of Li et al.26 Here, the constraints are equivalent to

1K1<r<1,1+K1rKni1<ρ<1+K1rK. (1)

We assume this “nested exchangeable” correlation structure in the following sections.

Suppose we are interested in testing the treatment effect for a two-group comparison: the treated vs. control. The treatment assignment is coded in the last column of the practice covariate matrices Xi' and the corresponding last parameter of β is βp. Let Vβ denote the (p, p)th element of VR. Thus, m(βp^βp) has an asymptotically normal distribution N(0,Vβ), or equivalently, Varβp^=Vβ/m. For simplicity, we take p = 2, i.e. coefficients β1 is the intercept and β2 is the treatment effect. The practice allocations of the treatment and control groups are, mtrt = and mcont = m(1 − π), respectively, where π is a pre-determined value, e.g, 50%. The hypotheses of interest are H0: β2 = 0 versus H1: β2 = β. For a binary outcome, let p0 and p1 be the success rates in the control and treated group. When the identity link function, g(μijk) = μijk, is specified, β2 = p1p0 is the risk difference (RD) between two groups; when the log link function, g(μijk) = ln(μijk), is specified, β2 = ln(p1/p0) is the difference between the natural logarithms of the proportions; and when the logit link function, gμijk=lnμijk/1μijk, is specified, β2=lnp1/(1p1)p0/(1p0) is the difference between the natural logarithms of the odds. When the log and logit link functions are used, taking the exponential of β2 refers to the risk ratio (RR) and the odds ratio (OR), respectively.

Given the “nested exchangeable” correlation structure, we use identity link function and have

Varβ2^=λ3Knmp11p1π+p01p01π, (2)

where nin and the eigenvalue λ3 = 1 + (K − 1)r + K(n − 1)ρ. If we consider the log link function, then

Varβ2^=λ3Knm1p1πp1+1p01πp0. (3)

Using logit link function in the GEE model for a binary outcome, we have

Varβ2^=λ3Knm1πp11p1+1(1π)p01p0. (4)

Please note that Equation (4) is the same as the formula in section 4.425 and reduces to Equation (8)27 when K = 1.

From the relationship between β2 and RD, the asymptotic variance of RD^ is

VarRD^=λ3Knmp11p1π+p01p01π. (5)

Applying the delta method, we obtain the asymptotic variances of RR^ and OR^ as

VarRR^=λ3Knm1p1πp1+1p0p01πexp2p1p0, (6)

and

VarOR^=λ3Knm1πp11p1+1(1π)p01p0exp2p1/(1p1)p0/(1p0). (7)

3. Local optimal design

Assume the study cost per practice is c currency units (e.g. $US), and each provider costs s currency units, e denotes each participant’s cost. The total budget B in a three-level trial is defined as

B=mc+sn+eKn. (8)

We aim to find the optimal design (OD) given the constraint in Equation (8). The term “optimal” refers to the variance of each measure estimator being minimized for a given sampling budget. 10,17,28,29

First, we assume that provider size K is a pre-determined value, same as B, c, s and e, for simplicity. The goal is to find the pair of m and n that minimizes the variance of each measure, which is equivalent to maximizing

L=Knmλ3 (9)

for all three measures (RD, RR, OR). Substituting m=Bc+s+eKn gives

L=KnBλ2c+λ2b+Kρcn+Kbρn2,

where b = s + eK. Taking the partial derivatives with respect to n gives

Lnλ2cKbρn2.

Since Riw(r,ρ) is PD, λ2 is positive. It can be shown that when

n=λ2cKbρ, (10)

where ρ should be positive, the derivatives equals 0 and L is maximized. The local optimal design (LOD) is reached for a known pair value (r,ρ) and n in equation (9) is denoted by nLOD. Let ϑ=λ2Kρ, the parameters in LOD are given by

nLOD=ϑcb,mLOD=Bϑbc+c. (11)

Please note ρ<1+K1rcKc+s+eK in order to be nLOD > 1. Thus, 0<ρ<min1+K1rK,1+K1rKcc+s+eK=1+K1rcKc+s+eK since Equation (1) also holds. For any measures (RD, RR, OR), the local optimal design is the same even if the variance of measure estimator is different. Obviously nLOD and mLOD may be non-integer. In reality we need to choose an integer value for practice size with either nup = int (nLOD) + 1 or ndown = int (nLOD), where “int” refers to an integer part of a number. We then calculate mup and mdown from m=Bc+s+eKn. Similarly, mup and mdown are most likely non-integers. In order to meet the limit of budget, the integer parts for mup and mdown are taken as the values of corresponding number of practices. Then we can calculate the corresponding L using Equation (9) and the proposed optimal practice size and number of practices is the one with the larger L.

Second, when the provider size K is not a fixed value but within a range (Kmin,Kmax) and Kmin ≥ 2, we find nLOD and mLOD for each value of K within this range and calculate the corresponding L in Equation (9). The design with the maximum of L within a range (Kmin,Kmax) is defined as LOD. Given

nLODmLOD=Bbb+cϑ,

it is easy to show that both KnLODmLOD and λ3 are increasing functions of K but KnLODmLODK and λ3K when K ≥ 3. That is, L decreases when K increases for K ≥ 3. Therefore, the LOD is reached at K = Kmin if Kmin ≥ 3 and K = 3 if Kmin = 2 for a known pair value (r,ρ).

Table 1 shows an example to determine LOD for r = 0.6 and ρ = 0.03, where 3 ≤ K ≤ 10, B = 300,000, c = 10,000, s = 100 and e = 10 are assumed. For each K, n and m are calculated from Equation (11). Both integers are chosen as discussed previously and the corresponding L is calculated from Equation (9). The design with K = 3, n = 43, and m = 18 is LOD since L is maximized at K = 3. Please note L is not monotone decreasing in Table 1 since the calculations are provided for (n,m) as integers only. The power estimates for RD, RR and OR are provided for p0 = 0.3 and p1 = 0.45. It definitely demonstrates that the power is maximized, equivalently the variance is minimized, when LOD is reached.

Table 1.

Local optimal design for B = 300000, c = 10000, s = 100 and e = 10 with known correlations r = 0.6 and ρ = 0.03

K Practice size n Number of practices m L Flag Power [1] Power [2] Power [3]
3 43 18 388.3 1 0.871 0.850 0.859
4 40 18 385.0 0.868 0.847 0.856
5 39 18 385.7 0.869 0.848 0.857
6 37 18 381.3 0.865 0.844 0.853
7 36 18 379.6 0.863 0.842 0.851
8 34 18 373.2 0.858 0.836 0.845
9 33 18 370.2 0.855 0.833 0.843
10 32 18 366.9 0.852 0.830 0.839

n and m are calculated from Equation (11).

L is calculated from Equation (9).

Flag=1 refers to LOD.

[1]

RD for p0 = 0.3 and p1 = 0.45.

[2]

RR for p0 = 0.3 and p1 = 0.45.

[3]

OR for p0 = 0.3 and p1 = 0.45.

4. MaxiMin optimal design

First, we still assume that the provider size K is a pre-determined value. Obviously nLOD in the Equation (11) depends on (r,ρ). In practice, the pair value of (r,ρ) could be unknown before a study starts. If the ranges, (rmin, rmax) and (ρmin, ρmax), can be obtained from previous studies or other literature, then we define them as the parameter space.30,31 The range of practice size based on the practical feasibility, (nmin, nmax), is defined as the design space.17,32,33 The objective is to identify OD within the parameter and design spaces.

Inserting (11) in (5)(7) gives the variance of each measure estimator for the optimal design. For example,

VarRD^=g(r,ρ)×1Bp11p1π+p01p01π, (12)

where gr,ρ=ρc+1+K1rKρKs+eK2.

Following the same definition of RE,17 the ratio of the variance of each measure estimator for practice size nLOD to n, we use Equations (5), (8) and (12) and then define RE for measure RD as a function of n, r, and ρ,

REn,r,ρ=g(r,ρ)1+K1r+Kn1ρ×Knc+s+eKn. (13)

It is easy to show that REs for both RR and OR measures are the same as Equation (13). Further, the maximal value of RE(n,r,ρ) is 1 and reached when n is nLOD. Figure 1 shows how RE changes across practice size n for a fixed r, and Figure 2 shows the trend of RE over practice size n for a fixed ρ, where K = 3, B = 300,000, c = 10,000, s = 100 and e = 10. Both figures demonstrate that RE increases until it reaches 1 and then decreases as practice size n increases. Among four REs with the different values of ρ in Figure 1, we observe that the practice size n at which RE equals 1 is the smallest when ρ = 0.7 and is the largest when ρ = 0.1. Similarly, we notice that the practice size n at which RE equals 1 is the smallest when r = 0.1 and is the largest when r = 0.7 in Figure 2.

Figure 1.

Figure 1

Relative efficiencies RE(n,r,ρ) as a function of n for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with r = 0.8

Figure 2.

Figure 2

Relative efficiencies RE(n,r,ρ) as a function of n for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with ρ = 0.05

MaxiMin design (MMD) is a design that maximizes some measure of performance (or minimize the risk) in the worst case scenario. 3134 Here, we use RE, quantified as Equation (13), as the measure of performance. Specifically, the MMD includes three steps. Step 1 defines the parameter and design spaces; Step 2 computes LOD for each pair value of (r,ρ) in the parameter space, and then computes the RE of each design in the design space; Step 3 finds its smallest RE value within the parameter space for each design in the design space and selects the design which maximizes the minimum RE among all designs in the design space. This MMD considers the worst case scenario and thus is robust against misspecification of the values of (r,ρ).

RE of any of the three measures, shown in Equation (13), is a function of n, r, and ρ given the costs c per practice, s per provider, e per participant, and the provider size K. First, Appendix 1 proves that RE(n, r, ρ) is minimized at one of the four points: (rmin, ρmin), (rmin, ρmax), (rmax, ρmin), (rmax, ρmax), i.e., the boundary of the parameter space (rmin, rmax) and (ρmin, ρmax). Figure 3 presents RE(n, rmin, ρmin), RE(n, rmin, ρmax), RE(n, rmax, ρmin) and RE(n, rmax, ρmax) as functions of n for K = 3, B = 300,000, c = 10,000, s = 100 and e = 10 with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05). Next, Appendix 2 shows that the minimum of RE(n,r,ρ) is maximized by the design satisfying RE(n, rmin, ρmax) = RE(n, rmax, ρmin). Let n^ be a solution of RE(n, rmin, ρmax) = RE(n, rmax, ρmin), and expressed as

1+K1rmaxKρmingrmin,ρmax1+K1rminKρmaxg(rmax,ρmin)Kρmaxgrmax,ρminρmingrmin,ρmax. (14)

As shown in Figure 3, the black vertical straight line indicates n^ and locally optimal designs for (rmin, ρmax) and (rmax, ρmin) are added as references. By dividing g(rmin, ρmax) and g(rmax, ρmin) by the study cost per practice c, we notice that MMD of practice sizes depends on (rmin, ρmax), (rmax, ρmin), and ratio (s + eK)/c. That is, the total budget B determines the number of practices m but not practice size n.

Figure 3.

Figure 3

Relative efficiencies RE(n, rmin, ρmin), RE(n, rmin, ρmax), RE(n, rmax, ρmin), and RE(n, rmax, ρmax) as a function of n and locally optimal designs LOD (rmin, ρmax) and LOD (rmax, ρmin) for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05)

Now we provide a step by step approach to find an MMD for a two-arm three-level CRT with a binary outcome when the provider size K is a pre-determined value.

Step 1: Define the parameter space (rmin, rmax), (ρmin, ρmax) and design space (nmin, nmax), respectively.

Step 2: Calculate n^ using Equation (14).

  1. If it is within the range (nmin,nmax), then set nMMD=n^ and the corresponding mMMD=intBc+s+eKnMMD.

  2. If it is outside of (nmin,nmax), calculate RE(n, rmin, ρmin), RE(n, rmin, ρmax), RE(n, rmax, ρmin), and RE(n, rmax, ρmax) for each practice size n ∈ (nmin, nmax) and take their minimum. Choose the design of (n,m) that has the maximum of minimum RE within design space, where m=int(Bc+s+eKn).

Again, n^ may be non-integer. We use the same method in Section 3 to get the integer practice size and number of practices. Please note that Equation (13) is derived using Equation (8) as well. If the calculated mMMD from the above approach is infeasible, then the range (nmin,nmax) needs to be revised appropriately.

Table 2 shows an example to determine MMD, where the same setting as Figure 3 is assumed. We obtain n^=46.6 using Equation (13). If the design space is (11, 20), then the design of (n = 20, m = 23) is MMD under the budget constraints; on the other hand, if the design space is (41, 50), then the design of (n = 47, m = 18) is MMD under the budget constraints.

Table 2.

Maximin design for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05)

Design space (nmin, nmax) Practice size n RE(n, rmin, ρmin) RE(n, rmin, ρmax) RE(n, rmax, ρmin) RE(n, rmax, ρmax) Min RE Number of practices m Flag
(11, 20) 11 0.5642 0.9059 0.4090 0.7346 0.4090 26
12 0.5966 0.9257 0.4369 0.8656 0.4369 25
13 0.6288 0.9421 0.4636 0.7935 0.4636 25
14 0.6550 0.9556 0.4892 0.8184 0.4892 25
15 0.6813 0.9667 0.5136 0.8408 0.5136 25
16 0.7059 0.9757 0.5369 0.8609 0.5369 24
17 0.7287 0.9829 0.5592 0.8788 0.5592 24
18 0.7501 0.9886 0.5806 0.8949 0.5806 24
19 0.7700 0.9929 0.6010 0.9093 0.6010 24
20 0.7886 0.9961 0.6205 0.9221 0.6205 23 1
(41, 50) 41 0.9799 0.9441 0.8809 0.9975 0.8809 19
42 0.9831 0.9394 0.8881 0.9963 0.8881 19
43 0.9859 0.9347 0.8950 0.9949 0.8950 19
44 0.9884 0.9299 0.9016 0.9932 0.9016 19
45 0.9907 0.9251 0.9079 0.9913 0.9079 18
46 0.9926 0.9202 0.9138 0.9893 0.9138 18
47 0.9943 0.9154 0.9195 0.9872 0.9154 18 1
48 0.9958 0.9105 0.9249 0.9849 0.9105 18
49 0.9970 0.9056 0.9301 0.9825 0.9056 18
50 0.9980 0.9008 0.9350 0.9799 0.9008 18

RE is calculated from Equation (13).

Flag=1 refers to MMD.

Second, when the provider size K is not a fixed value but within a range (Kmin,Kmax) and Kmin ≥ 2, we find nMMD and mMMD for each value of K within this range and calculate the corresponding RE in Equation (13). The design with the maximum of RE is defined as MMD.

Table 3 demonstrates how to find MMD with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05) where 3 ≤ K ≤ 10, B = 300,000, c = 10,000, s = 100 and e = 10 are assumed. For each K, nMMD and mMMD are calculated from the step by step approach and the corresponding RE is provided. If the design space is (11, 20), then the design of (K = 10, n = 20, m = 21) is MMD under the budget constraints; on the other hand, if the design space is (41, 50), then the design of (K = 3, n = 47, m = 18) is MMD under the budget constraints. SAS macros %OD_3Level_FixedK and %OD_3Level_RangeK are developed to find LOD and MMD when the corresponding parameters are provided.

Table 3.

MaxiMin design for B = 300000, c = 10000, s = 100 and e = 10 with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05)

Design space (nmin, nmax) K Practice size n Number of practices m RE Flag
(11, 20) 3 20 23 0.6205
4 20 23 0.6369
5 20 23 0.6517
6 20 22 0.6653
7 20 22 0.6781
8 20 22 0.6901
9 20 21 0.7014
10 20 21 0.7121 1
(41, 50) 3 47 18 0.9154 1
4 43 18 0.9032
5 41 18 0.8876
6 41 18 0.8638
7 41 17 0.8421
8 41 17 0.8222
9 41 16 0.8037
10 41 16 0.7866

n and m are calculated from step by step approach.

RE is calculated from Equation (13).

Flag=1 refers to MMD.

Last, we conduct a sensitivity analysis about the parameter space. The following eight different parameter spaces are considered: (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05), (rmin = 0.1, rmax = 0.3) and (ρmin = 0.01, ρmax = 0.05), (rmin = 0.3, rmax = 0.6) and (ρmin = 0.01, ρmax = 0.05), (rmin = 0.6, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05), (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.02), (rmin = 0.1, rmax = 0.9) and (ρmin = 0.02, ρmax = 0.03), (rmin = 0.1, rmax = 0.9) and (ρmin = 0.02, ρmax = 0.05), (rmin = 0.1, rmax = 0.9) and (ρmin = 0.03, ρmax = 0.05). We still assume 3 ≤ K ≤ 10, B = 300,000, c = 10,000, s = 100 and e = 10. Table 4 shows the MMDs for two different design space (2, 20) and (2, 50). If nmax is relatively small, e.g. < n^, MMDs are the same (K = 10, n = 20, m = 21). They might be different otherwise. That is, MMDs are insensitive to the parameter space when the maximum of practice size is relatively small.

Table 4.

MaxiMin design for B = 300000, c = 10000, s = 100 and e = 10 with 3 ≤ K ≤ 10

Design space (nmin, nmax) rmin rmax ρmin ρmax K Practice size n Number of practices m RE
(2, 20) 0.1 0.9 0.01 0.05 10 20 21 0.7121
0.1 0.3 0.01 0.05 10 20 21 0.8717
0.3 0.6 0.01 0.05 10 20 21 0.7754
0.6 0.9 0.01 0.05 10 20 21 0.7121
0.1 0.9 0.01 0.02 10 20 21 0.7121
0.1 0.9 0.02 0.03 10 20 21 0.8365
0.1 0.9 0.02 0.05 10 20 21 0.8365
0.1 0.9 0.03 0.05 10 20 21 0.9031
(2, 50) 0.1 0.9 0.01 0.05 3 47 18 0.9154
0.1 0.3 0.01 0.05 3 41 19 0.9441
0.3 0.6 0.01 0.05 3 47 18 0.9446
0.6 0.9 0.01 0.05 4 50 17 0.9446
0.1 0.9 0.01 0.02 5 49 17 0.9466
0.1 0.9 0.02 0.03 5 43 19 0.9751
0.1 0.9 0.02 0.05 3 41 19 0.9441
0.1 0.9 0.03 0.05 3 41 19 0.9441

5. Example

Teerenstra et al. discussed the Helping Hands trial (Netherlands Organization for Health Research and Development ZonMw, grant number 80–007028-98–07101).25 This study aimed to change nurse behavior through two strategies and randomized the wards to either strategy. The two strategies included the state-of-the-art strategy, which is derived from literature regarding education, reminders, feedback, and targeting adequate products and facilities; and the extended strategy, which contains all elements of the state-of-the-art strategy plus activities aimed at influencing social influence in groups and enhancing leadership. The primary endpoint was adherence to hygiene guidelines (Yes vs. No) and multiple evaluations of nurses’ guideline adherence were observed. The researchers expected to improve the adherence from 60% in the state-of-the-art strategy to 70% in the extended strategy. Teerenstra et al. considered the constant behavior of nurse r = 0.6 and intra-ward coefficient correlation ρ = 0.03.25 We calculated the total number of wards m = 58 to obtain 80% power using the number of nurses per ward n = 15 and number of evaluations K = 3 under the same assumptions of (r, ρ)) using Equation (4). We assume c = 2,000, s = 50 and e = 10 in this study, then the total cost 58*(2000 + 50 * 15 + 10 * 3 * 15) = 185,600 will be needed.

We now apply LOD and MMD approaches to redesign this study with the same budget B = 185,600. We consider 3 ≤ K ≤ 6 and find that LOD is K = 3, n = 25 and m = 46. The power is 83.7% under this scenario. It is worth mentioning that our proposed method does not guarantee obtaining the desired power, e.g., 80%, but to have the highest power under the budget constraints. Researchers should increase the budget if our proposed method does not reach the desired power.

On the other hand, if the researchers have no clear pictures of these two associations, then the parameter space need to be specified. Campbell et al. showed the ICC interquartile range of implantation studies in secondary care from 0.017 to 0.221.3 As Teerenstra et al. mentioned, the behavior of an individual nurse with respect to hand hygiene is constant, the parameter space 0.5 ≤ r ≤ 0.9 is reasonably assumed. Now the parameter space lies within (rmin = 0.5, rmax = 0.9) and (ρmin = 0.017, ρmax = 0.221) and the design space is set as (3, 50), then number of evaluations K = 3, the number of nurses per ward n = 17 and the total number of wards m = 55 is our proposed MMD given the budget B = 185,600.

6. Discussion

In this paper, we presented optimal designs based on GEE models in three-level CRTs and proposed both LODs and MMDs under budget constraints. We employed a nested exchangeable correlation structure25 and derived the variance of the treatment effect under the assumption of an equal practice size and the same provider size. We derived the locally optimal design when the correlation among participants within the same provider in the same practice, r, and correlation among participants with different providers in the same practice, ρ, are known; the optimal design aims to minimize the variance of each measure estimator for a given sampling budget. If the correlation pair (r,ρ) is unknown bulongitudinal data setting with AR(1)t lies in a known range, we proposed MMDs for three-level CRTs for a range of r and ρ. We also developed SAS macros to find the LOD and MMD for practical use.

Our method can be extended in several directions. First, our proposed approach is based on the nested exchangeable correlation structure only. It is suitable when the lowest level units are exchangeable within the middle level units (‘providers’) and the middle level units are exchangeable within the highest level units (‘practices’).25 We will consider more sophisticated settings, e.g., a longitudinal data setting with AR(1) correlation among repeated measures over time in our future work. Second, we assume the same practice size and same provider size. If the practice size is different across the providers, the variances of estimator of treatment effects are more complicated than Equations (2)(4). The derivation of these formulae warrants further research. Third, when GEE models with identity or log link are used to analyze correlated binary data, the convergence issues may occur since the predicted probability is unconstrained. Fourth, the empirical sandwich estimator of the covariance matrix obtained from GEE is biased for a small number of clusters and thus can inflate type I error rates. The proposed LOD and MMD based on the asymptotic variance might be worthy of further investigation. Finally, it merits further consideration to extend treatment groups to more than two.

Supplementary Material

Supp info1
Supp info2

Acknowledgements

We thank the Alvin J. Siteman Cancer Center at Washington University School of Medicine and Barnes-Jewish Hospital in St. Louis, MO., for supporting this research (P30 CA91842). Lei Liu’s work was supported by the Washington University Institute of Clinical and Translational Sciences grant UL1TR000448 from the National Center for Advancing Translational Sciences (NCATS) of the National Institutes of Health (NIH). The content is solely the responsibility of the authors and does not necessarily represent the official view of the NIH.

Appendix

The proof consists of two steps. Appendix 1 shows that RE(n,r,ρ) is minimized at one of the four points: (rmin, ρmin), (rmin, ρmax), (rmax, ρmin), (rmax, ρmax), i.e., the boundary of the parameter space (rmin, rmax) and (ρmin, ρmax). Next, Appendix 2 shows that the minimum of RE(n,r,ρ) is maximized by the design satisfying RE(n, rmin, ρmax) = RE(n, rmax, ρmin).

Appendix 1

Proof that RE(n,r,ρ) is minimized at one of the four points: (rmin,ρmin), (rmin, ρmax), (rmax,ρmin), (rmax, ρmax) within the parameter space (rmin, rmax) and (ρmin, ρmax)

From Equations (5), (8) and (12), it follows that the RE for measure RD as a function of n, r, and ρ for any of the three measures.

REn,r,ρ=g(r,ρ)1+K1r+Kn1ρ×Knc+s+eKn.

Take the partial derivative with respect to ρ gives

REn,r,ρρcρKs+eK1+K1rKρ1+K1r+Kn1ρρc+1+K1rKρKs+eKKn1.

Setting the right hand to zero we obtain

ρ*=c1+K1rKc+s+eKn2.

Then take the partial derivative with respect to r gives

REn,r,ρr1+K1r+Kn1ρs+eKK1+K1rKρρc+1+K1rKρKs+eK.

Similarly, we set the right hand to zero and have

r*=Kρc+s+eKn2ccK1.

Both are actually same as nLOD in Equation (10). We can show that REn,r,ρρ>0 if ρminρ < ρ*, and REn,r,ρρ<0 if ρ* < ρρmax, so RE(n,r,ρ) is minimized at either ρ = ρmin or ρ = ρmax for a fixed r. If we assume the possible range for ρ is (0.1, 0.7) and r = 0.8, Appendix Figure 1 demonstrates 3-D RE plot as a function of n and ρ while Figure 1 shows RE plots for 4 paired values (r,ρ). As seen in Figure 1, RE(n,r,ρ) is minimized at ρ = 0.1 when n < 13 and at ρ = 0.7 when n > 13. Similarly, REn,r,ρr>0 if rminr < r*, and REn,r,ρr<0 if r* < rrmax, so RE(n,r,ρ) is minimized at either r = rmin or r = rmax for a fixed ρ. Appendix Figure 2 demonstrates 3-D RE plot as a function of n and r, r ∈ (0.1, 0.7) for a fixed ρ = 0.05. Shown in Figure 2 with ρ = 0.05, RE(n,r,ρ) is minimized at r = 0.7 when n < 28 and r = 0.1 when n < 28. When combining these characteristics, we conclude that RE(n,r,ρ) is minimized at (rmin,ρmin), or (rmin,ρmax), or (rmax,ρmin), or (rmax,ρmax) within the parameters space (rmin,rmax) and (ρmin, ρmax).

Appendix 2

Proof that the minimum of RE(n,r,ρ) is maximized by the design satisfying RE(n, rmin, ρmax) = RE(n, rmax, ρmin)

Inserting (rmin,ρmin) in Equation (12) and taking the partial derivative with respect to n gives

REn,rmin,ρminnacKbρn2.

Following the proof in Section 3, RE(n, rmin, ρmin) is a single-peaked function and maximized at

nrmin,ρmin=1+K1rminKρminKρmin×cb

with maximum of 1. Similarly, RE(n, rmin, ρmax) is maximized at

nrmin,ρmax=1+K1rminKρmaxKρmax×cb;

RE(n, rmax, ρmin) is maximized at

nrmax,ρmin=1+K1rmaxKρminKρmin×cb;

and RE(n, rmax, ρmax) is maximized at

nrmax,ρmax=1+K1rmaxKρmaxKρmax×cb.

Since ρmin < ρmax, it gives nrmin,ρmin>nrmin,ρmax and nrmax,ρmin>nrmax,ρmax. Further, rmin < rmax is followed by nrmin,ρmin<nrmax,ρmin and nrmin,ρmax<nrmax,ρmax. Thus, nrmin,ρmax is the smallest and nrmax,ρmin is the largest. All four REs have a maximum of 1 and the maximums are reached at nrmin,ρmin, nrmin,ρmax, nrmax,ρmin, and nrmax,ρmax, respectively.

Following the proof of Appendix in Breukelen et al,20, min RE between any two of RE(n, rmin, ρmin), RE(n, rmin, ρmax), RE(n, rmax, ρmin) and RE(n, rmax, ρmax) is maximized by the design satisfying these two REs are equal. For example, min RE for RE(n, rmin, ρmin), and RE(n, rmin, ρmax) is maximized by the design satisfying RE(n, rmin, ρmin) = (n, rmin, ρmax). For any two pair values, (r0,ρ0) and (r1,ρ1), the intersection means

REn,r0,ρ0=REn,r1,ρ1yieldsg(r0,ρ0)1+K1r0+Kn1ρ0=g(r1,ρ1)1+K1r1+Kn1ρ1.

Its only solution is

n=1+K1r1Kρ1gr0,ρ01+K1r0Kρ0g(r1,ρ1)Kρ0gr1,ρ1ρ1gr0,ρ0.

That is, there is only one intersection between any two RE(n,r,ρ)s in the function of n. Therefore, there are total six intersections across these four REs. For example, Figure 3 demonstrates REs at the four points and all six intersections. Given all facts that these four are single-peaked functions, nrmin,ρmax is the smallest and nrmax,ρmin is the largest, and the only one intersection between any two REs, it is obvious that the minimum of RE(n,r,ρ) for these six intersections is reached at the intersection of RE(n, rmin, ρmax) = RE(n, rmax, ρmin). In other words, the minimum of RE(n,r,ρ) is maximized by the design satisfying RE(n, rmin, ρmax) = RE(n, rmax, ρmin).

Figure 1.

Figure 1

3-d relative efficiencies RE(n,r,ρ) as a function of n and ρ for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with r = 0.8

Figure 2.

Figure 2

3-d relative efficiencies RE(n,r,ρ) as a function of n and r for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with ρ = 0.05

Footnotes

Conflict of Interest

The authors have declared no conflict of interest.

References

  • 1.Brownson RC, Colditz GA, Proctor EK. Dissemination and Implementation Research in Health: Translating Science to Practice. Oxford University Press; 2018. [Google Scholar]
  • 2.James AS, Richardson V, Wang JS, Proctor EK, Colditz GA. Systems intervention to promote colon cancer screening in safety net settings: protocol for a community-based participatory randomized controlled trial. Implementation Science : IS. 2013;8:58–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Campbell MK, Mollison J, Steen N, Grimshaw JM, Eccles M. Analysis of cluster randomized trials in primary care: a practical approach. Family Practice. 2000;17(2):192–196. [DOI] [PubMed] [Google Scholar]
  • 4.Gulliford MC, van Staa TP, McDermott L, McCann G, Charlton J, Dregan A. Cluster randomized trials utilizing primary care electronic health records: methodological issues in design, conduct, and analysis (eCRT Study). Trials. 2014;15:220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gravenstein S, Dahal R, Gozalo PL, et al. A cluster randomized controlled trial comparing relative effectiveness of two licensed influenza vaccines in US nursing homes: Design and rationale. Clinical Trials. 2016. [DOI] [PubMed] [Google Scholar]
  • 6.Kalfon P, Mimoz O, Loundou A, et al. Reduction of self-perceived discomforts in critically ill patients in French intensive care units: study protocol for a cluster-randomized controlled trial. Trials. 2016;17(1):87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Mehring M, Haag M, Linde K, Wagenpfeil S, Schneider A. Effects of a Web-Based Intervention for Stress Reduction in Primary Care: A Cluster Randomized Controlled Trial. Journal of Medical Internet Research. 2016;18(2):e27. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nagayama H, Tomori K, Ohno K, et al. Effectiveness and Cost-Effectiveness of Occupation-Based Occupational Therapy Using the Aid for Decision Making in Occupation Choice (ADOC) for Older Residents: Pilot Cluster Randomized Controlled Trial. PLoS One. 2016;11(3):e0150374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Yamagata K, Makino H, Iseki K, et al. Effect of Behavior Modification on Outcome in Early- to Moderate-Stage Chronic Kidney Disease: A Cluster-Randomized Trial. PLoS One. 2016;11(3):e0151422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Raudenbush S Statistical analysis and optimal design for cluster randomized trials. Psychol Methods. 1997;2:173–185. [DOI] [PubMed] [Google Scholar]
  • 11.Raudenbush S, Liu X. Statistical power and optimal design for multisite trials. Psychol Methods. 2000;5(2):199–213. [DOI] [PubMed] [Google Scholar]
  • 12.Moerbeek M, Van Breukelen G, Berger M. Optimal experimental design for multilevel logistic models. The Statistician. 2001;50(1):17–30. [Google Scholar]
  • 13.Moerbeek M, Van Breukelen G, Berger M. Optimal experimental designs for multilevel models with covariates. Commun Stat Theory Methods. 2001;30(12):2683–2697. [Google Scholar]
  • 14.Connelly L Balancing the number and size of sites: an economic approach to the optimal design of cluster samples. Control Clin Trials. 2003;24:544–559. [DOI] [PubMed] [Google Scholar]
  • 15.Headrick T, Zumbo B. On optimizing multi-level designs: power under budget constraints. Austr N Z J Stat. 2005;47(2):219–229. [Google Scholar]
  • 16.Liu X Statistical power and optimum sample allocation ratio for treatment and control having unequal costs per unit of randomization. J Educ Behav Stat. 2003;28(3):231–248. [Google Scholar]
  • 17.Van Breukelen G, Candel M. Efficient design of cluster randomized and multicentre trials with unknown intraclass correlation. Stat Methods Med Res. 2015;24(5):540–556. [DOI] [PubMed] [Google Scholar]
  • 18.Wu S, Wong WK, Crespi CM. Maximin Optimal Designs for Cluster Randomized Trials. Biometrics. 2017;73(3):916–926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Liang K-Y, Zeger SL. Longitudinal Data Analysis Using Generalized Linear Models. Biometrika. 1986;73(1):13–22. [Google Scholar]
  • 20.Toriola AT, Liu J, Ganz PA, et al. Effect of weight loss on bone health in overweight/obese postmenopausal breast cancer survivors. Breast cancer research and treatment. 2015;152(3):637–643. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Lin CC, Bruinooge SS, Kirkwood MK, et al. Association Between Geographic Access to Cancer Care, Insurance, and Receipt of Chemotherapy: Geographic Distribution of Oncologists and Travel Distance. Journal of Clinical Oncology. 2015;33(28):3177–3185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Park YH, Jung KH, Im SA, et al. Quality of life (QoL) in metastatic breast cancer patients with maintenance paclitaxel plus gemcitabine (PG) chemotherapy: results from phase III, multicenter, randomized trial of maintenance chemotherapy versus observation (KCSG-BR07–02). Breast cancer research and treatment. 2015;152(1):77–85. [DOI] [PubMed] [Google Scholar]
  • 23.Jeffe DB, Perez M, Cole EF, Liu Y, Schootman M. The Effects of Surgery Type and Chemotherapy on Early-Stage Breast Cancer Patients’ Quality of Life Over 2-Year Follow-up. Annals of surgical oncology. 2016;23(3):735–743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Sanda MG, Dunn RL, Michalski J, et al. Quality of life and satisfaction with outcome among prostate-cancer survivors. New England Journal of Medicine. 2008;358(12):1250–1261. [DOI] [PubMed] [Google Scholar]
  • 25.Teerenstra S, Lu B, Preisser JS, van Achterberg T, Borm GF. Sample size considerations for GEE analyses of three-level cluster randomized trials. Biometrics. 2010;66(4):1230–1237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Li F, Turner EL, Preisser JS. Sample size determination for GEE analyses of stepped wedge cluster randomized trials. Biometrics. 2018;Early View. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Shih WJ. Sample Size and Power Calculations for Periodontal and Other Studies with Clustered Samples Using the Method of Generalized Estimating Equations. Biometrical Journal. 1997;39(8):899–908. [Google Scholar]
  • 28.Moerbeek M, Van Breukelen G, Berger M. Design Issues for Experiments in Multilevel Populations. Journal of Educational and Behavioral Statistics. 2000;25(3):271–284. [Google Scholar]
  • 29.Liu J, Colditz GA. Optimal design of longitudinal data analysis using generalized estimating equation models. Biometrical journal Biometrische Zeitschrift. 2017;59(2):315–330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Atkinson A, Donev A, Tobias R. Optimum experimental designs, with SAS. Oxford, UK: Oxford University Press,; 2007. [Google Scholar]
  • 31.Berger M, Wong W. An introduction to optimal designs for social and biomedical research. Chichester, UK: Wiley; 2009. [Google Scholar]
  • 32.Winkens B, Schouten HJ, van Breukelen GJ, Berger MP. Optimal designs for clinical trials with second-order polynomial treatment effects. Statistical methods in medical research. 2007;16(6):523–537. [DOI] [PubMed] [Google Scholar]
  • 33.Mario JNMO, Frans EST, Martijn PFB. Maximin D-Optimal Designs for Longitudinal Mixed Effects Models. Biometrics. 2002;58(4):735–741. [DOI] [PubMed] [Google Scholar]
  • 34.Maus B, van Breukelen GJ, Goebel R, Berger MP. Robustness of optimal design of fMRI experiments with application of a genetic algorithm. NeuroImage. 2010;49(3):2433–2443. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp info1
Supp info2

RESOURCES