Abstract
Cluster randomized trials (CRTs) were originally proposed for use when randomization at the subject level is practically infeasible or may lead to a severe estimation bias of the treatment effect. However, recruiting an additional cluster costs more than enrolling an additional subject in an individually randomized trial. Under budget constraints, researchers have proposed the optimal sample sizes in two-level CRTs. CRTs may have a three-level structure, in which two levels of clustering should be considered. In this paper, we propose optimal designs in three-level CRTs with a binary outcome, assuming nested exchangeable correlation structure in generalized estimating equation models. We provide the variance of estimators of three commonly used measures: risk difference, risk ratio, and odds ratio. For a given sampling budget, we discuss how many clusters and how many subjects per cluster are necessary to minimize the variance of each measure estimator. For known association parameters, the locally optimal design (LOD) is proposed. When association parameters are unknown but within pre-determined ranges, the MaxiMin design (MMD) is proposed to maximize the minimum of relative efficiency over the possible ranges, that is, to minimize the risk of the worst scenario.
Keywords: Cluster randomized trial (CRT), dissemination and implementation (D&I), generalized estimating equation (GEE), intracluster correlation coefficient (ICC), nested correlation structure
1. Introduction
Cluster randomized trials (CRTs) were originally proposed for use when randomization at the subject level is practically infeasible or may possibly lead to a severe estimation bias of the treatment effect. In practice, implementation strategies play important roles in dissemination and implementation (D&I) research. Compared to subject-level randomized trials, CRT designs have appealing features for implementation science in public health and clinical medicine. Further, CRTs are greatly needed for effectiveness research from science to practice.1,2 Therefore, there has been growing interest in the design of CRTs.3–9 The unit of randomization might be hospitals, clinics, classrooms, etc. Subjects within a cluster are exposed to common factors and tend to share similar characteristics. The degree of such similarity is commonly quantified by the intracluster correlation coefficient (ICC). Recruiting an additional cluster costs more than enrolling an additional subject in an individually randomized trial; thus, researchers have proposed the optimal sample size as a function of sampling costs and the ICC in CRTs.10–16 “Optimal” means the maximum power and precision for a given sampling budget, or the minimum sampling cost for a given power and precision. These approaches show that the optimal sample size depends strongly on the ICC. However, the ICC is usually unknown in CRTs. To overcome this shortcoming, Van Breukelen et al. considered a range of possible ICC values and presented MaxiMin designs (MMDs) based on relative efficiency (RE) under budget constraints.17 Wu et al. proposed the optimal group allocations for three measures (RD, RR, OR) in two-level CRTs with binary outcomes through the variances of the maximum likelihood estimators. 18
CRTs may have three-level structures. For example, subjects in a two-level CRT are measured at different time points. Measurements across the different time points are correlated within a subject, while subjects are correlated within a cluster. Another example is that interventions are randomly assigned to medical centers (“practices”), and health care professionals (“providers”) within the same practice are trained with the assigned intervention to provide care to participants. Participants could be correlated within a provider, while providers could be correlated within a practice. Hereafter we use a CRT with practice, provider, and participant levels as the three-level example. For simplicity, we consider the same provider size (number of participants from each provider) and equal practice sizes (number of providers per practice).
Generalized estimating equations (GEEs) proposed by Liang and Zeger19 have been commonly applied to analyze the correlated data in CRTs.20–24 Liang and Zeger showed that the GEE approach still gives consistent estimates of the regression coefficients- provided that the marginal model is correctly specified- even if the working correlation matrix is incorrectly assumed.19 In this paper, we aim to propose an optimal design (OD) in three-level CRTs, in which “optimal” refers to the minimization of the variance of each measure estimator for a given sampling budget. We assume the nested exchangeable correlation structure25 throughout and utilize the GEE models in a three-level CRT with a binary outcome. The correlation structure includes correlation among participants within the same provider in the same practice, r, and correlation among participants with different providers in the same practice, ρ. Both r and ρ are assumed to be constant across all practices. Three different link functions in GEE models are considered, e.g. identity, log, and logit, where the corresponding regression coefficients are related to risk difference (RD), risk ratio (RR), and odds ratio (OR), respectively.
For known association parameters (r,ρ), we discuss how many practices m need to be enrolled and how many providers per practice n are sufficient to minimize the variance of each measure estimator under the budget constraints when the provider size K is a pre-determined value and K is not a fixed value but within a range (Kmin,Kmax), respectively. This is a locally optimal design (LOD) with corresponding numbers nLOD and mLOD. When the association parameters (r,ρ) are unknown, but we assume that ranges for r and ρ can be obtained from other literature and similar studies, we propose MMDs in the framework of relative efficiency (RE) to minimize the risk of the worst scenario. RE is defined as the ratio of the variance of each measure estimator for practice size nLOD to n,17 which is a function of n,r, and ρ. Our goal is to maximize the minimum of RE over the possible ranges of r and ρ.
The organization of this article is as follows. In Section 2, we briefly summarize the GEE method developed by Liang and Zeger in three-level CRTs,19 introduce the “nested exchangeable” correlation structure,25 and derive the variance of the estimator of the treatment for a binary outcome in a two-group comparison. Section 3 presents the LOD for known parameter values under the assumption of a “nested exchangeable” correlation structure. In Section 4, we define the RE and propose MMDs for unknown parameter values of r and ρ. We provide guidance on applying the methods and illustrate using a real CRT, followed by a discussion about the limitations of the proposed approach and directions for future research.
2. Statistical GEE models in three-level CRTs
Let Yijk be a response from participant k = 1,⋯,K, for provider j = 1,⋯,ni in practice i = 1,⋯,m. Let Xijk = (Xijk1,⋯,Xijkp)′ be a covariate vector and μijk = E(Yijk|Xijk) be a marginal mean response given Xijk. The marginal model is
Let Yij = (Yij1,⋯,YijK), μij = (μij1, ⋯, μijK), and Xij = (Xij1,⋯,XijK) be the 1 × K response vector, 1 × K marginal mean response vector, p × K covariate matrix of provider j in practice i, respectively. Let and be the matrices of responses, marginal mean responses and covariate of the providers in practice i, respectively. The mean of Yi is denoted by μi = E(Yi) and the variance of Yi is where and a Kni × Kni correlation matrix Ri0(ω0) describes the correlation of measures within the ith practice with a vector of association parameters denoted by ω0. Both γ and θ are dependent on the distribution of responses. If Yijk is binary, γ(μijk) = μijk(1 − μijk) and θ = 1. Liang and Zeger19 showed that is asymptotically multivariate normal with a covariance matrix where , Di = ∂μi/∂β′, and Vi is a working covariance matrix of Yi. Let Riw(ω) be a Kni × Kni working correlation matrix with a vector of association parameters ω. The working covariance matrix is expressed as and is unequal to var (Yi|Xi) unless Riw(ω) = Ri0(ω0).
For three-level data, Teerenstra et al. proposed a “nested exchangeable” correlation structure 25:
Correlation among participants within the same provider in the same practice, is constant, for k1 ≠ k2;
Correlation among participants with different providers in the same practice, is constant, for j1 ≠ j2, and any k1, k2;
This three-level exchangeable working correlation structure was defined as
where 1i×i is a i × i matrix of 1’s, Bdiagi(A) is a block diagonal matrix with matrix element A replicated i times, and Ii×i is the i × i identity matrix. Here, Riw(r,ρ) must be positive definite (PD). Given a value of K and ni, PD can be determined if the constraints holds, min(λ1,λ2,λ3i) > 0, where λ1 = 1 − r, λ2 = 1 + (K − 1)r − Kρ, λ3i = 1 + (K − 1)r + K(ni − 1)ρ are the distinct eigenvalues of Riw(r,ρ). The proof was provided by Web Appendix A of Li et al.26 Here, the constraints are equivalent to
| (1) |
We assume this “nested exchangeable” correlation structure in the following sections.
Suppose we are interested in testing the treatment effect for a two-group comparison: the treated vs. control. The treatment assignment is coded in the last column of the practice covariate matrices and the corresponding last parameter of β is βp. Let Vβ denote the (p, p)th element of VR. Thus, has an asymptotically normal distribution N(0,Vβ), or equivalently, For simplicity, we take p = 2, i.e. coefficients β1 is the intercept and β2 is the treatment effect. The practice allocations of the treatment and control groups are, mtrt = mπ and mcont = m(1 − π), respectively, where π is a pre-determined value, e.g, 50%. The hypotheses of interest are H0: β2 = 0 versus H1: β2 = β. For a binary outcome, let p0 and p1 be the success rates in the control and treated group. When the identity link function, g(μijk) = μijk, is specified, β2 = p1 − p0 is the risk difference (RD) between two groups; when the log link function, g(μijk) = ln(μijk), is specified, β2 = ln(p1/p0) is the difference between the natural logarithms of the proportions; and when the logit link function, is specified, is the difference between the natural logarithms of the odds. When the log and logit link functions are used, taking the exponential of β2 refers to the risk ratio (RR) and the odds ratio (OR), respectively.
Given the “nested exchangeable” correlation structure, we use identity link function and have
| (2) |
where ni ≡ n and the eigenvalue λ3 = 1 + (K − 1)r + K(n − 1)ρ. If we consider the log link function, then
| (3) |
Using logit link function in the GEE model for a binary outcome, we have
| (4) |
Please note that Equation (4) is the same as the formula in section 4.425 and reduces to Equation (8)27 when K = 1.
From the relationship between β2 and RD, the asymptotic variance of is
| (5) |
Applying the delta method, we obtain the asymptotic variances of and as
| (6) |
and
| (7) |
3. Local optimal design
Assume the study cost per practice is c currency units (e.g. $US), and each provider costs s currency units, e denotes each participant’s cost. The total budget B in a three-level trial is defined as
| (8) |
We aim to find the optimal design (OD) given the constraint in Equation (8). The term “optimal” refers to the variance of each measure estimator being minimized for a given sampling budget. 10,17,28,29
First, we assume that provider size K is a pre-determined value, same as B, c, s and e, for simplicity. The goal is to find the pair of m and n that minimizes the variance of each measure, which is equivalent to maximizing
| (9) |
for all three measures (RD, RR, OR). Substituting gives
where b = s + eK. Taking the partial derivatives with respect to n gives
Since Riw(r,ρ) is PD, λ2 is positive. It can be shown that when
| (10) |
where ρ should be positive, the derivatives equals 0 and L is maximized. The local optimal design (LOD) is reached for a known pair value (r,ρ) and n in equation (9) is denoted by nLOD. Let , the parameters in LOD are given by
| (11) |
Please note in order to be nLOD > 1. Thus, since Equation (1) also holds. For any measures (RD, RR, OR), the local optimal design is the same even if the variance of measure estimator is different. Obviously nLOD and mLOD may be non-integer. In reality we need to choose an integer value for practice size with either nup = int (nLOD) + 1 or ndown = int (nLOD), where “int” refers to an integer part of a number. We then calculate mup and mdown from Similarly, mup and mdown are most likely non-integers. In order to meet the limit of budget, the integer parts for mup and mdown are taken as the values of corresponding number of practices. Then we can calculate the corresponding L using Equation (9) and the proposed optimal practice size and number of practices is the one with the larger L.
Second, when the provider size K is not a fixed value but within a range (Kmin,Kmax) and Kmin ≥ 2, we find nLOD and mLOD for each value of K within this range and calculate the corresponding L in Equation (9). The design with the maximum of L within a range (Kmin,Kmax) is defined as LOD. Given
it is easy to show that both KnLODmLOD and λ3 are increasing functions of K but and λ3 ∝ K when K ≥ 3. That is, L decreases when K increases for K ≥ 3. Therefore, the LOD is reached at K = Kmin if Kmin ≥ 3 and K = 3 if Kmin = 2 for a known pair value (r,ρ).
Table 1 shows an example to determine LOD for r = 0.6 and ρ = 0.03, where 3 ≤ K ≤ 10, B = 300,000, c = 10,000, s = 100 and e = 10 are assumed. For each K, n and m are calculated from Equation (11). Both integers are chosen as discussed previously and the corresponding L is calculated from Equation (9). The design with K = 3, n = 43, and m = 18 is LOD since L is maximized at K = 3. Please note L is not monotone decreasing in Table 1 since the calculations are provided for (n,m) as integers only. The power estimates for RD, RR and OR are provided for p0 = 0.3 and p1 = 0.45. It definitely demonstrates that the power is maximized, equivalently the variance is minimized, when LOD is reached.
Table 1.
Local optimal design for B = 300000, c = 10000, s = 100 and e = 10 with known correlations r = 0.6 and ρ = 0.03
| K | Practice size n | Number of practices m | L | Flag | Power [1] | Power [2] | Power [3] |
|---|---|---|---|---|---|---|---|
| 3 | 43 | 18 | 388.3 | 1 | 0.871 | 0.850 | 0.859 |
| 4 | 40 | 18 | 385.0 | 0.868 | 0.847 | 0.856 | |
| 5 | 39 | 18 | 385.7 | 0.869 | 0.848 | 0.857 | |
| 6 | 37 | 18 | 381.3 | 0.865 | 0.844 | 0.853 | |
| 7 | 36 | 18 | 379.6 | 0.863 | 0.842 | 0.851 | |
| 8 | 34 | 18 | 373.2 | 0.858 | 0.836 | 0.845 | |
| 9 | 33 | 18 | 370.2 | 0.855 | 0.833 | 0.843 | |
| 10 | 32 | 18 | 366.9 | 0.852 | 0.830 | 0.839 |
n and m are calculated from Equation (11).
L is calculated from Equation (9).
Flag=1 refers to LOD.
RD for p0 = 0.3 and p1 = 0.45.
RR for p0 = 0.3 and p1 = 0.45.
OR for p0 = 0.3 and p1 = 0.45.
4. MaxiMin optimal design
First, we still assume that the provider size K is a pre-determined value. Obviously nLOD in the Equation (11) depends on (r,ρ). In practice, the pair value of (r,ρ) could be unknown before a study starts. If the ranges, (rmin, rmax) and (ρmin, ρmax), can be obtained from previous studies or other literature, then we define them as the parameter space.30,31 The range of practice size based on the practical feasibility, (nmin, nmax), is defined as the design space.17,32,33 The objective is to identify OD within the parameter and design spaces.
Inserting (11) in (5)–(7) gives the variance of each measure estimator for the optimal design. For example,
| (12) |
where
Following the same definition of RE,17 the ratio of the variance of each measure estimator for practice size nLOD to n, we use Equations (5), (8) and (12) and then define RE for measure RD as a function of n, r, and ρ,
| (13) |
It is easy to show that REs for both RR and OR measures are the same as Equation (13). Further, the maximal value of RE(n,r,ρ) is 1 and reached when n is nLOD. Figure 1 shows how RE changes across practice size n for a fixed r, and Figure 2 shows the trend of RE over practice size n for a fixed ρ, where K = 3, B = 300,000, c = 10,000, s = 100 and e = 10. Both figures demonstrate that RE increases until it reaches 1 and then decreases as practice size n increases. Among four REs with the different values of ρ in Figure 1, we observe that the practice size n at which RE equals 1 is the smallest when ρ = 0.7 and is the largest when ρ = 0.1. Similarly, we notice that the practice size n at which RE equals 1 is the smallest when r = 0.1 and is the largest when r = 0.7 in Figure 2.
Figure 1.
Relative efficiencies RE(n,r,ρ) as a function of n for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with r = 0.8
Figure 2.
Relative efficiencies RE(n,r,ρ) as a function of n for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with ρ = 0.05
MaxiMin design (MMD) is a design that maximizes some measure of performance (or minimize the risk) in the worst case scenario. 31–34 Here, we use RE, quantified as Equation (13), as the measure of performance. Specifically, the MMD includes three steps. Step 1 defines the parameter and design spaces; Step 2 computes LOD for each pair value of (r,ρ) in the parameter space, and then computes the RE of each design in the design space; Step 3 finds its smallest RE value within the parameter space for each design in the design space and selects the design which maximizes the minimum RE among all designs in the design space. This MMD considers the worst case scenario and thus is robust against misspecification of the values of (r,ρ).
RE of any of the three measures, shown in Equation (13), is a function of n, r, and ρ given the costs c per practice, s per provider, e per participant, and the provider size K. First, Appendix 1 proves that RE(n, r, ρ) is minimized at one of the four points: (rmin, ρmin), (rmin, ρmax), (rmax, ρmin), (rmax, ρmax), i.e., the boundary of the parameter space (rmin, rmax) and (ρmin, ρmax). Figure 3 presents RE(n, rmin, ρmin), RE(n, rmin, ρmax), RE(n, rmax, ρmin) and RE(n, rmax, ρmax) as functions of n for K = 3, B = 300,000, c = 10,000, s = 100 and e = 10 with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05). Next, Appendix 2 shows that the minimum of RE(n,r,ρ) is maximized by the design satisfying RE(n, rmin, ρmax) = RE(n, rmax, ρmin). Let be a solution of RE(n, rmin, ρmax) = RE(n, rmax, ρmin), and expressed as
| (14) |
As shown in Figure 3, the black vertical straight line indicates and locally optimal designs for (rmin, ρmax) and (rmax, ρmin) are added as references. By dividing g(rmin, ρmax) and g(rmax, ρmin) by the study cost per practice c, we notice that MMD of practice sizes depends on (rmin, ρmax), (rmax, ρmin), and ratio (s + eK)/c. That is, the total budget B determines the number of practices m but not practice size n.
Figure 3.
Relative efficiencies RE(n, rmin, ρmin), RE(n, rmin, ρmax), RE(n, rmax, ρmin), and RE(n, rmax, ρmax) as a function of n and locally optimal designs LOD (rmin, ρmax) and LOD (rmax, ρmin) for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05)
Now we provide a step by step approach to find an MMD for a two-arm three-level CRT with a binary outcome when the provider size K is a pre-determined value.
Step 1: Define the parameter space (rmin, rmax), (ρmin, ρmax) and design space (nmin, nmax), respectively.
Step 2: Calculate using Equation (14).
If it is within the range (nmin,nmax), then set and the corresponding
If it is outside of (nmin,nmax), calculate RE(n, rmin, ρmin), RE(n, rmin, ρmax), RE(n, rmax, ρmin), and RE(n, rmax, ρmax) for each practice size n ∈ (nmin, nmax) and take their minimum. Choose the design of (n,m) that has the maximum of minimum RE within design space, where
Again, may be non-integer. We use the same method in Section 3 to get the integer practice size and number of practices. Please note that Equation (13) is derived using Equation (8) as well. If the calculated mMMD from the above approach is infeasible, then the range (nmin,nmax) needs to be revised appropriately.
Table 2 shows an example to determine MMD, where the same setting as Figure 3 is assumed. We obtain using Equation (13). If the design space is (11, 20), then the design of (n = 20, m = 23) is MMD under the budget constraints; on the other hand, if the design space is (41, 50), then the design of (n = 47, m = 18) is MMD under the budget constraints.
Table 2.
Maximin design for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05)
| Design space (nmin, nmax) | Practice size n | RE(n, rmin, ρmin) | RE(n, rmin, ρmax) | RE(n, rmax, ρmin) | RE(n, rmax, ρmax) | Min RE | Number of practices m | Flag |
|---|---|---|---|---|---|---|---|---|
| (11, 20) | 11 | 0.5642 | 0.9059 | 0.4090 | 0.7346 | 0.4090 | 26 | |
| 12 | 0.5966 | 0.9257 | 0.4369 | 0.8656 | 0.4369 | 25 | ||
| 13 | 0.6288 | 0.9421 | 0.4636 | 0.7935 | 0.4636 | 25 | ||
| 14 | 0.6550 | 0.9556 | 0.4892 | 0.8184 | 0.4892 | 25 | ||
| 15 | 0.6813 | 0.9667 | 0.5136 | 0.8408 | 0.5136 | 25 | ||
| 16 | 0.7059 | 0.9757 | 0.5369 | 0.8609 | 0.5369 | 24 | ||
| 17 | 0.7287 | 0.9829 | 0.5592 | 0.8788 | 0.5592 | 24 | ||
| 18 | 0.7501 | 0.9886 | 0.5806 | 0.8949 | 0.5806 | 24 | ||
| 19 | 0.7700 | 0.9929 | 0.6010 | 0.9093 | 0.6010 | 24 | ||
| 20 | 0.7886 | 0.9961 | 0.6205 | 0.9221 | 0.6205 | 23 | 1 | |
| (41, 50) | 41 | 0.9799 | 0.9441 | 0.8809 | 0.9975 | 0.8809 | 19 | |
| 42 | 0.9831 | 0.9394 | 0.8881 | 0.9963 | 0.8881 | 19 | ||
| 43 | 0.9859 | 0.9347 | 0.8950 | 0.9949 | 0.8950 | 19 | ||
| 44 | 0.9884 | 0.9299 | 0.9016 | 0.9932 | 0.9016 | 19 | ||
| 45 | 0.9907 | 0.9251 | 0.9079 | 0.9913 | 0.9079 | 18 | ||
| 46 | 0.9926 | 0.9202 | 0.9138 | 0.9893 | 0.9138 | 18 | ||
| 47 | 0.9943 | 0.9154 | 0.9195 | 0.9872 | 0.9154 | 18 | 1 | |
| 48 | 0.9958 | 0.9105 | 0.9249 | 0.9849 | 0.9105 | 18 | ||
| 49 | 0.9970 | 0.9056 | 0.9301 | 0.9825 | 0.9056 | 18 | ||
| 50 | 0.9980 | 0.9008 | 0.9350 | 0.9799 | 0.9008 | 18 |
RE is calculated from Equation (13).
Flag=1 refers to MMD.
Second, when the provider size K is not a fixed value but within a range (Kmin,Kmax) and Kmin ≥ 2, we find nMMD and mMMD for each value of K within this range and calculate the corresponding RE in Equation (13). The design with the maximum of RE is defined as MMD.
Table 3 demonstrates how to find MMD with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05) where 3 ≤ K ≤ 10, B = 300,000, c = 10,000, s = 100 and e = 10 are assumed. For each K, nMMD and mMMD are calculated from the step by step approach and the corresponding RE is provided. If the design space is (11, 20), then the design of (K = 10, n = 20, m = 21) is MMD under the budget constraints; on the other hand, if the design space is (41, 50), then the design of (K = 3, n = 47, m = 18) is MMD under the budget constraints. SAS macros %OD_3Level_FixedK and %OD_3Level_RangeK are developed to find LOD and MMD when the corresponding parameters are provided.
Table 3.
MaxiMin design for B = 300000, c = 10000, s = 100 and e = 10 with parameter space (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05)
| Design space (nmin, nmax) | K | Practice size n | Number of practices m | RE | Flag |
|---|---|---|---|---|---|
| (11, 20) | 3 | 20 | 23 | 0.6205 | |
| 4 | 20 | 23 | 0.6369 | ||
| 5 | 20 | 23 | 0.6517 | ||
| 6 | 20 | 22 | 0.6653 | ||
| 7 | 20 | 22 | 0.6781 | ||
| 8 | 20 | 22 | 0.6901 | ||
| 9 | 20 | 21 | 0.7014 | ||
| 10 | 20 | 21 | 0.7121 | 1 | |
| (41, 50) | 3 | 47 | 18 | 0.9154 | 1 |
| 4 | 43 | 18 | 0.9032 | ||
| 5 | 41 | 18 | 0.8876 | ||
| 6 | 41 | 18 | 0.8638 | ||
| 7 | 41 | 17 | 0.8421 | ||
| 8 | 41 | 17 | 0.8222 | ||
| 9 | 41 | 16 | 0.8037 | ||
| 10 | 41 | 16 | 0.7866 |
n and m are calculated from step by step approach.
RE is calculated from Equation (13).
Flag=1 refers to MMD.
Last, we conduct a sensitivity analysis about the parameter space. The following eight different parameter spaces are considered: (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05), (rmin = 0.1, rmax = 0.3) and (ρmin = 0.01, ρmax = 0.05), (rmin = 0.3, rmax = 0.6) and (ρmin = 0.01, ρmax = 0.05), (rmin = 0.6, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.05), (rmin = 0.1, rmax = 0.9) and (ρmin = 0.01, ρmax = 0.02), (rmin = 0.1, rmax = 0.9) and (ρmin = 0.02, ρmax = 0.03), (rmin = 0.1, rmax = 0.9) and (ρmin = 0.02, ρmax = 0.05), (rmin = 0.1, rmax = 0.9) and (ρmin = 0.03, ρmax = 0.05). We still assume 3 ≤ K ≤ 10, B = 300,000, c = 10,000, s = 100 and e = 10. Table 4 shows the MMDs for two different design space (2, 20) and (2, 50). If nmax is relatively small, e.g. < MMDs are the same (K = 10, n = 20, m = 21). They might be different otherwise. That is, MMDs are insensitive to the parameter space when the maximum of practice size is relatively small.
Table 4.
MaxiMin design for B = 300000, c = 10000, s = 100 and e = 10 with 3 ≤ K ≤ 10
| Design space (nmin, nmax) | rmin | rmax | ρmin | ρmax | K | Practice size n | Number of practices m | RE |
|---|---|---|---|---|---|---|---|---|
| (2, 20) | 0.1 | 0.9 | 0.01 | 0.05 | 10 | 20 | 21 | 0.7121 |
| 0.1 | 0.3 | 0.01 | 0.05 | 10 | 20 | 21 | 0.8717 | |
| 0.3 | 0.6 | 0.01 | 0.05 | 10 | 20 | 21 | 0.7754 | |
| 0.6 | 0.9 | 0.01 | 0.05 | 10 | 20 | 21 | 0.7121 | |
| 0.1 | 0.9 | 0.01 | 0.02 | 10 | 20 | 21 | 0.7121 | |
| 0.1 | 0.9 | 0.02 | 0.03 | 10 | 20 | 21 | 0.8365 | |
| 0.1 | 0.9 | 0.02 | 0.05 | 10 | 20 | 21 | 0.8365 | |
| 0.1 | 0.9 | 0.03 | 0.05 | 10 | 20 | 21 | 0.9031 | |
| (2, 50) | 0.1 | 0.9 | 0.01 | 0.05 | 3 | 47 | 18 | 0.9154 |
| 0.1 | 0.3 | 0.01 | 0.05 | 3 | 41 | 19 | 0.9441 | |
| 0.3 | 0.6 | 0.01 | 0.05 | 3 | 47 | 18 | 0.9446 | |
| 0.6 | 0.9 | 0.01 | 0.05 | 4 | 50 | 17 | 0.9446 | |
| 0.1 | 0.9 | 0.01 | 0.02 | 5 | 49 | 17 | 0.9466 | |
| 0.1 | 0.9 | 0.02 | 0.03 | 5 | 43 | 19 | 0.9751 | |
| 0.1 | 0.9 | 0.02 | 0.05 | 3 | 41 | 19 | 0.9441 | |
| 0.1 | 0.9 | 0.03 | 0.05 | 3 | 41 | 19 | 0.9441 |
5. Example
Teerenstra et al. discussed the Helping Hands trial (Netherlands Organization for Health Research and Development ZonMw, grant number 80–007028-98–07101).25 This study aimed to change nurse behavior through two strategies and randomized the wards to either strategy. The two strategies included the state-of-the-art strategy, which is derived from literature regarding education, reminders, feedback, and targeting adequate products and facilities; and the extended strategy, which contains all elements of the state-of-the-art strategy plus activities aimed at influencing social influence in groups and enhancing leadership. The primary endpoint was adherence to hygiene guidelines (Yes vs. No) and multiple evaluations of nurses’ guideline adherence were observed. The researchers expected to improve the adherence from 60% in the state-of-the-art strategy to 70% in the extended strategy. Teerenstra et al. considered the constant behavior of nurse r = 0.6 and intra-ward coefficient correlation ρ = 0.03.25 We calculated the total number of wards m = 58 to obtain 80% power using the number of nurses per ward n = 15 and number of evaluations K = 3 under the same assumptions of (r, ρ)) using Equation (4). We assume c = 2,000, s = 50 and e = 10 in this study, then the total cost 58*(2000 + 50 * 15 + 10 * 3 * 15) = 185,600 will be needed.
We now apply LOD and MMD approaches to redesign this study with the same budget B = 185,600. We consider 3 ≤ K ≤ 6 and find that LOD is K = 3, n = 25 and m = 46. The power is 83.7% under this scenario. It is worth mentioning that our proposed method does not guarantee obtaining the desired power, e.g., 80%, but to have the highest power under the budget constraints. Researchers should increase the budget if our proposed method does not reach the desired power.
On the other hand, if the researchers have no clear pictures of these two associations, then the parameter space need to be specified. Campbell et al. showed the ICC interquartile range of implantation studies in secondary care from 0.017 to 0.221.3 As Teerenstra et al. mentioned, the behavior of an individual nurse with respect to hand hygiene is constant, the parameter space 0.5 ≤ r ≤ 0.9 is reasonably assumed. Now the parameter space lies within (rmin = 0.5, rmax = 0.9) and (ρmin = 0.017, ρmax = 0.221) and the design space is set as (3, 50), then number of evaluations K = 3, the number of nurses per ward n = 17 and the total number of wards m = 55 is our proposed MMD given the budget B = 185,600.
6. Discussion
In this paper, we presented optimal designs based on GEE models in three-level CRTs and proposed both LODs and MMDs under budget constraints. We employed a nested exchangeable correlation structure25 and derived the variance of the treatment effect under the assumption of an equal practice size and the same provider size. We derived the locally optimal design when the correlation among participants within the same provider in the same practice, r, and correlation among participants with different providers in the same practice, ρ, are known; the optimal design aims to minimize the variance of each measure estimator for a given sampling budget. If the correlation pair (r,ρ) is unknown bulongitudinal data setting with AR(1)t lies in a known range, we proposed MMDs for three-level CRTs for a range of r and ρ. We also developed SAS macros to find the LOD and MMD for practical use.
Our method can be extended in several directions. First, our proposed approach is based on the nested exchangeable correlation structure only. It is suitable when the lowest level units are exchangeable within the middle level units (‘providers’) and the middle level units are exchangeable within the highest level units (‘practices’).25 We will consider more sophisticated settings, e.g., a longitudinal data setting with AR(1) correlation among repeated measures over time in our future work. Second, we assume the same practice size and same provider size. If the practice size is different across the providers, the variances of estimator of treatment effects are more complicated than Equations (2)–(4). The derivation of these formulae warrants further research. Third, when GEE models with identity or log link are used to analyze correlated binary data, the convergence issues may occur since the predicted probability is unconstrained. Fourth, the empirical sandwich estimator of the covariance matrix obtained from GEE is biased for a small number of clusters and thus can inflate type I error rates. The proposed LOD and MMD based on the asymptotic variance might be worthy of further investigation. Finally, it merits further consideration to extend treatment groups to more than two.
Supplementary Material
Acknowledgements
We thank the Alvin J. Siteman Cancer Center at Washington University School of Medicine and Barnes-Jewish Hospital in St. Louis, MO., for supporting this research (P30 CA91842). Lei Liu’s work was supported by the Washington University Institute of Clinical and Translational Sciences grant UL1TR000448 from the National Center for Advancing Translational Sciences (NCATS) of the National Institutes of Health (NIH). The content is solely the responsibility of the authors and does not necessarily represent the official view of the NIH.
Appendix
The proof consists of two steps. Appendix 1 shows that RE(n,r,ρ) is minimized at one of the four points: (rmin, ρmin), (rmin, ρmax), (rmax, ρmin), (rmax, ρmax), i.e., the boundary of the parameter space (rmin, rmax) and (ρmin, ρmax). Next, Appendix 2 shows that the minimum of RE(n,r,ρ) is maximized by the design satisfying RE(n, rmin, ρmax) = RE(n, rmax, ρmin).
Appendix 1
Proof that RE(n,r,ρ) is minimized at one of the four points: (rmin,ρmin), (rmin, ρmax), (rmax,ρmin), (rmax, ρmax) within the parameter space (rmin, rmax) and (ρmin, ρmax)
From Equations (5), (8) and (12), it follows that the RE for measure RD as a function of n, r, and ρ for any of the three measures.
Take the partial derivative with respect to ρ gives
Setting the right hand to zero we obtain
Then take the partial derivative with respect to r gives
Similarly, we set the right hand to zero and have
Both are actually same as nLOD in Equation (10). We can show that if ρmin ≤ ρ < ρ*, and if ρ* < ρ ≤ ρmax, so RE(n,r,ρ) is minimized at either ρ = ρmin or ρ = ρmax for a fixed r. If we assume the possible range for ρ is (0.1, 0.7) and r = 0.8, Appendix Figure 1 demonstrates 3-D RE plot as a function of n and ρ while Figure 1 shows RE plots for 4 paired values (r,ρ). As seen in Figure 1, RE(n,r,ρ) is minimized at ρ = 0.1 when n < 13 and at ρ = 0.7 when n > 13. Similarly, if rmin ≤ r < r*, and if r* < r ≤ rmax, so RE(n,r,ρ) is minimized at either r = rmin or r = rmax for a fixed ρ. Appendix Figure 2 demonstrates 3-D RE plot as a function of n and r, r ∈ (0.1, 0.7) for a fixed ρ = 0.05. Shown in Figure 2 with ρ = 0.05, RE(n,r,ρ) is minimized at r = 0.7 when n < 28 and r = 0.1 when n < 28. When combining these characteristics, we conclude that RE(n,r,ρ) is minimized at (rmin,ρmin), or (rmin,ρmax), or (rmax,ρmin), or (rmax,ρmax) within the parameters space (rmin,rmax) and (ρmin, ρmax).
Appendix 2
Proof that the minimum of RE(n,r,ρ) is maximized by the design satisfying RE(n, rmin, ρmax) = RE(n, rmax, ρmin)
Inserting (rmin,ρmin) in Equation (12) and taking the partial derivative with respect to n gives
Following the proof in Section 3, RE(n, rmin, ρmin) is a single-peaked function and maximized at
with maximum of 1. Similarly, RE(n, rmin, ρmax) is maximized at
RE(n, rmax, ρmin) is maximized at
and RE(n, rmax, ρmax) is maximized at
Since ρmin < ρmax, it gives and Further, rmin < rmax is followed by and Thus, is the smallest and is the largest. All four REs have a maximum of 1 and the maximums are reached at , and , respectively.
Following the proof of Appendix in Breukelen et al,20, min RE between any two of RE(n, rmin, ρmin), RE(n, rmin, ρmax), RE(n, rmax, ρmin) and RE(n, rmax, ρmax) is maximized by the design satisfying these two REs are equal. For example, min RE for RE(n, rmin, ρmin), and RE(n, rmin, ρmax) is maximized by the design satisfying RE(n, rmin, ρmin) = (n, rmin, ρmax). For any two pair values, (r0,ρ0) and (r1,ρ1), the intersection means
Its only solution is
That is, there is only one intersection between any two RE(n,r,ρ)s in the function of n. Therefore, there are total six intersections across these four REs. For example, Figure 3 demonstrates REs at the four points and all six intersections. Given all facts that these four are single-peaked functions, is the smallest and is the largest, and the only one intersection between any two REs, it is obvious that the minimum of RE(n,r,ρ) for these six intersections is reached at the intersection of RE(n, rmin, ρmax) = RE(n, rmax, ρmin). In other words, the minimum of RE(n,r,ρ) is maximized by the design satisfying RE(n, rmin, ρmax) = RE(n, rmax, ρmin).
Figure 1.
3-d relative efficiencies RE(n,r,ρ) as a function of n and ρ for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with r = 0.8
Figure 2.
3-d relative efficiencies RE(n,r,ρ) as a function of n and r for K = 3, B = 300000, c = 10000, s = 100 and e = 10 with ρ = 0.05
Footnotes
Conflict of Interest
The authors have declared no conflict of interest.
References
- 1.Brownson RC, Colditz GA, Proctor EK. Dissemination and Implementation Research in Health: Translating Science to Practice. Oxford University Press; 2018. [Google Scholar]
- 2.James AS, Richardson V, Wang JS, Proctor EK, Colditz GA. Systems intervention to promote colon cancer screening in safety net settings: protocol for a community-based participatory randomized controlled trial. Implementation Science : IS. 2013;8:58–58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Campbell MK, Mollison J, Steen N, Grimshaw JM, Eccles M. Analysis of cluster randomized trials in primary care: a practical approach. Family Practice. 2000;17(2):192–196. [DOI] [PubMed] [Google Scholar]
- 4.Gulliford MC, van Staa TP, McDermott L, McCann G, Charlton J, Dregan A. Cluster randomized trials utilizing primary care electronic health records: methodological issues in design, conduct, and analysis (eCRT Study). Trials. 2014;15:220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gravenstein S, Dahal R, Gozalo PL, et al. A cluster randomized controlled trial comparing relative effectiveness of two licensed influenza vaccines in US nursing homes: Design and rationale. Clinical Trials. 2016. [DOI] [PubMed] [Google Scholar]
- 6.Kalfon P, Mimoz O, Loundou A, et al. Reduction of self-perceived discomforts in critically ill patients in French intensive care units: study protocol for a cluster-randomized controlled trial. Trials. 2016;17(1):87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mehring M, Haag M, Linde K, Wagenpfeil S, Schneider A. Effects of a Web-Based Intervention for Stress Reduction in Primary Care: A Cluster Randomized Controlled Trial. Journal of Medical Internet Research. 2016;18(2):e27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nagayama H, Tomori K, Ohno K, et al. Effectiveness and Cost-Effectiveness of Occupation-Based Occupational Therapy Using the Aid for Decision Making in Occupation Choice (ADOC) for Older Residents: Pilot Cluster Randomized Controlled Trial. PLoS One. 2016;11(3):e0150374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Yamagata K, Makino H, Iseki K, et al. Effect of Behavior Modification on Outcome in Early- to Moderate-Stage Chronic Kidney Disease: A Cluster-Randomized Trial. PLoS One. 2016;11(3):e0151422. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Raudenbush S Statistical analysis and optimal design for cluster randomized trials. Psychol Methods. 1997;2:173–185. [DOI] [PubMed] [Google Scholar]
- 11.Raudenbush S, Liu X. Statistical power and optimal design for multisite trials. Psychol Methods. 2000;5(2):199–213. [DOI] [PubMed] [Google Scholar]
- 12.Moerbeek M, Van Breukelen G, Berger M. Optimal experimental design for multilevel logistic models. The Statistician. 2001;50(1):17–30. [Google Scholar]
- 13.Moerbeek M, Van Breukelen G, Berger M. Optimal experimental designs for multilevel models with covariates. Commun Stat Theory Methods. 2001;30(12):2683–2697. [Google Scholar]
- 14.Connelly L Balancing the number and size of sites: an economic approach to the optimal design of cluster samples. Control Clin Trials. 2003;24:544–559. [DOI] [PubMed] [Google Scholar]
- 15.Headrick T, Zumbo B. On optimizing multi-level designs: power under budget constraints. Austr N Z J Stat. 2005;47(2):219–229. [Google Scholar]
- 16.Liu X Statistical power and optimum sample allocation ratio for treatment and control having unequal costs per unit of randomization. J Educ Behav Stat. 2003;28(3):231–248. [Google Scholar]
- 17.Van Breukelen G, Candel M. Efficient design of cluster randomized and multicentre trials with unknown intraclass correlation. Stat Methods Med Res. 2015;24(5):540–556. [DOI] [PubMed] [Google Scholar]
- 18.Wu S, Wong WK, Crespi CM. Maximin Optimal Designs for Cluster Randomized Trials. Biometrics. 2017;73(3):916–926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Liang K-Y, Zeger SL. Longitudinal Data Analysis Using Generalized Linear Models. Biometrika. 1986;73(1):13–22. [Google Scholar]
- 20.Toriola AT, Liu J, Ganz PA, et al. Effect of weight loss on bone health in overweight/obese postmenopausal breast cancer survivors. Breast cancer research and treatment. 2015;152(3):637–643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lin CC, Bruinooge SS, Kirkwood MK, et al. Association Between Geographic Access to Cancer Care, Insurance, and Receipt of Chemotherapy: Geographic Distribution of Oncologists and Travel Distance. Journal of Clinical Oncology. 2015;33(28):3177–3185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Park YH, Jung KH, Im SA, et al. Quality of life (QoL) in metastatic breast cancer patients with maintenance paclitaxel plus gemcitabine (PG) chemotherapy: results from phase III, multicenter, randomized trial of maintenance chemotherapy versus observation (KCSG-BR07–02). Breast cancer research and treatment. 2015;152(1):77–85. [DOI] [PubMed] [Google Scholar]
- 23.Jeffe DB, Perez M, Cole EF, Liu Y, Schootman M. The Effects of Surgery Type and Chemotherapy on Early-Stage Breast Cancer Patients’ Quality of Life Over 2-Year Follow-up. Annals of surgical oncology. 2016;23(3):735–743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Sanda MG, Dunn RL, Michalski J, et al. Quality of life and satisfaction with outcome among prostate-cancer survivors. New England Journal of Medicine. 2008;358(12):1250–1261. [DOI] [PubMed] [Google Scholar]
- 25.Teerenstra S, Lu B, Preisser JS, van Achterberg T, Borm GF. Sample size considerations for GEE analyses of three-level cluster randomized trials. Biometrics. 2010;66(4):1230–1237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Li F, Turner EL, Preisser JS. Sample size determination for GEE analyses of stepped wedge cluster randomized trials. Biometrics. 2018;Early View. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Shih WJ. Sample Size and Power Calculations for Periodontal and Other Studies with Clustered Samples Using the Method of Generalized Estimating Equations. Biometrical Journal. 1997;39(8):899–908. [Google Scholar]
- 28.Moerbeek M, Van Breukelen G, Berger M. Design Issues for Experiments in Multilevel Populations. Journal of Educational and Behavioral Statistics. 2000;25(3):271–284. [Google Scholar]
- 29.Liu J, Colditz GA. Optimal design of longitudinal data analysis using generalized estimating equation models. Biometrical journal Biometrische Zeitschrift. 2017;59(2):315–330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Atkinson A, Donev A, Tobias R. Optimum experimental designs, with SAS. Oxford, UK: Oxford University Press,; 2007. [Google Scholar]
- 31.Berger M, Wong W. An introduction to optimal designs for social and biomedical research. Chichester, UK: Wiley; 2009. [Google Scholar]
- 32.Winkens B, Schouten HJ, van Breukelen GJ, Berger MP. Optimal designs for clinical trials with second-order polynomial treatment effects. Statistical methods in medical research. 2007;16(6):523–537. [DOI] [PubMed] [Google Scholar]
- 33.Mario JNMO, Frans EST, Martijn PFB. Maximin D-Optimal Designs for Longitudinal Mixed Effects Models. Biometrics. 2002;58(4):735–741. [DOI] [PubMed] [Google Scholar]
- 34.Maus B, van Breukelen GJ, Goebel R, Berger MP. Robustness of optimal design of fMRI experiments with application of a genetic algorithm. NeuroImage. 2010;49(3):2433–2443. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





