Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Jul 27.
Published in final edited form as: Int J Stat Med Res. 2016 Jan 8;5(1):2–7. doi: 10.6000/1929-6029.2016.05.01.1

The Method of Randomization for Cluster-Randomized Trials: Challenges of Including Patients with Multiple Chronic Conditions

Denise Esserman 1, Heather G Allore 1,2, Thomas G Travison 3,4,*
PMCID: PMC4963011  NIHMSID: NIHMS777064  PMID: 27478520

Abstract

Cluster-randomized clinical trials (CRT) are trials in which the unit of randomization is not a participant but a group (e.g. healthcare systems or community centers). They are suitable when the intervention applies naturally to the cluster (e.g. healthcare policy); when lack of independence among participants may occur (e.g. nursing home hygiene); or when it is most ethical to apply an intervention to all within a group (e.g. school-level immunization). Because participants in the same cluster receive the same intervention, CRT may approximate clinical practice, and may produce generalizable findings. However, when not properly designed or interpreted, CRT may induce biased results.

CRT designs have features that add complexity to statistical estimation and inference. Chief among these is the cluster-level correlation in response measurements induced by the randomization. A critical consideration is the experimental unit of inference; often it is desirable to consider intervention effects at the level of the individual rather than the cluster. Finally, given that the number of clusters available may be limited, simple forms of randomization may not achieve balance between intervention and control arms at either the cluster- or participant-level.

In non-clustered clinical trials, balance of key factors may be easier to achieve because the sample can be homogenous by exclusion of participants with multiple chronic conditions (MCC). CRTs, which are often pragmatic, may eschew such restrictions. Failure to account for imbalance may induce bias and reducing validity. This article focuses on the complexities of randomization in the design of CRTs, such as the inclusion of patients with MCC, and imbalances in covariate factors across clusters.

Keywords: Experimental Design, Randomization, Cluster Randomized Trials, Multiple Chronic Conditions

INTRODUCTION

The US Department of Health and Human Services has made addressing clinical trials of people with multiple chronic conditions (MCC) a priority [1]. People with MCC take multiple medications, which are rarely tested together in a randomized clinical trial. In fact, these individuals are often excluded from trials as a matter of course in order to decrease potential sources of variation and bias.

Thus, trials in this area may not be generalizable to the most important clinical populations. This is a major concern; for results to be applicable to clinical practice, it is essential that an intervention is effective in the true target population, not only in idealized samples. Accordingly, recent guidance has emphasized [2] “the FDA’s interest in encouraging a broad population sample in the development of new drugs.”

A key first step in trial design is determining how best to randomize participants. The balance among arms of observed and unobserved factors is a goal of randomization for unbiased estimation of intervention effects. Cluster randomized trials (CRT) – in which the unit of randomization is a group of participants [3,4], or “cluster” – are relevant for interventions applied at the level of the group, and may offer some advantages in trials enrolling participants with MCC. There are substantial difficulties introduced by clustered sampling of participants, stemming largely from the correlation between individuals enrolled within a cluster [5]. Here we provide a brief overview of randomization in CRTs, discuss the pros and cons of these designs for complex patient populations, and propose a direction for future methodological development in this area.

THE ROLE OF RANDOMIZATION

We rely upon randomization to achieve comparable treatment and control arms, balanced on both measured and unmeasured factors, so that the difference between them can be given a causal interpretation [6,7]. While the benefits of simple randomization follow readily when the number of randomized units is large (e.g. the number of participants in a large, non-clustered trial), they may not hold when the number of randomized units is small. In a CRT, the unit of randomization is the cluster, and these may be few in number. In this situation, there is a much greater probability of not achieving balance between trial arms under simple randomization schemes, compromising the validity of the trial results. Therefore, refinements on more simple methods of randomization are often necessary. There has been substantial attention to the problem of obtaining balance on covariates at the time of randomization, but numerous difficulties persist [8].

BALANCING CLUSTERS IN RANDOMIZED DESIGNS

When deciding on the best way to randomize clusters to ensure balance across treatment arms, we must first decide whether the unit of inference will be the cluster or rather the participant [9]. In the former case, simple methods may be used to compare, for instance, the mean rate of change on some outcome in treatment clusters versus control clusters, and for each cluster the outcomes data are reduced to the average rate of change for that cluster. Covariates are likewise applicable to the cluster itself and are taken to apply to it as a whole. Under this paradigm, comparability of treatment and control units randomized is equivalent to comparability of cluster-level factors; these should in principle be balanced by even simple randomization of clusters, though in practice the number of clusters is often too few to insure this.

When the unit of inference is the individual, by contrast, it is necessary that one be able to estimate the difference between comparable individuals assigned to treatment and to control. Comparability at the level of the individual, however, is not directly addressed by simple randomization of clusters to treatment and control. Imbalance among important characteristics may cause confounding of treatment and prognostic factors and adversely affect the interpretation and finding of the trial [10]. This may be further complicated by design elements – for instance, in situations where one is interested in a subset of individuals within each cluster, due to trial exclusions or limits on enrollment. If not all patients from a cluster are enrolled and those patients enrolled do not reflect the aggregate characteristics of the cluster, there could be severe imbalances at the level of the individual. This is especially applicable for trials with patients with MCC, who may often be excluded from participation and/or be unevenly distributed among geographic regions, hence across randomized clusters. Typically, trial designers do not have the necessary composite cluster-level characteristics or individual-level characteristics available for study until after participant enrollment and cluster randomization has occurred. Therefore, trial designers must rely on data from secondary sources. If these data are available, methods may be deployed to assist in balancing on these factors, but rely on the applicability of these data to the current trial population.

SIMULTANEOUS VS. SEQUENTIAL RANDOMIZATION SCHEMES

Randomization methods can broadly be divided into simultaneous and sequential schemes. A summary of some of the most common methods available is presented in Table 1; each has its own advantages and disadvantages. In simultaneous randomization, the full sampling frame is created and all clusters are randomized prior to enrollment of participants. This approach is easier to operationalize as the randomized units can begin preparations and participant enrollments at the same time. They have as a limitation, however, the fact that they cannot be modified after the fact. Several methods of restricted or constrained randomization, simultaneous approaches, have been proposed to achieve overall balance in trials, especially when a small number of units (i.e. participants or clusters) are being randomized. For example, in one trial of ten universities in North Carolina a matching process was used before randomization to achieve balance between the schools in the two arms [11].

Table 1.

Description of Competing Randomization Procedures for Cluster Designs

Randomization Procedure Description Advantages Disadvantages
Simple Randomization Unrestricted technique, based on single sequence random assignment. All allocations of units randomized are possible. Simple and easy to implement. Balances covariates with large sample sizes. Subjects enrolled may not have balance on covariates when the sample size is moderate or small.
Stratified Randomization Restricted technique: Create a stratum for each combination of covariates being considered. Units are then randomly assigned to treatment arms within each stratum. Reduces imbalance between treatment groups on important covariates. Able to control and balance covariates of importance. Limited number of factors can be stratified on, and need to be willing to categorize continuous variables. Number of strata needed increases rapidly as the number of covariates increases.
Matching Restricted technique: Select from a smaller set of all possible allocations, those fulfilling certain restrictions (i.e. meet the matching criteria), and then randomly allocate to the treatment arms within each match. Reduces imbalance between treatment groups on important covariates. Able to control and balance covariates of importance. Need to identify pairs of clusters that are well-matched on all of the risk factors, which is often not feasible, especially when subsets of people are enrolled in each cluster post-randomization. Need to set suitable balance criteria.
Covariate Constrained Randomization Restricted technique: Find the number of allocations meeting a set of balancing criteria for the covariates of interest. Ensure that overly constrained designs do not exist (e.g. same clusters always appearing in same group) – otherwise need to adjust balance criteria. Randomly select one allocation for the study. Can attain balance (or near balance) on covariates related to outcome resulting in a gain in efficiency. Do not need to categorize covariates. Need to set suitable balance criteria. If balance criteria are too restricted, it could result in biased or invalid design. Performed at the start of trial, so infeasible when need to add more clusters.
Minimal Sufficient Balance [29] Restricted technique: Distribution of covariates between treatment arms assessed using imbalance tests, and depending on results units are assigned treatment based on biased coin or simple random assignment Prevents serious imbalance on important covariates, while maintaining randomness of treatment allocation. Do not need to categorize covariates. Expected that units are being randomized sequentially. Could be deterministic. Need to set suitable balance criteria.
Minimization [10] Restricted technique: Sequentially assign units to treatment groups taking into account the balance on covariates and previous randomization assignments. Maintains balance among several covariates, while minimizing imbalance in the distribution of the treatment across whole trial and each stratification variable. Expectation is that units being randomized are available sequentially, which is usually not the case in a cluster-randomized trial. Could have imbalance in specific strata. Criticized for being too deterministic.
Dynamic Randomization [12] Restricted technique: For each level of a stratification hierarchy, a balance criteria is set, to keep imbalances from exceeding these limits. If imbalance is within limits for all levels, unit is randomly assigned, otherwise allocation is forced at stratification level where limits exceeded to reduce imbalance. Maintains balance on treatment assignments across the whole trial and within each strata. Most useful in unblinded trials. Need a centrally administered trial. Expected that units are being randomized sequentially.
Outcome Adaptive Randomization [30] Restricted technique: Class of methods including those proposed by Bather, [31] Thompson, [32] Zelen, [33] Sobel and Weiss, [34] and Berry and Fristedt, [35] in which treatment assignment is dependent on response of previous individuals. Objective is to maximize the number of overall successes, maximize effective treatment. Expected that units are being randomized sequentially. Need real time reporting of outcomes that can be measured shortly after treatment initiation, (e.g. pain relief for a treatment).

In sequential randomization, clusters are randomized over time as they are included in the study. This approach, though more difficult to implement, in principle allows for consideration of balance as an evolving state that can be monitored and controlled via randomization as the trial proceeds. Adaptive randomization – in which features of the design are changed or updated in response to the current state of enrollment – can be considered an important subset of sequential schemes, but few adaptive cluster randomization methods have been described in the literature.

As difficulty in enrolling enough participants is a key hurdle in randomized trials, an important challenge in maintaining the balance between arms after the initial randomization occurs when clusters need to be added, typically to meet recruitment goals. Randomization methods that allocate units in a sequential manner (e.g. baseline covariate adaptive randomization [10,12]) allow for the addition of participants later in the trial, but require flexibility on the part of investigators and trial centers. If investigators cannot stagger enrollment across centers or they do not have a large enough pool of clusters, these methods still may not be practical. In simultaneous schemes (e.g. covariate constrained randomization) [13,14], there is no defined method to include additional clusters once the initial randomization is completed. As noted above, these methods must rely on what limited data is available on key risk factors and/or only cluster level information is available at the time of study start. Given the desire to include participants with MCC and to conduct more pragmatic trials with diverse populations, there is a need to expand upon these methods to insure valid results.

MULTIFACTORIAL INTERVENTIONS

CRTs can be advantageous for testing multiple-component interventions tailored to an individual’s risk factors, designs that we refer to as standardly-tailored. For example, an intervention may combine protein and vitamin D supplementation, where only participants below a certain daily intake of protein are considered for randomization to the former, and only those whose circulating vitamin D levels are below a certain threshold are considered for randomization to the latter. The standardly-tailored intervention allows participants to be enrolled in the trial as long as they have at least one of the risk factors. Participants randomized to intervention receive those components relevant to their status (in the example above, either protein or vitamin D supplementation or both), while those not randomized to intervention receive a placebo, standard care, or some other control regime. These designs are thus highly applicable to trials enrolling individuals with MCC.

A CRT of a standardly-tailored multi-component intervention produces a distribution of interventions and components with marked similarities to clinical practice in that, within clusters randomized to interventions, participants receive those components appropriate to their health state. Thus, standardly-tailored designs are highly pragmatic and may produce more immediately applicable findings. They also present sizable methodological challenges. In addition to the issues related to randomization described above, analyses of these designs must acknowledge that individuals may not be eligible or need every component of the intervention given their risk factor profile. In CRTs, this treatment (and corresponding risk factor) heterogeneity may exacerbate the complex imbalances that can occurs between clusters, as well as inducing different treatment effects among subpopulations of participants, which may in turn be unevenly spread across clusters.

LIMITATIONS OF CLUSTER RANDOMIZED TRIAL DESIGNS

While CRTs are advantageous and may closely represent clinical practice, they also present substantial challenges. With the movement toward inclusion of more complex patient populations and to more pragmatic clinical trials, the limits of the currently methodology will be tested, and refinements will prove necessary. However, the shift from CRT to pragmatic trials that may use cluster randomization often include a generalizable population and potentially many clusters, with few if any techniques available for balancing covariates. Trial designers should weight the tradeoffs of enrolling few large clusters (often fewer than 100), or many smaller clusters, which may be protected by randomization.

To date there is no established method describing the addition or replacement of a cluster following the use of matching or any of the simultaneous randomization schemes, such as covariate constrained randomization. The flexibility needed for large scale CRTs to meet the realities of the healthcare systems, clinical practices, and patient populations in which they are used makes some randomization procedures infeasible, while casting doubt on the applicability of others. As the call for pragmatic trials strengthens, there may be more emphasis to use trials that are broadly inclusive of people with MCC and other complex populations.

CHALLENGES TO DEVELOPING CRT AMONG POPULATIONS WITH MCC

The overarching methodologic challenges inherent in CRT designs [9,1517] may be exacerbated by the demands of the specific patient population under study and inattention to best practices. Numerous authors have described the difficulties in developing and reporting CRT in primary care [1820], cardiovascular health [21], nutrition [22], pediatrics [23], pain management [24], and other fields [25,26]. Even identifying published reports of CRT in a specific area can be difficult [27], and the quality of reporting of critical information may be lacking. A recent review estimated that fewer than one quarter of publications describing CRT provide tabular depictions of covariate data; only 17% of trials reporting the use of covariates in randomization adjusted for all of those covariates in analysis [28]. For populations with MCC, the reporting of illnesses, multimorbidity and other potential confounders is an obvious concern. Though few adaptive CRT have been published, a recent report describes difficulties common to all trials but of particular relevance to cluster designs, including a need to make unplanned revisions to the randomization scheme due to practical limitations on enrollment and retention [24]; it seems likely that cluster trials among participants with MCC would face similar challenges. More rigorous methods and enhanced guidance addressing these difficulties are needed.

CONCLUSION

CRTs offer considerable benefits in that they allow for the testing of complex interventions intended to produce change at the group level, and the manner in which they deploy interventions is somewhat more reminiscent of clinical practice than is the case in conventional randomized trials. However, there are substantial methodological costs accrued in deploying cluster designs. The issue of imbalance on confounders is a critical concern that is not addressed in simple randomization schemes. Methods for insuring balance can be deployed under simultaneous or sequential matching schemes, but these rely heavily on existing or accumulated data applicable to the trial population. If the unit of analysis is the participant, these difficulties can be substantially exacerbated. The benefits and weaknesses of the various approaches described in Table 1 should be carefully taken into account during the design of a CRT. Additionally, with increasing emphasis on pragmatic trials, as well as the inclusion of individuals with complex disease states, future expansion and refinement of these methods is of urgent importance.

References

RESOURCES