We read with interest the work of Charehbili and colleagues1, which ‘aimed to investigate whether there is a superiority of chlorhexidine–alcohol over iodine–alcohol for preventing SSI’.
This cluster‐randomized crossover trial was conducted in five hospitals and 3665 patients were included. The authors found that the incidence of surgical‐site infection (SSI) was not different between the groups: 3·8 per cent among patients in the chlorhexidine–alcohol group versus 4·0 per cent in those in the iodine–alcohol group (odds ratio 0·96, 95 per cent c.i. 0·69 to 1·35).
We commend the authors for performing this interesting study, as these results are useful for the choice of the most appropriate preoperative antiseptic. However, we have several statistical suggestions and queries that we would like to communicate to the authors.
The authors concluded that ‘Preoperative skin disinfection with chlorhexidine–alcohol is similar to that for iodine–alcohol with respect to reducing the risk of developing an SSI’. This may be due to an underpowered study.
In fact, sample size was estimated by simulation. Although this approach is efficient, the authors do not provide enough details on the parameters they used. Thus it is not easy to replicate calculations. As mentioned by the CONSORT statement2: ‘the reports of cluster randomized trials should state the assumptions used when calculating the number of clusters and the cluster sample size’.
The authors mention R software for the simulations that led to the final estimation of sample size. But several R packages are available and one may suppose that a package such as clusterPower was used3. Not knowing which package was used does not permit the analysis to be replicated. Moreover, algorithms used may vary between packages and lead to different estimations of sample size.
In a cluster‐randomized crossover trial, a sequence of interventions is assigned to a cluster (group) of individuals. Each cluster receives each intervention in a separate period of time and this leads to ‘cluster periods'4. There is usually a correlation between patients in the same cluster. In addition, within a cluster, patients within the same period may be more similar to one another than to patients in other periods5.
In a cluster‐randomized crossover trial, the sample size estimated by not taking into account the above‐mentioned features must be multiplied by a defined inflation factor. The latter can be approximated by (1 + (n − 1)ρ) − η6, 7, where n is the average number of patients in a cluster during one of the periods, ρ is the intraclass correlation (ICC), and η the interperiod correlation. See, for example, Turner et al.8 or Moerbeek and Teerenstra9 (p. 94) for other approaches to estimate sample size in this context.
Parameters ρ and η can be retrieved from literature or estimated using assumptions or approximations10 (p. 203), for instance: the logarithm of the ICC can be approximated by the logarithm of the prevalence of disease (here, the SSI rate)11; the interclass correlation is intrinsically lower than the ICC12.
Data were analysed using a multilevel model, which is appropriate. Treatment period was considered as a fixed effect and hospitals as random effect. Treatment period could also be considered as a random effect. In their simulations, Morgan et al.5 actually demonstrated that ‘hierarchical models without random effects for period‐within‐cluster, which do not account for any extra within‐period correlation, performed poorly with greatly inflated Type I errors in many scenarios’.
The authors did not report variance components of outcomes: within‐ and between‐participant variance, the ICC, as recommended by some authors13.
In a cluster‐randomized crossover trial, three components of variation are available: variation in cluster mean response; variation in the cluster period mean response; and variation between individual responses within a cluster period4. Small changes in the specification of the within‐cluster–within‐period correlation, or the within‐cluster–between‐period correlation, can increase the required number of clusters4. Thus, as the above‐mentioned correlation parameters were not reported by Charehbili et al.1, the number of clusters required may be larger than that used in the study.
A simulation study showed an association between an increase in cluster size variability and a decrease in statistical power14. The authors did not address this point.
In summary, the results of this study are interesting, but readers should interpret them with caution, according to the statistical methods used for design and analysis of cluster‐randomized crossover trials.
Disclosure
The authors declare no conflict of interest.
DOI: 10.1002/bjs5.50177
The Editor‐in‐Chief welcomes topical correspondence from readers relating to articles published in BJS Open. Letters should be submitted via ScholarOne Manuscripts and, if accepted, will be published online.
References
- 1. Charehbili A, Koek MBG, de Mol van Otterloo JCA, Bronkhorst MWGA, van der Zwaal P, Thomassen B et al Cluster‐randomized crossover trial of chlorhexidine–alcohol versus iodine–alcohol for prevention of surgical‐site infection (SKINFECT trial). BJS Open 2019; 3: 617–622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Campbell MK, Piaggio G, Elbourne DR, Altman DG, CONSORT Group . Consort 2010 statement: extension to cluster randomised trials. BMJ 2012; 345: e5661. [DOI] [PubMed] [Google Scholar]
- 3. Reich NG, Myers JA, Obeng D, Milstone AM, Perl TM. Empirical power and sample size calculations for cluster‐randomized and cluster‐randomized crossover studies. PLoS One 2012; 7: e35564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Arnup SJ, McKenzie JE, Hemming K, Pilcher D, Forbes AB. Understanding the cluster randomised crossover design: a graphical illustration of the components of variation and a sample size tutorial. Trials 2017; 18: 381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Morgan KE, Forbes AB, Keogh RH, Jairath V, Kahan BC. Choosing appropriate analysis methods for cluster randomised cross‐over trials with a binary outcome. Stat Med 2017; 36: 318–333. [DOI] [PubMed] [Google Scholar]
- 6. Donner A, Birkett N, Buck C. Randomization by cluster. Sample size requirements and analysis. Am J Epidemiol 1981; 114: 906–914. [DOI] [PubMed] [Google Scholar]
- 7. Giraudeau B, Ravaud P, Donner A. Sample size calculation for cluster randomized cross‐over trials. Stat Med 2008; 27: 5578–5585. [DOI] [PubMed] [Google Scholar]
- 8. Turner EL, Li F, Gallis JA, Prague M, Murray DM. Review of recent methodological developments in group‐randomized trials: part 1 – design. Am J Public Health 2017; 107: 907–915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Moerbeek M, Teerenstra S. Power Analysis of Trials with Multilevel Data. CRC Press: Boca Raton, 2015. [Google Scholar]
- 10. Machin D, Campbell MJ, Tan SB, Tan SH. Sample Sizes for Clinical, Laboratory and Epidemiology Studies. John Wiley: Hoboken, 2018. [Google Scholar]
- 11. Gulliford MC, Adams G, Ukoumunne OC, Latinovic R, Chinn S, Campbell MJ. Intraclass correlation coefficient and outcome prevalence are associated in clustered binary data. J Clin Epidemiol 2005; 58: 246–251. [DOI] [PubMed] [Google Scholar]
- 12. Donner A, Klar N, Zou G. Methods for the statistical analysis of binary data in split‐cluster designs. Biometrics 2004; 60: 919–925. [DOI] [PubMed] [Google Scholar]
- 13. Arnup SJ, Forbes AB, Kahan BC, Morgan KE, McKenzie JE. The quality of reporting in cluster randomised crossover trials: proposal for reporting items and an assessment of reporting quality. Trials 2016; 17: 575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Lauer SA, Kleinman KP, Reich NG. The effect of cluster size variability on statistical power in cluster‐randomized trials. PLoS One 2015; 10: e0119074. [DOI] [PMC free article] [PubMed] [Google Scholar]