Sample size calculation for stepped wedge and other longitudinal cluster randomised trials

The sample size required for a cluster randomised trial is inflated compared with an individually randomised trial because outcomes of participants from the same cluster are correlated. Sample size calculations for longitudinal cluster randomised trials (including stepped wedge trials) need to take account of at least two levels of clustering: the clusters themselves and times within clusters. We derive formulae for sample size for repeated cross‐section and closed cohort cluster randomised trials with normally distributed outcome measures, under a multilevel model allowing for variation between clusters and between times within clusters. Our formulae agree with those previously described for special cases such as crossover and analysis of covariance designs, although simulation suggests that the formulae could underestimate required sample size when the number of clusters is small. Whether using a formula or simulation, a sample size calculation requires estimates of nuisance parameters, which in our model include the intracluster correlation, cluster autocorrelation, and individual autocorrelation. A cluster autocorrelation less than 1 reflects a situation where individuals sampled from the same cluster at different times have less correlated outcomes than individuals sampled from the same cluster at the same time. Nuisance parameters could be estimated from time series obtained in similarly clustered settings with the same outcome measure, using analysis of variance to estimate variance components. Copyright © 2016 John Wiley & Sons, Ltd.

[1]  H. Goldstein Multilevel covariance component models , 1987 .

[2]  S J Pocock,et al.  Repeated measures in clinical trials: analysis using mean summary statistics and its implications for design. , 1992, Statistics in medicine.

[3]  D G Altman,et al.  Estimating sample sizes for binary, ordered categorical, and continuous outcomes in two group comparisons , 1995, BMJ.

[4]  M. Kenward,et al.  Small sample inference for fixed effects from restricted maximum likelihood. , 1997, Biometrics.

[5]  J M Bland,et al.  The intracluster correlation coefficient in cluster randomisation , 1998, BMJ.

[6]  D. Firth,et al.  Estimating Intraclass Correlation for Binary Data , 1999, Biometrics.

[7]  R. Drake,et al.  Self-esteem as an outcome measure in studies of vocational rehabilitation for adults with severe mental illness. , 2000, Psychiatric services.

[8]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[9]  D. Ashby,et al.  Sample size for cluster randomized trials: effect of coefficient of variation of cluster size and analysis method. , 2006, International journal of epidemiology.

[10]  R. Lilford,et al.  Bmc Medical Research Methodology Open Access the Stepped Wedge Trial Design: a Systematic Review , 2022 .

[11]  J. Hughes,et al.  Design and analysis of stepped wedge cluster randomized trials. , 2007, Contemporary clinical trials.

[12]  A Donner,et al.  Sample size calculation for cluster randomized cross‐over trials , 2008, Statistics in medicine.

[13]  Celia A Taylor nee Brown,et al.  Systematic review of stepped wedge cluster randomized trials shows that design is particularly used to evaluate interventions during routine implementation. , 2011, Journal of clinical epidemiology.

[14]  Steven Teerenstra,et al.  A simple sample size formula for analysis of covariance in cluster randomized trials , 2012, Statistics in medicine.

[15]  D. Gerritsen,et al.  Stepped wedge designs could reduce the required sample size in cluster randomized trials. , 2013, Journal of clinical epidemiology.

[16]  Karla Hemming,et al.  The efficiency of stepped wedge vs. cluster randomized trials: stepped wedge studies do not always require a smaller sample size. , 2013, Journal of clinical epidemiology.

[17]  S. Senn Seven myths of randomisation in clinical trials , 2011, Statistics in medicine.

[18]  Steven Teerenstra,et al.  The stepped wedge cluster randomized trial always requires fewer clusters but not always fewer measurements, that is, participants than a parallel cluster randomized trial in a cross-sectional design. In reply. , 2013, Journal of clinical epidemiology.

[19]  Richard Hooper,et al.  Versatile Sample-Size Calculation using Simulation , 2013 .

[20]  T. Haines,et al.  Investigation of the effectiveness of the “Girls on the Go!” program for building self-esteem in young women: trial protocol , 2013, SpringerPlus.

[21]  Karla Hemming,et al.  A Menu-Driven Facility for Power and Detectable-Difference Calculations in Stepped-Wedge Cluster-Randomized Trials , 2014 .

[22]  L. Bourke,et al.  The dog-leg: an alternative to a cross-over design for pragmatic clinical trials in relatively stable populations. , 2014, International journal of epidemiology.

[23]  Gianluca Baio,et al.  Stepped wedge randomised controlled trials: systematic review of studies published between 2010 and 2014 , 2015, Trials.

[24]  L. Bourke,et al.  Cluster randomised trials with repeated cross sections: alternatives to parallel group designs , 2015, BMJ : British Medical Journal.

[25]  Andrew J Copas,et al.  Analysis and reporting of stepped wedge randomised controlled trials: synthesis and critical appraisal of published studies, 2010 to 2014 , 2015, Trials.

[26]  E. de Hoop Efficient designs for cluster randomized trials with small numbers of clusters; stepped wedge and other repeated measurements designs , 2015 .

[27]  D. Osrin,et al.  Logistic, ethical, and political dimensions of stepped wedge trials: critical review and case studies , 2015, Trials.

[28]  J. Carlin,et al.  Optimal stepped wedge designs , 2015 .

[29]  Gianluca Baio,et al.  Sample size calculation for a stepped wedge trial , 2015, Trials.

[30]  G. Baio,et al.  Five questions to consider before conducting a stepped wedge trial , 2015, Trials.

[31]  Gianluca Baio,et al.  Designing a stepped wedge trial: three main designs, carry-over effects and randomisation approaches , 2015, Trials.

[32]  R J Lilford,et al.  The stepped wedge cluster randomised trial: rationale, design, analysis, and reporting , 2015, BMJ : British Medical Journal.

[33]  Karla Hemming,et al.  Stepped-wedge cluster randomised controlled trials: a generic framework including parallel and multiple-level designs , 2014, Statistics in medicine.

[34]  A. Girling,et al.  Statistical efficiency and optimal design for stepped cluster studies under linear mixed effects models , 2016, Statistics in medicine.

[35]  D. Barker,et al.  Design and analysis of stepped wedge cluster randomised trials , 2018 .