Methods for sample size determination in cluster randomized trials

Background: The use of cluster randomized trials (CRTs) is increasing, along with the variety in their design and analysis. The simplest approach for their sample size calculation is to calculate the sample size assuming individual randomization and inflate this by a design effect to account for randomization by cluster. The assumptions of a simple design effect may not always be met; alternative or more complicated approaches are required. Methods: We summarise a wide range of sample size methods available for cluster randomized trials. For those familiar with sample size calculations for individually randomized trials but with less experience in the clustered case, this manuscript provides formulae for a wide range of scenarios with associated explanation and recommendations. For those with more experience, comprehensive summaries are provided that allow quick identification of methods for a given design, outcome and analysis method. Results: We present first those methods applicable to the simplest two-arm, parallel group, completely randomized design followed by methods that incorporate deviations from this design such as: variability in cluster sizes; attrition; non-compliance; or the inclusion of baseline covariates or repeated measures. The paper concludes with methods for alternative designs. Conclusions: There is a large amount of methodology available for sample size calculations in CRTs. This paper gives the most comprehensive description of published methodology for sample size calculation and provides an important resource for those designing these trials.

[1]  S. Raudenbush Statistical analysis and optimal design for cluster randomized trials , 1997 .

[2]  M. Segal,et al.  Design effects for binary regression models fitted to dependent data. , 1993, Statistics in medicine.

[3]  K Y Liang,et al.  Sample size calculations for studies with correlated observations. , 1997, Biometrics.

[4]  Jessica A. Myers,et al.  Empirical Power and Sample Size Calculations for Cluster-Randomized and Cluster-Randomized Crossover Studies , 2012, PloS one.

[5]  Xiaonan Xue,et al.  Sample size requirement to detect an intervention effect at the end of follow‐up in a longitudinal cluster randomized trial , 2010, Statistics in medicine.

[6]  Mirjam Moerbeek,et al.  Sample size calculations for 3-level cluster randomized trials , 2008, Clinical trials.

[7]  A Donner,et al.  Sample size calculation for cluster randomized cross‐over trials , 2008, Statistics in medicine.

[8]  Tailiang Xie,et al.  Design and sample size estimation in clinical trials with clustered survival times as the primary endpoint , 2003, Statistics in medicine.

[9]  R. Byng,et al.  Exploratory cluster randomised controlled trial of shared care development for long-term mental illness. , 2004, The British journal of general practice : the journal of the Royal College of General Practitioners.

[10]  W. Shih,et al.  Sample Size and Power Calculations for Periodontal and Other Studies with Clustered Samples Using the Method of Generalized Estimating Equations , 1997 .

[11]  D R Jacobs,et al.  The worksite component of variance: design effects and the Healthy Worker Project. , 1993, Health education research.

[12]  Antje Jahn-Eimermacher,et al.  Sample size in cluster‐randomized trials with time to event as the primary endpoint , 2013, Statistics in medicine.

[13]  Steven Teerenstra,et al.  Sample Size Considerations for GEE Analyses of Three‐Level Cluster Randomized Trials , 2010, Biometrics.

[14]  S. Mukhopadhyay,et al.  Quantile dispersion graphs to compare the efficiencies of cluster randomized designs , 2009 .

[15]  Martijn P. F. Berger,et al.  OPTIMAL EXPERIMENTAL DESIGNS FOR MULTILEVEL MODELS WITH COVARIATES , 2001 .

[16]  Z Feng,et al.  Correlated binomial variates: properties of estimator of intraclass correlation and its effect on sample size calculation. , 1992, Statistics in medicine.

[17]  A Donner,et al.  Statistical considerations in the design and analysis of community intervention trials. , 1996, Journal of clinical epidemiology.

[18]  Spyros Konstantopoulos,et al.  Incorporating Cost in Power Analysis for Three-Level Cluster-Randomized Designs , 2009, Evaluation review.

[19]  David J Spiegelhalter,et al.  Prior distributions for the intracluster correlation coefficient, based on multiple previous estimates, and their application in cluster randomized trials , 2005, Clinical trials.

[20]  Michael R. Kosorok,et al.  Sample‐size formula for clustered survival data using weighted log‐rank statistics , 2004 .

[21]  Sally Kerry,et al.  A Practical Guide to Cluster Randomised Trials in Health Services Research , 2012 .

[22]  Ian Harvey,et al.  A pragmatic–explanatory continuum indicator summary (PRECIS): a tool to help trial designers , 2009, Canadian Medical Association Journal.

[23]  D M Murray,et al.  Planning for the appropriate analysis in school-based drug-use prevention studies. , 1990, Journal of consulting and clinical psychology.

[24]  Gerard J P van Breukelen,et al.  Relative efficiency of unequal versus equal cluster sizes in cluster randomized and multicentre trials , 2007, Statistics in medicine.

[25]  D. Ashby,et al.  Sample size for cluster randomized trials: effect of coefficient of variation of cluster size and analysis method. , 2006, International journal of epidemiology.

[26]  Subhash Aryal,et al.  Sample Size Determination for Hierarchical Longitudinal Designs with Differential Attrition Rates , 2007, Biometrics.

[27]  R. Glynn,et al.  Incorporation of Clustering Effects for the Wilcoxon Rank Sum Test: A Large‐Sample Approach , 2003, Biometrics.

[28]  P G Smith,et al.  Calculation of power for matched pair studies when randomization is by group. , 1989, International journal of epidemiology.

[29]  Moonseong Heo,et al.  Sample size requirements to detect an intervention by time interaction in longitudinal cluster randomized clinical trials , 2009, Statistics in medicine.

[30]  D. Jacobs,et al.  PARAMETERS TO AID IN THE DESIGN AND ANALYSIS OF COMMUNITY TRIALS: INTRACLASS CORRELATIONS FROM THE MINNESOTA HEART HEALTH PROGRAM , 1994, Epidemiology.

[31]  Neil Klar,et al.  Sample size re‐estimation in cluster randomization trials , 2002, Statistics in medicine.

[32]  S. Chinn,et al.  Components of variance and intraclass correlations for the design of community-based surveys and intervention studies: data from the Health Survey for England 1994. , 1999, American journal of epidemiology.

[33]  F B Hu,et al.  Intraclass correlation estimates in a school-based smoking prevention study. Outcome and mediating variables, by sex and ethnicity. , 1996, American journal of epidemiology.

[34]  Jerome Cornfield,et al.  SYMPOSIUM ON CHD PREVENTION TRIALS: DESIGN ISSUES IN TESTING LIFE STYLE INTERVENTIONRANDOMIZATION BY GROUP: A FORMAL ANALYSIS , 1978 .

[35]  Martijn P. F. Berger,et al.  Optimal experimental designs for multilevel logistic models , 2001 .

[36]  S M Kerry,et al.  Unequal cluster sizes for trials in English and Welsh general practice: implications for sample size calculations. , 2001, Statistics in medicine.

[37]  Kung-Jong Lui,et al.  Sample Size Determination for Testing Equality in a Cluster Randomized Trial with Noncompliance , 2010, Journal of biopharmaceutical statistics.

[38]  J. Williamson,et al.  Sample‐size calculations for studies with correlated ordinal outcomes , 2005, Statistics in medicine.

[39]  Guosheng Yin,et al.  Adaptive Design and Estimation in Randomized Clinical Trials with Correlated Observations , 2005, Biometrics.

[40]  F J Ingelfinger,et al.  International Journal of Epidemiology , 1973, The New England journal of medicine.

[41]  D. Firth,et al.  Estimating Intraclass Correlation for Binary Data , 1999, Biometrics.

[42]  D. Gerritsen,et al.  Stepped wedge designs could reduce the required sample size in cluster randomized trials. , 2013, Journal of clinical epidemiology.

[43]  D P Byar,et al.  The design of cancer prevention trials. , 1988, Recent results in cancer research. Fortschritte der Krebsforschung. Progres dans les recherches sur le cancer.

[44]  A Donner,et al.  Developments in cluster randomized trials and Statistics in Medicine , 2007, Statistics in medicine.

[45]  Sin-Ho Jung,et al.  Sample Size Calculation for Weighted Rank Tests Comparing Survival Distributions Under Cluster Randomization: A Simulation Method , 2007, Journal of biopharmaceutical statistics.

[46]  A K Manatunga,et al.  Sample Size Estimation for Survival Outcomes in Cluster‐Randomized Studies with Small Cluster Sizes , 2000, Biometrics.

[47]  G. W. Snedecor Statistical Methods , 1964 .

[48]  Beth A Reboussin,et al.  The Importance and Role of Intracluster Correlations in Planning Cluster Trials , 2007, Epidemiology.

[49]  S G Thompson,et al.  The design and analysis of paired cluster randomized trials: an application of meta-analysis techniques. , 1997, Statistics in medicine.

[50]  R J Carroll,et al.  On design considerations and randomization-based inference for community intervention trials. , 1996, Statistics in medicine.

[51]  D. Hoover,et al.  Power for T-test comparisons of unbalanced cluster exposure studies , 2002, Journal of Urban Health.

[52]  B. Short,et al.  Intraclass correlation among measures related to alcohol use by young adults: estimates, correlates and applications in intervention studies. , 1995, Journal of studies on alcohol.

[53]  M J Campbell,et al.  Cluster randomized trials in general (family) practice research , 2000, Statistical methods in medical research.

[54]  Sin-Ho Jung,et al.  Sample Size Calculation for Dichotomous Outcomes in Cluster Randomization Trials with Varying Cluster Size , 2003 .

[55]  David M. Murray,et al.  Design and Analysis of Group- Randomized Trials , 1998 .

[56]  D. Schoenfeld,et al.  Sample-size formula for the proportional-hazards regression model. , 1983, Biometrics.

[57]  E H Wagner,et al.  Data analysis and sample size issues in evaluations of community-based health promotion and disease prevention programs: a mixed-model analysis of variance approach. , 1991, Journal of clinical epidemiology.

[58]  A. Donner A Review of Inference Procedures for the Intraclass Correlation Coefficient in the One-Way Random Effects Model , 1986 .

[59]  A. Donner,et al.  Randomization by cluster. Sample size requirements and analysis. , 1981, American journal of epidemiology.

[60]  Mirjam Moerbeek,et al.  The Design of Cluster Randomized Crossover Trials , 2011 .

[61]  J. Hughes,et al.  Design and analysis of stepped wedge cluster randomized trials. , 2007, Contemporary clinical trials.

[62]  Allan Donner,et al.  Accounting for expected attrition in the planning of community intervention trials , 2007, Statistics in medicine.

[63]  S. Cousens,et al.  Issues in the design and interpretation of studies to evaluate the impact of community‐based interventions , 1997, Tropical medicine & international health : TM & IH.

[64]  R. Hayes,et al.  Simple sample size calculation for cluster-randomized trials. , 1999, International journal of epidemiology.

[65]  Diane J Catellier,et al.  School-level intraclass correlation for physical activity in adolescent girls. , 2004, Medicine and science in sports and exercise.

[66]  Luke B Connelly,et al.  Balancing the number and size of sites: an economic approach to the optimal design of cluster samples. , 2003, Controlled clinical trials.

[67]  D P Byar,et al.  Assessing the gain in efficiency due to matching in a community intervention study. , 1990, Statistics in medicine.

[68]  Mirjam Moerbeek,et al.  Power and money in cluster randomized trials: when is it worth measuring a covariate? , 2006, Statistics in medicine.

[69]  Kung-Jong Lui,et al.  Test Non-Inferiority and Sample Size Determination Based on the Odds Ratio Under a Cluster Randomized Trial with Noncompliance , 2010, Journal of biopharmaceutical statistics.

[70]  Robert J Glynn,et al.  Power and sample size estimation for the clustered wilcoxon test. , 2011, Biometrics.

[71]  Christina Pagel,et al.  Intracluster correlation coefficients and coefficients of variation for perinatal outcomes from five cluster-randomised controlled trials in low and middle-income countries: results and methodological implications , 2011, Trials.

[72]  Thomas M Braun,et al.  A mixed model formulation for designing cluster randomized trials with binary outcomes , 2003 .

[73]  A. Kristal,et al.  Selected methodological issues in evaluating community-based health promotion and disease prevention programs. , 1992, Annual review of public health.

[74]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[75]  Peter C Austin,et al.  A comparison of the statistical power of different methods for the analysis of cluster randomization trials with binary outcomes , 2007, Statistics in medicine.

[76]  Daniel J Zaccaro,et al.  An integrated population‐averaged approach to the design, analysis and sample size determination of cluster‐unit trials , 2003, Statistics in medicine.

[77]  S M McKinlay,et al.  Cost-efficient designs of cluster unit trials. , 1994, Preventive medicine.

[78]  Moonseong Heo,et al.  Statistical Power and Sample Size Requirements for Three Level Hierarchical Cluster Randomized Trials , 2008, Biometrics.

[79]  D J Spiegelhalter,et al.  Bayesian methods for cluster randomized trials with continuous responses. , 2001, Statistics in medicine.

[80]  A Donner,et al.  Sample size requirements for stratified cluster randomization designs. , 1992, Statistics in medicine.

[81]  Sandra Eldridge,et al.  Patterns of intra-cluster correlation from primary care research to inform study design and analysis. , 2004, Journal of clinical epidemiology.

[82]  Ziding Feng,et al.  Some design issues in a community intervention trial. , 2002, Controlled clinical trials.

[83]  Jonathan L. Blitstein,et al.  Design and analysis of group-randomized trials: a review of recent methodological developments. , 2004, American journal of public health.

[84]  Gerard J P Van Breukelen,et al.  Sample size adjustments for varying cluster sizes in cluster randomized trials with binary outcomes analyzed with second‐order PQL mixed logistic regression , 2010, Statistics in medicine.

[85]  W Pan,et al.  Sample size and power calculations with correlated binary data. , 2001, Controlled clinical trials.

[86]  P. Fayers,et al.  Determinants of the intracluster correlation coefficient in cluster randomized trials: the case of implementation research , 2005, Clinical trials.

[87]  John Gittins,et al.  A behavioural Bayes approach for sample size determination in cluster randomized clinical trials , 2010 .

[88]  Allan Donner,et al.  The merits of breaking the matches: a cautionary tale , 2007, Statistics in medicine.

[89]  David M Murray,et al.  Sizing a trial to alter the trajectory of health behaviours: methods, parameter estimates, and their application , 2007, Statistics in medicine.

[90]  Martijn P. F. Berger,et al.  Design Issues for Experiments in Multilevel Populations , 2000 .

[91]  Aiyi Liu,et al.  Sample size and power determination for clustered repeated measurements , 2002, Statistics in medicine.

[92]  J. Sterne,et al.  Methods for evaluating area-wide and organisation-based interventions in health and health care: a systematic review. , 1999, Health technology assessment.

[93]  Steven Teerenstra,et al.  A simple sample size formula for analysis of covariance in cluster randomized trials , 2012, Statistics in medicine.

[94]  X M Tu,et al.  Power analyses for longitudinal trials and other clustered designs , 2004, Statistics in medicine.

[95]  J M Bland,et al.  Trials which randomize practices II: sample size. , 1998, Family practice.

[96]  Anup Amatya,et al.  Sample size determination for clustered count data. , 2013, Statistics in medicine.

[97]  S A Hendricks,et al.  Power determination for geographically clustered data using generalized estimating equations. , 1996, Statistics in medicine.

[98]  X. Liu,et al.  Statistical Power and Optimum Sample Allocation Ratio for Treatment and Control Having Unequal Costs per Unit of Randomization , 2003 .

[99]  H A Feldman,et al.  Cohort versus cross-sectional design in large field trials: precision, sample size, and a unifying model. , 1994, Statistics in medicine.

[100]  Simon G Thompson,et al.  Allowing for imprecision of the intracluster correlation coefficient in the design of cluster randomized trials , 2004, Statistics in medicine.

[101]  David A Harrison,et al.  Sample Size and Power Calculations using the Noncentral t-distribution , 2004 .

[102]  Valérie Buthion,et al.  ColoNav: patient navigation for colorectal cancer screening in deprived areas – Study protocol , 1999, BMC Cancer.

[103]  Allan Donner,et al.  Intracluster correlation coefficients from the 2005 WHO Global Survey on Maternal and Perinatal Health: implications for implementation research. , 2008, Paediatric and perinatal epidemiology.

[104]  F. Hsieh,et al.  Sample size formulae for intervention studies with the cluster as unit of randomization. , 1988, Statistics in medicine.

[105]  Cora J. M. Maas,et al.  Optimal Experimental Designs for Multilevel Logistic Models with Two Binary Predictors , 2005 .

[106]  J. Whitehead Sample size calculations for ordered categorical data. , 1993, Statistics in medicine.

[107]  Michael J. Campbell,et al.  How to Design, Analyse and Report Cluster Randomised Trials in Medicine and Health Related Research , 2014 .

[108]  Amita K. Manatunga,et al.  Sample Size Estimation in Cluster Randomized Studies with Varying Cluster Size , 2001 .

[109]  Allan Donner,et al.  Some aspects of the design and analysis of cluster randomization trials , 2002 .

[110]  J Cornfield,et al.  Randomization by group: a formal analysis. , 1978, American journal of epidemiology.