Sample size considerations for stratified cluster randomization design with binary outcomes and varying cluster size

Stratified cluster randomization trials (CRTs) have been frequently employed in clinical and healthcare research. Comparing with simple randomized CRTs, stratified CRTs reduce the imbalance of baseline prognostic factors among different intervention groups. Due to the popularity, there has been a growing interest in methodological development on sample size estimation and power analysis for stratified CRTs; however, existing work mostly assumes equal cluster size within each stratum and uses multilevel models. Clusters are often naturally formed with random sizes in CRTs. With varying cluster size, commonly used ad hoc approaches ignore the variability in cluster size, which may underestimate (overestimate) the required number of clusters for each group per stratum and lead to underpowered (overpowered) clinical trials. We propose closed-form sample size formulas for estimating the required total number of subjects and for estimating the number of clusters for each group per stratum, based on Cochran-Mantel-Haenszel statistic for stratified cluster randomization design with binary outcomes, accounting for both clustering and varying cluster size. We investigate the impact of various design parameters on the relative change in the required number of clusters for each group per stratum due to varying cluster size. Simulation studies are conducted to evaluate the finite-sample performance of the proposed sample size method. A real application example of a pragmatic stratified CRT of a triad of chronic kidney disease, diabetes, and hypertension is presented for illustration.

[1]  W. G. Cochran Some Methods for Strengthening the Common χ 2 Tests , 1954 .

[2]  Mirjam Moerbeek,et al.  Power Analysis of Trials with Multilevel Data , 2015 .

[3]  S. Raudenbush Statistical analysis and optimal design for cluster randomized trials , 1997 .

[4]  J. Hanley,et al.  GEE analysis of negatively correlated binary responses: a caution. , 2000, Statistics in medicine.

[5]  Spyros Konstantopoulos,et al.  Computing Power of Tests of the Variance of Treatment Effects in Designs With Two Levels of Nesting , 2008, Multivariate behavioral research.

[6]  Peter Z. Schochet Statistical Power for Random Assignment Evaluations of Education Programs , 2005 .

[7]  D. Firth,et al.  Estimating Intraclass Correlation for Binary Data , 1999, Biometrics.

[8]  D. Torgerson,et al.  Cluster randomized controlled trials. , 2005, Journal of evaluation in clinical practice.

[9]  A Donner,et al.  Sample size requirements for stratified cluster randomization designs. , 1992, Statistics in medicine.

[10]  I. Boutron,et al.  Reporting of analyses from randomized controlled trials with multiple arms: a systematic review , 2013, BMC Medicine.

[11]  R Z Omar,et al.  Bayesian methods of analysis for cluster randomized trials with binary outcome data. , 2001, Statistics in medicine.

[12]  Peter Z. Schochet Statistical Power for School-Based RCTs With Binary Outcomes , 2013 .

[13]  Fan Li,et al.  Review of Recent Methodological Developments in Group-Randomized Trials: Part 1-Design. , 2017, American journal of public health.

[14]  Amita K. Manatunga,et al.  Sample Size Estimation in Cluster Randomized Studies with Varying Cluster Size , 2001 .

[15]  L. Pilotto,et al.  Cluster randomized controlled trials in primary care: An introduction , 2006, The European journal of general practice.

[16]  Martijn P. F. Berger,et al.  Optimal experimental designs for multilevel logistic models , 2001 .

[17]  D. Bonett Sample size requirements for estimating intraclass correlations with desired precision , 2002, Statistics in medicine.

[18]  R F Woolson,et al.  Sample size for case-control studies using Cochran's statistic. , 1986, Biometrics.

[19]  Allan Donner,et al.  Confidence Interval Estimation of the Intraclass Correlation Coefficient for Binary Outcome Data , 2004, Biometrics.

[20]  Song Zhang,et al.  Power analysis for stratified cluster randomisation trials with cluster size being the stratifying factor , 2017 .

[21]  Sin-Ho Jung,et al.  Sample Size Calculation for Dichotomous Outcomes in Cluster Randomization Trials with Varying Cluster Size , 2003 .

[22]  Ken P. Kleinman,et al.  The Effect of Cluster Size Variability on Statistical Power in Cluster-Randomized Trials , 2015, PloS one.

[23]  Kevin K Dobbin,et al.  Comparison of confidence interval methods for an intra-class correlation coefficient (ICC) , 2014, BMC Medical Research Methodology.

[24]  A Donner,et al.  Adjustments to the Mantel-Haenszel chi-square statistic and odds ratio variance estimator when the data are clustered. , 1987, Statistics in medicine.

[25]  Andrew Copas,et al.  Methods for sample size determination in cluster randomized trials , 2015, International journal of epidemiology.

[26]  Wendy Chan,et al.  The Relationship Among Design Parameters for Statistical Power Between Continuous and Binomial Outcomes in Cluster Randomized Trials , 2019, Psychological methods.

[27]  Jessaca Spybrook,et al.  Assessing the Precision of Multisite Trials for Estimating the Parameters of a Cross-Site Population Distribution of Program Effects , 2017 .

[28]  J. Lewsey,et al.  Comparing completely and stratified randomized designs in cluster randomized trials when the stratifying factor is cluster size: a simulation study , 2004, Statistics in medicine.