A simple sample size formula for analysis of covariance in cluster randomized trials

For cluster randomized trials with a continuous outcome, the sample size is often calculated as if an analysis of the outcomes at the end of the treatment period (follow-up scores) would be performed. However, often a baseline measurement of the outcome is available or feasible to obtain. An analysis of covariance (ANCOVA) using both the baseline and follow-up score of the outcome will then have more power. We calculate the efficiency of an ANCOVA analysis using the baseline scores compared with an analysis on follow-up scores only. The sample size for such an ANCOVA analysis is a factor r2 smaller, where r is the correlation of the cluster means between baseline and follow-up. This correlation can be expressed in clinically interpretable parameters: the correlation between baseline and follow-up of subjects (subject autocorrelation) and that of clusters (cluster autocorrelation). Because of this, subject matter knowledge can be used to provide (range of) plausible values for these correlations, when estimates from previous studies are lacking. Depending on how large the subject and cluster autocorrelations are, analysis of covariance can substantially reduce the number of clusters needed.

[1]  Jonathan L. Blitstein,et al.  Design and analysis of group-randomized trials: a review of recent methodological developments. , 2004, American journal of public health.

[2]  R. Hayes,et al.  Simple sample size calculation for cluster-randomized trials. , 1999, International journal of epidemiology.

[3]  F. Hsieh,et al.  Sample size formulae for intervention studies with the cluster as unit of randomization. , 1988, Statistics in medicine.

[4]  George F Borm,et al.  A simple sample size formula for analysis of covariance in randomized clinical trials. , 2007, Journal of clinical epidemiology.

[5]  David M. Murray,et al.  Methods To Reduce The Impact Of Intraclass Correlation In Group-Randomized Trials , 2003, Evaluation review.

[6]  Mirjam Moerbeek,et al.  Power and money in cluster randomized trials: when is it worth measuring a covariate? , 2006, Statistics in medicine.

[7]  Jeremy M. Grimshaw,et al.  Sample size calculator for cluster randomized trials , 2004, Comput. Biol. Medicine.

[8]  Gerard J P van Breukelen,et al.  Relative efficiency of unequal versus equal cluster sizes in cluster randomized and multicentre trials , 2007, Statistics in medicine.

[9]  Neil Klar,et al.  Methods for modelling change in cluster randomization trials , 2004, Statistics in medicine.

[10]  S. Raudenbush Statistical analysis and optimal design for cluster randomized trials , 1997 .

[11]  Daniel J Zaccaro,et al.  An integrated population‐averaged approach to the design, analysis and sample size determination of cluster‐unit trials , 2003, Statistics in medicine.

[12]  H. Bloom,et al.  Using Covariates to Improve Precision for Studies That Randomize Schools to Evaluate Educational Interventions , 2007 .

[13]  D. Ashby,et al.  Sample size for cluster randomized trials: effect of coefficient of variation of cluster size and analysis method. , 2006, International journal of epidemiology.

[14]  Simon G Thompson,et al.  Allowing for imprecision of the intracluster correlation coefficient in the design of cluster randomized trials , 2004, Statistics in medicine.

[15]  Jacob Cohen,et al.  A power primer. , 1992, Psychological bulletin.

[16]  H A Feldman,et al.  Cohort versus cross-sectional design in large field trials: precision, sample size, and a unifying model. , 1994, Statistics in medicine.

[17]  Devan V Mehrotra,et al.  Sample size determination for constrained longitudinal data analysis , 2009, Statistics in medicine.

[18]  Allan Donner,et al.  Sample Size Estimation in Cluster Randomized Educational Trials: An Empirical Bayes Approach , 2009 .