Optimal sample sizes in experimental designs with individuals nested within clusters

This article deals with optimal sample sizes for experimental studies with individuals nested within clusters. Examples are multicenter clinical trials with patients nested within clinics or family practices, studies on absence due to illness with employees nested within work sites, or school-based smoking prevention interventions with pupils nested within schools. Optimal sample sizes are given for models with continuous or binary outcomes and for 1 or 2 treatment factors. Because individuals are nested within clusters, not only the total number of individuals needed to obtain a specified power of the test of treatment effect has to be established but also the number of clusters and the number of individuals per cluster. We show that different outcome variables lead to different optimal sample sizes. Furthermore, the statistically optimal design is not always feasible in practice because of certain limitations, and we show how to calculate the optimal design given restrictions on the sample sizes at both...

[1]  Martijn P. F. Berger,et al.  Optimal experimental designs for multilevel logistic models , 2001 .

[2]  D. Hedeker,et al.  MIXREG: a computer program for mixed-effects regression analysis with autocorrelated errors. , 1996, Computer methods and programs in biomedicine.

[3]  Timothy J. Robinson,et al.  Multilevel Analysis: Techniques and Applications , 2002 .

[4]  Ian H. Langford,et al.  A User’s Guide to MLwiN, Version 2.10 , 2000 .

[5]  Martijn P. F. Berger,et al.  OPTIMAL EXPERIMENTAL DESIGNS FOR MULTILEVEL MODELS WITH COVARIATES , 2001 .

[6]  R. Littell SAS System for Mixed Models , 1996 .

[7]  Jacob Cohen,et al.  A power primer. , 1992, Psychological bulletin.

[8]  H. Schouten,et al.  Sample size formula with a continuous outcome for unequal group sizes and unequal variances. , 1999, Statistics in medicine.

[9]  D. Bloch,et al.  A simple method of sample size calculation for linear and logistic regression. , 1998, Statistics in medicine.

[10]  Anthony S. Bryk,et al.  Hierarchical Linear Models: Applications and Data Analysis Methods , 1992 .

[11]  F B Hu,et al.  Intraclass correlation estimates in a school-based smoking prevention study. Outcome and mediating variables, by sex and ethnicity. , 1996, American journal of epidemiology.

[12]  D. Hedeker,et al.  MIXOR: a computer program for mixed-effects ordinal regression analysis. , 1996, Computer methods and programs in biomedicine.

[13]  Roel Bosker,et al.  Standard Errors and Sample Sizes for Two-Level Research , 1993 .

[14]  A. Donner,et al.  Randomization by cluster. Sample size requirements and analysis. , 1981, American journal of epidemiology.

[15]  Ilse Mesters,et al.  Short-term effects of a randomized computer-based out-of-school smoking prevention trial aimed at elementary schoolchildren. , 2002, Preventive medicine.

[16]  Mirjam Moerbeek,et al.  A comparison between traditional methods and multilevel regression for the analysis of multicenter intervention studies. , 2003, Journal of clinical epidemiology.

[17]  Gary H. McClelland,et al.  Optimal design in psychological research. , 1997 .

[18]  S. Raudenbush Statistical analysis and optimal design for cluster randomized trials , 1997 .

[19]  F. Hsieh,et al.  Sample size formulae for intervention studies with the cluster as unit of randomization. , 1988, Statistics in medicine.

[20]  Martijn P. F. Berger,et al.  A Comparison of Estimation Methods for Multilevel Logistic Models , 2003, Comput. Stat..

[21]  N. Goldman,et al.  Improved estimation procedures for multilevel models with binary response: a case‐study , 2001 .

[22]  Martijn P. F. Berger,et al.  Design Issues for Experiments in Multilevel Populations , 2000 .

[23]  Amita K. Manatunga,et al.  Sample Size Estimation in Cluster Randomized Studies with Varying Cluster Size , 2001 .