The Power of the Test for Treatment Effects in Three-Level Cluster Randomized Designs

Abstract Experiments that involve nested structures often assign entire groups (such as schools) to treatment conditions. Key aspects of the design of such experiments include knowledge of the intraclass correlation structure and the sample sizes necessary to achieve adequate power to detect the treatment effect. This study provides methods for computing power in three-level cluster randomized balanced designs (with two levels of nesting), where, for example, students are nested within classrooms and classrooms are nested within schools and schools are assigned to treatments. The power computations take into account nesting effects at the second (classroom) and at the third (school) level, sample size effects (e.g., number of schools, classrooms, and individuals), and covariate effects (e.g., pretreatment measures). The methods are applicable to quasi-experimental studies that examine group differences in an outcome.

[1]  Roel Bosker,et al.  Standard Errors and Sample Sizes for Two-Level Research , 1993 .

[2]  V. Verma,et al.  An analysis of sampling errors for the Demographic Health Surveys. , 1996, International statistical review = Revue internationale de statistique.

[3]  S. Raudenbush,et al.  Effects of study duration, frequency of observation, and sample size on power in studies of group differences in polynomial change. , 2001, Psychological methods.

[4]  K. Murphy,et al.  Statistical Power Analysis: A Simple and General Model for Traditional and Modern Hypothesis Tests, Second Ediction , 1998 .

[5]  S. Raudenbush,et al.  Statistical power and optimal design for multisite randomized trials. , 2000, Psychological methods.

[6]  A. Winsor Sampling techniques. , 2000, Nursing times.

[7]  H. Goldstein Multilevel Statistical Models , 2006 .

[8]  S. Chinn,et al.  Components of variance and intraclass correlations for the design of community-based surveys and intervention studies: data from the Health Survey for England 1994. , 1999, American journal of epidemiology.

[9]  S. Raudenbush Statistical analysis and optimal design for cluster randomized trials , 1997 .

[10]  L. Hedges,et al.  Intraclass Correlation Values for Planning Group-Randomized Trials in Education , 2007 .

[11]  Roger E. Kirk,et al.  Experimental design: Procedures for the behavioral sciences (3rd ed.). , 1995 .

[12]  L. Hedges,et al.  How Large Are Teacher Effects? , 2004 .

[13]  David M. Murray,et al.  Design and Analysis of Group- Randomized Trials , 1998 .

[14]  Risto Lethonen Multilevel Statistical Models (3rd ed.) , 2005 .

[15]  Helena Chmura Kraemer,et al.  How Many Subjects? Statistical Power Analysis in Research , 1987 .

[16]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[17]  David M Murray,et al.  Analysis strategies for a community trial to reduce adolescent ATOD use: a comparison of random coefficient and ANOVA/ANCOVA models. , 2006, Contemporary clinical trials.

[18]  Kosuke Imai,et al.  Survey Sampling , 1998, Nov/Dec 2017.

[19]  Jonathan L. Blitstein,et al.  Design and analysis of group-randomized trials: a review of recent methodological developments. , 2004, American journal of public health.

[20]  Patrick Leung,et al.  Power and Precision , 1999 .

[21]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[22]  Robert F. Boruch,et al.  Sensitivity, bias, and theory in impact evaluations. , 1977 .

[23]  Robert D. Tortora,et al.  Sampling: Design and Analysis , 2000 .

[24]  David M. Murray,et al.  Methods To Reduce The Impact Of Intraclass Correlation In Group-Randomized Trials , 2003, Evaluation review.

[25]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[26]  Larry V. Hedges,et al.  The Effects of Small Classes on Academic Achievement: The Results of the Tennessee Class Size Experiment , 2000 .

[27]  Jessaca Spybrook,et al.  Optimal Design for Longitudinal and Multilevel Research: Documentation for the "Optimal Design" Software , 2006 .

[28]  J. Gill Hierarchical Linear Models , 2005 .

[29]  R. Kirk Experimental Design: Procedures for the Behavioral Sciences , 1970 .

[30]  Howard S. Bloom,et al.  Learning More from Social Experiments , 2006 .

[31]  Sharon L. Lohr,et al.  Sampling: Design and Analysis , 1999 .

[32]  Todd C. Headrick,et al.  ON OPTIMIZING MULTI‐LEVEL DESIGNS: POWER UNDER BUDGET CONSTRAINTS , 2005 .

[33]  D. Zucker Design and Analysis of Cluster Randomization Trials , 2003 .