Adjusting a Significance Test for Clustering in Designs With Two Levels of Nesting

A common mistake in analysis of cluster randomized experiments is to ignore the effect of clustering and analyze the data as if each treatment group were a simple random sample. This typically leads to an overstatement of the precision of results and anticonservative conclusions about precision and statistical significance of treatment effects. This article gives a simple adjustment to the t statistic that would be computed if clustering were (incorrectly) ignored in an experiment with two levels of nesting (e.g., classrooms and schools) where treatment assignment is made at the highest (e.g., school) level. The adjustment is a multiplicative factor depending on the number of clusters and subclusters, the cluster and subcluster sample sizes, and the cluster and subcluster intraclass correlations ρ S and ρ C . The adjusted t statistic has Student's t distribution with reduced degrees of freedom. The adjusted statistic reduces to the t statistic computed by ignoring clustering when ρ S = ρ C = 0. It reduces to the t statistic computed using cluster means when ρ S = 1. If ρ S and ρ C are between 0 and 1, the adjusted t statistic lies between these two and the degrees of freedom are in between those corresponding to these two extremes.

[1]  K. Hopkins The Unit of Analysis: Group Means Versus Individual Observations , 1982 .

[2]  Jonathan L. Blitstein,et al.  Design and analysis of group-randomized trials: a review of recent methodological developments. , 2004, American journal of public health.

[3]  Denisse R. Thompson,et al.  Standards-based school mathematics curricula : What are they? What do students learn? , 2004 .

[4]  Allan Donner,et al.  Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[5]  P. R. Fisk,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1971 .

[6]  J. J. Higgins,et al.  Comment on “Statistical Power with Group Mean as the Unit of Analysis” , 1986 .

[7]  William G. Cochran,et al.  Experimental Designs, 2nd Edition , 1950 .

[8]  A Donner,et al.  Design considerations in the estimation of intraclass correlation , 1982, Annals of human genetics.

[9]  A Donner,et al.  Current and future challenges in the design and analysis of cluster randomization trials , 2001, Statistics in medicine.

[10]  S. Geisser,et al.  An Extension of Box's Results on the Use of the $F$ Distribution in Multivariate Analysis , 1958 .

[11]  William R. Shadish,et al.  Increasing the Degrees of Freedom in Existing Group Randomized Trials , 2005, Evaluation review.

[12]  L. Hedges,et al.  Intraclass Correlation Values for Planning Group-Randomized Trials in Education , 2007 .

[13]  J. Gill Hierarchical Linear Models , 2005 .

[14]  David M. Murray,et al.  Design and Analysis of Group- Randomized Trials , 1998 .

[15]  Robert S. Barcikowski,et al.  Statistical Power with Group Mean as the Unit of Analysis , 1981 .

[16]  D. Jacobs,et al.  PARAMETERS TO AID IN THE DESIGN AND ANALYSIS OF COMMUNITY TRIALS: INTRACLASS CORRELATIONS FROM THE MINNESOTA HEART HEALTH PROGRAM , 1994, Epidemiology.

[17]  David M. Murray,et al.  Methods To Reduce The Impact Of Intraclass Correlation In Group-Randomized Trials , 2003, Evaluation review.

[18]  D. W. Gaylor,et al.  Estimating the Degrees of Freedom for Linear Combinations of Mean Squares by Satterthwaite's Formula , 1969 .

[19]  S. R. Searle Linear Models , 1971 .

[20]  G. Box Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, I. Effect of Inequality of Variance in the One-Way Classification , 1954 .

[21]  D M Murray,et al.  A Monte Carlo Study of Alternative Responses To Intraclass Correlation in Community Trials , 1996, Evaluation review.

[22]  B. L. Welch On Linear Combinations of Several Variances , 1956 .

[23]  S. Chinn,et al.  Components of variance and intraclass correlations for the design of community-based surveys and intervention studies: data from the Health Survey for England 1994. , 1999, American journal of epidemiology.

[24]  D. Neeleman,et al.  A Monte-Carlo study , 1973 .

[25]  Larry V. Hedges,et al.  Correcting a Significance Test for Clustering , 2007 .