论文信息 - Modeling Certainty with Clustered Data: A Comparison of Methods

Modeling Certainty with Clustered Data: A Comparison of Methods

Political scientists often analyze data in which the observational units are clustered into politically or socially meaningful groups with an interest in estimating the effects that group-level factors have on individual-level behavior. Even in the presence of low levels of intracluster correlation, it is well known among statisticians that ignoring the clustered nature of such data overstates the precision estimates for group-level effects. Although a number of methods that account for clustering are available, their precision estimates are poorly understood, making it difficult for researchers to choose among approaches. In this paper, we explicate and compare commonly used methods (clustered robust standard errors (SEs), random effects, hierarchical linear model, and aggregated ordinary least squares) of estimating the SEs for group-level effects. We demonstrate analytically and with the help of empirical examples that under ideal conditions there is no meaningful difference in the SEs generated by these methods. We conclude with advice on the ways in which analysts can increase the efficiency of clustered designs.

Kevin Arceneaux | David W. Nickerson | Kevin Arceneaux

[1] Kosuke Imai,et al. Survey Sampling , 1998, Nov/Dec 2017.

[2] Jan E. Leighley,et al. Political Parties and Class Mobilization in Contemporary United States Elections , 1996 .

[3] Donald P. Green,et al. Analysis of Cluster-Randomized Experiments: A Comparison of Alternative Estimation Approaches , 2007, Political Analysis.

[4] R. Rumberger. Hierarchical linear models: Applications and data analysis methods: and. Newbury Park, CA: Sage, 1992. (ISBN 0-8039-4627-9), pp. xvi + 265. Price: U.S. $45.00 (cloth) , 1997 .

[5] Allan Donner,et al. Design and Analysis of Cluster Randomization Trials in Health Research , 2001 .

[6] Jan Kmenta,et al. Elements of Econometrics: Second Edition , 1997 .

[7] Jan E. Leighley,et al. Party Ideology, Organization, and Competitiveness as Mobilizing Forces in Gubernatorial Elections , 1993 .

[8] Bradford S. Jones,et al. Modeling Multilevel Data Structures , 2002 .

[9] Sophia Rabe-Hesketh,et al. Multilevel and Longitudinal Modeling Using Stata , 2005 .

[10] Jake Bowers,et al. Designing multi-level studies: sampling voters and electoral contexts , 2002 .

[11] D. Green,et al. Dirty Pool , 2001, International Organization.