Efficiency and Robustness of Alternative Estimators for Two- and Three-level Models: The Case of NAEP

This article investigates the efficiency and robustness of alternative estimators of regression coefficients for three-level data. To study student achievement, researchers might formulate a standard regression model or a hierarchical model with a two- or three-level structure. Having chosen the model, the researchers might employ either a model-based or a robust estimator of the standard errors. A simulation study showed that, as expected, the hierarchical model analyses produced more efficient point estimates than did analyses that ignored the covariance structure in the data, even when the normality assumption was violated. When samples were fairly large, the three-level analyses produced sound standard errors. In contrast, single-level analysis yielded seriously biased standard errors for coefficients defined at level 3 and level 2; and two-level analysis yielded biased standard errors for coefficients defined at level 2. These biases in standard error estimates were largely corrected by robust variance estimation. Implications of the results for analyzing NAEP and other large-scale surveys such as the Early Childhood Longitudinal Study (ECLS) and the Third International Mathematics and Science Study (TIMSS) are discussed.

[1]  L. Kish,et al.  Inference from Complex Samples , 1974 .

[2]  S. Raudenbush,et al.  Chapter 10: Methodological Advances in Analyzing the Effects of Schools and Classrooms on Student Learning , 1988 .

[3]  Stephen W. Raudenbush,et al.  Reexamining, Reaffirming, and Improving Application of Hierarchical Models , 1995 .

[4]  Anthony S. Bryk,et al.  Methodological Advances in Analyzing the Effects of Schools and Classrooms on Student Learning , 1988 .

[5]  Yeow Meng Thum,et al.  Hierarchical Linear Models for Multivariate Outcomes , 1997 .

[6]  E. Muraki,et al.  Chapter 3: Scaling Procedures in NAEP , 1992 .

[7]  Peter Schmidt,et al.  The Theory and Practice of Econometrics , 1985 .

[8]  A. Satorra,et al.  Complex Sample Data in Structural Equation Modeling , 1995 .

[9]  D. Rubin,et al.  Estimation in Covariance Components Models , 1981 .

[10]  N. Laird Empirical Bayes methods for two-way contingency tables , 1978 .

[11]  Eugene G. Johnson Technical Report of the NAEP 1992 Trial State Assessment Program in Mathematics. , 1993 .

[12]  Stephen W. Raudenbush,et al.  Synthesizing Results from the Trial State Assessment , 1999 .

[13]  Chris J. Skinner,et al.  Analysis of complex surveys , 1991 .

[14]  Jan de Leeuw,et al.  Review of Five Multilevel Analysis Programs: BMDP-5V, GENMOD, HLM, ML3, VARCL , 1994 .

[15]  Raghu N. Kackar,et al.  Unbiasedness of two-stage estimation and prediction procedures for mixed linear models , 1981 .

[16]  Stephen W. Raudenbush,et al.  Inequality of Access to Educational Resources: A National Report Card for Eighth-Grade Math , 1998 .

[17]  Edgar F. Borgatta,et al.  Sociological Methodology, 1969. , 1969 .

[18]  B. Leroux,et al.  Efficiency of regression estimates for clustered data. , 1996, Biometrics.

[19]  M. Seltzer,et al.  Sensitivity Analysis for Fixed Effects in the Hierarchical Model: A Gibbs Sampling Approach , 1993 .

[20]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[21]  A. Scott,et al.  The Effect of Two-Stage Sampling on Ordinary Least Squares Methods , 1982 .

[22]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[23]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[24]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[25]  H. Goldstein Multilevel Statistical Models , 2006 .