Robustness issues in multilevel regression analysis

A multilevel problem concerns a population with a hierarchical structure. A sample from such a population can be described as a multistage sample. First, a sample of higher level units is drawn (e.g. schools or organizations), and next a sample of the sub-units from the available units (e.g. pupils in schools or employees in organizations). In such samples, the individual observations are in general not completely independent. Multilevel analysis software accounts for this dependence and in recent years these programs have been widely accepted. Two problems that occur in the practice of multilevel modeling will be discussed. The first problem is the choice of the sample sizes at the different levels. What are sufficient sample sizes for accurate estimation? The second problem is the normality assumption of the level-2 error distribution. When one wants to conduct tests of significance, the errors need to be normally distributed. What happens when this is not the case? In this paper, simulation studies are used to answer both questions. With respect to the first question, the results show that a small sample size at level two (meaning a sample of 50 or less) leads to biased estimates of the second-level standard errors. The answer to the second question is that only the standard errors for the random effects at the second level are highly inaccurate if the distributional assumptions concerning the level-2 errors are not fulfilled. Robust standard errors turn out to be more reliable than the asymptotic standard errors based on maximum likelihood.

[1]  Jan de Leeuw,et al.  Introducing Multilevel Modeling , 1998 .

[2]  Yuk Fai Cheong,et al.  HLM 6: Hierarchical Linear and Nonlinear Modeling , 2000 .

[3]  Cora J. M. Maas,et al.  The Accuracy of Multilevel Structural Equation Modeling With Pseudobalanced Groups and Small Samples , 2001 .

[4]  P. J. Huber The behavior of maximum likelihood estimates under nonstandard conditions , 1967 .

[5]  Joop J. Hox,et al.  Multilevel modeling: When and why , 1998 .

[6]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[7]  G. A. Marcoulides,et al.  Multilevel Analysis Techniques and Applications , 2002 .

[8]  J. Gill Hierarchical Linear Models , 2005 .

[9]  R. Mathar,et al.  Classification, Data Analysis, and Data Highways , 1998 .

[10]  Jan de Leeuw,et al.  Random Coefficient Models for Multilevel Analysis , 1986 .

[11]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[12]  Roel Bosker,et al.  Multilevel analysis : an introduction to basic and advanced multilevel modeling , 1999 .

[13]  Scott Eliason,et al.  Maximum Likelihood Estimation , 1993 .

[14]  Timothy J. Robinson,et al.  Multilevel Analysis: Techniques and Applications , 2002 .

[15]  Marno Verbeek,et al.  A Guide to Modern Econometrics , 2000 .

[16]  Risto Lehtonen,et al.  Multilevel Statistical Models , 2005 .

[17]  William J. Browne,et al.  Implementation and performance issues in the Bayesian and likelihood fitting of multilevel models , 2000, Comput. Stat..