Conclusions beyond support: overconfident estimates in mixed models

Mixed-effect models are frequently used to control for the nonindependence of data points, for example, when repeated measures from the same individuals are available. The aim of these models is often to estimate fixed effects and to test their significance. This is usually done by including random intercepts, that is, intercepts that are allowed to vary between individuals. The widespread belief is that this controls for all types of pseudoreplication within individuals. Here we show that this is not the case, if the aim is to estimate effects that vary within individuals and individuals differ in their response to these effects. In these cases, random intercept models give overconfident estimates leading to conclusions that are not supported by the data. By allowing individuals to differ in the slopes of their responses, it is possible to account for the nonindependence of data points that pseudoreplicate slope information. Such random slope models give appropriate standard errors and are easily implemented in standard statistical software. Because random slope models are not always used where they are essential, we suspect that many published findings have too narrow confidence intervals and a substantially inflated type I error rate. Besides reducing type I errors, random slope models have the potential to reduce residual variance by accounting for between-individual variation in slopes, which makes it easier to detect treatment effects that are applied between individuals, hence reducing type II errors as well.

[1]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[2]  Joseph Hilbe,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2009 .

[3]  H. Schielzeth,et al.  Compensatory investment in zebra finches: females lay larger eggs when paired to sexually unattractive males , 2009, Proceedings of the Royal Society B: Biological Sciences.

[4]  Charles E. Heckler,et al.  Introduction to Mixed Modelling. Beyond Regression and Analysis of Variance , 2008, Technometrics.

[5]  A. Gelman Scaling regression inputs by dividing by two standard deviations , 2008, Statistics in medicine.

[6]  H. Schwabl,et al.  Sex-specific effects of yolk-androgens on growth of nestling American kestrels , 2008, Behavioral Ecology and Sociobiology.

[7]  D. Hasselquist,et al.  Transgenerational priming of immunity: maternal exposure to a bacterial antigen enhances offspring humoral immunity , 2006, Proceedings of the Royal Society B: Biological Sciences.

[8]  Julian J. Faraway,et al.  Extending the Linear Model with R , 2004 .

[9]  L. R. Schaeffer,et al.  Application of random regression models in animal breeding , 2004 .

[10]  J. Singer,et al.  Applied Longitudinal Data Analysis , 2003 .

[11]  Eric M. Blalock,et al.  Experimental Design and Data Analysis , 2003 .

[12]  G. Quinn,et al.  Experimental Design and Data Analysis for Biologists , 2002 .

[13]  V. Carey,et al.  Mixed-Effects Models in S and S-Plus , 2001 .

[14]  Roland P. Carpenter,et al.  Experimental Design and Data Analysis , 2000 .

[15]  Roel Bosker,et al.  Multilevel analysis : an introduction to basic and advanced multilevel modeling , 1999 .

[16]  Anthony S. Bryk,et al.  Hierarchical Linear Models: Applications and Data Analysis Methods , 1992 .

[17]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.