fitting linear mixed effects models using lme 4 arxiv

Maximum likelihood or restricted maximum likelihood (REML) estimates of the parameters in linear mixed-effects models can be determined using the lmer function in the lme4 package for R. As for most model-fitting functions in R, the model is described in an lmer call by a formula, in this case including both fixedand random-effects terms. The formula and data together determine a numerical representation of the model from which the profiled deviance or the profiled REML criterion can be evaluated as a function of some of the model parameters. The appropriate criterion is optimized, using one of the constrained optimization functions in R, to provide the parameter estimates. We describe the structure of the model, the steps in evaluating the profiled deviance or REML criterion, and the structure of classes or types that represents such a model. Sufficient detail is included to allow specialization of these structures by users who wish to write functions to fit specialized linear mixed models, such as models incorporating pedigrees or smoothing splines, that are not easily expressible in the formula language used by lmer.

[1]  F. E. Satterthwaite An approximate distribution of estimates of variance components. , 1946, Biometrics.

[2]  B. Efron,et al.  Stein's Paradox in Statistics , 1977 .

[3]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[4]  C. R. Henderson,et al.  Analysis of covariance in the mixed model: higher-level, nonhomogeneous, and random regressions. , 1982, Biometrics.

[5]  S. Weisberg,et al.  Residuals and Influence in Regression , 1982 .

[6]  G. Milliken Nonlinear Regression Analysis and Its Applications , 1990 .

[7]  R. Kass Nonlinear Regression Analysis and its Applications , 1990 .

[8]  Douglas M. Bates,et al.  Unconstrained parametrizations for variance-covariance matrices , 1996, Stat. Comput..

[9]  Anthony C. Davison,et al.  Bootstrap Methods and Their Application , 1998 .

[10]  M. Kenward,et al.  Small sample inference for fixed effects from restricted maximum likelihood. , 1997, Biometrics.

[11]  V. Carey,et al.  Mixed-Effects Models in S and S-Plus , 2001 .

[12]  D. Sengupta Linear models , 2003 .

[13]  Maria L. Thomas,et al.  Patterns of performance degradation and restoration during sleep restriction and subsequent recovery: a sleep dose‐response study , 2003, Journal of sleep research.

[14]  Douglas M. Bates,et al.  Linear mixed models and penalized least squares , 2004 .

[15]  F. Vaida,et al.  Conditional Akaike information for mixed-effects models , 2005 .

[16]  A. Gelman Analysis of variance: Why it is more important than ever? , 2005, math/0504499.

[17]  R. Horn,et al.  Basic Properties of the Schur Complement , 2005 .

[18]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[19]  Timothy A. Davis,et al.  Direct methods for sparse linear systems , 2006, Fundamentals of algorithms.

[20]  Douglas M. Bates,et al.  Estimating the Multilevel Rasch Model: With the lme4 Package , 2007 .

[21]  Gene H. Golub,et al.  The differentiation of pseudo-inverses and non-linear least squares problems whose variables separate , 1972, Milestones in Matrix Computation.

[22]  Gabor Grothendieck,et al.  Lattice: Multivariate Data Visualization with R , 2008 .

[23]  YANQING CHEN,et al.  Algorithm 8 xx : CHOLMOD , supernodal sparse Cholesky factorization and update / downdate ∗ , 2006 .

[24]  M. Powell The BOBYQA algorithm for bound constrained optimization without derivatives , 2009 .

[25]  M. Grotenhuis,et al.  Influence.ME: Tools for detecting influential data in mixed models. , 2009 .

[26]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[27]  John C. Nash,et al.  Unifying Optimization Algorithms to Aid Software System Users: optimx for R , 2011 .

[28]  Alan Edelman,et al.  Julia: A Fast Dynamic Language for Technical Computing , 2012, ArXiv.

[29]  Kyle Klein,et al.  Nelder-Mead Simplex Optimization Routine for Large-Scale Problems: A Distributed Memory Implementation , 2013 .

[30]  Sophia Rabe-Hesketh,et al.  A Nondegenerate Penalized Likelihood Estimator for Variance Parameters in Multilevel Models , 2013, Psychometrika.

[31]  John C. Nash,et al.  Strategies for fitting nonlinear ecological models in R, AD Model Builder, and BUGS , 2013 .

[32]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[33]  Adam Loy,et al.  HLMdiag: A Suite of Diagnostics for Hierarchical Linear Models in R , 2014 .

[34]  D. Bates,et al.  Linear Mixed-Effects Models using 'Eigen' and S4 , 2015 .

[35]  C. R. Rao,et al.  SOLUTIONS TO SOME FUNCTIONAL EQUATIONS AND THEIR APPLICATIONS TO CHARACTERIZATION OF PROBABILITY DISTRIBUTIONS , 2016 .