robustlmm : An R Package for Robust Estimation of Linear Mixed-Effects Models

As any real-life data, data modeled by linear mixed-effects models often contain outliers or other contamination. Even little contamination can drive the classic estimates far away from what they would be without the contamination. At the same time, datasets that require mixed-effects modeling are often complex and large. This makes it difficult to spot contamination. Robust estimation methods aim to solve both problems: to provide estimates where contamination has only little influence and to detect and flag contamination. We introduce an R package, robustlmm, to robustly fit linear mixed-effects models. The package's functions and methods are designed to closely equal those offered by lme4, the R package that implements classic linear mixed-effects model estimation in R. The robust estimation method in robustlmm is based on the random effects contamination model and the central contamination model. Contamination can be detected at all levels of the data. The estimation method does not make any assumption on the data's grouping structure except that the model parameters are estimable. robustlmm supports hierarchical and non-hierarchical (e.g., crossed) grouping structures. The robustness of the estimates and their asymptotic efficiency is fully controlled through the function interface. Individual parts (e.g., fixed effects and variance components) can be tuned independently. In this tutorial, we show how to fit robust linear mixed-effects models using robustlmm, how to assess the model fit, how to detect outliers, and how to compare different fits.

[1]  Werner A. Stahel,et al.  Sharpening Wald-type inference in robust regression for small samples , 2011, Comput. Stat. Data Anal..

[2]  Marco Geraci,et al.  Linear Quantile Mixed Models: The lqmm Package for Laplace Quantile Regression , 2014 .

[3]  Maria-Pia Victoria-Feser,et al.  High-Breakdown Inference for Mixed Linear Models , 2006 .

[4]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[5]  R. Greensmith,et al.  Statistical Methods in Research and Production , 1973 .

[6]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[7]  Stefan Van Aelst,et al.  Fast and robust bootstrap , 2008, Stat. Methods Appl..

[8]  Inna Chervoneva,et al.  Constrained S-estimators for linear mixed effects models with covariance components. , 2011, Statistics in medicine.

[9]  W. Stahel Estimation of a covariance matrix with location: Asymptotic formulas and optimal B -robust estimators , 1987 .

[10]  Manuel Koller,et al.  Robust Estimation of Linear Mixed Models , 2013 .

[11]  Martin Maechler CRAN Task View: Robust Statistical Methods , 2020 .

[12]  Douglas M. Bates,et al.  lme4: Mixed-e↵ects modeling with R , 2010 .

[13]  V. Yohai,et al.  Robust Statistics: Theory and Methods , 2006 .

[14]  E. Demidenko,et al.  Mixed Models: Theory and Applications (Wiley Series in Probability and Statistics) , 2004 .

[15]  Kesar Singh,et al.  Breakdown theory for bootstrap quantiles , 1998 .

[16]  D. Ruppert Robust Statistics: The Approach Based on Influence Functions , 1987 .

[17]  Andreas Papritz Robust Geostatistical Analysis of Spatial Data , 2013 .