A scalable and efficient covariate selection criterion for mixed effects regression models with unknown random effects structure

We propose a new model selection criterion for mixed effects regression models that is computable even when the structure and the distribution of the random effects are unknown. The criterion is most useful in the early stage of the model building process when one needs to decide which covariates should be included in a mixed effects regression model but has no knowledge of the random effect structure. The calculation of the criterion requires only the evaluation of cluster-level log-likelihoods and does not rely on heavy numerical integration. We provide theoretical and numerical arguments to justify the method and we illustrate its usefulness by analyzing data on the consumption of alcohol by young American Indians.

[1]  Jorma Rissanen,et al.  The Minimum Description Length Principle in Coding and Modeling , 1998, IEEE Trans. Inf. Theory.

[2]  R. Shibata Approximate efficiency of a selection procedure for the number of regression variables , 1984 .

[3]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[4]  Clifford M. Hurvich,et al.  Regression and time series model selection in small samples , 1989 .

[5]  F. Vaida,et al.  Conditional Akaike information for mixed-effects models , 2005 .

[6]  S. Greven,et al.  On the behaviour of marginal and conditional AIC in linear mixed models , 2010 .

[7]  Calyampudi R. Rao,et al.  A strongly consistent procedure for model selection in a regression problem , 1989 .

[8]  A S Whittemore,et al.  Methods for analyzing panel studies of acute health effects of air pollution. , 1979, Biometrics.

[9]  Radu V. Craiu,et al.  Conditional Logistic Regression With Longitudinal Follow-up and Individual-Level Random Coefficients: A Stable and Efficient Two-Step Estimation Method , 2011 .

[10]  Walter Zucchini,et al.  Model Selection , 2011, International Encyclopedia of Statistical Science.

[11]  J. Nelder,et al.  Hierarchical Generalized Linear Models , 1996 .

[12]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[13]  J. Ware,et al.  Random-effects models for serial observations with binary response. , 1984, Biometrics.

[14]  Ana Ivelisse Avilés,et al.  Linear Mixed Models for Longitudinal Data , 2001, Technometrics.

[15]  H. Lian A note on conditional Akaike information for Poisson regression with random effects , 2008, 0810.2010.

[16]  H. Akaike Statistical predictor identification , 1970 .

[17]  S. Müller,et al.  Model Selection in Linear Mixed Models , 2013, 1306.2427.

[18]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[19]  Runze Li,et al.  VARIABLE SELECTION IN LINEAR MIXED EFFECTS MODELS. , 2012, Annals of statistics.

[20]  T. Hyslop,et al.  A General Approach for Two‐Stage Analysis of Multilevel Clustered Non‐Gaussian Data , 2006, Biometrics.

[21]  Ronald L. Rivest,et al.  Inferring Decision Trees Using the Minimum Description Length Principle , 1989, Inf. Comput..

[22]  Anders Nielsen,et al.  TMB: Automatic Differentiation and Laplace Approximation , 2015, 1509.00660.

[23]  D. Goldstein Statistics and science : a Festschrift for Terry Speed , 2003 .

[24]  Maud Delattre,et al.  A note on BIC in mixed-effects models , 2014 .

[25]  Thomas C. M. Lee,et al.  An Introduction to Coding Theory and the Two‐Part Minimum Description Length Principle , 2001 .

[26]  S. Greven,et al.  A unifying approach to the estimation of the conditional Akaike information in generalized linear mixed models , 2014 .

[27]  Bin Yu,et al.  Minimum Description Length Model Selection Criteria for Generalized Linear Models , 2003 .

[28]  Thomas C.M. Lee A Minimum Description Length-Based Image Segmentation Procedure, and its Comparison with a Cross-Validation-Based Segmentation Procedure , 2000 .

[29]  Jiming Jiang Linear and Generalized Linear Mixed Models and Their Applications , 2007 .

[30]  Xinyu Zhang,et al.  Information based model selection criteria for generalized linear mixed models with unknown variance component parameters , 2013, J. Multivar. Anal..

[31]  J. S. Rao,et al.  Fence methods for mixed model selection , 2008, 0808.0985.

[32]  L. Fahrmeir,et al.  Correction: Consistency and Asymptotic Normality of the Maximum Likelihood Estimator in Generalized Linear Models , 1985 .

[33]  F. Yao,et al.  MINIMUM DESCRIPTION LENGTH PRINCIPLE FOR LINEAR MIXED EFFECTS MODELS , 2014 .

[34]  Bin Yu,et al.  Model Selection and the Principle of Minimum Description Length , 2001 .

[35]  Thomas C. M. Lee,et al.  Model Selection for the Competing-Risks Model With and Without Masking , 2005, Technometrics.

[36]  David R. Anderson,et al.  Model Selection and Inference: A Practical Information-Theoretic Approach , 2001 .

[37]  J. Cavanaugh Unifying the derivations for the Akaike and corrected Akaike information criteria , 1997 .

[38]  Geert Molenberghs,et al.  A pairwise likelihood approach to estimation in multilevel probit models , 2004, Comput. Stat. Data Anal..

[39]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[40]  Hua Liang,et al.  A Note on Conditional AIC for Linear Mixed-Effects Models. , 2008, Biometrika.

[41]  Geert Molenberghs,et al.  Pseudo-likelihood methodology for partitioned large and complex samples , 2011 .

[42]  R. Nishii Asymptotic Properties of Criteria for Selection of Variables in Multiple Regression , 1984 .

[43]  F. Vaida,et al.  Conditional Akaike information under generalized linear and proportional hazards mixed models. , 2011, Biometrika.

[44]  Bingqing Lin,et al.  Fixed and Random Effects Selection by REML and Pathwise Coordinate Optimization , 2013, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[45]  J. Ibrahim,et al.  Fixed and Random Effects Selection in Mixed Effects Models , 2011, Biometrics.