Trimmed Constrained Mixed Effects Models: Formulations and Algorithms

Abstract Mixed effects (ME) models inform a vast array of problems in the physical and social sciences, and are pervasive in meta-analysis. We consider ME models where the random effects component is linear. We then develop an efficient approach for a broad problem class that allows nonlinear measurements, priors, and constraints, and finds robust estimates in all of these cases using trimming in the associated marginal likelihood. The software accompanying this article is disseminated as an open-source Python package called LimeTr. LimeTr is able to recover results more accurately in the presence of outliers compared to available packages for both standard longitudinal analysis and meta-analysis, and is also more computationally efficient than competing robust alternatives. Supplementary materials that reproduce the simulations, as well as run LimeTr and third party code are available online. We also present analyses of global health data, where we use advanced functionality of LimeTr, including constraints to impose monotonicity and concavity for dose–response relationships. Nonlinear observation models allow new analyses in place of classic approximations, such as log-linear models. Robust extensions in all analyses ensure that spurious data points do not drive our understanding of either mean relationships or between-study heterogeneity.

[1]  P. Rousseeuw Multivariate estimation with high breakdown point , 1985 .

[2]  Bruce D. Spencer,et al.  The Delta Method , 1980 .

[3]  Ying Nian Wu,et al.  Efficient Algorithms for Robust Estimation in Linear Mixed-Effects Models Using the Multivariate t Distribution , 2001 .

[4]  Aaron Christ,et al.  Mixed Effects Models and Extensions in Ecology with R , 2009 .

[5]  Daniel Gianola,et al.  Robust Linear Mixed Models with Normal/Independent Distributions and Bayesian MCMC Implementation , 2003 .

[6]  David A. Harville,et al.  Linear Models and the Relevant Distributions and Matrix Algebra , 2018 .

[7]  Benjamin Pfaff,et al.  Perturbation Analysis Of Optimization Problems , 2016 .

[8]  J. Freidman,et al.  Multivariate adaptive regression splines , 1991 .

[9]  Evan Mayo-Wilson,et al.  Vitamin A supplementation for preventing morbidity and mortality in children from six months to five years of age. , 2017, The Cochrane database of systematic reviews.

[10]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[11]  N. Laird,et al.  Meta-analysis in clinical trials. , 1986, Controlled clinical trials.

[12]  Manuel Koller,et al.  robustlmm : An R Package for Robust Estimation of Linear Mixed-Effects Models , 2016 .

[13]  Peter Boyle,et al.  Tobacco smoking and cancer: A meta‐analysis , 2008, International journal of cancer.

[14]  Benar Fux Svaiter,et al.  Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward–backward splitting, and regularized Gauss–Seidel methods , 2013, Math. Program..

[15]  B. D. Craven,et al.  Non-Linear Parametric Optimization (B. Bank, J. Guddat, D. Klatte, B. Kummer and K. Tammer) , 1984 .

[16]  Gene H. Golub,et al.  The differentiation of pseudo-inverses and non-linear least squares problems whose variables separate , 1972, Milestones in Matrix Computation.

[17]  Christophe Croux,et al.  Sparse least trimmed squares regression for analyzing high-dimensional large data sets , 2013, 1304.4773.

[18]  Simon N. Wood,et al.  Shape constrained additive models , 2015, Stat. Comput..

[19]  J. Ware,et al.  Random-effects models for longitudinal data. , 1982, Biometrics.

[20]  Eunho Yang,et al.  High-Dimensional Trimmed Estimators: A General Framework for Robust Structured Estimation , 2016, 1605.08299.

[21]  J. Friedman Multivariate adaptive regression splines , 1990 .

[22]  Lorenz T. Biegler,et al.  On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming , 2006, Math. Program..

[23]  Jill P Pell,et al.  Red and processed meat consumption and breast cancer: UK Biobank cohort study and meta-analysis. , 2018, European journal of cancer.

[24]  Aleksandr Y. Aravkin,et al.  Trimmed Statistical Estimation via Variance Reduction , 2020, Math. Oper. Res..

[25]  Christine H. Müller,et al.  Breakdown Point and Computation of Trimmed Likelihood Estimators in Generalized Linear Models , 2003 .

[26]  Eunho Yang,et al.  Robust Gaussian Graphical Modeling with the Trimmed Graphical Lasso , 2015, NIPS.

[27]  G. Golub,et al.  Separable nonlinear least squares: the variable projection method and its applications , 2003 .

[28]  Aleksandr Y. Aravkin,et al.  Estimating nuisance parameters in inverse problems , 2012, 1206.6532.

[29]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[30]  Dmitriy Drusvyatskiy,et al.  Efficient Quadratic Penalization Through the Partial Minimization Technique , 2018, IEEE Transactions on Automatic Control.

[31]  B. Bank,et al.  Non-Linear Parametric Optimization , 1983 .

[32]  PETER J. ROUSSEEUW,et al.  Computing LTS Regression for Large Data Sets , 2005, Data Mining and Knowledge Discovery.

[33]  David M. Mount,et al.  On the Least Trimmed Squares Estimator , 2012, Algorithmica.

[34]  P. Rousseeuw,et al.  Alternatives to the Median Absolute Deviation , 1993 .

[35]  P. Lee,et al.  Systematic review with meta-analysis of the epidemiological evidence in the 1900s relating smoking to lung cancer , 2012, BMC Cancer.