Robust Analysis of Generalized Linear Mixed Models

The method of maximum likelihood (ML) is widely used for analyzing generalized linear mixed models (GLMM's). A full maximum likelihood analysis requires numerical integration techniques for calculation of the log-likelihood, and to avoid the computational problems involving irreducibly high-dimensional integrals, several maximum likelihood algorithms have been proposed in the literature to estimate the model parameters by approximating the log-likelihood function. Although these likelihood algorithms are useful for fitting the GLMM's efficiently under strict model assumptions, they can be highly influenced by the presence of unusual data points. In this article, the author develops a technique for finding robust maximum likelihood (RML) estimates of the model parameters in GLMM's, which appears to be useful in downweighting the influential data points when estimating the parameters. The asymptotic properties of the robust estimators are investigated under some regularity conditions. Small simulations are carried out to study the behavior of the robust estimates in the presence of outliers, and these estimates are also compared to the ordinary classical estimates. To avoid the computational problems involving high-dimensional integrals, the author proposes a robust Monte Carlo Newton–Raphson (RMCNR) algorithm for fitting GLMM's. The proposed robust method is illustrated in an analysis of data from a clinical experiment described in a biometrical journal.

[1]  D. Ruppert,et al.  Optimally bounded score functions for generalized linear models with applications to logistic regression , 1986 .

[2]  E. Ronchetti,et al.  Robust Inference for Generalized Linear Models , 2001 .

[3]  P. Albert,et al.  Models for longitudinal data: a generalized estimating equation approach. , 1988, Biometrics.

[4]  C. McCulloch Maximum Likelihood Variance Components Estimation for Binary Data , 1994 .

[5]  D. Pregibon Resistant fits for some commonly used logistic models with medical application. , 1982, Biometrics.

[6]  Martin Crowder,et al.  On the use of a working correlation matrix in using generalised linear models for repeated measures , 1995 .

[7]  H. White,et al.  Misspecified models with dependent observations , 1982 .

[8]  J. Ware,et al.  Random-effects models for serial observations with binary response. , 1984, Biometrics.

[9]  Kalyan Das,et al.  Miscellanea. On the efficiency of regression estimators in generalised linear models for longitudinal data , 1999 .

[10]  T. Wet,et al.  Mallows-Type Bounded-Influence-Regression Trimmed Means , 1988 .

[11]  R. Carroll,et al.  Conditionally Unbiased Bounded-Influence Estimation in General Regression Models, with Applications to Generalized Linear Models , 1989 .

[12]  M. Tanner Tools for statistical inference: methods for the exploration of posterior distributions and likeliho , 1994 .

[13]  Werner A. Stahel,et al.  Robust Statistics: The Approach Based on Influence Functions , 1987 .

[14]  Dibyen Majumdar,et al.  Conditional Second-Order Generalized Estimating Equations for Generalized Linear and Nonlinear Mixed-Effects Models , 2002 .

[15]  S. Morgenthaler Least-Absolute-Deviations Fits for Generalized Linear Models , 1992 .

[16]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[17]  R. Carroll,et al.  On Robustness in the Logistic Regression Model , 1993 .

[18]  P. Diggle Analysis of Longitudinal Data , 1995 .

[19]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[20]  Geert Molenberghs,et al.  Pseudo-likelihood inference , 2002 .

[21]  P. Rousseeuw,et al.  Unmasking Multivariate Outliers and Leverage Points , 1990 .

[22]  H. White Asymptotic theory for econometricians , 1985 .

[23]  M. A. Tanner,et al.  Tools for Statistical Inference: Methods for the Exploration of Posterior Distributions and Likelihood Functions, 3rd Edition , 1998 .

[24]  G. Fitzmaurice,et al.  A caveat concerning independence estimating equations with multivariate binary data. , 1995, Biometrics.

[25]  J S Preisser,et al.  Robust Regression for Clustered Data with Application to Binary Responses , 1999, Biometrics.

[26]  ON PSEUDO-LIKELIHOOD INFERENCE IN THE BINARY LONGITUDINAL MIXED MODEL , 2002 .

[27]  Scott L. Zeger,et al.  Generalized linear models with random e ects: a Gibbs sampling approach , 1991 .