A Hybrid Bayesian Laplacian Approach for Generalized Linear Mixed Models

The analytical intractability of generalized linear mixed models (GLMMs) has generated a lot of research in the past two decades. Applied statisticians routinely face the frustrating prospect of widely disparate results produced by the methods that are currently implemented in commercially available software. This article is motivated by this frustration and develops guidance as well as new methods that are computationally efficient and statistically reliable. Two main classes of approximations have been developed: likelihood-based methods and Bayesian methods. Likelihood-based methods such as the penalized quasi-likelihood approach of Breslow and Clayton (1993) have been shown to produce biased estimates especially for binary clustered data with small clusters sizes. More recent methods such as the adaptive Gaussian quadrature approach perform well but can be overwhelmed by problems with large numbers of random effects, and efficient algorithms to better handle these situations have not yet been integrated in standard statistical packages. Similarly, Bayesian methods, though they have good frequentist properties when the model is correct, are known to be computationally intensive and also require specialized code, limiting their use in practice. In this article we build on our previous method (Capanu and Begg 2010) and propose a hybrid approach that provides a bridge between the likelihood-based and Bayesian approaches by employing Bayesian estimation for the variance components followed by Laplacian estimation for the regression coefficients with the goal of obtaining good statistical properties, with relatively good computing speed, and using widely available software. The hybrid approach is shown to perform well against the other competitors considered. Another important finding of this research is the surprisingly good performance of the Laplacian approximation in the difficult case of binary clustered data with small clusters sizes. We apply the methods to a real study of head and neck squamous cell carcinoma and illustrate their properties using simulations based on a widely-analyzed salamander mating dataset and on another important dataset involving the Guatemalan Child Health survey. A hybrid Bayesian Laplacian approach for generalized linear mixed models MARINELA CAPANU, MITHAT G ONEN, COLIN B. BEGG Memorial Sloan-Kettering Cancer Center

[1]  N. Breslow,et al.  Bias Correction in Generalized Linear Mixed Models with Multiple Components of Dispersion , 1996 .

[2]  J. Booth,et al.  Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm , 1999 .

[3]  Liang Zhu,et al.  On fitting generalized linear mixed‐effects models for binary responses using different statistical packages , 2011, Statistics in medicine.

[4]  R. Wolfinger,et al.  Generalized linear mixed models a pseudo-likelihood approach , 1993 .

[5]  Chuhsing Kate Hsiao,et al.  Computation of reference Bayesian inference for variance components in longitudinal studies , 2008, Comput. Stat..

[6]  Duncan C Thomas,et al.  The use of hierarchical models for estimating relative risks of individual genetic variants: An application to a study of melanoma , 2008, Statistics in medicine.

[7]  Mithat Gonen,et al.  Diagnostic and Prognostic Value of [18F]Fluorodeoxyglucose Positron Emission Tomography for Recurrent Head and Neck Squamous Cell Carcinoma , 2002 .

[8]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[9]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[10]  E. Demidenko,et al.  Mixed Models: Theory and Applications (Wiley Series in Probability and Statistics) , 2004 .

[11]  N. Breslow,et al.  Bias correction in generalised linear mixed models with a single component of dispersion , 1995 .

[12]  J. Pinheiro,et al.  Efficient Laplacian and Adaptive Gaussian Quadrature Algorithms for Multilevel Generalized Linear Mixed Models , 2006 .

[13]  M. Karim Generalized Linear Models With Random Effects , 1991 .

[14]  Scott L. Zeger,et al.  Generalized linear models with random e ects: a Gibbs sampling approach , 1991 .

[15]  Peter McCullagh,et al.  Laplace Approximation of High Dimensional Integrals , 1995 .

[16]  Dani Gamerman,et al.  Sampling from the posterior distribution in generalized linear mixed models , 1997, Stat. Comput..

[17]  Noreen Goldman,et al.  An assessment of estimation procedures for multilevel models with binary responses , 1995 .

[18]  D. Bates,et al.  Approximations to the Log-Likelihood Function in the Nonlinear Mixed-Effects Model , 1995 .

[19]  Anthony Y. C. Kuk,et al.  Monte Carlo approximation through Gibbs output in generalized linear mixed models , 2005 .

[20]  Colin B Begg,et al.  Hierarchical Modeling for Estimating Relative Risks of Rare Genetic Variants: Properties of the Pseudo‐Likelihood Method , 2011, Biometrics.

[21]  S. Raudenbush,et al.  Maximum Likelihood for Generalized Linear Models with Nested Random Effects via High-Order, Multivariate Laplace Approximation , 2000 .

[22]  H. Rue,et al.  Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations , 2009 .

[23]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[24]  S L Zeger,et al.  Generalized linear models with random effects; salamander mating revisited. , 1992, Biometrics.

[25]  Harry Joe,et al.  Accuracy of Laplace approximation for discrete response mixed models , 2008, Comput. Stat. Data Anal..

[26]  Donald Hedeker,et al.  Longitudinal Data Analysis , 2006 .