Generalized linear mixed models with Gaussian mixture random effects: Inference and application

We propose a new class of generalized linear mixed models with Gaussian mixture random effects for clustered data. To overcome the weak identifiability issues, we fit the model using a penalized Expectation Maximization (EM) algorithm, and develop sequential locally restricted likelihood ratio tests to determine the number of components in the Gaussian mixture. Our work is motivated by an application to nationwide kidney transplant center evaluation in the United States, where the patient-level post-surgery outcomes are repeated measures of the care quality of the transplant centers. By taking into account patient-level risk factors and modeling the center effects by a finite Gaussian mixture model, the proposed model provides a convenient framework to study the heterogeneity among the transplant centers and controls the false discovery rate when screening for transplant centers with non-standard performance.

[1]  T. N. Sriram,et al.  Robust Estimation of Mixture Complexity , 2006 .

[2]  F. Liang,et al.  Estimating the false discovery rate using the stochastic approximation algorithm , 2008 .

[3]  María José Lombardía,et al.  Mixed generalized Akaike information criterion for small area models , 2017 .

[4]  N. Breslow,et al.  Bias Correction in Generalized Linear Mixed Models with Multiple Components of Dispersion , 1996 .

[5]  Jiahua Chen,et al.  Inference on the Order of a Normal Mixture , 2012 .

[6]  G. Molenberghs,et al.  Type I and Type II Error Under Random‐Effects Misspecification in Generalized Linear Mixed Models , 2007, Biometrics.

[7]  Geert Molenberghs,et al.  Type I and Type II Error Under Random‐Effects Misspecification in Generalized Linear Mixed Models , 2007, Biometrics.

[8]  Marie Davidian,et al.  A Monte Carlo EM algorithm for generalized linear mixed models with flexible random effects distribution. , 2002, Biostatistics.

[9]  J. Kiefer,et al.  CONSISTENCY OF THE MAXIMUM LIKELIHOOD ESTIMATOR IN THE PRESENCE OF INFINITELY MANY INCIDENTAL PARAMETERS , 1956 .

[10]  Wenguang Sun,et al.  Oracle and Adaptive Compound Decision Rules for False Discovery Rate Control , 2007 .

[11]  Kevin He,et al.  Evaluating hospital readmission rates in dialysis facilities; adjusting for hospital effects , 2013, Lifetime data analysis.

[12]  Jianhua Guo,et al.  Likelihood Ratio Test for Multi-Sample Mixture Model and Its Application to Genetic Imprinting , 2015 .

[13]  R. Hathaway A Constrained Formulation of Maximum-Likelihood Estimation for Normal Mixture Distributions , 1985 .

[14]  W. Spector,et al.  National release of the nursing home quality report cards: implications of statistical methodology for risk adjustment. , 2009, Health services research.

[15]  P. Deb Finite Mixture Models , 2008 .

[16]  Thomas A. Louis,et al.  Statistical Issues in Assessing Hospital Performance , 2012 .

[17]  J. Goeman,et al.  The Sequential Rejection Principle of Familywise Error Control , 2010, 1211.3313.

[18]  Harrison H. Zhou,et al.  False Discovery Rate Control With Groups , 2010, Journal of the American Statistical Association.

[19]  Brian Caffo,et al.  Flexible random intercept models for binary outcomes using mixtures of normals , 2007, Comput. Stat. Data Anal..

[20]  Lancelot F. James,et al.  Bayesian Model Selection in Finite Mixtures by Marginal Density Decompositions , 2001 .

[21]  C. McCulloch,et al.  Misspecifying the Shape of a Random Effects Distribution: Why Getting It Wrong May Not Matter , 2011, 1201.1980.

[22]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[23]  Jiahua Chen,et al.  INFERENCE FOR NORMAL MIXTURES IN MEAN AND VARIANCE , 2008 .

[24]  Nhat Ho,et al.  Convergence rates of parameter estimation for some weakly identifiable finite mixtures , 2016 .

[25]  Jiahua Chen,et al.  Hypothesis test for normal mixture models: The EM approach , 2009, 0908.3428.

[26]  J. Booth,et al.  Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm , 1999 .

[27]  Jiahua Chen Optimal Rate of Convergence for Finite Mixture Models , 1995 .

[28]  B. Efron Large-Scale Simultaneous Hypothesis Testing , 2004 .

[29]  John T. Ormerod,et al.  On generalized degrees of freedom with application in linear mixed models selection , 2016, Stat. Comput..

[30]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[31]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[32]  Yehua Li,et al.  Joint Modeling and Clustering Paired Generalized Longitudinal Trajectories With Application to Cocaine Abuse Treatment Data , 2014 .

[33]  H. Kasahara,et al.  Testing the Number of Components in Normal Mixture Regression Models , 2015 .

[34]  Wenguang Sun,et al.  False discovery control in large‐scale spatial multiple testing , 2015, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[35]  S. Normand,et al.  An Administrative Claims Model Suitable for Profiling Hospital Performance Based on 30-Day Mortality Rates Among Patients With an Acute Myocardial Infarction , 2006, Circulation.