Efficient direct sampling MCEM algorithm for latent variable models with binary responses

While latent variable models have been successfully applied in many fields and underpin various modeling techniques, their ability to incorporate categorical responses is hindered due to the lack of accurate and efficient estimation methods. Approximation procedures, such as penalized quasi-likelihood, are computationally efficient, but the resulting estimators can be seriously biased for binary responses. Gauss-Hermite quadrature and Markov Chain Monte Carlo (MCMC) integration based methods can yield more accurate estimation, but they are computationally much more intensive. Estimation methods that can achieve both computational efficiency and estimation accuracy are still under development. This paper proposes an efficient direct sampling based Monte Carlo EM algorithm (DSMCEM) for latent variable models with binary responses. Mixed effects and item factor analysis models with binary responses are used to illustrate this algorithm. Results from two simulation studies and a real data example suggest that, as compared with MCMC based EM, DSMCEM can significantly improve computational efficiency as well as produce equally accurate parameter estimates. Other aspects and extensions of the algorithm are discussed.

[1]  Xiao-Li Meng,et al.  Two slice-EM algorithms for fitting generalized linear mixed models with binary response , 2005 .

[2]  M. Knott,et al.  Generalized latent trait models , 2000 .

[3]  S. Rabe-Hesketh,et al.  Reliable Estimation of Generalized Linear Mixed Models using Adaptive Quadrature , 2002 .

[4]  D. V. Dyk NESTING EM ALGORITHMS FOR COMPUTATIONAL EFFICIENCY , 2000 .

[5]  S Y Lee,et al.  Latent variable models with mixed continuous and polytomous data , 2001, Biometrics.

[6]  Xiao-Li Meng,et al.  Fitting Full-Information Item Factor Models and an Empirical Investigation of Bridge Sampling , 1996 .

[7]  Sandip Sinharay,et al.  Experiences With Markov Chain Monte Carlo Convergence Assessment in Two Psychometric Examples , 2004 .

[8]  D. Bartholomew Latent Variable Models And Factor Analysis , 1987 .

[9]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo conver-gence diagnostics: a comparative review , 1996 .

[10]  G. Verbeke,et al.  Statistical inference in generalized linear mixed models: a review. , 2006, The British journal of mathematical and statistical psychology.

[11]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[12]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[13]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm , 1981 .

[14]  A. Kuk,et al.  MAXIMUM LIKELIHOOD ESTIMATION FOR PROBIT-LINEAR MIXED MODELS WITH CORRELATED RANDOM EFFECTS , 1997 .

[15]  J. Ware,et al.  Random-effects models for serial observations with binary response. , 1984, Biometrics.

[16]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[17]  J. Booth,et al.  Maximizing generalized linear mixed model likelihoods with an automated Monte Carlo EM algorithm , 1999 .

[18]  R. D. Bock,et al.  High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature , 2005 .

[19]  D. Rubin,et al.  Parameter expansion to accelerate EM: The PX-EM algorithm , 1998 .

[20]  Kristopher J Preacher,et al.  Item factor analysis: current approaches and future directions. , 2007, Psychological methods.

[21]  Harvey Goldstein,et al.  Improved Approximations for Multilevel Models with Binary Responses , 1996 .

[22]  R. Schall Estimation in generalized linear models with random effects , 1991 .

[23]  Jian Qing Shi,et al.  Maximum Likelihood Estimation of Two‐Level Latent Variable Models with Mixed Continuous and Polytomous Data , 2001 .

[24]  M. Tan,et al.  An efficient MCEM algorithm for fitting generalized linear mixed models for correlated binary data , 2007 .

[25]  Xiao-Li Meng,et al.  Maximum likelihood estimation via the ECM algorithm: A general framework , 1993 .

[26]  N. Breslow,et al.  Bias Correction in Generalized Linear Mixed Models with Multiple Components of Dispersion , 1996 .

[27]  S. Rabe-Hesketh,et al.  Latent Variable Modelling: A Survey * , 2007 .

[28]  N. Breslow,et al.  Bias correction in generalised linear mixed models with a single component of dispersion , 1995 .

[29]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters , 1982 .

[30]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[31]  G. C. Wei,et al.  A Monte Carlo Implementation of the EM Algorithm and the Poor Man's Data Augmentation Algorithms , 1990 .

[32]  H. Goldstein Nonlinear multilevel models, with an application to discrete response data , 1991 .

[33]  Yi-Hau Chen,et al.  Computationally efficient Monte Carlo EM algorithms for generalized linear mixed models , 2006 .

[34]  C. McCulloch Maximum Likelihood Variance Components Estimation for Binary Data , 1994 .

[35]  C. McCulloch Maximum Likelihood Algorithms for Generalized Linear Mixed Models , 1997 .

[36]  E. Muraki,et al.  Full-Information Item Factor Analysis , 1988 .

[37]  Noreen Goldman,et al.  An assessment of estimation procedures for multilevel models with binary responses , 1995 .

[38]  Lang Wu,et al.  Mixed Effects Models for Complex Data , 2019 .