Bayesian Latent Class Analysis: Sample Size, Model Size, and Classification Precision

The current literature includes limited information on the classification precision of Bayes estimation for latent class analysis (BLCA). (1) Objectives: The present study compared BLCA with the robust maximum likelihood (MLR) procedure, which is the default procedure with the Mplus 8.0 software. (2) Method: Markov chain Monte Carlo simulations were used to estimate two-, three-, and four-class models measured by four binary observed indicators with samples of 1000, 750, 500, 250, 100, and 75 observations, respectively. With each sample, the number of replications was 500, and entropy and average latent class probabilities for most likely latent class membership were recorded. (3) Results: Bayes entropy values were more stable and ranged between 0.644 and 1. Bayes’ average latent class probabilities ranged between 0.528 and 1. MLR entropy values ranged between 0.552 and 0.958. and MLR average latent class probabilities ranged between 0.539 and 0.993. With the two-class model, BLCA outperformed MLR with all sample sizes. With the three-class model, BLCA had higher classification precision with the 75-sample size, whereas MLR performed slightly better with the 750- and 1000-sample sizes. With the 4-class model, BLCA underperformed MLR and had an increased number of unsuccessful computations, particularly with smaller samples.

[1]  Robert L. Wolpert,et al.  Statistical Inference , 2019, Encyclopedia of Social Network Analysis and Mining.

[2]  A. Y. Choi,et al.  Ten Frequently Asked Questions About Latent Class Analysis , 2018, Translational Issues in Psychological Science.

[3]  Paul A. Gagniuc,et al.  Markov Chains: From Theory to Implementation and Experimentation , 2017 .

[4]  Zoubin Ghahramani,et al.  Probabilistic machine learning and artificial intelligence , 2015, Nature.

[5]  Dirk P. Kroese,et al.  Why the Monte Carlo method is so important today , 2014 .

[6]  Franz J. Neyer,et al.  A Gentle Introduction to Bayesian Analysis: Applications to Developmental Research , 2013, Child development.

[7]  Bengt Muthén,et al.  Bayesian structural equation modeling: a more flexible representation of substantive theory. , 2012, Psychological methods.

[8]  Stephanie T. Lanza,et al.  Latent Class and Latent Transition Analysis: With Applications in the Social, Behavioral, and Health Sciences , 2009 .

[9]  Simon Jackman,et al.  Bayesian Analysis for the Social Sciences , 2009 .

[10]  Bradley P. Carlin,et al.  Bayesian Methods for Data Analysis , 2008 .

[11]  Ajay Jasra,et al.  Markov Chain Monte Carlo Methods and the Label Switching Problem in Bayesian Mixture Modeling , 2005 .

[12]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[13]  G. Celeux,et al.  An entropy criterion for assessing the number of clusters in a mixture model , 1996 .

[14]  A. Raftery,et al.  Model-based Gaussian and non-Gaussian clustering , 1993 .

[15]  S. Chib,et al.  Bayesian analysis of binary and polychotomous response data , 1993 .

[16]  W. DeSarbo,et al.  An Empirical Pooling Approach for Estimating Marketing Mix Elasticities with PIMS Data , 1993 .

[17]  R. Redner,et al.  Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[18]  Sik-Yum Lee A bayesian approach to confirmatory factor analysis , 1981 .

[19]  R. P. McDonald,et al.  Bayesian estimation in unrestricted factor analysis: A treatment for heywood cases , 1975 .

[20]  L. A. Goodman The Analysis of Systems of Qualitative Variables When Some of the Variables Are Unobservable. Part I-A Modified Latent Structure Approach , 1974, American Journal of Sociology.

[21]  David B. Dunson,et al.  Bayesian data analysis, third edition , 2013 .

[22]  I. Katz,et al.  Using a Bayesian latent growth curve model to identify trajectories of positive affect and negative events following myocardial infarction. , 2005, Biostatistics.

[23]  B. Muthén BEYOND SEM: GENERAL LATENT VARIABLE MODELING , 2002 .

[24]  M. Stephens Dealing with label switching in mixture models , 2000 .