A Bayesian analysis of mixture structural equation models with non‐ignorable missing responses and covariates

In behavioral, biomedical, and social-psychological sciences, it is common to encounter latent variables and heterogeneous data. Mixture structural equation models (SEMs) are very useful methods to analyze these kinds of data. Moreover, the presence of missing data, including both missing responses and missing covariates, is an important issue in practical research. However, limited work has been done on the analysis of mixture SEMs with non-ignorable missing responses and covariates. The main objective of this paper is to develop a Bayesian approach for analyzing mixture SEMs with an unknown number of components, in which a multinomial logit model is introduced to assess the influence of some covariates on the component probability. Results of our simulation study show that the Bayesian estimates obtained by the proposed method are accurate, and the model selection procedure via a modified DIC is useful in identifying the correct number of components and in selecting an appropriate missing mechanism in the proposed mixture SEMs. A real data set related to a longitudinal study of polydrug use is employed to illustrate the methodology.

[1]  C. Robert,et al.  Deviance information criteria for missing data models , 2006 .

[2]  Xin-Yuan Song,et al.  Evaluation of the Bayesian and Maximum Likelihood Approaches in Analyzing Structural Equation Models with Small Sample Sizes , 2004, Multivariate behavioral research.

[3]  D. Dunson,et al.  Bayesian latent variable models for clustered mixed outcomes , 2000 .

[4]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[5]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[6]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[7]  Sik-Yum Lee,et al.  Maximum Likelihood Estimation and Model Comparison for Mixtures of Structural Equation Models with Ignorable Missing Data , 2003, J. Classif..

[8]  Sik-Yum Lee,et al.  A Bayesian analysis of finite mixtures in the LISREL model , 2001 .

[9]  Yiu-Fai Yung,et al.  Finite mixtures in confirmatory factor-analysis models , 1997 .

[10]  P. Green,et al.  Corrigendum: On Bayesian analysis of mixtures with an unknown number of components , 1997 .

[11]  Joseph G. Ibrahim,et al.  A Weighted Estimating Equation for Missing Covariate Data with Properties Similar to Maximum Likelihood , 1999 .

[12]  Han L. J. van der Maas,et al.  Fitting multivariage normal finite mixtures subject to structural equation modeling , 1998 .

[13]  M. Stephens Dealing with label switching in mixture models , 2000 .

[14]  P. Green,et al.  On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion) , 1997 .

[15]  A two-level structural equation model approach for analyzing multivariate longitudinal responses. , 2008, Statistics in medicine.

[16]  Sik-Yum Lee,et al.  Structural equation modelling: A Bayesian approach. , 2007 .

[17]  Gerhard Arminger,et al.  Mixtures of conditional mean- and covariance-structure models , 1999 .

[18]  Xin-Yuan Song,et al.  Bayesian model selection for mixtures of structural equation models with an unknown number of components. , 2003, The British journal of mathematical and statistical psychology.

[19]  Sik-Yum Lee,et al.  Bayesian analysis of latent variable models with non-ignorable missing outcomes from exponential family. , 2007, Statistics in medicine.

[20]  R. Scheines,et al.  Bayesian estimation and testing of structural equation models , 1999 .

[21]  Paul J. Rathouz,et al.  Semiparametric inference in matched case-control studies with missing covariate data , 2002 .

[22]  Yasuo Amemiya,et al.  Latent class regression on latent factors. , 2005, Biostatistics.

[23]  Mark L. Davison,et al.  Effects of Missing Data Methods in Structural Equation Modeling With Nonnormal Longitudinal Data , 2009 .

[24]  Joseph G. Ibrahim,et al.  Missing covariates in generalized linear models when the missing data mechanism is non‐ignorable , 1999 .

[25]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Joseph G Ibrahim,et al.  A weighted estimating equation for linear regression with missing covariate data , 2002, Statistics in medicine.

[27]  Sik-Yum Lee,et al.  Bayesian Analysis of Mixtures Structural Equation Models with Missing Data , 2007 .

[28]  Kamel Jedidi,et al.  STEMM: A General Finite Mixture Structural Equation Model , 1997 .

[29]  S. Lipsitz,et al.  Missing responses in generalised linear mixed models when the missing data mechanism is nonignorable , 2001 .

[30]  S. Frühwirth-Schnatter Markov chain Monte Carlo Estimation of Classical and Dynamic Switching and Mixture Models , 2001 .

[31]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[32]  I. Katz,et al.  Using a Bayesian latent growth curve model to identify trajectories of positive affect and negative events following myocardial infarction. , 2005, Biostatistics.

[33]  B. Muthén,et al.  Finite Mixture Modeling with Mixture Outcomes Using the EM Algorithm , 1999, Biometrics.

[34]  Joseph G Ibrahim,et al.  Maximum Likelihood Methods for Nonignorable Missing Responses and Covariates in Random Effects Models , 2003, Biometrics.