Bayesian Methods and Model Selection for Latent Growth Curve Models with Missing Data

With an increase in complexity of latent growth curve models (LGCMs), comes an increase in problems estimating the models. This research first proposes new growth models to address the perennial problems of almost all longitudinal research, namely, missing data. Different non-ignorable missingness models are formulated. These models include the latent coefficient (intercept or slope)-dependent missingness, in which the missing data rates vary across different latent individual initial levels or slopes; and the potential outcome-dependent missingness, in which the missing data rates on each occasion depend on potential outcomes. Second, this study proposes a full Bayesian approach to estimate the proposed LGCMs with non-ignorable missing data through data augmentation algorithm and Gibbs sampling procedure. And third, model selecting criteria are proposed in a Bayesian context to identify the best-fit model.Simulation studies were conducted. Conclusions include the proposed method can accurately recover model parameters, the mis-specified missingness may result in severely misleading conclusions, and almost all the model selection criteria can correctly identify the true model with high certainty. The application of the model and the method are illustrated with a longitudinal data set showing growth in mathematical ability. Finally, related implications of the approach and future research directions are discussed.

[1]  H. Bozdogan Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions , 1987 .

[2]  T. Micceri The unicorn, the normal curve, and other improbable creatures. , 1989 .

[3]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[4]  Bradley P. Carlin,et al.  Bayesian measures of model complexity and fit , 2002 .

[5]  Howard Wainer,et al.  Drawing inferences from self-selected samples , 1986 .

[6]  Jonathan M. Keith,et al.  Model selection in Bayesian segmentation of multiple DNA alignments , 2011, Bioinform..

[7]  D. Bartholomew Latent Variable Models And Factor Analysis , 1987 .

[8]  P. Roth MISSING DATA: A CONCEPTUAL REVIEW FOR APPLIED PSYCHOLOGISTS , 1994 .

[9]  J. Q. Smith,et al.  1. Bayesian Statistics 4 , 1993 .

[10]  S. Sclove Application of model-selection criteria to some problems in multivariate analysis , 1987 .

[11]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[12]  J. Singer,et al.  Applied Longitudinal Data Analysis , 2003 .

[13]  P. Boeck,et al.  Confirmatory Analyses of Componential Test Structure Using Multidimensional Item Response Theory. , 1999, Multivariate behavioral research.

[14]  J. Ware,et al.  Applied Longitudinal Analysis , 2004 .

[15]  Bengt Muthén,et al.  Bayesian structural equation modeling: a more flexible representation of substantive theory. , 2012, Psychological methods.

[16]  Brenda White,et al.  DEPARTMENT OF LABOR , 2006 .

[17]  Craig K Enders,et al.  Missing not at random models for latent growth curve analyses. , 2011, Psychological methods.

[18]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Sik-Yum Lee Structural Equation Modeling: A Bayesian Approach , 2007 .

[20]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[21]  H. Akaike A new look at the statistical model identification , 1974 .

[22]  Richard M Lerner,et al.  Use of missing data methods in longitudinal studies: the persistence of bad practices in developmental psychology. , 2009, Developmental psychology.

[23]  Peter J. Huber,et al.  Robust Statistical Procedures: Second Edition , 1996 .

[24]  Andrew Gelman,et al.  R2WinBUGS: A Package for Running WinBUGS from R , 2005 .

[25]  Donald Hedeker,et al.  Longitudinal Data Analysis , 2006 .

[26]  Craig K. Enders,et al.  An introduction to modern missing data analyses. , 2010, Journal of school psychology.

[27]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[28]  John R. Nesselroade,et al.  Bayesian analysis of longitudinal data using growth curve models , 2007 .

[29]  Arthur P. Dempster,et al.  The direct use of likelihood for significance testing , 1997, Stat. Comput..

[30]  Sik-Yum Lee,et al.  Structural equation modelling: A Bayesian approach. , 2007 .

[31]  G. C. Tiao,et al.  Bayesian inference in statistical analysis , 1973 .

[32]  Kenneth A. Bollen,et al.  Latent curve models: A structural equation perspective , 2005 .

[33]  W. Wong,et al.  The calculation of posterior distributions by data augmentation , 1987 .

[34]  Zhiyong Zhang,et al.  Bayesian Inference for Growth Mixture Models with Latent Class Dependent Missing Data , 2011, Multivariate behavioral research.

[35]  C. Robert,et al.  Deviance information criteria for missing data models , 2006 .

[36]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[37]  Ke-Hai Yuan,et al.  SEM with Missing Data and Unknown Population Distributions Using Two-Stage ML: Theory and Its Application , 2008, Multivariate behavioral research.