Implementing continuous non-normal skewed distributions in latent growth mixture modeling: An assessment of specification errors and class enumeration

Abstract Recent advances have allowed for modeling mixture components within latent growth modeling using robust, skewed mixture distributions rather than normal distributions. This feature adds flexibility in handling non-normality in longitudinal data, through manifest or latent variables, by directly modeling skewed or heavy-tailed latent classes rather than assuming a mixture of normal distributions. The aim of this study was to assess through simulation the potential under- or over-extraction of latent classes in a growth mixture model when underlying data follow either normal, skewed-normal, or skewed-t distributions. In order to assess this, we implement skewed-t, skewed-normal, and conventional normal (i.e., not skewed) forms of the growth mixture model. The skewed-t and skewed-normal versions of this model have only recently been implemented, and relatively little is known about their performance. Model comparison, fit, and classification of correctly specified and mis-specified models were assessed through various indices. Findings suggest that the accuracy of model comparison and fit measures are dependent on the type of (mis)specification, as well as the amount of class separation between the latent classes. A secondary simulation exposed computation and accuracy difficulties under some skewed modeling contexts. Implications of findings, recommendations for applied researchers, and future directions are discussed; a motivating example is presented using education data.

[1]  L. E. Garrido,et al.  Class enumeration false positive in skew-t family of continuous growth mixture models , 2020, PloS one.

[2]  Sharon X. Lee,et al.  Robust mixtures of factor analysis models using the restricted multivariate skew-t distribution , 2018 .

[3]  David Kaplan,et al.  Methodological Advances in the Analysis of Individual Growth With Relevance to Education Policy , 2002, Peabody Journal of Education.

[4]  Camille Roth,et al.  Natural Scales in Geographical Patterns , 2017, Scientific Reports.

[5]  P. McNicholas,et al.  Extending Growth Mixture Models Using Continuous Non-Elliptical Distributions , 2017, 1703.08723.

[6]  Douglas Steinley,et al.  Local Optima in Mixture Modeling , 2016, Multivariate behavioral research.

[7]  D. Steinley,et al.  Extracting Spurious Latent Classes in Growth Mixture Modeling With Nonnormal Errors , 2016, Educational and psychological measurement.

[8]  Lawrence Hubert,et al.  The variance of the adjusted Rand index. , 2016, Psychological methods.

[9]  B. Muthén,et al.  Structural Equation Models and Mixture Models With Continuous Nonnormal Skewed Distributions , 2016 .

[10]  M. Cugmas,et al.  On comparing partitions , 2015 .

[11]  B. Muthén,et al.  Growth mixture modeling with non‐normal distributions , 2015, Statistics in medicine.

[12]  Joshua F. Wiley,et al.  Automating Mplus Model Estimation and Interpretation , 2014 .

[13]  Geoffrey J. McLachlan,et al.  Finite mixtures of multivariate skew t-distributions: some recent and new results , 2014, Stat. Comput..

[14]  Ryan P. Browne,et al.  Mixtures of skew-t factor analyzers , 2013, Comput. Stat. Data Anal..

[15]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[16]  C. Meade,et al.  Reductions in traumatic stress following a coping intervention were mediated by decreases in avoidant coping for people living with HIV/AIDS and childhood sexual abuse. , 2013, Journal of consulting and clinical psychology.

[17]  Sarah Depaoli,et al.  Mixture class recovery in GMM under varying degrees of class separation: frequentist versus Bayesian estimation. , 2013, Psychological methods.

[18]  Zhiyong Zhang,et al.  Bayesian Inference and Application of Robust Growth Curve Models Using Student's t Distribution , 2013 .

[19]  Geoffrey J. McLachlan,et al.  On mixtures of skew normal and skew $$t$$-distributions , 2012, Adv. Data Anal. Classif..

[20]  S. Depaoli,et al.  Two Studies of Specification Error in Models for Categorical Latent Variables , 2011 .

[21]  M. Brusco,et al.  Evaluating mixture modeling for clustering: recommendations and cautions. , 2011, Psychological methods.

[22]  M. Thase,et al.  Differential effects of treatments for chronic depression: a latent growth model reanalysis. , 2010, Journal of consulting and clinical psychology.

[23]  Stephen J Tueller,et al.  Evaluation of Structural Equation Mixture Models: Parameter Estimates and Correct Class Assignment , 2010, Structural equation modeling : a multidisciplinary journal.

[24]  David Kaplan,et al.  Structural Equation Modeling (2nd ed.): Foundations and Extensions , 2009 .

[25]  M. Genton,et al.  Robust Likelihood Methods Based on the Skew‐t and Related Distributions , 2008 .

[26]  Daniel J. Bauer Observations on the Use of Growth Mixture Models in Psychological Research , 2007 .

[27]  B. Muthén,et al.  Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study , 2007 .

[28]  Jack C. Lee,et al.  Robust mixture modeling using the skew t distribution , 2007, Stat. Comput..

[29]  S. Reise,et al.  Detecting Mixtures From Structural Model Differences Using Latent Variable Mixture Modeling: A Comparison of Relative Model Fit Statistics , 2007 .

[30]  Asko Tolvanen,et al.  Latent growth mixture modeling : a simulation study , 2007 .

[31]  Nancy C. Jordan,et al.  Number sense growth in kindergarten: a longitudinal investigation of children at risk for mathematics difficulties. , 2006, Child development.

[32]  Kenneth A. Bollen,et al.  Latent curve models: A structural equation perspective , 2005 .

[33]  M. Goldman,et al.  Variation in the drinking trajectories of freshmen college students. , 2005, Journal of consulting and clinical psychology.

[34]  B. Muthén,et al.  Investigating population heterogeneity with factor mixture models. , 2005, Psychological methods.

[35]  D. Steinley Properties of the Hubert-Arabie adjusted Rand index. , 2004, Psychological methods.

[36]  Daniel J. Bauer,et al.  Overextraction of latent trajectory classes: Much ado about nothing? Reply to Rindskopf (2003), Muthén (2003), and Cudeck and Henly (2003) , 2003 .

[37]  Daniel J Bauer,et al.  Distributional assumptions of growth mixture models: implications for overextraction of latent trajectory classes. , 2003, Psychological methods.

[38]  Bengt Muthén,et al.  Statistical and substantive checking in growth mixture modeling: comment on Bauer and Curran (2003). , 2003, Psychological methods.

[39]  Douglas Steinley,et al.  Local optima in K-means clustering: what you don't know may hurt you. , 2003, Psychological methods.

[40]  R. Cudeck,et al.  A realistic perspective on pattern representation in growth data: comment on Bauer and Curran (2003). , 2003, Psychological methods.

[41]  D. Rindskopf Mixture or homogeneous? Comment on Bauer and Curran (2003). , 2003, Psychological methods.

[42]  D. Rubin,et al.  Testing the number of components in a normal mixture , 2001 .

[43]  M. Windle,et al.  Depressive symptoms and cigarette smoking among middle adolescents: prospective associations and intrapersonal and interpersonal influences. , 2001, Journal of consulting and clinical psychology.

[44]  D. Kaplan Structural Equation Modeling: Foundations and Extensions , 2000 .

[45]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[46]  B. Muthén,et al.  Finite Mixture Modeling with Mixture Outcomes Using the EM Algorithm , 1999, Biometrics.

[47]  G. Celeux,et al.  An entropy criterion for assessing the number of clusters in a mixture model , 1996 .

[48]  W. DeSarbo,et al.  An Empirical Pooling Approach for Estimating Marketing Mix Elasticities with PIMS Data , 1993 .

[49]  F. Rombouts,et al.  Modeling of the Bacterial Growth Curve , 1990, Applied and environmental microbiology.

[50]  T. Micceri The unicorn, the normal curve, and other improbable creatures. , 1989 .

[51]  H. Akaike Factor analysis and AIC , 1987 .

[52]  R. Hathaway Another interpretation of the EM algorithm for mixture distributions , 1986 .

[53]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[54]  C. Burt Is Intelligence Distributed Normally , 1963 .

[55]  V L KOZLOWSKI,et al.  Depressive symptoms. , 1959, Medical times.