Latent class modeling with covariates : Two improved three-step approaches 1

Researchers using latent class (LC) analysis often proceed using the following three steps: (1) an LC model is built for a set of response variables, (2) subjects are assigned to LCs based on their posterior class membership probabilities, and (3) the association between the assigned class membership and external variables is investigated using simple cross-tabulations or multinomial logistic regression analysis. Bolck, Croon, and Hagenaars (2004) demonstrated that such a three-step approach underestimates the associations between covariates and class membership. They proposed resolving this problem by means of a specific correction method that involves modifying the third step. In this article, I extend the correction method of Bolck, Croon, and Hagenaars by showing that it involves maximizing a weighted log-likelihood function for clustered data. This conceptualization makes it possible to apply the method not only with categorical but also with continuous explanatory variables, to obtain correct tests using complex sampling variance estimation methods, and to implement it in standard software for logistic regression analysis. In addition, a new maximum likelihood (ML)—based correction method is proposed, which is more direct in the sense that it does not require analyzing weighted data. This new three-step ML method can be easily implemented in software for LC analysis. The reported simulation study shows that both correction methods perform very well in the sense that their parameter estimates and their SEs can be trusted, except for situations with very poorly separated classes. The main advantage of the ML method compared with the Bolck, Croon, and Hagenaars approach is that it is much more efficient and almost as efficient as one-step ML estimation.

[1]  Jonathan N. Katz,et al.  Reassessing the Link between Voter Heterogeneity and PoliticalAccountability: A Latent Class Regression Model of EconomicVoting , 2009 .

[2]  Jennifer L. Hill,et al.  Classification by Opinion-Changing Behavior: A Mixture Model Approach , 2001, Political Analysis.

[3]  Kazuo Yamaguchi,et al.  Multinomial Logit Latent‐Class Regression Models: An Analysis of the Predictors of Gender‐Role Attitudes among Japanese Women1 , 2000, American Journal of Sociology.

[4]  José G. Dias,et al.  A bootstrap-based aggregate classifier for model-based clustering , 2008, Comput. Stat..

[5]  Leo A. Goodman,et al.  1. On the Assignment of Individuals to Latent Classes , 2007 .

[6]  Chris J. Skinner,et al.  Analysis of complex surveys , 1991 .

[7]  Jeroen K. Vermunt,et al.  AVOIDING BOUNDARY ESTIMATES IN LATENT CLASS ANALYSIS BY BAYESIAN POSTERIOR MODE ESTIMATION , 2006 .

[8]  Blossom H. Patterson,et al.  Latent Class Analysis of Complex Sample Survey Data , 2002 .

[9]  J. Vermunt Mixed-Effects Logistic Regression Models for Indirectly Observed Discrete Outcome Variables , 2005, Multivariate behavioral research.

[10]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[11]  R. Breen Why Is Support for Extreme Parties Underestimated by Surveys? A Latent Class Analysis , 2000, British Journal of Political Science.

[12]  L. A. Goodman,et al.  Latent Structure Analysis of a Set of Multidimensional Contingency Tables , 1984 .

[13]  Petter Laake,et al.  Regression among factor scores , 2001 .

[14]  Paul F. Lazarsfeld,et al.  Latent Structure Analysis. , 1969 .

[15]  A. Agresti,et al.  Categorical Data Analysis , 1991, International Encyclopedia of Statistical Science.

[16]  Joseph L Schafer,et al.  Latent class logistic regression: application to marijuana use and attitudes among high school seniors , 2006 .

[17]  S. Haberman Analysis of qualitative data , 1978 .

[18]  Scott L. Zeger,et al.  Latent Variable Regression for Multiple Discrete Outcomes , 1997 .

[19]  Z. Gilula,et al.  5. An Extended Study into the Relationship between Correspondence Analysis and Latent Class Analysis , 1999 .

[20]  P. Deb Finite Mixture Models , 2008 .

[21]  L. A. Goodman Exploratory latent structure analysis using both identifiable and unidentifiable models , 1974 .

[22]  Jeroen K. Vermunt,et al.  Mixture models for multilevel data sets , 2010 .

[23]  M. Croon Using Predicted Latent Scores in General Latent Struc­ ture M odels , 2002 .

[24]  Michel Wedel,et al.  Concomitant Variable Latent Class Models for Conjoint Analysis , 1994 .

[25]  S. Zeger,et al.  Latent Class Model Diagnosis , 2000, Biometrics.

[26]  Scott Zeger,et al.  Methods for evaluating the performance of diagnostic tests in the absence of a gold standard: a latent class model approach , 2002, Statistics in medicine.

[27]  B. Graubard,et al.  Latent Class Analysis of Complex Sample Survey Data , 2002 .

[28]  Solon J. Simmons Ascriptive Justice: The Prevalence, Distribution, and Consequences of Political Correctness in the Academy , 2008 .

[29]  David Knoke,et al.  Analysis of Qualitative Data, Vol. 2: New Developments. , 1981 .

[30]  F. Krauss Latent Structure Analysis , 1980 .

[31]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[32]  J. Vermunt,et al.  Latent class and finite mixture models for multilevel data sets , 2008, Statistical methods in medical research.

[33]  Michel Wedel,et al.  Concomitant Variable Latent Class Models for the External Analysis of Choice Data , 1992 .

[34]  R. Dalton,et al.  Citizenship Norms and the Expansion of Political Participation , 2008 .

[35]  J. Vermunt,et al.  Tilburg University Mixed-effects logistic regression models for indirectly observed outcome variables , 2004 .

[36]  Jonas Edlund Trust in the Capability of the Welfare State and General Welfare State Support: Sweden 1997-2002 , 2006 .

[37]  Jeroen K. Vermunt,et al.  Heterogeneity in Post-materialist Value Priorities. Evidence from a Latent Class Discrete Choice Approach , 2007 .

[38]  J. Hagenaars Loglinear Models with Latent Variables , 1993 .

[39]  J. Vermunt,et al.  Latent Gold 4.0 User's Guide , 2005 .

[40]  David J. Bartholomew,et al.  New Developments in Latent Structure Analysis Applied to Social Attitudes , 1996 .

[41]  Jeroen K. Vermunt,et al.  Log-Linear Models for Event Histories , 1997 .

[42]  Irene R. R. Lu,et al.  Avoiding and Correcting Bias in Score-Based Latent Variable Regression With Discrete Manifest Items , 2008 .

[43]  Marcel Croon,et al.  Estimating Latent Structure Models with Categorical Variables: One-Step Versus Three-Step Estimators , 2004, Political Analysis.

[44]  F. V. D. Pol,et al.  MIXED MARKOV LATENT CLASS MODELS , 1990 .

[45]  Peter G. M. van der Heijden,et al.  The Analysis of Multivariate Misclassified Data With Special Attention to Randomized Response Data , 2004 .

[46]  Jay Magidson,et al.  Latent Class Factor and Cluster Models, Bi-Plots, and Related Graphical Displays , 2001 .

[47]  Neil Henry Latent structure analysis , 1969 .

[48]  Jeroen K. Vermunt,et al.  7. Multilevel Latent Class Models , 2003 .

[49]  Jay Magidson,et al.  Qualitative variance, entropy, and correlation ratios for nominal dependent variables , 1981 .

[50]  Drew A. Linzer,et al.  The Political Economy of Women's Support for Fundamentalist Islam , 2008 .

[51]  L. A. Goodman The Analysis of Systems of Qualitative Variables When Some of the Variables Are Unobservable. Part I-A Modified Latent Structure Approach , 1974, American Journal of Sociology.

[52]  Jay Magidson,et al.  LG-Syntax user's guide: Manual for Latent GOLD 4.5 Syntax module , 2008 .

[53]  A. McCutcheon,et al.  Latent Class Analysis , 2021, Encyclopedia of Autism Spectrum Disorders.

[54]  Allan L. McCutcheon,et al.  A Latent Class Analysis of Tolerance for Nonconformity in the American Public , 1985 .

[55]  Jennifer L. Hill An Extension and Test of Converse’s “Black-and-White” Model of Response Stability , 2001, American Political Science Review.

[56]  J. Vermunt,et al.  Discrete-Time Discrete-State Latent Markov Models with Time-Constant and Time-Varying Covariates , 1999 .

[57]  L. Feick LATENT CLASS ANALYSIS OF SURVEY QUESTIONS THAT INCLUDE DON'T KNOW RESPONSES , 1989 .

[58]  L. Collins,et al.  Latent Class Models for Stage-Sequential Dynamic Latent Variables , 1992 .

[59]  George B. Macready,et al.  Concomitant-Variable Latent-Class Models , 1988 .

[60]  P. V. D. van der Heijden,et al.  The Analysis of Multivariate Misclassified Data With Special Attention to Randomized Response Data , 2004 .