Relating Latent Class Assignments to External Variables: Standard Errors for Correct Inference

Latent class analysis is used in the political science literature in both substantive applications and as a tool to estimate measurement error. Many studies in the social and political sciences relate estimated class assignments from a latent class model to external variables. Although common, such a “three-step” procedure effectively ignores classification error in the class assignments; Vermunt (2010, “Latent class modeling with covariates: Two improved three-step approaches,” Political Analysis 18:450–69) showed that this leads to inconsistent parameter estimates and proposed a correction. Although this correction for bias is now implemented in standard software, inconsistency is not the only consequence of classification error. We demonstrate that the correction method introduces an additional source of variance in the estimates, so that standard errors and confidence intervals are overly optimistic when not taking this into account. We derive the asymptotic variance of the third-step estimates of interest, as well as several candidate-corrected sample estimators of the standard errors. These corrected standard error estimators are evaluated using a Monte Carlo study, and we provide practical advice to researchers as to which should be used so that valid inferences can be obtained when relating estimated class membership to external variables.

[1]  Jacques A. Hagenaars,et al.  Categorical Longitudinal Data. , 1991 .

[2]  Sophia Rabe-Hesketh,et al.  Maximum Likelihood Estimation of Generalized Linear Models with Covariate Measurement Error , 2003 .

[3]  Jennifer L. Hill,et al.  Classification by Opinion-Changing Behavior: A Mixture Model Approach , 2001, Political Analysis.

[4]  R. Carroll,et al.  A Note on the Efficiency of Sandwich Covariance Matrix Estimation , 2001 .

[5]  D. Alwin Margins of Error , 2007 .

[6]  Jeroen K. Vermunt,et al.  A Model-Based Approach to Goodness-of-Fit Evaluation in Item Response Theory , 2013 .

[7]  José G. Dias,et al.  A bootstrap-based aggregate classifier for model-based clustering , 2008, Comput. Stat..

[8]  Michel Wedel,et al.  Mixture Model Analysis of Complex Samples , 1998 .

[9]  Drew A. Linzer Reliable Inference in Highly Stratified Contingency Tables: Using Latent Class Models as Density Estimators , 2011, Political Analysis.

[10]  G. Oehlert A note on the delta method , 1992 .

[11]  W. Wong,et al.  The calculation of posterior distributions by data augmentation , 1987 .

[12]  Leonard S. Cahen,et al.  Educational Testing Service , 1970 .

[13]  A. Feingold,et al.  New approaches for examining associations with latent categorical variables: applications to substance abuse and aggression. , 2014, Psychology of addictive behaviors : journal of the Society of Psychologists in Addictive Behaviors.

[14]  George B. Macready,et al.  Concomitant-Variable Latent-Class Models , 1988 .

[15]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[16]  J. Goldthorpe,et al.  Social Stratification and Cultural Consumption: Music in England , 2006 .

[17]  Thomas Mustillo Modeling New Party Performance: A Conceptual and Methodological Approach for Volatile Party Systems , 2009, Political Analysis.

[18]  Jeroen K. Vermunt,et al.  Latent class modeling with covariates : Two improved three-step approaches 1 , 2012 .

[19]  William R. Parke Pseudo Maximum Likelihood Estimation: The Asymptotic Distribution , 1986 .

[20]  Margaret E. Roberts,et al.  How Robust Standard Errors Expose Methodological Problems They Do Not Fix, and What to Do About It , 2015, Political Analysis.

[21]  Wayne A. Fuller,et al.  Measurement Error Models , 1988 .

[22]  L. A. Goodman The Analysis of Systems of Qualitative Variables When Some of the Variables Are Unobservable. Part I-A Modified Latent Structure Approach , 1974, American Journal of Sociology.

[23]  Raymond J. Carroll,et al.  Measurement error in nonlinear models: a modern perspective , 2006 .

[24]  S. Sclove Application of model-selection criteria to some problems in multivariate analysis , 1987 .

[25]  Justin Grimmer,et al.  Appropriators not Position Takers: The Distorting Effects of Electoral Incentives on Congressional Representation , 2013 .

[26]  J. Hagenaars Loglinear Models with Latent Variables , 1993 .

[27]  Justin Grimmer,et al.  Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts , 2013, Political Analysis.

[28]  R. Breen Why Is Support for Extreme Parties Underestimated by Surveys? A Latent Class Analysis , 2000, British Journal of Political Science.

[29]  Ulrich Trautwein,et al.  Please Scroll down for Article Structural Equation Modeling: a Multidisciplinary Journal Classical Latent Profile Analysis of Academic Self-concept Dimensions: Synergy of Person-and Variable-centered Approaches to Theoretical Models of Self-concept , 2022 .

[30]  B. Muthén,et al.  Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study , 2007 .

[31]  Scott L. Zeger,et al.  Latent Variable Regression for Multiple Discrete Outcomes , 1997 .

[32]  Andrew W. Roddam,et al.  Measurement Error in Nonlinear Models: a Modern Perspective , 2008 .

[33]  D. Green,et al.  Principled Tolerance and the American Mass Public , 1989, British Journal of Political Science.

[34]  P. Lewinsohn,et al.  Latent trajectory classes of depressive and anxiety disorders from adolescence to adulthood: descriptions of classes and associations with risk factors. , 2010, Comprehensive psychiatry.

[35]  Karen Bandeen-Roche,et al.  Residual Diagnostics for Growth Mixture Models , 2005 .

[36]  Gail Gong,et al.  Pseudo Maximum Likelihood Estimation: Theory and Applications , 1981 .

[37]  Peter G. M. van der Heijden,et al.  A parametric bootstrap procedure to perform statistical tests in a LCA of anti-social behaviour , 1997 .

[38]  M. Centellas,et al.  The Democracy Cluster Classification Index , 2013, Political Analysis.

[39]  Jay Magidson,et al.  Technical Guide for Latent GOLD 5.1: Basic, Advanced, and Syntax 1 , 2016 .

[40]  Renee M. Clark,et al.  A new approach to hazardous materials transportation risk analysis: decision modeling to identify critical variables. , 2009, Risk analysis : an official publication of the Society for Risk Analysis.

[41]  J. Rost,et al.  Applications of Latent Trait and Latent Class Models in the Social Sciences , 1998 .

[42]  Stephanie T. Lanza,et al.  Latent Class Analysis With Distal Outcomes: A Flexible Model-Based Approach , 2013, Structural equation modeling : a multidisciplinary journal.

[43]  Chris J. Skinner,et al.  Analysis of complex surveys , 1991 .

[44]  Stephanie T. Lanza,et al.  Latent Class and Latent Transition Analysis: With Applications in the Social, Behavioral, and Health Sciences , 2009 .

[45]  Simon Jackman,et al.  Democracy as a Latent Variable , 2008 .

[46]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[47]  Eric Loken,et al.  Using Latent Class Analysis to Model Temperament Types , 2004, Multivariate behavioral research.

[48]  Jeroen K. Vermunt,et al.  Estimating the Association between Latent Class Membership and External Variables Using Bias-adjusted Three-step Approaches , 2013 .

[49]  Drew A. Linzer,et al.  The Political Economy of Women's Support for Fundamentalist Islam , 2008 .

[50]  V. Neuhaus,et al.  Latent Class Analysis , 2010 .

[51]  S. Stouffer,et al.  Communism, conformity, and civil liberties : a cross-section of the nation speaks its mind , 1955 .

[52]  T. König,et al.  Estimating Party Positions across Countries and Time—A Dynamic Latent Variable Model for Manifesto Data , 2013, Political Analysis.

[53]  Matt Golder,et al.  New Empirical Strategies for the Study of Parliamentary Government Formation , 2012, Political Analysis.

[54]  Bengt Muthén,et al.  Auxiliary Variables in Mixture Modeling: Three-Step Approaches Using Mplus , 2014 .

[55]  Anders Skrondal,et al.  Improved Regression Calibration , 2012 .

[56]  Marcel Croon,et al.  Estimating Latent Structure Models with Categorical Variables: One-Step Versus Three-Step Estimators , 2004, Political Analysis.

[57]  B. Muthén,et al.  Auxiliary Variables in Mixture Modeling: Three-Step Approaches Using Mplus , 2014 .

[58]  Allan L. McCutcheon,et al.  A Latent Class Analysis of Tolerance for Nonconformity in the American Public , 1985 .

[59]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[60]  C. Fornell,et al.  Evaluating structural equation models with unobservable variables and measurement error. , 1981 .

[61]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[62]  Duane F. Alwin Margins of Error: A Study of Reliability in Survey Measurement , 2007 .

[63]  L. Feick LATENT CLASS ANALYSIS OF SURVEY QUESTIONS THAT INCLUDE DON'T KNOW RESPONSES , 1989 .

[64]  Gary King,et al.  Multiple Overimputation: A Unified Approach to Measurement Error and Missing Data , 2010 .

[65]  Kathryn Roeder,et al.  Modeling Uncertainty in Latent Class Membership: A Case Study in Criminology , 1999 .

[66]  Albert Satorra,et al.  Measurement Error Models With Uncertainty About the Error Variance , 2013 .

[67]  Robert J. Mislevy,et al.  Randomization-based inference about latent variables from complex samples , 1991 .

[68]  John S. Ahlquist,et al.  Model-based Clustering and Typologies in the Social Sciences , 2012, Political Analysis.

[69]  M. Beissinger,et al.  The Semblance of Democratic Revolution: Coalitions in Ukraine's Orange Revolution , 2013, American Political Science Review.

[70]  Kevin M. Murphy,et al.  Estimation and Inference in Two-Step Econometric Models , 1985 .