A Monte Carlo Simulation Study to Assess The Appropriateness of Traditional and Newer Approaches to Test for Measurement Invariance

Several structural equation modeling (SEM) strategies were developed for assessing measurement invariance (MI) across groups relaxing the assumptions of strict MI to partial, approximate, and partial approximate MI. Nonetheless, applied researchers still do not know if and under what conditions these strategies might provide results that allow for valid comparisons across groups in large-scale comparative surveys. We perform a comprehensive Monte Carlo simulation study to assess the conditions under which various SEM methods are appropriate to estimate latent means and path coefficients and their differences across groups. We find that while SEM path coefficients are relatively robust to violations of full MI and can be rather effectively recovered, recovering latent means and their group rankings might be difficult. Our results suggest that, contrary to some previous recommendations, partial invariance may rather effectively recover both path coefficients and latent means even when the majority of items are noninvariant. Although it is more difficult to recover latent means using approximate and partial approximate MI methods, it is possible under specific conditions and using appropriate models. These models also have the advantage of providing accurate standard errors. Alignment is recommended for recovering latent means in cases where there are only a few noninvariant parameters across groups.

[1]  Bengt Muthén,et al.  Bayesian structural equation modeling: a more flexible representation of substantive theory. , 2012, Psychological methods.

[2]  Willem E. Saris,et al.  Testing Structural Equation Models or Detection of Misspecifications? , 2009 .

[3]  Eldad Davidov,et al.  The Comparability of Measures in the Ageism Module of the Fourth Round of the European Social Survey, 2008-2009 , 2020 .

[4]  B. Muthén,et al.  A comparison of some methodologies for the factor analysis of non‐normal Likert variables , 1985 .

[5]  S. Reise,et al.  Exploring the measurement invariance of psychological instruments: Applications in the substance use domain. , 1997 .

[6]  Bengt Muthén,et al.  Testing for the Equivalence of Factor Covariance and Mean Structures : The Issue of Partial Measurement In variance , 1989 .

[7]  Christine DiStefano,et al.  The Impact of Categorization With Confirmatory Factor Analysis , 2002 .

[8]  J. Steenkamp,et al.  Assessing Measurement Invariance in Cross-National Consumer Research , 1998 .

[9]  Bengt Muthén,et al.  IRT studies of many groups: the alignment method , 2014, Front. Psychol..

[10]  B. Muthén,et al.  How to Use a Monte Carlo Study to Decide on Sample Size and Determine Power , 2002 .

[11]  D. R. Johnson,et al.  Ordinal measures in multiple indicator models: A simulation study of categorization error. , 1983 .

[12]  B. Muthén,et al.  Deciding on the Number of Classes in Latent Class Analysis and Growth Mixture Modeling: A Monte Carlo Simulation Study , 2007 .

[13]  W. Meredith Measurement invariance, factor analysis and factorial invariance , 1993 .

[14]  R. Brennan,et al.  Test Equating, Scaling, and Linking , 2004 .

[15]  Todd D. Little,et al.  A Non-arbitrary Method of Identifying and Scaling Latent Variables in SEM and MACS Models , 2006 .

[16]  Bengt Muthén,et al.  Prior-Posterior Predictive P-values , 2017 .

[17]  A. Boomsma,et al.  Robustness Studies in Covariance Structure Modeling , 1998 .

[18]  R. Hambleton,et al.  Fundamentals of Item Response Theory , 1991 .

[19]  Eldad Davidov,et al.  Comparing results of an exact vs. an approximate (Bayesian) measurement invariance test: a cross-country illustration with a scale to measure 19 human values , 2014, Front. Psychol..

[20]  Gilbert Swinnen,et al.  Biased latent variable mean comparisons due to measurement noninvariance. A simulation study , 2018 .

[21]  Holger Steinmetz,et al.  Analyzing observed composite differences across groups: Is partial measurement invariance enough? , 2013 .

[22]  Conor V. Dolan,et al.  Factor analysis of variables with 2, 3, 5, and 7 response categories: A comparison of categorical variable estimators using simulated data , 1994 .

[23]  Bengt Muthén,et al.  Multiple-Group Factor Analysis Alignment , 2014 .

[24]  John W Graham,et al.  Planned missing data designs in psychological research. , 2006, Psychological methods.

[25]  D. Betsy McCoach,et al.  An Investigation of the Alignment Method With Polytomous Indicators Under Conditions of Partial Measurement Invariance , 2018 .

[26]  R. Vandenberg,et al.  A Review and Synthesis of the Measurement Invariance Literature: Suggestions, Practices, and Recommendations for Organizational Research , 2000 .

[27]  Anna Brown,et al.  The consequences of ignoring measurement invariance for path coefficients in structural equation models , 2014, Front. Psychol..

[28]  D Kaplan,et al.  The Impact of Specification Error on the Estimation, Testing, and Improvement of Structural Equation Models. , 1988, Multivariate behavioral research.

[29]  Lars Tummers,et al.  Facing off with Scylla and Charybdis: a comparison of scalar, partial, and the novel possibility of approximate measurement invariance , 2013, Front. Psychol..

[30]  S. Leung A Comparison of Psychometric Properties and Normality in 4-, 5-, 6-, and 11-Point Likert Scales , 2011 .

[31]  Lars-Erik Malmberg,et al.  An approximate measurement invariance approach to within-couple relationship quality , 2014, Front. Psychol..

[32]  Judith Torney-Purta,et al.  Measurement Invariance in Comparing Attitudes Toward Immigrants Among Youth Across Europe in 1999 and 2009 , 2018 .

[33]  B. Muthén,et al.  What to do When Scalar Invariance Fails: The Extended Alignment Method for Multi-Group Factor Analysis Comparison of Latent Means Across Many Groups , 2017, Psychological methods.

[34]  Adam W. Meade,et al.  A Monte-Carlo Study of Confirmatory Factor Analytic Tests of Measurement Equivalence/Invariance , 2004 .

[35]  Roger E. Millsap,et al.  Detecting Violations of Factorial Invariance Using Data-Based Specification Searches: A Monte Carlo Study , 2007 .

[36]  Holger Steinmetz,et al.  Estimation and Comparison of Latent Means Across Cultures , 2018 .

[37]  R. Brennan,et al.  Test Equating, Scaling, and Linking: Methods and Practices , 2004 .

[38]  André Beauducel,et al.  On the Performance of Maximum Likelihood Versus Means and Variance Adjusted Weighted Least Squares Estimation in CFA , 2006 .

[39]  Thomas R. Boucher,et al.  Test Equating, Scaling, and Linking: Methods and Practices , 2007 .

[40]  Keith F Widaman,et al.  Confirmatory factor analysis and item response theory: two approaches for exploring measurement invariance. , 1993, Psychological bulletin.

[41]  Bengt Muthén,et al.  Recent Methods for the Study of Measurement Invariance With Many Groups , 2018 .

[42]  Daniel A. Sass,et al.  Evaluating Model Fit With Ordered Categorical Data Within a Measurement Invariance Framework: A Comparison of Estimators , 2014 .

[43]  Yan Wang,et al.  Measurement Invariance Testing with Many Groups: A Comparison of Five Approaches , 2017 .

[44]  B. Byrne,et al.  Testing for the equivalence of factor covariance and mean structures: The issue of partial measurement invariance. , 1989 .

[45]  P. Schmidt,et al.  Measurement Equivalence in Cross-National Research , 2014 .

[46]  K. Jöreskog A General Method for Estimating a Linear Structural Equation System. , 1970 .

[47]  Phil Wood Confirmatory Factor Analysis for Applied Research , 2008 .

[48]  Eugenio Gonzalez,et al.  principles of multiple matrix booklet designs and parameter recovery in large-scale assessments , 2010 .

[49]  Artur Pokropek Missing by Design: Planned Missing-Data Designs in Social Science , 2011 .

[50]  Eldad Davidov,et al.  The comparability of the universalism value over time and across countries in the European Social Survey: exact vs. approximate measurement invariance , 2015, Front. Psychol..