When can categorical variables be treated as continuous? A comparison of robust continuous and categorical SEM estimation methods under suboptimal conditions.

A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category thresholds. Results revealed that factor loadings and robust standard errors were generally most accurately estimated using cat-LS, especially with fewer than 5 categories; however, factor correlations and model fit were assessed equally well with ML. Cat-LS was found to be more sensitive to sample size and to violations of the assumption of normality of the underlying continuous variables. Normal theory ML was found to be more sensitive to asymmetric category thresholds and was especially biased when estimating large factor loadings. Accordingly, we recommend cat-LS for data sets containing variables with fewer than 5 categories and ML when there are 5 or more categories, sample size is small, and category thresholds are approximately symmetric. With 6-7 categories, results were similar across methods for many conditions; in these cases, either method is acceptable.

[1]  K. Pearson On the Theory of Contingency and Its Relation to Association and Normal Correlation , 2013 .

[2]  R. Kaul,et al.  Quality and quantity: mucosal CD4+ T cells and HIV susceptibility. , 2012, Current opinion in HIV and AIDS.

[3]  Li Cai,et al.  Comparing the Fit of Item Response Theory and Factor Analysis Models , 2011 .

[4]  K. Yuan,et al.  Ridge structural equation modelling with correlation matrices for ordinal and continuous data. , 2011, The British journal of mathematical and statistical psychology.

[5]  Ke-Hai Yuan,et al.  Positive Definiteness via Off-diagonal Scaling of a Symmetric Indefinite Matrix , 2010, Psychometrika.

[6]  K. Jöreskog,et al.  Confirmatory Factor Analysis of Ordinal Variables With Misspecified Models , 2010 .

[7]  Craig K. Enders,et al.  Applied Missing Data Analysis , 2010 .

[8]  Harry Joe,et al.  A General Family of Limited Information Goodness-of-Fit Statistics for Multinomial Data , 2010 .

[9]  Alberto Maydeu-Olivares,et al.  Factor Analysis with Ordinal Indicators: A Monte Carlo Study Comparing DWLS and ULS Estimation , 2009 .

[10]  Alberto Maydeu-Olivares,et al.  Estimation of IRT graded response models: limited versus full information methods. , 2009, Psychological methods.

[11]  Pui-Wa Lei,et al.  Evaluating estimation methods for ordinal data in structural equation modeling , 2009 .

[12]  Kristopher J Preacher,et al.  Item factor analysis: current approaches and future directions. , 2007, Psychological methods.

[13]  H. Marsh,et al.  Applications of latent-variable models in educational psychology: The need for methodological-substantive synergies. , 2007 .

[14]  Michael Eid,et al.  Analysing multitrait-multimethod data with structural equation models for ordinal variables applying the WLSMV estimator: what sample size is needed for valid results? , 2006, The British journal of mathematical and statistical psychology.

[15]  André Beauducel,et al.  On the Performance of Maximum Likelihood Versus Means and Variance Adjusted Weighted Least Squares Estimation in CFA , 2006 .

[16]  Albert Maydeu-Olivares,et al.  Limited- and Full-Information Estimation and Goodness-of-Fit Testing in 2n Contingency Tables , 2005 .

[17]  Daniel J Bauer,et al.  The role of nonlinear factor-to-indicator relationships in tests of measurement equivalence. , 2005, Psychological methods.

[18]  M. Martens The Use of Structural Equation Modeling in Counseling Psychology Research , 2005 .

[19]  Stan Lipovetsky,et al.  Generalized Latent Variable Modeling: Multilevel,Longitudinal, and Structural Equation Models , 2005, Technometrics.

[20]  Albert Maydeu-Olivares,et al.  Limited Information Goodness-of-fit Testing in Multidimensional Contingency Tables , 2005 .

[21]  H. Joe,et al.  Limited-and Full-Information Estimation and Goodness-ofFit Testing in 2 n Contingency Tables : A Unified Framework , 2005 .

[22]  D. Flora,et al.  An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data. , 2004, Psychological methods.

[23]  Roger E. Millsap,et al.  Assessing Factorial Invariance in Ordered-Categorical Measures , 2004 .

[24]  Daniel J Bauer,et al.  The integration of continuous and discrete latent variable models: potential problems and promising opportunities. , 2004, Psychological methods.

[25]  Craig K. Enders,et al.  Using an EM Covariance Matrix to Estimate Structural Equation Models With Missing Data: Choosing an Adjusted Sample Size to Improve the Accuracy of Inferences , 2004 .

[26]  Christine DiStefano,et al.  The Impact of Categorization With Confirmatory Factor Analysis , 2002 .

[27]  Bengt,et al.  Latent Variable Analysis With Categorical Outcomes : Multiple-Group And Growth Modeling In Mplus , 2002 .

[28]  J. Schafer,et al.  A comparison of inclusive and restrictive strategies in modern missing data procedures. , 2001, Psychological methods.

[29]  Albert Maydeu-Olivares,et al.  Limited information estimation and testing of Thurstonian models for paired comparison data under multiple judgment sampling , 2001 .

[30]  K. Bollen,et al.  Improper Solutions in Structural Equation Models , 2001 .

[31]  P M Bentler,et al.  Normal theory based test statistics in structural equation modelling. , 1998, The British journal of mathematical and statistical psychology.

[32]  A. Boomsma,et al.  Robustness Studies in Covariance Structure Modeling , 1998 .

[33]  S. Crowley,et al.  Structural equation modeling: basic concepts and applications in personality assessment research. , 1997, Journal of personality assessment.

[34]  B. Muthén,et al.  Robust inference using weighted least squares and quadratic estimating equations in latent variable modeling with categorical and continuous outcomes , 1997 .

[35]  Theresa M. Akey,et al.  Effect of the number of scale points on chi‐square fit indices in confirmatory factor analysis , 1997 .

[36]  S. West,et al.  The robustness of test statistics to nonnormality and specification error in confirmatory factor analysis. , 1996 .

[37]  P M Bentler,et al.  A two-stage estimation of structural equation models with continuous and polytomous variables. , 1995, The British journal of mathematical and statistical psychology.

[38]  T. Good,et al.  Contemporary Educational Psychology , 1995 .

[39]  Conor V. Dolan,et al.  Factor analysis of variables with 2, 3, 5, and 7 response categories: A comparison of categorical variable estimators using simulated data , 1994 .

[40]  Karl G. Jöreskog,et al.  On the estimation of polychoric correlations and their asymptotic covariance matrix , 1994 .

[41]  A. Satorra,et al.  Corrections to test statistics and standard errors in covariance structure analysis. , 1994 .

[42]  Margaret J. Potthast,et al.  Confirmatory factor analysis of ordered categorical variables with large models , 1993 .

[43]  F. Baker,et al.  Item response theory : parameter estimation techniques , 1993 .

[44]  J. S. Long,et al.  Testing Structural Equation Models , 1993 .

[45]  Y Kano,et al.  Can test statistics in covariance structure analysis be trusted? , 1992, Psychological bulletin.

[46]  Edward E. Rigdon,et al.  The Performance of the Polychoric Correlation Coefficient and Selected Fitting Functions in Confirmatory Factor Analysis with Ordinal Data , 1991 .

[47]  D. Knol,et al.  Empirical Comparison Between Factor Analysis and Multidimensional Item Response Models. , 1991, Multivariate behavioral research.

[48]  Peter M. Bentler,et al.  A three-stage estimation procedure for structural equation models with polytomous variables , 1990 .

[49]  Ira H. Bernstein,et al.  Factoring items and factoring scales are different: Spurious evidence for multidimensionality due to item categorization. , 1989 .

[50]  E. Muraki,et al.  Full-Information Item Factor Analysis , 1988 .

[51]  Bengt Muthén,et al.  Testing the assumptions underlying tetrachoric correlations , 1988 .

[52]  Jan de Leeuw,et al.  On the relationship between item response theory and factor analysis of discretized variables , 1987 .

[53]  Emin Babakus,et al.  The Sensitivity of Confirmatory Maximum Likelihood Factor Analysis to Violations of Measurement Scale and Distributional Assumptions , 1987 .

[54]  D. Bartholomew Latent Variable Models And Factor Analysis , 1987 .

[55]  B. Muthén,et al.  A comparison of some methodologies for the factor analysis of non‐normal Likert variables , 1985 .

[56]  M. Browne Asymptotically distribution-free methods for the analysis of covariance structures. , 1984, The British journal of mathematical and statistical psychology.

[57]  B. Muthén A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators , 1984 .

[58]  C. D. Vale,et al.  Simulating multivariate nonnormal distributions , 1983 .

[59]  D. R. Johnson,et al.  Ordinal measures in multiple indicator models: A simulation study of categorization error. , 1983 .

[60]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters , 1982 .

[61]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm , 1981 .

[62]  K. Bollen,et al.  Pearson's R and Coarsely Categorized Measures , 1981 .

[63]  Ulf Olsson,et al.  Maximum likelihood estimation of the polychoric correlation coefficient , 1979 .

[64]  U. Olsson On The Robustness Of Factor Analysis Against Crude Classification Of The Observations. , 1979, Multivariate behavioral research.

[65]  Allen I. Fleishman A method for simulating non-normal distributions , 1978 .

[66]  B. Muthén Contributions to factor analysis of dichotomous variables , 1978 .

[67]  Anders Christoffersson,et al.  Factor analysis of dichotomized variables , 1975 .

[68]  R. Darrell Bock,et al.  Fitting a response model forn dichotomously scored items , 1970 .

[69]  G. A. Ferguson,et al.  The factorial interpretation of test difficulty , 1941 .

[70]  Karl Pearson,et al.  Mathematical contributions to the theory of evolution. VIII. On the correlation of characters not quantitatively measurable , 1900, Proceedings of the Royal Society of London.

[71]  HighWire Press Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.