Adding Missing-Data-Relevant Variables to FIML-Based Structural Equation Models

Conventional wisdom in missing data research dictates adding variables to the missing data model when those variables are predictive of (a) missingness and (b) the variables containing missingness. However, it has recently been shown that adding variables that are correlated with variables containing missingness, whether or not they are related to missingness, can substantially improve estimation (bias and efficiency). Including large numbers of these "auxiliary" variables is straightforward for researchers who use multiple imputation. However, what is the researcher to do if 1 of the FIML/SEM procedures is the analysis of choice? This article suggests 2 models for SEM analysis with missing data, and presents simulation results to show that both models provide estimation that is clearly as good as analysis with the EM algorithm, and by extension, multiple imputation. One of these models, the saturated correlates model, also provides good estimates of model fit.

[1]  L. Tucker,et al.  A reliability coefficient for maximum likelihood factor analysis , 1973 .

[2]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3]  P. Bentler,et al.  Significance Tests and Goodness of Fit in the Analysis of Covariance Structures , 1980 .

[4]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[5]  Karl G. Jöreskog,et al.  Lisrel 8: User's Reference Guide , 1997 .

[6]  P. Bentler,et al.  Comparative fit indexes in structural models. , 1990, Psychological bulletin.

[7]  M. Browne,et al.  Alternative Ways of Assessing Model Fit , 1992 .

[8]  D P MacKinnon,et al.  Maximizing the Usefulness of Data Obtained with Planned Missing Value Patterns: An Application of Maximum Likelihood Procedures. , 1996, Multivariate behavioral research.

[9]  John W. Graham,et al.  Analysis With Missing Data in Prevention Research , 1997 .

[10]  Michael Windle,et al.  The science of prevention: methodological advances from alcohol and substance abuse research. , 1997 .

[11]  S. L. Hyatt,et al.  Using latent transition analysis to examine the relationship between parental permissiveness and the onset of substance use , 2000 .

[12]  S. L. Hyatt,et al.  Latent transition analysis as a way of testing models of stage-sequential change in longitudinal data. , 2000 .

[13]  Jürgen Baumert,et al.  Modeling longitudinal and multilevel data: Practical issues, applied approaches, and specific examples. , 2000 .

[14]  S. L. Hyatt,et al.  LTA as a way of testing models of stage-sequential change in longitudinal data , 2000 .

[15]  J. Schafer,et al.  A comparison of inclusive and restrictive strategies in modern missing data procedures. , 2001, Psychological methods.

[16]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.