On structural equation modeling with data that are not missing completely at random

A general latent variable model is given which includes the specification of a missing data mechanism. This framework allows for an elucidating discussion of existing general multivariate theory bearing on maximum likelihood estimation with missing data. Here, missing completely at random is not a prerequisite for unbiased estimation in large samples, as when using the traditional listwise or pairwise present data approaches. The theory is connected with old and new results in the area of selection and factorial invariance. It is pointed out that in many applications, maximum likelihood estimation with missing data may be carried out by existing structural equation modeling software, such as LISREL and LISCOMP. Several sets of artifical data are generated within the general model framework. The proposed estimator is compared to the two traditional ones and found superior.

[1]  Karl G. Jöreskog,et al.  Simultaneous Analysis of Longitudinal Data From Several Cohorts , 1985 .

[2]  K. Jöreskog Simultaneous factor analysis in several populations , 1971 .

[3]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[4]  H. Hartley Maximum Likelihood Estimation from Incomplete Data , 1958 .

[5]  W. Dixon,et al.  BMDP statistical software , 1983 .

[6]  C. Brown,et al.  Asymptotic comparison of missing data procedures for estimating factor loadings , 1983 .

[7]  R. R. Hocking,et al.  The analysis of incomplete data. , 1971 .

[8]  E. Beale,et al.  Missing Values in Multivariate Analysis , 1975 .

[9]  Donald B. Rubin,et al.  Maximum-Likelihood Estimation in Panel Studies with Missing Data , 1980 .

[10]  K. Jöreskog A general approach to confirmatory maximum likelihood factor analysis , 1969 .

[11]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm , 1981 .

[12]  R. Little,et al.  A note about models for selectivity bias. , 1985 .

[13]  C E Werts,et al.  Confirmatory Factor Analysis Applications: Missing Data Problems And Comparison Of Path Models Between Populations. , 1979, Multivariate behavioral research.

[14]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15]  A. Boomsma On the robustness of LISREL (maximum likelihood estimation) against small sample size and non-normality. , 1984 .

[16]  T. W. Anderson Maximum Likelihood Estimates for a Multivariate Normal Distribution when Some Observations are Missing , 1957 .

[17]  Karl G. Jöreskog,et al.  Selectivity Problems in Quasi-Experimental Studies , 1983 .

[18]  B. Muthén A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators , 1984 .

[19]  D. Lawley IV.—A Note on Karl Pearson's Selection Formulæ , 1944, Proceedings of the Royal Society of Edinburgh. Section A. Mathematical and Physical Sciences.

[20]  Donald B. Rubin,et al.  Characterizing the Estimation of Parameters in Incomplete-Data Problems , 1974 .

[21]  G. M. Tallis The Moment Generating Function of the Truncated Multi‐Normal Distribution , 1961 .

[22]  R. W. Wedderburn Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method , 1974 .

[23]  Carl T. Finkbeiner Estimation for the multiple factor model when data are missing , 1979 .

[24]  K. Jöreskog,et al.  Analysis of linear structural relationships by maximum likelihood and least squares methods , 1983 .

[25]  B. Muthén,et al.  A comparison of some methodologies for the factor analysis of non‐normal Likert variables , 1985 .

[26]  R. Bargmann,et al.  MAXIMUM LIKELIHOOD ESTIMATION WITH INCOMPLETE MULTIVARIATE DATA , 1964 .

[27]  K. Pearson VII. On the General Theory of the Influence of Selection on Correlation and Variation , 2022 .

[28]  B. Muthén,et al.  Assessing Reliability and Stability in Panel Models , 1977 .

[29]  Jerry A. Hausman,et al.  Attrition Bias in Experimental and Panel Data: The Gary Income Maintenance Experiment , 1979 .

[30]  W. R. Buckland,et al.  Distributions in Statistics: Continuous Multivariate Distributions , 1973 .

[31]  R. Little Models for Nonresponse in Sample Surveys , 1982 .

[32]  S. Rosenbaum Moments of a Truncated Bivariate Normal Distribution , 1961 .

[33]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[34]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[35]  William Meredith,et al.  Notes on factorial invariance , 1964 .

[36]  R. D. Bock,et al.  Marginal maximum likelihood estimation of item parameters , 1982 .