An EM Algorithm for ML Factor Analysis with Missing Data

EM algorithm is a popular algorithm for obtaining maximum likelihood estimates. Here we propose an EM algorithm for the factor analysis model. This algorithm extends a previously proposed EM algorithm to handle problems with missing data. It is simple to implement and is the most storage efficient among its competitors. We apply our algorithm to three examples and discuss the results. For problems with reasonable amount of missing data, it converges in reasonable time. For problems with large amount of missing data EM algorithm is usually slow. For such cases we successfully apply two EM acceleration methods to our examples. Finally, we discuss different methods of obtaining standard errors and in particular we recommend a method based on center difference approximation to the derivative.

[1]  Leonard S. Cahen,et al.  Educational Testing Service , 1970 .

[2]  Xiao-Li Meng,et al.  Using EM to Obtain Asymptotic Variance-Covariance Matrices: The SEM Algorithm , 1991 .

[3]  B. Efron,et al.  Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher information , 1978 .

[4]  Peter M. Bentler,et al.  EQS : structural equations program manual , 1989 .

[5]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[6]  R. Jennrich,et al.  Conjugate Gradient Acceleration of the EM Algorithm , 1993 .

[7]  Karl G. Jöreskog,et al.  A COMPUTER PROGRAM FOR MINIMIZING A FUNCTION OF SEVERAL VARIABLES1 , 1970 .

[8]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[9]  K. Holzinger,et al.  A study in factor analysis : the stability of a bi-factor solution , 1939 .

[10]  P. Allison Estimation of Linear Models with Incomplete Data , 1987 .

[11]  Dorothy T. Thayer,et al.  EM algorithms for ML factor analysis , 1982 .

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  John E. Dennis,et al.  Numerical methods for unconstrained optimization and nonlinear equations , 1983, Prentice Hall series in computational mathematics.

[14]  Michael E. Sobel,et al.  Pseudo-Maximum Likelihood Estimation of Mean and Covariance Structures with Missing Data , 1990 .

[15]  Richard H. Byrd,et al.  A Theoretical and Experimental Study of the Symmetric Rank-One Update , 1993, SIAM J. Optim..

[16]  Rory A. Fisher,et al.  Theory of Statistical Estimation , 1925, Mathematical Proceedings of the Cambridge Philosophical Society.

[17]  Sik-Yum Lee,et al.  Estimation for structural equation models with missing data , 1986 .

[18]  Mortaza Jamshidian,et al.  Conjugate gradient methods in confirmatory factor analysis , 1994 .

[19]  Bengt Muthén,et al.  On structural equation modeling with data that are not missing completely at random , 1987 .

[20]  C. G. Broyden A Class of Methods for Solving Nonlinear Simultaneous Equations , 1965 .

[21]  Karl G. Jöreskog,et al.  LISREL 7: A guide to the program and applications , 1988 .

[22]  Carl T. Finkbeiner Estimation for the multiple factor model when data are missing , 1979 .