论文信息 - Simple and Globally Convergent Methods for Accelerating the Convergence of Any EM Algorithm

Simple and Globally Convergent Methods for Accelerating the Convergence of Any EM Algorithm

Abstract. The expectation‐maximization (EM) algorithm is a popular approach for obtaining maximum likelihood estimates in incomplete data problems because of its simplicity and stability (e.g. monotonic increase of likelihood). However, in many applications the stability of EM is attained at the expense of slow, linear convergence. We have developed a new class of iterative schemes, called squared iterative methods (SQUAREM), to accelerate EM, without compromising on simplicity and stability. SQUAREM generally achieves superlinear convergence in problems with a large fraction of missing information. Globally convergent schemes are easily obtained by viewing SQUAREM as a continuation of EM. SQUAREM is especially attractive in high‐dimensional problems, and in problems where model‐specific analytic insights are not available. SQUAREM can be readily implemented as an ‘off‐the‐shelf’ accelerator of any EM‐type algorithm, as it only requires the EM parameter updating. We present four examples to demonstrate the effectiveness of SQUAREM. A general‐purpose implementation (written in R) is available.

R. Varadhan | C. Roland

[1] V. Hasselblad. Finite mixtures of distributions from the exponential family , 1969 .

[2] Peter Lancaster,et al. The theory of matrices , 1969 .

[3] James M. Ortega,et al. Iterative solution of nonlinear equations in several variables , 2014, Computer science and applied mathematics.

[4] J. Ortega. Stability of Difference Equations and Convergence of Iterative Processes , 1973 .

[5] T. Louis. Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[6] New York Dover,et al. ON THE CONVERGENCE PROPERTIES OF THE EM ALGORITHM , 1983 .

[7] John E. Dennis,et al. Numerical methods for unconstrained optimization and nonlinear equations , 1983, Prentice Hall series in computational mathematics.

[8] A. Sidi,et al. Extrapolation methods for vector sequences , 1987 .

[9] J. Borwein,et al. Two-Point Step Size Gradient Methods , 1988 .

[10] Y. Nievergelt. Aitken's and Steffensen's accelerations in several variables , 1991 .

[11] R. Jennrich,et al. Conjugate Gradient Acceleration of the EM Algorithm , 1993 .