论文信息 - A New Class of Incremental Gradient Methods for Least Squares Problems

A New Class of Incremental Gradient Methods for Least Squares Problems

The least mean squares (LMS) method for linear least squares problems differs from the steepest descent method in that it processes data blocks one-by-one, with intermediate adjustment of the parameter vector under optimization. This mode of operation often leads to faster convergence when far from the eventual limit and to slower (sublinear) convergence when close to the optimal solution. We embed both LMS and steepest descent, as well as other intermediate methods, within a one-parameter class of algorithms, and we propose a hybrid class of methods that combine the faster early convergence rate of LMS with the faster ultimate linear convergence rate of steepest descent. These methods are well suited for neural network training problems with large data sets. Furthermore, these methods allow the effective use of scaling based, for example, on diagonal or other approximations of the Hessian matrix.

Dimitri P. Bertsekas | D. Bertsekas

[1] W. Davidon. New least-square algorithms , 1976 .

[2] Lennart Ljung,et al. Analysis of recursive stochastic algorithms , 1977 .

[3] Harold J. Kushner,et al. wchastic. approximation methods for constrained and unconstrained systems , 1978 .

[4] John N. Tsitsiklis,et al. Distributed Asynchronous Deterministic and Stochastic Gradient Optimization Algorithms , 1984, 1984 American Control Conference.

[5] Bernard Widrow,et al. Adaptive Signal Processing , 1985 .

[6] John N. Tsitsiklis,et al. Distributed asynchronous deterministic and stochastic gradient optimization algorithms , 1986 .

[7] Bernard Widrow,et al. Adaptive switching circuits , 1988 .

[8] H. White. Some Asymptotic Results for Learning in Single Hidden-Layer Feedforward Network Models , 1989 .

[9] John N. Tsitsiklis,et al. Parallel and distributed computation , 1989 .

[10] Spyros G. Tzafestas,et al. Learning algorithms for neural networks with the Kalman filters , 1990, J. Intell. Robotic Syst..

[11] Zhi-Quan Luo,et al. On the Convergence of the LMS Algorithm with Adaptive Learning Rate for Linear Feedforward Networks , 1991, Neural Computation.