A Fast Incremental Learning Algorithm for Feed-forward Neural Networks Using Resilient Propagation

Fast learning under incremental learning environments is very important in real situations as data are generated rapidly over time. However, the input-output relationships that are trained before tend to be destroyed when new data are received. Therefore, the information on the previous data tends to be lost when the data are drawn from the different data distribution. This phenomenon is called “interference” or catastrophic forgetting. To solve this problem, Resource Allocating Network with Long Term Memory (RAN-LTM) is proposed by Kobayashi et al. in order to suppress the interference. In the original RAN-LTM, both new training data and memory items are trained based on the gradient descent method. However, gradient descent method usually leads to the slow learning even for simple problems. On the other hand, Resilient Back-propagation (R-prop) performs a direct adaptation for the step size of the weight update based on local gradient information. The principal idea of R-prop is that the signs of two consecutive partial derivatives are used to determine how connection weights are updated. When the signs of two consecutive partial derivatives coincide with each other, the update-value is increased in order to accelerate the learning in the shallow regions. In contrast, when the signs of two consecutive partial derivatives are changed, the update value is decreased by a decrease value. By conducting these procedures, the number of learning steps is significantly reduced compared to the original gradient descent method. Considering the advantages of R-prop, we propose a fast incremental learning algorithm for feed-forward neural network where the gradient descent method in RAN-LTM is accelerated based on R-prop. The performance of the proposed method is evaluated for several data sets and the results demonstrated that learning time of the extended version of RAN-LTM is greatly reduced compared to the original RAN-LTM.

[1]  Yasue Mitsukura,et al.  Fast Incremental Algorithm of Simple Principal Component Analysis (特集 若手研究者) -- (ソフトコンピューティング・学習) , 2009 .

[2]  John C. Platt A Resource-Allocating Network for Function Interpolation , 1991, Neural Computation.

[3]  Shaoning Pang,et al.  Incremental Learning of Chunk Data for Online Pattern Classification Systems , 2008, IEEE Transactions on Neural Networks.

[4]  Robi Polikar,et al.  Incremental Learning of Concept Drift in Nonstationary Environments , 2011, IEEE Transactions on Neural Networks.

[5]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[6]  Shigeo Abe,et al.  Reducing computations in incremental learning for feedforward neural network with long-term memory , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[7]  A. Asuncion,et al.  UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[8]  F. Girosi,et al.  Networks for approximation and learning , 1990, Proc. IEEE.

[9]  Minho Lee,et al.  Extension of Incremental Linear Discriminant Analysis to Online Feature Extraction under Nonstationary Environments , 2012, ICONIP.

[10]  Haitao Zhao,et al.  A novel incremental principal component analysis and its application for face recognition , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[11]  Seiichi Ozawa,et al.  A robust incremental principal component analysis for feature extraction from stream data with missing values , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[12]  Xin Yao,et al.  Negative correlation in incremental learning , 2009, Natural Computing.

[13]  Minho Lee,et al.  A real-time personal authentication system based on incremental feature extraction and classification of audiovisual information , 2011, Evol. Syst..

[14]  Martin A. Riedmiller,et al.  Rprop - Description and Implementation Details , 1994 .

[15]  Martin A. Riedmiller,et al.  Advanced supervised learning in multi-layer perceptrons — From backpropagation to adaptive learning algorithms , 1994 .