The Effect of Adaptive Momentum in Improving the Accuracy of Gradient Descent Back Propagation Algorithm on Classification Problems

The traditional Gradient Descent Back-propagation Neural Network Algorithm is widely used in solving many practical applications around the globe. Despite providing successful solutions, it possesses a problem of slow convergence and sometimes getting stuck at local minima. Several modifications are suggested to improve the convergence rate of Gradient Descent Backpropagation algorithm such as careful selection of initial weights and biases, learning rate, momentum, network topology, activation function and ‘gain’ value in the activation function. In a certain variation, the previous researchers demonstrated that in “feed-forward algorithm”, the slope of activation function is directly influenced by ‘gain’ parameter. This research proposed an algorithm for improving the current working performance of Back-propagation algorithm by adaptively changing the momentum value and at the same time keeping the ‘gain’ parameter fixed for all nodes in the neural network. The performance of the proposed method known as ‘Gradient Descent Method with Adaptive Momentum (GDAM)’ is compared with the performances of ‘Gradient Descent Method with Adaptive Gain (GDM-AG)’ and ‘Gradient Descent with Simple Momentum (GDM)’. The learning rate is kept fixed while sigmoid activation function is used throughout the experiments. The efficiency of the proposed method is demonstrated by simulations on three classification problems. Results show that GDAM is far better than previous methods with an accuracy ratio of 1.0 for classification problems and can be used as an alternative approach of BPNN.

[1]  Kaspar Althoefer,et al.  Stability analysis of a three-term backpropagation algorithm , 2005, Neural Networks.

[2]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[3]  Paul Compton,et al.  Inductive knowledge acquisition: a case study , 1987 .

[4]  Shen Zhang,et al.  Improved BP Neural Network for Transformer Fault Diagnosis , 2007 .

[5]  Francesco Palmieri,et al.  A new algorithm for training multilayer perceptrons , 1989, Conference Proceedings., IEEE International Conference on Systems, Man and Cybernetics.

[6]  Rozaida Ghazali,et al.  The Effect of Adaptive Gain and Adaptive Momentum in Improving Training Time of Gradient Descent Back Propagation Algorithm on Classification Problems , 2011 .

[7]  Ben Coppin,et al.  Artificial Intelligence Illuminated , 2004 .

[8]  J. Ross Quinlan,et al.  Simplifying Decision Trees , 1987, Int. J. Man Mach. Stud..

[9]  M. A. Fkirin,et al.  Change detection using neural network in Toshka area , 2009, 2009 National Radio Science Conference.

[10]  Nazri Mohd Nawi,et al.  Noise-Induced Hearing Loss (NIHL) Prediction in Humans Using a Modified Back Propagation Neural Network , 2011 .

[11]  Paulo Cortez,et al.  Modeling wine preferences by data mining from physicochemical properties , 2009, Decis. Support Syst..

[12]  Nazri Mohd Nawi,et al.  An Improved Conjugate Gradient Based Learning Algorithm for Back Propagation Neural Networks , 2008 .

[13]  Wen-Chin Chen,et al.  Back-propagation neural network based importance-performance analysis for determining critical service attributes , 2008, Expert Syst. Appl..

[14]  Ehsan Mesbahi,et al.  Artificial neural networks: fundamentals , 2003 .

[15]  J. C. Schlimmer,et al.  Concept acquisition through representational adjustment , 1987 .

[16]  David E. Booth,et al.  A comparison of supervised and unsupervised neural networks in predicting bankruptcy of Korean firms , 2005, Expert Syst. Appl..

[17]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[18]  Bin-Da Liu,et al.  A backpropagation algorithm with adaptive learning rate and momentum coefficient , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[19]  O. Mangasarian,et al.  Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[20]  I A Basheer,et al.  Artificial neural networks: fundamentals, computing, design, and application. , 2000, Journal of microbiological methods.

[21]  Hongmei Shao,et al.  A New BP Algorithm with Adaptive Momentum for FNNs Training , 2009, 2009 WRI Global Congress on Intelligent Systems.

[22]  R.J. Mitchell,et al.  On Simple Adaptive Momentum , 2008, 2008 7th IEEE International Conference on Cybernetic Intelligent Systems.

[23]  Vladimir M. Krasnopolsky,et al.  Some neural network applications in environmental sciences. Part II: advancing computational efficiency of environmental numerical models , 2003, Neural Networks.

[24]  John Mark Bishop,et al.  Simple adaptive momentum: New algorithm for training multilayer perceptrons , 1994 .

[25]  Tsung-Lin Lee,et al.  Back-propagation neural network for the prediction of the short-term storm surge in Taichung harbor, Taiwan , 2008, Eng. Appl. Artif. Intell..

[26]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .