论文信息 - Comparison of performance of variants of single-layer perceptron algorithms on nonseparable data

Comparison of performance of variants of single-layer perceptron algorithms on nonseparable data

We present a detailed experimental comparison of the pocket algorithm thermal percep tron and barycentric correction procedure algorithms that most commonly used algorithms for training threshold logic units TLUs Each of these algorithms represent stable variants of the standard perceptron learning rule in that they guarantee convergence to zero classi cation errors on datasets that are linearly separable and attempt to classify as large a subset of the training patterns as possible for datasets that are not linearly separable For datasets involving patterns distributed amongM di erent categories M a group of M TLUs is trained one for each of the output classes These TLU s can be trained either independently or as a winner take all WTA group The latter mechanism accounts for the interactions among the di erent output classes and exploits the fact that a pattern can ideally belong to only one of the M output classes The extension of the pocket algorithm to the WTA output strategy is direct In this paper we present heuristic extensions of the thermal perceptron and the barycentric correction procedure to WTA groups and empirically verify their performance The performance of these algorithms was measured in a collection of carefully chosen benchmarks datasets We report the training and generalization accuracies of these algorithms on the di erent datasets along with the learning time in seconds In addition a comparison of the learning speeds of the al gorithms is indicated by means of learning curve plots on two datasets We identify and report some distinguishing traits of these algorithms which could possibly enable making an informed choice of the training algorithm combined with constructive learning algorithms when certain characteristics of the dataset are known

[1] Vasant Honavar. Machine learning: Principles and applications , 1999 .

[2] Vasant G Honavar,et al. MTiling A Constructive Neural Network Learning Algorithm for Multi Category Pattern Classi cation , 1996 .

[3] Jean-Pierre Nadal,et al. Study of a Growth Algorithm for a Feedforward Network , 1989, Int. J. Neural Syst..

[4] Marvin Minsky,et al. Perceptrons: An Introduction to Computational Geometry , 1969 .

[5] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[6] Jihoon Yang,et al. DistAl: an inter-pattern distance-based constructive learning algorithm , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[7] Nils J. Nilsson,et al. The Mathematical Foundations of Learning Machines , 1990 .

[8] Vasant Honavar,et al. Generative learning structures for generalized connectionist networks , 1990 .

[9] M. Golea,et al. A Convergence Theorem for Sequential Learning in Two-Layer Perceptrons , 1990 .

[10] Rajesh Parekh,et al. Analysis of Decision Boundaries Generated by Constructive Neural Network Learning Algorithms , 1995 .

[11] John J. Shynk,et al. Performance surfaces of a single-layer perceptron , 1990, IEEE Trans. Neural Networks.

[12] Jihoon Yang,et al. MUpstart-a constructive neural network learning algorithm for multi-category pattern classification , 1997, Proceedings of International Conference on Neural Networks (ICNN'97).

[13] Stephen I. Gallant,et al. Perceptron-based learning algorithms , 1990, IEEE Trans. Neural Networks.

[14] Marcus R. Frean,et al. A "Thermal" Perceptron Learning Rule , 1992, Neural Computation.

[15] Neil Burgess,et al. A Constructive Algorithm that Converges for Real-Valued Input Patterns , 1994, Int. J. Neural Syst..

[16] Tomas Hrycej,et al. Modular Learning in Neural Networks: A Modularized Approach to Neural Network Classification , 1992 .

[17] Marcus Frean,et al. The Upstart Algorithm: A Method for Constructing and Training Feedforward Neural Networks , 1990, Neural Computation.

[18] F ROSENBLATT,et al. The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[19] Stephen I. Gallant,et al. Neural network learning and expert systems , 1993 .

[20] Rajesh Parekh,et al. Constructive learning: inducing grammars and neural networks , 1998 .