论文信息 - Scaling and Generalization in Neural Networks: A Case Study

Scaling and Generalization in Neural Networks: A Case Study

The issues of scaling and generalization have emerged as key issues in current studies of supervised learning from examples in neural networks. Questions such as how many training patterns and training cycles are needed for a problem of a given size and difficulty, how to represent the input and how to choose useful training exemplars, are of considerable theoretical and practical importance. Several intuitive rules of thumb have been obtained from empirical studies, but as yet there are few rigorous results. In this paper we summarize a study of generalization in the simplest possible case-perceptron networks learning linearly separable functions. The task chosen was the majority function (i.e. return a 1 if a majority of the input units are on), a predicate with a number of useful properties. We find that many aspects of generalization in multilayer networks learning large, difficult tasks are reproduced in this simple domain, in which concrete numerical results and even some analytic understanding can be achieved.

Gerald Tesauro | Subutai Ahmad | G. Tesauro | Subutai Ahmad

[1] Thomas M. Cover,et al. Geometrical and Statistical Properties of Systems of Linear Inequalities with Applications in Pattern Recognition , 1965, IEEE Trans. Electron. Comput..

[2] Verzekeren Naar Sparen,et al. Cambridge , 1969, Humphrey Burton: In My Own Time.

[3] Saburo Muroga,et al. Threshold logic and its applications , 1971 .

[4] Editors , 1986, Brain Research Bulletin.

[5] James L. McClelland,et al. Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[6] Bernardo A. Huberman,et al. AN IMPROVED THREE LAYER, BACK PROPAGATION ALGORITHM , 1987 .

[7] Terrence J. Sejnowski,et al. Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[8] Gerald Tesauro,et al. A study of scaling and generalization in neural networks , 1988, Neural Networks.

[9] Terrence J. Sejnowski,et al. A Parallel Network that Learns to Play Backgammon , 1989, Artif. Intell..