论文信息 - Scaling Relationships in Back-Propagation Learning: Dependence on Training Set Size

Scaling Relationships in Back-Propagation Learning: Dependence on Training Set Size

We st ud y th e amount of ti me needed to learn a fixed t rain ing se t in the "back-pro pagation" proced ure for learning in multi-layer ne ural network models. The task chosen was 32-bit parity, a hi gh order fu nct ion for wh ich memor iza ti on o f specific in p u t- out put pairs is necessary. For small t raining sets, the learning time is consistent with a ~-power law depen dence on the nu mber of patterns in the t ra ining set. For lar ger training set s, t he learn ing t ime dive rges at a critical t ra ining set size which appears to be related to the st orage capacity of t he network.

Gerald Tesauro | G. Tesauro

[1] Geoffrey E. Hinton,et al. OPTIMAL PERCEPTUAL INFERENCE , 1983 .

[2] P. Anandan,et al. Pattern-recognizing stochastic learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[3] Geoffrey E. Hinton,et al. A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[4] David Zipser,et al. Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[5] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[6] S Dehaene,et al. Spin glass model of learning by selection. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[7] Geoffrey E. Hinton,et al. Learning and relearning in Boltzmann machines , 1986 .

[8] Geoffrey E. Hinton,et al. Learning symmetry groups with hidden units: beyond the perceptron , 1986 .

[9] Terrence J. Sejnowski,et al. Parallel Networks that Learn to Pronounce English Text , 1987, Complex Syst..

[10] Terrence J. Sejnowski,et al. A Parallel Network that Learns to Play Backgammon , 1989, Artif. Intell..