论文信息 - Backpropagation without weight transport

Backpropagation without weight transport

In backpropagation, connection weights are used to both compute node activations and error gradient for hidden units. Grossberg (1987) has argued that the dual use of the same synaptic connections (weight transport) constitutes a bidirectional flow of information through synapses, which is biologically implausable. In this paper we formally and empirically demonstrate the feasibility of an architecture equivalent to backpropagation, but without the assumption of weight transport. Through coordinated training with weight decay, a reciprocal layer of weights evolves into a copy of the forward connections and acts as the conduit for backward flowing corrective information. Examination of the networks trained with dual weights suggests that functional synchronization, and not weight synchronization, is crucial to the operation of backpropagation methods.<<ETX>>

[1] John F. Kolen,et al. Backpropagation is Sensitive to Initial Conditions , 1990, Complex Syst..

[2] Geoffrey E. Hinton. Learning Translation Invariant Recognition in Massively Parallel Networks , 1987, PARLE.

[3] David Zipser,et al. The neurobiological significance of the new learning models , 1993 .

[4] Geoffrey E. Hinton,et al. Proceedings of the 1988 Connectionist Models Summer School , 1989 .

[5] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[6] Stephen Grossberg,et al. Competitive Learning: From Interactive Activation to Adaptive Resonance , 1987, Cogn. Sci..

[7] Anders Krogh,et al. A Simple Weight Decay Can Improve Generalization , 1991, NIPS.

[8] D. Mackay,et al. A Practical Bayesian Framework for Backprop Networks , 1991 .

[9] K. Lang,et al. Learning to tell two spirals apart , 1988 .

[10] Geoffrey E. Hinton,et al. Learning and relearning in Boltzmann machines , 1986 .

[11] A. J. Nijman,et al. PARLE Parallel Architectures and Languages Europe , 1987, Lecture Notes in Computer Science.