论文信息 - Assessing the Scalability of Biologically-Motivated Deep Learning Algorithms and Architectures - 字舞流文

Assessing the Scalability of Biologically-Motivated Deep Learning Algorithms and Architectures

The backpropagation of error algorithm (BP) is impossible to implement in a real brain. The recent success of deep networks in machine learning and AI, however, has inspired proposals for understanding how the brain might learn across multiple layers, and hence how it might approximate BP. As of yet, none of these proposals have been rigorously evaluated on tasks where BP-guided deep learning has proved critical, or in architectures more structured than simple fully-connected networks. Here we present results on scaling up biologically motivated models of deep learning on datasets which need deep networks with appropriate architectures to achieve good performance. We present results on the MNIST, CIFAR-10, and ImageNet datasets and explore variants of target-propagation (TP) and feedback alignment (FA) algorithms, and explore performance in both fully- and locally-connected architectures. We also introduce weight-transport-free variants of difference target propagation (DTP) modified to remove backpropagation from the penultimate layer. Many of these algorithms perform well for MNIST, but for CIFAR and ImageNet we find that TP and FA variants perform significantly worse than BP, especially for networks composed of locally connected units, opening questions about whether new architectures and algorithms are required to scale these approaches. Our results and implementation details help establish baselines for biologically motivated deep learning schemes going forward.

Geoffrey E. Hinton | Timothy P. Lillicrap | Adam Santoro | Sergey Bartunov | Blake A. Richards | T. Lillicrap | Adam Santoro | Sergey Bartunov | B. Richards

[1] Geoffrey E. Hinton,et al. A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[2] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.

[3] Y. L. Cun. Learning Process in an Asymmetric Threshold Network , 1986 .

[4] Yann LeCun,et al. Learning processes in an asymmetric threshold network , 1986 .

[5] Stephen Grossberg,et al. Competitive Learning: From Interactive Activation to Adaptive Resonance , 1987, Cogn. Sci..

[6] Pineda,et al. Generalization of back-propagation to recurrent neural networks. , 1987, Physical review letters.

[7] Geoffrey E. Hinton,et al. Learning Representations by Recirculation , 1987, NIPS.

[8] Yann LeCun,et al. Modeles connexionnistes de l'apprentissage , 1987 .

[9] Fernando J. Pineda,et al. Dynamics and architecture for neural computation , 1988, J. Complex..

[10] Francis Crick,et al. The recent excitement about neural networks , 1989, Nature.

[11] L. B. Almeida. A learning rule for asynchronous perceptrons with feedback in a combinatorial environment , 1990 .

[12] Javier R. Movellan,et al. Contrastive Hebbian Learning in the Continuous Hopfield Model , 1991 .

[13] Randall C. O'Reilly,et al. Biologically Plausible Error-Driven Learning Using Local Activation Differences: The Generalized Recirculation Algorithm , 1996, Neural Computation.

[14] Xiaohui Xie,et al. Equivalence of Backpropagation and Contrastive Hebbian Learning in a Layered Network , 2003, Neural Computation.

[15] Konrad P. Körding,et al. Supervised and Unsupervised Learning with Two Sites of Synaptic Integration , 2001, Journal of Computational Neuroscience.

[16] Chris Eliasmith,et al. Solving the Problem of Negative Synaptic Weights in Cortical Models , 2008, Neural Computation.

[17] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[18] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[19] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20] Yoshua Bengio,et al. How Auto-Encoders Could Provide Credit Assignment in Deep Networks via Target Propagation , 2014, ArXiv.

[21] Daniel Cownden,et al. Random feedback weights support learning in deep neural networks , 2014, ArXiv.

[22] Thomas Brox,et al. Striving for Simplicity: The All Convolutional Net , 2014, ICLR.

[23] Yoshua Bengio,et al. Towards Biologically Plausible Deep Learning , 2015, ArXiv.

[24] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[25] Yoshua Bengio,et al. Difference Target Propagation , 2014, ECML/PKDD.

[26] Yoshua Bengio,et al. Early Inference in Energy-Based Models Approximates Back-Propagation , 2015, ArXiv.

[27] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[28] Colin J. Akerman,et al. Random synaptic feedback weights support error backpropagation for deep learning , 2016, Nature Communications.

[29] Yoshua Bengio,et al. Feedforward Initialization for Fast Inference of Deep Generative Networks is biologically plausible , 2016, ArXiv.

[30] Timothy P. Lillicrap,et al. Deep learning with segregated dendrites , 2016 .

[31] Arild Nøkland,et al. Direct Feedback Alignment Provides Learning in Deep Neural Networks , 2016, NIPS.

[32] Francesco Visin,et al. A guide to convolution arithmetic for deep learning , 2016, ArXiv.

[33] Yoshua Bengio,et al. STDP-Compatible Approximation of Backpropagation in an Energy-Based Model , 2017, Neural Computation.

[34] Yoshua Bengio,et al. Equilibrium Propagation: Bridging the Gap between Energy-Based Models and Backpropagation , 2016, Front. Comput. Neurosci..

[35] Timothy P Lillicrap,et al. Deep Learning with Dynamic Spiking Neurons and Fixed Feedback Weights , 2017, Neural Computation.

[36] Rafal Bogacz,et al. An Approximation of the Error Backpropagation Algorithm in a Predictive Coding Network with Local Hebbian Synaptic Plasticity , 2017, Neural Computation.

[37] Timothy P Lillicrap,et al. Towards deep learning with segregated dendrites , 2016, eLife.

[38] Daniel Kifer,et al. Conducting Credit Assignment by Aligning Local Representations , 2018, 1803.01834.

[39] Yoshua Bengio,et al. Dendritic error backpropagation in deep cortical microcircuits , 2017, ArXiv.

[40] Alexander Ororbia,et al. Biologically Motivated Algorithms for Propagating Local Target Representations , 2018, AAAI.