论文信息 - Streaming Normalization: Towards Simpler and More Biologically-plausible Normalizations for Online and Recurrent Learning - 字舞流文

Streaming Normalization: Towards Simpler and More Biologically-plausible Normalizations for Online and Recurrent Learning

This work was supported by the Center for Brains, Minds and Machines (CBMM), funded by NSF STC award CCF - 1231216.

Tomaso A. Poggio | Kenji Kawaguchi | Qianli Liao

[1] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2] Andrea Vedaldi,et al. MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[3] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Tomaso A. Poggio,et al. Learning Real and Boolean Functions: When Is Deep Better Than Shallow , 2016, ArXiv.

[5] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.

[6] S. Nelson,et al. Homeostatic plasticity in the developing nervous system , 2004, Nature Reviews Neuroscience.

[7] S. Ullman,et al. Adaptation and gain normalization , 1982, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[8] Tomaso Poggio,et al. Learning Functions: When Is Deep Better Than Shallow , 2016, 1603.00988.

[9] T. Poggio,et al. Deep vs. shallow networks : An approximation theory perspective , 2016, ArXiv.

[10] Ying Zhang,et al. Batch normalized recurrent neural networks , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[11] Tim Salimans,et al. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.

[12] G. Turrigiano. The Self-Tuning Neuron: Synaptic Scaling of Excitatory Synapses , 2008, Cell.

[13] Aaron C. Courville,et al. Recurrent Batch Normalization , 2016, ICLR.

[14] Ruslan Salakhutdinov,et al. Path-SGD: Path-Normalized Optimization in Deep Neural Networks , 2015, NIPS.

[15] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[16] Joel Z. Leibo,et al. How Important Is Weight Symmetry in Backpropagation? , 2015, AAAI.

[17] R. Malenka,et al. Synaptic scaling mediated by glial TNF-α , 2006, Nature.

[18] Kenji Kawaguchi,et al. Deep Learning without Poor Local Minima , 2016, NIPS.

[19] Geoffrey E. Hinton,et al. Layer Normalization , 2016, ArXiv.

[20] Tomaso A. Poggio,et al. Bridging the Gaps Between Residual Learning, Recurrent Neural Networks and Visual Cortex , 2016, ArXiv.