论文信息 - Slow, Decorrelated Features for Pretraining Complex Cell-like Networks

Slow, Decorrelated Features for Pretraining Complex Cell-like Networks

We introduce a new type of neural network activation function based on recent physiological rate models for complex cells in visual area V1. A single-hidden-layer neural network of this kind of model achieves 1.50% error on MNIST. We also introduce an existing criterion for learning slow, decorrelated features as a pretraining strategy for image models. This pretraining strategy results in orientation-selective features, similar to the receptive fields of complex cells. With this pretraining, the same single-hidden-layer model achieves 1.34% error, even though the pretraining sample distribution is very different from the fine-tuning distribution. To implement this pretraining strategy, we derive a fast algorithm for online learning of decorrelated features such that each iteration of the algorithm runs in linear time with respect to the number of features.

Yoshua Bengio | James Bergstra | Yoshua Bengio | J. Bergstra

[1] S. Canu,et al. Training Invariant Support Vector Machines using Selective Sampling , 2005 .

[2] Bernhard Schölkopf,et al. Training Invariant Support Vector Machines , 2002, Machine Learning.

[3] Marc'Aurelio Ranzato,et al. Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[4] Marc'Aurelio Ranzato,et al. Sparse Feature Learning for Deep Belief Networks , 2007, NIPS.

[5] Laurenz Wiskott,et al. Slow feature analysis yields a rich repertoire of complex cell properties. , 2005, Journal of vision.

[6] D. Ferster,et al. Computational Diversity in Complex Cells of Cat Primary Visual Cortex , 2007, The Journal of Neuroscience.

[7] Pascal Vincent,et al. The Difficulty of Training Deep Architectures and the Effect of Unsupervised Pre-Training , 2009, AISTATS.

[8] Aapo Hyvärinen,et al. Temporal Coherence, Natural Image Sequences, and the Visual Cortex , 2002, NIPS.

[9] Honglak Lee,et al. Sparse deep belief net model for visual area V2 , 2007, NIPS.

[10] Geoffrey E. Hinton,et al. Learning Mixture Models of Spatial Coherence , 1993, Neural Computation.

[11] Jason Weston,et al. Deep learning via semi-supervised embedding , 2008, ICML '08.

[12] Eero P. Simoncelli,et al. Spatiotemporal Elements of Macaque V1 Receptive Fields , 2005, Neuron.

[13] T. Poggio,et al. Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[14] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[15] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[16] Konrad Paul Kording,et al. How are complex cell properties adapted to the statistics of natural stimuli? , 2004, Journal of neurophysiology.

[17] Bruno A. Olshausen,et al. Learning Transformational Invariants from Natural Movies , 2008, NIPS.

[18] Yoshua Bengio,et al. An empirical evaluation of deep architectures on problems with many factors of variation , 2007, ICML '07.

[19] Tomaso A. Poggio,et al. A Canonical Neural Circuit for Cortical Nonlinear Operations , 2008, Neural Computation.

[20] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[21] Hossein Mobahi,et al. Deep learning from temporal coherence in video , 2009, ICML '09.

[22] E H Adelson,et al. Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[23] Terrence J. Sejnowski,et al. Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.