论文信息 - Scaling in a hierarchical unsupervised network

Scaling in a hierarchical unsupervised network

A persistent worry with computational models of unsupervised learning is that learning will become more difficult as the problem is scaled. We examine this issue in the context of a novel hierarchical, generative model that can be viewed as a nonlinear generalisation of factor analysis and can be implemented in a neural network. The model performs perceptual inference in a probabilistically consistent manner by using top-down, bottom-up and lateral connections. These connections can be learned using simple rules that require only locally available information. We first demonstrate that the model can extract a sparse, distributed, hierarchical representation of global disparity from simplified random-dot stereograms. We then investigate some of the scaling properties of the algorithm on this problem and find that: 1) increasing the image size leads to faster and more reliable learning; 2) increasing the depth of the network from one to two hidden layers leads to better representations at the first hidden layer; and 3) once one part of the network has discovered how to represent disparity, it "supervises" other parts of the network, greatly speeding up their learning.

Geoffrey E. Hinton | Zoubin Ghahramani | At Korenberg

[1] Ralph Linsker,et al. Self-organization in a perceptual network , 1988, Computer.

[2] Geoffrey E. Hinton,et al. Self-organizing neural network that discovers surfaces in random-dot stereograms , 1992, Nature.

[3] Brian Everitt,et al. An Introduction to Latent Variable Models , 1984 .

[4] E. Oja. Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[5] H. Sebastian Seung,et al. Unsupervised Learning by Convex and Conic Coding , 1996, NIPS.

[6] Geoffrey E. Hinton,et al. Hierarchical Non-linear Factor Analysis and Topographic Maps , 1997, NIPS.

[7] Geoffrey E. Hinton,et al. Generative models for discovering sparse distributed representations. , 1997, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[8] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[9] H. B. Barlow,et al. Unsupervised Learning , 1989, Neural Computation.

[10] Richard Durbin,et al. An analogue approach to the travelling salesman problem using an elastic net method , 1987, Nature.

[11] David Zipser,et al. Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[12] Stephen Grossberg,et al. The ART of adaptive pattern recognition by a self-organizing neural network , 1988, Computer.