论文信息 - Learning the 2-D Topology of Images

Learning the 2-D Topology of Images

We study the following question: is the two-dimensional structure of images a very strong prior or is it something that can be learned with a few examples of natural images? If someone gave us a learning task involving images for which the two-dimensional topology of pixels was not known, could we discover it automatically and exploit it? For example suppose that the pixels had been permuted in a fixed but unknown way, could we recover the relative two-dimensional location of pixels on images? The surprising result presented here is that not only the answer is yes, but that about as few as a thousand images are enough to approximately recover the relative locations of about a thousand pixels. This is achieved using a manifold learning algorithm applied to pixels associated with a measure of distributional similarity between pixel intensities. We compare different topology-extraction approaches and show how having the two-dimensional topology can be exploited.

[1] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[2] Jason Weston,et al. Large-scale kernel machines , 2007 .

[3] V. Vapnik. Estimation of Dependences Based on Empirical Data , 2006 .

[4] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[5] Michael S. Lewicki,et al. Unsupervised image classification, segmentation, and enhancement using ICA mixture models , 2002, IEEE Trans. Image Process..

[6] Jason Weston,et al. Scaling Learning Algorithms toward AI , 2007 .

[7] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[8] Geoffrey E. Hinton,et al. Topographic Product Models Applied to Natural Scene Statistics , 2006, Neural Computation.

[9] J.S. Denker,et al. Natural Versus "universal" Probability, Complexity, And Entropy , 1992, Workshop on Physics and Computation.

[10] Yee Whye Teh,et al. A New View of ICA , 2001 .

[11] Kilian Q. Weinberger,et al. An Introduction to Nonlinear Dimensionality Reduction by Maximum Variance Unfolding , 2006, AAAI.

[12] Aapo Hyvärinen,et al. Topographic Independent Component Analysis , 2001, Neural Computation.

[13] Alexander H. Waibel,et al. Modular Construction of Time-Delay Neural Networks for Speech Recognition , 1989, Neural Computation.

[14] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[15] J. Tenenbaum,et al. A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[16] George Tzanetakis,et al. Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[17] Yoshua Bengio,et al. Scaling learning algorithms towards AI , 2007 .