论文信息 - A Model for Learning Topographically Organized Parts-Based Representations of Objects in Visual Cortex: Topographic Nonnegative Matrix Factorization

A Model for Learning Topographically Organized Parts-Based Representations of Objects in Visual Cortex: Topographic Nonnegative Matrix Factorization

Object representation in the inferior temporal cortex (IT), an area of visual cortex critical for object recognition in the primate, exhibits two prominent properties: (1) objects are represented by the combined activity of columnar clusters of neurons, with each cluster representing component features or parts of objects, and (2) closely related features are continuously represented along the tangential direction of individual columnar clusters. Here we propose a learning model that reflects these properties of parts-based representation and topographic organization in a unified framework. This model is based on a nonnegative matrix factorization (NMF) basis decomposition method. NMF alone provides a parts-based representation where nonnegative inputs are approximated by additive combinations of nonnegative basis functions. Our proposed model of topographic NMF (TNMF) incorporates neighborhood connections between NMF basis functions arranged on a topographic map and attains the topographic property without losing the parts-based property of the NMF. The TNMF represents an input by multiple activity peaks to describe diverse information, whereas conventional topographic models, such as the self-organizing map (SOM), represent an input by a single activity peak in a topographic map. We demonstrate the parts-based and topographic properties of the TNMF by constructing a hierarchical model for object recognition where the TNMF is at the top tier for learning high-level object features. The TNMF showed better generalization performance over NMF for a data set of continuous view change of an image and more robustly preserving the continuity of the view change in its object representation. Comparison of the outputs of our model with actual neural responses recorded in the IT indicates that the TNMF reconstructs the neuronal responses better than the SOM, giving plausibility to the parts-based learning of the model.

[1] D. Hubel,et al. Shape and arrangement of columns in cat's striate cortex , 1963, The Journal of physiology.

[2] A. Borst. Seeing smells: imaging olfactory learning in bees , 1999, Nature Neuroscience.

[3] T. Poggio,et al. Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[4] Alexandre Pouget,et al. Probabilistic Interpretation of Population Codes , 1996, Neural Computation.

[5] Tomaso Poggio,et al. Learning a dictionary of shape-components in visual cortex: comparison with neurons, humans and machines , 2006 .

[6] Minami Ito,et al. Columns for visual features of objects in monkey inferotemporal cortex , 1992, Nature.

[7] Patrik O. Hoyer,et al. Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[8] C. Gross,et al. How inferior temporal cortex became a visual area. , 1994, Cerebral cortex.

[9] Xin Liu,et al. Document clustering based on non-negative matrix factorization , 2003, SIGIR.

[10] M. Ito,et al. Processing of contrast polarity of visual images in inferotemporal cortex of the macaque monkey. , 1994, Cerebral cortex.

[11] I Fujita,et al. Intrinsic connections in the macaque inferior temporal cortex , 1996, The Journal of comparative neurology.

[12] Y. Yamane,et al. Complex objects are represented in macaque inferotemporal cortex by the combination of feature columns , 2001, Nature Neuroscience.

[13] Aapo Hyvärinen,et al. Topographic Independent Component Analysis , 2001, Neural Computation.

[14] I. Fujita,et al. Neuronal mechanisms of selectivity for object features revealed by blocking inhibition in inferotemporal cortex , 2000, Nature Neuroscience.

[15] Dezhe Z. Jin,et al. The Coordinated Mapping of Visual Space and Response Features in Visual Cortex , 2005, Neuron.

[16] C. Gross,et al. Neural ensemble coding in inferior temporal cortex. , 1994, Journal of neurophysiology.

[17] Masato Okada,et al. Self-organization of globally continuous and locally distributed information representation , 2004, Neural Networks.

[18] Stan Z. Li,et al. Learning spatially localized, parts-based representation , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[19] Ichiro Fujita,et al. Quantitative analysis of functional clustering of neurons in the macaque inferior temporal cortex , 2005, Neuroscience Research.

[20] Seungjin Choi,et al. Nonnegative features of spectro-temporal sounds for classification , 2005, Pattern Recognit. Lett..

[21] Peter Dayan,et al. Doubly Distributional Population Codes: Simultaneous Representation of Uncertainty and Multiplicity , 2003, Neural Computation.

[22] Teuvo Kohonen,et al. Self-Organization and Associative Memory , 1988 .

[23] James A. Reggia,et al. Mirror Symmetric Topographic Maps Can Arise from Activity-Dependent Synaptic Changes , 2005, Neural Computation.

[24] I. Fujita,et al. Organization of horizontal axons in the inferior temporal cortex and primary visual cortex of the macaque monkey. , 2005, Cerebral cortex.

[25] Dmitri B. Chklovskii,et al. Orientation Preference Patterns in Mammalian Visual Cortex A Wire Length Minimization Approach , 2001, Neuron.

[26] Leslie G. Ungerleider,et al. Object vision and spatial vision: two cortical pathways , 1983, Trends in Neurosciences.

[27] Christoph Schnörr,et al. Learning Sparse Representations by Non-Negative Matrix Factorization and Sequential Cone Programming , 2006, J. Mach. Learn. Res..

[28] H. Ritter,et al. A principle for the formation of the spatial structure of cortical feature maps. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[29] Nicholas V. Swindale,et al. Coverage and the design of striate cortex , 1991, Biological Cybernetics.

[30] Ichiro Fujita,et al. Presumed inhibitory neurons in the macaque inferior temporal cortex: visual response properties and functional interactions with adjacent neurons. , 2004, Journal of neurophysiology.

[31] Heiko Wersing,et al. Learning Optimized Features for Hierarchical Models of Invariant Object Recognition , 2003, Neural Computation.

[32] TJ Gawne,et al. How independent are the messages carried by adjacent inferior temporal cortical neurons? , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[33] H. Sebastian Seung,et al. Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[34] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[35] Keiji Tanaka,et al. Coding visual images of objects in the inferotemporal cortex of the macaque monkey. , 1991, Journal of neurophysiology.

[36] Tomaso Poggio,et al. Trade-Off between Object Selectivity and Tolerance in Monkey Inferotemporal Cortex , 2007, The Journal of Neuroscience.

[37] Sven Behnke,et al. Topological features in locally connected RBMs , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[38] Teuvo Kohonen,et al. Self-organization and associative memory: 3rd edition , 1989 .

[39] Keiji Tanaka. Columns for complex visual object features in the inferotemporal cortex: clustering of cells with similar but slightly different stimulus selectivities. , 2003, Cerebral cortex.

[40] James A. Reggia,et al. Temporally Asymmetric Learning Supports Sequence Processing in Multi-Winner Self-Organizing Maps , 2004, Neural Computation.

[41] G. Buchsbaum,et al. Color categories revealed by non-negative matrix factorization of Munsell color spectra , 2002, Vision Research.

[42] Richard Durbin,et al. A dimension reduction framework for understanding cortical maps , 1990, Nature.

[43] Juan Humberto Sossa Azuela,et al. 3D Object Recognition Based on Some Aspects of the Infant Vision System and Associative Memory , 2009, Cognitive Computation.

[44] Keiji Tanaka,et al. Functional architecture in monkey inferotemporal cortex revealed by in vivo optical imaging , 1998, Neuroscience Research.

[45] I. Fujita. The inferior temporal cortex: Architecture, computation, and representation , 2002, Journal of neurocytology.

[46] Kunihiko Fukushima,et al. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[47] D. Hubel,et al. Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[48] C. Connor,et al. Population coding of shape in area V4 , 2002, Nature Neuroscience.

[49] Thomas Serre,et al. A Theory of Object Recognition: Computations and Circuits in the Feedforward Path of the Ventral Stream in Primate Visual Cortex , 2005 .