Representation is representation of similarities

Advanced perceptual systems are faced with the problem of securing a principled (ideally, veridical) relationship between the world and its internal representation. I propose a unified approach to visual representation, addressing the need for superordinate and basic-level categorization and for the identification of specific instances of familiar categories. According to the proposed theory, a shape is represented internally by the responses of a small number of tuned modules, each broadly selective for some reference shape, whose similarity to the stimulus it measures. This amounts to embedding the stimulus in a low-dimensional proximal shape space spanned by the outputs of the active modules. This shape space supports representations of distal shape similarities that are veridical as Shepard's (1968) second-order isomorphisms (i.e., correspondence between distal and proximal similarities among shapes, rather than between distal shapes and their proximal representations). Representation in terms of similarities to reference shapes supports processing (e.g., discrimination) of shapes that are radically different from the reference ones, without the need for the computationally problematic decomposition into parts required by other theories. Furthermore, a general expression for similarity between two stimuli, based on comparisons to reference shapes, can be used to derive models of perceived similarity ranging from continuous, symmetric, and hierarchical ones, as in multidimensional scaling (Shepard 1980), to discrete and nonhierarchical ones, as in the general contrast models (Shepard & Arabie 1979; Tversky 1977).

[1]  永福 智志 The Organization of Learning , 2005, Journal of Cognitive Neuroscience.

[2]  T. Poggio A theory of how the brain might work. , 1990, Cold Spring Harbor symposia on quantitative biology.

[3]  Shimon Edelman,et al.  Receptive field spaces and class-based generalization from a single view in face recognition , 1995 .

[4]  Nazir andO'Regan,et al.  Translation Invariance in Object Recognition, and Its Relation to Other Visual Transformations , 1997 .

[5]  Horace Barlow The Past, Present and Future of Feature Detectors , 1982 .

[6]  S. Harnad Categorical Perception: The Groundwork of Cognition , 1990 .

[7]  Roger N. Shepard,et al.  Connectionist Implementation of a Theory of Generalization , 1990, NIPS.

[8]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.

[9]  Robert L. Goldstone The role of similarity in categorization: providing a groundwork , 1994, Cognition.

[10]  O. Braddick Visual hyperacuity. , 1984, Nature.

[11]  F. Bookstein,et al.  Morphometric Tools for Landmark Data: Geometry and Biology , 1999 .

[12]  J. Brigham The Influence of Race on Face Recognition , 1986 .

[13]  Yann LeCun,et al.  Tangent Prop - A Formalism for Specifying Selected Invariances in an Adaptive Network , 1991, NIPS.

[14]  David J. Field,et al.  What Is the Goal of Sensory Coding? , 1994, Neural Computation.

[15]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[16]  D. Hubel,et al.  Receptive fields of single neurones in the cat's striate cortex , 1959, The Journal of physiology.

[17]  David Mumford,et al.  Mathematical theories of shape: do they model perception? , 1991, Optics & Photonics.

[18]  D. Marr A theory for cerebral neocortex , 1970, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[19]  Nathan Intrator,et al.  Combining Exploratory Projection Pursuit and Projection Pursuit Regression with Application to Neural Networks , 1993, Neural Computation.

[20]  Keiji Tanaka,et al.  Neuronal selectivities to complex object features in the ventral visual pathway of the macaque cerebral cortex. , 1994, Journal of neurophysiology.

[21]  Thomas D Albright Motion perception and the mind-body problem , 1991, Current Biology.

[22]  M. Tovée,et al.  Face Recognition: What are faces for? , 1995, Current Biology.

[23]  D Mumford,et al.  On the computational architecture of the neocortex. II. The role of cortico-cortical loops. , 1992, Biological cybernetics.

[24]  L. Maffei,et al.  Spatial Frequency Channels: Neural Mechanisms , 1978 .

[25]  Nathan Intrator,et al.  Objective function formulation of the BCM theory of visual cortical plasticity: Statistical connections, stability conditions , 1992, Neural Networks.

[26]  David Mumford,et al.  Neuronal Architectures for Pattern-theoretic Problems , 1995 .

[27]  S. Edelman,et al.  On Similarity to Prototypes in 3D Object Representation , 1995 .

[28]  D. Gentner,et al.  Respects for similarity , 1993 .

[29]  S. Sajami,et al.  Representation and reality , 1993 .

[30]  William T. Newsome,et al.  Cortical microstimulation influences perceptual judgements of motion direction , 1990, Nature.

[31]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[32]  N. Goodman,et al.  The Structure of Appearance. , 1953 .

[33]  Eric Saund,et al.  A Multiple Cause Mixture Model for Unsupervised Learning , 1995, Neural Computation.

[34]  P. Cavanagh Vision is Getting Easier Every Day , 1995, Perception.

[35]  K Sakai,et al.  Neuronal tuning and associative mechanisms in form representation. , 1994, Learning & memory.

[36]  Shimon Ullman,et al.  Sequence-Seeking and Counter Streams: A Model for Information Processing the Cortex , 1991 .

[37]  S. Grossberg,et al.  Fuzzy ART: an adaptive resonance algorithm for rapid, stable classification of analog patterns , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[38]  Horace Barlow,et al.  What is the computational goal of the neocortex , 1994 .

[39]  James R. Williamson,et al.  Gaussian ARTMAP: A Neural Network for Fast Incremental Learning of Noisy Multidimensional Maps , 1996, Neural Networks.

[40]  Christof Koch,et al.  Selecting One Among the Many: A Simple Network Implementing Shifts in Selective Visual Attention , 1984 .

[41]  Illusions and Hallucinations , 1897, Nature.

[42]  Amos Tversky,et al.  Studies of similarity , 1978 .

[43]  G. Brelstaff,et al.  Is the Richness of Our Visual World an Illusion? Transsaccadic Memory for Complex Scenes , 1995, Perception.

[44]  A. Tversky Features of Similarity , 1977 .

[45]  N. Littlestone Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[46]  Daniel Reisberg,et al.  Can mental images be ambiguous , 1985 .

[47]  D. B. Bender,et al.  Visual properties of neurons in inferotemporal cortex of the Macaque. , 1972, Journal of neurophysiology.

[48]  R. Hamel,et al.  Sketching and creative discovery , 1998 .

[49]  S. Edelman,et al.  Explorations of Shape Space , 1995 .

[50]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[51]  Jonathan Baxter The Canonical Metric For Vector Quantization , 1995 .

[52]  T Poggio,et al.  Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.

[53]  R. Shepard,et al.  Toward a universal law of generalization for psychological science. , 1987, Science.

[54]  Mary Henle,et al.  Isomorphism: Setting the record straight , 1984 .

[55]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[56]  S. Hanson,et al.  Spherical Units as Dynamic Consequential Regions: Implications for Attention, Competition and Categorization , 1990, NIPS 1990.

[57]  R. Shepard The analysis of proximities: Multidimensional scaling with an unknown distance function. II , 1962 .

[58]  Peter Dayan,et al.  Neural Models for Part-Whole Hierarchies , 1996, NIPS.

[59]  Joel Michell,et al.  Maze's direct realism and the character of cognition , 1988 .

[60]  D. Sundararaman,et al.  Moduli, deformations, and classifications of compact complex manifolds , 1980 .

[61]  M. Martin White Queen Psychology and Other Essays for Alice , 1995 .

[62]  K Tanaka,et al.  Neuronal mechanisms of object recognition. , 1993, Science.

[63]  W. Newsome,et al.  A selective impairment of motion perception following lesions of the middle temporal visual area (MT) , 1988, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[64]  Tomaso A. Poggio,et al.  Extensions of a Theory of Networks for Approximation and Learning , 1990, NIPS.

[65]  Kimberly A. Jameson What Saunders and van Brakel chose to ignore in color and cognition research , 1997, Behavioral and Brain Sciences.

[66]  Tomaso Poggio,et al.  Image Representations for Visual Learning , 1996, Science.

[67]  D. Kendall,et al.  The Riemannian Structure of Euclidean Shape Spaces: A Novel Environment for Statistics , 1993 .

[68]  J. O'Regan,et al.  Solving the "real" mysteries of visual perception: the world as an outside memory. , 1992, Canadian journal of psychology.

[69]  R. Nosofsky Exemplar-Based Accounts of Relations Between Classification, Recognition, and Typicality , 1988 .

[70]  S. Ullman Aligning pictorial descriptions: An approach to object recognition , 1989, Cognition.

[71]  D. Kendall A Survey of the Statistical Theory of Shape , 1989 .

[72]  O. G. Selfridge,et al.  Pandemonium: a paradigm for learning , 1988 .

[73]  H. Le,et al.  On Geodesics in Euclidean Shape Spaces , 1991 .

[74]  R. Shepard,et al.  Second-order isomorphism of internal representations: Shapes of states ☆ , 1970 .

[75]  T Poggio,et al.  Fast perceptual learning in visual hyperacuity. , 1991, Science.

[76]  S. Edelman Representation of Similarity in 3D Object Discrimination , 1995 .

[77]  Harvey Cohn,et al.  Conformal Mappings on Riemann Surfaces. , 1983 .

[78]  R. Nosofsky Stimulus bias, asymmetric similarity, and classification , 1991, Cognitive Psychology.

[79]  Vladimir A. Zorich,et al.  The global homeomorphism theorem for space quasiconformal mappings, its development and related open problems , 1992 .

[80]  Minami Ito,et al.  Columns for visual features of objects in monkey inferotemporal cortex , 1992, Nature.

[81]  Keiji Tanaka,et al.  Coding visual images of objects in the inferotemporal cortex of the macaque monkey. , 1991, Journal of neurophysiology.

[82]  E. Adelson,et al.  The analysis of moving visual patterns , 1985 .

[83]  S. Ullman Against direct perception , 1980, Behavioral and Brain Sciences.

[84]  W. E. Collins,et al.  Integrating pictorial information across eye movements. , 1984, Journal of experimental psychology. General.

[85]  Jussi Väisälä,et al.  Lectures on n-Dimensional Quasiconformal Mappings , 1971 .

[86]  W. Pitts,et al.  What the Frog's Eye Tells the Frog's Brain , 1959, Proceedings of the IRE.

[87]  I. Borg Multidimensional similarity structure analysis , 1987 .

[88]  T J Sejnowski,et al.  Learning the higher-order structure of a natural sound. , 1996, Network.

[89]  D C Van Essen,et al.  Shifter circuits: a computational strategy for dynamic aspects of visual processing. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[90]  Roger N. Shepard,et al.  Additive clustering: Representation of similarities as combinations of discrete overlapping properties. , 1979 .

[91]  R N Shepard,et al.  Multidimensional Scaling, Tree-Fitting, and Clustering , 1980, Science.

[92]  B M Dow,et al.  The mapping of visual space onto foveal striate cortex in the macaque monkey , 1985, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[93]  Keiji Tanaka Inferotemporal cortex and higher visual functions , 1992, Current Opinion in Neurobiology.

[94]  Geoffrey E. Hinton,et al.  How neural networks learn from experience. , 1992, Scientific American.

[95]  R. Shepard,et al.  Perceptual-cognitive explorations of a toroidal set of free-form stimuli , 1973 .

[96]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[97]  P. Suppes,et al.  REPRESENTATIONS AND MODELS IN PSYCHOLOGY , 1994 .

[98]  A. Koriat,et al.  Memory metaphors and the real-life/laboratory controversy: Correspondence versus storehouse conceptions of memory , 1996, Behavioral and Brain Sciences.

[99]  P. H. Lindsay,et al.  Human Information Processing: An Introduction to Psychology , 1972 .

[100]  J. Bourgain On lipschitz embedding of finite metric spaces in Hilbert space , 1985 .

[101]  J. Grimes On the failure to detect changes in scenes across saccades. , 1996 .

[102]  Jussi Väisälä,et al.  Domains and maps , 1992 .

[103]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.