How are three-dimensional objects represented in the brain?

In this report we discuss a variety of psychophysical experiments that explore different aspects of the problem of object recognition and representation in human vision. In all experiments, subjects were presented with realistically rendered images of computer-generated 3D objects, with tight control over stimulus shape, surface properties, illumination, and viewpoint, as well as subjects' prior exposure to the stimulus objects. Contrary to the predictions of the paradigmatic theory of recognition, which holds that object representations are viewpoint invariant, performance in all experiments was consistently viewpoint dependent, was only partially aided by binocular stereo and other depth information, was specific to viewpoints that were familiar, and was systematically disrupted by rotation in depth more than by deforming the 2D images of the stimuli. The emerging concept of multiple-views representation supported by these results is consistent with recently advanced computational theories of recognition based on view interpolation. Moreover, in several simulated experiments employing the same stimuli used in experiments with human subjects, models based on multiple-views representations replicated many of the psychophysical results concerning the observed pattern of human performance.

[1]  G E Legge,et al.  Color improves object recognition in normal and low vision. , 1993, Journal of experimental psychology. Human perception and performance.

[2]  Shimon Edelman,et al.  Modeling human visual object recognition , 1992, [Proceedings 1992] IJCNN International Joint Conference on Neural Networks.

[3]  Kavitha Srinivas Perceptual specificity in nonverbal priming. , 1993 .

[4]  Stephen M. Kosslyn,et al.  Pictures and names: Making the connection , 1984, Cognitive Psychology.

[5]  S. Ullman Aligning pictorial descriptions: An approach to object recognition , 1989, Cognition.

[6]  E. G. Jones Cerebral Cortex , 1987, Cerebral Cortex.

[7]  I. Biederman,et al.  Recognizing depth-rotated objects: evidence and conditions for three-dimensional viewpoint invariance. , 1993, Journal of experimental psychology. Human perception and performance.

[9]  D. Schacter Implicit memory: History and current status. , 1987 .

[10]  P. Jolicoeur The time to name disoriented natural objects , 1985, Memory & cognition.

[11]  A. J. Mistlin,et al.  Visual neurones responsive to faces , 1987, Trends in Neurosciences.

[12]  M. Tarr,et al.  When does Human Object Recognition use a Viewer-Centered Reference Frame? , 1990 .

[13]  H. Barrow,et al.  RECOVERING INTRINSIC SCENE CHARACTERISTICS FROM IMAGES , 1978 .

[14]  M. Tarr,et al.  Mental rotation and orientation-dependence in shape recognition , 1989, Cognitive Psychology.

[15]  T Poggio,et al.  Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.

[16]  H H Bülthoff,et al.  Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[17]  I. Biederman,et al.  Dynamic binding in a neural network for shape recognition. , 1992, Psychological review.

[18]  Shimon Ullman,et al.  Aligning pictorial descriptions , 1991 .

[19]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[21]  Peter E. Hart,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[22]  D. W. Thompson,et al.  Three-dimensional model matching from an unconstrained viewpoint , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[23]  Heinrich H. Bülthoff,et al.  Generalization of object recognition across stimulus rotation and deformation , 1993 .

[24]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[25]  C. B. Cave,et al.  The Role of Parts and Spatial Relations in Object Identification , 1993, Perception.

[26]  T Poggio,et al.  View-based models of 3D object recognition: invariance to imaging transformations. , 1995, Cerebral cortex.

[27]  I. Rock,et al.  Can we imagine how objects look from other viewpoints? , 1989, Cognitive Psychology.

[28]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[29]  Sharon Lee Armstrong,et al.  What some concepts might not be , 1983, Cognition.

[30]  T. Poggio,et al.  The importance of symmetry and virtual views in three-dimensional object recognition , 1994, Current Biology.

[31]  M. G. Eley,et al.  Identifying rotated letter-like symbols , 1982, Memory & cognition.

[32]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[33]  David G. Lowe,et al.  Perceptual Organization and Visual Recognition , 2012 .

[34]  Dave Bartram,et al.  The role of visual and semantic codes in object naming , 1974 .

[35]  Michael Werman,et al.  Stability and Likelihood of Views of Three Dimensional Objects , 1994, Theoretical Foundations of Computer Vision.

[36]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[37]  H H Bülthoff,et al.  Integration of depth modules: stereo and shading. , 1988, Journal of the Optical Society of America. A, Optics and image science.

[38]  I. Rock,et al.  A case of viewer-centered object perception , 1987, Cognitive Psychology.

[39]  T. Poggio,et al.  Representations in high-level vision: reassessing the inverse optics paradigm , 1989 .

[40]  Daniel L. Schacter,et al.  Dissociations Between Structural and Episodic Representations of Visual Objects , 1992 .

[41]  Shimon Edelman A Network Model of Object Recognition in Human Vision , 1992 .

[42]  Nathan Intrator,et al.  Three-Dimensional Object Recognition Using an Unsupervised Neural Network: Understanding the Distinguishing Features , 1991 .

[43]  Shimon Edelman,et al.  Bringing the Grandmother back into the Picture: A Memory-Based View of Object Recognition , 1990, Int. J. Pattern Recognit. Artif. Intell..

[44]  G. Humphrey,et al.  Recognizing novel views of three-dimensional objects. , 1992, Canadian journal of psychology.

[45]  A Koriat,et al.  Mental rotation and visual familiarity , 1985, Perception & psychophysics.

[46]  Harry G. Barrow,et al.  Scene modeling: a structural basis for image description , 1980 .

[47]  Michael J. Tarr,et al.  Orientation dependence in three-dimensional object recognition , 1989 .

[48]  G W Humphreys,et al.  Varieties of Object Constancy , 1989, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[49]  R. Shepard,et al.  Mental Images and Their Transformations , 1982 .

[50]  B. Tversky,et al.  Objects, parts, and categories. , 1984 .

[51]  S. Edelman,et al.  Generalization of object recognition in human vision across stimulus transformations and deformations , 1990 .