Psychophysical and physiological evidence for viewer-centered object representations in the primate.

A key question concerning the perception of 3D objects is the spatial reference frame used by the brain to represent them. The celerity of the recognition process could be explained by the visual system's ability to quickly transform stored models of familiar 3D objects, or by its ability to specify the relationship among viewpoint-invariant features or volumetric primitives that can be used to accomplish a structural description of an image. Alternatively, viewpoint-invariant recognition could be realized by a system endowed with the ability to perform an interpolation between a set of stored 2D templates, created for each experienced viewpoint. In the present study we set out to examine the nature of object representation in the primate in combined psychophysical-electrophysiological experiments. Monkeys were trained to recognize novel objects from a given viewpoint and subsequently were tested for their ability to generalize recognition for views generated by mathematically rotating the objects around any arbitrary axis. The perception of 3D novel objects was found to be a function of the object's retinal projection at the time of the recognition encounter. Recognition became increasingly difficult for the monkeys as the stimulus was rotated away from its familiar attitude. The generalization field for novel wire-like and spheroidal objects extended to about +/- 40 degrees around an experienced viewpoint. When the animals were trained with as few as three views of the object, 120 degrees apart, they could often interpolate recognition for all views resulting from rotations around the same axis. Recordings from inferotemporal cortex during the psychophysical testing showed a number of neurons with remarkable selectivity for individual views of those objects that the monkey had learned to recognize. Plotting the response of neurons as a function of rotation angle revealed systematic view-tuning curves for rotations in depth. A small percentage of the view-selective cells responded strongly for a particular view and its mirror-symmetrical view. For some of the tested objects, different neurons were found to be tuned to different views of the same object; the peaks of the view-tuning curves were 40-50 degrees apart. Neurons were also found that responded to the sight of unfamiliar objects or distractors. Such cells, however, gave nonspecific responses to a variety of other patterns presented while the monkey performed a simple fixation task.

[1]  S. T. Orton SPECIFIC READING DISABILITY— STREPHOSYMBOLIA , 1928 .

[2]  G. Bonin,et al.  The neocortex of Macaca mulatta , 1947 .

[3]  G. J. Romanes,et al.  The Neocortex of Macaca mulatta , 1948 .

[4]  B. Milner,et al.  Psychological defects produced by temporal lobe excision. , 1958, Research publications - Association for Research in Nervous and Mental Disease.

[5]  D. Robinson,et al.  A METHOD OF MEASURING EYE MOVEMENT USING A SCLERAL SEARCH COIL IN A MAGNETIC FIELD. , 1963, IEEE transactions on bio-medical engineering.

[6]  D. Kimura Right temporal-lobe damage. Perception of unfamiliar stimuli after damage. , 1963, Archives of neurology.

[7]  G L Gerstein,et al.  Single-unit activity in temporal association cortex of the monkey. , 1967, Journal of neurophysiology.

[8]  C. Gross,et al.  Inferotemporal evoked potentials during visual discrimination performance by monkeys. , 1968, Journal of comparative and physiological psychology.

[9]  M. Braunstein Motion and texture as sources of slant information. , 1968, Journal of experimental psychology.

[10]  H. Lansdell Effect of extent of temporal lobe ablations on two lateralized deficits , 1968 .

[11]  Brenda Milner,et al.  Visual recognition and recall after right temporal-lobe excision in man , 1968 .

[12]  D. B. Bender,et al.  Visual Receptive Fields of Neurons in Inferotemporal Cortex of the Monkey , 1969, Science.

[13]  M Mishkin,et al.  Further evidence on the locus of the visual area in the temporal lobe of the monkey. , 1969, Experimental neurology.

[14]  L B Taylor,et al.  Localisation of cerebral lesions by psychological testing. , 1969, Clinical neurosurgery.

[15]  D. B. Bender,et al.  Visual properties of neurons in inferotemporal cortex of the Macaque. , 1972, Journal of neurophysiology.

[16]  C. Gross Visual Functions of Inferotemporal Cortex , 1973 .

[17]  D. H. McLain,et al.  Drawing Contours from Arbitrary Data Points , 1974, Comput. J..

[18]  E. Rosch Cognitive Representations of Semantic Categories. , 1975 .

[19]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[20]  C. Gross 7 – The Neural Basis of Stimulus Equivalence Across Retinal Translation , 1977 .

[21]  S. Carey,et al.  From piecemeal to configurational representation of faces. , 1977, Science.

[22]  C. Gross,et al.  Left and Right in Science and Art , 1978 .

[23]  Charles G. Gross,et al.  Perceptual similarity of mirror images in infancy , 1978, Cognition.

[24]  R. Desimone,et al.  Visual areas in the temporal cortex of the macaque , 1979, Brain Research.

[25]  B Rogers,et al.  Motion Parallax as an Independent Cue for Depth Perception , 1979, Perception.

[26]  B. Richmond,et al.  Implantation of magnetic search coils for measurement of eye position: An improved method , 1980, Vision Research.

[27]  A. Mikami,et al.  Inferotemporal neuron activities and color discrimination with delay , 1980, Brain Research.

[28]  R. Desimone,et al.  Visual properties of neurons in a polysensory area in superior temporal sulcus of the macaque. , 1981, Journal of neurophysiology.

[29]  I. Rock,et al.  The effect on form perception of change of orientation in the third dimension. , 1981, Journal of experimental psychology. Human perception and performance.

[30]  B. Rogers,et al.  Similarities between motion parallax and stereopsis in human depth perception , 1982, Vision Research.

[31]  C. Bruce,et al.  Face recognition by monkeys: Absence of an inversion effect , 1982, Neuropsychologia.

[32]  B. Rogers,et al.  Anisotropies in the perception of three-dimensional surfaces. , 1983, Science.

[33]  R. Desimone,et al.  Shape recognition and inferior temporal neurons. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[34]  M. Corballis,et al.  Winding one's ps and qs: mental rotation and mirror-image discrimination. , 1984, Journal of experimental psychology. Human perception and performance.

[35]  R. Desimone,et al.  Stimulus-selective properties of inferior temporal neurons in the macaque , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[36]  Stephen M. Kosslyn,et al.  Pictures and names: Making the connection , 1984, Cognitive Psychology.

[37]  A. J. Mistlin,et al.  Visual cells in the temporal cortex sensitive to face view and gaze direction , 1985, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[38]  Maureen Graham,et al.  Motion Parallax and the Perception of Three-Dimensional Surfaces , 1985 .

[39]  S. Yamane,et al.  Responses of single neurons in the inferotemporal cortex of the awake monkey performing human face discrimination task , 1987 .

[40]  H. Spitzer,et al.  Temporal encoding of two-dimensional patterns by single units in primate inferior temporal cortex. I. Response characteristics. , 1987, Journal of neurophysiology.

[41]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[42]  I. Rock,et al.  A case of viewer-centered object perception , 1987, Cognitive Psychology.

[43]  A. J. Mistlin,et al.  Visual neurones responsive to faces , 1987, Trends in Neurosciences.

[44]  A. Damasio,et al.  Intact recognition of facial expression, gender, and age in patients with impaired recognition of face identity , 1988, Neurology.

[45]  R. M. Siegel,et al.  Perception of three-dimensional structure from motion in monkey and man , 1988, Nature.

[46]  S. Ullman Aligning pictorial descriptions: An approach to object recognition , 1989, Cognition.

[47]  M. Tarr,et al.  Mental rotation and orientation-dependence in shape recognition , 1989, Cognitive Psychology.

[48]  I. Rock,et al.  Can we imagine how objects look from other viewpoints? , 1989, Cognitive Psychology.

[49]  T Poggio,et al.  Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.

[50]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.

[51]  M. Tarr,et al.  When does Human Object Recognition use a Viewer-Centered Reference Frame? , 1990 .

[52]  A. Damasio Category-related recognition defects as a clue to the neural substrates of knowledge , 1990, Trends in Neurosciences.

[53]  T. Poggio A theory of how the brain might work. , 1990, Cold Spring Harbor symposia on quantitative biology.

[54]  I. Biederman,et al.  Evidence for Complete Translational and Reflectional Invariance in Visual Object Priming , 1991, Perception.

[55]  Keiji Tanaka,et al.  Coding visual images of objects in the inferotemporal cortex of the macaque monkey. , 1991, Journal of neurophysiology.

[56]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[57]  Minami Ito,et al.  Columns for visual features of objects in monkey inferotemporal cortex , 1992, Nature.

[58]  S Yamane,et al.  Color selectivity of neurons in the inferior temporal cortex of the awake macaque monkey , 1992, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[59]  S. Edelman,et al.  Orientation dependence in the recognition of familiar and novel views of three-dimensional objects , 1992, Vision Research.

[60]  Heinrich H. Bülthoff,et al.  Evidence for recognition based on interpolation among 2D views of objects in monkeys. , 1992 .

[61]  M. Young,et al.  Sparse population coding of faces in the inferotemporal cortex. , 1992, Science.

[62]  H H Bülthoff,et al.  Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[63]  Heinrich H. Bülthoff,et al.  Responses of Inferotemporal (IT) Neurons to Novel Wire-Objects in Monkeys Trained in an Object Recognition Task , 1993 .

[64]  M. Young,et al.  An analysis at the population level of the processing of faces in the inferotemporal cortex. , 1993 .

[65]  C. Gross,et al.  Response properties of neurons in temporal cortical visual areas of infant monkeys. , 1993, Journal of neurophysiology.

[66]  G. Orban,et al.  Cue-invariant shape selectivity of macaque inferior temporal neurons. , 1993, Science.

[67]  N. Logothetis,et al.  View-dependent object recognition by monkeys , 1994, Current Biology.

[68]  T. Poggio,et al.  The importance of symmetry and virtual views in three-dimensional object recognition , 1994, Current Biology.

[69]  T Poggio,et al.  View-based models of 3D object recognition: invariance to imaging transformations. , 1995, Cerebral cortex.

[70]  Nikos K. Logothetis,et al.  View Invariant Representations in Monkey Temporal Cortex: Position, Scale, and Rotational Invariance. , 1996 .