Object classification for human and ideal observers

We describe a novel approach, based on ideal observer analysis, for measuring the ability of human observers to use image information for 3D object perception. We compute the statistical efficiency of subjects relative to an ideal observer for a 3D object classification task. After training to 11 different views of a randomly shaped thick wire object, subjects were asked which of a pair of noisy views of the object best matched the learned object. Efficiency relative to the actual information in the stimuli can be as high as 20%. Increases in object regularity (e.g. symmetry) lead to increases in the efficiency with which novel views of an object could be classified. Furthermore, such increases in regularity also lead to decreases in the effect of viewpoint on classification efficiency. Human statistical efficiencies relative to a 2D ideal observer exceeded 100%, thereby excluding all models which are sub-optimal relative to the 2D ideal.

[1]  Allan D. Jepson,et al.  What Makes a Good Feature , 1992 .

[2]  Shimon Ullman,et al.  An Approach to Object Recognition: Aligning Pictorial Descriptions , 1986 .

[3]  Heinrich H. Bülthoff,et al.  Psychophysical support for a 2D view interpolation theory of object recognition , 1991 .

[4]  T. Poggio,et al.  Recognition and Structure from one 2D Model View: Observations on Prototypes, Object Classes and Symmetries , 1992 .

[5]  Elie Bienenstock,et al.  A neural network for invariant pattern recognition. , 1987 .

[6]  T. Poggio,et al.  The importance of symmetry and virtual views in three-dimensional object recognition , 1994, Current Biology.

[7]  T Poggio,et al.  Regularization Algorithms for Learning That Are Equivalent to Multilayer Networks , 1990, Science.

[8]  H H Bülthoff,et al.  Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Shimon Edelman,et al.  Bringing the Grandmother back into the Picture: A Memory-Based View of Object Recognition , 1990, Int. J. Pattern Recognit. Artif. Intell..

[10]  Colin Blakemore,et al.  Statistical limits to image understanding , 1991 .

[11]  A. Watson,et al.  Quest: A Bayesian adaptive psychometric method , 1983, Perception & psychophysics.

[12]  I. Biederman,et al.  Recognizing depth-rotated objects: evidence and conditions for three-dimensional viewpoint invariance. , 1993, Journal of experimental psychology. Human perception and performance.

[13]  Teuvo Kohonen,et al.  Associative memory. A system-theoretical approach , 1977 .

[14]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[15]  Stephen A. Ritz,et al.  Distinctive features, categorical perception, and probability learning: some applications of a neural model , 1977 .

[16]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[17]  M. Tarr,et al.  Mental rotation and orientation-dependence in shape recognition , 1989, Cognitive Psychology.

[18]  S. Edelman,et al.  Orientation dependence in the recognition of familiar and novel views of three-dimensional objects , 1992, Vision Research.

[19]  H. Barlow The absolute efficiency of perceptual decisions. , 1980, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[20]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[21]  Tomaso A. Poggio,et al.  Extensions of a Theory of Networks for Approximation and Learning , 1990, NIPS.

[22]  S. Ullman Aligning pictorial descriptions: An approach to object recognition , 1989, Cognition.

[23]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[24]  I. Rock,et al.  A case of viewer-centered object perception , 1987, Cognitive Psychology.

[25]  Michael J. Tarr,et al.  Orientation dependence in three-dimensional object recognition , 1989 .

[26]  H. Barlow,et al.  The versatility and absolute efficiency of detecting mirror symmetry in random dot displays , 1979, Vision Research.

[27]  Colin Blakemore,et al.  Vision: Coding and Efficiency , 1991 .

[28]  T. Poggio,et al.  A network that learns to recognize three-dimensional objects , 1990, Nature.