Inferring Perceptual Saliency Fields from Viewpoint-Dependent Recognition Data

We present an algorithm for computing the relative perceptual saliencies of the features of a three-dimensional object using either goodness-of-view scores measured at several viewpoints or perceptual similarities among several object views. This technique addresses the inverse, illposed version of the direct problem of predicting goodness-of-view scores or view point similarities when the object features are known. On the basis of a linear model for the direct problem, we solve the inverse problem using the method of regularization. The critical assumption we make to regularize the solution is that perceptual salience varies slowly on the surface of the object. The salient regions derived using this assumption empirically indicate what object structures are important in human three-dimensional object perception, a domain where theories typically have been based on somewhat ad hoc features.

[1]  P. Hansen Rank-Deficient and Discrete Ill-Posed Problems: Numerical Aspects of Linear Inversion , 1987 .

[2]  J. Chang,et al.  Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition , 1970 .

[3]  Joseph L. Zinnes,et al.  Theory and Methods of Scaling. , 1958 .

[4]  Grace Wahba,et al.  Spline Models for Observational Data , 1990 .

[5]  H. Engl,et al.  Regularization of Inverse Problems , 1996 .

[6]  S. Edelman Representation of Similarity in 3D Object Discrimination , 1995 .

[7]  M. Tarr,et al.  To What Extent Do Unique Parts Influence Recognition Across Changes in Viewpoint? , 1995 .

[8]  J. Marroquín Surface Reconstruction Preserving Discontinuities , 1984 .

[9]  Donald Geman,et al.  Constrained Restoration and the Recovery of Discontinuities , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  John Skilling,et al.  Maximum Entropy and Bayesian Methods , 1989 .

[11]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[12]  M. Tarr,et al.  Testing conditions for viewpoint invariance in object recognition. , 1997, Journal of experimental psychology. Human perception and performance.

[13]  I. Biederman,et al.  Viewpoint-dependent mechanisms in visual object recognition: Reply to Tarr and Bülthoff (1995). , 1995 .

[14]  H H Bülthoff,et al.  Psychophysical support for a two-dimensional view interpolation theory of object recognition. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[15]  W. Gibbs,et al.  Finite element methods , 2017, Graduate Studies in Mathematics.

[16]  Shimon Edelman,et al.  Representation of Similarity in Three-Dimensional Object Discrimination , 1995, Neural Computation.

[17]  Teresa Reginska,et al.  A Regularization Parameter in Discrete Ill-Posed Problems , 1996, SIAM J. Sci. Comput..

[18]  Stan Z. Li,et al.  Markov Random Field Modeling in Computer Vision , 1995, Computer Science Workbench.

[19]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[20]  G. Wahba Spline models for observational data , 1990 .

[21]  Michael J. Tarr,et al.  Representation of three-dimensional object similarity in human vision , 1997, Electronic Imaging.

[22]  M. Tarr Rotating objects to recognize them: A case study on the role of viewpoint dependency in the recognition of three-dimensional objects , 1995, Psychonomic bulletin & review.

[23]  B. Tversky,et al.  Objects, parts, and categories. , 1984 .

[24]  Michael J. Tarr Is human object recognition better described by geon structural description or by multiple views , 1995 .

[25]  Robert L. Goldstone,et al.  The development of features in object concepts , 1998, Behavioral and Brain Sciences.

[26]  M J Tarr,et al.  Is human object recognition better described by geon structural descriptions or by multiple views? Comment on Biederman and Gerhardstein (1993). , 1995, Journal of experimental psychology. Human perception and performance.

[27]  C. W. Groetsch,et al.  Inverse Problems in the Mathematical Sciences , 1993 .

[28]  Tomaso Poggio,et al.  Computational vision and regularization theory , 1985, Nature.

[29]  I. Biederman,et al.  Recognizing depth-rotated objects: Evidence and conditions for three-dimensional viewpoint invariance. , 1993 .

[30]  S. Edelman,et al.  Representation of object similarity in human vision: psychophysics and a computational model , 1998, Vision Research.

[31]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[32]  A. Tversky,et al.  Representations of perceptions of risks , 1984 .

[33]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.