The generic viewpoint assumption in a framework for visual perception

A VISUAL system makes assumptions in order to interpret visual data. The assumption of 'generic view'1–4 states that the observer is not in a special position relative to the scene. Researchers commonly use a binary decision of generic or accidental view to disqualify scene interpretations that assume accidental viewpoints5–10. Here we show how to use the generic view assumption, and others like it, to quantify the likelihood of a view, adding a new term to the probability of a given image interpretation. The resulting framework better models the visual world and reduces the reliance on other prior assumptions. It may lead to computer vision algorithms of greater power and accuracy, or to better models of human vision. We show applications to the problems of inferring shape, surface reflectance properties, and motion from images.

[1]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[2]  Robert L Cook,et al.  A reflectance model for computer graphics , 1981, SIGGRAPH '81.

[3]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[4]  Thomas O. Binford,et al.  Inferring Surfaces from Images , 1981, Artif. Intell..

[5]  Thomas O. Binford,et al.  The Recovery of Three-Dimensional Structure from Image Curves , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Demetri Terzopoulos,et al.  Regularization of Inverse Visual Problems Involving Discontinuities , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  I. Biederman Human image understanding: Recent research and a theory , 1985, Computer Vision Graphics and Image Processing.

[8]  Demetri Terzopoulos Regularization ofInverseVisualProblemsInvolving Discontinuities , 1986 .

[9]  Jan J. Koenderink,et al.  Inferring three-dimensional shapes from two-dimensional silhouettes , 1987 .

[10]  K. Nakayama,et al.  The aperture problem—I. Perception of nonrigidity and motion direction in translating sinusoidal lines , 1988, Vision Research.

[11]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[12]  Aaron F. Bobick,et al.  The direct computation of height from shading , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Tomaso Poggio,et al.  Computational vision and regularization theory , 1985, Nature.

[14]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[15]  Allan D. Jepson,et al.  What Makes a Good Feature , 1992 .

[16]  Alex Pentland,et al.  A simple algorithm for shape from shading , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  K Nakayama,et al.  Experiencing and perceiving visual surfaces. , 1992, Science.

[18]  William T. Freeman,et al.  Exploiting the generic view assumption to estimate scene parameters , 1993, 1993 (4th) International Conference on Computer Vision.

[19]  Michael Jenkin,et al.  Spatial vision in humans and robots , 1994 .

[20]  Michael Werman,et al.  Stability and Likelihood of Views of Three Dimensional Objects , 1994, Theoretical Foundations of Computer Vision.