A Bayesian Framework for the Integration of Visual Modules

The Bayesian approach to vision provides a fruitful theoretical framework both for modeling individual cues, such as stereo, shading, texture and occlusion, and for integrating their information. In this formalism we represent the viewed scene by one, or more, surfaces using prior assumptions about the surface shapes and material properties. On theoretical grounds, the less information available to the cues (and the less accurate it is) then the more important these assumptions become. This suggests that visual illusions, and biased perceptions, will arise for scenes for which the prior assumptions are not appropriate. We describe psychophysical experiments which are consistent with these ideas. Our Bayesian approach also has two important implications for coupling diierent visual cues. Firstly, diierent cues cannot in general be treated independently and then simply combined together at the end. There are dependencies between them that have to be incorporated into the models. Secondly, a single generic prior assumption is not suucient even if it does incorporate cue interactions because there are many diierent types of visual scenes and diierent models are appropriate for each. This leads to the concept of competitive priors where the visual system must choose the corect model depending on the stimulus.

[1]  R. Gregory The intelligent eye , 1970 .

[2]  B. Sakitt Counting every quantum , 1972, The Journal of physiology.

[3]  J. Gibson The Ecological Approach to Visual Perception , 1979 .

[4]  Robert J. Woodham,et al.  Analysing Images of Curved Surfaces , 1981, Artif. Intell..

[5]  Alex Pentland,et al.  Local Shading Analysis , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  J P Frisby,et al.  PMF: A Stereo Correspondence Algorithm Using a Disparity Gradient Limit , 1985, Perception.

[7]  Tomaso Poggio,et al.  Computational vision and regularization theory , 1985, Nature.

[8]  G. Sperling,et al.  Tradeoffs between stereopsis and proximity luminance covariance as determinants of perceived 3D structure , 1986, Vision Research.

[9]  Michael J. Brooks,et al.  The variational approach to shape from shading , 1986, Comput. Vis. Graph. Image Process..

[10]  H. Bülthoff,et al.  INTERACTION OF DIFFERENT MODULES IN DEPTH PERCEPTION. , 1987, ICCV 1987.

[11]  J. A. Anderson,et al.  Associative learning of scene parameters from images. , 1987, Applied optics.

[12]  W. Bialek,et al.  Physical limits to sensation and perception. , 1987, Annual review of biophysics and biophysical chemistry.

[13]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[14]  J. Cutting,et al.  Minimodularity and the perception of layout. , 1988, Journal of experimental psychology. General.

[15]  T Poggio,et al.  Parallel integration of vision modules. , 1988, Science.

[16]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[17]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[18]  Michael S. Landy,et al.  A Statistical Framework for Robust Fusion of Depth Information , 1989, Other Conferences.

[19]  Douglas B. Paul,et al.  Speech Recognition Using Hidden Markov Models , 1990 .

[20]  H. Bülthoff,et al.  Does the brain know the physics of specular reflection? , 1990, Nature.

[21]  Heinrich H. Bülthoff,et al.  Integration of Stereo, Shading and Texture , 1990 .

[22]  James J. Clark,et al.  Data Fusion for Sensory Information Processing Systems , 1990 .

[23]  K Nakayama,et al.  Toward a neural understanding of visual surface representation. , 1990, Cold Spring Harbor symposia on quantitative biology.

[24]  Ernst D. Dickmanns,et al.  An integrated spatio-temporal approach to automatic visual guidance of autonomous vehicles , 1990, IEEE Trans. Syst. Man Cybern..

[25]  T. Bayes An essay towards solving a problem in the doctrine of chances , 2003 .

[26]  A Blake,et al.  Shape from specularities: computation and psychophysics. , 1991, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[27]  D. Knill,et al.  Apparent surface curvature affects lightness perception , 1991, Nature.

[28]  J. Koenderink,et al.  Surface perception in pictures , 1992, Perception & psychophysics.

[29]  Bennett L. Schwartz,et al.  Interaction between Transparency and Structure from Motion , 1992, Neural Computation.

[30]  D Mumford,et al.  On the computational architecture of the neocortex. II. The role of cortico-cortical loops. , 1992, Biological cybernetics.

[31]  K Nakayama,et al.  Experiencing and perceiving visual surfaces. , 1992, Science.

[32]  William T. Freeman,et al.  Exploiting the generic view assumption to estimate scene parameters , 1993, 1993 (4th) International Conference on Computer Vision.

[33]  D. Knill,et al.  Moving Cast Shadows and the Perception of Relative Depth , 1994 .

[34]  James J. Clark,et al.  Bayesian models, deformable templates and competitive priors , 1994 .

[35]  Heinrich H. Bülthoff,et al.  General lighting can overcome accidental viewing , 1994 .

[36]  M. Landy,et al.  Measurement and modeling of depth cue combination: in defense of weak fusion , 1995, Vision Research.

[37]  A. Yuille,et al.  Bayesian decision theory and psychophysics , 1996 .

[38]  Edward H. Adelson,et al.  The perception of shading and reflectance , 1996 .

[39]  David C. Knill,et al.  Introduction: a Bayesian formulation of visual perception , 1996 .