论文信息 - Shape Recipes: Scene Representations that Refer to the Image

Shape Recipes: Scene Representations that Refer to the Image

The goal of low-level vision is to estimate an underlying scene, given an observed image. Real-world scenes (eg, albedos or shapes) can be very complex, conventionally requiring high dimensional representations which are hard to estimate and store. We propose a low-dimensional representation, called a scene recipe, that relies on the image itself to describe the complex scene configurations. Shape recipes are an example: these are the regression coefficients that predict the bandpassed shape from image data. We describe the benefits of this representation, and show two uses illustrating their properties: (1) we improve stereo shape estimates by learning shape recipes at low resolution and applying them at full resolution; (2) Shape recipes implicitly contain information about lighting and materials and we use them for material segmentation.

Antonio Torralba | William T. Freeman | A. Torralba | W. Freeman

[1] Berthold K. P. Horn,et al. Shape from shading , 1989 .

[2] Alex Pentland,et al. Generalized implicit functions for computer graphics , 1991, SIGGRAPH.

[3] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[4] William T. Freeman,et al. The generic viewpoint assumption in a framework for visual perception , 1994, Nature.

[5] William T. Freeman,et al. Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[6] Eero P. Simoncelli. Statistical models for images: compression, restoration and synthesis , 1997, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).

[7] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8] Yair Weiss. Bayesian motion estimation and segmentation , 1998 .

[9] E. Adelson. Lightness Perception and Lightness Illusions , 1999 .

[10] Reinhard Koch,et al. A simple and efficient rectification method for general motion , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[11] A. Gilchrist,et al. An anchoring theory of lightness perception. , 1999 .

[12] Takeo Kanade,et al. A Cooperative Algorithm for Stereo Matching and Occlusion Detection , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13] W. Freeman,et al. Learning local evidence for shading and reflectance , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[14] Antonio Torralba,et al. Properties and applications of shape recipes , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[15] Richard Szeliski,et al. Bayesian modeling of uncertainty in low-level vision , 2011, International Journal of Computer Vision.

[16] William T. Freeman,et al. Learning Low-Level Vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[17] Jitendra Malik,et al. Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[18] Zhengyou Zhang,et al. Determining the Epipolar Geometry and its Uncertainty: A Review , 1998, International Journal of Computer Vision.

[19] Alex Pentland. Linear shape from shading , 2004, International Journal of Computer Vision.