论文信息 - Intrinsic Scene Properties from a Single RGB-D Image

Intrinsic Scene Properties from a Single RGB-D Image

In this paper we extend the “shape, illumination and reflectance from shading” (SIRFS) model [3, 4], which recovers intrinsic scene properties from a single image. Though SIRFS performs well on images of segmented objects, it performs poorly on images of natural scenes, which contain occlusion and spatially-varying illumination. We therefore present Scene-SIRFS, a generalization of SIRFS in which we have a mixture of shapes and a mixture of illuminations, and those mixture components are embedded in a “soft” segmentation of the input image. We additionally use the noisy depth maps provided by RGB-D sensors (in this case, the Kinect) to improve shape estimation. Our model takes as input a single RGB-D image and produces as output an improved depth map, a set of surface normals, a reflectance image, a shading image, and a spatially varying model of illumination. The output of our model can be used for graphics applications, or for any application involving RGB-D images.

Jitendra Malik | Jonathan T. Barron

[1] H. Barrow,et al. RECOVERING INTRINSIC SCENE CHARACTERISTICS FROM IMAGES , 1978 .

[2] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[3] Jitendra Malik,et al. Color Constancy, Intrinsic Images, and Shape Estimation , 2012, ECCV.

[4] Stephen Lin,et al. Estimation of Intrinsic Image Sequences from Image+Depth Video , 2012, ECCV.

[5] Berthold K. P. Horn,et al. Determining lightness from an image , 1974, Comput. Graph. Image Process..

[6] Andrew Blake,et al. Surface descriptions from stereo and shading , 1986, Image Vis. Comput..

[7] Alexei A. Efros,et al. Recovering Surface Layout from an Image , 2007, International Journal of Computer Vision.

[8] Jitendra Malik,et al. Using contours to detect and localize junctions in natural images , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9] Frédo Durand,et al. Understanding and evaluating blind deconvolution algorithms , 2009, CVPR.

[10] Jitendra Malik,et al. Shape, albedo, and illumination from a single image of an unknown object , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Jitendra Malik,et al. High-frequency shape and albedo from shading using natural image statistics , 2011, CVPR 2011.

[12] Fan Chung,et al. Spectral Graph Theory , 1996 .

[13] Peter V. Gehler,et al. Recovering Intrinsic Images with a Global Sparsity Prior on Reflectance , 2011, NIPS.

[14] Jitendra Malik,et al. Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[15] Patrick Cavanagh,et al. Perceiving Illumination Inconsistencies in Scenes , 2005, Perception.

[16] Edward H. Adelson,et al. Ground truth dataset and baseline evaluations for intrinsic image algorithms , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[17] J J Koenderink,et al. What Does the Occluding Contour Tell Us about Solid Shape? , 1984, Perception.

[18] Paul Debevec,et al. Inverse global illumination: Recovering re?ectance models of real scenes from photographs , 1998 .

[19] Berthold K. P. Horn. SHAPE FROM SHADING: A METHOD FOR OBTAINING THE SHAPE OF A SMOOTH OPAQUE OBJECT FROM ONE VIEW , 1970 .

[20] Jitendra Malik,et al. Contour Continuity in Region Based Image Segmentation , 1998, ECCV.

[21] David A. Forsyth,et al. Variable-Source Shading Analysis , 2011, International Journal of Computer Vision.

[22] Mikhail Belkin,et al. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[23] Charless C. Fowlkes,et al. Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Nisheeth K. Vishnoi,et al. Biased normalized cuts , 2011, CVPR 2011.

[25] Edward H. Adelson,et al. Recovering intrinsic images from a single image , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Ashutosh Saxena,et al. Make3D: Learning 3D Scene Structure from a Single Still Image , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] E. Land,et al. Lightness and retinex theory. , 1971, Journal of the Optical Society of America.

[28] David A. Forsyth,et al. Rendering synthetic objects into legacy photographs , 2011, ACM Trans. Graph..

[29] Pat Hanrahan,et al. An efficient representation for irradiance environment maps , 2001, SIGGRAPH.