论文信息 - Shape Estimation from Shading, Defocus, and Correspondence Using Light-Field Angular Coherence

Shape Estimation from Shading, Defocus, and Correspondence Using Light-Field Angular Coherence

Light-field cameras are quickly becoming commodity items, with consumer and industrial applications. They capture many nearby views simultaneously using a single image with a micro-lens array, thereby providing a wealth of cues for depth recovery: defocus, correspondence, and shading. In particular, apart from conventional image shading, one can refocus images after acquisition, and shift one's viewpoint within the sub-apertures of the main lens, effectively obtaining multiple views. We present a principled algorithm for dense depth estimation that combines defocus and correspondence metrics. We then extend our analysis to the additional cue of shading, using it to refine fine details in the shape. By exploiting an all-in-focus image, in which pixels are expected to exhibit angular coherence, we define an optimization framework that integrates photo consistency, depth consistency, and shading consistency. We show that combining all three sources of information: defocus, correspondence, and shading, outperforms state-of-the-art light-field depth estimation algorithms in multiple scenarios.

[1] Jitendra Malik,et al. Depth Estimation for Glossy Surfaces with Light-Field Cameras , 2014, ECCV Workshops.

[2] R. Woodham. Gradient and Curvature from Photometric Stereo Including Local Condence Estimation , 1994 .

[3] Alexei A. Efros,et al. Occlusion-Aware Depth Estimation Using Light-Field Cameras , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[4] Yoav Y. Schechner,et al. Depth from Defocus vs. Stereo: How Different Really Are They? , 2004, International Journal of Computer Vision.

[5] Jonathan T. Barron,et al. A category-level 3-D object dataset: Putting the Kinect to work , 2011, ICCV Workshops.

[6] Pierre Vandergheynst,et al. A Convex Solution to Disparity Estimation from Light Fields via the Primal-Dual Method , 2015, EMMCVPR.

[7] Sven Wanner,et al. The Variational Structure of Disparity and Regularization of 4D Light Fields , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Thomas Pock,et al. Shape from Light Field Meets Robust PCA , 2014, ECCV.

[9] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[10] Takeo Kanade,et al. A Multiple-Baseline Stereo , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[11] Patrick Pérez,et al. Accurate Disparity Estimation for Plenoptic Images , 2014, ECCV Workshops.

[12] Jean-Denis Durou,et al. Numerical methods for shape-from-shading: A new survey with benchmarks , 2008, Comput. Vis. Image Underst..

[13] Edward H. Adelson,et al. Shape estimation in natural illumination , 2011, CVPR 2011.

[14] Shree K. Nayar,et al. Rational Filters for Passive Depth from Defocus , 1998, International Journal of Computer Vision.

[15] Shahram Izadi,et al. Real-time shading-based refinement for consumer depth cameras , 2014, ACM Trans. Graph..

[16] Jitendra Malik,et al. Depth from shading, defocus, and correspondence using light-field angular coherence , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] David Sweeney,et al. Learning to be a depth camera for close-range human capture and interaction , 2014, ACM Trans. Graph..

[18] Robert J. Woodham,et al. Photometric method for determining surface orientation from multiple images , 1980 .

[19] Johan Wagemans,et al. Light fields and shape from shading. , 2011, Journal of vision.

[20] Ronen Basri,et al. Lambertian Reflectance and Linear Subspaces , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[21] Richard Szeliski,et al. Extracting layers and analyzing their specular properties using epipolar-plane-image analysis , 2005, Comput. Vis. Image Underst..

[22] Jitendra Malik,et al. Intrinsic Scene Properties from a Single RGB-D Image , 2013, CVPR.

[23] Gérard G. Medioni,et al. Object modeling by registration of multiple range images , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[24] Ronen Basri,et al. Photometric stereo with general, unknown lighting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[25] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.

[26] Marc Levoy,et al. Reconstructing Occluded Surfaces Using Synthetic Apertures: Stereo, Focus and Robust Measures , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[27] Yael Pritch,et al. Scene reconstruction from high spatio-angular resolution light fields , 2013, ACM Trans. Graph..

[28] Michael M. Kazhdan,et al. Poisson surface reconstruction , 2006, SGP '06.

[29] Chia-Kai Liang,et al. Programmable aperture photography: multiplexed light field acquisition , 2008, SIGGRAPH 2008.

[30] Thabo Beeler,et al. Facial performance enhancement using dynamic shape space analysis , 2014, TOGS.

[31] Gérard G. Medioni,et al. Object modelling by registration of multiple range images , 1992, Image Vis. Comput..

[32] Jitendra Malik,et al. Depth Estimation and Specular Removal for Glossy Surfaces Using Point and Line Consistency with Light-Field Cameras , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Sven Wanner,et al. Globally consistent depth labeling of 4D light fields , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[34] Wilson S. Geisler,et al. Maximum-likelihood depth-from-defocus for active vision , 1995, Proceedings 1995 IEEE/RSJ International Conference on Intelligent Robots and Systems. Human Robot Interaction and Cooperative Robots.

[35] Jitendra Malik,et al. Color Constancy, Intrinsic Images, and Shape Estimation , 2012, ECCV.

[36] Ping-Sing Tsai,et al. Shape from Shading: A Survey , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[37] Jitendra Malik,et al. Shape, albedo, and illumination from a single image of an unknown object , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[38] Zhan Yu,et al. Light Field Stereo Matching Using Bilateral Statistics of Surface Cameras , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[39] Jitendra Malik,et al. Depth from Combining Defocus and Correspondence Using Light-Field Cameras , 2013, 2013 IEEE International Conference on Computer Vision.

[40] Xiaoyan Hu,et al. A Quantitative Evaluation of Confidence Measures for Stereo Vision , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41] In-So Kweon,et al. Accurate depth map estimation from a lenslet light field camera , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Stephen Lin,et al. A Closed-form Solution to Retinex with Non-local Texture Constraints , 2012 .

[43] Minh N. Do,et al. Joint Histogram-Based Cost Aggregation for Stereo Matching , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44] Murali Subbarao,et al. Depth from defocus: A spatial domain approach , 1994, International Journal of Computer Vision.

[45] Kiriakos N. Kutulakos,et al. Confocal Stereo , 2006, International Journal of Computer Vision.

[46] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[47] Lennart Wietzke,et al. Single lens 3D-camera with extended depth-of-field , 2012, Electronic Imaging.

[48] Stephen Lin,et al. A Closed-Form Solution to Retinex with Nonlocal Texture Constraints , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49] Ira Kemelmacher-Shlizerman,et al. Total Moving Face Reconstruction , 2014, ECCV.

[50] C.-C. Jay Kuo,et al. Shape from Shading with a Linear Triangular Element Surface Model , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[51] Pat Hanrahan,et al. A signal-processing framework for inverse rendering , 2001, SIGGRAPH.

[52] Eric Q. Li,et al. Bundled depth-map merging for multi-view stereo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[53] R. Woodham. Gradient and curvature from the photometric-stereo method, including local confidence estimation , 1994 .

[54] Richard Szeliski,et al. A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[55] Vladlen Koltun,et al. A Simple Model for Intrinsic Image Decomposition with Depth Cues , 2013, 2013 IEEE International Conference on Computer Vision.

[56] Thomas Pock,et al. Variational Shape from Light Field , 2013, EMMCVPR.

[57] Murali Subbarao,et al. Integration of defocus and focus analysis with stereo for 3D shape recovery , 1997, Other Conferences.

[58] Berthold K. P. Horn. SHAPE FROM SHADING: A METHOD FOR OBTAINING THE SHAPE OF A SMOOTH OPAQUE OBJECT FROM ONE VIEW , 1970 .

[59] Manmohan Krishna Chandraker,et al. What Camera Motion Reveals about Shape with Unknown BRDF , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[60] Roberto Cipolla,et al. Multiview Photometric Stereo , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[61] Anat Levin,et al. Analyzing Depth from Coded Aperture Sets , 2010, ECCV.

[62] P. Hanrahan,et al. Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[63] Zhan Yu,et al. Line Assisted Light Field Triangulation and Stereo Matching , 2013, 2013 IEEE International Conference on Computer Vision.

[64] Robert C. Bolles,et al. Epipolar-plane image analysis: An approach to determining structure from motion , 1987, International Journal of Computer Vision.

[65] Jonathan T. Barron,et al. A category-level 3-D object dataset: Putting the Kinect to work , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[66] Edward H. Adelson,et al. Single Lens Stereo with a Plenoptic Camera , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[67] Paul Debevec. Rendering synthetic objects into real scenes: bridging traditional and image-based graphics with global illumination and high dynamic range photography , 2008, SIGGRAPH Classes.

[68] Steven M. Seitz,et al. Photorealistic Scene Reconstruction by Voxel Coloring , 1997, International Journal of Computer Vision.