Depth map estimation with 4D light fields using confocal stereo

We present a scene depth map generation method based on light field cameras. From the plenoptic function, the angular information about each image point under different sizes of aperture is extracted, which could be used for confocal stereo. Considering confocal constancy and gradient constancy, we take into account two constraints: (1) When a pixel is in focus, its relative intensities across aperture should match the variation predicted by the relative exitance of the lens; and (2) When a pixel is in focus, the gradient of the pixel should equal to that of the corresponding pixel in reference image. Based on these two constraints, we develop data term which measures the probability of each pixel in each depth. Considering the textureless area, we also develop smoothness term which helps to determine the depth of textureless area by its neighboring texture area. Finally, the depth map is estimated via multi-label optimization and weighted median filtering.

[1]  Frédo Durand,et al.  Image and depth from a conventional camera with a coded aperture , 2007, ACM Trans. Graph..

[2]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Sven Wanner,et al.  Variational Light Field Analysis for Disparity Estimation and Super-Resolution. , 2013, IEEE transactions on pattern analysis and machine intelligence.

[4]  Sven Wanner,et al.  Globally consistent depth labeling of 4D light fields , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Jitendra Malik,et al.  Depth from Combining Defocus and Correspondence Using Light-Field Cameras , 2013, 2013 IEEE International Conference on Computer Vision.

[6]  Daniel Cremers,et al.  Global Solutions of Variational Models with Convex Regularization , 2010, SIAM J. Imaging Sci..

[7]  In-So Kweon,et al.  Accurate depth map estimation from a lenslet light field camera , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Ramesh Raskar,et al.  Depth Map Estimation and Colorization of Anaglyph Images Using Local Color Prior and Reverse Intensity Distribution , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9]  Andrew Lumsdaine,et al.  Reducing Plenoptic Camera Artifacts , 2010, Comput. Graph. Forum.

[10]  P. Hanrahan,et al.  Light Field Photography with a Hand-held Plenoptic Camera , 2005 .

[11]  Tom E. Bishop,et al.  The Light Field Camera: Extended Depth of Field, Aliasing, and Superresolution , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Sven Wanner,et al.  Variational Light Field Analysis for Disparity Estimation and Super-Resolution , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Enhua Wu,et al.  Constant Time Weighted Median Filtering for Stereo Matching and Beyond , 2013, 2013 IEEE International Conference on Computer Vision.

[14]  P. Hanrahan,et al.  Digital light field photography , 2006 .

[15]  Kiriakos N. Kutulakos,et al.  Confocal Stereo , 2006, International Journal of Computer Vision.