Confocal Stereo

We present confocal stereo, a new method for computing 3D shape by controlling the focus and aperture of a lens. The method is specifically designed for reconstructing scenes with high geometric complexity or fine-scale texture. To achieve this, we introduce the confocal constancy property, which states that as the lens aperture varies, the pixel intensity of a visible in-focus scene point will vary in a scene-independent way, that can be predicted by prior radiometric lens calibration. The only requirement is that incoming radiance within the cone subtended by the largest aperture is nearly constant. First, we develop a detailed lens model that factors out the distortions in high resolution SLR cameras (12MP or more) with large-aperture lenses (e.g., f1.2). This allows us to assemble an A×F aperture-focus image (AFI) for each pixel, that collects the undistorted measurements over all A apertures and F focus settings. In the AFI representation, confocal constancy reduces to color comparisons within regions of the AFI, and leads to focus metrics that can be evaluated separately for each pixel. We propose two such metrics and present initial reconstruction results for complex scenes, as well as for a scene with known ground-truth shape.

[1]  I. Spak [Technical innovations]. , 1966, Nordisk medicin.

[2]  Alex Pentland,et al.  A New Sense for Depth of Field , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Trevor Darrell,et al.  Pyramid based depth from focus , 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Edward H. Adelson,et al.  Single Lens Stereo with a Plenoptic Camera , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  C. Fraser,et al.  Variation of distortion within the photographic field , 1992 .

[6]  H.N. Nair,et al.  Robust focus ranging , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Steven A. Shafer,et al.  What is the center of the image? , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Glenn Healey,et al.  Radiometric CCD camera calibration and noise estimation , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Reg G. Willson Modeling and calibration of automated zoom lenses , 1994, Other Conferences.

[10]  Shree K. Nayar,et al.  Real-time focus range sensor , 1995, Proceedings of IEEE International Conference on Computer Vision.

[11]  R. Webb Confocal optical microscopy , 1996 .

[12]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[13]  Shree K. Nayar,et al.  Telecentric Optics for Focus Analysis , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Eero P. Simoncelli,et al.  Range estimation by optical differentiation. , 1998, Journal of the Optical Society of America. A, Optics, image science, and vision.

[15]  Naoki Asada,et al.  Seeing Behind the Scene: Analysis of Photometric Properties of Occluding Edges by the Reversed Projection Blurring Model , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Subhasis Chaudhuri,et al.  An MRF Model-Based Approach to Simultaneous Recovery of Depth and Restoration from Defocused Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Sing Bing Kang,et al.  Can We Calibrate a Camera Using an Image of a Flat, Textureless Lambertian Surface? , 2000, ECCV.

[18]  Guillermo Sapiro,et al.  Image inpainting , 2000, SIGGRAPH.

[19]  Leonard McMillan,et al.  Dynamically reparameterized light fields , 2000, SIGGRAPH.

[20]  Jean-Yves Bouguet,et al.  Camera calibration toolbox for matlab , 2001 .

[21]  Subhasis Chaudhuri,et al.  Depth From Defocus in Presence of Partial Self Occlusion , 2001, ICCV.

[22]  Hailin Jin,et al.  A Variational Approach to Shape from Defocus , 2002, ECCV.

[23]  Stefano Soatto,et al.  Learning Shape from Defocus , 2002, ECCV.

[24]  Stefano Soatto,et al.  Seeing beyond occlusions (and other marvels of a finite lens aperture) , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[25]  Moon Gi Kang,et al.  Super-resolution image reconstruction: a technical overview , 2003, IEEE Signal Process. Mag..

[26]  Stefano Soatto,et al.  3D shape from anisotropic diffusion , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[27]  Steven A. Shafer,et al.  Moment and Hypergeometric Filters for High Precision Computation of Focus, Stereo and Optical Flow , 1994, International Journal of Computer Vision.

[28]  Marc Levoy,et al.  Synthetic aperture confocal imaging , 2004, ACM Trans. Graph..

[29]  Sylvain Paris,et al.  Capture of hair geometry from multiple images , 2004, SIGGRAPH 2004.

[30]  Sylvain Paris,et al.  Capture of hair geometry from multiple images , 2004, ACM Trans. Graph..

[31]  Kiyoharu Aizawa,et al.  All-focused light field rendering , 2004, Rendering Techniques.

[32]  Simon Baker,et al.  Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.

[33]  Stefano Soatto,et al.  Observing Shape from Defocused Images , 2004, International Journal of Computer Vision.

[34]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[35]  Shree K. Nayar,et al.  Modeling the space of camera response functions , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Naoki Asada,et al.  Edge and Depth from Focus , 2004, International Journal of Computer Vision.

[37]  Eric Krotkov,et al.  Focusing , 2004, International Journal of Computer Vision.

[38]  Shree K. Nayar,et al.  Rational Filters for Passive Depth from Defocus , 1998, International Journal of Computer Vision.

[39]  Yoav Y. Schechner,et al.  Depth from Defocus vs. Stereo: How Different Really Are They? , 2004, International Journal of Computer Vision.

[40]  Kiriakos N. Kutulakos,et al.  A Theory of Shape by Space Carving , 2000, International Journal of Computer Vision.

[41]  Steven M. Seitz,et al.  Example-based photometric stereo: shape reconstruction with general, varying BRDFs , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Harry Shum,et al.  Modeling hair from multiple views , 2005, ACM Trans. Graph..

[43]  Frédo Durand,et al.  Defocus video matting , 2005, ACM Trans. Graph..

[44]  Murali Subbarao,et al.  Depth from defocus: A spatial domain approach , 1994, International Journal of Computer Vision.

[45]  Andrew W. Fitzgibbon,et al.  Image-Based Rendering Using Image-Based Priors , 2005, International Journal of Computer Vision.

[46]  Ren Ng Fourier slice photography , 2005, ACM Trans. Graph..

[47]  Stefano Soatto,et al.  A geometric approach to shape from defocus , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Kiriakos N. Kutulakos,et al.  Confocal Stereo , 2006, ECCV.

[49]  S. Nayar,et al.  Projection defocus analysis for scene capture and image display , 2006, ACM Trans. Graph..

[50]  Marc Levoy,et al.  Reconstructing Occluded Surfaces Using Synthetic Apertures: Stereo, Focus and Robust Measures , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[51]  Frédo Durand,et al.  Image and depth from a conventional camera with a coded aperture , 2007, ACM Trans. Graph..

[52]  Frédo Durand,et al.  Multi-aperture photography , 2007, ACM Trans. Graph..

[53]  Kiriakos N. Kutulakos,et al.  A Layer-Based Restoration Framework for Variable-Aperture Photography , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[54]  Ramesh Raskar,et al.  Dappled photography: mask enhanced cameras for heterodyned light fields and coded aperture refocusing , 2007, ACM Trans. Graph..

[55]  P. Belhumeur,et al.  Active refocusing of images and videos , 2007, ACM Trans. Graph..

[56]  Jitendra Malik,et al.  Recovering high dynamic range radiance maps from photographs , 1997, SIGGRAPH '08.