Finite Aperture Stereo: 3D Reconstruction of Macro-Scale Scenes

While the accuracy of multi-view stereo (MVS) has continued to advance, its performance reconstructing challenging scenes from images with a limited depth of field is generally poor. Typical implementations assume a pinhole camera model, and therefore treat defocused regions as a source of outlier. In this paper, we address these limitations by instead modelling the camera as a thick lens. Doing so allows us to exploit the complementary nature of stereo and defocus information, and overcome constraints imposed by traditional MVS methods. Using our novel reconstruction framework, we recover complete 3D models of complex macro-scale scenes. Our approach demonstrates robustness to view-dependent materials, and outperforms state-of-the-art MVS and depth from defocus across a range of real and synthetic datasets.

[1]  Lei Zhang,et al.  Detail-Preserving and Content-Aware Variational Multi-View Stereo Reconstruction , 2015, IEEE Transactions on Image Processing.

[2]  Paolo Favaro,et al.  Shape from Focus and Defocus: Convexity, Quasiconvexity and Defocus-Invariant Textures , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[3]  Lauren A. Christopher,et al.  3-D Scene Reconstruction Using Depth from Defocus and Deep Learning , 2019, 2019 IEEE Applied Imagery Pattern Recognition Workshop (AIPR).

[4]  Alex Pentland,et al.  A New Sense for Depth of Field , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Paolo Favaro,et al.  Recovering thin structures via nonlocal-means regularization with application to depth from defocus , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Kun Zhou,et al.  Bayesian Depth-from-Defocus with Shading Constraints , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Narendra Ahuja,et al.  DeepMVS: Learning Multi-view Stereopsis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Roberto Cipolla,et al.  Multiview Stereo via Volumetric Graph-Cuts and Occlusion Robust Photo-Consistency , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  F. Fraundorfer,et al.  DeepC-MVS: Deep Confidence Prediction for Multi-View Stereo Reconstruction , 2019, 2020 International Conference on 3D Vision (3DV).

[10]  Kiriakos N. Kutulakos,et al.  Confocal Stereo , 2006, International Journal of Computer Vision.

[11]  Leif Kobbelt,et al.  Hierarchical Volumetric Multi-view Stereo Reconstruction of Manifold Surfaces based on Dual Graph Embedding , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Michael M. Kazhdan,et al.  Screened poisson surface reconstruction , 2013, TOGS.

[13]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Joachim Weickert,et al.  Physically inspired depth-from-defocus , 2017, Image Vis. Comput..

[15]  Masako Kashiwagi,et al.  Deep Depth From Aberration Map , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[16]  Long Quan,et al.  Recurrent MVSNet for High-Resolution Multi-View Stereo Depth Inference , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[18]  Stefano Soatto,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE IEEE TRANSACTION OF PATTERN RECO , 2022 .

[19]  Lu Fang,et al.  SurfaceNet: An End-to-End 3D Neural Network for Multiview Stereopsis , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[20]  Fatih Porikli,et al.  Deblur and deep depth from single defocus image , 2021, Mach. Vis. Appl..

[21]  Ting-Chun Wang,et al.  Depth from Semi-Calibrated Stereo and Defocus , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Vincent Lepetit,et al.  DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Pascal Fua,et al.  Efficient large-scale multi-view stereo for ultra high-resolution image sets , 2011, Machine Vision and Applications.

[24]  Pratul P. Srinivasan,et al.  NeRF , 2020, ECCV.

[25]  Hui Zhou,et al.  Blur-Aware Disparity Estimation from Defocus Stereo Images , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[26]  Jürgen Beyerer,et al.  A New Approach for Estimating Depth by Fusing Stereo and Defocus Information , 2007, GI Jahrestagung.

[27]  Jean-Yves Guillemaut,et al.  A Novel Depth from Defocus Framework Based on a Thick Lens Camera Model , 2020, 2020 International Conference on 3D Vision (3DV).

[28]  Frédéric Champagnat,et al.  Deep Depth from Defocus: how can defocus blur improve 3D estimation using dense neural networks? , 2018, ECCV Workshops.

[29]  Sunil Hadap,et al.  Regularized depth from defocus , 2008, 2008 15th IEEE International Conference on Image Processing.

[30]  Marc Pollefeys,et al.  Photometric Bundle Adjustment for Dense Multi-view 3D Modeling , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Qionghai Dai,et al.  Iterative Feedback Estimation of Depth and Radiance from Defocused Images , 2012, ACCV.

[32]  Jing Xu,et al.  Point-Based Multi-View Stereo Network , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[33]  Derek Bradley,et al.  Accurate multi-view reconstruction using robust binocular stereo and surface meshing , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Sergey Zagoruyko,et al.  Depth Camera Based on Color-Coded Aperture , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[35]  Jitendra Malik,et al.  Learning a Multi-View Stereo Machine , 2017, NIPS.

[36]  Yasuyuki Matsushita,et al.  High-quality shape from multi-view stereo and shading under general illumination , 2011, CVPR 2011.

[37]  Kyoung Mu Lee,et al.  Depth Estimation Network for Dual Defocused Images with Different Depth-of-Field , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[38]  David Lee,et al.  Dual Aperture Photography: Image and Depth from a Mobile Camera , 2015, 2015 IEEE International Conference on Computational Photography (ICCP).

[39]  Ayan Chakrabarti,et al.  Depth and Deblurring from a Spectrally-Varying Depth-of-Field , 2012, ECCV.

[40]  Long Quan,et al.  MVSNet: Depth Inference for Unstructured Multi-view Stereo , 2018, ECCV.

[41]  Martin S Banks,et al.  Focus information is used to interpret binocular images. , 2010, Journal of vision.

[42]  Zhang Chen,et al.  A Learning-based Framework for Hybrid Depth-from-Defocus and Stereo Matching , 2017, ArXiv.

[43]  Tao Guan,et al.  P-MVSNet: Learning Patch-Wise Matching Confidence Aggregation for Multi-View Stereo , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[44]  Silvio Savarese,et al.  3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction , 2016, ECCV.

[45]  Roberto Cipolla,et al.  A Differential Volumetric Approach to Multi-View Photometric Stereo , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[46]  Nikos Komodakis,et al.  Learning to compare image patches via convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Jitendra Malik,et al.  Shape Estimation from Shading, Defocus, and Correspondence Using Light-Field Angular Coherence , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  Michael S. Langer,et al.  Optimal Camera Parameters for Depth from Defocus , 2015, 2015 International Conference on 3D Vision.

[49]  Jitendra Malik,et al.  Depth from Combining Defocus and Correspondence Using Light-Field Cameras , 2013, 2013 IEEE International Conference on Computer Vision.

[50]  Kiriakos N. Kutulakos,et al.  Depth from Defocus in the Wild , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  C. Fraser,et al.  Accurate and occlusion-robust multi-view stereo , 2015 .

[52]  Steven W. Zucker,et al.  Differential Geometric Inference in Surface Stereo , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Qionghai Dai,et al.  A Point-Cloud-Based Multiview Stereo Algorithm for Free-Viewpoint Video , 2010, IEEE Transactions on Visualization and Computer Graphics.

[54]  Siyu Zhu,et al.  Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[55]  Can Chen,et al.  Depth Recovery from Light Field Using Focal Stack Symmetry , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[56]  Rami Ben-Ari,et al.  A Unified Approach for Registration and Depth in Depth from Defocus , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  Shree K. Nayar,et al.  Rational Filters for Passive Depth from Defocus , 1998, International Journal of Computer Vision.

[58]  Konrad Schindler,et al.  Massively Parallel Multiview Stereopsis by Surface Normal Diffusion , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[59]  Richard Szeliski,et al.  A Comparative Study of Energy Minimization Methods for Markov Random Fields with Smoothness-Based Priors , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  Jan-Michael Frahm,et al.  Pixelwise View Selection for Unstructured Multi-View Stereo , 2016, ECCV.

[61]  Truong Q. Nguyen,et al.  Depth estimation from focus and disparity , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[62]  Kosuke Sato,et al.  Fusing Depth from Defocus and Stereo with Coded Apertures , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[63]  Stefano Soatto,et al.  A geometric approach to shape from defocus , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  N. Kiryati,et al.  Depth from Defocus vs. Stereo: How Different Really Are They? , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[65]  Subhasis Chaudhuri,et al.  Depth estimation and image restoration using defocused stereo pairs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  A. N. Rajagopalan,et al.  Towards Unrestrained Depth Inference with Coherent Occlusion Filling , 2012, International Journal of Computer Vision.

[67]  Feng Li,et al.  Dual-focus stereo imaging , 2010, J. Electronic Imaging.

[68]  Carl Olsson,et al.  In Defense of 3D-Label Stereo , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[69]  Michael Möller,et al.  Variational Depth From Focus Reconstruction , 2014, IEEE Transactions on Image Processing.