A semi-local method for iterative depth-map refinement

Building a photorealistic, 3D model of an object or a complete scene from image-based methods is a fundamental problem in computer vision, and has many applications in robotic perception, navigation, exploration and mapping. In this paper, we extend current state-of-the-art in the computation of depth maps by presenting an accurate and computationally efficient iterative hierarchical algorithm for multi-view stereo. The algorithm is designed to utilise all available contextual information to compute highly-accurate and robust depth maps by iteratively examining different image resolutions in an image-pyramid. The novelty in our approach is that we are able to incrementally improve the depth fidelity as the algorithm progresses through the image pyramid by utilising a local method. This is achieved in a computationally efficient manner by simultaneously enforcing the consistency of the depth-map by continual comparison with neighbouring depth-maps. We present a detailed description of the algorithm, and describe how each step is carried out. The proposed technique is used to analyse multi-view stereo data from two well-known, standard datasets, and presented results show a significant decrease in computation time, as well as an increase in overall accuracy of the computed depth maps.

[1]  Michael M. Kazhdan,et al.  Poisson surface reconstruction , 2006, SGP '06.

[2]  Lutz Falkenhagen Hierarchical Block-Based Disparity Estimation Considering Neighbourhood Constraints , 1997 .

[3]  Slobodan Ilic,et al.  Probabilistic Disparity Fusion for Real-Time Motion-Stereo , 2010 .

[4]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[6]  H. Hirschmüller Accurate and Efficient Stereo Processing by Semi-Global Matching and Mutual Information , 2005, CVPR.

[7]  Heiko Hirschmüller,et al.  Evaluation of Cost Functions for Stereo Matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Jan-Michael Frahm,et al.  Real-Time Visibility-Based Fusion of Depth Maps , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[9]  Ben Upcroft,et al.  Towards automated and in-situ, near-real time 3-D reconstruction of coral reef environments , 2011, OCEANS'11 MTS/IEEE KONA.

[10]  Luc Van Gool,et al.  Real-time connectivity constrained depth map computation using programmable graphics hardware , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Takeo Kanade,et al.  A Cooperative Algorithm for Stereo Matching and Occlusion Detection , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Zhengyou Zhang,et al.  A Progressive Scheme for Stereo Matching , 2000, SMILE.

[13]  Andrew J. Davison,et al.  Live dense reconstruction with a single moving camera , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Pascal Fua,et al.  On benchmarking camera calibration and multi-view stereo for high resolution imagery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Hujun Bao,et al.  Consistent Depth Maps Recovery from a Video Sequence , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Ruigang Yang,et al.  Multi-resolution real-time stereo on commodity graphics hardware , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[17]  Roberto Cipolla,et al.  Using Multiple Hypotheses to Improve Depth-Maps for Multi-View Stereo , 2008, ECCV.