论文信息 - Plane Completion and Filtering for Multi-View Stereo Reconstruction

Plane Completion and Filtering for Multi-View Stereo Reconstruction

Multi-View Stereo (MVS)-based 3D reconstruction is a major topic in computer vision for which a vast number of methods have been proposed over the last decades showing impressive visual results. Long-since, benchmarks like Middlebury [32] numerically rank the individual methods considering accuracy and completeness as quality attributes. While the Middlebury benchmark provides low-resolution images only, the recently published ETH3D [31] and Tanks and Temples [19] benchmarks allow for an evaluation of high-resolution and large-scale MVS from natural camera configurations. This benchmarking reveals that still only few methods can be used for the reconstruction of large-scale models. We present an effective pipeline for large-scale 3D reconstruction which extends existing methods in several ways: (i) We introduce an outlier filtering considering the MVS geometry. (ii) To avoid incomplete models from local matching methods we propose a plane completion method based on growing superpixels allowing a generic generation of high-quality 3D models. (iii) Finally, we use deep learning for a subsequent filtering of outliers in segmented sky areas. We give experimental evidence on benchmarks that our contributions improve the quality of the 3D model and our method is state-of-the-art in high-quality 3D reconstruction from high-resolution images or large image sets.

[1] M. Goesele,et al. Floating scale surface reconstruction , 2014, ACM Trans. Graph..

[2] ARNO KNAPITSCH,et al. Tanks and temples , 2017, ACM Trans. Graph..

[3] Matteo Matteucci,et al. TAPA-MVS: Textureless-Aware PAtchMatch Multi-View Stereo , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[4] Pascal Fua,et al. On benchmarking camera calibration and multi-view stereo for high resolution imagery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Heiko Hirschmüller,et al. Evaluation of Stereo Matching Costs on Images with Radiometric Differences , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Jean-Philippe Pons,et al. High Accuracy and Visibility-Consistent Dense Multiview Stereo , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Richard Szeliski,et al. A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8] Heiko Hirschmüller,et al. A TV Prior for High-Quality Local Multi-view Stereo Reconstruction , 2014, 2014 2nd International Conference on 3D Vision.

[9] George Papandreou,et al. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[10] Bolei Zhou,et al. Scene Parsing through ADE20K Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Jan-Michael Frahm,et al. Pixelwise View Selection for Unstructured Multi-View Stereo , 2016, ECCV.

[12] Michael Brady,et al. Practical Structure and Motion from Stereo When Motion is Unconstrained , 2000, International Journal of Computer Vision.

[13] Tomás Pajdla,et al. Multi-view reconstruction preserving weakly-supported surfaces , 2011, CVPR 2011.

[14] Jan-Michael Frahm,et al. Structure-from-Motion Revisited , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Simon Fuhrmann,et al. MVE - A Multi-View Reconstruction Environment , 2014, GCH.

[16] Long Quan,et al. MVSNet: Depth Inference for Unstructured Multi-view Stereo , 2018, ECCV.

[17] Horst Bischof,et al. Scalable Surface Reconstruction from Point Clouds with Extreme Scale and Density Diversity , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Heiko Hirschmüller,et al. A TV Prior for High-Quality Scalable Multi-View Stereo Reconstruction , 2017, International Journal of Computer Vision.

[19] Narendra Ahuja,et al. DeepMVS: Learning Multi-view Stereopsis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20] Konrad Schindler,et al. Massively Parallel Multiview Stereopsis by Surface Normal Diffusion , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[21] Luc Van Gool,et al. SEEDS: Superpixels Extracted Via Energy-Driven Sampling , 2012, International Journal of Computer Vision.

[22] Anders Bjorholm Dahl,et al. Large-Scale Data for Multiple-View Stereopsis , 2016, International Journal of Computer Vision.

[23] Jan-Michael Frahm,et al. PatchMatch Based Joint View Selection and Depthmap Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[24] Michael M. Kazhdan,et al. Screened poisson surface reconstruction , 2013, TOGS.

[25] Jan-Michael Frahm,et al. Improvement of Extrinsic Parameters from a Single Stereo Pair , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[26] Sebastian Ramos,et al. The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Carsten Rother,et al. PatchMatch Stereo - Stereo Matching with Slanted Support Windows , 2011, BMVC.

[28] Heiko Hirschmüller,et al. Multi-Resolution Range Data Fusion for Multi-View Stereo Reconstruction , 2013, GCPR.

[29] Heiko Hirschmüller,et al. Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[30] Jean Ponce,et al. Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31] Helmut Mayer,et al. Incremental Division of Very Large Point Clouds for Scalable 3D Surface Reconstruction , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[32] Torsten Sattler,et al. A Multi-view Stereo Benchmark with High-Resolution Images and Multi-camera Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Michael Goesele,et al. Multi-View Stereo for Community Photo Collections , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[34] Wei-Ta Chu,et al. Camera as weather sensor: Estimating weather information from single images , 2017, J. Vis. Commun. Image Represent..

[35] Jean-Philippe Pons,et al. Robust and Efficient Surface Reconstruction From Range Data , 2009, Comput. Graph. Forum.

[36] Wenbing Tao,et al. Multi-Scale Geometric Consistency Guided Multi-View Stereo , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37] François Chollet,et al. Xception: Deep Learning with Depthwise Separable Convolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Tomás Pajdla,et al. Hallucination-Free Multi-View Stereo , 2010, ECCV Workshops.

[39] Simon Fuhrmann,et al. MVE - An image-based reconstruction environment , 2015, Comput. Graph..