论文信息 - Energy-based multi-view piecewise planar stereo

Energy-based multi-view piecewise planar stereo

The piecewise planar model (PPM) is an effective means of approximating a complex scene by using planar patches to give a complete interpretation of the spatial points reconstructed from projected 2D images. The traditional piecewise planar stereo methods suffer from either a very restricted number of directions for plane detection or heavy reliance on the segmentation accuracy of superpixels. To address these issues, we propose a new multi-view piecewise planar stereo method in this paper. Our method formulates the problem of complete scene reconstruction as a multi-level energy minimization problem. To detect planes along principal directions, a novel energy formulation with pair-wise potentials is used to assign an optimal plane for each superpixel in an iterative manner, where reliable scene priors and geometric constraints are incorporated to enhance the modeling efficacy and inference efficiency. To detect non-principal-direction planes, we adopt a multi-direction plane sweeping with a restricted search space method to generate reliable candidate planes. To handle the multi-surface straddling problem of a single superpixel, a superpixel sub-segmenting scheme is proposed and a robust Pn Potts model-like higher-order potential is introduced to refine the resulting depth map. Our method is a natural integration of pixel- and superpixel-level multi-view stereos under a unified energy minimization framework. Experimental results for standard data sets and our own data sets show that our proposed method can satisfactorily handle many challenging factors (e.g., slanted surfaces and poorly textured regions) and can obtain accurate piecewise planar depth maps.创新点对于基于图像的三维场景重建,由于光照变化、透视畸变、弱纹理等干扰因素的存在,传统像素级与区域级的重建算法通常难以获得可靠的结果。为了解决此问题,本文提出一种新颖的基于能量的场景分段平面重建算法。根据场景分段平面假设,本文算法在MRF(Markov Random Field)能量最小化框架下将场景完整结构推断问题转换为沿场景主方向与非主方向的平面标记以及平面级的场景结构优化等问题进行求解,由于候选平面集与融合灰度一致性度量、空间几何与可见性约束、空间平面先验的能量函数的高可靠性,因而可以快速获取完整、准确的场景模型。实验结果表明,本文算法不但可以有效地解决场景中弱纹理、倾斜表面等区域的重建问题,而且可以克服传统相关算法依赖特定场景模型(如Manhattan 场景模型)、易受图像过分割精度的影响等缺点,整体上具有较高的可靠性与效率。

Wei Wang | Zhanyi Hu | Lihua Hu

[1] Yuri Boykov,et al. Energy-Based Geometric Multi-model Fitting , 2012, International Journal of Computer Vision.

[2] Richard Szeliski,et al. Multiple View Object Cosegmentation Using Appearance and Stereo Cues , 2012, ECCV.

[3] Jan-Michael Frahm,et al. Real-Time Plane-Sweeping Stereo with Multiple Sweeping Directions , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Andreas Klaus,et al. Segment-Based Stereo Matching Using Belief Propagation and a Self-Adapting Dissimilarity Measure , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[5] Delbert Dueck,et al. Clustering by Passing Messages Between Data Points , 2007, Science.

[6] Jan-Michael Frahm,et al. Piecewise planar and non-planar stereo for urban scene reconstruction , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7] Tat-Jun Chin,et al. The Random Cluster Model for robust geometric fitting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Richard Szeliski,et al. Manhattan-world stereo , 2009, CVPR.

[9] Pushmeet Kohli,et al. Robust Higher Order Potentials for Enforcing Label Consistency , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[11] Jean Ponce,et al. Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12] Jean-Philippe Pons,et al. Robust piecewise-planar 3D reconstruction and completion from large-scale unstructured point data , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13] Li Hong,et al. Segment-based stereo matching using graph cuts , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[14] Dorin Comaniciu,et al. Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[15] Zheng Zhi. A Region Based Stereo Matching Algorithm Using Cooperative Optimization , 2009 .

[16] Olga Veksler,et al. Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[17] Sing Bing Kang,et al. Stereo for Image-Based Rendering using Image Over-Segmentation , 2007, International Journal of Computer Vision.

[18] Carsten Rother,et al. PatchMatch Stereo - Stereo Matching with Slanted Support Windows , 2011, BMVC.

[19] Richard Szeliski,et al. Piecewise planar stereo for image-based rendering , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[20] Hyojin Kim,et al. Piecewise Planar Scene Reconstruction and Optimization for Multi-view Stereo , 2012, ACCV.

[21] A. Aydin Alatan,et al. Segment-Based Stereo-Matching Via Plane and Angle Sweeping , 2007, 2007 3DTV Conference.

[22] Anton Osokin,et al. Fast Approximate Energy Minimization with Label Costs , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23] Jana Kosecka,et al. Multi-view Superpixel Stereo in Urban Environments , 2010, International Journal of Computer Vision.

[24] Pascal Fua,et al. Efficient large-scale multi-view stereo for ultra high-resolution image sets , 2011, Machine Vision and Applications.

[25] András Bódis-Szomorú,et al. Fast, Approximate Piecewise-Planar Modeling Based on Sparse Structure-from-Motion and Superpixels , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[26] Stephen Gould,et al. Decomposing a scene into geometric and semantically consistent regions , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[27] Jana Kosecka,et al. Piecewise planar city 3D modeling from street view panoramic sequences , 2009, CVPR.