论文信息 - Superpixel meshes for fast edge-preserving surface reconstruction

Superpixel meshes for fast edge-preserving surface reconstruction

Multi-View-Stereo (MVS) methods aim for the highest detail possible, however, such detail is often not required. In this work, we propose a novel surface reconstruction method based on image edges, superpixels and second-order smoothness constraints, producing meshes comparable to classic MVS surfaces in quality but orders of magnitudes faster. Our method performs per-view dense depth optimization directly over sparse 3D Ground Control Points (GCPs), hence, removing the need for view pairing, image rectification, and stereo depth estimation, and allowing for full per-image parallelization. We use Structure-from-Motion (SfM) points as GCPs, but the method is not specific to these, e.g. LiDAR or RGB-D can also be used. The resulting meshes are compact and inherently edge-aligned with image gradients, enabling good-quality lightweight per-face flat renderings. Our experiments demonstrate on a variety of 3D datasets the superiority in speed and competitive surface quality.

[1] Pascal Fua,et al. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Andrew Zisserman,et al. New Techniques for Automated Architectural Reconstruction from Photographs , 2002, ECCV.

[3] András Bódis-Szomorú,et al. Fast, Approximate Piecewise-Planar Modeling Based on Sparse Structure-from-Motion and Superpixels , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Luc Van Gool,et al. SEEDS: Superpixels Extracted via Energy-Driven Sampling , 2012, ECCV.

[5] Vladimir Kolmogorov,et al. What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Andreas Geiger,et al. Efficient Large-Scale Stereo Matching , 2010, ACCV.

[7] Richard Szeliski,et al. Piecewise planar stereo for image-based rendering , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[8] Takeo Kanade,et al. Image-consistent surface triangulation , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[9] D. Nistér,et al. Stereo Matching with Color-Weighted Correlation, Hierarchical Belief Propagation, and Occlusion Handling , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Steven M. Seitz,et al. Multicore bundle adjustment , 2011, CVPR 2011.

[11] Roberto Cipolla,et al. Bayesian Stochastic Mesh Optimization for 3D reconstruction , 2003, BMVC.

[12] Roberto Cipolla,et al. Multiview Stereo via Volumetric Graph-Cuts and Occlusion Robust Photo-Consistency , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Marc Pollefeys,et al. Pulling Things out of Perspective , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[14] Changchang Wu,et al. Towards Linear-Time Incremental Structure from Motion , 2013, 2013 International Conference on 3D Vision.

[15] Luc Van Gool,et al. Learning Where to Classify in Multi-view Semantic Segmentation , 2014, ECCV.

[16] Andrew Owens,et al. Discrete-continuous optimization for large-scale structure from motion , 2011, CVPR 2011.

[17] Jianxiong Xiao,et al. Image-based façade modeling , 2008, ACM Trans. Graph..

[18] Jan-Michael Frahm,et al. Detailed Real-Time Urban 3D Reconstruction from Video , 2007, International Journal of Computer Vision.

[19] Hayko Riemenschneider,et al. Irregular lattices for complex shape grammar facade parsing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20] Horst Bischof,et al. Incremental Surface Extraction from Sparse Structure-from-Motion Point Clouds , 2013, BMVC.

[21] Richard Szeliski,et al. Building Rome in a day , 2009, ICCV.

[22] Changchang Wu,et al. SiftGPU : A GPU Implementation of Scale Invariant Feature Transform (SIFT) , 2007 .

[23] Mariette Yvinec,et al. Surface Reconstruction from Multi-View Stereo of Large-Scale Outdoor Scenes , 2010, Int. J. Virtual Real..

[24] Jan-Michael Frahm,et al. Real-Time Plane-Sweeping Stereo with Multiple Sweeping Directions , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Ruigang Yang,et al. Global stereo matching leveraged by sparse ground control points , 2011, CVPR 2011.

[26] Konrad Schindler,et al. Piecewise planar scene reconstruction from sparse correspondences , 2006, Image Vis. Comput..

[27] Michael M. Kazhdan,et al. Poisson surface reconstruction , 2006, SGP '06.

[28] Alexei A. Efros,et al. Recovering Surface Layout from an Image , 2007, International Journal of Computer Vision.

[29] Jana Kosecka,et al. Multi-view Superpixel Stereo in Urban Environments , 2010, International Journal of Computer Vision.

[30] FurukawaYasutaka,et al. Accurate, Dense, and Robust Multiview Stereopsis , 2010 .

[31] Jean-Philippe Pons,et al. Efficient Multi-View Reconstruction of Large-Scale Scenes using Interest Points, Delaunay Triangulation and Graph Cuts , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[32] Pascal Fua,et al. Efficient large-scale multi-view stereo for ultra high-resolution image sets , 2011, Machine Vision and Applications.

[33] Reinhard Koch,et al. Visual Modeling with a Hand-Held Camera , 2004, International Journal of Computer Vision.

[34] Andrew J. Davison,et al. Live dense reconstruction with a single moving camera , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[35] Richard Szeliski,et al. First-person hyper-lapse videos , 2014, ACM Trans. Graph..

[36] Zheng Zhi. A Region Based Stereo Matching Algorithm Using Cooperative Optimization , 2009 .

[37] Pascal Fua,et al. On benchmarking camera calibration and multi-view stereo for high resolution imagery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[38] Olga Veksler,et al. Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[39] Long Quan,et al. Region-based progressive stereo matching , 2004, CVPR 2004.

[40] Sing Bing Kang,et al. Stereo for Image-Based Rendering using Image Over-Segmentation , 2007, International Journal of Computer Vision.

[41] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[42] Luc Van Gool,et al. 3D all the way: Semantic segmentation of urban scenes from start to end in 3D , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Daniel G. Aliaga,et al. A Survey of Urban Reconstruction , 2013, Comput. Graph. Forum.

[44] Ashutosh Saxena,et al. Learning 3-D Scene Structure from a Single Still Image , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[45] Jan-Michael Frahm,et al. Piecewise planar and non-planar stereo for urban scene reconstruction , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[46] Daniel P. Huttenlocher,et al. Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[47] Richard Szeliski,et al. Towards Internet-scale multi-view stereo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[48] Tomás Pajdla,et al. Multi-view reconstruction preserving weakly-supported surfaces , 2011, CVPR 2011.

[49] Richard K. Beatson,et al. Reconstruction and representation of 3D objects with radial basis functions , 2001, SIGGRAPH.

[50] Vamshhi Pavan Kumar Varma Vegeshna,et al. Stereo Matching with Color-Weighted Correlation, Hierachical Belief Propagation and Occlusion Handling , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[51] Daniel G. Aliaga,et al. Automatic urban modeling using volumetric reconstruction with surface graph cuts , 2013, Comput. Graph..

[52] Ashutosh Saxena,et al. 3-D Reconstruction from Sparse Views using Monocular Vision , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[53] Steven M. Seitz,et al. Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[54] Marc Pollefeys,et al. Photometric Bundle Adjustment for Dense Multi-view 3D Modeling , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[55] Zhi-Gang Zheng,et al. A region based stereo matching algorithm using cooperative optimization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[56] Vladimir Kolmogorov,et al. An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57] Michael Cohen,et al. First-person Hyperlapse Videos , 2014, SIGGRAPH 2014.

[58] Hans-Peter Seidel,et al. A multi-scale approach to 3D scattered data interpolation with compactly supported basis functions , 2003, 2003 Shape Modeling International..

[59] Christopher G. Harris,et al. A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[60] Timo Pylvänäinen,et al. 3D City Modeling from Street-Level Data for Augmented Reality Applications , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[61] Jean-Philippe Pons,et al. High Accuracy and Visibility-Consistent Dense Multiview Stereo , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62] Richard Szeliski,et al. A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[63] Yasuyuki Matsushita,et al. Multiview Photometric Stereo Using Planar Mesh Parameterization , 2013, 2013 IEEE International Conference on Computer Vision.

[64] Luc Van Gool,et al. Fast Compact City Modeling for Navigation Pre-Visualization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[65] Pascal Fua,et al. A parallel stereo algorithm that produces dense depth maps and preserves image features , 1993, Machine Vision and Applications.

[66] Olga Veksler,et al. Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[67] John F. Canny,et al. A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[68] David G. Kirkpatrick,et al. On the shape of a set of points in the plane , 1983, IEEE Trans. Inf. Theory.

[69] Horst Bischof,et al. Online Feedback for Structure-from-Motion Image Acquisition , 2012, BMVC.

[70] Richard Szeliski,et al. Manhattan-world stereo , 2009, CVPR.

[71] Luc Van Gool,et al. A Three-Layered Approach to Facade Parsing , 2012, ECCV.

[72] Jean-Philippe Pons,et al. Robust piecewise-planar 3D reconstruction and completion from large-scale unstructured point data , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[73] Jan-Michael Frahm,et al. 3D Reconstruction Using an n-Layer Heightmap , 2010, DAGM-Symposium.