3D Scene Reconstruction from Multiple Spherical Stereo Pairs

We propose a 3D environment modelling method using multiple pairs of high-resolution spherical images. Spherical images of a scene are captured using a rotating line scan camera. Reconstruction is based on stereo image pairs with a vertical displacement between camera views. A 3D mesh model for each pair of spherical images is reconstructed by stereo matching. For accurate surface reconstruction, we propose a PDE-based disparity estimation method which produces continuous depth fields with sharp depth discontinuities even in occluded and highly textured regions. A full environment model is constructed by fusion of partial reconstruction from spherical stereo pairs at multiple widely spaced locations. To avoid camera calibration steps for all camera locations, we calculate 3D rigid transforms between capture points using feature matching and register all meshes into a unified coordinate system. Finally a complete 3D model of the environment is generated by selecting the most reliable observations among overlapped surface measurements considering surface visibility, orientation and distance from the camera. We analyse the characteristics and behaviour of errors for spherical stereo imaging. Performance of the proposed algorithm is evaluated against ground-truth from the Middlebury stereo test bed and LIDAR scans. Results are also compared with conventional structure-from-motion algorithms. The final composite model is rendered from a wide range of viewpoints with high quality textures.

[1]  Jan-Michael Frahm,et al.  Towards Urban 3D Reconstruction from Video , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[2]  Robert B. Fisher,et al.  Special Issue on Registration and Fusion of Range Images , 2002, Comput. Vis. Image Underst..

[3]  Adrian Hilton,et al.  Implicit Surface-Based Geometric Fusion , 1998, Comput. Vis. Image Underst..

[4]  Adrian Hilton,et al.  Environment modelling using spherical stereo imaging , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[5]  Adrian Hilton,et al.  Scene modelling from sparse 3D data , 2005, Image Vis. Comput..

[6]  Joachim Weickert,et al.  Optic Flow Goes Stereo: A Variational Method for Estimating Discontinuity-Preserving Dense Disparity Maps , 2005, DAGM-Symposium.

[7]  Michael Bosse,et al.  Calibrated, Registered Images of an Extended Urban Area , 2003, International Journal of Computer Vision.

[8]  Mikhail Sizintsev Hierarchical Stereo with Thin Structures and Transparency , 2008, 2008 Canadian Conference on Computer and Robot Vision.

[9]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[10]  Michael M. Kazhdan,et al.  Poisson surface reconstruction , 2006, SGP '06.

[11]  Daphna Weinshall,et al.  Realtime IBR with omnidirectional crossed-slits projection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[12]  P. Burt Fast filter transform for image processing , 1981 .

[13]  Andrew W. Fitzgibbon,et al.  Global stereo reconstruction under second order smoothness priors , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Gérard G. Medioni,et al.  Object modeling by registration of multiple range images , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[15]  Adrian Hilton,et al.  3D Modelling of Static Environments Using Multiple Spherical Stereo , 2010, ECCV Workshops.

[16]  Shree K. Nayar,et al.  360 x 360 Mosaics , 2000, Computer Vision and Pattern Recognition.

[17]  Kwanghoon Sohn,et al.  Cost Aggregation and Occlusion Handling With WLS in Stereo Matching , 2008, IEEE Transactions on Image Processing.

[18]  Luc Van Gool,et al.  3D Urban Scene Modeling Integrating Recognition and Reconstruction , 2008, International Journal of Computer Vision.

[19]  Rachid Deriche,et al.  Symmetrical Dense Optical Flow Estimation with Occlusions Detection , 2002, International Journal of Computer Vision.

[20]  Katsushi Ikeuchi,et al.  Disparity map refinement and 3D surface smoothing via Directed Anisotropic Diffusion , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[21]  D. Nistér,et al.  Stereo Matching with Color-Weighted Correlation, Hierarchical Belief Propagation, and Occlusion Handling , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Ryad Benosman,et al.  Panoramic stereovision sensor , 1998 .

[23]  Svetlana Lazebnik,et al.  Superparsing , 2010, International Journal of Computer Vision.

[24]  Rachid Deriche,et al.  Dense Disparity Map Estimation Respecting Image Discontinuities: A PDE and Scale-Space BasedApproach , 2002, MVA.

[25]  Pascal Fua,et al.  On benchmarking camera calibration and multi-view stereo for high resolution imagery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Jean-Philippe Pons,et al.  Towards high-resolution large-scale multi-view stereo , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[28]  Norbert Haala,et al.  Panoramic Scenes for Texture Mapping of 3 D City Models , 2005 .

[29]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Luc Van Gool,et al.  Wide-baseline stereo from multiple views: A probabilistic account , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[31]  Richard Szeliski,et al.  Towards Internet-scale multi-view stereo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[32]  Naokazu Yokoya,et al.  3D modeling of outdoor environments by integrating omnidirectional range and color images , 2005, Fifth International Conference on 3-D Digital Imaging and Modeling (3DIM'05).

[33]  Mariette Yvinec,et al.  Surface Reconstruction from Multi-View Stereo , 2009 .

[34]  Jan-Michael Frahm,et al.  Real-Time Visibility-Based Fusion of Depth Maps , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[35]  P. J. Burt,et al.  Fast Filter Transforms for Image Processing , 1981 .

[36]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[37]  Katsushi Ikeuchi,et al.  Omnidirectional texturing based on robust 3D registration through Euclidean reconstruction from two spherical images , 2010, Comput. Vis. Image Underst..

[38]  Christian Früh,et al.  Google Street View: Capturing the World at Street Level , 2010, Computer.

[39]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[40]  Frank Dellaert,et al.  Structure from motion without correspondence , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[41]  Luc Van Gool,et al.  Procedural 3D Building Reconstruction Using Shape Grammars and Detectors , 2011, 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission.

[42]  Kwanghoon Sohn,et al.  Hierarchical depth estimation for image synthesis in mixed reality , 2003, IS&T/SPIE Electronic Imaging.

[43]  Maarten Vergauwen,et al.  Web-based 3D Reconstruction Service , 2006, Machine Vision and Applications.

[44]  Richard Szeliski,et al.  3-D Scene Data Recovery Using Omnidirectional Multibaseline Stereo , 2004, International Journal of Computer Vision.

[45]  Richard Szeliski,et al.  Manhattan-world stereo , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Daphna Weinshall,et al.  Mosaicing New Views: The Crossed-Slits Projection , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[47]  Michael J. Black,et al.  Learning Optical Flow , 2008, ECCV.

[48]  T. Poggio,et al.  A generalized ordering constraint for stereo correspondence , 1984 .

[49]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  Tomas Pajdla,et al.  3D Metric Reconstruction from Uncalibrated Omnidirectional Images , 2004 .

[51]  Jana Kosecka,et al.  Piecewise planar city 3D modeling from street view panoramic sequences , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[53]  Kwanghoon Sohn,et al.  Hierarchical disparity estimation with energy-based regularization , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[54]  Nir A. Sochen,et al.  Variational Stereo Vision with Sharp Discontinuities and Occlusion Handling , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[55]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[56]  Shenchang Eric Chen,et al.  QuickTime VR: an image-based approach to virtual environment navigation , 1995, SIGGRAPH.

[57]  Avinash C. Kak,et al.  Vision for Mobile Robot Navigation: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[58]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[59]  Hans-Peter Seidel,et al.  PDE-Based Anisotropic Disparity-Driven Stereo Vision , 2008, VMV.

[60]  Richard Szeliski,et al.  Building Rome in a day , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[61]  Richard Szeliski,et al.  Modeling the World from Internet Photo Collections , 2008, International Journal of Computer Vision.

[62]  John A. Williams,et al.  Simultaneous Registration of Multiple Corresponding Point Sets , 2001, Comput. Vis. Image Underst..

[63]  Janusz Konrad,et al.  Occlusion-Aware Optical Flow Estimation , 2008, IEEE Transactions on Image Processing.

[64]  Andreas Klaus,et al.  Segment-Based Stereo Matching Using Belief Propagation and a Self-Adapting Dissimilarity Measure , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[65]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[66]  Maxime Lhuillier,et al.  Automatic scene structure and camera motion using a catadioptric system , 2008, Comput. Vis. Image Underst..

[67]  Claes Johnson Numerical solution of partial differential equations by the finite element method , 1988 .

[68]  Shigang Li Real-Time Spherical Stereo , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[69]  Shree K. Nayar,et al.  360/spl times/360 mosaics , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[70]  Mariette Yvinec,et al.  Surface Reconstruction from MultiView Stereo ∗ , 2009 .

[71]  Xavier Pennec,et al.  Rigid Point-Surface Registration Using an EM Variant of ICP for Computer Guided Oral Implantology , 2001, MICCAI.

[72]  X. Pennec,et al.  Rigid Point-Surface Registration using Oriented Points and an EM Variant of ICP for Computer Guided , 2001 .

[73]  N. Mitra,et al.  4-points congruent sets for robust pairwise surface registration , 2008, SIGGRAPH 2008.

[74]  Denis Laurendeau,et al.  A General Surface Approach to the Integration of a Set of Range Views , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[75]  Nikos Paragios,et al.  Random Exploration of the Procedural Space for Single-View 3D Modeling of Buildings , 2011, International Journal of Computer Vision.

[76]  Jan-Michael Frahm,et al.  Building Rome on a Cloudless Day , 2010, ECCV.

[77]  Michael Goesele,et al.  Multi-View Stereo for Community Photo Collections , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[78]  Hans-Hellmut Nagel,et al.  An Investigation of Smoothness Constraints for the Estimation of Displacement Vector Fields from Image Sequences , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[79]  Joachim Weickert,et al.  A Review of Nonlinear Diffusion Filtering , 1997, Scale-Space.

[80]  Marc Levoy,et al.  Zippered polygon meshes from range images , 1994, SIGGRAPH.

[81]  Heiko Hirschmüller,et al.  Evaluation of Stereo Matching Costs on Images with Radiometric Differences , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[82]  Vladimir Kolmogorov,et al.  Computing visual correspondence with occlusions using graph cuts , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[83]  Gérard G. Medioni,et al.  Object modelling by registration of multiple range images , 1992, Image Vis. Comput..

[84]  Jan-Michael Frahm,et al.  Detailed Real-Time Urban 3D Reconstruction from Video , 2007, International Journal of Computer Vision.

[85]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[86]  Richard Szeliski,et al.  Sampling the disparity space image , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[87]  Pau Gargallo,et al.  Bayesian 3D modeling from images using multiple depth maps , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[88]  Reinhard Koch,et al.  Automated reconstruction of 3D scenes from sequences of images , 2000 .