Multi-View Video Representation Based on Fast Monte Carlo Surface Reconstruction

This paper provides an alternative solution to the costly representation of multi-view video data, which can be used for both rendering and scene analyses. Initially, a new efficient Monte Carlo discrete surface reconstruction method for foreground objects with static background is presented, which outperforms volumetric techniques and is suitable for GPU environments. Some extensions are also presented, which allow a speeding up of the reconstruction by exploiting multi-resolution and temporal correlations. Then, a fast meshing algorithm is applied, which allows interpolating a continuous surface from the discrete reconstructed points. As shown by the experimental results, the original video frames can be approximated with high accuracy by projecting the reconstructed foreground objects onto the original viewpoints. Furthermore, the reconstructed scene can be easily projected onto any desired virtual viewpoint, thus simplifying the design of free-viewpoint video applications. In our experimental results, we show that our techniques for reconstruction and meshing compare favorably with the state-of-the-art, and we also introduce a rule-of-thumb for effective application of the method with a good quality versus representation cost trade-off.

[1]  Montse Pardàs,et al.  Joint multi-view foreground segmentation and 3D reconstruction with tolerance loop , 2011, 2011 18th IEEE International Conference on Image Processing.

[2]  Hans-Peter Seidel,et al.  Performance capture from sparse multi-view video , 2008, ACM Trans. Graph..

[3]  Aljoscha Smolic,et al.  Image-space Free-viewpoint Video , 2005 .

[4]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[5]  Jean-Yves Bouguet,et al.  Camera calibration toolbox for matlab , 2001 .

[6]  Leonardo Chiariglione Multimedia Standards: Interfaces to Innovation , 2012, Proceedings of the IEEE.

[7]  Montse Pardàs,et al.  Enhanced foreground segmentation and tracking combining Bayesian background, shadow and foreground modeling , 2012, Pattern Recognit. Lett..

[8]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Kiriakos N. Kutulakos,et al.  Multi-View Scene Capture by Surfel Sampling: From Video Streams to Non-Rigid 3D Motion, Shape and Reflectance , 2002, International Journal of Computer Vision.

[10]  Jean-Yves Guillemaut,et al.  Wide-baseline multi-view video segmentation for 3D reconstruction , 2010, 3DVP '10.

[11]  Slobodan Ilic,et al.  Probabilistic Deformable Surface Tracking from Multiple Videos , 2010, ECCV.

[12]  Dan Svenstrup,et al.  Monte Carlo methods for dynamical systems , 2009 .

[13]  Jun S. Liu,et al.  Sequential Monte Carlo methods for dynamic systems , 1997 .

[14]  Jean-Yves Guillemaut,et al.  Joint Multi-Layer Segmentation and Reconstruction for Free-Viewpoint Video Applications , 2011, International Journal of Computer Vision.

[15]  Baining Guo,et al.  Feature-based light field morphing , 2002, ACM Trans. Graph..

[16]  Alla Sheffer,et al.  Mesh parameterization: theory and practice Video files associated with this course are available from the citation page , 2007, SIGGRAPH Courses.

[17]  Michael M. Kazhdan,et al.  Poisson surface reconstruction , 2006, SGP '06.

[18]  Richard Szeliski,et al.  Towards Internet-scale multi-view stereo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Gabriel Taubin,et al.  The ball-pivoting algorithm for surface reconstruction , 1999, IEEE Transactions on Visualization and Computer Graphics.

[20]  Edmond Boyer,et al.  Efficient Polyhedral Modeling from Silhouettes , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Ramesh Raskar,et al.  Image-based visual hulls , 2000, SIGGRAPH.

[22]  Marcus A. Magnor,et al.  Virtual video camera: image-based viewpoint navigation through space and time , 2009, SIGGRAPH '09.

[23]  Hans-Peter Seidel,et al.  Performance capture from sparse multi-view video , 2008, SIGGRAPH 2008.

[24]  Paolo Cignoni,et al.  MeshLab: an Open-Source 3D Mesh Processing System , 2008, ERCIM News.

[25]  Bruno Lévy,et al.  Mesh parameterization: theory and practice , 2007, SIGGRAPH Courses.

[26]  Josep R. Casas,et al.  From silhouettes to 3D points to mesh: towards free viewpoint video , 2010, 3DVP '10.

[27]  Sang Uk Lee,et al.  GPU-friendly multi-view stereo reconstruction using surfel representation and graph cuts , 2011, Comput. Vis. Image Underst..

[28]  Luc Van Gool,et al.  Blue-c: a spatially immersive display and 3D video portal for telepresence , 2003, IPT/EGVE.

[29]  Richard Szeliski,et al.  A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[30]  Richard Szeliski,et al.  Modeling the World from Internet Photo Collections , 2008, International Journal of Computer Vision.

[31]  Simon J. Godsill,et al.  On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..