Integration of multiple view plus depth data for free viewpoint 3D display

This paper proposes a method for constructing a reasonable scale of end-to-end free-viewpoint video system that captures multiple view and depth data, reconstructs three-dimensional polygon models of objects, and display them on virtual 3D CG spaces. This system consists of a desktop PC and four Kinect sensors. First, multiple view plus depth data at four viewpoints are captured by Kinect sensors simultaneously. Then, the captured data are integrated to point cloud data by using camera parameters. The obtained point cloud data are sampled to volume data that consists of voxels. Since volume data that are generated from point cloud data are sparse, those data are made dense by using global optimization algorithm. Final step is to reconstruct surfaces on dense volume data by discrete marching cubes method. Since accuracy of depth maps affects to the quality of 3D polygon model, a simple inpainting method for improving depth maps is also presented.

[1]  Roberto Cipolla,et al.  Multiview Stereo via Volumetric Graph-Cuts and Occlusion Robust Photo-Consistency , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  William E. Lorensen,et al.  Marching cubes: A high resolution 3D surface construction algorithm , 1987, SIGGRAPH.

[3]  Atsushi Imiya,et al.  Marching cubes method with connectivity , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[4]  Toshiaki Fujii,et al.  Free viewpoint image generation using multi-pass dynamic programming , 2007, Electronic Imaging.

[5]  Sang Uk Lee,et al.  A dense stereo matching using two-pass dynamic programming with generalized ground control points , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Yuichi Iwadate,et al.  3D Archive System for Traditional Performing Arts , 2011, International Journal of Computer Vision.

[7]  Wojciech Matusik,et al.  3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes , 2004, ACM Trans. Graph..

[8]  Takanori Senoh,et al.  Proposal on a New Activity for the Third Phase of FTV , 2013 .

[9]  Makoto Okui,et al.  Integral Three-Dimensional Television Using a 33-Megapixel Imaging System , 2010, Journal of Display Technology.