Combined use of ToF sensors and standard cameras for 3D video acquisition

3D video applications usually require the availability of high resolution depth and color information. Depth information can be acquired at video rate by Time-of-Flight matrix sensors, but these devices usually have a limited resolution and image quality. A common solution to this issue is the combined use of ToF sensors and color cameras. This paper firstly presents a generalized multi-camera calibration technique that aims at calibrating together the ToF sensor with two synchronized cameras exploiting the color information from both type of sensors but also the depth measures of the ToF sensor. In the second part of the work we will present an ad-hoc interpolation technique to obtain high resolution depth information exploiting side information from the color camera together with the ToF measures and a novel surface prediction scheme. Finally we will show how the high resolution depth map obtained with the proposed approach can be used in order to warp the available video streams to arbitrary viewpoints in 3D video applications. Experimental results have shown how the proposed method allows to obtain a more accurate calibration and to improve the quality of the depth data and warped views if compared with standard approaches.

[1]  Pietro Zanuttigh,et al.  Compression of depth information for 3D rendering , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[2]  Ruigang Yang,et al.  Spatial-Depth Super Resolution for Range Images , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Xiaoyi Jiang,et al.  Registration of depth and video data in depth image based rendering , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[4]  Ruigang Yang,et al.  Fusion of time-of-flight depth and stereo for high accuracy depth maps , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[6]  A. Frick,et al.  Generation of 3D-TV LDV-content with Time-Of-Flight Camera , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[7]  Alexandru Tupan,et al.  Triangulation , 1997, Comput. Vis. Image Underst..

[8]  Janne Heikkilä,et al.  A four-step camera calibration procedure with implicit image correction , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Sebastian Thrun,et al.  An Application of Markov Random Fields to Range Sensing , 2005, NIPS.