Remote Dynamic Three-Dimensional Scene Reconstruction

Remote dynamic three-dimensional (3D) scene reconstruction renders the motion structure of a 3D scene remotely by means of both the color video and the corresponding depth maps. It has shown a great potential for telepresence applications like remote monitoring and remote medical imaging. Under this circumstance, video-rate and high resolution are two crucial characteristics for building a good depth map, which however mutually contradict during the depth sensor capturing. Therefore, recent works prefer to only transmit the high-resolution color video to the terminal side, and subsequently the scene depth is reconstructed by estimating the motion vectors from the video, typically using the propagation based methods towards a video-rate depth reconstruction. However, in most of the remote transmission systems, only the compressed color video stream is available. As a result, color video restored from the streams has quality losses, and thus the extracted motion vectors are inaccurate for depth reconstruction. In this paper, we propose a precise and robust scheme for dynamic 3D scene reconstruction by using the compressed color video stream and their inaccurate motion vectors. Our method rectifies the inaccurate motion vectors by analyzing and compensating their quality losses, motion vector absence in spatial prediction, and dislocation in near-boundary region. This rectification ensures the depth maps can be compensated in both video-rate and high resolution at the terminal side towards reducing the system consumption on both the compression and transmission. Our experiments validate that the proposed scheme is robust for depth map and dynamic scene reconstruction on long propagation distance, even with high compression ratio, outperforming the benchmark approaches with at least 3.3950 dB quality gains for remote applications.

[1]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[2]  Reinhard Koch,et al.  ToF-sensors: New dimensions for realism and interactivity , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[3]  Yue Gao,et al.  Dynamic 3D Scene Depth Reconstruction via Optical Flow Field Rectification , 2012, PloS one.

[4]  Aljoscha Smolic,et al.  View Synthesis for Advanced 3D Video Systems , 2008, EURASIP J. Image Video Process..

[5]  T. Fortoul,et al.  Visualization and 3D Reconstruction of Flame Cells of Taenia solium (Cestoda) , 2011, PloS one.

[6]  Boris Rubinsky,et al.  Distributed Network, Wireless and Cloud Computing Enabled 3-D Ultrasound; a New Medical Technology Paradigm , 2009, PloS one.

[7]  Yue Gao,et al.  3D model comparison using spatial structure circular descriptor , 2010, Pattern Recognit..

[8]  B. Barenbrug,et al.  Improved depth propagation for 2D to 3D video conversion using key-frames , 2007 .

[9]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[10]  Yo-Sung Ho,et al.  Dynamic 3D human actor generation method using a time-of-flight depth camera , 2008, IEEE Transactions on Consumer Electronics.

[11]  Boris Rubinsky,et al.  A New Concept for Medical Imaging Centered on Cellular Phone Technology , 2008, PloS one.

[12]  Yao Wang,et al.  Video Processing and Communications , 2001 .

[13]  Baining Guo,et al.  Kinect Identity: Technology and Experience , 2011, Computer.

[14]  C. Worringham,et al.  Development and Feasibility of a Smartphone, ECG and GPS Based System for Remotely Monitoring Exercise in Cardiac Rehabilitation , 2011, PloS one.

[15]  Qionghai Dai,et al.  Depth map generation for 2D-to-3D conversion by limited user inputs and depth propagation , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[16]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[17]  Jean-Philippe Thiran,et al.  JULIDE: A Software Tool for 3D Reconstruction and Statistical Analysis of Autoradiographic Mouse Brain Sections , 2010, PloS one.

[18]  Yue Gao,et al.  Camera Constraint-Free View-Based 3-D Object Retrieval , 2012, IEEE Transactions on Image Processing.

[19]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[20]  Julie M. Harris,et al.  Learning to Use Illumination Gradients as an Unambiguous Cue to Three Dimensional Shape , 2012, PloS one.

[21]  Rolf Rossaint,et al.  Feasibility of Prehospital Teleconsultation in Acute Stroke – A Pilot Study in Clinical Routine , 2012, PloS one.

[22]  Dieter Fox,et al.  RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments , 2010, ISER.

[23]  Michiteru Kitazaki,et al.  Effect of Pictorial Depth Cues, Binocular Disparity Cues and Motion Parallax Depth Cues on Lightness Perception in Three-Dimensional Virtual Scenes , 2008, PloS one.

[24]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.

[25]  Michael T. Orchard,et al.  Overlapped block motion compensation: an estimation-theoretic approach , 1994, IEEE Trans. Image Process..

[26]  Gilbae Park,et al.  Reconstruction of three-dimensional occluded object using optical flow and triangular mesh reconstruction in integral imaging. , 2010, Optics express.

[27]  Masayuki Tanimoto,et al.  FTV: Free-viewpoint Television , 2006, Signal Process. Image Commun..

[28]  Thomas Wiegand,et al.  3D Video and Free Viewpoint Video - Technologies, Applications and MPEG Standards , 2006, 2006 IEEE International Conference on Multimedia and Expo.