Depth-Spatio-Temporal Joint Region-of-Interest Extraction and Tracking for 3D Video

Three-dimensional video (3DV) consists of multi-view video and multi-view depth video, which provides three-dimensional perception and makes people more interested in depth contrast and pop-out regions. Meanwhile, 3DV is with both high temporal and inter-view correlation. In this paper, we define a novel depth perceptual region of interest (ROI) for 3DV and propose two joint extraction schemes according to correlation types of 3DV. Then, depth based ROI extraction is proposed by jointly using depth, motion and texture information. Furthermore, we also present a novel inter-view tracking method for 3DV, in which inter-view correlation among views and extracted ROI of neighboring views are utilized to facilitate ROI extraction among different views. Experimental results show that the proposed ROI extraction and tracking algorithms maintain high extraction accuracy and low complexity.

[1]  Mutsumi Ohta,et al.  Focused object extraction with multiple cameras , 2000, IEEE Trans. Circuits Syst. Video Technol..

[2]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[3]  King Ngi Ngan,et al.  Unsupervised extraction of visual attention objects in color images , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Ofer Hadar,et al.  Dynamic Computational Complexity and Bit Allocation for Optimizing H.264/AVC Video Compression , 2006, 2006 International Conference on Information Technology: Research and Education.

[5]  Zhengguo Li,et al.  Region-of-Interest Based Resource Allocation for Conversational Video Communication of H.264/AVC , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Masayuki Tanimoto Overview of free viewpoint television , 2006, Signal Process. Image Commun..

[7]  Yang Wang,et al.  Spatiotemporal video segmentation based on graphical models , 2005, IEEE Transactions on Image Processing.