Object-Based Video Coding by Visual Saliency and Temporal Correlation

When a disaster occurs, video communication is an effective way to disseminate large quantities of important information. However, video coding standards such as High Efficiency Video Coding (HEVC) compress entire videos, whatever the contents are; at low bit rates, the quality of significant objects deteriorates. In this paper, an object-based video coding method is proposed to address this problem. The proposed method extracts objects on the basis of visual saliency and temporal correlation between frames. Subsequently, we execute pre-processing which degrades the background quality before encoding the video with HEVC. This method can reduce the bit rate while preserving target object quality. Experimental comparison with HEVC demonstrates the superior performance of the proposed method.

[1]  Wilson S. Geisler,et al.  Implementation of a foveated image coding system for image bandwidth reduction , 1996, Electronic Imaging.

[2]  Touradj Ebrahimi,et al.  Semantic video analysis for adaptive content delivery and automatic description , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Edward J. Delp,et al.  Video coding using motion classification , 2008, 2008 15th IEEE International Conference on Image Processing.

[4]  Lihi Zelnik-Manor,et al.  Context-aware saliency detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5]  Shengxi Li,et al.  Region-of-Interest Based Conversational HEVC Coding with Hierarchical Perception Model of Face , 2014, IEEE Journal of Selected Topics in Signal Processing.

[6]  Feng Wu,et al.  Object-based coding for Kinect depth and color videos , 2012, 2012 Visual Communications and Image Processing.

[7]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[8]  Paolo Napoletano,et al.  Bayesian Integration of Face and Low-Level Cues for Foveated Video Coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  Anamitra Makur,et al.  Object-based Surveillance Video Compression using Foreground Motion Compensation , 2006, 2006 9th International Conference on Control, Automation, Robotics and Vision.

[11]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[12]  Shinichiro Omachi,et al.  Object-based Video Coding for Arbitrary Shape by Visual Saliency and Temporal Correlation , 2016 .

[13]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[14]  Yang Li,et al.  Hybrid model-and-object-based real-time conversational video coding , 2015, Signal Process. Image Commun..

[15]  Thomas Wiegand,et al.  A Content-Based Video Coding Approach for Rigid and Non-Rigid Textures , 2006, 2006 International Conference on Image Processing.

[16]  Ivan V. Bajic,et al.  Saliency-Aware Video Compression , 2014, IEEE Transactions on Image Processing.

[17]  Harry Shum,et al.  Object-Based Coding for Plenoptic Videos , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[19]  Yu-Chiang Frank Wang,et al.  Exploring Visual and Motion Saliency for Automatic Video Object Extraction , 2013, IEEE Transactions on Image Processing.