Compressed domain video saliency detection using global and local spatiotemporal features

We propose a compressed domain video saliency detection algorithm.The proposed algorithm extracts features from the partially decoded data.The proposed algorithm performs the detection in real-time with good performance. A compressed domain video saliency detection algorithm, which employs global and local spatiotemporal (GLST) features, is proposed in this work. We first conduct partial decoding of a compressed video bitstream to obtain motion vectors and DCT coefficients, from which GLST features are extracted. More specifically, we extract the spatial features of rarity, compactness, and center prior from DC coefficients by investigating the global color distribution in a frame. We also extract the spatial feature of texture contrast from AC coefficients to identify regions, whose local textures are distinct from those of neighboring regions. Moreover, we use the temporal features of motion intensity and motion contrast to detect visually important motions. Then, we generate spatial and temporal saliency maps, respectively, by linearly combining the spatial features and the temporal features. Finally, we fuse the two saliency maps into a spatiotemporal saliency map adaptively by comparing the robustness of the spatial features with that of the temporal features. Experimental results demonstrate that the proposed algorithm provides excellent saliency detection performance, while requiring low complexity and thus performing the detection in real-time.

[1]  Pietro Perona,et al.  Is bottom-up attention useful for object recognition? , 2004, CVPR 2004.

[2]  Peyman Milanfar,et al.  Static and space-time visual saliency detection by self-resemblance. , 2009, Journal of vision.

[3]  C. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[4]  Chang-Su Kim,et al.  Spatiotemporal Saliency Detection for Video Sequences Based on Random Walk With Restart , 2015, IEEE Transactions on Image Processing.

[5]  Chang-Su Kim,et al.  Video saliency detection based on spatiotemporal feature learning , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[6]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7]  O. Sukmarg,et al.  Fast object detection and segmentation in MPEG compressed domain , 2000, 2000 TENCON Proceedings. Intelligent Systems and Technologies for the New Millennium (Cat. No.00CH37119).

[8]  Weisi Lin,et al.  Objective Quality Assessment for Image Retargeting Based on Perceptual Geometric Distortion and Information Loss , 2014, IEEE Journal of Selected Topics in Signal Processing.

[9]  Weisi Lin,et al.  A Video Saliency Detection Model in Compressed Domain , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Yin Li,et al.  Visual Saliency Based on Conditional Entropy , 2009, ACCV.

[11]  Yael Pritch,et al.  Saliency filters: Contrast based filtering for salient region detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Yin Li,et al.  Incremental sparse saliency detection , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[13]  Chia-Wen Lin,et al.  Fast algorithms for DCT-domain video transcoding , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[14]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[16]  Huchuan Lu,et al.  Saliency Detection via Graph-Based Manifold Ranking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Chang-Su Kim,et al.  Adaptive image and video retargeting technique based on Fourier analysis , 2009, CVPR.

[18]  Nanning Zheng,et al.  Learning to Detect a Salient Object , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Liming Zhang,et al.  A Novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression , 2010, IEEE Transactions on Image Processing.

[20]  Tim K Marks,et al.  SUN: A Bayesian framework for saliency using natural statistics. , 2008, Journal of vision.

[21]  Faouzi Kossentini,et al.  H.264/AVC baseline profile decoder complexity analysis , 2003, IEEE Trans. Circuits Syst. Video Technol..

[22]  Shen Yan High Efficiency Video Coding , 2013 .

[23]  King Ngi Ngan,et al.  Unsupervised extraction of visual attention objects in color images , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Rainer Stiefelhagen,et al.  Quaternion-Based Spectral Saliency Detection for Eye Fixation Prediction , 2012, ECCV.

[25]  Neri Merhav,et al.  Fast algorithms for DCT-domain image downsampling and for inverse motion compensation , 1997, IEEE Trans. Circuits Syst. Video Technol..

[26]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.

[27]  C.-C. Jay Kuo,et al.  Compressed-Domain Video Retargeting , 2014, IEEE Transactions on Image Processing.

[28]  Kebin Jia,et al.  Low-Complexity Saliency Detection Algorithm for Fast Perceptual Video Coding , 2013, TheScientificWorldJournal.

[29]  Daniel Cohen-Or,et al.  Non-homogeneous Content-driven Video-retargeting , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[30]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Peter König,et al.  Influence of disparity on fixation and saccades in free viewing of natural scenes. , 2009, Journal of vision.

[32]  Akisato Kimura,et al.  Fully Automatic Extraction of Salient Objects from Videos in Near Real Time , 2010, Comput. J..

[33]  Chen Chen,et al.  Transform-domain intra prediction for H.264 , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[34]  Jian Sun,et al.  Saliency Optimization from Robust Background Detection , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[35]  L. Zhaoping Attention capture by eye of origin singletons even without awareness--a hallmark of a bottom-up saliency map in the primary visual cortex. , 2008, Journal of vision.

[36]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[37]  Chang-Su Kim,et al.  Multiscale Saliency Detection Using Random Walk With Restart , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[38]  Ivan V. Bajic,et al.  Compressed-Domain Correlates of Fixations in Video , 2014, PIVP '14.

[39]  Mubarak Shah,et al.  Visual attention detection in video sequences using spatiotemporal cues , 2006, MM '06.

[40]  Weisi Lin,et al.  Saliency Detection in the Compressed Domain for Adaptive Image Retargeting , 2012, IEEE Transactions on Image Processing.

[41]  Sheng-Wen Shih,et al.  Dynamic visual saliency modeling based on spatiotemporal analysis , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[42]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.