Saliency detection using tensor sparse reconstruction residual analysis

In this paper, a visual saliency detection model based on tensor sparse reconstruction for images is proposed. This algorithm measures saliency value of image regions by the reconstruction residual and performs better on color images than current sparse models. Current sparse models treat a color image as multiple independent channel images and vectorise the image patches, ignoring interrelationship between color channels and spatial correlation between neighbouring pixels. In contrast, the proposed tensor sparse model treats a color image as a 3D array, retaining the spatial color structures entirely during the sparse coding. The proposed saliency detection method is tested on ASD dataset and OSIE dataset and compared with traditional sparse reconstruction based models. The experimental results show that our model achieves higher AUC scores than traditional sparse reconstruction based models.

[1]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[2]  Guillermo Sapiro,et al.  Sparse Representation for Computer Vision and Pattern Recognition , 2010, Proceedings of the IEEE.

[3]  Andrzej Cichocki,et al.  Block sparse representations of tensors using Kronecker bases , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[4]  Ioannis Rigas,et al.  Low-Level Visual Saliency With Application on Aerial Imagery , 2013, IEEE Geoscience and Remote Sensing Letters.

[5]  Syed Zubair,et al.  Tensor dictionary learning with sparse TUCKER decomposition , 2013, 2013 18th International Conference on Digital Signal Processing (DSP).

[6]  Hao Zhu,et al.  Bottom-up saliency based on weighted sparse coding residual , 2011, ACM Multimedia.

[7]  Yi Yang,et al.  Decomposable Nonlocal Tensor Dictionary Learning for Multispectral Image Denoising , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[9]  No Value,et al.  IEEE International Conference on Image Processing , 2003 .

[10]  Yin Li,et al.  Incremental sparse saliency detection , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[11]  R. Bro PARAFAC. Tutorial and applications , 1997 .

[12]  Sabine Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Reinhard Klein,et al.  BTF Compression via Sparse Tensor Decomposition , 2009, Comput. Graph. Forum.

[14]  Jason J.S. Barton,et al.  The latencies of prosaccades are prolonged by both executed and planned (but not executed) prior antisaccades , 2014 .

[15]  L. Tucker,et al.  Some mathematical notes on three-mode factor analysis , 1966, Psychometrika.

[16]  Pietro Perona,et al.  Is bottom-up attention useful for object recognition? , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[17]  Shuo Wang,et al.  Predicting human gaze beyond pixels. , 2014, Journal of vision.

[18]  Laurent Itti,et al.  An Integrated Model of Top-Down and Bottom-Up Attention for Optimizing Detection Speed , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[19]  Pietro Perona,et al.  Is bottom-up attention useful for object recognition? , 2004, CVPR 2004.