Multi-scale visual attention & saliency modelling with decision theory

Recently, an information-based saliency technique which is biologically plausible and computationally feasible called Discriminant Saliency (DIS) has been proposed. While DIS successfully defines discriminant saliency in the information theoretic sense, its implementation restraints the sampled features to a single fixed-size window and creates a bias towards objects with distinctive features fitted in the window size. This paper proposes a multi-scale discriminant saliency (MDIS) technique for visual attention which uses the wavelet transform for the multi-resolution framework. MDIS utilizes mutual information between classes and feature distribution to quantify classifying discriminant power as saliency value in multiple dyadic-scale structures. The paper will present simulations on Neil Bruce's image database with quantitative and qualitative results showing the advantages of MDIS over DIS. For quantitative comparisons, numerical tests AUC, NSS, LCC are measured and several plots are generated to visualized differences between simulation modes; meanwhile, qualitative evaluation is a visual examination of synthesized saliency maps of general natural scenes.

[1]  Norma Graham Neurophysiology and Psychophysics , 1989 .

[2]  Minh N. Do,et al.  Wavelet-based texture retrieval using generalized Gaussian density and Kullback-Leibler distance , 2002, IEEE Trans. Image Process..

[3]  Nuno Vasconcelos,et al.  Discriminant Interest Points are Stable , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  John K. Tsotsos,et al.  Visual Correlates of Fixation Selection: A Look at the Spatial Frequency Domain , 2007, 2007 IEEE International Conference on Image Processing.

[5]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[6]  Richard Baraniuk,et al.  Multiscale texture segmentation using wavelet-domain hidden Markov models , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).

[7]  Nuno Vasconcelos,et al.  Decision-Theoretic Saliency: Computational Principles, Biological Plausibility, and Implications for Neurophysiology and Psychophysics , 2009, Neural Computation.

[8]  Justin K. Romberg,et al.  Bayesian tree-structured image modeling using wavelet-domain hidden Markov models , 2001, IEEE Trans. Image Process..

[9]  Minh N. Do,et al.  Rotation invariant texture characterization and retrieval using steerable wavelet-domain hidden Markov models , 2002, IEEE Trans. Multim..

[10]  R.G. Baraniuk,et al.  Simplified wavelet-domain hidden Markov models using contexts , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[11]  John K. Tsotsos,et al.  Saliency, attention, and visual search: an information theoretic approach. , 2009, Journal of vision.

[12]  Richard G. Baraniuk,et al.  Multiscale image segmentation using wavelet-domain hidden Markov models , 2001, IEEE Trans. Image Process..

[13]  Kah Phooi Seng,et al.  Visual saliency based on fast nonparametric multidimensional entropy estimation , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Xiaodong Gu,et al.  An Information Theoretic Model of Spatiotemporal Visual Saliency , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[15]  Ali Borji,et al.  Quantitative Analysis of Human-Model Agreement in Visual Saliency Modeling: A Comparative Study , 2013, IEEE Transactions on Image Processing.

[16]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[17]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[18]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[20]  Nuno Vasconcelos,et al.  The discriminant center-surround hypothesis for bottom-up saliency , 2007, NIPS.

[21]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .