Multiscale Discriminant Saliency for Visual Attention

The bottom-up saliency, an early stage of humans' visual attention, can be considered as a binary classification problem between center and surround classes. Discriminant power of features for the classification is measured as mutual information between features and two classes distribution. The estimated discrepancy of two feature classes very much depends on considered scale levels; then, multi-scale structure and discriminant power are integrated by employing discrete wavelet features and Hidden markov tree (HMT). With wavelet coefficients and Hidden Markov Tree parameters, quad-tree like label structures are constructed and utilized in maximum a posterior probability (MAP) of hidden class variables at corresponding dyadic sub-squares. Then, saliency value for each dyadic square at each scale level is computed with discriminant power principle and the MAP. Finally, across multiple scales is integrated the final saliency map by an information maximization rule. Both standard quantitative tools such as NSS, LCC, AUC and qualitative assessments are used for evaluating the proposed multiscale discriminant saliency method (MDIS) against the well-know information-based saliency method AIM on its Bruce Database wity eye-tracking data. Simulation results are presented and analyzed to verify the validity of MDIS as well as point out its disadvantages for further research direction.

[1]  P Reinagel,et al.  Natural scene statistics at the centre of gaze. , 1999, Network.

[2]  Xiaodong Gu,et al.  An Information Theoretic Model of Spatiotemporal Visual Saliency , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[3]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[4]  Justin K. Romberg,et al.  Bayesian tree-structured image modeling using wavelet-domain hidden Markov models , 2001, IEEE Trans. Image Process..

[5]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[6]  Nuno Vasconcelos,et al.  Decision-Theoretic Saliency: Computational Principles, Biological Plausibility, and Implications for Neurophysiology and Psychophysics , 2009, Neural Computation.

[7]  H. Chipman,et al.  Adaptive Bayesian Wavelet Shrinkage , 1997 .

[8]  Charles A. Bouman,et al.  A multiscale random field model for Bayesian image segmentation , 1994, IEEE Trans. Image Process..

[9]  Ali Borji,et al.  Quantitative Analysis of Human-Model Agreement in Visual Saliency Modeling: A Comparative Study , 2013, IEEE Transactions on Image Processing.

[10]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[11]  Robert M. Gray,et al.  Multiresolution image classification by hierarchical modeling with two-dimensional hidden Markov models , 2000, IEEE Trans. Inf. Theory.

[12]  Richard Baraniuk,et al.  Multiscale texture segmentation using wavelet-domain hidden Markov models , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).

[13]  Nuno Vasconcelos,et al.  On the plausibility of the discriminant center-surround hypothesis for visual saliency. , 2008, Journal of vision.

[14]  Richard G. Baraniuk,et al.  Multiscale image segmentation using wavelet-domain hidden Markov models , 2001, IEEE Trans. Image Process..

[15]  Kah Phooi Seng,et al.  Visual saliency based on fast nonparametric multidimensional entropy estimation , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[17]  Nuno Vasconcelos,et al.  The discriminant center-surround hypothesis for bottom-up saliency , 2007, NIPS.

[18]  Derrick J. Parkhurst,et al.  Modeling the role of salience in the allocation of overt visual attention , 2002, Vision Research.

[19]  U. Neisser Cognitive Psychology: Classic Edition , 1967 .

[20]  Iain D. Gilchrist,et al.  Visual correlates of fixation selection: effects of scale and time , 2005, Vision Research.

[21]  Kah Phooi Seng,et al.  Improvement and evaluation of visual saliency based on information theory , 2010, 2010 International Computer Symposium (ICS2010).

[22]  R.G. Baraniuk,et al.  Simplified wavelet-domain hidden Markov models using contexts , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[23]  David Leporini,et al.  Bayesian approach to best basis selection , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[24]  Robert B. Fisher,et al.  Object-based visual attention for computer vision , 2003, Artif. Intell..

[25]  Roland J. Baddeley,et al.  High frequency edges (but not contrast) predict where we fixate: A Bayesian system identification analysis , 2006, Vision Research.

[26]  Nuno Vasconcelos,et al.  Discriminant Saliency for Visual Recognition from Cluttered Scenes , 2004, NIPS.

[27]  P. König,et al.  Effects of luminance contrast and its modifications on fixation behavior during free viewing of images from different categories , 2009, Vision Research.

[28]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  John K. Tsotsos,et al.  Visual Correlates of Fixation Selection: A Look at the Spatial Frequency Domain , 2007, 2007 IEEE International Conference on Image Processing.

[30]  Hui Cheng,et al.  Trainable context model for multiscale segmentation , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[31]  Nuno Vasconcelos,et al.  Discriminant Interest Points are Stable , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  J. Allebach,et al.  Multiscale Document Segmentation 1 , 1997 .

[33]  B. Silverman,et al.  Wavelet thresholding via a Bayesian approach , 1998 .

[34]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[35]  KochChristof,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 1998 .