Information-Based Scale Saliency Methods with Wavelet Sub-band Energy Density Descriptors

Pixel-based scale saliency (PSS) work bases on information estimation of data content and structure in multiscale analysis; its theoretical aspects as well as practical implementation are discussed by Kadir et al [11]. Scale Saliency framework [10] does not work only for pixels but other basis-projected descriptors as well. While wavelet atoms, localization in both time and frequency domain, are possible alternative descriptors, no theoretical analysis and practical solutions have been proposed yet. Our contribution is introducing a mathematical model of utilizing wavelet-based descriptors in a correspondent Wavelet-based Scale Saliency (WSS). It treats wavelet sub-band energy density of two popular discrete wavelet transform (DWT) and dual-tree complex wavelet transform (DTCWT) as basis descriptors instead of pixel-value descriptors for saliency map estimation. Then, ROC, AUC, and NSS quantitative analysis are comparing WSS against PSS as well as other state-of-the-art saliency methods ITT [9], SUN [18], SRS [8] on N. Bruce's database [4] with human eye-tracking data as ground-truth. Furthermore, qualitative results, different saliency maps, are analyzed case by case for their pros and cons; especially their short-comings in specific situation or insensible results for human perception.

[1]  Ali Borji,et al.  Quantitative Analysis of Human-Model Agreement in Visual Saliency Modeling: A Comparative Study , 2013, IEEE Transactions on Image Processing.

[2]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[3]  Nuno Vasconcelos,et al.  The discriminant center-surround hypothesis for bottom-up saliency , 2007, NIPS.

[4]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Laurie M. Wilcox,et al.  The role of meaning in visual search , 2010 .

[6]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[7]  Nuno Vasconcelos,et al.  On the plausibility of the discriminant center-surround hypothesis for visual saliency. , 2008, Journal of vision.

[8]  Xiaodong Gu,et al.  An Information Theoretic Model of Spatiotemporal Visual Saliency , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[9]  Pablo Suau,et al.  A New Feasible Approach to Multi-dimensional Scale Saliency , 2009, ACIVS.

[10]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[11]  John K. Tsotsos,et al.  Saliency, attention, and visual search: an information theoretic approach. , 2009, Journal of vision.

[12]  Richard G. Baraniuk,et al.  Coherent Multiscale Image Processing Using Dual-Tree Quaternion Wavelets , 2008, IEEE Transactions on Image Processing.

[13]  Tim K Marks,et al.  SUN: A Bayesian framework for saliency using natural statistics. , 2008, Journal of vision.

[14]  Dan Stowell,et al.  Fast Multidimensional Entropy Estimation by $k$-d Partitioning , 2009, IEEE Signal Processing Letters.

[15]  Eli Brenner,et al.  Reliable Identification by Color under Natural Conditions the Locations Baseline Measurement , 2022 .

[16]  Fionn Murtagh,et al.  Multiscale entropy filtering , 1999, Signal Process..

[17]  Richard Baraniuk,et al.  The Dual-tree Complex Wavelet Transform , 2007 .

[18]  Pierre Baldi,et al.  Of bits and wows: A Bayesian theory of surprise with applications to attention , 2010, Neural Networks.

[19]  Kah Phooi Seng,et al.  Visual saliency based on fast nonparametric multidimensional entropy estimation , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).