Improved saliency toolbox/Itti model for region of interest extraction

The saliency toolbox (STB)/Itti model is an outstanding computational selective visual attention model. In this paper, we propose an improved STB/Itti model to overcome the drawback of STB/Itti-its output "saliency map" is not large enough for region of interest (ROI) extraction. First, we employ a simplified pulse coupled neural network (PCNN) with a special input image, and more importantly, the PCNN does not require iterations. Subsequently, the PCNN takes the place of the winner-take-all network in STB/Itti. Experimental results show that the improved STB/Itti model works well for ROI extraction, with the mean area under the curve value of 0.8306 and robustness against noise and geometric attacks. The proposed model can greatly enhance the performances of both STB/Itti and PCNN model in image processing.

[1]  Robert B. Fisher,et al.  Object-based visual attention for computer vision , 2003, Artif. Intell..

[2]  Christof Koch,et al.  Visual attention and target detection in cluttered natural scenes , 2001 .

[3]  F. Hamker The reentry hypothesis: the putative interaction of the frontal eye field, ventrolateral prefrontal cortex, and areas V4, IT for attention and eye movement. , 2005, Cerebral cortex.

[4]  C. Koch,et al.  Models of bottom-up and top-down visual attention , 2000 .

[5]  Xiaokang Yang,et al.  Improved error concealment of region of interest based on the H.264/AVC standard , 2010 .

[6]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[7]  Dirk Walther,et al.  Interactions of visual attention and object recognition : computational modeling, algorithms, and psychophysics. , 2006 .

[8]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[9]  Liming Zhang,et al.  Spatio-temporal Saliency detection using phase spectrum of quaternion fourier transform , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Liu Qing,et al.  Automated image segmentation using improved PCNN model based on cross-entropy , 2004, Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, 2004..

[11]  Hong Huo,et al.  Color conspicuity map based on wavelet low-pass pyramid for popping out contours of salient objects , 2010 .

[12]  Iain D. Gilchrist,et al.  Visual correlates of fixation selection: effects of scale and time , 2005, Vision Research.

[13]  W. Singer,et al.  Oscillatory responses in cat visual cortex exhibit inter-columnar synchronization which reflects global stimulus properties , 1989, Nature.

[14]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  G. Deco,et al.  A hierarchical neural system with attentional top–down enhancement of the spatial resolution for object recognition , 2000, Vision Research.

[16]  Yide Ma,et al.  Review of pulse-coupled neural networks , 2010, Image Vis. Comput..

[17]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[18]  I. Rybak,et al.  A model of attention-guided visual perception and recognition , 1998, Vision Research.

[19]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[20]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[21]  Thierry Pun,et al.  Attentive mechanisms for dynamic and static scene analysis , 1995 .

[22]  Christof Koch,et al.  Modeling attention to salient proto-objects , 2006, Neural Networks.

[23]  Fred Henrik Hamker,et al.  The emergence of attention by population-based inference and its role in distributed processing and cognitive control of vision , 2005, Comput. Vis. Image Underst..

[24]  Fred Stentiford,et al.  Attention-based similarity , 2007, Pattern Recognit..

[25]  Ronald A. Rensink Seeing, sensing, and scrutinizing , 2000, Vision Research.