Efficient object detection based on selective attention

In this paper, we make use of biologically inspired selective attention to improve the efficiency and performance of object detection under clutter. At first, we propose a novel bottom-up attention model. We argue that heuristic feature selection based on bottom-up attention can stably select out invariant and discriminative features. With these selected features, performance of object detection can be improved apparently and stably. Then we propose a novel concept of saccade map based on bottom-up attention to simulate the saccade (eye movements) in vision. Sliding within saccade map to detect object can significantly reduce computational complexity and apparently improve performance because of the effective filtering for distracting information. With these ideas, we present a general framework for object detection through integrating bottom-up attention. Through evaluating on UIUC cars and Weizmann-Shotton horses we show state-of-the-art performance of our object detection model. (C) 2013 Elsevier Ltd. All rights reserved.

[1]  Zhiyong Xu,et al.  Bottom-up attention based on C1 features of HMAX model , 2012, Photonics Asia.

[2]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[3]  Christof Koch,et al.  Feature combination strategies for saliency-based visual attention systems , 2001, J. Electronic Imaging.

[4]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[5]  David G. Lowe,et al.  University of British Columbia. , 1945, Canadian Medical Association journal.

[6]  Masoud Nikravesh,et al.  Feature Extraction - Foundations and Applications , 2006, Feature Extraction.

[7]  T. Poggio,et al.  What and where: A Bayesian inference theory of attention , 2010, Vision Research.

[8]  Dan Roth,et al.  Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Tilke Judd,et al.  Understanding and predicting where people look in images , 2011 .

[10]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[11]  Dirk Walther,et al.  Interactions of visual attention and object recognition : computational modeling, algorithms, and psychophysics. , 2006 .

[12]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  A. L. I︠A︡rbus Eye Movements and Vision , 1967 .

[14]  Marko Grobelnik,et al.  Feature selection using linear classifier weights: interaction with classification models , 2004, SIGIR '04.

[15]  Masoud Nikravesh,et al.  Feature Extraction: Foundations and Applications (Studies in Fuzziness and Soft Computing) , 2006 .

[16]  Laurent Itti,et al.  Neuromorphic algorithms for computer vision and attention , 2001, SPIE Optics + Photonics.

[17]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[18]  Dennis Gabor,et al.  Theory of communication , 1946 .

[19]  Andrew Blake,et al.  Contour-based learning for object detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[20]  Tomaso Poggio,et al.  CNS: a GPU-based framework for simulating cortically-organized networks , 2010 .

[21]  W. James,et al.  The Principles of Psychology. , 1983 .

[22]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.