Probabilistic Combination of Visual Context Based Attention and Object Detection

Visual context provides cues about an object’s presence, position and size within the observed scene, which are used to increase the performance of object detection techniques. However, state-of-theart methods for context aware object detection could decrease the initial performance. We discuss the reasons for failure and propose a concept that overcomes these limitations. Therefore, we introduce the prior probability function of an object detector, that maps the detector’s output to probabilities. Together, with an appropriate contextual weighting a probabilistic framework is established. In addition, we present an extension to state-of-the-art methods to learn scale-dependent visual context information and show how this increases the initial performance. The standard methods and our proposed extensions are compared on a novel demanding image data set.

[1]  A. Torralba,et al.  The role of context in object recognition , 2007, Trends in Cognitive Sciences.

[2]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[3]  Alexei A. Efros,et al.  Geometric context from a single image , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[4]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[5]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[6]  Stanley M. Bileschi,et al.  Street Scenes: towards scene understanding in still images , 2006 .

[7]  Ales Leonardis,et al.  Context Driven Focus of Attention for Object Detection , 2008, WAPCV.

[8]  Alexei A. Efros,et al.  Putting Objects in Perspective , 2006, CVPR.

[9]  Jan-Olof Eklundh,et al.  An Attentional System Combining Top-Down and Bottom-Up Influences , 2008, WAPCV.

[10]  tephen E. Palmer The effects of contextual scenes on the identification of objects , 1975, Memory & cognition.

[11]  Antonio Torralba,et al.  Contextual Modulation of Target Saliency , 2001, NIPS.

[12]  Lior Wolf,et al.  A Critical View of Context , 2006, International Journal of Computer Vision.

[13]  Antonio Torralba,et al.  Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. , 2006, Psychological review.

[14]  M. Bar Visual objects in context , 2004, Nature Reviews Neuroscience.

[15]  Bernt Schiele,et al.  Multi-Aspect Detection of Articulated Objects , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[16]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[17]  Antonio Torralba,et al.  Statistical Context Priming for Object Detection , 2001, ICCV.

[18]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[19]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[20]  Jitendra Malik,et al.  Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying , 2002, IEEE Trans. Pattern Anal. Mach. Intell..