Object Detection by Admissible Region Search

Efficient Subwindow Search(ESS) is an effective method for object detection and localization, which adopts a scheme of branch-and-bound to find the global optimum of a quality function from all the possible subimages. Since the number of possible subimage is $\emph{O}(\emph{n}^{4})$ for an images with $\emph{n}\times\emph{n}$ resolution, the time complexity of ESS ranges from $\emph{O}(\emph{n}^{2})$ to $\emph{O}(\emph{n}^{4})$ . In other words, ESS is equivalent to the exhaustive search in the worst case. In this paper, we propose a new method named Adimissible Region Search(ARS) for detecting and localizing the object with arbitrary shape in an image. Compared with the sliding window methods using ESS, ARS has two advantages: firstly, the time complexity is quadratic and stable so that it is more suitable to process large resolution images; secondly, the admissible region is adaptable to match the real shape of the target object and thus more suitable to represent the object. The experimental results on PASCAL VOC 2006 demonstrate that the proposed method is much faster than the ESS method on average.

[1]  Antti Oulasvirta,et al.  Computer Vision – ECCV 2006 , 2006, Lecture Notes in Computer Science.

[2]  Alok Aggarwal,et al.  Geometric applications of a matrix-searching algorithm , 1987, SCG '86.

[3]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[4]  Stefano Soatto,et al.  Localizing Objects with Smart Dictionaries , 2008, ECCV.

[5]  Frédéric Jurie,et al.  Groups of Adjacent Contour Segments for Object Detection , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[8]  Christoph H. Lampert,et al.  Efficient Subwindow Search: A Branch and Bound Framework for Object Localization , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Andrew Zisserman,et al.  A Boundary-Fragment-Model for Object Detection , 2006, ECCV.

[10]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Svetha Venkatesh,et al.  Efficient algorithms for subwindow search in object detection and localization , 2009, CVPR 2009.

[12]  Bernt Schiele,et al.  Decomposition, discovery and detection of visual categories using topic models , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Tetsuo Asano,et al.  Polynomial-time solutions to image segmentation , 1996, SODA '96.

[14]  Andrew J. Davison,et al.  Active Matching , 2008, ECCV.

[15]  Andrew Zisserman,et al.  An Exemplar Model for Learning Object Classes , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Takeo Kanade,et al.  Human Face Detection in Visual Scenes , 1995, NIPS.

[17]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[18]  Yasuhiko Morimoto,et al.  Data Mining with optimized two-dimensional association rules , 2001, TODS.

[19]  Andrew Zisserman,et al.  Representing shape with a spatial pyramid kernel , 2007, CIVR '07.