Accurate object detection with a discriminative shape model

Abstract Discriminative model over bag-of-visual-words representation significantly improves the accuracy of object detection under clutter. However, it encounters bottleneck because of completely ignoring geometric constraint between features. On the contrary, to detect object accurately explicit shape model heavily relies on geometric information of the object, which as a result lacks of discriminative power. In this paper, we present a discriminative shape model to make use of the advantages of the two models based on the insight that the two models are essentially complementary. Discriminative model provides discriminative power, while shape model encodes geometry. The cost function that we used to distinguish objects considers both the detection maps of the discriminative model and the result of shape matching. In this cost function, we adopt a novel way to deal with multi-scale detection maps. We show that this cost function has very strong discriminative power, which makes learning a discriminative threshold for full object detection possible. For shape model, we also present a scheme for learning a good shape model from noisy images. Experiments on UIUC Car and Weizmann–Shotton horses show state-of-the-art performance of our model.

[1]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[2]  Nicolai Petkov,et al.  Contour detection based on nonclassical receptive field inhibition , 2003, IEEE Trans. Image Process..

[3]  Gökhan BakIr,et al.  Predicting Structured Data , 2008 .

[4]  Dan Roth,et al.  Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Yu Cao,et al.  Free-shape subwindow search for object localization , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Anand Rangarajan,et al.  A new point matching algorithm for non-rigid registration , 2003, Comput. Vis. Image Underst..

[8]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Bernhard Schölkopf,et al.  Learning with kernels , 2001 .

[11]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Cordelia Schmid,et al.  Bandit Algorithms for Tree Search , 2007, UAI.

[13]  Dennis Gabor,et al.  Theory of communication , 1946 .

[14]  Andrew Blake,et al.  Contour-based learning for object detection , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[15]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[16]  Jitendra Malik,et al.  Learning to detect natural image boundaries using local brightness, color, and texture cues , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Frédéric Jurie,et al.  Groups of Adjacent Contour Segments for Object Detection , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  David G. Lowe,et al.  University of British Columbia. , 1945, Canadian Medical Association journal.

[19]  Sven J. Dickinson,et al.  Superedge grouping for object localization by combining appearance and shape information , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Jun Wang,et al.  Salient closed boundary extraction with ratio contour , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.