A novel method for object localization in digital images

Here we consider generic object localization in digital images where the goal is to find a tight bounding box enclosing the instances of object of interest. Traditional object localization methods treat this problem as building a binary classification that distinguishes between the object class and the background. The trained classifier is usually turned into a detector by sliding it across the image at different scales and classifying each window. In this study we also use the sliding window approach, but as opposed to the traditional methods, we approximate object class by using a convex class model, and each window is assigned to the object class or background based on the distance to this convex model. Our experiments demonstrate that using such models in a cascade for object localization with linear Support Vector Machines significantly improves the real-time efficiency with maintaining high classification accuracies.

[1]  Bill Triggs,et al.  Feature Sets and Dimensionality Reduction for Visual Object Detection , 2010, BMVC.

[2]  PietikainenMatti,et al.  Face Description with Local Binary Patterns , 2006 .

[3]  Chandan Srivastava,et al.  Support Vector Data Description , 2011 .

[4]  David A. McAllester,et al.  A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[8]  Andrew Zisserman,et al.  Multiple kernels for object detection , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[9]  Cordelia Schmid,et al.  Combining efficient object localization and image classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.