A Least Squares Approach to Region Selection

Region selection is able to boost the recognition performance for images with background clutter by discovering the object regions. In this paper, we propose a region selection method under the least squares framework. With the assumption that an object is a combination of several over-segmented regions, we impose a selection variable on each region, and employ a linear model to perform classification. The model parameter and the selection parameter are alternatively updated to minimize a sum-of-squares error function. During the iteration, the selection parameter can automatically pick the discriminant regions accounting for the object category, then fine tunes the linear model with the objects, independently of the background. As a result, the learnt model is able to distinguish object regions and non-object regions, which actually generates irregular-shape object localization. Our method performs significantly better than the baselines on two datasets, and the performance can be further improved when combining deep CNN features. Moreover, the algorithm is easy to implement and computationally efficient because of the merits inherited from the least squares.

[1]  Chenliang Xu,et al.  Streaming Hierarchical Video Segmentation , 2012, ECCV.

[2]  Paul A. Viola,et al.  Multiple Instance Boosting for Object Detection , 2005, NIPS.

[3]  Fei-Fei Li,et al.  Object-Centric Spatial Pooling for Image Classification , 2012, ECCV.

[4]  Carsten Rother,et al.  Weakly supervised discriminative localization and classification: a joint learning process , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[5]  Thomas Deselaers,et al.  Localizing Objects While Learning Their Appearance , 2010, ECCV.

[6]  Ming-Hsuan Yang,et al.  Hierarchical Convolutional Features for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[7]  Véronique Prinet,et al.  Towards Optimal Naive Bayes Nearest Neighbor , 2010, ECCV.

[8]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Ivor W. Tsang,et al.  A Convex Method for Locating Regions of Interest with Multi-instance Learning , 2009, ECML/PKDD.

[10]  Jian Dong,et al.  Contextualizing Object Detection and Classification , 2015, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Kristen Grauman,et al.  Efficient region search for object detection , 2011, CVPR 2011.

[12]  Jitendra Malik,et al.  Hypercolumns for object segmentation and fine-grained localization , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Mohammad H. Poursaeidi,et al.  Robust support vector machines for multiple instance learning , 2012, Annals of Operations Research.

[14]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Christoph H. Lampert,et al.  Beyond sliding windows: Object localization by efficient subwindow search , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Fei-Fei Li,et al.  Discriminative Segment Annotation in Weakly Labeled Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[18]  Deyu Meng,et al.  Instance Annotation via Optimal BoW for Weakly Supervised Object Localization , 2017, IEEE Transactions on Cybernetics.

[19]  Cordelia Schmid,et al.  Combining efficient object localization and image classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[20]  Svetlana Lazebnik,et al.  Scene recognition and weakly supervised object localization with deformable part-based models , 2011, 2011 International Conference on Computer Vision.

[21]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[22]  C. Schmid,et al.  Region-Based Image Classification with a Latent SVM Model , 2011 .

[23]  Ji Zhao,et al.  Feature and Region Selection for Visual Learning , 2014, IEEE Transactions on Image Processing.

[24]  Andrew Zisserman,et al.  Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[25]  Jiajun Wu,et al.  Deep multiple instance learning for image classification and auto-annotation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Jitendra Malik,et al.  Simultaneous Detection and Segmentation , 2014, ECCV.