Discriminative Features for Image Classification and Retrieval

In this paper, we present a new method to improve the performance of current bag-of-word based image classification process. After feature extraction, we introduce a pair wise image matching scheme to select the discriminative features. Only the label information from the raining-sets is used to update the feature weights via an iterative matching processing. The selected features correspond to the foreground content of the images thus highlight the high level category knowledge of images. "Visual words" are constructed on these selected features. Our method can be used as a refinement step for current image classification and retrieval process. We prove the efficiency of our methods in three tasks: supervised image classification, semi-supervised image classification and image retrieval. In the experiment part, we use two canonical datasets Caltech 256 and MSRC-v2 to validate our method. The results show that the share of discriminative features increases significantly to 87% by applying our selection. In four group of contract tests, supervised classification accuracies are shown to improve up to 21% while semi supervised classification accuracies are 15%. Image retrieval precision is also remarkable enhanced by our method.

[1]  O. H. Brownlee,et al.  ACTIVITY ANALYSIS OF PRODUCTION AND ALLOCATION , 1952 .

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[4]  Cordelia Schmid,et al.  Selection of scale-invariant parts for object class recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[5]  B. Schiele,et al.  Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[6]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[7]  R. Sukthankar,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[8]  Leonidas J. Guibas,et al.  The Earth Mover's Distance as a Metric for Image Retrieval , 2000, International Journal of Computer Vision.

[9]  Ronald Rosenfeld,et al.  Semi-supervised learning with graphs , 2005 .

[10]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[11]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[13]  Andrew Blake,et al.  Cosegmentation of Image Pairs by Histogram Matching - Incorporating a Global Constraint into MRFs , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[14]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[15]  Frédéric Jurie,et al.  Learning Visual Similarity Measures for Comparing Never Seen Objects , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[18]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[19]  Yong Jae Lee,et al.  Foreground Focus: Unsupervised Learning from Partially Matching Images , 2009, International Journal of Computer Vision.

[20]  Edwin R. Hancock,et al.  A generative model for graph matching and embedding , 2009, Comput. Vis. Image Underst..