论文信息 - Bag-of-Features Codebook Generation by Self-Organisation

Bag-of-Features Codebook Generation by Self-Organisation

Bag of features is a well established technique for the visual categorisation of objects, categories of objects and textures. One of the most important part of this technique is codebook generation since its within-class and between-class discrimination power is the main factor in the categorisation accuracy. A codebook is generated from regions of interest extracted automatically from a set of labeled (supervised/semi-supervised) or unlabeled (unsupervised) images. A standard tool for the codebook generation is the c-means clustering algorithm, and the state-of-the-art results have been reported using generation schemes based on the c-means. In this work, we challenge this mainstream approach by demonstrating how the competitive learning principle in the self-organising map (SOM) is able to provide similar and often superior results to the c-means. Therefore, we claim that exploiting the self-organisation principle is an alternative research direction to the mainstream research in visual object categorisation and its importance for the ultimate challenge, unsupervised visual object categorisation, needs to be investigated.

[1] Antti Oulasvirta,et al. Computer Vision – ECCV 2006 , 2006, Lecture Notes in Computer Science.

[2] Frédéric Jurie,et al. Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[3] David G. Lowe,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[4] Cordelia Schmid,et al. A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[5] Pietro Perona,et al. Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[6] Lixin Fan,et al. Categorizing Nine Visual Classes using Local Appearance Descriptors , 2004 .

[7] Pietro Perona,et al. Unsupervised learning of visual taxonomies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Gabriela Csurka,et al. Visual categorization with bags of keypoints , 2002, eccv 2004.

[9] Luc Van Gool,et al. SURF: Speeded Up Robust Features , 2006, ECCV.

[10] Antonio Criminisi,et al. Object categorization by learned universal visual dictionary , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[11] Alexei A. Efros,et al. Unsupervised discovery of visual object class hierarchies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Cor J. Veenman,et al. Kernel Codebooks for Scene Categorization , 2008, ECCV.

[13] Jiří Matas,et al. Computer Vision - ECCV 2004 , 2004, Lecture Notes in Computer Science.

[14] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15] Frédéric Jurie,et al. Creating efficient codebooks for visual recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[16] Cordelia Schmid,et al. Constructing Category Hierarchies for Visual Recognition , 2008, ECCV.

[17] Jiri Matas,et al. Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[18] Bernt Schiele,et al. Learning semantic object parts for object categorization , 2008, Image Vis. Comput..

[19] Teuvo Kohonen,et al. The self-organizing map , 1990 .

[20] Andrew Zisserman,et al. An Affine Invariant Salient Region Detector , 2004, ECCV.