论文信息 - Statistical approach for supervised codeword selection

Statistical approach for supervised codeword selection

Bag-of-words (BoW) is one of the most successful methods for object categorization. This paper proposes a statistical codeword selection algorithm where the best subset is selected from the initial codewords based on the statistical characteristics of codewords. For this purpose, we defined two types of codeword-confidences: cross- and within-category confidences. The cross- and within-category confidences eliminate indistinctive codewords across categories and inconsistent codewords within each category, respectively. An informative subset of codewords is then selected based on these two codeword-confidences. The experimental evaluation for a scene categorization dataset and a Caltech-101 dataset shows that the proposed method improves the categorization performance up to 10% in terms of error rate reduction when cooperated with BoW, sparse coding (SC), and locality-constrained liner coding (LLC). Furthermore, the codeword size is reduced by 50% leading a low computational complexity.

Kihong Park | Seungryong Kim | Kwanghoon Sohn | Seungchul Ryu

[1] Cordelia Schmid,et al. Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2] Qi Tian,et al. Task-Dependent Visual-Codebook Compression , 2012, IEEE Transactions on Image Processing.

[3] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[4] Florent Perronnin,et al. Fisher Kernels on Visual Vocabularies for Image Categorization , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5] Y. LeCun,et al. Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[6] Andrew Zisserman,et al. Scene Classification Via pLSA , 2006, ECCV.

[7] Yihong Gong,et al. Linear spatial pyramid matching using sparse coding for image classification , 2009, CVPR.

[8] Yihong Gong,et al. Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9] Frédéric Jurie,et al. Sampling Strategies for Bag-of-Features Image Classification , 2006, ECCV.

[10] Gabriela Csurka,et al. Visual categorization with bags of keypoints , 2002, eccv 2004.

[11] Pietro Perona,et al. Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[12] Pietro Perona,et al. One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Lei Wang. Toward A Discriminative Codebook: Codeword Selection across Multi-resolution , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[14] Alain Rakotomamonjy,et al. Variable Selection Using SVM-based Criteria , 2003, J. Mach. Learn. Res..