Sparse Multiscale Patches (SMP) for Image Categorization

In this paper we address the task of image categorization using a new similarity measure on the space of Sparse Multiscale Patches (SMP ). SMP s are based on a multiscale transform of the image and provide a global representation of its content. At each scale, the probability density function (pdf ) of the SMP s is used as a description of the relevant information. The closeness between two images is defined as a combination of Kullback-Leibler divergences between the pdfs of their SMP s. In the context of image categorization, we represent semantic categories by prototype images, which are defined as the centroids of the training clusters. Therefore any unlabeled image is classified by giving it the same label as the nearest prototype. Results obtained on ten categories from the Corel collection show the categorization accuracy of the SMP method.

[1]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[2]  Florent Perronnin,et al.  A similarity measure between unordered vector sets with application to image categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Yixin Chen,et al.  A sparse support vector machine approach to region-based image categorization , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Zhou Wang,et al.  Quality-aware images , 2006, IEEE Transactions on Image Processing.

[5]  Justin K. Romberg,et al.  Bayesian tree-structured image modeling using wavelet-domain hidden Markov models , 2001, IEEE Trans. Image Process..

[6]  Minh N. Do,et al.  Wavelet-based texture retrieval using generalized Gaussian density and Kullback-Leibler distance , 2002, IEEE Trans. Image Process..

[7]  Ibrahim A. Ahmad,et al.  A nonparametric estimation of the entropy for absolutely continuous distributions (Corresp.) , 1976, IEEE Trans. Inf. Theory.

[8]  Yixin Chen,et al.  Image Categorization by Learning and Reasoning with Regions , 2004, J. Mach. Learn. Res..

[9]  Joachim M. Buhmann,et al.  Empirical evaluation of dissimilarity measures for color and texture , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[10]  M. Hazelton Variable kernel density estimation , 2003 .

[11]  Michel Barlaud,et al.  Image retrieval via Kullback-Leibler divergence of patches of multiscale coefficients in the KNN framework , 2008, 2008 International Workshop on Content-Based Multimedia Indexing.

[12]  Anil K. Jain,et al.  Image classification for content-based indexing , 2001, IEEE Trans. Image Process..

[13]  Martin J. Wainwright,et al.  Image denoising using scale mixtures of Gaussians in the wavelet domain , 2003, IEEE Trans. Image Process..

[14]  Jitendra Malik,et al.  SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15]  Michel Barlaud,et al.  Fast k nearest neighbor search using GPU , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[16]  In-So Kweon,et al.  A semantic region descriptor for local feature based image categorization , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[18]  Michel Barlaud,et al.  Image restoration using a kNN-variant of the mean-shift , 2008, 2008 15th IEEE International Conference on Image Processing.

[19]  Elena Pierpaoli,et al.  Reconstructing Sunyaev-Zeldovich clusters in future CMB experiments , 2004 .

[20]  Princeton University,et al.  Reconstructing Sunyaev–Zel'dovich clusters in future cosmic microwave background experiments , 2005 .

[21]  Michel Barlaud,et al.  High-dimensional statistical distance for region-of-interest tracking: Application to combining a soft geometric constraint with radiometry , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Yuhua Zhu,et al.  Content-Based Image Categorization and Retrieval using Neural Networks , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[23]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[24]  Martin Szummer,et al.  Indoor-outdoor image classification , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[25]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  C. Quesenberry,et al.  A nonparametric estimate of a multivariate density function , 1965 .