Image categorization by a classifier based on probabilistic topic model

With rapid increase of number of accessible images and videos, ability to recognize visual information is getting more and more important for content-based information retrieval. Recently, probabilistic topic models, which were originally developed for text analysis, have been used for image categorization successfully. Usually, ldquotopicsrdquo which represent contents of an image is detected based on the underlying probabilistic model, then image categorization is carried out using topic distribution as the input feature. Typical method is to use k-nearest neighbor classifier based on L2-distance after topic discovery. In the method, topic distribution is just treated as a feature point. In this paper, we propose a categorization method based on more natural use of the topic distribution, which is derived by using pLSA model. Categorization is carried out by estimating conditional probability p(category|data). We present two types of image categorization tasks, scene classification and document image segmentation, and show the proposed method performs very well. In addition, we also examine the performance of the proposed method under the situation where only the limited number of labeled examples are available. We show our method can perform quite well even in the circumstances.

[1]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[2]  Peter Stanchev,et al.  Content-Based Image Retrieval Systems , 2001 .

[3]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[4]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[5]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[6]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[7]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[9]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[10]  Serge J. Belongie,et al.  Region-based image querying , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[11]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[12]  Tony Lindeberg,et al.  Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure , 1997, Image Vis. Comput..

[13]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[14]  Alexei A. Efros,et al.  Discovering object categories in image collections , 2005 .