Character Type Classification via Probabilistic Topic Model

In this paper, we propose a method for character type classification based on a probabilistic topic model. The topic model is originally developed for topic discovery in text analysis using bag-of-words representation. Recent studies have shown the model is also useful for image analysis. We adopt the probabilistic topic model for character type classification. In our method, character type classification is carried out by classifying image patches based on their topic proportions. Since the performance of the method depends on a visual vocabulary generated by image feature extraction, we compare several feature extraction and description methods, and examine the relations to classification performance. In addition, by extending the method, we propose a coarse-to-fine approach to achieve stable character type classification for a small image patch. For that purpose, firstly, we partition an image into several patches which contain enough information to estimate the model parameters via EM algorithm. Then, each patch is subdivided into smaller patches. Estimation on the small patch is carried out by MAP-technique with a prior reflecting topic proportion of its parent patch. Through the experiments, we show accurate character type classification is made possible by the probabilistic topic model.

[1]  Efstathios Stamatatos,et al.  Machine-printed from handwritten text discrimination , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[2]  Anil K. Jain,et al.  Document Representation and Its Application to Page Decomposition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  David S. Doermann,et al.  Gabor filter based multi-class classifier for scanned document images , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[4]  David Salesin,et al.  Wavelets for computer graphics: a primer.1 , 1995, IEEE Computer Graphics and Applications.

[5]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[6]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[7]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[9]  C. Schmid,et al.  Indexing based on scale invariant interest points , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[10]  Alexei A. Efros,et al.  Discovering objects and their location in images , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[11]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Thomas Hofmann,et al.  Unsupervised Learning by Probabilistic Latent Semantic Analysis , 2004, Machine Learning.

[13]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[15]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[16]  Tony Lindeberg,et al.  Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure , 1997, Image Vis. Comput..

[17]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[18]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.