Binarization Chinese Rubbing Images Using Gaussian Mixture Model

Rubbings are important components of ancient Chinese books, and are the main source for people to learn, study, and research history. Image segmentation plays a crucial role in extracting useful information and characteristics of Chinese character from the rubbing images. In this paper, binarization using a Gaussian Mixture Model (GMM) with 2 components for representation of background and foreground distribution in a Chinese rubbing image has been proposed. To model the likelihood of each pixel belonging to foreground or background, a foreground and background color model are learned from three color bands samples that using RGB color space. The standard Expectation-Maximisation (EM) algorithm had been used to estimate the GMM parameters. Experimental results on real rubbing images validate the effectiveness of the model when working with Chinese rubbing images.

[1]  Sang Uk Lee,et al.  A comparative performance study of several global thresholding techniques for segmentation , 1990, Comput. Vis. Graph. Image Process..

[2]  Rohit Kamal Chatterjee,et al.  Historical Handwritten Document Image Segmentation Using Morphology , 2014 .

[3]  Laurence Likforman-Sulem,et al.  Document Recognition and Retrieval XVII , 2007 .

[4]  Ophir Frieder,et al.  Degraded document image enhancement , 2007, Electronic Imaging.

[5]  Z. Saidane,et al.  Robust Binarization for Video Text Recognition , 2007 .

[6]  Utpal Garain,et al.  On foreground — background separation in low quality document images , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[7]  C. V. Jawahar,et al.  An MRF Model for Binarization of Natural Scene Text , 2011, 2011 International Conference on Document Analysis and Recognition.

[8]  R. Redner,et al.  Mixture densities, maximum likelihood, and the EM algorithm , 1984 .

[9]  Kwok-Wing Chau,et al.  A new image thresholding method based on Gaussian mixture model , 2008, Appl. Math. Comput..

[10]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[11]  Josef Kittler,et al.  Threshold selection based on a simple image statistic , 1985, Comput. Vis. Graph. Image Process..

[12]  Li Linlin,et al.  Edge Based Binarization for Video Text Images , 2010, ICPR 2010.

[13]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..