Document Image Binarization Based on NFCM

In general, binarization methods of document image can be categorized into two classes [5]: local methods and global methods. The global threshold methods segment an entire image with a single threshold using the gray level histogram of image, while the local methods partition the given image into a number of sub-images (local windows) and select a threshold for each of sub-images. The most common methods of calculating the global threshold are Otsu’s algorithm [6] and the histogram shapebased threshold algorithms [6]. The global threshold method has a very good effect for the document image which has two obvious peaks in gray level histogram or is clearly between the foreground and the background. However, for the document image when the gray values between the foreground and the background become fuzzy or whose brightness is uneven, it can not obtain a good result. The most common methods of calculating local threshold are Bernsen algorithm [6] and Niblack algorithm [6]. The local threshold method is fit for dealing with the situation of the edge between character and the background being relatively vague and uneven illumination. In this case, the local threshold approach can extract character strokes from the background effectively. However, the local threshold methods also have some shortcomings. Firstly, when the whole local window is filled up with the background, the local binarization methods usually bring the ghost phenomenon (ghost is a pseudo stroke which is derived by binarization algorithm in the background region) [7]. In addition, the normal local threshold methods will also bring fracture phenomenon for some camera images (the strokes in the characters appears fracture phenomenon) [8]. Furthermore, some of the local threshold methods are relatively slower. Recently, some new binarization methods were proposed. [9] proposed a method which is based on mathematical morphology operation to enhance the blurred stroke information. [10] proposed a method which is a combination of Otsu and Bernsen. [11] proposed binarizing document image with K-means clustering algorithm. However, the K-means algorithm is a hard clustering algorithm. All these papers are not based on the combination of Niblack and Fuzzy C-Means algorithm (FCM), which is a soft clustering algorithm and has higher ability to process uncertain information [12], e.g. the fuzzy document image. Motivated by the aforementioned reason, we propose a new binarization algorithm for camera-based document image. The algorithm includes two thresholds: the first local threshold is calculated by Niblack algorithm; the second local threshold is calculated by FCM. We call this algorithm NFCM algorithm. This algorithm can not only preserve the character stroke details, but also eliminate ghost effectively.

[1]  Bashar Al-Shboul,et al.  A fast fuzzy clustering algorithm , 2007 .

[2]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[3]  David S. Doermann,et al.  Progress in camera-based document image analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[4]  Zhen-Long Bai,et al.  An approach to extracting the target text line from a document image captured by a pen scanner , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[5]  Chi Cui An Improved Design and Implementation of Chinese Name Card Recognition System on Mobile Devices , 2006, 2006 International Conference on Intelligent Information Hiding and Multimedia.

[6]  GueeSang Lee,et al.  Binarization by Local K-means Clustering for Korean Text Extraction , 2008, 2008 IEEE International Symposium on Signal Processing and Information Technology.

[7]  Hao Wang,et al.  Scalability of Hybrid Fuzzy C-Means Algorithm Based on Quantum-Behaved PSO , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[8]  Hefei Ling,et al.  An Image Binarization Method Based on HVS , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[9]  In-Jung Kim,et al.  Multi-window binarization of camera image for document recognition , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[10]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[11]  You Yang OCR Oriented Binarization Method of Document Image , 2008, 2008 Congress on Image and Signal Processing.

[12]  David S. Doermann,et al.  Geometric Rectification of Camera-Captured Document Images , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Ioannis Pratikakis,et al.  Improved document image binarization by using a combination of multiple binarization techniques and adapted edge information , 2008, 2008 19th International Conference on Pattern Recognition.