Using irregular pyramid for text segmentation and binarization of gray scale images

Compared to binary images that most text extraction methods work on, gray scale images provide much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (i.e. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images are binarized before the actual text extraction, this paper proposes a new method by first segmenting individual subject areas with the help of an irregular pyramid to be followed by the binarization process. This permits the focus of attention only on the appropriate subject areas for the binarization process before text recognition. Our method overcomes the difficulty in global binarization to find a single value to fit all. It also avoids the common problem in most local thresholding technique of finding a suitable window size. As shown in our experimented result, our method performed well in both text segmentation and binarization by varying the sequence of processing.

[1]  Proceedings Seventh International Conference on Document Analysis and Recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[2]  Chew Lim Tan,et al.  Detection of word groups based on irregular pyramid , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[3]  Seong-Whan Lee,et al.  A new methodology for gray-scale character segmentation and recognition , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[4]  Peter Meer,et al.  Stochastic image pyramids , 1989, Comput. Vis. Graph. Image Process..

[5]  Theodosios Pavlidis,et al.  Direct Gray-Scale Extraction of Features for Character Recognition , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[7]  Fu Chang Retrieving information from document images: problems and solutions , 2001, International Journal on Document Analysis and Recognition.