Gabor-based Kernel Self-optimization Fisher Discriminant for Optical Character Segmentation from Text-image-mixed Document

Abstract Recognizing optical character from document image of text mixed by figure has its wide applications such as document auto-reading. Segmenting the document region from text-mixed is a crucial step of this system. The segmentation procedure includes two stages, one is to extract the texture features of each block based on Gabor filter, and second is to classify the texture features for segmentation based kernel self-optimization Fisher classifier. Some experiments are implemented to testify the performance of the proposed method.

[1]  Wilson S. Geisler,et al.  Texture segmentation using Gabor modulation/demodulation , 1987, Pattern Recognit. Lett..

[2]  Alberto Del Bimbo,et al.  Semantics in Visual Information Retrieval , 1999, IEEE Multim..

[3]  Anil K. Jain,et al.  Text segmentation using gabor filters for automatic document processing , 1992, Machine Vision and Applications.

[4]  D. Sagi,et al.  Gabor filters as texture discriminator , 1989, Biological Cybernetics.

[5]  Ramesh C. Jain,et al.  A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video , 2002, Pattern Recognit..

[6]  Atsuo Yoshitaka,et al.  A Survey on Content-Based Retrieval for Multimedia Databases , 1999, IEEE Trans. Knowl. Data Eng..

[7]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..

[8]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Sargur N. Srihari,et al.  Classification of newspaper image blocks using texture analysis , 1989, Comput. Vis. Graph. Image Process..

[10]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[11]  John G. Daugman,et al.  Complete discrete 2-D Gabor transforms by neural networks for image analysis and compression , 1988, IEEE Trans. Acoust. Speech Signal Process..

[12]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..