Document segmentation using polynomial spline wavelets

Wavelet transforms have been widely used as effective tools in texture segmentation in the past decade. Segmentation of document images, which usually contain three types of texture information: text, picture and background, can be regarded as a special case of texture segmentation. B-spline wavelets possess some desirable properties such as being well localized in time and frequency, and being compactly supported, which make them an effective tool for texture analysis. Based on the observation that text textures provide fast-changed and relatively regular distributed edges in the wavelet transform domain, an efficient document segmentation algorithm is designed via cubic B-spline wavelets. Three-means or two-means classification is applied for classifying pixels with similar characteristics after feature estimation at the outputs of high frequency bands of spline wavelet transforms. We examine and evaluate the contributions of different factors to the segmentation results from the viewpoints of decomposition levels, frequency bands and wavelet functions. Further performance analysis reveals the advantages of the proposed method.

[1]  Jian Fan,et al.  Texture Classification by Wavelet Packet Signatures , 1993, MVA.

[2]  L. Schumaker Spline Functions: Basic Theory , 1981 .

[3]  Lawrence O'Gorman,et al.  Document Image Analysis , 1996 .

[4]  George Nagy,et al.  DOCUMENT ANALYSIS WITH AN EXPERT SYSTEM , 1986 .

[5]  Michael Unser,et al.  Texture classification and segmentation using wavelet frames , 1995, IEEE Trans. Image Process..

[6]  Sargur N. Srihari,et al.  Classification of newspaper image blocks using texture analysis , 1989, Comput. Vis. Graph. Image Process..

[7]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[8]  Robert M. Gray,et al.  Text and picture segmentation by the distribution analysis of wavelet coefficients , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[9]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Michael Unser,et al.  A family of polynomial spline wavelet transforms , 1993, Signal Process..

[11]  Cedric Nishan Canagarajah,et al.  A robust automatic clustering scheme for image segmentation using wavelets , 1996, IEEE Trans. Image Process..