Segmentation of document images

Several methods for segmentation of document images are explored. The authors pose the segmentation operation as a statistical classification task with two pattern classes: print and background. A number of classification strategies are available. All require some prior information about the distribution of gray levels for the two classes. Learning (either supervised or unsupervised) and automatic updating of the class-conditional densities are performed within image subregions to adapt global density estimates to the local area. After local densities have been obtained, each pixel within the window is classified; several techniques for this are considered. Results on four test images indicate that the commonly used contextual models are not suitable to all document images. >

[1]  Kanti V. Mardia,et al.  A Spatial Thresholding Method for Image Segmentation , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[3]  Azriel Rosenfeld,et al.  Scene Labeling by Relaxation Operations , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[4]  P.K Sahoo,et al.  A survey of thresholding techniques , 1988, Comput. Vis. Graph. Image Process..

[5]  John Haslett,et al.  Maximum likelihood discriminant analysis on the plane using a Markovian model of spatial context , 1985, Pattern Recognit..

[6]  J. Besag On the Statistical Analysis of Dirty Pictures , 1986 .

[7]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[8]  A. Owen A neighbourhood-based classifier for LANDSAT data , 1984 .

[9]  Azriel Rosenfeld,et al.  Some experiments on variable thresholding , 1979, Pattern Recognit..

[10]  Nils Lid Hjort,et al.  Automatic Training in Statistical Pattern Recognition , 1988 .