Multithresholding of mixed-type documents

Mixed type documents include text, drawings and graphics regions. It is obvious that a technique that can reduce the number of the gray-levels in accordance to the type of each document region could be important for many document applications, such as storage, transmission and recognition. To solve this problem this paper proposes a new method that is called the document multithresholding technique. The method is based on a Page Layout Analysis (PLA) technique and on a neural network multilevel threshold selection approach. In the final document the different block types are stored with the appropriate and limited number of gray-level values. In text and line-drawing blocks, only one threshold is determined whereas in the graphics blocks the optimal number of thresholds is first determined. The performance of the method was extensively tested on a variety of documents.

[1]  Ian H. Witten,et al.  Managing gigabytes , 1994 .

[2]  P. Laguna,et al.  Signal Processing , 2002, Yearbook of Medical Informatics.

[3]  Eberhard Mandler,et al.  Document analysis-from pixels to contents , 1992 .

[4]  Robert M. Haralick,et al.  Automatic multithreshold selection , 1984, Comput. Vis. Graph. Image Process..

[5]  P.K Sahoo,et al.  A survey of thresholding techniques , 1988, Comput. Vis. Graph. Image Process..

[6]  Andrew K. C. Wong,et al.  A new method for gray-level picture thresholding using the entropy of the histogram , 1985, Comput. Vis. Graph. Image Process..

[7]  Mark J. Carlotto,et al.  Histogram Analysis Using a Scale-Space Approach , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  C. Strouthopoulos,et al.  Identification of text-only areas in mixed-type documents , 1997 .

[9]  Ronald W. Schafer,et al.  Multilevel thresholding using edge matching , 1988, Comput. Vis. Graph. Image Process..

[10]  C. Strouthopoulos,et al.  PLA using RLSA and a neural network , 1999 .

[11]  Abhijit S. Pandya,et al.  Pattern Recognition with Neural Networks in C++ , 1995 .

[12]  Qin-Zhong Ye,et al.  On minimum error thresholding and its implementations , 1988, Pattern Recognit. Lett..

[13]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[14]  Qin Zhong,et al.  On minimum error thresholding and its implementations , 1988 .

[15]  Friedrich M. Wahl,et al.  Document Analysis System , 1982, IBM J. Res. Dev..

[16]  Nikos Papamarkos,et al.  A New Approach for Multilevel Threshold Selection , 1994, CVGIP Graph. Model. Image Process..

[17]  Yasuaki Nakano,et al.  Segmentation methods for character recognition: from segmentation to document structure analysis , 1992, Proc. IEEE.

[18]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[19]  N. Papamarkos Multirational function approximation via linear programming , 1996 .

[20]  H. R. Keshavan,et al.  An optimal multiple threshold scheme for image segmentation , 1984, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  R. Kohler A segmentation system based on thresholding , 1981 .

[22]  Anil K. Jain,et al.  Page segmentation using tecture analysis , 1996, Pattern Recognit..

[23]  Michael Spann,et al.  A quad-tree approach to image segmentation which combines statistical and spatial information , 1985, Pattern Recognit..

[24]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[25]  Charalambos Strouthopoulos,et al.  Text identification for document image analysis using a neural network , 1998, Image Vis. Comput..

[26]  José Manuel Rebordão,et al.  An amplitude segmentation method based on the distribution function of an image , 1984, Comput. Vis. Graph. Image Process..

[27]  Friedrich M. Wahl,et al.  Block segmentation and text extraction in mixed text/image documents , 1982, Comput. Graph. Image Process..