Document Binarization Based on Connected Operators

An original binarization method based on connected operators is proposed in this paper. Connected operators enable to filter and/or segment an image by preserving its contours.The proposed binarization method enables to extract relevant document objects by means of the component-tree structure. This method was compared to other binarization methods and showed good behavior in various contexts.

[1]  M. Leon,et al.  A TREE STRUCTURED-BASED CAPTION TEXT DETECTION APPROACH , 2005 .

[2]  Jacques Demongeot,et al.  Efficient Algorithms to Implement the Confinement Tree , 2000, DGCI.

[3]  Ronald Jones,et al.  Attribute Openings, Thinnings, and Granulometries , 1996, Comput. Vis. Image Underst..

[4]  Guojun Lu,et al.  Shape-based image retrieval using generic Fourier descriptor , 2002, Signal Process. Image Commun..

[5]  Chew Lim Tan,et al.  Using irregular pyramid for text segmentation and binarization of gray scale images , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[6]  Volodymyr Mosorov A main stem concept for image matching , 2005, Pattern Recognit. Lett..

[7]  Øivind Due Trier,et al.  Improvement of "integrated function algorithm" for binarization of document images , 1995, Pattern Recognit. Lett..

[8]  Philippe Salembier,et al.  Antiextensive connected operators for image and sequence processing , 1998, IEEE Trans. Image Process..

[9]  Laurent Wendling,et al.  Combining Shape Descriptors and Component-tree for Recognition of Ancient Graphical Drop Caps , 2009, VISAPP.

[10]  Wen-Hsiang Tsai,et al.  Moment-preserving thresholding: a new approach , 1995 .

[11]  Nabih N. Abdelmalek,et al.  Maximum likelihood thresholding based on population mixture models , 1992, Pattern Recognit..

[12]  Ahmed S. Abutableb Automatic thresholding of gray-level pictures using two-dimensional entropy , 1989 .

[13]  Heng-Da Cheng,et al.  Fuzzy partition of two-dimensional histogram and its application to thresholding , 1999, Pattern Recognit..

[14]  Laurent Wendling,et al.  Multi-scale binarization of images , 2003, Pattern Recognit. Lett..

[15]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[16]  Ronald Jones,et al.  Connected Filtering and Segmentation Using Component Trees , 1999, Comput. Vis. Image Underst..

[17]  Philippe Salembier,et al.  Connected operators and pyramids , 1993, Optics & Photonics.

[18]  Antoine Tabbone,et al.  Combining Global and Local Threshold to Binarize Document of Images , 2005, IbPRIA.

[19]  Ahmed S. Abutaleb,et al.  Automatic thresholding of gray-level pictures using two-dimensional entropy , 1989, Comput. Vis. Graph. Image Process..

[20]  Sungzoon Cho,et al.  Improvement of kittler and illingworth's minimum error thresholding , 1989, Pattern Recognit..

[21]  Ming-Ting Sun,et al.  Extensive partition operators, gray-level connected operators, and region merging/classification segmentation algorithms: theoretical links , 2001, IEEE Trans. Image Process..

[22]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[23]  Michael H. F. Wilkinson,et al.  Connected Shape-Size Pattern Spectra for Rotation and Scale-Invariant Classification of Gray-Scale Images , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Øivind Due Trier,et al.  Evaluation of Binarization Methods for Document Images , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Cullen Jennings,et al.  Thresholding using an illumination model , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[26]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[27]  Michael W. Berry,et al.  Using dendronal signatures for feature extraction and retrieval , 2000, Int. J. Imaging Syst. Technol..

[28]  Michel Couprie,et al.  Building the Component Tree in Quasi-Linear Time , 2006, IEEE Transactions on Image Processing.

[29]  Ki-Sang Hong,et al.  Binarization of noisy gray-scale character images by thin line modeling , 1999, Pattern Recognit..

[30]  Wen-Hsiang Tsai,et al.  Moment-preserving thresolding: A new approach , 1985, Comput. Vis. Graph. Image Process..

[31]  W. Guitang,et al.  A new method for image segmentation , 2009, 2009 Asia-Pacific Conference on Computational Intelligence and Industrial Applications (PACIIA).

[32]  Andrew K. C. Wong,et al.  A new method for gray-level picture thresholding using the entropy of the histogram , 1985, Comput. Vis. Graph. Image Process..

[33]  Anil K. Jain,et al.  Text segmentation using gabor filters for automatic document processing , 1992, Machine Vision and Applications.

[34]  Wen-Liang Hwang,et al.  Binarization of document images using Hadamard multiresolution analysis , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[35]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[36]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[37]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[38]  Philippe Salembier,et al.  Flat zones filtering, connected operators, and filters by reconstruction , 1995, IEEE Trans. Image Process..

[39]  Nicolas Passat,et al.  Segmentation using vector-attribute filters: methodology and application to dermatological imaging , 2007, ISMM.

[40]  Pheng-Ann Heng,et al.  A double-threshold image binarization method based on edge detector , 2008, Pattern Recognit..