A New Approach to Water Flow Algorithm for Text Line Segmentation

This paper proposes a new approach to water flow algorithm for the text line segmentation. Original method assumes hypothetical water flows under a few specified angles to the document image frame from left to right and vice versa. As a result, unwetted image frames are extracted. These areas are of major importance for text line segmentation. Method modifications mean extension values of water flow angle and unwetted image frames function enlargement. Results are encouraging due to text line segmentation improvement which is the most challenging process stage in document image processing.

[1]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[2]  Dana H. Ballard,et al.  Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[3]  Michael Ian Shamos,et al.  Computational geometry: an introduction , 1985 .

[4]  Siu Cheung Hui,et al.  Cursive word reference line detection , 1997, Pattern Recognit..

[5]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[6]  Adnan Amin,et al.  A Document Skew Detection Method Using the Hough Transform , 2000, Pattern Analysis & Applications.

[7]  Venu Govindaraju,et al.  Line separation for complex document images using fuzzy runlength , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[8]  Ken-ichi Iso,et al.  An HMM-based text segmentation method using variational Bayes approach and its application to LVCSR for broadcast news , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[9]  Adnan Amin,et al.  Robust skew detection in mixed text/graphics documents , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[10]  Laurence Likforman-Sulem,et al.  Text line segmentation of historical documents: a survey , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[11]  K. Koch Introduction to Bayesian Statistics , 2007 .

[12]  Subhadip Basu,et al.  Text line extraction from multi-skewed handwritten documents , 2007, Pattern Recognit..

[13]  Adnan Khashman,et al.  Document Image binarisation Using a Supervised Neural Network , 2008, Int. J. Neural Syst..

[14]  C.A.B. Mello,et al.  Text Line Segmentation in Images of Handwritten Historical Documents , 2008, 2008 First Workshops on Image Processing Theory, Tools and Applications.

[15]  Aura Conci,et al.  Automatic Discrimination between Printed and Handwritten Text in Documents , 2009, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing.

[16]  Syed Saqib Bukhari,et al.  Adaptive Binarization of Unconstrained Hand-Held Camera-Captured Document Images , 2009, J. Univers. Comput. Sci..

[17]  Darko Brodic,et al.  An Approach to Modification of Water Flow Algorithm for Segmentation and Text Parameters Extraction , 2010, DoCEIS.

[18]  Darko Brodić,et al.  Basic experiments set for the evaluation of the text line segmentation , 2010 .

[19]  Darko Brodic,et al.  Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction , 2010, Sensors.

[20]  Ching Y. Suen,et al.  Arabic Handwritten Text Line Extraction by Applying an Adaptive Mask to Morphological Dilation , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.