A Robust Scheme for Extraction of Text Lines from Handwritten Documents

Considering the vast collection of handwritten documents in various archives, research studies for their automatic processing have major impact in the society. Line segmentation from images of such documents is a crucial step. The problem is more difficult for documents of major Indian scripts such as Bangla because a large number of its characters have either ascender or descender or both and the majority of its writers are accustomed in extremely cursive handwriting. In this article, we describe a novel strip based text line segmentation method for handwritten documents of Bangla. Moreover, the proposed method has been found to perform efficiently on English and Devanagari handwritten documents. We conducted extensive experimentations and its results show the robustness of the proposed approach on multiple scripts.

[1]  S. Banerjee,et al.  An efficient line segmentation approach for handwritten Bangla document image , 2015, 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR).

[2]  David Doermann,et al.  A New Algorithm for Detecting Text Line in Handwritten Documents , 2006 .

[3]  Laurence Likforman-Sulem,et al.  Text line segmentation of historical documents: a survey , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[4]  Darko Brodic,et al.  Text line segmentation by adapted water flow algorithm , 2010, 10th Symposium on Neural Network Applications in Electrical Engineering.

[5]  Venu Govindaraju,et al.  2009 10th International Conference on Document Analysis and Recognition A Steerable Directional Local Profile Technique for Extraction of Handwritten Arabic Text Lines , 2022 .

[6]  Alireza Alaei,et al.  A new scheme for unconstrained handwritten text-line segmentation , 2011, Pattern Recognit..

[7]  Horst Bunke,et al.  Using Hidden Markov Models as a Tool for Handwritten Text Line Segmentation , 2007 .

[8]  Apostolos Antonacopoulos,et al.  Document image analysis for World War II personal records , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[9]  Bidyut Baran Chaudhuri,et al.  A Global-to-Local Approach to Binarization of Degraded Document Images , 2014, 2014 22nd International Conference on Pattern Recognition.

[10]  Vassilis Katsouros,et al.  Handwritten document image segmentation into text lines and words , 2010, Pattern Recognit..

[11]  Georgios Louloudis,et al.  ICDAR 2009 Handwriting Segmentation Contest , 2009, ICDAR.

[12]  Ioannis Pratikakis,et al.  Text line and word segmentation of handwritten documents , 2009, Pattern Recognit..

[13]  Fei Yin,et al.  2009 10th International Conference on Document Analysis and Recognition A Variational Bayes Method for Handwritten Text Line Segmentation , 2022 .

[14]  Ioannis Pratikakis,et al.  A Block-Based Hough Transform Mapping for Text Line Detection in Handwritten Documents , 2006 .

[15]  Yi Li,et al.  Script-Independent Text Line Segmentation in Freestyle Handwritten Documents , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.