A Morphological Approach for Text-Line Segmentation in Handwritten Documents

Document image segmentation to text lines is a critical stage towards unconstrained handwritten document recognition. Although morphological operations proved to be effective in processing machine-printed documents for several issues, similar methods for unconstraint-handwritten documents lack accuracy. We propose an efficient method based on binary morphology for text-line segmentation in such documents. The basic steps of our approach are: a) sub sampling and binary rank order filtering to enhance the text-line structures and b) applying dilations and (p,q)-th generalized foreground rank openings successively to join close and horizontally overlapping regions while preventing a merge in the vertical direction. The method tested on the benchmarking dataset of the ICDAR07 handwriting segmentation contest and show remarkable results.

[1]  Dan S. Bloomberg Textured reductions for document image analysis , 1996, Electronic Imaging.

[2]  Petros Maragos,et al.  Generalized hit-miss operators , 1990, Optics & Photonics.

[3]  Vassilis Katsouros,et al.  Handwritten document image segmentation into text lines and words , 2010, Pattern Recognit..

[4]  Sargur N. Srihari,et al.  Control Structure for Interpreting Handwritten Addresses , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Jonathan J. Hull Document Image skew Detection: Survey and Annotated Bibliography , 1996, DAS.

[6]  Venu Govindaraju,et al.  Line separation for complex document images using fuzzy runlength , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[7]  R. Lotufo,et al.  Morphological Image Processing , 2008 .

[8]  Ioannis Pratikakis,et al.  Text line detection in handwritten documents , 2008, Pattern Recognit..

[9]  Georgios Louloudis,et al.  ICDAR 2009 Handwriting Segmentation Contest , 2009, ICDAR.

[10]  R. Manmatha,et al.  A scale space approach for automatically segmenting words from historical handwritten documents , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Basilios Gatos,et al.  Handwriting Segmentation Contest , 2007, ICDAR.

[12]  Dan S. Bloomberg Image analysis using threshold reduction , 1991, Optics & Photonics.

[13]  Petros Maragos,et al.  Morphological filters-Part I: Their set-theoretic analysis and relations to linear shift-invariant filters , 1987, IEEE Trans. Acoust. Speech Signal Process..

[14]  William A. Barrett,et al.  Separating lines of text in free-form handwritten historical documents , 2006, Second International Conference on Document Image Analysis for Libraries (DIAL'06).

[15]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Matematiksel Morfoloji,et al.  MORPHOLOGICAL IMAGE PROCESSİNG WITH FUZZY LOGIC , 2006 .

[17]  Berrin A. Yanikoglu,et al.  Segmentation of off-line cursive handwriting using linear programming , 1998, Pattern Recognit..

[18]  George D. C. Cavalcanti,et al.  Text Line Segmentation Based on Morphology and Histogram Projection , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[19]  Fei Yin,et al.  2009 10th International Conference on Document Analysis and Recognition A Variational Bayes Method for Handwritten Text Line Segmentation , 2022 .

[20]  H. Damasio,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence: Special Issue on Perceptual Organization in Computer Vision , 1998 .

[21]  Sargur N. Srihari,et al.  A statistical approach to line segmentation in handwritten documents , 2007, Electronic Imaging.

[22]  Klaus D. Tönnies,et al.  Line detection and segmentation in historical church registers , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[23]  Yi Li,et al.  Script-Independent Text Line Segmentation in Freestyle Handwritten Documents , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Syed Saqib Bukhari,et al.  Script-Independent Handwritten Textlines Segmentation Using Active Contours , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[25]  Umapada Pal,et al.  Morphology Based Handwritten Line Segmentation Using Foreground and Background Information , 2008 .

[26]  George Nagy,et al.  Twenty Years of Document Image Analysis in PAMI , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Petros Maragos,et al.  Morphological filters-Part II: Their relations to median, order-statistic, and stack filters , 1987, IEEE Trans. Acoust. Speech Signal Process..

[28]  Apostolos Antonacopoulos,et al.  Handwriting Segmentation Contest , 2007, ICDAR.