Methodology for the Evaluation of the Algorithms for Text Line Segmentation Based on Extended Binary Classification

Methodology for the Evaluation of the Algorithms for Text Line Segmentation Based on Extended Binary Classification Text line segmentation represents the key element in the optical character recognition process. Hence, testing of text line segmentation algorithms has substantial relevance. All previously proposed testing methods deal mainly with text database as a template. They are used for testing as well as for the evaluation of the text segmentation algorithm. In this manuscript, methodology for the evaluation of the algorithm for text segmentation based on extended binary classification is proposed. It is established on the various multiline text samples linked with text segmentation. Their results are distributed according to binary classification. Final result is obtained by comparative analysis of cross linked data. At the end, its suitability for different types of scripts represents its main advantage.

[1]  Ioannis Pratikakis,et al.  Text line and word segmentation of handwritten documents , 2009, Pattern Recognit..

[2]  Syed Saqib Bukhari,et al.  Adaptive Binarization of Unconstrained Hand-Held Camera-Captured Document Images , 2009, J. Univers. Comput. Sci..

[3]  Adnan Amin,et al.  Robust skew detection in mixed text/graphics documents , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[4]  Darko Brodic Advantages of the Extended Water Flow Algorithm for Handwritten Text Segmentation , 2011, PReMI.

[5]  Syed Saqib Bukhari,et al.  Script-Independent Handwritten Textlines Segmentation Using Active Contours , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[6]  Darko Brodic,et al.  Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction , 2010, Sensors.

[7]  Darko Brodić,et al.  Optimization of the Gaussian Kernel Extended by Binary Morphology for Text Line Segmentation , 2010 .

[8]  Xueming Qian,et al.  Text detection, localization, and tracking in compressed video , 2007, Signal Process. Image Commun..

[9]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[10]  Rolf Ingold,et al.  Optical Font Recognition from Projection Profiles , 1993, Electron. Publ..

[11]  Tien D. Bui,et al.  Text line segmentation in handwritten documents using Mumford-Shah model , 2009, Pattern Recognit..

[12]  Subhadip Basu,et al.  Text line extraction from multi-skewed handwritten documents , 2007, Pattern Recognit..

[13]  Darko Brodić,et al.  The Evaluation of the Initial Skew Rate for Printed Text , 2011 .

[14]  Adnan Khashman,et al.  Document Image binarisation Using a Supervised Neural Network , 2008, Int. J. Neural Syst..

[15]  Noorzaily Mohamed Noor Off-line Handwriting Text Line Segmentation : A Review , 2008 .

[16]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[17]  R. Manmatha,et al.  A scale space approach for automatically segmenting words from historical handwritten documents , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Laurence Likforman-Sulem,et al.  Text line segmentation of historical documents: a survey , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[19]  Darko Brodić,et al.  Basic experiments set for the evaluation of the text line segmentation , 2010 .

[20]  Yi Li,et al.  Script-Independent Text Line Segmentation in Freestyle Handwritten Documents , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Subhadip Basu,et al.  CMATERdb1: a database of unconstrained handwritten Bangla and Bangla–English mixed script document image , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[22]  C.A.B. Mello,et al.  Text Line Segmentation in Images of Handwritten Historical Documents , 2008, 2008 First Workshops on Image Processing Theory, Tools and Applications.

[23]  Darko Brodic,et al.  Optimization of the Anisotropic Gaussian Kernel for Text Segmentation and Parameter Extraction , 2010, IFIP TCS.

[24]  Venu Govindaraju,et al.  Line separation for complex document images using fuzzy runlength , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[25]  Basilios Gatos,et al.  Handwriting Segmentation Contest , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).