Analogic Preprocessing And Segmentation Algorithms For Offline Handwriting Recognition

This report describes analogic algorithms used in the preprocessing and segmentation phase of offline handwriting recognition tasks. A segmentation-based handwriting recognition approach is discussed, i.e., the system attempts to segment the words into their constituent letters. In order to improve their speed, the utilized CNN algorithms, whenever possible, use dynamic, wave front propagation-based methods instead of relying on morphologic operators were embedded into iterative algorithms. The system first locates the handwritten lines in the page image, then corrects their skew as necessary. It then searches for the words within the lines and corrects the skew at the word level as well. A novel trigger wave-based word segmentation algorithm is presented, which operates on the skeletons of words. Sample results of experiments conducted on a database of 25 handwritten pages along with suggestions for future development are presented.

[1]  Venu Govindaraju,et al.  Analysis of textual images using the Hough transform , 1989, Machine Vision and Applications.

[2]  Paul D. Gader,et al.  Generalized hidden Markov models. II. Application to handwritten word recognition , 2000, IEEE Trans. Fuzzy Syst..

[3]  Ehud Rivlin,et al.  Offline cursive script word recognition – a survey , 1999, International Journal on Document Analysis and Recognition.

[4]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Gyeonghwan Kim,et al.  An architecture for handwritten text recognition systems , 1999, International Journal on Document Analysis and Recognition.

[6]  Anthony J. Robinson,et al.  An Off-Line Cursive Handwriting Recognition System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Jack Sklansky,et al.  Finding the convex hull of a simple polygon , 1982, Pattern Recognit. Lett..

[8]  Vedat Tavsanoglu,et al.  Multiscale handwritten character recognition using CNN image filters , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[9]  M. E. Morita,et al.  Morphological approach of handwritten word skew correction , 1998, Proceedings SIBGRAPI'98. International Symposium on Computer Graphics, Image Processing, and Vision (Cat. No.98EX237).

[10]  Tamás Roska,et al.  The CNN universal machine: an analogic array computer , 1993 .

[11]  Kenneth M. Sayre,et al.  Machine recognition of handwritten words: A project report , 1973, Pattern Recognit..

[12]  Venu Govindaraju,et al.  The Role of Holistic Paradigms in Handwritten Word Recognition , 2009 .

[13]  Guy Lorette Handwriting recognition or reading? What is the situation at the dawn of the 3rd millenium? , 1999, International Journal on Document Analysis and Recognition.

[14]  Robert Sabourin,et al.  An HMM-Based Approach for Off-Line Unconstrained Handwritten Word Modeling and Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Leon O. Chua,et al.  The CNN paradigm , 1993 .

[16]  Lambert Schomaker,et al.  Finding features used in the human reading of cursive handwriting , 1999, International Journal on Document Analysis and Recognition.

[17]  Jhing-Fa Wang,et al.  Segmentation of Single- or Multiple-Touching Handwritten Numeral String Using Background and Foreground Analysis , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Jozsef Csicsvari,et al.  High-speed character recognition using a dual cellular neural network architecture (CNND) , 1993 .

[19]  W SeniorAndrew,et al.  An Off-Line Cursive Handwriting Recognition System , 1998 .

[20]  Ángel Rodríguez-Vázquez,et al.  ACE4k: An analog I/O 64×64 visual microprocessor chip with 7-bit analog accuracy , 2002, Int. J. Circuit Theory Appl..

[21]  Leon O. Chua,et al.  Computing with Front Propagation: Active Contour And Skeleton Models In Continuous-Time CNN , 1999, J. VLSI Signal Process..

[22]  Tamás Roska,et al.  CNN software library (Templates and algorithms.) Version 7.2 , 1998 .