Separating drawings, formula and text from free handwriting

This paper describes a method for separating on-line handwritten patterns into characters, figures, and formulas. Today, Tablet PCs and electronic whiteboards provide much larger writing area for pen interfaces unlike PDAs, so that users can easily input text, write mathematical formulas and draw figures on the screen. The fact that people can write these objects by a single pen (marker) without switching the device, mode, software or whatever else and without any writing restriction such as grids or boxes is one of the most important benefits of the pen interfaces but it requires difficult task of segmenting these objects. We apply a probabilistic model for this problem and employ stroke features, stroke crossings and stroke densities. Moreover, we partially apply the approach of segmentation by recognition. Although the current recognizer for formulas is not a true recognizer, we have achieved about 81% correct segmentation for all the strokes in our newly prepared database of the mixed patterns from these objects.

[1]  Anil K. Jain,et al.  Structure in on-line documents , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[2]  Masaki Nakagawa,et al.  On-line text/drawings segmentation of handwritten patterns , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[3]  Masaki Nakagawa,et al.  Online writing-box-free recognition of handwritten Japanese text considering character size variations , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[4]  Bidyut Baran Chaudhuri,et al.  Script line separation from Indian multi-script documents , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[5]  Bidyut Baran Chaudhuri,et al.  Automatic identification of English, Chinese, Arabic, Devnagari and Bangla script line , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[6]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..