Detection and segmentation of touching characters in mathematical expressions

A technique for the detection and the segmentation of touching characters in mathematical expressions is presented. In the detection stage, a connected component initially recognized into some category is judged as a candidate of touched characters if its feature values deviate from the standard feature values of the category. In the segmentation stage, two component characters of the candidate are decided by the comparison with touching character images synthesized from two single character images. Experimental results showed the effectiveness on the accuracy improvement of the recognition of mathematical expressions.

[1]  Gerhard O. Michler,et al.  Report on the retrodigitization project “Archiv der Mathematik” , 2001 .

[2]  Venu Govindaraju,et al.  Holistic recognition of handwritten character pairs , 2000, Pattern Recognit..

[3]  M. Suzuki,et al.  Automatic reference linking in distributed digital libraries , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[4]  Masayuki Okamoto,et al.  Segmentation of Touching Characters in Formulas , 1998, Document Analysis Systems.

[5]  Hsi-Jian Lee,et al.  Understanding mathematical expressions in a printed document , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[6]  Eric Lecolinet,et al.  A Survey of Methods and Strategies in Character Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Proceedings Seventh International Conference on Document Analysis and Recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[8]  Dit-Yan Yeung,et al.  Mathematical expression recognition: a survey , 2000, International Journal on Document Analysis and Recognition.

[9]  Yi Lu,et al.  Machine printed character segmentation --; An overview , 1995, Pattern Recognit..