Automatic Labeling of Handwritten Mathematical Symbols via Expression Matching

Mathematical expression recognition is one of the challenging problems in the field of handwritten recognition. Public datasets are often used to evaluate and compare different computer solutions for recognition problems in several domains of applications. However, existing public datasets for handwritten mathematical expressions and symbols are still scarce both in number and in variety. Such scarcity makes large scale assessment of the existing techniques a difficult task. This paper proposes a novel approach, based on expression matching, for generating ground-truthed exemplars of expressions (and, therefore, of symbols). Matching is formulated as a graph matching problem in which symbols of input instances of a manually labeled model expression are matched to the symbols in the model. Pairwise matching cost considers both local and global features of the expression. Experimental results show achievement of high accuracy for several types of expressions, written by different users.

[1]  Masakazu Suzuki,et al.  A ground-truthed mathematical character and symbol image database , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[2]  Salvatore Tabbone,et al.  Attributed Graph Matching Using Local Descriptions , 2009, ACIVS.

[3]  Harold Mouchère,et al.  Towards Handwritten Mathematical Expression Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[4]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[5]  Kaspar Riesen,et al.  Approximate graph edit distance computation by means of bipartite graph matching , 2009, Image Vis. Comput..

[6]  Dit-Yan Yeung,et al.  Mathematical expression recognition: a survey , 2000, International Journal on Document Analysis and Recognition.

[7]  George Labahn,et al.  Grammar-based techniques for creating ground-truthed sketch corpora , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[8]  Joseph J. LaViola,et al.  A Practical Approach for Writer-Dependent Symbol Recognition Using a Writer-Independent Symbol Recognizer , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Horst Bunke,et al.  Handbook of Character Recognition and Document Image Analysis , 1997 .

[10]  Dorothea Blostein,et al.  RECOGNITION OF MATHEMATICAL NOTATION , 1997 .

[11]  Bidyut Baran Chaudhuri,et al.  Recognition of online handwritten mathematical expressions , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[12]  Raúl Rojas,et al.  Recognition of On-line Handwritten Mathematical Expressions Using a Minimum Spanning Tree Construction and Symbol Dominance , 2003, GREC.

[13]  Roberto Marcondes Cesar Junior,et al.  Structural matching of 2D electrophoresis gels using deformed graphs , 2011, Pattern Recognit. Lett..

[14]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[15]  K. Bennell,et al.  Recent advances and perspectives , 2012 .