Evaluating structural pattern recognition for handwritten math via primitive label graphs

Currently, structural pattern recognizer evaluations compare graphs of detected structure to target structures (i.e. ground truth) using recognition rates, recall and precision for object segmentation, classification and relationships. In document recognition, these target objects (e.g. symbols) are frequently comprised of multiple primitives (e.g. connected components, or strokes for online handwritten data), but current metrics do not characterize errors at the primitive level, from which object-level structure is obtained. Primitive label graphs are directed graphs defined over primitives and primitive pairs. We define new metrics obtained by Hamming distances over label graphs, which allow classification, segmentation and parsing errors to be characterized separately, or using a single measure. Recall and precision for detected objects may also be computed directly from label graphs. We illustrate the new metrics by comparing a new primitive-level evaluation to the symbol-level evaluation performed for the CROHME 2012 handwritten math recognition competition. A Python-based set of utilities for evaluating, visualizing and translating label graphs is publicly available.

[1]  Utpal Garain,et al.  EMERS: a tree matching–based performance evaluation of mathematical expression recognition systems , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[2]  Stephen V. Rice,et al.  An Algorithm for Matching OCR-Generated Text Strings , 1994, Int. J. Pattern Recognit. Artif. Intell..

[3]  Dit-Yan Yeung,et al.  Error detection, error correction and performance evaluation in on-line mathematical expression recognition , 2001, Pattern Recognit..

[4]  Jin Hyung Kim,et al.  ICFHR 2012 Competition on Recognition of On-Line Mathematical Expressions (CROHME 2012) , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[5]  Richard Zanibbi,et al.  Recognition and retrieval of mathematical expressions , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[6]  Ihsin T. Phillips,et al.  Empirical Performance Evaluation of Graphics Recognition Systems , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Robert M. Haralick,et al.  Document structure analysis and performance evaluation , 1999 .

[8]  Harold Mouchère,et al.  The Problem of Handwritten Mathematical Expression Recognition Evaluation , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[9]  Bidyut Baran Chaudhuri,et al.  Recognition of online handwritten mathematical expressions , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[10]  Daniel P. Lopresti,et al.  Evaluating the performance of table processing algorithms , 2002, International Journal on Document Analysis and Recognition.

[11]  Ana Costa e Silva,et al.  Metrics for evaluating performance in document analysis: application to tables , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[12]  Harold Mouchère,et al.  Stroke-Based Performance Metrics for Handwritten Mathematical Expressions , 2011, 2011 International Conference on Document Analysis and Recognition.

[13]  Richard Zanibbi,et al.  Layout-based substitution tree indexing and retrieval for mathematical expressions , 2012, Electronic Imaging.

[14]  Dorothea Blostein,et al.  Issues in Performance Evaluation: A Case Study of Math Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[15]  J. Cordy,et al.  A Survey of Table Recognition : Models , Observations , Transformations , and Inferences , 2003 .

[16]  Jin Hyung Kim,et al.  CROHME2011: Competition on Recognition of Online Handwritten Mathematical Expressions , 2011, 2011 International Conference on Document Analysis and Recognition.