A new approach for recognizing handwritten mathematics using relational grammars and fuzzy sets

We present a new approach for parsing two-dimensional input using relational grammars and fuzzy sets. A fast, incremental parsing algorithm is developed, motivated by the two-dimensional structure of written mathematics. The approach reports all identifiable parses of the input. The parses are represented as a fuzzy set, in which the membership grade of a parse measures the similarity between it and the handwritten input. To identify and report parses efficiently, we adapt and apply existing techniques such as rectangular partitions and shared parse forests, and introduce new ideas such as relational classes and interchangeability. We also present a correction mechanism that allows users to navigate parse results and choose the correct interpretation in case of recognition errors or ambiguity. Such corrections are incorporated into subsequent incremental recognition results. Finally, we include two empirical evaluations of our recognizer. One uses a novel user-oriented correction count metric, while the other replicates the CROHME 2011 math recognition contest. Both evaluations demonstrate the effectiveness of our proposed approach.

[1]  Robert H. Anderson Syntax-directed recognition of hand-printed two-dimensional mathematics , 1967, Symposium on Interactive Systems for Experimental Applied Mathematics.

[2]  Eugene Charniak,et al.  Figures of Merit for Best-First Probabilistic Chart Parsing , 1998, Comput. Linguistics.

[3]  Joseph J. LaViola,et al.  Mathematical Sketching: A New Approach to Creating and Exploring Dynamic Illustrations , 2005 .

[4]  Manfred K. Lang,et al.  A soft-decision approach for structural analysis of handwritten mathematical expressions , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[5]  Bernd Meyer,et al.  Visual Language Theory , 2012, Springer New York.

[6]  ZanibbiRichard,et al.  Recognizing Mathematical Expressions Using Tree Transformation , 2002 .

[7]  Dit-Yan Yeung,et al.  Mathematical expression recognition: a survey , 2000, International Journal on Document Analysis and Recognition.

[8]  Masaru Tomita Current Issues in Parsing Technology , 1990 .

[9]  Lotfi A. Zadeh,et al.  Note on fuzzy languages , 1969, Inf. Sci..

[10]  George Labahn,et al.  MathBrush: A System for Doing Math on Pen-Based Devices , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[11]  Masaru Tomita Parsing 2-Dimensional Language , 1989, IWPT.

[12]  Ceriel J. H. Jacobs,et al.  Parsing Techniques - A Practical Guide , 2007, Monographs in Computer Science.

[13]  Paul A. Viola,et al.  Efficient geometric algorithms for parsing in two dimensions , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[14]  Dorothea Blostein,et al.  Math-Literate Computers , 2009, Calculemus/MKM.

[15]  Harold Mouchère,et al.  The Problem of Handwritten Mathematical Expression Recognition Evaluation , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[16]  Jin Hyung Kim,et al.  Efficient search strategy in structural analysis for handwritten mathematical expression recognition , 2009, Pattern Recognit..

[17]  John A. Fitzgerald,et al.  Mathpad: A Fuzzy Logic-Based Recognition System for Handwritten Mathematics , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[18]  Madan M. Gupta,et al.  Fuzzy Sets, Fuzzy Logic, and Fuzzy Systems , 2003 .

[19]  Frank K. Soong,et al.  A Unified Framework for Symbol Segmentation and Recognition of Handwritten Mathematical Expressions , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[20]  Jean Paul Haton,et al.  A Syntactic Approach for Handwritten Mathematical Formula Recognition , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Bernard Lang,et al.  Towards a Uniform Formal Framework for Parsing , 1991 .

[22]  Utpal Garain,et al.  EMERS: a tree matching–based performance evaluation of mathematical expression recognition systems , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[23]  Jack Mostow,et al.  Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence , 1998 .

[24]  George Labahn,et al.  Tools for the efficient generation of hand-drawn corpora based on context-free grammars , 2009, SBIM '09.

[25]  George Labahn,et al.  Recognizing handwritten mathematics via fuzzy parsing , 2010 .

[26]  Genny Tortora,et al.  Positional grammars: a formalism for LR-like parsing of visual languages , 1998 .

[27]  Robert H. Anderson Syntax-directed recognition of hand-printed two-dimensional mathematics , 1967, Symposium on Interactive Systems for Experimental Applied Mathematics.

[28]  Joan-Andreu Sánchez,et al.  Recognition of Printed Mathematical Expressions Using Two-Dimensional Stochastic Context-Free Grammars , 2011, 2011 International Conference on Document Analysis and Recognition.

[29]  Ling Zhang,et al.  Using fuzzy logic to analyze superscript and subscript relations in handwritten mathematical expressions , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[30]  Harold Mouchère,et al.  A hybrid classifier for handwritten mathematical expression recognition , 2010, Electronic Imaging.

[31]  Ihsin T. Phillips,et al.  Empirical Performance Evaluation of Graphics Recognition Systems , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  P. A. Chou,et al.  Recognition of Equations Using a Two-Dimensional Stochastic Context-Free Grammar , 1989, Other Conferences.

[33]  Dit-Yan Yeung,et al.  Error detection, error correction and performance evaluation in on-line mathematical expression recognition , 2001, Pattern Recognit..

[34]  Masakazu Suzuki,et al.  Structural Analysis of Mathematical Formulae with Verification Based on Formula Description Grammar , 2006, Document Analysis Systems.

[35]  Stephen H. Unger A global parser for context-free phrase structure grammars , 1968, CACM.

[36]  Ian Rutherford Structural Analysis for Pen-Based Math Input Systems , 2005 .

[37]  Joseph J. LaViola,et al.  AlgoSketch: algorithm sketching and interactive computation , 2008, SBM'08.

[38]  George Labahn,et al.  Grammar-based techniques for creating ground-truthed sketch corpora , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[39]  George Labahn,et al.  Elastic matching in linear time and constant space , 2009 .

[40]  Bidyut Baran Chaudhuri,et al.  Recognition of online handwritten mathematical expressions , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[41]  Tahar Kechadi,et al.  Mathpad: A Fuzzy Logic-Based Recognition System for Handwritten Mathematics , 2007 .

[42]  Richard Zanibbi,et al.  Recognizing Mathematical Expressions Using Tree Transformation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Jesse F Hull Recognition of mathematics using a two-dimensional trainable context-free grammar , 1996 .

[44]  Kim Marriott,et al.  A survey of visual language specification and recognition , 1998 .

[45]  Bidyut Baran Chaudhuri,et al.  A corpus for OCR research on mathematical expressions , 2004, International Journal of Document Analysis and Recognition (IJDAR).

[46]  Dov Dori,et al.  Cost Evaluation of Interactively Correcting Recognized Engineering Drawings , 1999, GREC.

[47]  Dorothea Blostein,et al.  RECOGNITION OF MATHEMATICAL NOTATION , 1997 .

[48]  Jin Hyung Kim,et al.  CROHME2011: Competition on Recognition of Online Handwritten Mathematical Expressions , 2011, 2011 International Conference on Document Analysis and Recognition.

[49]  Paul A. Viola,et al.  Ambiguity and Constraint in Mathematical Expression Recognition , 1998, AAAI/IAAI.