Recognizing Mathematical Expressions Using Tree Transformation

We describe a robust and efficient system for recognizing typeset and handwritten mathematical notation. From a list of symbols with bounding boxes the system analyzes an expression in three successive passes. The Layout Pass constructs a Baseline Structure Tree (BST) describing the two-dimensional arrangement of input symbols. Reading order and operator dominance are used to allow efficient recognition of symbol layout even when symbols deviate greatly from their ideal positions. Next, the Lexical Pass produces a Lexed BST from the initial BST by grouping tokens comprised of multiple input symbols; these include decimal numbers, function names, and symbols comprised of nonoverlapping primitives such as "=". The Lexical Pass also labels vertical structures such as fractions and accents. The Lexed BST is translated into L/sup A/T/sub E/X. Additional processing, necessary for producing output for symbolic algebra systems, is carried out in the Expression Analysis Pass. The Lexed BST is translated into an Operator Tree, which describes the order and scope of operations in the input expression. The tree manipulations used in each pass are represented compactly using tree transformations. The compiler-like architecture of the system allows robust handling of unexpected input, increases the scalability of the system, and provides the groundwork for handling dialects of mathematical notation.

[1]  R. H. Anderson,et al.  Two-Dimensional Mathematical Notation , 1977 .

[2]  Sargur N. Srihari From pixels to paragraphs: The use of contextual models in text recognition , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[3]  Kanahori Toshihiro,et al.  A Recognition Method of Matrices by Using Variable Block Pattern Elements Generating Rectangular Area , 2001, GREC.

[4]  Frank G. Garvan,et al.  The MAPLE Book , 2001 .

[5]  Richard J. Fateman How to find mathematics on a scanned page , 1999, Electronic Imaging.

[6]  Dorothea Blostein,et al.  Mathematics recognition using graph rewriting , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[7]  James R. Cordy,et al.  The TXL Programming Language-Version 10 , 2000 .

[8]  Dorothea Blostein,et al.  RECOGNITION OF MATHEMATICAL NOTATION , 1997 .

[9]  Paul A. Viola,et al.  Ambiguity and Constraint in Mathematical Expression Recognition , 1998, AAAI/IAAI.

[10]  Ihsin T. Phillips,et al.  Empirical Performance Evaluation of Graphics Recognition Systems , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Masakazu Suzuki,et al.  A technique of mathematical expression structure analysis for the handwriting input system , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[12]  Robert H. Anderson Syntax-directed recognition of hand-printed two-dimensional mathematics , 1967, Symposium on Interactive Systems for Experimental Applied Mathematics.

[13]  James Arvo,et al.  Equation entry and editing via handwriting and gesture recognition , 2001, Behav. Inf. Technol..

[14]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[15]  Keith Price Review of "TEX and METAFONT, New Directions in Typesetting by Donald Knuth", Digital Press and the American Mathematical Society. , 1980, SGAR.

[16]  Alfred V. Aho,et al.  Principles of Compiler Design , 1977 .

[17]  Stephen Wolfram,et al.  The Mathematica book (4th edition) , 1999 .

[18]  Ch. Faure,et al.  AUTOMATIC PERCEPTION OF THE STRUCTURE OF HANDWRITTEN MATHEMATICAL EXPRESSIONS , 1990 .

[19]  Nicholas J. Higham,et al.  Handbook of writing for the mathematical sciences , 1993 .

[20]  Ihsin T. Phillips Methodologies for using UW databases for OCR and image-understanding systems , 1998, Electronic Imaging.

[21]  Jean Paul Haton,et al.  A Syntactic Approach for Handwritten Mathematical Formula Recognition , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Shi-Kuo Chang,et al.  A method for the structural analysis of two-dimensional mathematical expressions , 1970, Inf. Sci..

[23]  Richard J. Fateman,et al.  Optical character recognition for typeset mathematics , 1994, ISSAC '94.

[24]  Masayuki Okamoto,et al.  Performance evaluation of a robust method for mathematical expression recognition , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[25]  Masakazu Suzuki,et al.  Mathematical formula recognition using virtual link network , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[26]  Eric Brill,et al.  Transformation-Based Error-Driven Parsing , 1993, IWPT.

[27]  Richard Zanibbi,et al.  Directions in recognizing tabular structures of handwritten mathematics notation , 2001 .

[28]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[29]  Genny Tortora,et al.  A Parsing Methodology for the Implementation of Visual Systems , 1997, IEEE Trans. Software Eng..

[30]  Kensaku Mori,et al.  A system for real-time recognition of handwritten mathematical formulas , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[31]  W. Cassie,et al.  Printing Mathematics , 2022, Nature.

[32]  Horst Bunke,et al.  Handbook of Character Recognition and Document Image Analysis , 1997 .

[33]  Dit-Yan Yeung,et al.  Mathematical expression recognition: a survey , 2000, International Journal on Document Analysis and Recognition.

[34]  P. A. Chou,et al.  Recognition of Equations Using a Two-Dimensional Stochastic Context-Free Grammar , 1989, Other Conferences.

[35]  C. Faure,et al.  Structural analysis of handwritten mathematical expressions , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[36]  James R. Cordy,et al.  TXL: A Rapid Prototyping System for Programming Language Dialects , 1991, Comput. Lang..

[37]  Hsi-Jian Lee,et al.  Understanding mathematical expressions using procedure-oriented transformation , 1994, Pattern Recognit..

[38]  Mohamed Ben Ahmed,et al.  Automatic extraction of printed mathematical formulas using fuzzy logic and propagation of context , 2001, International Journal on Document Analysis and Recognition.

[39]  Manfred K. Lang,et al.  A soft-decision approach for structural analysis of handwritten mathematical expressions , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[40]  Donald E. Knuth,et al.  TEX and METAFONT: New directions in typesetting , 1979 .

[41]  Dit-Yan Yeung,et al.  Error detection, error correction and performance evaluation in on-line mathematical expression recognition , 2001, Pattern Recognit..

[42]  James Arvo,et al.  Aiding Manipulation of Handwritten Mathematical Expressions through Style-Preserving Morphs , 2001, Graphics Interface.

[43]  Richard Zanibbi,et al.  Recognition of mathematics notation via computer using baseline structure , 2000 .

[44]  Richard Zanibbi,et al.  Baseline structure analysis of handwritten mathematics notation , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[45]  Alfred V. Aho,et al.  Principles of Compiler Design (Addison-Wesley series in computer science and information processing) , 1977 .