A hierarchical and recursive model of mathematical expressions for automatic reading of mathematical documents

We propose an original model for automatic reading of mathematical texts, founded on a recursive approach and a hierarchical view of the latter. This hierarchy in the text has been introduced by means of the notion of mathematical object, that we characterise by a set of definitions. A prior segmentation phase of mathematical objects in text is performed, using character matching techniques and propagating the labelling process of mathematical objects around "special characters" initially recognised. Complex and dynamical tree representations are proposed for mathematical expressions detected in the text, and a dynamical set of priority rules has been defined that guides this automatic process. Finally the recognition of the elementary objects obtained permits to achieve the encoding of a mathematical expression.

[1]  Manfred K. Lang,et al.  A soft-decision approach for structural analysis of handwritten mathematical expressions , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[2]  Dorothea Blostein,et al.  Mathematics recognition using graph rewriting , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[3]  H. Emptoz,et al.  From the segmentation to the complete reading of a mathematical document , 1998 .

[4]  Hsi-Jian Lee,et al.  Design of a mathematical expression recognition system , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.