Predicting Structure in Handwritten Algebra Data From Low Level Features

Background A goal in computer vision and natural language processing is to predict high level structure over handwritten algebra data. This is typically done by clustering or otherwise identifying symbols, then applying a known or learned grammar to generate dependency trees. However, these methods are often not generic and require domain knowledge. Aim The goal of this paper is to determine whether it is practical to predict dependency trees from low level features of handwritten algebra data without the use of symbol identification, and to determine which algorithms are best suited for this task. Data Our data consists of 400 pages of handwritten algebra data collected by one of our researchers. Methods We use transition-based parsing to generate dependency trees, and various no-regret online imitation learning algorithms to predict transitions. Results We find that a simple algorithm called greedy transition-based parsing, which has regretful learning and greedy decoding, gives similar or better results compared to the no-regret algorithms. This is surprising, as theory suggests noregret algorithms should outperform regretful algorithms. We discuss possible explanations for this discrepancy. Conclusions We conclude that it is practical to predict dependency trees over handwritten algebra data without the use of symbol identification.

[1]  Jean Paul Haton,et al.  A Syntactic Approach for Handwritten Mathematical Formula Recognition , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Tomoichi Takahashi,et al.  A study of symbol segmentation method for handwritten mathematical formula recognition using mathematical structure information , 2004, ICPR 2004.

[3]  Yann LeCun,et al.  The mnist database of handwritten digits , 2005 .

[4]  Ryo Yamamoto,et al.  Stroke-Based Stochastic Context-Free Grammar for On-line Handwritten Mathematical Expression Recognition , 2006 .

[5]  Frank K. Soong,et al.  A Unified Framework for Symbol Segmentation and Recognition of Handwritten Mathematical Expressions , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[6]  Frank K. Soong,et al.  Symbol graph based discriminative training and rescoring for improved math symbol recognition , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[8]  Harold Mouchère,et al.  Towards Handwritten Mathematical Expression Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[9]  Geoffrey J. Gordon,et al.  No-Regret Reductions for Imitation Learning and Structured Prediction , 2010, ArXiv.

[10]  J. Andrew Bagnell,et al.  Efficient Reductions for Imitation Learning , 2010, AISTATS.

[11]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[12]  Jin Hyung Kim,et al.  CROHME2011: Competition on Recognition of Online Handwritten Mathematical Expressions , 2011, 2011 International Conference on Document Analysis and Recognition.

[13]  Byron Boots,et al.  Hilbert Space Embeddings of Predictive State Representations , 2013, UAI.

[14]  Catherine Lu Recognition of Online Handwritten Mathematical Expressions , 2013 .

[15]  Joakim Nivre,et al.  Training Deterministic Parsers with Non-Deterministic Oracles , 2013, TACL.

[16]  Joan-Andreu Sánchez,et al.  Recognition of on-line handwritten mathematical expressions using 2D stochastic context-free grammars and hidden Markov models , 2014, Pattern Recognit. Lett..

[17]  Regina Barzilay,et al.  Greed is Good if Randomized: New Inference for Dependency Parsing , 2014, EMNLP.

[18]  Harold Mouchère,et al.  A global learning approach for an online handwritten mathematical expression recognition system , 2014, Pattern Recognit. Lett..