Mathematical Variable Detection based on Convolutional Neural Network and Support Vector Machine

Mathematical expression detection in scientific documents is a prerequisite step for developing a mathematical retrieval system that has attracted many researches recently. In the detecting process, a challenging issue is the detection of variable. The similar properties of variable and narrative text cause many errors in the detection in existing approaches. In the paper, pre-trained deep Convolutional Neural Networks (CNN) are employed and optimized to automatically extract visual features of images and Support Vector Machine (SVM) is used to improve the accuracy of the detection. The accuracy of 99.5% is achieved for the detection of variable in inline expressions in document images in public benchmark datasets. The performance comparison with traditional methods demonstrates the effectiveness of the proposed method.

[1]  Utpal Garain,et al.  Identification of Mathematical Expressions in Document Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[2]  Hsi-Jian Lee,et al.  Design of a mathematical expression understanding system , 1997, Pattern Recognit. Lett..

[3]  Bui Hai Phong,et al.  A new method for displayed mathematical expression detection based on FFT and SVM , 2017, 2017 4th NAFOSTED Conference on Information and Computer Science.

[4]  Chew Lim Tan,et al.  Italic font recognition using stroke pattern analysis on wavelet decomposed word images , 2004, ICPR 2004.

[5]  Yue Lu,et al.  Italic font recognition using stroke pattern analysis on wavelet decomposed word images , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[6]  Changming Sun,et al.  Skew and slant correction for document images using gradient direction , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[7]  Marcus Liwicki,et al.  Deepdocclassifier: Document classification with deep Convolutional Neural Network , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[8]  Volker Sorge,et al.  A Text Line Detection Method for Mathematical Formula Recognition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[9]  Wei-Ta Chu,et al.  Mathematical Formula Detection in Heterogeneous Document Images , 2013, 2013 Conference on Technologies and Applications of Artificial Intelligence.

[10]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[11]  Han Hu,et al.  Context-aware mathematical expression recognition: An end-to-end framework and a benchmark , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[12]  Anil K. Jain,et al.  Document Structure and Layout Analysis , 2007 .

[13]  Zhi Tang,et al.  Performance Evaluation of Mathematical Formula Identification , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[14]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[15]  Nikos Fakotakis,et al.  Handwritten character recognition based on structural characteristics , 2002, Object recognition supported by user interaction for service robots.