First experiments on a new online handwritten flowchart database

We propose in this paper a new online handwritten flowchart database and perform some first experiments to have a baseline benchmark on this dataset. The collected database consists of 419 flowcharts labeled at the stroke and symbol levels. In addition, an isolated database of graphical and text symbols was extracted from these collected flowcharts. Then, we tackle the problem of online handwritten flowchart recognition from two different points of view. Firstly, we consider that flowcharts are correctly segmented, and we propose different classifiers to perform two tasks, text/non-text separation and graphical symbol recognition. Tested with the extracted isolated test database, we achieve up to 90% and 98% in text/non-text separation and up to 93.5% in graphical symbols recognition. Secondly, we propose a global approach to perform flowchart segmentation and recognition. For this latter, we adopt a global learning schema and a recognition architecture that considers a simultaneous segmentation and recognition. Global architecture is trained and tested directly with flowcharts. Results show the interest of such global approach, but regarding the complexity of flowchart segmentation problem, there is still lot of space to improve the global learning and recognition methods.

[1]  R. M. Karp,et al.  The Construction of Discrete Dynamic Programming Algorithms , 1965, IBM Syst. J..

[2]  Thomas F. Stahovich,et al.  SketchIT: a Sketch Interpretation Tool for Conceptual Mechanical Design , 1996 .

[3]  Dorothea Blostein,et al.  RECOGNITION OF MATHEMATICAL NOTATION , 1997 .

[4]  E. Poisson,et al.  Multi-modular architecture based on convolutional neural networks for online handwritten character recognition , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[5]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[6]  Isabelle Guyon,et al.  On-line cursive script recognition using time-delay neural networks and hidden Markov models , 2005, Machine Vision and Applications.

[7]  Yuan Qi,et al.  Diagram structure recognition by Bayesian conditional random fields , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Randall Davis,et al.  Tahuti: a geometrical sketch recognition system for UML class diagrams , 2006, SIGGRAPH Courses.

[9]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[10]  Liang Zhang,et al.  A Novel Pen-Based Flowchart Recognition System for Programming Teaching , 2008, WBL.

[11]  Christian Viard-Gaudin,et al.  On-line hand-drawn electric circuit diagram recognition using 2D dynamic programming , 2009, Pattern Recognit..

[12]  Harold Mouchère,et al.  Towards Handwritten Mathematical Expression Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.