Arrow R-CNN for handwritten diagram recognition

We address the problem of offline handwritten diagram recognition. Recently, it has been shown that diagram symbols can be directly recognized with deep learning object detectors. However, object detectors are not able to recognize the diagram structure. We propose Arrow R-CNN, the first deep learning system for joint symbol and structure recognition in handwritten diagrams. Arrow R-CNN extends the Faster R-CNN object detector with an arrow head and tail keypoint predictor and a diagram-aware postprocessing method. We propose a network architecture and data augmentation methods targeted at small diagram datasets. Our diagram-aware postprocessing method addresses the insufficiencies of standard Faster R-CNN postprocessing. It reconstructs a diagram from a set of symbol detections and arrow keypoints. Arrow R-CNN improves state-of-the-art substantially: on a scanned flowchart dataset, we increase the rate of recognized diagrams from 37.7 to 78.6%.

[1]  K. C. Santosh,et al.  Document Image Analysis , 2018, Springer Singapore.

[2]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[3]  Michael S. Bernstein,et al.  Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations , 2016, International Journal of Computer Vision.

[4]  Laurent Wendling,et al.  Symbol recognition using spatial relations , 2012, Pattern Recognit. Lett..

[5]  Lei Sun,et al.  An anchor-free region proposal network for Faster R-CNN-based text detection approaches , 2018, International Journal on Document Analysis and Recognition (IJDAR).

[6]  C. V. Jawahar,et al.  HWNet v2: an efficient word image representation for handwritten documents , 2018, International Journal on Document Analysis and Recognition (IJDAR).

[7]  Laurent Wendling,et al.  Integrating vocabulary clustering with spatial relations for symbol recognition , 2013, International Journal on Document Analysis and Recognition (IJDAR).

[8]  Laurent Wendling,et al.  A Simple and Efficient Arrowhead Detection Technique in Biomedical Images , 2016, Int. J. Pattern Recognit. Artif. Intell..

[9]  Václav Hlavác,et al.  Online recognition of sketched arrow-connected diagrams , 2016, International Journal on Document Analysis and Recognition (IJDAR).

[10]  Laurent Wendling,et al.  Overlaid Arrow Detection for Labeling Regions of Interest in Biomedical Images , 2016, IEEE Intelligent Systems.

[11]  Harold Mouchère,et al.  Online flowchart understanding by combining max-margin Markov random field with grammatical analysis , 2017, International Journal on Document Analysis and Recognition (IJDAR).