Eager interpretation of on-line hand-drawn structured documents: The DALI methodology

On-line hand-drawn structured document interpretation is a complex problem of pattern recognition. This paper deals with eager strategies for such purpose, i.e. consisting in updating the analyzed document after each input stroke and providing a corresponding feedback to the user. We have designed a new class of visual grammars for the modeling of structured document composition: the context-driven constraint multiset grammars (CD-CMG). Their main originalities are to model the structural context in which a production can be reduced and to take into account the hand-drawn nature of the considered data. Its associated parser exploits the formalized knowledge for predictive purposes and couples bottom-up and top-down strategies. Their context-sensitiveness helps reducing significantly the combinatory associated to the analysis process. We use the fuzzy set framework to evaluate each possible interpretation on a qualitative way. Reject options are exploited to increase the decision making robustness and to detect the need for stroke segmentation. The parser is also able to wait for more information before making a decision by using a branch and bound algorithm. In this paper, we provide experimental results showing that the method is efficient enough to be used in real-time applications. We illustrate this point by focusing on a commercialized pen-based system that is based on the DALI method we present in this paper.

[1]  Andrew S. Forsberg,et al.  The music notepad , 1998, UIST '98.

[2]  Lotfi A. Zadeh,et al.  Soft computing and fuzzy logic , 1994, IEEE Software.

[3]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Martin Odersky,et al.  Building visual language parsers , 1991, CHI '91.

[5]  Michele Risi,et al.  Sketch Grammars: a formalism for describing and recognizing diagrammatic sketch languages , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[6]  Tracy Anne Hammond,et al.  LADDER: a language to describe drawing, display, and editing in sketch recognition , 2003, IJCAI.

[7]  Levent Burak Kara,et al.  Hierarchical parsing and recognition of hand-sketched diagrams , 2004, UIST '04.

[8]  James A. Landay,et al.  Sketching Interfaces: Toward More Human Interface Design , 2001, Computer.

[9]  M. Cheriet,et al.  Online recognition of sketched electrical diagrams , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[10]  Alexander Hall,et al.  A combinatorial approach to multi-domain sketch recognition , 2007, SBIM '07.

[11]  Harold Mouchère,et al.  A Unified Strategy to Deal with Different Natures of Reject , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[12]  Christian Heide Damm,et al.  Tool support for cooperative object-oriented design: gesture based modelling on an electronic whiteboard , 2000, CHI.

[13]  Jiro Tanaka,et al.  VIC: CMG Input System Using Example Figures , 1999 .

[14]  L. Kara,et al.  Recognizing Multi-Stroke Symbols , 2002 .

[15]  Sébastien Macé,et al.  Fuzzy Relative Positioning for On-Line Handwritten Stroke Analysis , 2006 .

[16]  Kim Marriott Constraint multiset grammars , 1994, Proceedings of 1994 IEEE Symposium on Visual Languages.

[17]  Levent Burak Kara,et al.  Combining geometry and domain knowledge to interpret hand-drawn diagrams , 2005, Comput. Graph..

[18]  Bart Lamiroy,et al.  Automatic Adjacency Grammar Generation from User Drawn Sketches , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[19]  Thomas F. Stahovich,et al.  Interpreting Schematic Sketches Using Physical Reasoning , 2002 .

[20]  Joaquim A. Jorge,et al.  CALI: An Online Scribble Recognizer for Calligraphic Interfaces , 2002 .

[21]  Éric Anquetil,et al.  Integration of an on-line handwriting recognition system in a smart phone device , 2002, Object recognition supported by user interaction for service robots.

[22]  Stuart Russell,et al.  Statistical Visual Language Models for Ink Parsing , 2002 .

[23]  Sébastien Macé,et al.  Chapter XII Pen-Based Interaction for Intuitive Music Composition and Editing , 2008 .

[24]  Bin Cui,et al.  Intelligent Music Information Systems: Tools and Methodologies , 2007 .

[25]  Josep Lladós,et al.  An Adjacency Grammar to Recognize Symbols and Gestures in a Digital Pen Framework , 2005, IbPRIA.

[26]  Minoru Maruyama,et al.  An online handwritten music symbol recognition system , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[27]  Masaki Nakagawa,et al.  Lazy recognition as a principle of pen interfaces , 1993, INTERCHI Adjunct Proceedings.

[28]  J. Anstice,et al.  The design of a pen-based musical input system , 1996, Proceedings Sixth Australian Conference on Computer-Human Interaction.

[29]  Mark D. Gross,et al.  The Electronic Cocktail Napkin—a computational environment for working with design diagrams , 1996 .

[30]  Isabelle Bloch,et al.  Fuzzy Relative Position Between Objects in Image Processing: A Morphological Approach , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[32]  Ephraim P. Glinert,et al.  Online parsing of visual languages using adjacency grammars , 1995, Proceedings of Symposium on Visual Languages.

[33]  Kim Marriott,et al.  Automatic generation of intelligent diagram editors , 2003, TCHI.

[34]  Kim Marriott,et al.  A survey of visual language specification and recognition , 1998 .

[35]  Bernd Meyer,et al.  Theory of Visual Languages , 1998 .

[36]  Takayuki Dan Kimura,et al.  A Graphic Diagram Editor for Pen Computers , 1994, Softw. Concepts Tools.