Hierarchical parsing and recognition of hand-sketched diagrams

A long standing challenge in pen-based computer interaction is the ability to make sense of informal sketches. A main difficulty lies in reliably extracting and recognizing the intended set of visual objects from a continuous stream of pen strokes. Existing pen-based systems either avoid these issues altogether, thus resulting in the equivalent of a drawing program, or rely on algorithms that place unnatural constraints on the way the user draws. As one step toward alleviating these difficulties, we present an integrated sketch parsing and recognition approach designed to enable natural, fluid, sketch-based computer interaction. The techniques presented in this paper are oriented toward the domain of network diagrams. In the first step of our approach, the stream of pen strokes is examined to identify the arrows in the sketch. The identified arrows then anchor a spatial analysis which groups the uninterpreted strokes into distinct clusters, each representing a single object. Finally, a trainable shape recognizer, which is informed by the spatial analysis, is used to find the best interpretations of the clusters. Based on these concepts, we have built SimuSketch, a sketch-based interface for Matlab's Simulink software package. An evaluation of SimuSketch has indicated that even novice users can effectively utilize our system to solve real engineering problems without having to know much about the underlying recognition techniques.

[1]  Joaquim A. Jorge,et al.  CALI: An Online Scribble Recognizer for Calligraphic Interfaces , 2002 .

[2]  Thomas F. Stahovich,et al.  Interpreting Schematic Sketches Using Physical Reasoning , 2002 .

[3]  Takayuki Dan Kimura,et al.  Recognizing multistroke geometric shapes: an experimental evaluation , 1993, UIST '93.

[4]  Nicholas E. Matsakis Recognition of Handwritten Mathematical Expressions , 1999 .

[5]  Dean Rubine,et al.  Specifying gestures by example , 1991, SIGGRAPH.

[6]  Gregory D. Abowd,et al.  OOPS: a toolkit supporting mediation techniques for resolving ambiguity in recognition-based interfaces , 2000, Comput. Graph..

[7]  A. Enis Çetin,et al.  Vision-Based Single-Stroke Character Recognition for Wearable Computing , 2001, IEEE Intell. Syst..

[8]  Stuart Russell,et al.  Statistical Visual Language Models for Ink Parsing , 2002 .

[9]  W. Eric L. Grimson The Combinatorics of Heuristic Search Termination for Object Recognition in Cluttered Environments , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Victor R. Lesser,et al.  The Hearsay-II Speech-Understanding System: Integrating Knowledge to Resolve Uncertainty , 1980, CSUR.

[11]  Gennaro Costagliola,et al.  Visual Language Editors based on LR Parsing Techniques , 2003, IWPT.

[12]  Zhengwei Yang,et al.  Invariant matching and identification of curves using B-splines curve representation , 1995, IEEE Trans. Image Process..

[13]  T. F. Sezgin Genetic and HMM based ap-proaches to freehand sketch recognition , 2003 .

[14]  Harris Drucker,et al.  Comparison of learning algorithms for handwritten digit recognition , 1995 .

[15]  Robert W. Brodersen,et al.  Pen and speech recognition in the user interface for mobile multimedia terminals , 1996 .

[16]  James A. Landay,et al.  Sketching Interfaces: Toward More Human Interface Design , 2001, Computer.

[17]  James A. Landay,et al.  SATIN: a toolkit for informal ink-based applications , 2000, UIST '00.

[18]  Christine Alvarado Dynamically Constructed Bayesian Networks for Sketch Understanding , 2003 .

[19]  Jack D. Tubbs,et al.  A note on binary template matching , 1989, Pattern Recognit..

[20]  Christine Alvarado,et al.  Resolving ambiguities to create a natural computer-based sketching environment , 2007, SIGGRAPH '07.

[21]  Takayuki Dan Kimura,et al.  A Graphic Diagram Editor for Pen Computers , 1994, Softw. Concepts Tools.

[22]  Anil K. Jain,et al.  A modified Hausdorff distance for object matching , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[23]  William Rucklidge,et al.  Efficient Visual Recognition Using the Hausdorff Distance , 1996, Lecture Notes in Computer Science.

[24]  David W. Jacobs,et al.  The Use of Grouping in Visual Object Recognition , 1988 .

[25]  A. Newton,et al.  Sketched symbol recognition using Zernike moments , 2004, ICPR 2004.

[26]  David J. Fleet,et al.  Perceptual Organization as a Foundation for Intelfigent Sketch Editing , 2002 .

[27]  Ethem Alpaydin,et al.  Combining Multiple Representations for Pen-based Handwritten Digit Recognition , 2001 .

[28]  Joaquim A. Jorge,et al.  Using fuzzy logic to recognize geometric shapes interactively , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[29]  L. Kara,et al.  Recognizing Multi-Stroke Symbols , 2002 .

[30]  Christine J. Alvarado,et al.  A natural sketching environment : bringing the computer into early stages of mechanical design , 2000 .

[31]  Tatiana Baidyk,et al.  Improved method of handwritten digit recognition tested on MNIST database , 2004, Image Vis. Comput..

[32]  Mark W. Newman,et al.  DENIM: finding a tighter fit between tools and practice for Web site design , 2000, CHI.

[33]  K. Takahashi,et al.  A Discrete HMM for Online Handwriting Recognition , 2000, Int. J. Pattern Recognit. Artif. Intell..