Camera-Based Analysis of Whiteboard Notes

A domain where, even in the era of electronic document processing, handwriting is still widely used is note-taking on a whiteboard. Such documents are either captured by a pen-tracking device or – which is much more challenging – by a camera. In both cases the layout analysis of realistic whiteboard notes is an open research problem. In this paper we propose a camera-based three-stage approach for the automatic analysis of whiteboard documents. Assuming a reasonable foreground-background separation of the handwriting it starts with a locally adaptive binarization followed by connected component extraction. These are then automatically classified as representing either simple graphical elements of a mindmap or elementary text patches. In the final stage the text patches are subject to a clustering procedure in order to generate hypotheses for those image regions where textual annotations of the mindmap can be found. In order to demonstrate the effectiveness of the proposed approach we report results of an experimental evaluation on a data set of mindmap images created by several different writers without any constraints on writing or drawing style.

[1]  Stavros J. Perantonis,et al.  Automatic page analysis for the creation of a digital library from newspaper archives , 2000, International Journal on Digital Libraries.

[2]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[3]  Bidyut Baran Chaudhuri,et al.  On machine understanding of online handwritten mathematical expressions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[4]  Bidyut Baran Chaudhuri,et al.  A system for Indian postal automation , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[5]  Hiromichi Fujisawa,et al.  Forty years of research in character and document recognition - an industrial perspective , 2008, Pattern Recognit..

[6]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[7]  Thomas M. Breuel,et al.  Example-Based Logical Labeling of Document Title Page Images , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[8]  Christine Reid,et al.  The Myth of the Paperless Office , 2003, J. Documentation.

[9]  Raúl Rojas,et al.  Recognition of on-line handwritten mathematical expressions in the E-Chalk system - an extension , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[10]  Bart Lamiroy,et al.  Pattern Recognition Methods for Querying and Browsing Technical Documentation , 2008, CIARP.

[11]  Masakazu Suzuki,et al.  Detection and segmentation of touching characters in mathematical expressions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[12]  Chew Lim Tan,et al.  A system for understanding imaged infographics and its applications , 2007, DocEng '07.

[13]  Gernot A. Fink,et al.  Camera-based Whiteboard Reading: New Approaches to a Challenging Task , 2008, ICFHR 2008.

[14]  Marcus Liwicki,et al.  Handwriting Recognition of Whiteboard Notes , 2005 .

[15]  T. Shinogi,et al.  Keywords Recognition of Handwritten Character String on Whiteboard Using Word Dictionary for e-Learning , 2006, 2006 International Conference on Hybrid Information Technology.

[16]  B. Kapralos,et al.  I An Introduction to Digital Image Processing , 2022 .

[17]  Thomas Kieninger,et al.  Document Structure Analysis Based on Layout and Textual Features , 2000 .

[18]  Chew Lim Tan,et al.  A multi-level component grouping algorithm and its applications , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).