Camera-Based Whiteboard Reading for Understanding Mind Maps

Mind maps, i.e. the spatial organization of ideas and concepts around a central topic and the visualization of their relations, represent a very powerful and thus popular means to support creative thinking and problem solving processes. Typically created on traditional whiteboards, they represent an important technique for collaborative brainstorming sessions. We describe a camera-based system to analyze hand-drawn mind maps written on a whiteboard. The goal of the presented system is to produce digital representations of such mind maps, which would enable digital asset management, i.e. storage and retrieval of manually created documents. Our system is based on image acquisition by means of a camera, followed by the segmentation of the particular whiteboard image focusing on the extraction of written context, i.e. the ideas captured by the mind map. The spatial arrangement of these ideas is recovered using layout analysis based on unsupervised clustering, which results in graph representations of mind maps. Finally, handwriting recognition derives textual transcripts of the ideas captured by the mind map. We demonstrate the capabilities of our mind map reading system by means of an experimental evaluation, where we analyze images of mind maps that have been drawn on whiteboards, without any further constraints other than the underlying topic. In addition to the promising recognition results, we also discuss training strategies, which effectively allow for system bootstrapping using out-of-domain sample data. The latter is important when addressing creative thinking processes where domain-related training data are difficult to obtain as they focus on novelty by definition.

[1]  Gernot A. Fink,et al.  A Method for Camera-Based Interactive Whiteboard Reading , 2011, CBDAR.

[2]  Rong Huang,et al.  On the Possibility of Structure Learning-Based Scene Character Detector , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[3]  Palaiahnakote Shivakumara,et al.  Text detection in natural scenes using Gradient Vector Flow-Guided symmetry , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[4]  Marcus Liwicki,et al.  IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[5]  Umapada Pal,et al.  Seal Detection and Recognition: An Approach for Document Indexing , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[6]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[7]  B Fritzke,et al.  A growing neural gas network learns topologies. G. Tesauro, DS Touretzky, and TK Leen, editors , 1995, NIPS 1995.

[8]  Marcus Liwicki,et al.  Touch & Write: a multi-touch table with pen-input , 2010, DAS '10.

[9]  Eric Saund Bringing the Marks on a Whiteboard to Electronic Life , 1999, CoBuild.

[10]  G. A. Fink,et al.  Developing pattern recognition systems based on Markov models: The ESMERALDA framework , 2008, Pattern Recognition and Image Analysis.

[11]  Quentin Stafford-Fraser,et al.  BrightBoard: a video-augmented environment , 1996, CHI '96.

[12]  Anil K. Jain,et al.  Document Structure and Layout Analysis , 2007 .

[13]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[14]  Gernot A. Fink,et al.  Layout Analysis for Camera-Based Whiteboard Notes , 2009, J. Univers. Comput. Sci..

[15]  Sekhar Mandal,et al.  Segmentation of Text and Graphics from Document Images , 2007 .

[16]  Horst Bunke,et al.  Recognition of cursive Roman handwriting: past, present and future , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[17]  Thomas M. Breuel,et al.  Example-Based Logical Labeling of Document Title Page Images , 2007 .

[18]  Gernot A. Fink,et al.  On appearance-based feature extraction methods for writer-independent handwritten text recognition , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[19]  Chunheng Wang,et al.  Adaptive Scene Text Detection Based on Transferring Adaboost , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[20]  Jorma Laaksonen,et al.  SOM_PAK: The Self-Organizing Map Program Package , 1996 .

[21]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[22]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[23]  Horst Bunke,et al.  Handwritten sentence recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[24]  Gernot A. Fink,et al.  Camera-based Whiteboard Reading: New Approaches to a Challenging Task , 2008, ICFHR 2008.

[25]  Hiromichi Fujisawa,et al.  Forty years of research in character and document recognition - an industrial perspective , 2008, Pattern Recognit..

[26]  Jiri Matas,et al.  Robust wide-baseline stereo from maximally stable extremal regions , 2004, Image Vis. Comput..

[27]  Bart Lamiroy,et al.  Text/Graphics Separation Revisited , 2002, Document Analysis Systems.

[28]  Xiangjian He,et al.  An Algorithm for Colour-Based Natural Scene Text Segmentation , 2011, CBDAR.

[29]  Gernot A. Fink,et al.  Toward automatic video-based whiteboard reading , 2004, International Journal of Document Analysis and Recognition (IJDAR).

[30]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[31]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[32]  Gernot A. Fink,et al.  Markov models for offline handwriting recognition: a survey , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[33]  Gernot A. Fink,et al.  Video-based on-line handwriting recognition , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[34]  Gernot A. Fink,et al.  Markov Models for Handwriting Recognition , 2011, Springer Briefs in Computer Science.

[35]  David S. Doermann,et al.  Scene Text Detection via Integrated Discrimination of Component Appearance and Consensus , 2013, CBDAR.

[36]  Subhadip Basu,et al.  Text/Graphics Separation for Business Card Images for Mobile Devices , 2010, ArXiv.

[37]  Hiromichi Fujisawa Robustness Design of Industrial Strength Recognition Systems , 2007 .

[38]  Bart Lamiroy,et al.  Pattern Recognition Methods for Querying and Browsing Technical Documentation , 2008, CIARP.

[39]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[40]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[41]  Marcus Liwicki,et al.  Handwriting Recognition of Whiteboard Notes , 2005 .

[42]  Bidyut Baran Chaudhuri,et al.  Automation of Indian Postal Documents Written in Bangla and English , 2009, Int. J. Pattern Recognit. Artif. Intell..

[43]  Gernot A. Fink,et al.  Camera-Based Analysis of Whiteboard Notes , 2009 .

[44]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  Gyeonghwan Kim,et al.  Chaincode Contour Processing for Handwritten Word Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[46]  Dimosthenis Karatzas,et al.  Multi-script Text Extraction from Natural Scenes , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[47]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[48]  Marcus Liwicki,et al.  Combining On-Line and Off-Line Systems for Handwriting Recognition , 2007 .

[49]  Yaokai Feng,et al.  A Hierarchical Visual Saliency Model for Character Detection in Natural Scenes , 2013, CBDAR.

[50]  Rangachar Kasturi,et al.  A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[51]  Gernot A. Fink,et al.  Toward automatic video-based whiteboard reading , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[52]  Giovanni Soda,et al.  Self-Organizing Maps for Clustering in Document Image Analysis , 2008, Machine Learning in Document Analysis and Recognition.