Data Embedding into Characters

SUMMARY This paper reviews several trials of re-designing conventional communication medium, i.e., characters, for enriching their functions by using data-embedding techniques. For example, characters are redesigned to have better machine-readability even under various geometric distortions by embedding a geometric invariant into each character image to represent class label of the character. Another example is to embed various information into handwriting trajectory by using a new pen device, called a data-embedding pen. An experimental result showed that we can embed 32-bit information into a handwritten line of 5 cm length by using the pen device. In addition to those applications, we also discuss the relationship between data-embedding and pattern recognition in a theoretical point of view. Several theories tell that if we have appropriate supplementary information by data-embedding, we can enhance pattern recognition

[1]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[2]  Azriel Rosenfeld,et al.  Recovery of temporal information from static images of handwriting , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  George Nagy,et al.  Optical character recognition: an illustrated guide to the frontier , 1999, Electronic Imaging.

[4]  Makoto Yasuhara,et al.  Recovery of Drawing Order from Single-Stroke Handwriting Images , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  David L. Hecht,et al.  Printed Embedded Data Graphical User Interfaces , 2001, Computer.

[6]  Kimberly Moravec A Grayscale Reader for Camera Images of Xerox DataGlyphs , 2002, BMVC.

[7]  Akihiro Okamoto,et al.  A digital watermark technique using morphological signal processing , 2003 .

[8]  David S. Doermann,et al.  Camera-based analysis of text and documents: a survey , 2005, International Journal of Document Analysis and Recognition (IJDAR).

[9]  Shinichiro Omachi,et al.  Recognition with Supplementary Information —How Many Bits Are Lacking for 100% Recognition?— , 2005 .

[10]  Shinichiro Omachi,et al.  Data Embedding for Camera-Based Character Recognition , 2005 .

[11]  Ben M. Herbst,et al.  Estimating the pen trajectories of static signatures using hidden Markov models , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Shinichiro Omachi,et al.  OCR Fonts Revisited for Camera-Based Character Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[13]  Shinichiro Omachi,et al.  Affine Invariant Information Embedment for Accurate Camera-Based Character Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[14]  Shinichiro Omachi,et al.  Pattern Recognition with Supplementary Information , 2007 .

[15]  Megumi Sakai,et al.  Extraction of Embedded Class Information from Universal Character Pattern , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[16]  Masakazu IWAMURA,et al.  A General Assignment of Supplementary Information , 2010 .

[17]  Marcus Liwicki,et al.  Embedding Meta-Information in Handwriting -- Reed-Solomon for Reliable Error Correction , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[18]  Marcus Liwicki,et al.  Data-embedding pen: augmenting ink strokes with meta-information , 2010, DAS '10.

[19]  Marcus Liwicki,et al.  Reliable Online Stroke Recovery from Offline Data with the Data-Embedding Pen , 2011, 2011 International Conference on Document Analysis and Recognition.

[20]  Marcus Liwicki,et al.  Data-Embedding Pen , 2013 .

[21]  Jon Almazán,et al.  ICDAR 2013 Robust Reading Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[22]  Seiichi Uchida,et al.  Text Localization and Recognition in Images and Video , 2014, Handbook of Document Image Processing and Recognition.

[23]  Marcus Liwicki,et al.  More than ink - Realization of a data-embedding pen , 2014, Pattern Recognit. Lett..