Print-scan invariant text image watermarking for hardcopy document authentication

In this paper, a novel contour feature-based text image watermarking scheme against print and scan processes is proposed. We employ a mathematical multiplicative transformation model to approximate the geometric invariant feature that can survive a variety of attacks during the print-scan process and thus serve as reference points for both watermark embedding and extraction. Based on the print-scan invariant, the boundary points of each character are flipped using Fourier descriptors with visual perception characteristics, so that the watermarks are embedded into the visually nonsignificant points. In the calculation process of the print-scan invariant, a certain text line serves as the benchmark line without affording additional characters for watermark adjustment. Thus, the hiding capacity is greatly improved. For the data detection, noise reduction and deskewing mechanisms are performed previously to compensate for the distortions caused by hardcopy. The watermark is then extracted by parity check of the invariant feature of connected components for soft authentication. The experimental results show that the proposed approach is not limited to a particular language, and has better robustness, watermark transparency as well as hiding capacity compared with some existing methods.

[1]  Xingming Sun,et al.  Print-Scan Resilient Text Image Watermarking Based on Stroke Direction Modulation for Chinese Document Authentication , 2012 .

[2]  Lamiaa A. Elrefaei,et al.  Improved capacity Arabic text watermarking methods based on open word space , 2017, J. King Saud Univ. Comput. Inf. Sci..

[3]  Xiaoyan Gu,et al.  Detecting Uyghur text in complex background images with convolutional neural network , 2017, Multimedia Tools and Applications.

[4]  Min-Shiang Hwang,et al.  Development of a data hiding scheme based on combination theory for lowering the visual noise in binary images , 2017, Displays.

[5]  Elisa H. Barney Smith,et al.  Statistical image differences, degradation features, and character distance metrics , 2003, Document Analysis and Recognition.

[6]  Mansour Jamzad,et al.  Robust watermarking against print and scan attack through efficient modeling algorithm , 2014, Signal Process. Image Commun..

[7]  Changsheng Xu,et al.  User-Aware Image Tag Refinement via Ternary Semantic Analysis , 2012, IEEE Transactions on Multimedia.

[8]  Changsheng Xu,et al.  Right buddy makes the difference: an early exploration of social relation analysis in multimedia applications , 2012, ACM Multimedia.

[9]  Ioannis Pitas,et al.  Watermarking polygonal lines using Fourier descriptors , 2000, IEEE Computer Graphics and Applications.

[10]  Saeed Mozaffari,et al.  Watermarking in binary document images using fractal codes , 2014, Pattern Recognit. Lett..

[11]  Long-Wen Chang,et al.  Data hiding in binary images for annotation by parity check , 2006, 2006 International Symposium on Intelligent Signal Processing and Communications.

[12]  Cheng Dao-fang Document watermarking scheme for information tracking , 2008 .

[13]  Sheng Tang,et al.  Robust common visual pattern discovery using graph matching , 2013, J. Vis. Commun. Image Represent..

[14]  Richard P. Wildes,et al.  A machine-vision system for iris recognition , 2005, Machine Vision and Applications.

[15]  Pili Hu,et al.  AuthPaper: Protecting paper-based documents and credentials using Authenticated 2D barcodes , 2015, 2015 IEEE International Conference on Communications (ICC).

[16]  Ki-Hyun Jung,et al.  Data hiding method in binary images based on block masking for key authentication , 2014, Inf. Sci..

[17]  张炜,et al.  Name-Face Association in Web Videos: A Large-Scale Dataset,Baselines, and Open Issues , 2014 .

[18]  Jung-San Lee,et al.  Selective scalable secret image sharing with verification , 2015, Multimedia Tools and Applications.

[19]  Haiping Lu,et al.  Distance-reciprocal distortion measure for binary document images , 2004, IEEE Signal Processing Letters.

[20]  Changsheng Xu,et al.  Exploiting Social-Mobile Information for Location Visualization , 2017, ACM Trans. Intell. Syst. Technol..

[21]  Susanto Rahardja,et al.  Orthogonal Data Embedding for Binary Images in Morphological Transform Domain- A High-Capacity Approach , 2008, IEEE Transactions on Multimedia.

[22]  Mariko Nakano-Miyatake,et al.  Script format document authentication scheme based on watermarking techniques , 2015 .

[23]  Cheng Wang,et al.  Energy-Sharing Model With Price-Based Demand Response for Microgrids of Peer-to-Peer Prosumers , 2017, IEEE Transactions on Power Systems.

[24]  Hong Yan,et al.  Interword distance changes represented by sine waves for watermarking text images , 2001, IEEE Trans. Circuits Syst. Video Technol..

[25]  C. Culnane,et al.  Improving Mutli-set Formatted Binary Text Watermarking Using Continuous Line Embedding , 2007, Second International Conference on Innovative Computing, Informatio and Control (ICICIC 2007).

[26]  Oscar C. Au,et al.  Halftone image watermarking via optimization , 2016, Signal Process. Image Commun..

[27]  Min Wu,et al.  Data hiding in binary image for authentication and annotation , 2004, IEEE Transactions on Multimedia.

[28]  Thierry Pun,et al.  Text data-hiding for digital and printed documents: theoretical and practical considerations , 2006, Electronic Imaging.

[29]  Xi Chen,et al.  Robust and parallel Uyghur text localization in complex background images , 2017, Machine Vision and Applications.

[30]  Chin-Chen Chang,et al.  A high capacity data hiding scheme for binary images based on block patterns , 2014, J. Syst. Softw..

[31]  Anthony Vetro,et al.  Data hiding in hard-copy text documents robust to print, scan and photocopy operations , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[32]  Il-Seok Oh,et al.  Watermarking text document images using edge direction histograms , 2004, Pattern Recognit. Lett..