A zero text watermarking algorithm based on non-vowel ASCII characters

The widespread use of Internet and other communication technologies has bring along the ease to reproduce, disclose, and distribute digital content. In addition to getting the benefits of information exchange, the digital community is confronted with authentication, forgery, and copyright protection issues. The amount of textual information on Internet is increasing besides images, audios, and videos. Therefore, copyright protection of plain text is the most important issue. In this paper, we propose a zero text watermarking algorithm based on occurrence frequency of non-vowel ASCII characters and words for copyright protection of plain text documents. The embedding algorithm incorporates occurrences of non-vowel ASCII characters in text partitions to form a key based on the watermark. The extraction algorithm extracts the watermark from the noisy text to identify original copyright owner. Experimental results prove the effectiveness of the proposed algorithm on text encountering dispersed insertion and deletion attacks occurring randomly.

[1]  Anwar M. Mirza,et al.  A NOVEL TEXT WATERMARKING ALGORITHM USING IMAGE WATERMARK , 2010 .

[2]  Junzhong Gu,et al.  An Optimized Natural Language Watermarking Algorithm based on TMR , 2008, 2008 The 9th International Conference for Young Computer Scientists.

[3]  Edward J. Delp,et al.  Natural language watermarking , 2005, IS&T/SPIE Electronic Imaging.

[4]  Steven H. Low,et al.  Copyright protection for the electronic distribution of text documents , 1999, Proc. IEEE.

[5]  Mikhail J. Atallah,et al.  Information hiding through errors: a confusing approach , 2007, Electronic Imaging.

[6]  Nicholas F. Maxemchuk,et al.  Electronic document distribution , 1994, AT&T Technical Journal.

[7]  S.H. Low,et al.  Capacity of text marking channel , 2000, IEEE Signal Processing Letters.

[8]  Benoit M. Macq,et al.  A method of text watermarking using presuppositions , 2007, Electronic Imaging.

[9]  Bülent Sankur,et al.  Syntactic tools for text watermarking , 2007, Electronic Imaging.

[10]  Radu Sion,et al.  Natural Language Watermarking and Tamperproofing , 2002, Information Hiding.

[11]  Pheng-Ann Heng,et al.  Face Recognition Based on Generalized Canonical Correlation Analysis , 2005, ICIC.

[12]  Lawrence O'Gorman,et al.  Electronic marking and identification techniques to discourage document copying , 1994, Proceedings of INFOCOM '94 Conference on Computer Communications.

[13]  Steven H. Low,et al.  Document identification for copyright protection using centroid detection , 1998, IEEE Trans. Commun..

[14]  Daigo Misaki,et al.  A feature calibration method for watermarking of document images , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[15]  Mikhail J. Atallah,et al.  Natural Language Watermarking: Design, Analysis, and a Proof-of-Concept Implementation , 2001, Information Hiding.

[16]  Steven H. Low,et al.  Marking text documents , 1997, Proceedings of International Conference on Image Processing.

[17]  Hong Yan,et al.  Interword distance changes represented by sine waves for watermarking text images , 2001, IEEE Trans. Circuits Syst. Video Technol..

[18]  Sergei Nirenburg,et al.  Natural language processing for information assurance and security: an overview and implementations , 2001, NSPW '00.

[19]  Steven H. Low,et al.  Performance comparison of two text marking methods , 1998, IEEE J. Sel. Areas Commun..

[20]  E. Delp,et al.  Security, steganography, and watermarking of multimedia contents , 2004 .

[21]  Xingming Sun,et al.  Noun-Verb Based Technique of Text Watermarking Using Recursive Decent Semantic Net Parsers , 2005, ICNC.

[22]  Mikhail J. Atallah,et al.  The hiding virtues of ambiguity: quantifiably resilient watermarking of natural language text through synonym substitutions , 2006, MM&Sec '06.

[23]  Anwar M. Mirza,et al.  An Invisible Text Watermarking Algorithm using Image Watermark , 2009, SCSS.

[24]  Bülent Sankur,et al.  Natural language watermarking via morphosyntactic alterations , 2009, Comput. Speech Lang..