A text watermarking algorithm based on word classification and inter-word space statistics

Text documents can be watermarked by patterning theinter-word spaces. This paper proposes a textwatermarking algorithm that exploits the novel conceptsof word classification and inter-word space statistics. Thewords are classified using some features. Severaladjacent words are grouped into a segment, and thesegments are also classified using the word classinformation. The same amount of information is insertedinto each of the segment classes. The information isencoded by modifying some statistics of inter-word spacesof the segments belonging to the same class. Severaladvantages over the conventional word-shift algorithmsare discussed.

[1]  Bill N. Schilit,et al.  As We May Read: The Reading Appliance Revolution , 1999, Computer.

[2]  Frank Hartung,et al.  Multimedia watermarking techniques , 1999, Proc. IEEE.

[3]  Steven H. Low,et al.  Copyright protection for the electronic distribution of text documents , 1999, Proc. IEEE.

[4]  Daigo Misaki,et al.  A feature calibration method for watermarking of document images , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[5]  Catherine C. Marshall,et al.  As We May Read The Reading Appliance Revolution , 1999 .

[6]  Hakan Ancin,et al.  Data embedding in text for a copier system , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[7]  Hong Yan,et al.  Interword distance changes represented by sine waves for watermarking text images , 2001, IEEE Trans. Circuits Syst. Video Technol..

[8]  Proceedings Seventh International Conference on Document Analysis and Recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..