论文信息 - Natural Language Watermarking and Tamperproofing

Natural Language Watermarking and Tamperproofing

Two main results in the area of information hiding in natural language text are presented. A semantically-based scheme dramatically improves the information-hiding capacity of any text through two techniques: (i) modifying the granularity of meaning of individual sentences, whereas our own previous scheme kept the granularity fixed, and (ii) halving the number of sentences affected by the watermark. No longer a "long text, short watermark" approach, it now makes it possible to watermark short texts, like wire agency reports. Using both the above-mentioned semantic marking scheme and our previous syntactically-based method hides information in a way that reveals any non-trivial tampering with the text (while re-formatting is not considered to be tampering--the problem would be solved trivially otherwise by hiding a hash of the text) with a probability 1-2-s (n+1), n being its number of sentences and s a small positive integer based on the extent of co-referencing.

[1] Kalman Cinkler,et al. Very low bit-rate wavelet video coding , 1998, IEEE J. Sel. Areas Commun..

[2] Sergei Nirenburg,et al. Book Review: Ontological Semantics, by Sergei Nirenburg and Victor Raskin , 2004, CL.

[3] Ross J. Anderson. Stretching the Limits of Steganography , 1996, Information Hiding.

[4] Ross J. Anderson,et al. On the limits of steganography , 1998, IEEE J. Sel. Areas Commun..

[5] Martha James Hardman. Appendix I. SAMPLE TEXT , 1966 .

[6] Steven H. Low,et al. Document identification for copyright protection using centroid detection , 1998, IEEE Trans. Commun..

[7] Peter Wayner,et al. Mimic Functions , 1992, Cryptologia.

[8] Sergei Nirenburg,et al. Natural language processing for information assurance and security: an overview and implementations , 2001, NSPW '00.

[9] Mark Chapman,et al. Hiding the Hidden: A software system for concealing ciphertext as innocuous text , 1997, ICICS.

[10] Nicholas F. Maxemchuk,et al. Electronic document distribution , 1994, AT&T Technical Journal.

[11] Peter Wayner. Strong Theoretical Stegnography , 1995, Cryptologia.

[12] Lawrence O'Gorman,et al. Electronic marking and identification techniques to discourage document copying , 1994, Proceedings of INFOCOM '94 Conference on Computer Communications.

[13] Mikhail J. Atallah,et al. Natural Language Watermarking: Design, Analysis, and a Proof-of-Concept Implementation , 2001, Information Hiding.