Detection of Hidden Information in Tags of Webpage Based on Tag-Mismatch

Secret messages can be embedded in a webpage by switching the uppercase-lowercase states of letters in tags. In this paper, a novel steganalytic approach called Tag-Mismatch analysis for detection of hidden information embedded in tags is presented. In order to guarantee reasonable false positive and false negative, the approach is firstly trained on a set of 4,100 webpages to determine the decision threshold. It is shown that the length of embedded secret messages can be estimated with relatively high precision and the resulting detection algorithm is simple and fast. The experimental results show that the detection rate is larger than 85%, as the embedded rate is larger than 0.3%.

[1]  Luo Hui,et al.  A Steganalysis Method Based on the Distribution of Space Characters , 2006, 2006 International Conference on Communications, Circuits and Systems.

[2]  Alex ChiChung Kot,et al.  Steganalysis of data hiding in binary text images , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[3]  Zhao Qi-jun Web Page Watermarking for Tamper-Proof , 2005 .

[4]  Hui Luo,et al.  A Steganalysis Method Based on the Distribution of First Letters of Words , 2006, 2006 International Conference on Intelligent Information Hiding and Multimedia.

[5]  Hui Luo,et al.  A new steganography method based on hypertext , 2004, 2004 Asia-Pacific Radio Science Conference, 2004. Proceedings..

[6]  Edward J. Delp,et al.  Attacks on lexical natural language steganography systems , 2006, Electronic Imaging.