Steganalysis of Information Hidden in Webpage Based on Higher-order Statistics

Secret messages can be embedded into letters in tags of a Webpage in ways that are imperceptible to the human eye viewed with a browser. These messages, however, alter the inherent characteristic of the offset of a tag. This paper presents a new higher-order statistical steganalytic algorithm for detection of secret messages embedded in a Webpage. The offset is used to build the higher-order statistical models to detect whether secret messages hide in tags. 30 homepages are randomly downloaded from different Websites to test, and the results show the reliability and accuracy of statistical characteristics. The probability of missing secret messages decrease as the secret message increase, and it is zero, as 50% letters of tags are used to carry secret message.

[1]  Xingming Sun,et al.  Detection of Hidden Information in Webpage , 2007, Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007).

[2]  Xingming Sun,et al.  Detection of Hidden Information in Tags of Webpage Based on Tag-Mismatch , 2007, Third International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2007).

[3]  Sun Guang,et al.  An Algorithm of Webpage Information Hiding Based on Equal Tag , 2007 .

[4]  Zhao Qi-jun Web Page Watermarking for Tamper-Proof , 2005 .

[5]  Qijun Zhao,et al.  A PCA-based watermarking scheme for tamper-proof of web pages , 2005, Pattern Recognit..

[6]  Hui Luo,et al.  A Steganalysis Method Based on the Distribution of First Letters of Words , 2006, 2006 International Conference on Intelligent Information Hiding and Multimedia.

[7]  Hui Luo,et al.  A new steganography method based on hypertext , 2004, 2004 Asia-Pacific Radio Science Conference, 2004. Proceedings..

[8]  蒋晓华,et al.  Web Page Watermarking for Tamper-Proof , 2005 .