STBS: A Statistical Algorithm for Steganalysis of Translation-Based Steganography

Translation-Based Steganography is a secure text steganographic algorithm. In this paper, we present a novel statistical algorithm for steganalysis of Translation-Based Steganography (STBS). We first show that there are fewer high-frequency words in stegotexts than in normal texts. We then design a preprocessor to refine all the given texts to expand the frequency differences between normal texts and stegotexts. 12 dimensional feature vectors sensitive to frequency are derived from the refined texts. We finally use a SVM classifier to classify given texts to normal texts and stegotexts. A series of experiments is given to demonstrate the performance of STBS.

[1]  Mark Chapman,et al.  Hiding the Hidden: A software system for concealing ciphertext as innocuous text , 1997, ICICS.

[2]  Krista Bennett,et al.  LINGUISTIC STEGANOGRAPHY: SURVEY, ANALYSIS, AND ROBUSTNESS CONCERNS FOR HIDING INFORMATION IN TEXT , 2004 .

[3]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[4]  Mikhail J. Atallah,et al.  Translation-based steganography , 2005, J. Comput. Secur..

[5]  Mikhail J. Atallah,et al.  Lost in just the translation , 2006, SAC.

[6]  Edward J. Delp,et al.  Attacks on lexical natural language steganography systems , 2006, Electronic Imaging.

[7]  Hisham M. Haddad Proceedings of the 2006 ACM symposium on Applied computing , 2006, SAC.

[8]  Xin-xin Zhao,et al.  Effective Linguistic Steganography Detection , 2008, 2008 IEEE 8th International Conference on Computer and Information Technology Workshops.

[9]  Huang Liusheng,et al.  A Statistical Algorithm for Linguistic Steganography Detection Based on Distribution of Words , 2008, ARES 2008.

[10]  Liusheng Huang,et al.  Linguistic Steganography Detection Using Statistical Characteristics of Correlations between Words , 2008, Information Hiding.

[11]  P. Wayner Disappearing Cryptography: Information Hiding: Steganography and Watermarking , 2008 .

[12]  Translation-based steganography , 2009, J. Comput. Secur..

[13]  Zhili Chen,et al.  Attacks on Translation Based Steganography , 2009, 2009 IEEE Youth Conference on Information, Computing and Telecommunication.

[14]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.