TS-RNN: Text Steganalysis Based on Recurrent Neural Networks

With the rapid development of natural language processing technologies, more and more text steganographic methods based on automatic text generation technology have appeared in recent years. These models use the powerful self-learning and feature extraction ability of the neural networks to learn the feature expression of massive normal texts. Then, they can automatically generate dense steganographic texts which conform to such statistical distribution based on the learned statistical patterns. In this letter, we observe that the conditional probability distribution of each word in the automatically generated steganographic texts will be distorted after embedded with hidden information. We use recurrent neural networks to extract these feature distribution differences and then classify those features into cover text and stego text categories. Experimental results show that the proposed model can achieve high detection accuracy. Besides, the proposed model can even make use of the subtle differences of the feature distribution of texts to estimate the amount of hidden information embedded in the generated steganographic text.

[1]  Yongfeng Huang,et al.  Clinical Assistant Diagnosis for Electronic Medical Record Based on Convolutional Neural Network , 2018, Scientific Reports.

[2]  Yongfeng Huang,et al.  TS-CNN: Text Steganalysis from Semantic Space Based on Convolutional Neural Network , 2018, ArXiv.

[3]  Yongfeng Huang,et al.  A Fast and Efficient Text Steganalysis Method , 2019, IEEE Signal Processing Letters.

[4]  Jessica Fridrich,et al.  Steganography in Digital Media: References , 2009 .

[5]  Peng Liu,et al.  A Novel Linguistic Steganography Based on Synonym Run-Length Encoding , 2017, IEICE Trans. Inf. Syst..

[6]  Yongfeng Huang,et al.  RITS: Real-Time Interactive Text Steganography Based on Automatic Dialogue Model , 2018, ICCCS.

[7]  Hu Zheng,et al.  Linguistic Steganography Detection Algorithm Using Statistical Language Model , 2009, 2009 International Conference on Information Technology and Computer Science.

[8]  Hang Zhou,et al.  Defining Cost Functions for Adaptive JPEG Steganography at the Microscale , 2019, IEEE Transactions on Information Forensics and Security.

[9]  Yongfeng Huang,et al.  Image Captioning with Object Detection and Localization , 2017, ICIG.

[10]  Hang Li,et al.  Neural Responding Machine for Short-Text Conversation , 2015, ACL.

[11]  Yongfeng Huang,et al.  A Sudoku Matrix-Based Method of Pitch Period Steganography in Low-Rate Speech Coding , 2017, SecureComm.

[12]  Yongfeng Huang,et al.  Text Steganography with High Embedding Rate: Using Recurrent Neural Networks to Generate Chinese Classic Poetry , 2017, IH&MMSec.

[13]  Gang Luo,et al.  Research on the coding strategy for synonym substitution-based steganography , 2014 .

[14]  Katerina J. Argyraki,et al.  Generating Steganographic Text with LSTMs , 2017, ACL.

[15]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[16]  Claude E. Shannon,et al.  Communication theory of secrecy systems , 1949, Bell Syst. Tech. J..

[17]  Jin Liu,et al.  Optimal matrix embedding for Voice-over-IP steganography , 2015, Signal Process..

[18]  Sepp Hochreiter,et al.  The Vanishing Gradient Problem During Learning Recurrent Neural Nets and Problem Solutions , 1998, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[19]  Yongfeng Huang,et al.  Text Steganography Based on Ci-poetry Generation Using Markov Chain Model , 2016, KSII Trans. Internet Inf. Syst..

[20]  Laurens van der Maaten,et al.  Accelerating t-SNE using tree-based algorithms , 2014, J. Mach. Learn. Res..

[21]  Goutam Sanyal,et al.  A real time text steganalysis by using statistical method , 2016, 2016 IEEE International Conference on Engineering and Technology (ICETECH).

[22]  Yongfeng Huang,et al.  Real-Time Steganalysis for Stream Media Based on Multi-channel Convolutional Sliding Windows , 2019, Knowl. Based Syst..

[23]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[25]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[26]  Yong-Feng Huang,et al.  RNN-Stega: Linguistic Steganography Based on Recurrent Neural Networks , 2019, IEEE Transactions on Information Forensics and Security.

[27]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[28]  Ping Zhong,et al.  Convolutional Neural Network Based Text Steganalysis , 2019, IEEE Signal Processing Letters.

[29]  Shafiz Affendi Mohd Yusof,et al.  Performance Analysis on Text Steganalysis Method Using A Computational Intelligence Approach , 2015 .

[30]  Xu Li,et al.  A linguistic steganography based on word indexing compression and candidate selection , 2018, Multimedia Tools and Applications.

[31]  Gustavus J. Simmons,et al.  The Prisoners' Problem and the Subliminal Channel , 1983, CRYPTO.

[32]  Yongfeng Huang,et al.  Automatically Generate Steganographic Text Based on Markov Model and Huffman Coding , 2018, ArXiv.