论文信息 - Light Multiscale Conventional Neural Network for MP3 Steganalysis

Light Multiscale Conventional Neural Network for MP3 Steganalysis

In this paper, we propose a light multiscale convolution neural network to detect adaptive MP3 steganography, which can be used in attacking both the MP3 steganography based on Huffman codes substitution and the method through modifying sign bit in MP3 encoding. Especially, we decrease the model size and the occupation of graphics memory based on convolution factorization. At the same time, the convolution kernels with different size are applied in one layer, which is conducive to the retaining of the detection performance. And refer to the residual structure, a shortcut connection is used in the proposed network to enhance the performance of the network. The experimental result shows the accuracy can reach more than 90% when the payload rate is high. And the model size is reduced by 70% than the previous networks.

[1] Meisam Khalil Arjmandi,et al. Audio steganalysis based on reversed psychoacoustic model of human hearing , 2016, Digit. Signal Process..

[2] Bolin Chen,et al. Audio Steganalysis with Convolutional Neural Network , 2017, IH&MMSec.

[3] Kun Yang,et al. Adaptive MP3 Steganography Using Equal Length Entropy Codes Substitution , 2017, International Workshop on Digital Watermarking.

[4] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Gao Hai-ying,et al. The MP3 Steganography Algorithm Based on Huffman Coding , 2007 .

[7] Xianfeng Zhao,et al. RHFCN:: Fully CNN-based Steganalysis of MP3 with Rich High-pass Filtering , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8] Lina Wang,et al. A Steganalysis Scheme for AAC Audio Based on MDCT Difference Between Intra and Inter Frame , 2017, IWDW.

[9] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Kun Yang,et al. CNN-based Steganalysis of MP3 Steganography in the Entropy Code Domain , 2018, IH&MMSec.

[11] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[12] Chao Jin,et al. Steganalysis of MP3Stego with low embedding-rate using Markov feature , 2017, Multimedia Tools and Applications.

[13] Xianfeng Zhao,et al. Defining Joint Embedding Distortion for Adaptive MP3 Steganography , 2019, IH&MMSec.