Methods for identifying traces of compression in audio

Compression history of an audio may reveal very useful information when traces of tampering has to be investigated or quality of an audio has to be evaluated. Motivated by this, we introduce two methods that can discriminate between single and double compressed audio and can identify compression codec and bit rate of an audio. The first method utilizes audio quality measures to realize this and operates on decoded audio. The second method, alternatively, works on coded audio, effectively the audio bit stream, and characterizes randomness and chaotic properties of the bit stream to achieve these tasks. Unlike the existing work in the literature, which are proposed mainly for MP3 encoded audio, both methods can be applied to all encoding formats. Extensive tests have been performed to test the performance of both methods under various settings. Results show that both methods can be very reliably used to obtain information on compression history of an audio.

[1]  Wonho Yang,et al.  Enhanced modified bark spectral distortion (embsd): an objective speech quality measure based on audible distortion and cognition model , 1999 .

[2]  Rui Yang,et al.  Detecting double compression of audio signal , 2010, Electronic Imaging.

[3]  Rui Yang,et al.  Defeating fake-quality MP3 , 2009, MM&Sec '09.

[4]  Stephen D. Voran,et al.  Objective estimation of perceived speech quality. I. Development of the measuring normalizing block technique , 1999, IEEE Trans. Speech Audio Process..

[5]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[6]  Erkam Uzun,et al.  Classification of audios containing speech and music , 2012, 2012 20th Signal Processing and Communications Applications Conference (SIU).

[7]  I. Johnstone,et al.  Ideal denoising in an orthonormal basis chosen from a library of bases , 1994 .

[8]  Andrew Sekey,et al.  An Objective Measure for Predicting Subjective Quality of Speech Coders , 1992, IEEE J. Sel. Areas Commun..

[9]  Qingzhong Liu,et al.  Detection of Double MP3 Compression , 2010, Cognitive Computation.

[10]  Husrev T. Sencar,et al.  Audio codec identification through payload sampling , 2011, 2011 IEEE International Workshop on Information Forensics and Security.

[11]  George Tzanetakis,et al.  Audio Analysis using the Discrete Wavelet Transform , 2001 .

[12]  Alex Waibel,et al.  Readings in speech recognition , 1990 .

[13]  Qingzhong Liu,et al.  Revealing real quality of double compressed MP3 audio , 2010, ACM Multimedia.

[14]  G. W. Hughes,et al.  Minimum Prediction Residual Principle Applied to Speech Recognition , 1975 .

[15]  Nasir D. Memon,et al.  Detection of audio covert channels using statistical footprints of hidden messages , 2006, Digit. Signal Process..

[16]  Brian A. Hanson,et al.  Spectral slope distance measures with linear prediction analysis for word recognition in noise , 1987, IEEE Trans. Acoust. Speech Signal Process..

[17]  Jiwu Huang,et al.  Detecting digital audio forgeries by checking frame offsets , 2008, MM&Sec '08.

[18]  Rui Yang,et al.  Compression history identification for digital audio signal , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).