Exposing MP3 audio forgeries using frame offsets

Audio recordings should be authenticated before they are used as evidence. Although audio watermarking and signature are widely applied for authentication, these two techniques require accessing the original audio before it is published. Passive authentication is necessary for digital audio, especially for the most popular audio format: MP3. In this article, we propose a passive approach to detect forgeries of MP3 audio. During the process of MP3 encoding the audio samples are divided into frames, and thus each frame has its own frame offset after encoding. Forgeries lead to the breaking of framing grids. So the frame offset is a good indication for locating forgeries, and it can be retrieved by the identification of the quantization characteristic. In this way, the doctored positions can be automatically located. Experimental results demonstrate that the proposed approach is effective in detecting some common forgeries, such as deletion, insertion, substitution, and splicing. Even when the bit rate is as low as 32 kbps, the detection rate is above 99%.

[1]  Miikka Vilermo,et al.  Modified Discrete Cosine Transform: Its Implications for Audio Coding and Error Concealment , 2003 .

[2]  Catalin Grigoras Digital audio recording analysis: the Electric Network Frequency (ENF) Criterion , 2005 .

[3]  Hany Farid,et al.  Detecting Digital Forgeries Using Bispectral Analysis , 1999 .

[4]  Jürgen Herre,et al.  Analysis of Decompressed Audio-The -Inverse Decoder- , 2000 .

[5]  Y. Wang,et al.  Some peculiar properties of the MDCT , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[6]  Jana Dittmann,et al.  Digital audio forensics: a first practical evaluation on microphone and environment classification , 2007, MM&Sec.

[7]  Jürgen Herre,et al.  Analysing Decompressed Audio with the "Inverse Decoder" - Towards an Operative Algorithm , 2002 .

[8]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[9]  Newton Lee,et al.  ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMCCAP) , 2007, CIE.

[10]  Hany Farid,et al.  Statistical Tools for Digital Forensics , 2004, Information Hiding.

[11]  Jiwu Huang,et al.  Detecting digital audio forgeries by checking frame offsets , 2008, MM&Sec '08.

[12]  Rainer Böhme,et al.  Statistical characterisation of MP3 encoders for steganalysis , 2004, MM&Sec '04.

[13]  Wei Su,et al.  A generalized Benford's law for JPEG coefficients and its applications in image forensics , 2007, Electronic Imaging.

[14]  Jiwu Huang,et al.  A convolutive mixing model for shifted double JPEG compression with application to passive image authentication , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Jan Lukás,et al.  Estimation of Primary Quantization Matrix in Double Compressed JPEG Images , 2003 .