COdec Recognition From Decoded Audio

Identifying or verifying the coding of an audio recording in uncompressed format may be useful in many fields, from audio authenticity assessment, in forensic science, to optimizing the use of biometric speaker recognition systems, when the test samples are collected remotely. In this paper, a channel recognition algorithm is presented for use with uncompressed audio, in the absence of noise. We considered the learning capabilities of Gaussian mixture models (GMM) and noticed the slightest variations induced by the transmission channel into the speech signal can be used to make channel recognition decisions. To the best of our knowledge, this is the first holistic approach of detecting adaptive multi-rate (AMR), wide band AMR (AMRWB) and G.729 codecs in an uncompressed audio, although these are some of the most used codecs in global systems for mobile telephony (GSM) and voice over Internet protocol (VoIP).

[1]  J. Berger,et al.  P.563—The ITU-T Standard for Single-Ended Speech Quality Assessment , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Dragos Burileanu,et al.  On forensic speaker recognition case pre-assessment , 2013, 2013 7th Conference on Speech Technology and Human - Computer Dialogue (SpeD).

[3]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[4]  Jiwu Huang,et al.  Detecting digital audio forgeries by checking frame offsets , 2008, MM&Sec '08.

[5]  Cemal Hanilçi,et al.  Recognition of Brand and Models of Cell-Phones From Recorded Speech Signals , 2012, IEEE Transactions on Information Forensics and Security.

[6]  Rui Yang,et al.  Compression history identification for digital audio signal , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  M. Herrera Martinez Evaluation of Audio Compression Artifacts , 2007 .

[8]  Erkam Uzun,et al.  Methods for identifying traces of compression in audio , 2013, 2013 1st International Conference on Communications, Signal Processing, and their Applications (ICCSPA).

[9]  Daniel Garcia-Romero,et al.  Automatic Speech Codec Identification with Applications to Tampering Detection of Speech Recordings , 2011, INTERSPEECH.