Cell-Phone Identification from Recompressed Audio Recordings

Many audio forensic applications would benefit from the ability to classify audio recordings, based on characteristics of the originating device, particularly in social media platforms where an enormous amount of data is posted every day. This paper utilizes passive signatures associated with the recording devices, as extracted from recorded audio itself, in the absence of any extrinsic security mechanism such as digital watermarking, to identify the source cell-phone of recorded audio. It uses device-specific information present in low as well as high-frequency regions of the recorded audio. On the only publicly available dataset in this field, MOBIPHONE, the proposed system gives a closed set accuracy of 97.2 % which matches the state of art accuracy reported for this dataset. On audio recordings which have undergone double compression, as typically happens for a recording posted on social media, the proposed system outperforms the existing methods (4% improvement in average accuracy).

[1]  Yanxiong Li,et al.  Source cell phone matching from speech recordings by sparse representation and KISS metric , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Hafiz Malik,et al.  Microphone Identification Using Higher-Order Statistics , 2012 .

[3]  Beth Logan,et al.  Mel Frequency Cepstral Coefficients for Music Modeling , 2000, ISMIR.

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Yuechi Jiang,et al.  Mobile phone identification from speech recordings using Weighted Support Vector Machine , 2016, IECON 2016 - 42nd Annual Conference of the IEEE Industrial Electronics Society.

[6]  Constantine Kotropoulos,et al.  Mobile phone identification using recorded speech signals , 2014, 2014 19th International Conference on Digital Signal Processing.

[7]  Tomi Kinnunen,et al.  Source cell-phone recognition from recorded speech using non-speech segments , 2014, Digit. Signal Process..

[8]  Rachit Aggarwal,et al.  Cellphone identification using noise estimates from recorded audio , 2014, 2014 International Conference on Communication and Signal Processing.

[9]  William M. Campbell,et al.  Speaker recognition with polynomial classifiers , 2002, IEEE Trans. Speech Audio Process..

[10]  Jana Dittmann,et al.  Digital audio forensics: a first practical evaluation on microphone and environment classification , 2007, MM&Sec.

[11]  Daniel Garcia-Romero,et al.  Automatic acquisition device identification from speech recordings , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12]  Xue Zhang,et al.  Mobile phone clustering from acquired speech recordings using deep Gaussian supervector and spectral clustering , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[13]  Nitin Khanna,et al.  Cell-phone identification from audio recordings using PSD of speech-free regions , 2014, 2014 IEEE Students' Conference on Electrical, Electronics and Computer Science.

[14]  Goutam Saha,et al.  Spectral Features for Synthetic Speech Detection , 2017, IEEE Journal of Selected Topics in Signal Processing.

[15]  SCI Facts and Figures 2015 , 2015, The journal of spinal cord medicine.

[16]  Xiaohui Feng,et al.  Cell phone verification from speech recordings using sparse representation , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Jana Dittmann,et al.  Microphone Classification Using Fourier Coefficients , 2009, Information Hiding.

[18]  Goutam Saha,et al.  Improved Closed Set Text-Independent Speaker Identification by Combining MFCC with Evidence from Flipped Filter Banks , 2008 .

[19]  Jan P. Allebach,et al.  A survey of forensic characterization methods for physical devices , 2006, Digit. Investig..

[20]  Cemal Hanilçi,et al.  Recognition of Brand and Models of Cell-Phones From Recorded Speech Signals , 2012, IEEE Transactions on Information Forensics and Security.

[21]  William M. Campbell,et al.  Generalized linear discriminant sequence kernels for speaker recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  Junfeng Wu,et al.  Source cell phone verification from speech recordings using sparse representation , 2017, Digit. Signal Process..