Feature-based classification for audio bootlegs detection

In the past few years, thanks to the increasing availability of multimedia sharing platforms, the online availability of user generated content has incredibly grown. However, since media sharing is often not well regulated, copyright infringement cases may occur. One classic example is the pirate distribution of audio bootlegs, i.e., concerts illegally recorded using portable devices. In order to guarantee copyrights and avoid the sharing of such illicit material, in this paper we propose an automatic audio bootleg detector. This can be used to analyze audio data in bulk, in order to filter out from a database the audio tracks recorded, e.g., by fans during a live performance. To this purpose, we propose to use a set of acoustic features to characterize audio bootlegs, justified by theoretical foundations. Then, we train a binary classifier that operates on this set of features to discriminate between: i) audio tracks recorded at either concerts, clubs, or theaters; ii) legally distributed live performances professionally mixed and edited. In order to validate our system, we tested it on a dataset of more than 250 audio excerpts considering different musical genres and different kinds of music performances. The results achieved are promising, showing a high bootleg detection accuracy.

[1]  Xiao-Ming Chen,et al.  AC-3 bit stream watermarking , 2012, 2012 IEEE International Workshop on Information Forensics and Security (WIFS).

[2]  Alex ChiChung Kot,et al.  Identification of recaptured photographs on LCD screens , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Alexander Lerch,et al.  An Introduction to Audio Content Analysis: Applications in Signal Processing and Music Informatics , 2012 .

[4]  Pier Luigi Dragotti,et al.  Video jitter analysis for automatic bootleg detection , 2012, 2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP).

[5]  A. Piva,et al.  overview paper An overview on video forensics , 2012 .

[6]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[7]  Husrev T. Sencar,et al.  Audio codec identification through payload sampling , 2011, 2011 IEEE International Workshop on Information Forensics and Security.

[8]  Augusto Sarti,et al.  Searching for dominant high-level features for Music Information Retrieval , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[9]  Gerald Friedland,et al.  Name that room: room identification using acoustic features in a recording , 2012, ACM Multimedia.

[10]  R. F. Olanrewaju,et al.  Digital audio watermarking; techniques and applications , 2012, 2012 International Conference on Computer and Communication Engineering (ICCCE).

[11]  Marco Tagliasacchi,et al.  Audio tampering detection via microphone classification , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[12]  C.-C. Jay Kuo,et al.  Current Developments and Future Trends in Audio Authentication , 2012, IEEE MultiMedia.

[13]  Petri Toiviainen,et al.  MIR in Matlab (II): A Toolbox for Musical Feature Extraction from Audio , 2007, ISMIR.

[14]  Paolo Bestagini,et al.  Video recapture detection based on ghosting artifact analysis , 2013, 2013 IEEE International Conference on Image Processing.

[15]  Hany Farid,et al.  Exposing digital forgeries in scientific images , 2006, MM&Sec '06.

[16]  Yi-Hsuan Yang,et al.  A Regression Approach to Music Emotion Recognition , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Jana Dittmann,et al.  A context model for microphone forensics and its application in evaluations , 2011, Electronic Imaging.

[18]  Daniel Gärtner,et al.  Audio forensics meets Music Information Retrieval — A toolbox for inspection of music plagiarism , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[19]  A. Piva An Overview on Image Forensics , 2013 .

[20]  Daniel Garcia-Romero,et al.  Automatic acquisition device identification from speech recordings , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[21]  Thomas Sikora,et al.  MPEG-7 Audio and Beyond: Audio Content Indexing and Retrieval , 2005 .

[22]  Hany Farid,et al.  Audio forensics from acoustic reverberation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.