AudioPrint: An efficient audio fingerprint system based on a novel cost-less synchronization scheme

This paper presents the latest improvements on AudioPrint: the IRCAM audio fingerprint system. Cosine filters are introduced in the short-term spectral analysis, in order to compensate the effect of pitch shifting, and a simple solution is proposed for the determination of the frame positions, robust to audio degradations, with nearly no additional cost. We then show that both contributions significantly improve the Audio-Print system, with evaluations both on a free corpus, made publicly available, and a real-world corpus of broadcast radio streams.

[1]  France,et al.  Onset Detection in Polyphonic Signals by means of Transient Peak Classification , 2005 .

[2]  Geoffroy Peeters,et al.  Audio identification based on spectral modeling of bark-bands energy and synchronization through onset detection , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System With an Efficient Search Strategy , 2003 .

[4]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.

[5]  Ingemar J. Cox,et al.  A comparison of extended fingerprint hashing and locality sensitive hashing for binary audio fingerprints , 2011, ICMR.

[6]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[7]  I. Stravinsky,et al.  AUTOMATIC ALIGNMENT OF AUDIO OCCURRENCES : APPLICATION TO THE VERIFICATION AND SYNCHRONIZATION OF AUDIO FINGERPRINTING ANNOTATION , 2011 .

[8]  Thomas Fillon,et al.  A PUBLIC AUDIO IDENTIFICATION EVALUATION FRAMEWORK FOR BROADCAST MONITORING , 2012, Appl. Artif. Intell..

[9]  Eric Allamanche,et al.  Content-based Identification of Audio Material Using MPEG-7 Low Level Description , 2001, ISMIR.

[10]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[11]  Xiangyang Xue,et al.  A novel audio fingerprinting method robust to time scale modification and pitch shifting , 2010, ACM Multimedia.

[12]  E. Zwicker,et al.  Analytical expressions for critical‐band rate and critical bandwidth as a function of frequency , 1980 .

[13]  Gaël Richard,et al.  A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting , 2011, ISMIR.