A review of algorithms for audio fingerprinting

An audio fingerprint is a content-based compact signature that summarizes an audio recording. Audio fingerprinting technologies have recently attracted attention since they allow the monitoring of audio independently of its format and without the need of meta-data or watermark embedding. The different approaches to fingerprinting are usually described with different rationales and terminology depending on the background: pattern matching, multimedia (music) information retrieval or cryptography (robust hashing). In this paper, we review different techniques mapping functional parts to blocks of a unified framework.

[1]  Eric Allamanche,et al.  Content-based Identification of Audio Material Using MPEG-7 Low Level Description , 2001, ISMIR.

[2]  L. Varga,et al.  Short-term sound stream characterization for reliable, real-time occurrence monitoring of given sound-prints , 2000, 2000 10th Mediterranean Electrotechnical Conference. Information Technology and Electrotechnology for the Mediterranean Countries. Proceedings. MeleCon 2000 (Cat. No.00CH37099).

[3]  Kunio Kashino,et al.  Very quick audio searching: introducing global pruning to the Time-Series Active Search , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[4]  Ricardo A. Baeza-Yates,et al.  Searching in metric spaces , 2001, CSUR.

[5]  Constantin Papaodysseus,et al.  A New Approach to the Automatic Recognition of Musical Recordings , 2001 .

[6]  John C. Platt,et al.  Extracting noise-robust features from audio data , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Jaap A. Haitsma,et al.  Robust Audio Hashing for Content Identification , 2001 .

[8]  Joseph Picone,et al.  Signal modeling techniques in speech recognition , 1993, Proc. IEEE.

[9]  J. G. Lourens Detection and Logging Advertisements using its Sound , 1990, IEEE South African Symposium on Communications and Signal Processing.

[10]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[11]  E. Batlle,et al.  Automatic Song Identification in Noisy Broadcast Audio , 2002 .

[12]  Clu-istos Foutsos,et al.  Fast subsequence matching in time-series databases , 1994, SIGMOD '94.

[13]  Frank Kurth,et al.  Identification of Highly Distorted Audio Material for Querying Large Scale Data Bases , 2002 .

[14]  S. R. Subramanya,et al.  Transform-based indexing of audio data for multimedia databases , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[15]  Ramarathnam Venkatesan,et al.  A Perceptual Audio Hashing Algorithm: A Tool for Robust Audio Identification and Information Hiding , 2001, Information Hiding.

[16]  Les E. Atlas,et al.  Modulation frequency features for audio fingerprinting , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Helmut Neuschmied,et al.  Robust Sound Modeling for Song Detection in Broadcast Audio , 2002 .