Scalable robust audio fingerprinting using MPEG-7 content description

Much interest has recently been received by systems for audio fingerprinting which enable automatic content-based identification by extracting unique signatures from the signal. Among other aspects, the main requirements for such systems include robustness to a wide range of signal distortions and availability of fast search methods, even for large fingerprint databases. This paper describes the provisions of the MPEG-7 standard for audio fingerprinting which allow for interoperability of fingerprints generated according to the open standardized specification for extraction. In addition, it discusses the ability to generate scalable fingerprints providing different trade-offs between fingerprint compactness, temporal coverage and robustness of recognition, and gives experimental results for a number of system configurations.

[1]  Constantin Papaodysseus,et al.  A New Approach to the Automatic Recognition of Musical Recordings , 2001 .

[2]  Jaap A. Haitsma,et al.  Robust Audio Hashing for Content Identification , 2001 .

[3]  Helmut Neuschmied,et al.  Content-based identification of audio titles on the Internet , 2001, Proceedings First International Conference on WEB Delivering of Music. WEDELMUSIC 2001.

[4]  Jürgen Herre,et al.  Robust matching of audio signals using spectral flatness features , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[5]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[6]  Jürgen Herre,et al.  Advanced Audio Identification Using MPEG-7 Content Description , 2001 .

[7]  Frank Kurth,et al.  Full-Text Indexing of Very Large Audio Data Bases , 2001 .

[8]  Jürgen Herre,et al.  AudioID: Towards Content-Based Identification of Audio Material , 2001 .

[9]  Cheng Yang MACS: music audio characteristic sequence indexing for similarity retrieval , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).