SONG IDENTIFICATION WITH 2 D FOURIER TRANSFORM SEQUENCES

We approach cover song identification using a novel timeseries representation of audio based on the 2DFT. The audio is represented as a sequence of magnitude 2D Fourier Transforms (2DFT). This representation is robust to key changes, timbral changes, and small local tempo deviations. We look at cross-similarity between these time-series, and extract a distance measure that is invariant to music structure changes. Our approach is state-of-the-art on a recent cover song dataset, and expands on previous work using the 2DFT for music representation and work on live song recognition.

[1]  Thierry Bertin-Mahieux,et al.  Large-scale cover song recognition using hashed chroma landmarks , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[2]  Paul Bendich,et al.  Cover Song Identification with Timbral Shape Sequences , 2015, ISMIR.

[3]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.

[4]  Daniel P. W. Ellis,et al.  Cover song detection: From high scores to general classification , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Marc Van Droogenbroeck,et al.  Enhancing Cover Song Identification with Hierarchical Rank Aggregation , 2016, ISMIR.

[6]  Oriol Nieto,et al.  Music segment similarity using 2D-Fourier Magnitude Coefficients , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7]  Vinícius M. A. de Souza,et al.  Music Shapelets for Fast Cover Song Recognition , 2015, ISMIR.

[8]  Maurizio Omologo,et al.  Large-Scale Cover Song Identification Using Chord Profiles , 2013, ISMIR.

[9]  Rafael C. González,et al.  Local Determination of a Moving Contrast Edge , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Emilia Gómez,et al.  Audio Cover Song Identification and Similarity: Background, Approaches, Evaluation, and Beyond , 2010, Advances in Music Information Retrieval.

[11]  Thierry Bertin-Mahieux,et al.  Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude , 2012, ISMIR.

[12]  Xavier Serra,et al.  Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Juan Pablo Bello,et al.  Measuring Structural Similarity in Music , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Daniel P. W. Ellis,et al.  A tempo-insensitive distance measure for cover song identification based on chroma features , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Judith C. Brown Calculation of a constant Q spectral transform , 1991 .

[16]  Eamonn J. Keogh,et al.  SiMPle: Assessing Music Similarity Using Subsequences Joins , 2016, ISMIR.

[17]  Pedro Cano,et al.  A Review of Audio Fingerprinting , 2005, J. VLSI Signal Process..

[18]  Simon Dixon,et al.  Combining Features for Cover Song Identification , 2015, ISMIR.

[19]  Jonathan Foote,et al.  Automatic audio segmentation using a measure of audio novelty , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[20]  Zafar Rafii,et al.  An audio fingerprinting system for live version identification using image processing techniques , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21]  Daniel P. W. Ellis,et al.  Identifying `Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[22]  Meinard Müller,et al.  Audio Matching via Chroma-Based Statistical Features , 2005, ISMIR.

[23]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[24]  Meinard Müller,et al.  Known Artist Live Song ID: A Hashprint Approach , 2016, ISMIR.

[25]  Judith C. Brown,et al.  An efficient algorithm for the calculation of a constant Q transform , 1992 .