"Copy and scale" method for doing time-localized M.I.R. estimation:: application to beat-tracking

In this work we propose a "copy and scale" method based on the 1-NN paradigm to estimate time-localized parameters and apply it to the problem of beat-tracking. The 1-NN algorithm consists in assigning the information of the closest item of a pre-annotated database to an unknown target. It can be viewed as a "copy and paste" method. The "copy and scale" method we propose consists in "scaling" this information to adapt it to the properties of the unknown target. For this, we first represent the content of an audio signal using a sampled and tempo-normalized complex DFT. This representation is used as the vectors over which the 1-NN search is performed. Along each vector of the 1-NN space, we store the corresponding annotated beat-marker positions in a normalized form. Once the closest vector is found, its tempo is assigned to the unknown item and the normalized beat-markers are scaled to this tempo in order to provide the estimation of the unknown item beat-markers. We perform a preliminary evaluation of this method and show that, with such a simple method, we can achieve results comparable to the ones obtained with sophisticated approaches.

[1]  George Tzanetakis,et al.  Analyzing Afro-Cuban Rhythms using Rotation-Aware Clave Template Matching with Dynamic Programming , 2008, ISMIR.

[2]  Anssi Klapuri,et al.  Music Tempo Estimation With $k$-NN Regression , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Simon Dixon,et al.  Evaluation of the Audio Beat Tracking System BeatRoot , 2007 .

[4]  Anssi Klapuri,et al.  Sound onset detection by applying psychoacoustic knowledge , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[5]  D. Ellis Beat Tracking by Dynamic Programming , 2007 .

[6]  Masataka Goto,et al.  An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds , 2001 .

[7]  Matthew E. P. Davies,et al.  Context-Dependent Beat Tracking of Musical Audio , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Anssi Klapuri,et al.  Measuring the similarity of Rhythmic Patterns , 2002, ISMIR.

[9]  G. Peeters,et al.  Non-Stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignement , 1999 .

[10]  Pedro Cano,et al.  Pulse-dependent analyses of percussive music , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Peter Grosche,et al.  A Mid-Level Representation for Capturing Dominant Tempo and Pulse Information in Music Recordings , 2009, ISMIR.

[12]  Jean Laroche,et al.  Efficient Tempo and Beat Tracking in Audio Recordings , 2003 .

[13]  Geoffroy Peeters,et al.  Simultaneous Beat and Downbeat-Tracking Using a Probabilistic Framework: Theory and Large-Scale Evaluation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Gerhard Widmer,et al.  Evaluating Low-Level Features for Beat Classification and Tracking , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[15]  Ali Taylan Cemgil,et al.  Monte Carlo Methods for Tempo Tracking and Rhythm Quantization , 2011, J. Artif. Intell. Res..

[16]  Gerhard Widmer,et al.  Evaluating Rhythmic descriptors for Musical Genre Classification , 2004 .

[17]  Geoffroy Peeters,et al.  Template-Based Estimation of Time-Varying Tempo , 2007, EURASIP J. Adv. Signal Process..

[18]  Jaakko Astola,et al.  Analysis of the meter of acoustic musical signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Eric D. Scheirer,et al.  Tempo and beat analysis of acoustic musical signals. , 1998, The Journal of the Acoustical Society of America.

[20]  George Tzanetakis,et al.  An experimental comparison of audio tempo induction algorithms , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Simon Dixon,et al.  Automatic Extraction of Tempo and Beat From Expressive Performances , 2001 .

[22]  Stephen W. Hainsworth,et al.  Beat tracking with particle filtering algorithms , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[23]  Yannis Stylianou,et al.  Rhythmic similarity of music based on dynamic periodicity warping , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[24]  Geoffroy Peeters Template-based estimation of tempo: using unsupervised or supervised learning to create better spectral templates , 2010 .

[25]  Douglas Eck,et al.  Finding Meter in Music Using An Autocorrelation Phase Matrix and Shannon Entropy , 2005, ISMIR.

[26]  Shingo Uchihashi,et al.  The beat spectrum: a new approach to rhythm analysis , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[27]  Peter Desain,et al.  On tempo tracking: Tempogram Representation and Kalman filtering , 2000, ICMC.

[28]  Gerhard Widmer,et al.  Towards Characterisation of Music via Rhythmic Patterns , 2004, ISMIR.

[29]  Yannis Stylianou,et al.  A scale transform based method for rhythmic similarity of music , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[30]  Simon Dixon,et al.  A Review of Automatic Rhythm Description Systems , 2005, Computer Music Journal.

[31]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[32]  Geoffroy Peeters Spectral and Temporal Periodicity Representations of Rhythm for the Automatic Classification of Music Audio Signal , 2011, IEEE Transactions on Audio, Speech, and Language Processing.