Template-Based Estimation of Time-Varying Tempo

We present a novel approach to automatic estimation of tempo over time. This method aims at detecting tempo at the tactus level for percussive and nonpercussive audio. The front-end of our system is based on a proposed reassigned spectral energy flux for the detection of musical events. The dominant periodicities of this flux are estimated by a proposed combination of discrete Fourier transform and frequency-mapped autocorrelation function. The most likely meter, beat, and tatum over time are then estimated jointly using proposed meter/beat subdivision templates and a Viterbi decoding algorithm. The performances of our system have been evaluated on four different test sets among which three were used during the ISMIR 2004 tempo induction contest. The performances obtained are close to the best results of this contest.

[1]  George Tzanetakis,et al.  An experimental comparison of audio tempo induction algorithms , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Patrick J. Wolfe,et al.  Time-Frequency Reassignment for Music Analysis , 2001, ICMC.

[3]  Juan Pablo,et al.  Towards the automated analysis of simple polyphonic music : a knowledge-based approach , 2003 .

[4]  Geoffroy Peeters,et al.  Music Pitch Representation by Periodicity Measures Based on Combined Temporal and Spectral Representations , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[5]  Simon Dixon,et al.  Automatic Extraction of Tempo and Beat From Expressive Performances , 2001 .

[6]  Jaakko Astola,et al.  Analysis of the meter of acoustic musical signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Fabien Gouyon,et al.  Determination of the meter of musical audio signals: Seeking recurrences in beat segment descriptors , 2003 .

[8]  Hugues Vinet The Semantic HiFi Project , 2005, ICMC.

[9]  Eric D. Scheirer,et al.  Tempo and beat analysis of acoustic musical signals. , 1998, The Journal of the Acoustical Society of America.

[10]  Xavier Rodet,et al.  Fundamental frequency estimation and tracking using maximum likelihood harmonic matching and HMMs , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  F. Gouyon A computational approach to rhythm description - Audio features for the computation of rhythm periodicity functions and their use in tempo induction and music content processing , 2005 .

[12]  Jean Laroche,et al.  Efficient Tempo and Beat Tracking in Audio Recordings , 2003 .

[13]  Patrick Flandrin,et al.  Time-Frequency/Time-Scale Analysis , 1998 .

[14]  J. Beauchamp,et al.  Fundamental frequency estimation of musical signals using a two‐way mismatch procedure , 1994 .

[15]  Jeff A. Bilmes,et al.  Timing is of the essence : perceptual and computational techniques for representing, learning, and reproducing expressive timing in percussive rhythm , 1993 .

[17]  M. Puckette,et al.  Calculation of a "narrowed" autocorrelation function. , 1989, The Journal of the Acoustical Society of America.

[18]  Judith C. Brown Determination of the meter of musical scores by autocorrelation , 1993 .

[19]  Roger B. Dannenberg,et al.  Tracking Musical Beats in Real Time , 1990, ICMC.

[20]  Anssi Klapuri,et al.  Sound onset detection by applying psychoacoustic knowledge , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[21]  Simon Dixon,et al.  A Review of Automatic Rhythm Description Systems , 2005, Computer Music Journal.

[22]  Gerhard Widmer,et al.  Classification of dance music by periodicity patterns , 2003, ISMIR.

[23]  Anssi Klapuri,et al.  Measuring the similarity of Rhythmic Patterns , 2002, ISMIR.

[24]  Pedro Cano,et al.  Pulse-dependent analyses of percussive music , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25]  Christian Uhle,et al.  ESTIMATION OF TEMPO, MICRO TIME AND TIME SIGNATURE FROM PERCUSSIVE MUSIC , 2003 .

[26]  Geoffroy Peeters Rhythm Classification Using Spectral Rhythm Patterns , 2005, ISMIR.

[27]  F. Harris On the use of windows for harmonic analysis with the discrete Fourier transform , 1978, Proceedings of the IEEE.

[28]  G. Peeters,et al.  Non-Stationary Analysis/Synthesis using Spectrum Peak Shape Distortion, Phase and Reassignement , 1999 .

[29]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[30]  Juan Pablo Bello,et al.  Towards the automated analysis of simple polyphonic music: A knowledge-based approach (Ph.D. Thesis) , 2003 .

[31]  A. Röbel A NEW APPROACH TO TRANSIENT PROCESSING IN THE PHASE VOCODER , 2003 .

[32]  Van Nostrand,et al.  Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .

[33]  Masataka Goto,et al.  An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds , 2001 .

[34]  Geoffroy Peeters,et al.  Modeles et modification du signal sonore adaptes aux caracteristiques locales , 2001 .