Extracting Predominant Local Pulse Information From Music Recordings

The extraction of tempo and beat information from music recordings constitutes a challenging task in particular for non-percussive music with soft note onsets and time-varying tempo. In this paper, we introduce a novel mid-level representation that captures musically meaningful local pulse information even for the case of complex music. Our main idea is to derive for each time position a sinusoidal kernel that best explains the local periodic nature of a previously extracted note onset representation. Then we employ an overlap-add technique accumulating all these kernels over time to obtain a single function that reveals the predominant local pulse (PLP). Our concept introduces a high degree of robustness to noise and distortions resulting from weak and blurry onsets. Furthermore, the resulting PLP curve reveals the local pulse information even in the presence of continuous tempo changes and indicates a kind of confidence in the periodicity estimation. As further contribution, we show how our PLP concept can be used as a flexible tool for enhancing tempo estimation and beat tracking. The practical relevance of our approach is demonstrated by extensive experiments based on music recordings of various genres.

[1]  Anssi Klapuri,et al.  Music Tempo Estimation With $k$-NN Regression , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Yannis Stylianou,et al.  Beat Tracking using Group Delay Based Onset Detection , 2008, ISMIR.

[3]  Simon Dixon,et al.  Evaluation of the Audio Beat Tracking System BeatRoot , 2007 .

[4]  Anssi Klapuri,et al.  Sound onset detection by applying psychoacoustic knowledge , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[5]  Anssi Klapuri,et al.  Measuring the similarity of Rhythmic Patterns , 2002, ISMIR.

[6]  Pedro Cano,et al.  Pulse-dependent analyses of percussive music , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Geoffroy Peeters,et al.  Template-Based Estimation of Time-Varying Tempo , 2007, EURASIP J. Adv. Signal Process..

[8]  Jaakko Astola,et al.  Analysis of the meter of acoustic musical signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Daniel P. W. Ellis,et al.  Cross-correlation of beat-synchronous representations for music similarity , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Eric D. Scheirer,et al.  Tempo and beat analysis of acoustic musical signals. , 1998, The Journal of the Acoustical Society of America.

[11]  Peter Grosche,et al.  A Mid-Level Representation for Capturing Dominant Tempo and Pulse Information in Music Recordings , 2009, ISMIR.

[12]  Matthew E. P. Davies,et al.  Note onset detection using rhythmic structure , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  R. Jackendoff,et al.  A Generative Theory of Tonal Music , 1985 .

[14]  Marco Mattavelli,et al.  Music Onset Detection Based on Resonator Time Frequency Image , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Craig Stuart Sapp Comparative Analysis of Multiple Musical Performances , 2007, ISMIR.

[16]  Miguel A. Alonso,et al.  Tempo And Beat Estimation Of Musical Signals , 2004, ISMIR.

[17]  Jieping Xu,et al.  Rhythm-Based Segmentation of Popular Chinese Music , 2005, ISMIR.

[18]  Matthew E. P. Davies,et al.  Evaluation of Audio Beat Tracking and Music Tempo Extraction Algorithms , 2007 .

[19]  Roger B. Dannenberg Toward Automated Holistic Beat Tracking, Music Analysis and Understanding , 2005, ISMIR.

[20]  Simon Dixon,et al.  Automatic Extraction of Tempo and Beat From Expressive Performances , 2001 .

[21]  Matthew E. P. Davies,et al.  Context-Dependent Beat Tracking of Musical Audio , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[22]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[23]  Paul Masri,et al.  Imroved Modelling of Attack Transients in Music Analysis-Resynthesis , 1996, ICMC.

[24]  Nick Collins A Comparison of Sound Onset Detection Algorithms with Emphasis on Psychoacoustically Motivated Detection Functions , 2005 .

[25]  S. Dixon,et al.  PERCEPTUAL SMOOTHNESS OF TEMPO IN EXPRESSIVELY PERFORMED MUSIC , 2006 .

[26]  Peter Grosche,et al.  High resolution audio synchronization using chroma onset features , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[27]  Dan Stowell,et al.  Adaptive whitening for Improved Real-Time audio onset Detection , 2007, ICMC.

[28]  Shingo Uchihashi,et al.  The beat spectrum: a new approach to rhythm analysis , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[29]  Peter Desain,et al.  On tempo tracking: Tempogram Representation and Kalman filtering , 2000, ICMC.

[30]  Mark B. Sandler,et al.  A tutorial on onset detection in music signals , 2005, IEEE Transactions on Speech and Audio Processing.

[31]  Masataka Goto,et al.  An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds , 2001 .

[32]  Bingjun Zhang,et al.  Multiple-Feature Fusion Based Onset Detection for Solo Singing Voice , 2008, ISMIR.

[33]  Orberto,et al.  Evaluation Methods for Musical Audio Beat Tracking Algorithms , 2009 .

[34]  Jarno Sepp nen TATUM GRID ANALYSIS OF MUSICAL SIGNALS , 2001 .

[35]  R. Parncutt A Perceptual Model of Pulse Salience and Metrical Accent in Musical Rhythms , 1994 .

[36]  Jarno Seppänen,et al.  Joint Beat & Tatum Tracking from Music Signals , 2006, ISMIR.

[37]  Miguel A. Alonso,et al.  Accurate tempo estimation based on harmonic + noise decomposition , 2007, EURASIP J. Adv. Signal Process..

[38]  Geoffroy Peeters,et al.  Simultaneous estimation of chord progression and downbeats from an audio file , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[39]  William A. Sethares,et al.  Rhythm and Transforms , 2007 .

[40]  Daniel P. W. Ellis,et al.  Beat Tracking by Dynamic Programming , 2007 .

[41]  Peter Grosche,et al.  Computing predominant local periodicity information in music recordings , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[42]  Jeff A. Bilmes,et al.  Techniques to Foster Drum Machine Expressivity , 1993, ICMC.

[43]  Masataka Goto,et al.  RWC Music Database: Popular, Classical and Jazz Music Databases , 2002, ISMIR.