Tempo Detection of Urban Music Using Tatum Grid Non Negative Matrix Factorization

High tempo detection accuracies have been reported for the analysis of percussive, constant-tempo, Western music audio signals. As a consequence, active research in the tempo detection domain has been shifted to yet open tasks like tempo analysis of non-percussive, expressive, or non-western music. Also, tempo detection is included in a large range of music-related software. In DJ software, features like beat-synching or tempo-synchronized sound effects are widely accepted in the DJ community, and their users rely on correct tempo hypothesis as their basis. In this paper, we are evaluating both academic and commercial tempo detection systems on a typical dataset of an urban club music DJ. Based on this evaluation, we identify octave errors as a problem that has not yet been solved. Further, an approach based on non-negative matrix factorization is presented. In its current state it can compete with the state of the art. It further provides a foundation to tackle the octave error issue in future research.

[1]  Joseph T. Wells Timing Is of the Essence , 2001 .

[2]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[3]  Jouni Paulus,et al.  Drum transcription with non-negative spectrogram factorisation , 2005, 2005 13th European Signal Processing Conference.

[4]  Matthew E. P. Davies,et al.  Evaluation of Audio Beat Tracking and Music Tempo Extraction Algorithms , 2007 .

[5]  Miguel A. Alonso,et al.  Tempo And Beat Estimation Of Musical Signals , 2004, ISMIR.

[6]  Yoichi Muraoka,et al.  A Real-Time Beat Tracking System for Audio Signals , 1996, ICMC.

[7]  Björn W. Schuller,et al.  Tango or Waltz?: Putting Ballroom Dance Style into Tempo Detection , 2008, EURASIP J. Audio Speech Music. Process..

[8]  Jaakko Astola,et al.  Analysis of the meter of acoustic musical signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Simon Dixon,et al.  Evaluation of the Audio Beat Tracking System BeatRoot , 2007 .

[10]  Daniel P. W. Ellis,et al.  Beat Tracking by Dynamic Programming , 2007 .

[11]  Christian Uhle,et al.  Automatisierte Extraktion rhythmischer Merkmale zur Anwendung in Music-Information Retrieval-Systemen , 2008 .

[12]  Emilia Gómez,et al.  Comparative Evaluation and Combination of Audio Tempo Estimation Approaches , 2011, Semantic Audio.

[13]  D. Moelants,et al.  Ambiguity in Tempo Perception: What Draws Listeners to Different Metrical Levels? , 2006 .

[14]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[15]  George Tzanetakis,et al.  An experimental comparison of audio tempo induction algorithms , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Daniel Gärtner Tempo Estimation from Urban Music Using Non-Negative Matrix Factorization , 2011, Semantic Audio.

[17]  D. Ellis Beat Tracking by Dynamic Programming , 2007 .

[18]  Nick Collins Towards a style-specific basis for computational beat tracking , 2006 .

[19]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.