Rhythm and Tempo Analysis Toward Automatic Music Transcription

This paper discusses model-based rhythm and tempo analysis of music data in the MIDI format. The data is assumed to be obtained from a module performing multi-pitch analysis of music acoustic signals inside an automatic transcription system. In performed music, observed note lengths and local tempo fluctuate from the nominal note lengths and long-term tempo. Applying the framework of continuous speech recognition to rhythm recognition, we take a probabilistic top-down approach on the joint estimation of rhythm and tempo from the performed onset events in MIDI data. Short-term rhythm patterns are extracted from existing music samples and form a "rhythm vocabulary." Local tempo is represented by a smooth curve. The entire problem is formulated as an integrated optimization problem to maximize a posterior probability, which can be solved by an iterative algorithm which alternately estimates rhythm and tempo. Evaluation of the algorithm through various experiments is also presented.

[1]  Nell P. McAngusTodd,et al.  The dynamics of dynamics: A model of musical expression , 1992 .

[2]  Hirokazu Kameoka,et al.  Probabilistic Approach to Automatic Music Transcription from Audio Signals , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[3]  Simon Dixon,et al.  Automatic Extraction of Tempo and Beat From Expressive Performances , 2001 .

[4]  H. Kameoka,et al.  HARMONIC-TEMPORAL-STRUCTURED CLUSTERING VIA DETERMINISTIC ANNEALING EM ALGORITHM FOR AUDIO FEATURE EXTRACTION , 2005 .

[5]  Jaakko Astola,et al.  Analysis of the meter of acoustic musical signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  B. Repp Diversity and commonality in music performance: an analysis of timing microstructure in Schumann's "Träumerei". , 1992, The Journal of the Acoustical Society of America.

[7]  Tomoshi Otsuki,et al.  Hidden Markov model for automatic transcription of MIDI signals , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[8]  Ali Taylan Cemgil,et al.  Monte Carlo Methods for Tempo Tracking and Rhythm Quantization , 2011, J. Artif. Intell. Res..

[9]  David Rosenthal,et al.  Emulation of human rhythm perception , 1992 .

[10]  Biing-Hwang Juang,et al.  The segmental K-means algorithm for estimating parameters of hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[11]  Christopher Raphael,et al.  Automated Rhythm Transcription , 2001, ISMIR.

[12]  Masataka Goto,et al.  A Learning-Based Quantization: Unsupervised Estimation of the Model Parameters , 2003, ICMC.

[13]  Henkjan Honing The Final Ritard: On Music, Motion, and Kinematic Models , 2003, Computer Music Journal.