MUSICAL KEY ESTIMATION OF AUDIO SIGNAL BASED ON HIDDEN MARKOV MODELING OF CHROMA VECTORS

In this paper, we propose a system for the automatic estimation of the key of a music track using hidden Markov models. The front-end of the system performs transient/noise reduction, estimation of the tuning and then represents the track as a succession of chroma vectors over time. The characteristics of the Major and minor modes are learned by training two hidden Markov models on a labeled database. 24 hidden Markov models corresponding to the various keys are then derived from the two trained models. The estimation of the key of a music track is then obtained by computing the likelihood of its chroma sequence given each HMM. The system is evaluated positively using a database of European baroque, classical and romantic music. We compare the results with the ones obtained using a cognitive-based approach. We also compare the chroma-key profiles learned from the database to the cognitive-based ones.

[1]  Judith C. Brown Calculation of a constant Q spectral transform , 1991 .

[2]  Markus Cremer A System for Harmonic Analysis of Polyphonic Music , 2004 .

[3]  Steffen Pauws,et al.  Musical key extraction from audio , 2004, ISMIR.

[4]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[5]  Emilia Gómez,et al.  Tonal Description of Polyphonic Audio for Music Content Processing , 2006, INFORMS J. Comput..

[6]  Daniel P. W. Ellis,et al.  Chord Recognition and Segmentation Using EM-trained Hidden Markov Models , 2003 .

[7]  Özgür Izmirli,et al.  Template Based Key Finding from audio , 2005, ICMC.

[8]  Gerhard Widmer,et al.  Exploring Music Collections by Browsing Different Views , 2004, Computer Music Journal.

[9]  R. Shepard Circularity in Judgments of Relative Pitch , 1964 .

[10]  Gregory H. Wakefield,et al.  Mathematical representation of joint time-chroma distributions , 1999, Optics & Photonics.

[11]  C. Krumhansl Cognitive Foundations of Musical Pitch , 1990 .

[12]  Hugues Vinet The Semantic HiFi Project , 2005, ICMC.

[13]  Mark Sandler,et al.  Automatic Chord Identifcation using a Quantised Chromagram , 2005 .

[14]  Takuya Fujishima,et al.  Realtime Chord Recognition of Musical Sound: a System Using Common Lisp Music , 1999, ICMC.

[15]  David Temperley,et al.  What's Key for Key? The Krumhansl-Schmuckler Key-Finding Algorithm Reconsidered , 1999 .

[16]  Ching-Hua Chuan,et al.  Fuzzy Analysis in Pitch-Class Determination for Polyphonic Audio Key Finding , 2005, ISMIR.