Template Based Key Finding from audio

A model for template based key finding from audio is presented and two methods that implement this model are compared. Templates are computed from a weighted combination of spectra obtained from sound recordings of monophonic musical notes. Individual weights of notes contributing to the templates are determined by profiles representing tonal hierarchies in Western music. Key determination is based on the correlations between spectral summary information obtained from audio input and the precomputed templates. The first method that implements the template based model utilizes a pure spectral representation and the second uses a chroma based representation. An overall evaluation of the model and comparison between the two methods are shown using a test audio collection. Performance results are presented for different profiles and a variety of analysis durations. Results are encouraging and show that template based models are viable for key finding from audio.

[1]  D. Temperley The Cognition of Basic Musical Structures , 2001 .

[2]  C. Krumhansl,et al.  Tracing the dynamic changes in perceived tonal organization in a spatial representation of musical keys. , 1982 .

[3]  Ching-Hua Chuan,et al.  Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[4]  E. Chew Towards a mathematical model of tonality , 2000 .

[5]  Klaus Obermayer,et al.  Constant Q Profiles for Tracking Modulations in Audio Data Format , 2001, ICMC.

[6]  G. H. Wakefield,et al.  To catch a chorus: using chroma-based representations for audio thumbnailing , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[7]  Emilia Gómez,et al.  Estimating The Tonality Of Polyphonic Audio Files: Cognitive Versus Machine Learning Modelling Strategies , 2004, ISMIR.

[8]  Daniel P. W. Ellis,et al.  Chord Recognition and Segmentation Using EM-trained Hidden Markov Models , 2003 .

[9]  George Tzanetakis,et al.  Polyphonic audio matching and alignment for music retrieval , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[10]  Steffen Pauws,et al.  Musical key extraction from audio , 2004, ISMIR.

[11]  Ilya Shmulevich,et al.  Localized Key Finding: Algorithms and Applications , 2000 .

[12]  Takuya Fujishima,et al.  Realtime Chord Recognition of Musical Sound: a System Using Common Lisp Music , 1999, ICMC.

[13]  Marc Leman Tonal context by pattern-integration over time , 1991 .

[14]  Jérôme Barthélemy,et al.  Figured Bass and Tonality Recognition , 2001, ISMIR.

[15]  C. Krumhansl Cognitive Foundations of Musical Pitch , 1990 .

[16]  Mohan S. Kankanhalli,et al.  Music Key Detection for Musical Audio , 2005, 11th International Multimedia Modelling Conference.

[17]  Richard Parncutt,et al.  AN IMPROVED MODEL OF TONALITY PERCEPTION INCORPORATING PITCH SALIENCE AND ECHOIC MEMORY , 1993 .

[18]  Özgür Izmirli,et al.  A model for tonal context time course calculation from acoustical input , 1996 .