论文信息 - Musical key extraction from audio

Musical key extraction from audio

The realisation and evaluation of a musical key extraction algorithm that works directly on raw audio data is presented. Its implementation is based on models of human auditory perception and music cognition. It is straightforward and has minimal computing requirements. First, it computes a chromagram from non-overlapping 100 msecs time frames of audio; a chromagram represents the likelihood of the chroma occurrences in the audio. This chromagram is correlated with Krumhansl’s key profiles that represent the perceived stability of each chroma within the context of a particular musical key. The key profile that has maximum correlation with the computed chromagram is taken as the most likely key. An evaluation with 237 CD recordings of classical piano sonatas indicated a classification accuracy of 75.1%. By considering the exact, relative, dominant, sub-dominant and parallel keys as similar keys, the accuracy is even 94.1%.

Steffen Pauws | S. Pauws

[1] Piet G. Vos,et al. A parallel-processing key-finding model , 1996 .

[2] David Temperley,et al. An Algorithm for Harmonic Analysis , 1997 .

[3] Ilya Shmulevich,et al. Localized Key Finding: Algorithms and Applications , 2000 .

[4] Mark Steedman,et al. On Interpreting Bach , 1987 .

[5] S. R. Holtzman. A program for key determination , 1977 .

[6] Chris Chafe,et al. Toward an Intelligent Editor of Digital Audio: Recognition of Musical Constructs , 1982 .

[7] Richard Parncutt,et al. AN IMPROVED MODEL OF TONALITY PERCEPTION INCORPORATING PITCH SALIENCE AND ECHOIC MEMORY , 1993 .

[8] C. Krumhansl. Cognitive Foundations of Musical Pitch , 1990 .

[9] Elaine Chew,et al. The Spiral Array: An Algorithm for Determining Key Boundaries , 2002, ICMAI.

[10] R. G. Crowder,et al. Perception of the Major/Minor Distinction: IV. Emotional Connotations in Young Children , 1990 .

[11] Marc Leman. Schema-based tone center recognition of musical signals , 1994 .