Tonal Description of Polyphonic Audio for Music Content Processing

We present a method to extract a description of the tonal aspects of music from polyphonic audio signals. We define this tonal description using different levels of abstraction, differentiating between low-level signal descriptors and high-level textual labels. We also establish different temporal scales for description, defining some features as being attached to a certain time instant, and other global descriptors as related to a wider segment. The description is validated by estimating the key of a piece. We also propose the description as a tonal representation of the polyphonic audio signal to measure tonal similarity between audio excerpts and to establish the tonal structure of a musical piece.

[1]  Judith C. Brown,et al.  An efficient algorithm for the calculation of a constant Q transform , 1992 .

[2]  Roger B. Dannenberg,et al.  Music Representation Issues,Techniques,and Systems , 1993 .

[3]  Marc Leman,et al.  Musical Audio Mining , 2002 .

[4]  William Rothstein,et al.  Articles on Schenker and Schenkerian Theory in The New Grove Dictionary of Music and Musicians, 2nd Edition Edited by Stanley Sadie Executive Editor John Tyrrell New York: Grove's Dictionaries, 2001 Also Available Online (by Subscription) at , 2001 .

[5]  Jordi Bonada,et al.  Automatic technique in frequency domain for near-lossless time-scale modification of audio , 2000, ICMC.

[6]  Marc Leman,et al.  Tendencies, perspectives, and opportunities of musical audio-mining , 2002 .

[7]  Klaus Obermayer,et al.  Correspondence Analysis for Visualizing Interplay of Pitch Class, Key, and Composer , 2003 .

[8]  Klaus Obermayer,et al.  A new method for tracking modulations in tonal music in audio data format , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[9]  Roger B. Dannenberg,et al.  A Brief Survey of Music Representation Issues, Techniques, and Systems , 1993 .

[10]  C. Krumhansl Cognitive Foundations of Musical Pitch , 1990 .

[11]  Yoichi Muraoka,et al.  Real-time beat tracking for drumless audio signals: Chord change detection for musical decisions , 1999, Speech Commun..

[12]  Xavier Serra,et al.  Musical Sound Modeling with Sinusoids plus Noise , 1997 .

[13]  Takuya Fujishima,et al.  Realtime Chord Recognition of Musical Sound: a System Using Common Lisp Music , 1999, ICMC.

[14]  George Tzanetakis,et al.  Pitch Histograms in Audio and Symbolic Music Information Retrieval , 2003, ISMIR.

[15]  Anssi Klapuri,et al.  Signal Processing Methods for the Automatic Transcription of Music , 2004 .

[16]  Marc Leman,et al.  The Cortical Topography of Tonal Structures Underlying Western Music , 2002, Science.

[17]  Stephen Travis Pope,et al.  Musical Signal Processing , 1997 .

[18]  Daniel P. W. Ellis,et al.  Chord segmentation and recognition using EM-trained hidden markov models , 2003, ISMIR.

[19]  E. Chew Towards a mathematical model of tonality , 2000 .

[20]  David Temperley,et al.  What's Key for Key? The Krumhansl-Schmuckler Key-Finding Algorithm Reconsidered , 1999 .