Music Content Analysis through Models of Audition

The direct application of ideas from music theory and music signal processing has not yet led to successful musical multimedia systems. We present a research framework that addresses the limitations of conventional approaches by questioning their (often tacit) underlying principles. We discuss several case studies from our own research on the extraction of musical rhythm, timbre, harmony, and structure from complex audio signals; these projects have demonstrated the power of an approach based on a realistic view of human listening abilities. Continuing research in this direction is necessary for the construction of robust systems for music content analysis.

[1]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[2]  Eleanor Rosch,et al.  Principles of Categorization , 1978 .

[3]  J. W. Gordon,et al.  Perceptual effects of spectral modifications on musical timbres , 1978 .

[4]  Gerard Charbonneau Timbre and the Perceptual Effects of Three Types of Data Reduction , 1981 .

[5]  Chris Chafe,et al.  Toward an Intelligent Editor of Digital Audio: Recognition of Musical Constructs , 1982 .

[6]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[7]  Barry Vercoe,et al.  The Synthetic Performer in The Context of Live Performance , 1984, International Conference on Mathematics and Computing.

[8]  R. Meddis,et al.  Virtual pitch and phase sensitivity of a computer model of the auditory periphery. II: Phase sensitivity , 1991 .

[9]  Ray Meddis,et al.  Virtual pitch and phase sensitivity of a computer model of the auditory periphery , 1991 .

[10]  Gregory J. Sandell,et al.  Roles for Spectral Centroid and Other Factors in Determining "Blended" Instrument Pairings in Orchestration , 1995 .

[11]  S. Handel,et al.  Chapter 12 – Timbre Perception and Auditory Object Identification , 1995 .

[12]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[13]  J. D. Smith The Place of Musical Novices in Music Science , 1997 .

[14]  Ajm Adrian Houtsma,et al.  Pitch and timbre : definition, meaning and use , 1997 .

[15]  Eric D. Scheirer Pulse tracking with a pitch tracker , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[16]  Malcolm Slaney,et al.  Construction and evaluation of a robust multifeature speech/music discriminator , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Eric D. Scheirer,et al.  Tempo and beat analysis of acoustic musical signals. , 1998, The Journal of the Acoustical Society of America.

[18]  Keith D. Martin,et al.  TOWARD AUTOMATIC SOUND SOURCE RECOGNITION: IDENTIFYING MUSICAL INSTRUMENTS , 1998 .

[19]  Barry Vercoe,et al.  Structured audio: creation, transmission, and rendering of parametric sound representations , 1998, Proc. IEEE.

[20]  Jonathan Foote,et al.  An overview of audio information retrieval , 1999, Multimedia Systems.