论文信息 - Audio Descriptors and Descriptor Schemes in the Context of MPEG-7

Audio Descriptors and Descriptor Schemes in the Context of MPEG-7

Sound content description is one of the aims of the MPEG-7 initiative. Although MPEG-7 focuses on indexing and retrieval of audio, there are other sound content-based processing applications waiting to be developed once we have a robust set of descriptors and structures for putting them into relation and for expressing semantic concerns about sound. Spectral Modeling techniques provide a valuable framework for extracting and organizing sound content descriptions. All our descriptors can be considered low- or mid-level, for which we have devised automatic extraction procedures. In this paper we present the basic descriptors and introduce a scheme for organizing them (a so called Descriptor Scheme). We do not cover the high level description of music (musical forms and styles, roles of characters in a movie, etc.) which is also relevant in MPEG-7.

[1] Douglas Keislar,et al. Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[2] C.-C. Jay Kuo,et al. Content-based classification and retrieval of audio , 1998, Optics & Photonics.

[3] J. Grey,et al. Perceptual evaluations of synthesized musical instrument tones , 1977 .

[4] Xavier Rodet,et al. New Applications of the Sound Description Interchange Format , 1998, ICMC.

[5] Keith D. Martin,et al. TOWARD AUTOMATIC SOUND SOURCE RECOGNITION: IDENTIFYING MUSICAL INSTRUMENTS , 1998 .

[6] Jordi Bonada,et al. Vibrato Extraction and Parameterization in the Spectral Modeling Synthesis framework , 1998 .

[7] Stephen W. Smoliar,et al. Toward content-based audio indexing and retrieval and a new speaker discrimination technique , 1995, IJCAI 1995.

[8] Wolfgang Effelsberg,et al. Automatic audio content analysis , 1997, MULTIMEDIA '96.