论文信息 - Audio Analysis using the Discrete Wavelet Transform

Audio Analysis using the Discrete Wavelet Transform

The Discrete Wavelet Transform (DWT) is a transformation that can be used to analyze the temporal and spectral properties of non-stationary signals like audio. In this paper we describe some applications of the DWT to the problem of extracting information from non-speech audio. More specifically automatic classification of various types of audio using the DWT is described and compared with other traditional feature extractors proposed in the literature. In addition, a technique for detecting the beat attributes of music is presented. Both synthetic and real world stimuli were used to evaluate the performance of the beat detection algorithm. Key-Words: audio analysis, wavelets, classification, beat extraction

[1] George Tzanetakis,et al. Multifeature audio segmentation for browsing and annotation , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[2] Richard Kronland-Martinet,et al. Analysis of Sound Patterns through Wavelet transforms , 1987, Int. J. Pattern Recognit. Artif. Intell..

[3] Stéphane Mallat,et al. A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[4] I. Daubechies. Orthonormal bases of compactly supported wavelets , 1988 .

[5] Jonathan Foote,et al. An overview of audio information retrieval , 1999, Multimedia Systems.

[6] Paul Mermelstein,et al. Experiments in syllable-based recognition of continuous speech , 1980, ICASSP.

[7] S. R. Subramanya,et al. Transform-based indexing of audio data for multimedia databases , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[8] Eric D. Scheirer,et al. Tempo and beat analysis of acoustic musical signals. , 1998, The Journal of the Acoustical Society of America.

[9] Malcolm Slaney,et al. Construction and evaluation of a robust multifeature speech/music discriminator , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10] George Tzanetakis,et al. MARSYAS: a framework for audio analysis , 1999, Organised Sound.

[11] Douglas Keislar,et al. Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..