Audio Signal Classification: An Overview

Audio signal classification consists of extracting physical and perceptual features from a sound, and of using these features to identify into which of a set of classes the sound is most likely to fit. The feature extraction and classification algorithms used can be quite diverse depending on the classification domain of the application. This paper presents an overview of the current state of the audio signal classification research literature.

[1]  Martin Piszczalski A computational model of music transcription , 1986 .

[2]  S. Coren,et al.  In Sensation and perception , 1979 .

[3]  Malcolm Slaney,et al.  Construction and evaluation of a robust multifeature speech/music discriminator , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Curtis Roads,et al.  The Computer Music Tutorial , 1996 .

[5]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[6]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[7]  Erling H. Wold,et al.  Content-Based Search, and Retrieval of Audio , 1996 .

[8]  John Saunders,et al.  Real-time discrimination of broadcast speech/music , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  B. Kedem,et al.  Spectral analysis and discrimination by zero-crossings , 1986, Proceedings of the IEEE.

[10]  Jürgen Schürmann,et al.  Pattern classification , 1996 .

[11]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[12]  R. Lathe Phd by thesis , 1988, Nature.

[13]  B. Feiten,et al.  Automatic indexing of a sound database using self-organizing neural nets , 1994 .

[14]  T. Denoeux Pattern Classiication , 1996 .