Boosting for content-based audio classification and retrieval: an evaluation

In this paper, we evaluate a recently proposed algorithm in machine learning called AdaBoost for content-based audio classification and retrieval. AdaBoost is a kind of large margin classifiers and is efficient for on-line learning. Our focus is to evaluate its classification and retrieval accuracy as compared with other methods. The Muscle Fish audio database of 409 sounds is used for the evaluation with perceptual and cepstral features.

[1]  Tamas Ungvary,et al.  Organization of Sounds with Neural Nets , 1991, ICMC.

[2]  B. Feiten,et al.  Automatic indexing of a sound database using self-organizing neural nets , 1994 .

[3]  Yoav Freund,et al.  Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[4]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[5]  Paul A. Viola,et al.  Boosting Image Retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[6]  Jonathan Foote,et al.  An overview of audio information retrieval , 1999, Multimedia Systems.

[7]  Tsuhan Chen,et al.  Audio feature extraction and analysis for scene classification , 1997, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing.

[8]  Jonathan Foote,et al.  Content-based retrieval of music and audio , 1997, Other Conferences.

[9]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[10]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[11]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[12]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Wolfgang Effelsberg,et al.  Automatic audio content analysis , 1997, MULTIMEDIA '96.

[14]  Stan Z. Li,et al.  Content-based Classification and Retrieval of Audio Using the Nearest Feature Line Method , 2000 .

[15]  W. Andrew Schloss,et al.  Toward an Intelligent Editor of Digital Audio: Signal Processing Methods , 1982 .

[16]  Stan Z. Li,et al.  Content-based audio classification and retrieval using the nearest feature line method , 2000, IEEE Trans. Speech Audio Process..

[17]  Guodong Guo,et al.  Content-Based Audio Classification and Retrieval Using SVM Learning , 2000 .