Automatic music genre classification based on wavelet package transform and best basis algorithm

In this paper, an improved music genre classification method is presented. The proposed method makes use of the wavelet package transform (WPT) and the best basis algorithm (BBA) to accurately classify and increase classification performance. It is well known that WPT can generate a wavelet decomposition that offers a richer signal analysis. In this paper, the music signal is first decomposed into approximation and detail coefficients using WPT with the best basis algorithm to minimize the Shannon entropy and maximize the representation of music signal. This paper uses the Top-Down search strategy with cost function to select the best basis. Then the proposed method could apply support vector machine (SVM) to build a music genre classifier using the mel-frequency cepstral coefficients (MFCC) and log energies extracted from the decomposition coefficients of WPT with the best basis algorithm. Finally one can perform music genre classification with the built music genre classifier. Experiments conducted on three different music datasets have shown that the proposed method can achieve higher classification accuracy than other music genre classification methods with the same experimental setup.

[1]  Kun-Ming Yu,et al.  Automatic Music Genre Classification Based on Modulation Spectral Analysis of Spectral and Cepstral Features , 2009, IEEE Transactions on Multimedia.

[2]  Jeff A. Bilmes,et al.  MVA Processing of Speech Features , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Tao Li,et al.  Toward intelligent music information retrieval , 2006, IEEE Transactions on Multimedia.

[4]  Lars Kai Hansen,et al.  Temporal Feature Integration for Music Genre Classification , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Andreas Rauber,et al.  Evaluation of Feature Extractors and Psycho-Acoustic Transformations for Music Genre Classification , 2005, ISMIR.

[6]  Arthur Flexer,et al.  A Closer Look on Artist Filters for Musical Genre Classification , 2007, ISMIR.

[7]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[8]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[9]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[10]  Changshui Zhang,et al.  Content-Based Information Fusion for Semi-Supervised Music Genre Classification , 2008, IEEE Transactions on Multimedia.

[11]  Tao Li,et al.  Factors in automatic musical genre classification of audio signals , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[12]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[13]  Trieu-Kien Truong,et al.  Audio classification and categorization based on wavelets and support vector Machine , 2005, IEEE Transactions on Speech and Audio Processing.

[14]  Stan Davis,et al.  Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .

[15]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[16]  Chang Dong Yoo,et al.  Music genre classification using novel features and a weighted voting method , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[17]  Ashraf A. Kassim,et al.  Wavelet packet transform basis selection method for set partitioning in hierarchical trees , 2008, J. Electronic Imaging.

[18]  N. Scaringella,et al.  Automatic genre classification of music content: a survey , 2006, IEEE Signal Process. Mag..