Music-genre classification system based on spectro-temporal features and feature selection

An automatic classification system of the music genres is proposed. Based on the timbre features such as mel-frequency cepstral coefficients, the spectro-temporal features are obtained to capture the temporal evolution and variation of the spectral characteristics of the music signal. Mean, variance, minimum, and maximum values of the timbre features are calculated. Modulation spectral flatness, crest, contrast, and valley are estimated for both original spectra and timbre-feature vectors. A support vector machine (SVM) is used as a classifier where an elaborated kernel function is defined. To reduce the computational complexity, an SVM ranker is applied for feature selection. Compared with the best algorithms submitted to the music information retrieval evaluation exchange (MIREX) contests, the proposed method provides higher accuracy at a lower feature dimension for the GTZAN and ISMIR2004 databases.

[1]  Emilia Gómez,et al.  Musical genre classification using melody features extracted from polyphonic music signals , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Constantine Kotropoulos,et al.  Non-Negative Tensor Factorization Applied to Music Genre Classification , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Peter Knees,et al.  USING BLOCK-LEVEL FEATURES FOR GENRE CLASSIFICATION , TAG CLASSIFICATION AND MUSIC SIMILARITY ESTIMATION , 2010 .

[4]  Ming-Ju Wu,et al.  MIREX 2011 SUBMISSION - COMBINING VISUAL AND ACOUSTIC FEATURES FOR MUSIC GENRE CLASSIFICATION , 2011 .

[5]  Seok-Pil Lee,et al.  Music Genre Classification System Using Decorrelated Filter Bank , 2011 .

[6]  James R. Glass,et al.  Robust Speaker Recognition in Noisy Conditions , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Constantine Kotropoulos,et al.  Non-Negative Multilinear Principal Component Analysis of Auditory Temporal Modulations for Music Genre Classification , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[9]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[10]  Chang Dong Yoo,et al.  Music genre classification using novel features and a weighted voting method , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[11]  Kun-Ming Yu,et al.  Automatic Music Genre Classification using Modulation Spectral Contrast Feature , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[12]  Mert Bay,et al.  The Music Information Retrieval Evaluation eXchange: Some Observations and Insights , 2010, Advances in Music Information Retrieval.

[13]  Kichul Kim,et al.  Robust query-by-singing/humming system against background noise environments , 2011, IEEE Transactions on Consumer Electronics.

[14]  J. Stephen Downie,et al.  The music information retrieval evaluation exchange (2005-2007): A window into music information retrieval research , 2008 .

[15]  Jyh-Shing Roger Jang,et al.  Combining Visual and Acoustic Features for Music Genre Classification , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.

[16]  Carlos Guadarrama,et al.  Nonlinear Audio Recurrence Analysis with Application to Music Genre Classification , 2010 .

[17]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[18]  Chang D. Yoo,et al.  Music information retrieval using novel features and a weighted voting method , 2009, 2009 IEEE International Symposium on Industrial Electronics.

[19]  Rocha Bruno Genre Classification based on Predominant Melodic Pitch Contours , 2014 .

[20]  Pasi Aalto,et al.  Matrix factorization methods for analysing diffusion battery data , 1991 .

[21]  Lie Lu,et al.  Music type classification by spectral contrast feature , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[22]  Les E. Atlas,et al.  Modulation-scale analysis for content identification , 2004, IEEE Transactions on Signal Processing.

[23]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[24]  Constantine Kotropoulos,et al.  Music Genre Classification: A Multilinear Approach , 2008, ISMIR.

[25]  Kun-Ming Yu,et al.  Automatic Music Genre Classification Based on Modulation Spectral Analysis of Spectral and Cepstral Features , 2009, IEEE Transactions on Multimedia.

[26]  Ming Li,et al.  THINKIT'S SUBMISSIONS FOR MIREX2009 AUDIO MUSIC CLASSIFICATION AND SIMILARITY TASKS , 2009 .

[27]  Zhouyu Fu,et al.  A Survey of Audio-Based Music Classification and Annotation , 2011, IEEE Transactions on Multimedia.

[28]  George Tzanetakis,et al.  MARSYAS SUBMISSIONS TO MIREX 2007 , 2007 .

[29]  Joan Serrà,et al.  Nonlinear audio recurrence analysis with application to genre classification , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[30]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[31]  Xuan Zhu,et al.  An integrated music recommendation system , 2006, IEEE Transactions on Consumer Electronics.

[32]  Moo Young Kim,et al.  Music genre/mood classification using a feature-based modulation spectrum , 2011, International Conference on Mobile IT Convergence.