Perceptual feature based music classification - A DSP perspective for a new type of application

Today, more and more computational power is available not only in desktop computers but also in portable devices such as smart phones or PDAs. At the same time the availability of huge non-volatile storage capacities (flash memory etc.) suggests to maintain huge music databases even in mobile devices. Automated music classification promises to allow keeping a much better overview on huge data bases for the user. Such a classification enables the user to sort the available huge music archives according to different genres which can be either predefined or user defined. It is typically based on a set of perceptual features which are extracted from the music data. Feature extraction and subsequent music classification are very computational intensive tasks. Today, a variety of music features and possible classification algorithms optimized for various application scenarios and achieving different classification qualities are under discussion. In this paper results concerning the computational needs and the achievable classification rates on different processor architectures are presented. The inspected processors include a general purpose P IV dual core processor, heterogeneous digital signal processor architectures like a Nomadik STn8810 (featuring a smart audio accelerator, SAA) as well as an OMAP2420. In order to increase classification performance, different forms of feature selection strategies (heuristic selection, full search and Mann-Whitney-Test) are applied. Furthermore, the potential of a hardware-based acceleration for this class of application is inspected by performing a fine as well as a coarse grain instruction tree analysis. Instruction trees are identified, which could be attractively implemented as custom instructions speeding up this class of applications.

[1]  Uwe Meyer-Baese,et al.  Digital Signal Processing with Field Programmable Gate Arrays , 2001 .

[2]  Hans-Peter Kriegel,et al.  Hierarchical Genre Classification for Large Music Collections , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[3]  Daniel P. W. Ellis,et al.  A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures , 2004, Computer Music Journal.

[4]  François Pachet,et al.  A taxonomy of musical genres , 2000, RIAO.

[5]  David Pye,et al.  Content-based methods for the management of digital music , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[6]  Katharina Morik,et al.  Automatic Feature Extraction for Classifying Audio Data , 2005, Machine Learning.

[7]  Alessandro Lameiras Koerich,et al.  Automatic classification of audio data , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[8]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[9]  Qi Tian,et al.  Musical genre classification using support vector machines , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[10]  Yibin Zhang,et al.  A study on content-based music classification , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[11]  Changsheng Xu,et al.  Automatic music classification and summarization , 2005, IEEE Transactions on Speech and Audio Processing.

[12]  N. Scaringella,et al.  Automatic genre classification of music content: a survey , 2006, IEEE Signal Process. Mag..

[13]  J. Schmee Applied Statistics—A Handbook of Techniques , 1984 .

[14]  Jarno Seppänen,et al.  Joint Beat & Tatum Tracking from Music Signals , 2006, ISMIR.

[15]  Peter M. W. Knijnenburg,et al.  Code Size Reduction by Compiler Tuning , 2006, SAMOS.