Applying Supervised Classifiers Based on Non-negative Matrix Factorization to Musical Instrument Classification

In this paper, a new approach for automatic audio classification using non-negative matrix factorization (NMF) is presented. Training is performed onto each audio class individually, whilst during the test phase each test recording is projected onto the several training matrices. Experiments demonstrating the efficiency of the proposed approach were performed for musical instrument classification. Several perceptual features as well as MPEG-7 descriptors were measured for 300 sound recordings consisting of 6 different musical instrument classes. Subsets of the feature set were selected using branch-and-bound search, in order to obtain the most discriminating features for classification. Several NMF techniques were utilized, namely the standard NMF method, the local NMF, and the sparse NMF. The experiments demonstrate an almost perfect classification (classification error 1.0%), outperforming the state-of-the-art techniques tested for the aforementioned experiment

[1]  Constantine Kotropoulos,et al.  Comparison of subspace analysis-based and statistical model-based algorithms for musical instrument classification , 2005 .

[2]  Elias Pampalk,et al.  Please Scroll down for Article Journal of New Music Research the Som-enhanced Jukebox: Organization and Visualization of Music Collections Based on Perceptual Models , 2022 .

[3]  J C Brown,et al.  Feature dependence in the automatic identification of musical woodwind instruments. , 2001, The Journal of the Acoustical Society of America.

[4]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[5]  David M. J. Tax,et al.  One-class classification , 2001 .

[6]  Wei-Ying Ma,et al.  Mining ratio rules via principal sparse non-negative matrix factorization , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[7]  Thomas Sikora,et al.  Audio classification based on MPEG-7 spectral basis representations , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Stan Z. Li,et al.  Learning spatially localized, parts-based representation , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Xavier Rodet,et al.  The importance of cross database evaluation in musical instrument sound classification: A critical approach , 2003, ISMIR.

[10]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[11]  T. Subba Rao,et al.  Classification, Parameter Estimation and State Estimation: An Engineering Approach Using MATLAB , 2004 .

[12]  Elias Pampalk,et al.  Using PsychoAcoustic Models and SOMs to create a Hierarchical Structuring of Music Using PsychoAcoustic Models and Self-Organizing Maps to Create a Hierarchical Structuring of Music by Sound Similarity , 2002 .

[13]  Andreas Rauber,et al.  Automatically Analyzing and Organizing Music Archives , 2001, ECDL.

[14]  Andreas Rauber,et al.  Evaluation of Feature Extractors and Psycho-Acoustic Transformations for Music Genre Classification , 2005, ISMIR.

[15]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[16]  Piotr Synak,et al.  Application of Temporal Descriptors to Musical Instrument Sound Recognition , 2003, Journal of Intelligent Information Systems.