Time Variability-Based Hierarchic Recognition of Multiple Musical Instruments in Recordings

The research reported in this chapter is focused on automatic identification of musical instruments in polyphonic audio recordings. Random forests have been used as a classification tool, pre-trained as binary classifiers to indicate presence or absence of a target instrument. Feature set includes parameters describing frame-based properties of a sound. Moreover, in order to capture the patterns which emerge on the time scale, new temporal parameters are introduced to supply additional temporal information for the timbre recognition. In order to achieve higher estimation rate, we investigated a feature-driven hierarchical classification of musical instruments built using agglomerative clustering strategy. Experiments showed that the performance of classifiers based on this new classification of instruments schema is better than performance of the traditional flat classifiers, which directly estimate the instrument. Also, they outperform the classifiers based on the classical Hornbostel-Sachs schema.

[1]  Jonathan Foote,et al.  An overview of audio information retrieval , 1999, Multimedia Systems.

[2]  Xavier Rodet,et al.  MUSICAL INSTRUMENT IDENTIFICATION IN CONTINUOUS RECORDINGS , 2004 .

[3]  Guy J. Brown,et al.  Application of missing feature theory to the recognition of musical instruments in polyphonic audio , 2003, ISMIR.

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  Zbigniew W. Ras,et al.  Advances in Music Information Retrieval , 2012, Advances in Music Information Retrieval.

[6]  Peter A. Flach,et al.  Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[7]  Bin Cui,et al.  Intelligent Music Information Systems: Tools and Methodologies , 2007 .

[8]  Anssi Klapuri,et al.  Automatic Classification of Pitched Musical Instrument Sounds , 2006 .

[9]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[10]  J C Brown Computer identification of musical instruments using pattern recognition with cepstral coefficients as features. , 1999, The Journal of the Acoustical Society of America.

[11]  Zbigniew W. Ras,et al.  Recognition of Instrument Timbres in Real Polytimbral Audio Recordings , 2010, ECML/PKDD.

[12]  Технология Springer Science+Business Media , 2013 .

[13]  Masataka Goto,et al.  RWC Music Database: Popular, Classical and Jazz Music Databases , 2002, ISMIR.

[14]  Ingo Mierswa Collaborative Use of Features in a Distributed System for the Organization of Music Collections , 2008 .

[15]  Bozena Kostek,et al.  Musical instrument classification and duet analysis employing music information retrieval techniques , 2004, Proceedings of the IEEE.

[16]  Perfecto Herrera-Boyer,et al.  Automatic Classification of Musical Instrument Sounds , 2003 .

[17]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[18]  Anssi Klapuri,et al.  Signal Processing Methods for Music Transcription , 2006 .

[19]  Zbigniew W. Ras,et al.  Maximum Likelihood Study for Sound Pattern Separation and Recognition , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[20]  D. Niewiadomy,et al.  Implementation of MFCC vector generation in classification context , 2008 .