Information Rate for Fast Time-Domain Instrument Classification

In this paper, we propose a novel feature set for instrument classification which is based on the information rate of the signal in the time domain. The feature is extracted by calculating the Shannon entropy over a sliding short-time energy frame and binning statistical features into a unique feature vector. Experimental results are presented, including a comparison to frequency-domain feature sets. The proposed entropy features are shown to be faster than popular frequency-domain methods while maintaining comparable accuracy in an instrument classification task.

[1]  Moe Pwint,et al.  An Efficient Approach for Classification of Speech and Music , 2008, PCM.

[2]  Antti Eronen,et al.  Comparison of features for musical instrument recognition , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[3]  Edgar Chávez,et al.  A Robust Entropy-Based Audio-Fingerprint , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[4]  Lars Kai Hansen,et al.  On the Relevance of Spectral Features for Instrument Classification , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[5]  Mark B. Sandler,et al.  Classification of audio signals using statistical features on time and wavelet transform domains , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6]  Stephen Cranefield,et al.  A Study on Feature Analysis for Musical Instrument Classification , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[7]  Nurgun Erdol,et al.  Recovery of missing speech packets using the short-time energy and zero-crossing measurements , 1993, IEEE Trans. Speech Audio Process..

[8]  Biing-Hwang Juang,et al.  Audio signal classification with temporal envelopes , 2011, IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Patrick Susini,et al.  The Timbre Toolbox: extracting audio descriptors from musical signals. , 2011, The Journal of the Acoustical Society of America.

[10]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[11]  Ramón F. Brena,et al.  Classification of environmental audio signals using statistical time and frequency features , 2014, 2014 International Conference on Electronics, Communications and Computers (CONIELECOMP).

[12]  Perfecto Herrera-Boyer,et al.  Automatic Classification of Musical Instrument Sounds , 2003 .