Blind Music Timbre Source Isolation by Multi- resolution Comparison of Spectrum Signatures

Automatic indexing of music instruments for multi-timbre sounds is challenging, especially when partials from different sources are overlapping with each other. Temporal features, which have been successfully applied in monophonic sound timbre identification, failed to isolate music instrument in multi-timbre objects, since the detection of the start and end position of each music segment unit is very difficult. Spectral features of MPEG7 and other popular features provide economic computation but contain limited information about timbre. Being compared to the spectral features, spectrum signature features have less information loss; therefore may identify sound sources in multitimbre music objects with higher accuracy. However, the high dimensionality of spectrum signature feature set requires intensive computing and causes estimation efficiency problem. To overcome these problems, the authors developed a new multi-resolution system with an iterative spectrum band matching device to provide fast and accurate recognition.

[1]  俊一 甘利,et al.  A. Hyvärinen, J. Karhunen and E. Oja, Independent Component Analysis, Jhon Wiley & Sons, 2001年,504ページ. (根本幾・川勝真喜訳:独立成分分析——信号解析の新しい世界,東京電機大学出版局,2005年,532ページ.) , 2010 .

[2]  Malcolm Slaney,et al.  Construction and evaluation of a robust multifeature speech/music discriminator , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  E. C. Cmm,et al.  on the Recognition of Speech, with , 2008 .

[4]  Özgür Yilmaz,et al.  Blind separation of disjoint orthogonal signals: demixing N sources from 2 mixtures , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[5]  Anssi Klapuri,et al.  Separation of harmonic sound sources using sinusoidal modeling , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[6]  Zbigniew W. Ras,et al.  Music Instrument Estimation in Polyphonic Sound Based on Short-Term Spectrum Match , 2009, Foundations of Computational Intelligence.

[7]  Rémi Gribonval,et al.  Construction d'estimateurs oracles pour la séparation de sources , 2005 .

[8]  Masataka Goto,et al.  Instrument Identification in Polyphonic Music: Feature Weighting to Minimize Influence of Sound Overlaps , 2007, EURASIP J. Adv. Signal Process..

[9]  P. Philippe,et al.  One microphone singing voice separation using source-adapted models , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[10]  Michael A. Casey,et al.  Separation of Mixed Audio Sources By Independent Subspace Analysis , 2000, ICMC.

[11]  Scott Rickard,et al.  ROBUSTNESS OF PARAMETRIC SOURCE DEMIXING IN ECHOIC ENVIRONMENTS , 2001 .

[12]  Jean-Francois Cardoso,et al.  Blind signal separation: statistical principles , 1998, Proc. IEEE.

[13]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[14]  Erkki Oja,et al.  Independent Component Analysis , 2001 .

[15]  Beth Logan,et al.  Mel Frequency Cepstral Coefficients for Music Modeling , 2000, ISMIR.

[16]  Zbigniew W. Ras,et al.  Analysis of Sound Features for Music Timbre Recognition , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[17]  Guy J. Brown,et al.  Computational auditory scene analysis , 1994, Comput. Speech Lang..

[18]  Zbigniew W. Ras,et al.  Maximum Likelihood Study for Sound Pattern Separation and Recognition , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[19]  Piotr Dalka,et al.  Estimation of Musical Sound Separation Algorithm Effectiveness Employing Neural Networks , 2005, Journal of Intelligent Information Systems.

[20]  Jürgen Herre,et al.  MPEG-7 and MPEG-7 audio: An overview , 2001 .