Polyphonic music separation based on the simplified energy splitter

In the past years many approaches have been developed that target the separation of polyphonic music material into independent source signals. Due to lack of information on the original signals it is currently practically impossible to extract the original waveforms from their mixture. Thus all of the approaches target the reconstruction of signals that are at least in some way close to the original. For that purpose common features of harmonic sounds are usually exploited. This paper proposes a system that uses frequency-domain instrument models as prior knowledge for reinserting information needed for the separation. The system provides over 18dB Signal to Distortion Ratio for two simultaneous notes, which slowly degrades as the level of polyphony increases. This makes the approach highly applicable both as a standalone separation tool and the ground of other signal manipulation methods.

[1]  Graham E. Poliner,et al.  Melody Transcription From Music Audio: Approaches and Evaluation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  C. Krumhansl,et al.  Isolating the dynamic attributes of musical timbre. , 1993, The Journal of the Acoustical Society of America.

[3]  Anssi Klapuri,et al.  Separation of harmonic sounds using multipitch analysis and iterative parameter estimation , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[4]  Randal J. Leistikow,et al.  Melody Extraction and Musical Onset Detection via Probabilistic Models of Framewise STFT Peak Data , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  P. Smaragdis,et al.  Non-negative matrix factorization for polyphonic music transcription , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[6]  Tuomas Virtanen,et al.  Unsupervised Learning Methods for Source Separation in Monaural Music Signals , 2006 .

[7]  Anssi Klapuri,et al.  Multipitch estimation and sound separation by the spectral smoothness principle , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[8]  Mark D. Plumbley,et al.  Unsupervised analysis of polyphonic music by sparse coding , 2006, IEEE Transactions on Neural Networks.

[9]  Perfecto Herrera-Boyer,et al.  Automatic Classification of Musical Instrument Sounds , 2003 .

[10]  Anssi Klapuri,et al.  Separation of harmonic sounds using linear models for the overtone series , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Mark B. Sandler,et al.  Automatic Piano Transcription Using Frequency and Time-Domain Information , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Mark D. Plumbley,et al.  Polyphonic transcription by non-negative sparse coding of power spectra , 2004, ISMIR.

[13]  Dan Barry,et al.  Sound Source Separation: Azimuth Discrimination and Resynthesis , 2004 .

[14]  E. PolinerG.,et al.  Melody Transcription From Music Audio , 2007 .

[15]  Xavier Serra,et al.  Towards Instrument Segmentation for Music Content Description: a Critical Review of Instrument Classification Techniques , 2000, ISMIR.

[16]  Andrzej Czyzewski,et al.  Rough Set Based Automatic Classification of Musical Instrument Sounds , 2003, Electron. Notes Theor. Comput. Sci..

[17]  Nikolaos Mitianoudis,et al.  Using beamforming in the audio source separation problem , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[18]  David Barber,et al.  A generative model for music transcription , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[19]  Mark D. Plumbley,et al.  Polyphonic music transcription by non-negative sparse coding of power spectra , 2004 .

[20]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[21]  István Vajk,et al.  Note separation of polyphonic music by energy split , 2008 .

[22]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[23]  Anssi Klapuri,et al.  Separation of harmonic sound sources using sinusoidal modeling , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[24]  Kristóf Aczél,et al.  Sound separation of polyphonic music using instrument prints , 2007, 2007 15th European Signal Processing Conference.

[25]  J C Brown Computer identification of musical instruments using pattern recognition with cepstral coefficients as features. , 1999, The Journal of the Acoustical Society of America.