Recovering Overlapping Partials for Monaural Perfect Harmonic Musical Sound Separation Using Modified Common Amplitude Modulation

Monaural musical sound separation attempts to isolate one or more instrument sources from a mono-channel polyphonic mixture. The primary challenge is to accurately separate pitched musical sounds where their partials overlap with each other. In the general case, each source has one or more non-overlapping partials. However, there exists the case that no non-overlapping partial is available for at least one source, leading to a more complex problem. In order to address this problem, we propose a novel method named Modified Common Amplitude Modulation (Modified-CAM), which aims to separate monaural perfect harmonic musical sound (except the unison), especially for the perfect octave interval. The proposed system can accurately recover both the amplitudes and the phases of the perfect octave, which can be considered as the typical but the hardest condition. We evaluate the proposed system on the University of Iowa musical instrument sample database, and the experiment results show the system’s superiority.

[1]  Tuomas Virtanen,et al.  Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Bryan Pardo,et al.  Reconstructing completely overlapped notes from musical mixtures , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Mert Bay,et al.  Harmonic Source Separation Using Prestored Spectra , 2006, ICA.

[4]  Tuomas Virtanen,et al.  Sound Source Separation in Monaural Music Signals , 2006 .

[5]  Yipeng Li,et al.  Monaural Musical Sound Separation , 2008 .

[6]  A. Bregman Auditory Scene Analysis , 2008 .

[7]  Melik Berkan Ercan,et al.  MUSICAL INSTRUMENT SOURCE SEPARATION IN UNISON AND MONAURAL MIXTURES , 2014 .

[8]  DeLiang Wang,et al.  Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Harald Viste,et al.  A method for separation of overlapping partials based on similarity of temporal envelopes in multichannel mixtures , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Michael A. Casey,et al.  Separation of Mixed Audio Sources By Independent Subspace Analysis , 2000, ICMC.

[11]  Mark R. Every,et al.  Separation of synchronous pitched notes by spectral filtering of harmonics , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Mark D. Plumbley,et al.  Unsupervised analysis of polyphonic music by sparse coding , 2006, IEEE Transactions on Neural Networks.

[13]  Anssi Klapuri,et al.  Separation of harmonic sounds using multipitch analysis and iterative parameter estimation , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[14]  Anssi Klapuri,et al.  Multiple fundamental frequency estimation based on harmonicity and spectral smoothness , 2003, IEEE Trans. Speech Audio Process..

[15]  David K. Mellinger,et al.  Event formation and separation in musical sound , 1992 .

[16]  R. Rasch,et al.  Description of regular twelve-tone musical tunings. , 1983, The Journal of the Acoustical Society of America.