Separation of speech and music sources from a single-channel mixture using discrete energy separation algorithm

In this paper, we address the problem of monaural source separation of a mixed signal containing speech and piano components. We use Discrete Energy Separation Algorithm (DESA) to estimate frequency-modulating (FM) signal energy. We design a time-varying filter in the time-frequency domain for rejecting the interfering signal. An estimation of the FM signal energy employs instantaneous signal properties that are localized both in time and frequency. We present experimental results which demonstrate the advantages of the proposed method using real audio signals.

[1]  Tuomas Virtanen,et al.  Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine , 2005, 2005 13th European Signal Processing Conference.

[2]  Rémi Gribonval,et al.  Adaptation of Bayesian Models for Single-Channel Source Separation and its Application to Voice/Music Separation in Popular Songs , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Rémi Gribonval,et al.  Audio source separation with a single sensor , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Rémi Gribonval,et al.  Oracle estimators for the benchmarking of source separation algorithms , 2007, Signal Process..

[5]  Israel Cohen,et al.  Monaural speech/music source separation using discrete energy separation algorithm , 2010, Signal Process..

[6]  J. F. Kaiser,et al.  On a simple algorithm to calculate the 'energy' of a signal , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[7]  Scott Rickard,et al.  Blind separation of speech mixtures via time-frequency masking , 2004, IEEE Transactions on Signal Processing.

[8]  Laurent Benaroya,et al.  WIENER BASED SOURCE SEPARATION WITH HMM/GMM USING A SINGLE SENSOR , 2003 .

[9]  H. M. Teager,et al.  Evidence for Nonlinear Sound Production Mechanisms in the Vocal Tract , 1990 .

[10]  Petros Maragos,et al.  Energy separation in signal modulations with application to speech analysis , 1993, IEEE Trans. Signal Process..

[11]  R. Gribonval,et al.  Proposals for Performance Measurement in Source Separation , 2003 .

[12]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .