Coherent modulation spectral filtering for single-channel music source separation

Modulation spectral filtering, if effective and distortion-free, would offer a new tool for signal modification. Previous approaches to modulation spectral filtering, which made use of incoherent detection of real and positive modulating envelopes for each frequency sub-band, have not offered effective and distortion-free signal modification. Based upon a recent observation that the modulating envelopes are potentially complex, coherent detection is instead proposed. Details are provided for accurate carrier estimation, and tests on both synthetic signals and music, show that modulation filtering is indeed distortion-free. The coherent modulation filtering method is applied to single-channel music sound source separation with promising results for music and other signal separation and modification applications.

[1]  Yuji Murahara,et al.  Modulation enhancement of speech as a preprocessing for reverberant chambers with the hearing-impaired , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[2]  O Ghitza,et al.  On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. , 2001, The Journal of the Acoustical Society of America.

[3]  Misha Pavel,et al.  Intelligibility of speech with filtered time trajectories of spectral envelopes , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4]  Steven Greenberg,et al.  The modulation spectrogram: in pursuit of an invariant representation of speech , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  R. Kumaresan,et al.  Model-based approach to envelope and positive instantaneous frequency estimation of signals with speech applications , 1999 .

[6]  Les E. Atlas,et al.  Scalable and progressive audio codec , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[7]  J.P.F. Glas A differential FM detector for low-IF radios , 1999, Gateway to 21st Century Communications Village. VTC 1999-Fall. IEEE VTS 50th Vehicular Technology Conference (Cat. No.99CH36324).

[8]  Qin Li,et al.  Homomorphic modulation spectra , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.