Properties for modulation spectral filtering

A two-dimensional representation, the "modulation spectrum", where the modulation frequency exists jointly with a regular Fourier frequency or other filter channel index, has previously been investigated. Accurate modulation filters offer, for example, new approaches for signal separation and noise reduction. However, a filtering operation on modulation frequency components has yet to be carefully defined. Most previous studies on modulation filtering assumed that the amplitude modulation envelope is real and non-negative, which has recently been shown to be incorrect. Distortions appear when the non-negative envelope assumption fails. Beginning with a more appropriate envelope assumption that allows the envelope to go negative, we propose three properties which modulation filtering systems should satisfy. Any modulation filtering method which satisfies these properties yields distortion-free results. An implementation of modulation filtering, based on a short-time Fourier transform followed by independent coherent demodulation for each frequency channel, is then proposed. Satisfaction of the properties is confirmed and an example result of modulation filtering on a speech signal is illustrated.

[1]  Ramdas Kumaresan,et al.  On decomposing speech into modulated components , 2000, IEEE Trans. Speech Audio Process..

[2]  Brian C. J. Moore Masking in the Human Auditory System , 1996 .

[3]  R. McEachern How the ear really works , 1992, [1992] Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis.

[4]  Les E. Atlas,et al.  Modulation frequency and efficient audio coding , 2001, SPIE Optics + Photonics.

[5]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[6]  Steven Greenberg,et al.  The modulation spectrogram: in pursuit of an invariant representation of speech , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  O Ghitza,et al.  On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. , 2001, The Journal of the Acoustical Society of America.

[8]  Qin Li,et al.  Homomorphic modulation spectra , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Les Atlas,et al.  Over-modulated AM-FM decomposition , 2004, SPIE Optics + Photonics.

[10]  Zachary M. Smith,et al.  Chimaeric sounds reveal dichotomies in auditory perception , 2002, Nature.

[11]  P. Loughlin,et al.  On the amplitude‐ and frequency‐modulation decomposition of signals , 1996 .

[12]  Les E. Atlas,et al.  A non-uniform modulation transform for audio coding with increased time resolution , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[13]  Misha Pavel,et al.  Intelligibility of speech with filtered time trajectories of spectral envelopes , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[14]  Thomas F. Quatieri,et al.  AM-FM separation using auditory-motivated filters , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.