An approach to co-channel talker interference suppression using a sinusoidal model for speech

The technique fits a sinusoidal model to additive vocal speech segments so that the least-mean-squared error between the model and the summed waveforms is obtained. Enhancement is achieved by synthesizing a waveform from the sine waves attributed to the desired speaker. Least-squares estimation is applied to obtain sine-wave amplitudes and phases of both talkers, based on either a priori sine-wave frequencies or a priori fundamental frequency contours. When the frequencies of the two waveforms are closely spaced, the performance is significantly improved by exploring the time evolution of the sinusoidal parameters across multiple analysis frames. The least-squared-error approach is also extended, under restricted conditions, to estimate fundamental frequency contours of both speakers from the summed waveforms. The results obtained, although limited in their scope, provide evidence that the sinusoidal analysis/synthesis model with effective parameter estimation techniques offers a promising approach to the problem of cochannel talker-interference suppression over a range of conditions. >

[1]  Biing-Hwang Juang,et al.  Speech enhancement with harmonic synthesis , 1983, ICASSP.

[2]  R J Stubbs,et al.  Evaluation of two voice-separation algorithms using normal-hearing and hearing-impaired listeners. , 1988, The Journal of the Acoustical Society of America.

[3]  Donald G. Childers,et al.  Co--Channel speech separation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  T. W. Parsons,et al.  Enhancing/Intelligibility of Speech in Noisy or Multi-Talker Environments. , 1975 .

[5]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[6]  S. Boll,et al.  Techniques for suppression of an interfering talker in co-channel speech , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  B A Hanson,et al.  Processing techniques for intelligibility improvement to speech with co-channel interference , 1983 .

[8]  T. W. Parsons Separation of speech from interfering speech by means of harmonic selection , 1976 .

[9]  Brian A. Hanson,et al.  The harmonic magnitude suppression (EMS) technique for intelligibility enhancement in the presence of interfering speech , 1984, ICASSP.

[10]  Thomas F. Quatieri,et al.  An approach to co-channel talker interference suppression using a sinusoidal model for speech , 1990, IEEE Trans. Acoust. Speech Signal Process..

[11]  Alan V. Oppenheim,et al.  Enhancement of speech by adaptive filtering , 1976, ICASSP.

[12]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[13]  Thomas F. Quatieri,et al.  Speech transformations based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..