Speech enhancement using a constrained iterative sinusoidal model

This paper presents a sinusoidal model based algorithm for enhancement of speech degraded by additive broad-band noise. In order to ensure speech-like characteristics observed in clean speech, smoothness constraints are imposed on the model parameters using a spectral envelope surface (SES) smoothing procedure. Algorithm evaluation is performed using speech signals degraded by additive white Gaussian noise. Distortion as measured by objective speech quality scores showed a 34%-41% reduction over a SNR range of 5-to-20 dB. Objective and subjective evaluations also show considerable improvement over traditional spectral subtraction and Wiener filtering based schemes. Finally, in a subjective AB preference test, where enhanced signals were coded with the G729 codec, the proposed scheme was preferred over the traditional enhancement schemes tested for SNRs in the range of 5 to 20 dB.

[1]  John H. L. Hansen,et al.  Robust feature-estimation and objective quality assessment for noisy speech recognition using the Credit Card corpus , 1995, IEEE Trans. Speech Audio Process..

[2]  John H. L. Hansen,et al.  An improved (Auto: I, LSP: T) constrained iterative speech enhancement for colored noise environments , 1998, IEEE Trans. Speech Audio Process..

[3]  Søren Holdt Jensen,et al.  Reduction of broad-band noise in speech by truncated QSVD , 1995, IEEE Trans. Speech Audio Process..

[4]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[5]  John H. L. Hansen,et al.  A new dual-channel speech enhancement technique with application to CELP coding in noise , 1992, ICSLP.

[6]  Yariv Ephraim,et al.  Statistical-model-based speech enhancement systems , 1992, Proc. IEEE.

[7]  Thomas F. Quatieri,et al.  An approach to co-channel talker interference suppression using a sinusoidal model for speech , 1990, IEEE Trans. Acoust. Speech Signal Process..

[8]  J H Hansen,et al.  Robust estimation of speech in noisy backgrounds based on aspects of the auditory process. , 1995, The Journal of the Acoustical Society of America.

[9]  Schuyler Quackenbush,et al.  Objective measures of speech quality , 1995 .

[10]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[11]  Alan V. Oppenheim,et al.  All-pole modeling of degraded speech , 1978 .

[12]  Jesper Jensen,et al.  Exponential sinusoidal modeling of transitional speech segments , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[13]  John H. L. Hansen,et al.  Markov model-based phoneme class partitioning for improved constrained iterative speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[14]  Thomas F. Quatieri,et al.  Shape invariant time-scale and pitch modification of speech , 1992, IEEE Trans. Signal Process..

[15]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[16]  Hamid Sheikhzadeh,et al.  HMM-based strategies for enhancement of speech signals embedded in nonstationary noise , 1998, IEEE Trans. Speech Audio Process..

[17]  J.H.L. Hansen,et al.  Dual-channel iterative speech enhancement with constraints on an auditory-based spectrum , 1995, IEEE Trans. Speech Audio Process..

[18]  Biing-Hwang Juang,et al.  On the application of hidden Markov models for enhancing noisy speech , 1989, IEEE Trans. Acoust. Speech Signal Process..

[19]  Jae S. Lim,et al.  Speech enhancement , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[20]  David V. Anderson,et al.  Audio signal noise reduction using multi-resolution sinusoidal modeling , 1998, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[21]  Thomas F. Quatieri,et al.  Noise reduction using a soft-decision sine-wave vector quantizer , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[22]  J H Hansen,et al.  Objective speech quality assessment and the RPE-LTP coding algorithm in different noise and language conditions. , 1995, The Journal of the Acoustical Society of America.

[23]  John Mourjopoulos,et al.  Speech enhancement based on audible noise suppression , 1997, IEEE Trans. Speech Audio Process..

[24]  Steven M. Kay,et al.  Cochannel speaker separation by harmonic enhancement and suppression , 1997, IEEE Trans. Speech Audio Process..

[25]  A Kataoka,et al.  Conjugate Structure and Algebraic CELP(CS-ACELP)Coder for the ITU-T 8-kbit/s Speech Coding Standard , 1995 .

[26]  John H. L. Hansen,et al.  Constrained iterative speech enhancement with application to speech recognition , 1991, IEEE Trans. Signal Process..