Single-channel speech enhancement: Using recurrent neuro-fuzzy voice activity detector and spectral subtraction algorithms

This paper investigates the effectiveness of a single-channel speech enhancement system that contains spectral subtraction and voice activity detection algorithm for noise elimination. We first extract features from a noisy signal and use these features as the inputs of a recurrent neuro-fuzzy network for detecting the voice activities of the signal. Based on the detection, we describe the characteristics of the background noise of the speech segments by a minimum frequency energy (MFE) parameter and then apply spectral subtraction algorithms with the parameter to eliminate the noise. Our simulation results show that the proposed enhancement system with a nonlinear spectral subtraction algorithm has superior performance.

[1]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .

[2]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[3]  J. Poruba Speech enhancement based on nonlinear spectral subtraction , 2002, Proceedings of the Fourth IEEE International Caracas Conference on Devices, Circuits and Systems (Cat. No.02TH8611).

[4]  Javier Ortega-Garcia,et al.  Overview of speech enhancement techniques for automatic speaker recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[5]  Chin-Teng Lin Single-channel speech enhancement in variable noise-level environment , 2003, IEEE Trans. Syst. Man Cybern. Part A.

[6]  Yariv Ephraim,et al.  A signal subspace approach for speech enhancement , 1995, IEEE Trans. Speech Audio Process..

[7]  L X Wang,et al.  Fuzzy basis functions, universal approximation, and orthogonal least-squares learning , 1992, IEEE Trans. Neural Networks.

[8]  Nathalie Virag,et al.  Single channel speech enhancement based on masking properties of the human auditory system , 1999, IEEE Trans. Speech Audio Process..

[9]  Chin-Teng Lin,et al.  Word boundary detection with mel-scale frequency bank in noisy environment , 2000, IEEE Trans. Speech Audio Process..

[10]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[11]  Jeen-Shing Wang,et al.  Self-adaptive recurrent neuro-fuzzy control of an autonomous underwater vehicle , 2002, IEEE Trans. Robotics Autom..