A dual microphone coherence based method for speech enhancement in headsets

The performance of two-microphone coherence based methods degrades if two captured noises are correlated. The Cross Power Spectrum Subtraction (CPSS) is an adapted coherence method for noise correlated environments. In this paper, we propose a new technique for estimation of speech cross power spectrum density and we exploit it in CPSS. The proposed speech enhancement method is evaluated as a speech recognition preprocessing system and as an independent speech enhancement system. The enhancement results show the practical superiority of the proposed method comparing with the previous solutions.

[1]  Gérard Faucon,et al.  Using the coherence function for noise reduction , 1992 .

[2]  Klaus Obermayer,et al.  A cross-spectrum weighting algorithm for speech enhancement and array processing: combining phase-shift information and stationary signal properties. , 2006, The Journal of the Acoustical Society of America.

[3]  Xuefeng Zhang,et al.  A soft decision based noise cross power spectral density estimation for two-microphone speech enhancement systems , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[4]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[5]  M. Viberg,et al.  Two decades of array signal processing research: the parametric approach , 1996, IEEE Signal Process. Mag..

[6]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[7]  Jont B. Allen,et al.  Multimicrophone signal‐processing technique to remove room reverberation from speech signals , 1977 .

[8]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[9]  Régine Le Bouquin-Jeannès,et al.  Enhancement of speech degraded by coherent and incoherent noise using a cross-spectral estimator , 1997, IEEE Trans. Speech Audio Process..

[10]  Régine Le Bouquin-Jeannès,et al.  A Two-Sensor Noise Reduction System: Applications for Hands-Free Car Kit , 2003, EURASIP J. Adv. Signal Process..

[11]  R. G. Leonard,et al.  A database for speaker-independent digit recognition , 1984, ICASSP.

[12]  Ahmad Akbari,et al.  SNR-dependent compression of enhanced Mel sub-band energies for compensation of noise effects on MFCC features , 2007, Pattern Recognit. Lett..