Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation

SUMMARY Traditionaltwo-microphonenoisereductionalgorithmstodeal with highly nonstationary directional noises generally use the direc-tion of arrival or phase difference information. The performance of thesealgorithms deteriorate when diffuse noises coexist with nonstationary di-rectional noises in realistic adverse environments. In this paper, we presenta two-channel noise reduction algorithm using a spatial information-basedspeech estimator and a spatial-information-controlled soft-decision noiseestimator to improve the noise reduction performance in realistic non-stationary noisy environments. A target presence probability estimatorbased on Bayes rules using both phase difference and magnitude squaredcoherenceisproposedforsoft-decisionofnoiseestimation,sothattheycanshare complementary advantages when both directional noises and diffusenoises are present. Performances of the proposed two-microphone noisereduction algorithm are evaluated by noise reduction, log-spectral distance(LSD) and word recognition rate (WRR) of a distant-talking ASR systemin a real room’s noisy environment. Experimental results show that theproposed algorithm achieves better noises suppression without further dis-torting the desired signal components over the comparative dual-channelnoise reduction algorithms.

[1]  Jianfeng Chen,et al.  Performance evaluation of adaptive dual microphone systems , 2009, Speech Commun..

[2]  Guangji Shi,et al.  Multi-Microphone Phase-Based Speech Processing , 2005 .

[3]  Israel Cohen,et al.  Analysis of two-channel generalized sidelobe canceller (GSC) with post-filtering , 2003, IEEE Trans. Speech Audio Process..

[4]  Futoshi Asano,et al.  Speech Enhancement Based on Short-Time Spectral Amplitude Estimation with Two-Channel Beamformer , 1996 .

[5]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[6]  Yonghong Yan,et al.  A One-Pass Real-Time Decoder Using Memory-Efficient State Network , 2008, IEICE Trans. Inf. Syst..

[7]  L. J. Griffiths,et al.  An alternative approach to linearly constrained adaptive beamforming , 1982 .

[8]  Xuefeng Zhang,et al.  A soft decision based noise cross power spectral density estimation for two-microphone speech enhancement systems , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[9]  Richard M. Stern,et al.  Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain , 2009, INTERSPEECH.

[10]  Nam Soo Kim,et al.  Spectral enhancement based on global soft decision , 2000, IEEE Signal Processing Letters.

[11]  Philipos C. Loizou,et al.  A coherence-based algorithm for noise reduction in dual-microphone applications , 2010, 2010 18th European Signal Processing Conference.

[12]  Min-Seok Choi,et al.  A Two-Channel Minimum Mean-Square Error Log-Spectral Amplitude Estimator for Speech Enhancement , 2008, 2008 Hands-Free Speech Communication and Microphone Arrays.

[13]  Jeongsu Kim,et al.  Dual channel noise reduction method using phase difference-based spectral amplitude estimation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14]  Mohsen Rahmani,et al.  Power level difference as a criterion for speech enhancement , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Régine Le Bouquin-Jeannès,et al.  A Two-Sensor Noise Reduction System: Applications for Hands-Free Car Kit , 2003, EURASIP J. Adv. Signal Process..

[16]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[17]  Junfeng Li,et al.  A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments , 2006, Speech Commun..

[18]  Régine Le Bouquin-Jeannès,et al.  Study of a voice activity detector and its influence on a noise reduction system , 1995, Speech Commun..

[19]  Régine Le Bouquin-Jeannès,et al.  Enhancement of speech degraded by coherent and incoherent noise using a cross-spectral estimator , 1997, IEEE Trans. Speech Audio Process..

[20]  Jont B. Allen,et al.  Multimicrophone signal‐processing technique to remove room reverberation from speech signals , 1977 .

[21]  Israel Cohen,et al.  Speech enhancement for non-stationary noise environments , 2001, Signal Process..

[22]  Mohsen Rahmani,et al.  An iterative noise cross-PSD estimation for two-microphone speech enhancement , 2009 .

[23]  Gérard Faucon,et al.  Using the coherence function for noise reduction , 1992 .

[24]  Bhaskar D. Rao,et al.  A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Guangji Shi,et al.  Phased-Based Speech Processing , 2005 .

[26]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[27]  Philipos C. Loizou,et al.  Speech Enhancement: Theory and Practice , 2007 .

[28]  Parham Aarabi,et al.  Phase-based dual-microphone robust speech enhancement , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[29]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[30]  Ken Yamazaki,et al.  Acoustic source localization using phase difference spectrum images , 2003 .

[31]  Junfeng Li,et al.  A Two-Microphone Noise Reduction Method in Highly Non-stationary Multiple-Noise-Source Environments , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[32]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[33]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[34]  E.A.P. Habets,et al.  Dual-Microphone Speech Dereverberation in a Noisy Environment , 2006, 2006 IEEE International Symposium on Signal Processing and Information Technology.

[35]  Jae-Hoon Jeong,et al.  Dominant speech enhancement based on SNR-adaptive soft mask filtering , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.