论文信息 - Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation

Two-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation

SUMMARY Traditionaltwo-microphonenoisereductionalgorithmstodeal with highly nonstationary directional noises generally use the direc-tion of arrival or phase diﬀerence information. The performance of thesealgorithms deteriorate when diﬀuse noises coexist with nonstationary di-rectional noises in realistic adverse environments. In this paper, we presenta two-channel noise reduction algorithm using a spatial information-basedspeech estimator and a spatial-information-controlled soft-decision noiseestimator to improve the noise reduction performance in realistic non-stationary noisy environments. A target presence probability estimatorbased on Bayes rules using both phase diﬀerence and magnitude squaredcoherenceisproposedforsoft-decisionofnoiseestimation,sothattheycanshare complementary advantages when both directional noises and diﬀusenoises are present. Performances of the proposed two-microphone noisereduction algorithm are evaluated by noise reduction, log-spectral distance(LSD) and word recognition rate (WRR) of a distant-talking ASR systemin a real room’s noisy environment. Experimental results show that theproposed algorithm achieves better noises suppression without further dis-torting the desired signal components over the comparative dual-channelnoise reduction algorithms.

Yanmeng Guo | Qiang Fu | Yonghong Yan | Kai Li | Junfeng Li

[1] Jianfeng Chen,et al. Performance evaluation of adaptive dual microphone systems , 2009, Speech Commun..

[2] Guangji Shi,et al. Multi-Microphone Phase-Based Speech Processing , 2005 .

[3] Israel Cohen,et al. Analysis of two-channel generalized sidelobe canceller (GSC) with post-filtering , 2003, IEEE Trans. Speech Audio Process..

[4] Futoshi Asano,et al. Speech Enhancement Based on Short-Time Spectral Amplitude Estimation with Two-Channel Beamformer , 1996 .

[5] Michael S. Brandstein,et al. Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[6] Yonghong Yan,et al. A One-Pass Real-Time Decoder Using Memory-Efficient State Network , 2008, IEICE Trans. Inf. Syst..

[7] L. J. Griffiths,et al. An alternative approach to linearly constrained adaptive beamforming , 1982 .

[8] Xuefeng Zhang,et al. A soft decision based noise cross power spectral density estimation for two-microphone speech enhancement systems , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[9] Richard M. Stern,et al. Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain , 2009, INTERSPEECH.

[10] Nam Soo Kim,et al. Spectral enhancement based on global soft decision , 2000, IEEE Signal Processing Letters.

[11] Philipos C. Loizou,et al. A coherence-based algorithm for noise reduction in dual-microphone applications , 2010, 2010 18th European Signal Processing Conference.

[12] Min-Seok Choi,et al. A Two-Channel Minimum Mean-Square Error Log-Spectral Amplitude Estimator for Speech Enhancement , 2008, 2008 Hands-Free Speech Communication and Microphone Arrays.

[13] Jeongsu Kim,et al. Dual channel noise reduction method using phase difference-based spectral amplitude estimation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[14] Mohsen Rahmani,et al. Power level difference as a criterion for speech enhancement , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15] Régine Le Bouquin-Jeannès,et al. A Two-Sensor Noise Reduction System: Applications for Hands-Free Car Kit , 2003, EURASIP J. Adv. Signal Process..

[16] David Malah,et al. Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..

[17] Junfeng Li,et al. A noise reduction system based on hybrid noise estimation technique and post-filtering in arbitrary noise environments , 2006, Speech Commun..

[18] Régine Le Bouquin-Jeannès,et al. Study of a voice activity detector and its influence on a noise reduction system , 1995, Speech Commun..

[19] Régine Le Bouquin-Jeannès,et al. Enhancement of speech degraded by coherent and incoherent noise using a cross-spectral estimator , 1997, IEEE Trans. Speech Audio Process..

[20] Jont B. Allen,et al. Multimicrophone signal‐processing technique to remove room reverberation from speech signals , 1977 .

[21] Israel Cohen,et al. Speech enhancement for non-stationary noise environments , 2001, Signal Process..

[22] Mohsen Rahmani,et al. An iterative noise cross-PSD estimation for two-microphone speech enhancement , 2009 .

[23] Gérard Faucon,et al. Using the coherence function for noise reduction , 1992 .

[24] Bhaskar D. Rao,et al. A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[25] Guangji Shi,et al. Phased-Based Speech Processing , 2005 .

[26] I. Cohen,et al. Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[27] Philipos C. Loizou,et al. Speech Enhancement: Theory and Practice , 2007 .

[28] Parham Aarabi,et al. Phase-based dual-microphone robust speech enhancement , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[29] Jont B. Allen,et al. Image method for efficiently simulating small‐room acoustics , 1976 .

[30] Ken Yamazaki,et al. Acoustic source localization using phase difference spectrum images , 2003 .

[31] Junfeng Li,et al. A Two-Microphone Noise Reduction Method in Highly Non-stationary Multiple-Noise-Source Environments , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[32] Israel Cohen,et al. Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[33] Ephraim. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[34] E.A.P. Habets,et al. Dual-Microphone Speech Dereverberation in a Noisy Environment , 2006, 2006 IEEE International Symposium on Signal Processing and Information Technology.

[35] Jae-Hoon Jeong,et al. Dominant speech enhancement based on SNR-adaptive soft mask filtering , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.