Microphone Diversity Combining for In-Car Applications

This paper proposes a frequency domain diversity approach for two or more microphone signals, for example, for in-car applications. The microphones should be positioned separately to insure diverse signal conditions and incoherent recording of noise. This enables a better compromise for the microphone position with respect to different speaker sizes and noise sources. This work proposes a two-stage approach. In the first stage, the microphone signals are weighted with respect to their signal-to-noise ratio and then summed similar to maximum ratio combining. The combined signal is then used as a reference for a frequency domain least-mean-squares (LMS) filter for each input signal. The output SNR is significantly improved compared to coherence-based noise reduction systems, even if one microphone is heavily corrupted by noise.

[1]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[2]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[3]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[4]  Marc Moonen,et al.  Subspace Methods for Multimicrophone Speech Dereverberation , 2003, EURASIP J. Adv. Signal Process..

[5]  Régine Le Bouquin-Jeannès,et al.  Enhancement of speech degraded by coherent and incoherent noise using a cross-spectral estimator , 1997, IEEE Trans. Speech Audio Process..

[6]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[7]  Rainer Martin,et al.  Combined acoustic echo cancellation, dereverberation and noise reduction: a two microphone approach , 1994 .

[8]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[9]  Wonyong Sung,et al.  A voice activity detector employing soft decision based noise spectrum adaptation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[10]  J. L. Flanagan,et al.  Signal Processing to Reduce Multipath Distortion in Small Rooms , 1970 .

[11]  Peter Vary,et al.  Digital Speech Transmission: Enhancement, Coding and Error Concealment , 2006 .

[12]  E. Hänsler,et al.  Acoustic Echo and Noise Control: A Practical Approach , 2004 .

[13]  Gerhard Schmidt,et al.  Speech and Audio Processing in Adverse Environments , 2008 .

[14]  Bah-Hwee Gwee,et al.  A 32-point FFT based Noise Reduction Algorithm for Single Channel Speech Signals , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[15]  Emanuel A. P. Habets,et al.  Multi-microphone speech dereverberation using LIME and least squares filtering , 2008, 2008 16th European Signal Processing Conference.

[16]  Javier Ramírez,et al.  A new voice activity detector using subband order-statistics filters for robust speech recognition , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Alexandre Gu A Two-Sensor Noise Reduction System: Applications for Hands-Free Car Kit , 2003 .

[18]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[19]  Rainer Martin,et al.  Soft decision combining for dual channel noise reduction , 2006, INTERSPEECH.

[20]  Régine Le Bouquin-Jeannès,et al.  A Two-Sensor Noise Reduction System: Applications for Hands-Free Car Kit , 2003, EURASIP J. Adv. Signal Process..

[21]  Jürgen Freudenberger,et al.  Spectral combining for microphone diversity systems , 2009, 2009 17th European Signal Processing Conference.

[22]  Jürgen Freudenberger,et al.  A two-microphone diversity system and its application for hands-free car kits , 2005, INTERSPEECH.

[23]  Jont B. Allen,et al.  Multimicrophone signal‐processing technique to remove room reverberation from speech signals , 1977 .

[24]  Marc Delcroix,et al.  Dereverberation and Denoising Using Multichannel Linear Prediction , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Henning Puder Single channel noise reduction using time-frequency dependent voice activity detection , 1999 .

[26]  Benjamin Venditti,et al.  An FLMS based two-microphone speech enhancement system for in-car applications , 2009, 2009 IEEE/SP 15th Workshop on Statistical Signal Processing.

[27]  G. David Forney,et al.  Exponential error bounds for erasure, list, and decision feedback schemes , 1968, IEEE Trans. Inf. Theory.

[28]  Philipos C. Loizou,et al.  A multi-band spectral subtraction method for enhancing speech corrupted by colored noise , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[29]  Rainer Martin,et al.  A Symmetric Two Microphone Speech Enhancement System Theoretical Limits and Application in a Car Environment , 1992, The Digital Signal Processing workshop.

[30]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[31]  A. Gray,et al.  Distance measures for speech processing , 1976 .