Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Adaptation control of beamforming interference cancellation techniques is investigated for in-car speech acquisition. Two efficient adaptation control methods are proposed that avoid target cancellation. The "implicit" method varies the step-size continuously, based on the filtered output signal. The "explicit" method decides in a binary manner whether to adapt or not, based on a novel estimate of target and interference energies. It estimates the average delay-sum power within a volume of space, for the same cost as the classical delay-sum. Experiments on real in-car data validate both methods, including a case with km/h background road noise.

[1]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[2]  Jean-Marc Odobez,et al.  Unsupervised Location-Based Segmentation of Multi-Party Speech , 2004 .

[3]  L. J. Griffiths,et al.  An alternative approach to linearly constrained adaptive beamforming , 1982 .

[4]  O. Hoshuyama,et al.  A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5]  Dirk Van Compernolle,et al.  Switching adaptive filters for enhancing noisy and reverberant speech from microphone array recordings , 1990, ICASSP.

[6]  Satoshi Nakamura,et al.  Joint optimization of LCMV beamforming and acoustic echo cancellation , 2004, 2004 12th European Signal Processing Conference.

[7]  Sam T. Roweis,et al.  Factorial models and refiltering for speech separation and denoising , 2003, INTERSPEECH.

[8]  Yves Grenier,et al.  Test of adaptive beamformers for speech acquisition in cars , 1994 .

[9]  Daniel P. W. Ellis,et al.  Speaker turn segmentation based on between-channel differences , 2004 .

[10]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[11]  T. Moon,et al.  Mathematical Methods and Algorithms for Signal Processing , 1999 .

[12]  Andreas Stolcke,et al.  Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech , 2003 .

[13]  Iain McCowan,et al.  A sector-based approach for localization of multiple speakers with microphone arrays , 2004, SAPA@INTERSPEECH.

[14]  Bertrand Mesot,et al.  A spectrogram model for enhanced source localization and noise-robust ASR , 2005, INTERSPEECH.

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  Sofiène Affes,et al.  A signal subspace tracking algorithm for microphone array processing of speech , 1997, IEEE Trans. Speech Audio Process..

[17]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[18]  B. Moore An Introduction to the Psychology of Hearing , 1977 .

[19]  Henning Puder,et al.  Step-size control for acoustic echo cancellation filters - an overview , 2000, Signal Process..

[20]  Walter Kellermann,et al.  ROBUST SPATIAL ESTIMATION OF THE SIGNAL-TO-INTERFERENCE RATIO (SIR) FOR NON-STATIONARY MIXTURES , 2003 .

[21]  Julien Bourgeois,et al.  Implicit control of noise canceller for speech enhancement , 2005, INTERSPEECH.

[22]  Julien Bourgeois,et al.  Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control , 2005 .

[23]  E. Owens,et al.  An Introduction to the Psychology of Hearing , 1997 .

[24]  S. Thomas Alexander,et al.  Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.

[25]  Guillaume Lathoud,et al.  A sector-based, frequency-domain approach to detection and localization of multiple speakers , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[26]  Akihiko Sugiyama,et al.  A real time robust adaptive microphone array controlled by an SNR estimate , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).