论文信息 - Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Sector-Based Detection for Hands-Free Speech Enhancement in Cars

Adaptation control of beamforming interference cancellation techniques is investigated for in-car speech acquisition. Two efficient adaptation control methods are proposed that avoid target cancellation. The "implicit" method varies the step-size continuously, based on the filtered output signal. The "explicit" method decides in a binary manner whether to adapt or not, based on a novel estimate of target and interference energies. It estimates the average delay-sum power within a volume of space, for the same cost as the classical delay-sum. Experiments on real in-car data validate both methods, including a case with km/h background road noise.

[1] Ehud Weinstein,et al. Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[2] Jean-Marc Odobez,et al. Unsupervised Location-Based Segmentation of Multi-Party Speech , 2004 .

[3] L. J. Griffiths,et al. An alternative approach to linearly constrained adaptive beamforming , 1982 .

[4] O. Hoshuyama,et al. A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5] Dirk Van Compernolle,et al. Switching adaptive filters for enhancing noisy and reverberant speech from microphone array recordings , 1990, ICASSP.

[6] Satoshi Nakamura,et al. Joint optimization of LCMV beamforming and acoustic echo cancellation , 2004, 2004 12th European Signal Processing Conference.

[7] Sam T. Roweis,et al. Factorial models and refiltering for speech separation and denoising , 2003, INTERSPEECH.

[8] Yves Grenier,et al. Test of adaptive beamformers for speech acquisition in cars , 1994 .

[9] Daniel P. W. Ellis,et al. Speaker turn segmentation based on between-channel differences , 2004 .

[10] G. Carter,et al. The generalized correlation method for estimation of time delay , 1976 .

[11] T. Moon,et al. Mathematical Methods and Algorithms for Signal Processing , 1999 .

[12] Andreas Stolcke,et al. Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech , 2003 .

[13] Iain McCowan,et al. A sector-based approach for localization of multiple speakers with microphone arrays , 2004, SAPA@INTERSPEECH.

[14] Bertrand Mesot,et al. A spectrogram model for enhanced source localization and noise-robust ASR , 2005, INTERSPEECH.

[15] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16] Sofiène Affes,et al. A signal subspace tracking algorithm for microphone array processing of speech , 1997, IEEE Trans. Speech Audio Process..

[17] B.D. Van Veen,et al. Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[18] B. Moore. An Introduction to the Psychology of Hearing , 1977 .

[19] Henning Puder,et al. Step-size control for acoustic echo cancellation filters - an overview , 2000, Signal Process..

[20] Walter Kellermann,et al. ROBUST SPATIAL ESTIMATION OF THE SIGNAL-TO-INTERFERENCE RATIO (SIR) FOR NON-STATIONARY MIXTURES , 2003 .

[21] Julien Bourgeois,et al. Implicit control of noise canceller for speech enhancement , 2005, INTERSPEECH.

[22] Julien Bourgeois,et al. Multichannel Speech Enhancement in Cars: Explicit vs. Implicit Adaptation Control , 2005 .

[23] E. Owens,et al. An Introduction to the Psychology of Hearing , 1997 .

[24] S. Thomas Alexander,et al. Adaptive Signal Processing , 1986, Texts and Monographs in Computer Science.

[25] Guillaume Lathoud,et al. A sector-based, frequency-domain approach to detection and localization of multiple speakers , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[26] Akihiko Sugiyama,et al. A real time robust adaptive microphone array controlled by an SNR estimate , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).