In-car speech and audio processing - some experiments within hArtes project

This paper describes part of the work dealing with in-car speech and audio applications achieved within the European FP6-IST hArtes project. After a brief introduction describing the hArtes project framework together with the targeted scenarios, the in-car hardware platform will be presented, followed by an overview of the different algorithmic solutions. The applications are covering speech and audio processing for enhanced communications through hands-free telephony, and preprocessing for advanced speech processing modules such as speech recognition and speaker authentication. Then a description of the hArtes integration process applied to the proposed algorithmic solutions will be given, followed by some experimental results. Finally some perspectives will be presented for the next steps towards the integration on a dedicated hardware.

[1]  Jacob Benesty,et al.  Generalized multichannel frequency-domain adaptive filtering: efficient realization and application to hands-free speech communication , 2005, Signal Process..

[2]  Jacob Benesty,et al.  New insights into the noise reduction Wiener filter , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  K. U. Simmer,et al.  MULTI-MICROPHONE NOISE REDUCTION BY POST-FILTER AND SUPERDIRECTIVE BEAMFORMER , 1999 .

[4]  Jacob Benesty,et al.  An extended multidelay filter: fast low-delay algorithms for very high-order adaptive systems , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[5]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[6]  A. Wasiljeff,et al.  Adaptive Microphone Arrays for Noise Suppression in the Frequency Domain , 1992 .

[7]  Israel Cohen,et al.  Noise spectrum estimation in adverse environments: improved minima controlled recursive averaging , 2003, IEEE Trans. Speech Audio Process..

[8]  I. Cohen,et al.  Noise estimation by minima controlled recursive averaging for robust speech enhancement , 2002, IEEE Signal Processing Letters.

[9]  Joseph H. DiBiase A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays , 2000 .

[10]  A.V. Oppenheim,et al.  Enhancement and bandwidth compression of noisy speech , 1979, Proceedings of the IEEE.

[11]  Jörg Meyer,et al.  Multi-channel speech enhancement in a car environment using Wiener filtering and spectral subtraction , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Jacob Benesty,et al.  Multichannel Acoustic Echo Cancellation , 2001 .

[13]  Gerhard Doblinger,et al.  Computationally efficient speech enhancement by spectral minima tracking in subbands , 1995, EUROSPEECH.

[14]  Yannick Mahieux,et al.  Analysis of noise reduction and dereverberation techniques based on microphone arrays with postfiltering , 1998, IEEE Trans. Speech Audio Process..

[15]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[16]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[17]  Yi Hu,et al.  A noise estimation algorithm with rapid adaptation for highly nonstationary environments , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[19]  Keshab K. Parhi,et al.  Multi-delay block frequency domain adaptive filters with sparse partial subblock update , 2009, 2009 Conference Record of the Forty-Third Asilomar Conference on Signals, Systems and Computers.

[20]  Francesco Piazza,et al.  NU-Tech: Implementing DSP Algorithms in a Plug-in Based Software Platform for Real time Audio Applications , 2005 .

[21]  Eric Moulines,et al.  The generalized multidelay adaptive filter: structure and convergence analysis , 1995, IEEE Trans. Signal Process..

[22]  Hervé Bourlard,et al.  Microphone array post-filter based on noise field coherence , 2003, IEEE Trans. Speech Audio Process..

[23]  J. L. Hall,et al.  Stereophonic acoustic echo cancellation-an overview of the fundamental problem , 1995, IEEE Signal Processing Letters.

[24]  Philipos C. Loizou,et al.  A noise-estimation algorithm for highly non-stationary environments , 2006, Speech Commun..

[25]  Rainer Martin,et al.  Spectral Subtraction Based on Minimum Statistics , 2001 .

[26]  Rainer Martin,et al.  Noise power spectral density estimation based on optimal smoothing and minimum statistics , 2001, IEEE Trans. Speech Audio Process..

[27]  R. Zelinski,et al.  A microphone array with adaptive post-filtering for noise reduction in reverberant rooms , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[28]  David Malah,et al.  Speech enhancement using a minimum mean-square error log-spectral amplitude estimator , 1984, IEEE Trans. Acoust. Speech Signal Process..