A compact multi-sensor headset for hands-free communication

The need for hands-free communication has led to an increased popularity in the use of headsets with mobile phones. Comfort and portability concerns have led to the desire for headsets with a small form factor. Unfortunately, this size constraint typically requires that the microphone be placed farther from the user's mouth, making it highly susceptible to environmental noise. One long term goal of our work is to develop a headset that can achieve the sound capture performance of a close-talking microphone located at the user's mouth, while maintaining the desired compact size. Toward this end, we have designed a headset consisting of three air microphones and a bone-conductive sensor. The speech enhancement is performed in two stages, a fixed beamformer followed by a single-channel adaptive post-filter. Unlike other techniques, the beamformer is calibrated in a purely data-driven manner. The bone sensor provides a robust speech activity detector for use in the post-filtering stage. We present preliminary experimental results using real data collected in multiple environments. The proposed approach results in significant improvements in both speech recognition accuracy and SNR.

[1]  G.W. Elko,et al.  An adaptive close-talking microphone array , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).

[2]  D. E. Davies,et al.  Array signal processing , 1983 .

[3]  Jörg Meyer,et al.  Multi-channel speech enhancement in a car environment using Wiener filtering and spectral subtraction , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Israel Cohen,et al.  Microphone array post-filtering for non-stationary noise suppression , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Xuedong Huang,et al.  Air- and bone-conductive integrated microphones for robust speech detection and enhancement , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[6]  C. Burrus,et al.  Array Signal Processing , 1989 .

[7]  K. B. Rasmussen,et al.  Design of a microphone array for headsets , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[8]  G. W. Elko,et al.  An adaptive close-talking microphone array , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[9]  B. Raj,et al.  Speech-recognizer-based filter optimization for microphone array processing , 2003, IEEE Signal Processing Letters.

[10]  Janet M. Baker,et al.  The Design for the Wall Street Journal-based CSR Corpus , 1992, HLT.