Application of a first-order differential microphone for efficient voice activity detection in a car platform

Handsfree interfaces provide a nice solution to add-on devices in car platforms. However, the amount of acoustic disturbances existing in automotive environments usually prevents satisfactory results. In most of the cases, noise reduction techniques involving a voice activity detector (VAD) are required. Through this paper, a robust microphone array processing technique for speech detection under the influence of noise and reverberation in an automobile environment is proposed. This method applies a simple twomicrophone First Order Differential Microphone in order to estimate the power spectral density of the background perturbations embedded in speech signals. Afterwards, specialized order-statistics filters (OSFs) are applied in order to obtain a consistent speech/non speech decision. The paper also includes a performance evaluation of the algorithm using Aurora3 database recordings. According to our simulation results, the proposed algorithm shows a significantly better performance than standard VADs such as G.729B or ARM and, a slight advantage over other reported methods.

[1]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[2]  V. Rodellar,et al.  Time-domain steering of a differential beamformer for speech enhancement and source separation , 2002, 6th International Conference on Signal Processing, 2002..

[3]  Gary W. Elko,et al.  FIRST- AND SECOND-ORDER ADAPTIVE DIFFERENTIAL MICROPHONE ARRAYS , 2001 .

[4]  Hiroshi Saruwatari,et al.  Blind source separation based on fast-convergence algorithm using ICA and beamforming for real convolutive mixture , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Javier Ramírez,et al.  A new voice activity detector using subband order-statistics filters for robust speech recognition , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  C.E. Mokbel,et al.  Automatic word recognition in cars , 1995, IEEE Trans. Speech Audio Process..

[7]  Dirk Van Compernolle,et al.  Beamforming with microphone arrays , 1995 .

[8]  Sven Nordholm,et al.  Speech enhancement for hands-free terminals , 2001, ISPA 2001. Proceedings of the 2nd International Symposium on Image and Signal Processing and Analysis. In conjunction with 23rd International Conference on Information Technology Interfaces (IEEE Cat..

[9]  Gary W. Elko,et al.  Microphone array systems for hands-free telecommunication , 1996, Speech Commun..

[10]  Khalid Choukri,et al.  SPEECHDAT-CAR. A Large Speech Database for Automotive Environments , 2000, LREC.