Estimation of speech presence probability in the field of microphone array

The subject of this work is the robust estimation of speech presence probability of every spectral component of a speech signal impinging on a linear microphone array. The approach is based on the generalized likelihood ratio test (GLRT) applied to the multichannel framework and far-field, wideband sources. It is shown that under certain distributional assumptions the GLRT provides a framework for speech presence detection by exploiting both the spatial localization and spectral content of the speech signal. The efficiency of the approach and its superiority over a state-of-the-art one-channel speech presence estimation technique is illustrated when additive white Gaussian noise is present in the acoustical field in low signal-to-noise ratio (SNR).

[1]  Nozomu Hamada,et al.  Voice activity detection with array signal processing in the wavelet domain , 2002, 2002 11th European Signal Processing Conference.

[2]  Ephraim Speech enhancement using a minimum mean square error short-time spectral amplitude estimator , 1984 .

[3]  Francesco Beritelli,et al.  A multichannel speech/silence detector based on time delay estimation and fuzzy classification , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[4]  A. G. Jaffer,et al.  Maximum likelihood direction finding of stochastic sources: a separable solution , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[6]  Wonyong Sung,et al.  A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.