In many conventional voice activity detection (VAD) methods, speech signal is assumed to be acquired in high quality. However, human-machine interface based on speech is usually employed in indoor environment where various interferences exist, therefore, the VAD performance is seriously deteriorated. In this paper, we propose a novel VAD method with array signal processing on wavelet domain, in which we utilize the time, frequency and space information in the speech signal to separate interferences. In the proposed method, speech signal acquired by microphone array is at first decomposed into appropriate subbands with wavelet packet, and then array signal processing is executed on each subbands to realize VAD system for speech signal arriving from particular direction.
[1]
S. Montresor,et al.
Robust speech/non-speech detection in adverse conditions using an entropy based estimator
,
1997,
Proceedings of 13th International Conference on Digital Signal Processing.
[2]
Wee Ser,et al.
Speech detection using microphone array
,
2000
.
[3]
Shubha Kadambe,et al.
Application of the wavelet transform for pitch detection of speech signals
,
1992,
IEEE Trans. Inf. Theory.
[4]
Harvey F. Silverman.
A Microphone Array System for Speech Recognition
,
1989,
HLT.
[5]
A. Nejat Ince,et al.
Digital Speech Processing
,
1992
.