Research on voice activity detection in burst and partial duration noisy environment

Voice activity detection aims at detecting speech in noisy environment and is very important for speech recognition. In this paper, two novel methods are proposed for finding out burst and partial duration noisy signals in order to detect real speech from non-stationary noise and to improve the performance of continuous Mandarin speech recognition system. For the burst and low-energy noise adjacent to speech segment, a method using initial and final part of Chinese syllable is applied to detect accurate endpoints of speech based on traditional short-time energy and zero-crossing rates. For the isolated noise with relatively high energy, a short-time energy zero product based method is used. Both of the methods use time-domain features and have low computational complexity. Experimental results show that the system using proposed methods has improved accuracy in voice activity detection.

[1]  Haiying Zhang,et al.  An endpoint detection algorithm based on MFCC and spectral entropy using BP NN , 2010, 2010 2nd International Conference on Signal Processing Systems.

[2]  Masakiyo Fujimoto,et al.  A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Jay G. Wilpon,et al.  Application of hidden Markov models to automatic speech endpoint detection , 1987 .

[4]  S. Casale,et al.  Performance evaluation and comparison of G.729/AMR/fuzzy voice activity detectors , 2002, IEEE Signal Processing Letters.

[5]  M.N.S. Swamy,et al.  An improved voice activity detection using higher order statistics , 2005, IEEE Transactions on Speech and Audio Processing.

[6]  Jean-Claude Junqua,et al.  A study of endpoint detection algorithms in adverse conditions: incidence on a DTW and HMM recognizer , 1991, EUROSPEECH.

[7]  Tuan Van Pham,et al.  Using Artificial Neural Network for Robust Voice Activity Detection Under Adverse Conditions , 2009, 2009 IEEE-RIVF International Conference on Computing and Communication Technologies.

[8]  Alex Acero,et al.  Robust HMM-based endpoint detector , 1993, EUROSPEECH.

[9]  S. Masud,et al.  Support Vector Machine based Voice Activity Detection , 2006, 2006 International Symposium on Intelligent Signal Processing and Communications.