Robust Voice Activity Detection Using the Combination of Short-Term and Long-Term Spectral Patterns

In this paper, we present a robust voice activity detection (VAD) algorithm using the combination of short-term and long-term spectral patterns. We analyze the benefit of short-term and long-term spectral patterns, respectively, when applied to robust VAD. Based on the analysis, we find the combination of short-term and long-term spectral patterns can be used to achieve a higher VAD accuracy than one of them only in noisy environments. We evaluate its performance under four types of noises and six types of signal-to-noise ratio (SNR) conditions. Compared with standard VAD schemes, the evaluation almost demonstrates promising results with the proposed scheme being comparable or favorable over the whole test set for various criterions of the VAD evaluation.

[1]  John H. L. Hansen,et al.  Robust speech activity detection in the presence of noise , 1998, ICSLP.

[2]  Francesco Beritelli,et al.  A robust voice activity detector for wireless communications using soft computing , 1998, IEEE J. Sel. Areas Commun..

[3]  Javier Ramírez,et al.  A new adaptive long-term spectral estimation voice activity detector , 2003, INTERSPEECH.

[4]  Petros Maragos,et al.  Speech event detection using multiband modulation energy , 2005, INTERSPEECH.

[5]  E. Shlomot,et al.  ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applications , 1997, IEEE Commun. Mag..

[6]  S.M. Ahadi,et al.  Voice Activity Detection based on Combination of Multiple Features using Linear/Kernel Discriminant Analyses , 2008, 2008 3rd International Conference on Information and Communication Technologies: From Theory to Applications.

[7]  Dongsuk Yook,et al.  Robust Voice Activity Detection Using the Spectral Peaks of Vowel Sounds , 2009 .

[8]  Zdravko Kacic,et al.  A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithm , 2001, INTERSPEECH.

[9]  Javier Ramírez,et al.  Voice activity detection with noise reduction and long-term spectral divergence estimation , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Javier Ramírez,et al.  Efficient voice activity detection algorithms using long-term speech information , 2004, Speech Commun..

[11]  Mohammad Hossein Moattar,et al.  A new approach for robust realtime Voice Activity Detection using spectral pattern , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.