论文信息 - A New Method Based on HMMs and K-Means Algorithms for Noise-Robust Voice Activity Detector

A New Method Based on HMMs and K-Means Algorithms for Noise-Robust Voice Activity Detector

In this paper, we proposed left-right hidden Markov models (HMMs) combination with k-means threshold of Likelihood ratio test (LRT) to identify the start and end of the speech. This method builds two models of non-speech and speech but not two states, i.e. each model could conclude several states. In the experiments we present the Voice Activity Detection (VAD) results between two states hidden semi-Markov model (HSMM) and proposed algorithm. We also compare accuracy and robust between the k-means threshold and the adaptive threshold in high signal to noise rate in the background noise. It presents that k-means threshold is more effective than the adaptive threshold and the proposed method also make a better performance than two states HSMM based VAD, especially in the low signal-to-noise ratio (SNR) environment.

Zheng Pei | Bing Luo | Li Xu | Da Li Hu

[1] P. Fränti,et al. Voice Activity Detection Using MFCC Features and Support Vector Machine , 2007 .

[2] Sven Nordholm,et al. Statistical Voice Activity Detection Using Low-Variance Spectrum Estimation and an Adaptive Threshold , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[3] Francesco Beritelli,et al. A robust voice activity detector for wireless communications using soft computing , 1998, IEEE J. Sel. Areas Commun..

[4] Yi Hu,et al. Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[5] Wonyong Sung,et al. A statistical model-based voice activity detection , 1999, IEEE Signal Processing Letters.

[6] S. Gökhun Tanyer,et al. Voice activity detection in nonstationary noise , 2000, IEEE Trans. Speech Audio Process..

[7] Xianglong Liu,et al. An improved noise-robust voice activity detector based on hidden semi-Markov models , 2011, Pattern Recognit. Lett..

[8] Joon-Hyuk Chang,et al. Voice activity detection based on a family of parametric distributions , 2007, Pattern Recognit. Lett..