A voice activity detector based on cepstral analysis
暂无分享,去创建一个
This paper proposes a new approach to speech end-point detection based on cepstral analysis. The algorithm is based on explicit (static) modelling of speech and non-speech, and decisions are made on each incoming (overlapped) cepstral frame, according to model similarity scores. The cepstral analysis provides excellent level-independence, meaning that parameter adjustment , decision thresholds etc, are unnecessary. A high degree of robustness to additive noise is demonstrated, even though the models are static. Accurate end-points are recovered with SNR levels of 0dB.
[1] R. Tucker,et al. Voice activity detection using a periodicity measure , 1992 .
[2] L. Rabiner,et al. An algorithm for determining the endpoints of isolated utterances , 1974, The Bell System Technical Journal.
[3] Hyeong-Ho Lee,et al. A Study of On-Off Characteristics of Conversational Speech , 1986, IEEE Trans. Commun..