New cepstrum frequency scale for neural network speaker verification
暂无分享,去创建一个
The influence of cepstrum parameters on text-dependent speaker verification and speech recognition is investigated. Experiments are performed to establish the relevance of various resonant frequencies and frequency bands in terms of their speech and speaker recognition ability. A Romanian database of eighteen isolated words has been used. The study of the filter bank analysis suggests a new frequency scale instead of the currently used mel-scale to extract from the speech signal cepstrum coefficients. The proposed scale results in better performance in speaker verification. The processes of speech recognition and speaker verification are carried out by using a neural network system comprising a self-organizing feature map (SOFM) and a multilayer perceptron (MLP).
[1] R. P. Ramachandran,et al. Robust speaker recognition: a feature-based approach , 1996, IEEE Signal Processing Magazine.
[2] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .