论文信息 - A text-independent speaker recognition method robust against utterance variations

A text-independent speaker recognition method robust against utterance variations

The authors describe a VQ (vector-quantization)-based text-independent speaker recognition method which is robust against utterance variations. Three techniques are introduced to cope with temporal and text-dependent spectral variations. First, either an ergodic hidden Markov model or a voiced/unvoiced decision is used to classify input speech into broad phonetic classes. Second, a new distance measure, the distortion-intersection measure (DIM), is introduced for calculating VQ distortion of input speech compared to speaker-independent codebooks. Third, a normalization method, talker variability normalization (TVN), is introduced. TVN normalizes parameter variation taking both inter- and intra-speaker variability into consideration. The system was tested using utterances of nine speakers recorded over three years. The combination of the three techniques achieves high speaker identification accuracies of 98.5% using only vocal tract information and 99.0% using both vocal tract and pitch information.<<ETX>>

Sadaoki Furui | Tomoko Matsui

[1] Sadaoki Furui,et al. Text-independent speaker recognition using vocal tract and pitch information , 1990, ICSLP.

[2] John S. D. Mason,et al. Automatically focusing on good discriminating speech segments in speaker recognition , 1990, ICSLP.

[3] M. Savic,et al. Variable parameter speaker verification system based on hidden Markov modeling , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[4] Biing-Hwang Juang,et al. A vector quantization approach to speaker recognition , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5] Sadaoki Furui,et al. Research of individuality features in speech waves and automatic speaker recognition techniques , 1986, Speech Commun..