Variable parameter speaker verification system based on hidden Markov modeling

A text-independent speaker verification system based on an adaptive vocal tract model which emulates the vocal tract of the speaker is described. Each speaker is represented by a set of feature vectors derived from speech segments belonging to different classes of phonemes. Linear predictive hidden Markov modeling and maximum-likelihood Viterbi decoding are applied to a speech utterance to obtain different classes of phonemes pronounced by a speaker. It is shown that different classes of phonemes are not equally effective in discriminating between speakers and that verification performance can be considerably improved by separately classifying speech segments representing each broad phonetic category as belonging to an impostor or as belonging to the true speaker. A weighted linear combination of scores for individual categories can be used as the final verification score. The weights are chosen to reflect the effectiveness of particular classes of phonemes in discriminating between speakers and are adjusted to maximize the verification performance.<<ETX>>

[1]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[2]  M. Savic,et al.  A TMs32020-based real time, text-independent, automatic speaker verification system , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[3]  A.E. Rosenberg,et al.  Automatic speaker verification: A review , 1976, Proceedings of the IEEE.

[4]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[5]  A. B. Poritz,et al.  Linear predictive hidden Markov models and the speech signal , 1982, ICASSP.

[6]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[7]  B.S. Atal,et al.  Automatic recognition of speakers from their voices , 1976, Proceedings of the IEEE.