Speaker identification based on discriminative vector quantization

A novel discriminative vector quantization method for speaker identification (DVQSI) is proposed, and its parameters selection is discussed. The vector space of speech features is divided into a number of subspaces and the distribution of the inter speaker variation inside the speech feature vector space is considered. Discriminative weighted average distortion instead of equally weighted average distortion is used in speaker identification (SI). The proposed approach can be considered a generalization of the existing vector quantization (VQ) technique and the experimental results confirm the improved SI accuracy

[1]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[2]  Douglas D. O'Shaughnessy,et al.  Generalized mel frequency cepstral coefficients for large-vocabulary speaker-independent continuous-speech recognition , 1999, IEEE Trans. Speech Audio Process..

[3]  Lawrence G. Bahler,et al.  Voice identification using nearest-neighbor distance measure , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  W. B. Mikhael,et al.  Speaker identification employing redundant vector quantisers , 2002 .

[5]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[6]  Biing-Hwang Juang,et al.  A vector quantization approach to speaker recognition , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  D.M. Weber,et al.  A comparison between hidden Markov models and vector quantization for speech independent speaker recognition , 1993, 1993 IEEE South African Symposium on Communications and Signal Processing.

[8]  Jr. J.P. Campbell,et al.  Speaker recognition: a tutorial , 1997, Proc. IEEE.