论文信息 - A discriminative training algorithm for VQ-based speaker identification

A discriminative training algorithm for VQ-based speaker identification

A novel method, referred to as group vector quantization (GVQ), is proposed to train VQ codebooks for closed-set speaker identification. In GVQ training, speaker codebooks are optimized for vector groups rather than for individual vectors. An evaluation experiment has been conducted to compare the codebooks trained by the Linde-Buzo-Grey (LBG), the learning vector quantization (LVQ), and the GVQ algorithms. It is shown that the frame scores from the GVQ trained codebooks are less correlated, therefore, the sentence level speaker identification rate increases more quickly with the length of test sentences.

[1] Aaron E. Rosenberg,et al. Evaluation of a vector quantization talker recognition system in text independent and text dependent modes , 1987 .

[2] Robert M. Gray,et al. An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[3] Günther Palm,et al. A discriminative training algorithm for Gaussian mixture speaker models , 1997, EUROSPEECH.

[4] Joseph Picone,et al. Signal modeling techniques in speech recognition , 1993, Proc. IEEE.

[5] Keinosuke Fukunaga,et al. Introduction to Statistical Pattern Recognition , 1972 .

[6] Douglas A. Reynolds,et al. Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[7] Teuvo Kohonen,et al. The self-organizing map , 1990 .

[8] Günther Palm,et al. A text-independent speaker identification system based on neural networks , 1994, ICSLP.

[9] Ming-Shih Chen,et al. Speaker Identification Based on a Matrix Quantization Method , 1993, IEEE Trans. Signal Process..

[10] Keinosuke Fukunaga,et al. Chapter 3 – HYPOTHESIS TESTING , 1990 .