论文信息 - Discriminative training for speaker identification based on maximum model distance algorithm

Discriminative training for speaker identification based on maximum model distance algorithm

In this paper we apply the maximum model distance (MMD) training to speaker identification and a new selection strategy of competitive speakers is proposed to it. The traditional ML method only utilizes the utterances for each speaker model, which probably leads to a local optimization solution. By maximizing the dissimilarities among those similar speaker models, MMD could add the discriminative capability into the training procedure and then improve the identification performance. Based on the TIMIT corpus, we designed the word and sentence experiments to evaluate this proposed training approach. The results show that the identification performance can be improved greatly when the training data is limited.

Sam Kwong | Q. Y. Hong

[1] Aaron E. Rosenberg,et al. Speaker identification using minimum classification error training , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[2] Francisco Javier Caminero Gil,et al. Discriminative training of GMM for speaker identification , 1996, ICASSP.

[3] Douglas A. Reynolds,et al. Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[4] B. Juang,et al. A study on minimum error discriminative training for speaker recognition , 1995 .

[5] Sam Kwong,et al. A maximum model distance approach for HMM-based speech recognition , 1998, Pattern Recognit..

[6] Sadaoki Furui,et al. An Overview of Speaker Recognition Technology , 1996 .

[7] Sam Kwong,et al. Improved maximum model distance for HMM training , 1999 .