New background modeling for speaker verification

A new background speaker modelling method is presented in this paper for text-independent speaker verification using Gaussian mixture models. This method does not require speech databases of other speakers to build background speaker models. A background model can be built directly from the same claimed speaker's database and has a smaller number of Gaussian mixtures compared to the claimed speaker model. Experiments performed on the YOHO database showed a better result for speaker verification using the 64-mixture claimed speaker model and 16-mixture background model compared to current background model set methods using five closest background models.

[1]  Chin-Hui Lee,et al.  Speaker verification using normalized log-likelihood score , 1996, IEEE Trans. Speech Audio Process..

[2]  Douglas A. Reynolds,et al.  Speaker identification and verification using Gaussian mixture speaker models , 1995, Speech Commun..

[3]  Chin-Hui Lee,et al.  Background model design for flexible and portable speaker verification systems , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[4]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[5]  Corneliu Burileanu,et al.  On Performance Improvement of a Speaker Verification System Using Vector Quantization, Cohorts and Hybrid Cohort-World Models , 2002, Int. J. Speech Technol..

[6]  Jun-ichi Takahashi,et al.  A new cohort normalization using local acoustic information for speaker verification , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[7]  Biing-Hwang Juang,et al.  The use of cohort normalized scores for speaker verification , 1992, ICSLP.

[8]  Dat Tran,et al.  A proposed likelihood transformation for speaker verification , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[9]  Chin-Hui Lee,et al.  A new approach to utterance verification based on neighborhood information in model space , 2003, IEEE Trans. Speech Audio Process..

[10]  Thomas Hain,et al.  The 1997 HTK broadcast news transcription system , 1998 .

[11]  Dat Tran,et al.  Fuzzy normalisation methods for speaker verification , 2000, INTERSPEECH.

[12]  Seiichi Nakagawa,et al.  Speaker verification using frame and utterance level likelihood normalization , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Lawrence G. Bahler,et al.  Speaker verification using randomized phrase prompting , 1991, Digit. Signal Process..

[14]  Sadaoki Furui,et al.  Concatenated phoneme models for text-variable speaker recognition , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.