CRSS systems for the NIST i-Vector Machine Learning Challenge

This paper describes the systems developed by the Center for Robust Speech Systems (CRSS), Univ. of Texas Dallas, for the National Institute of Standards and Technology (NIST) iVector challenge. Since the emphasis of this challenge is on utilizing unlabeled development data, our system development focuses on: 1) unsupervised clustering methods to estimate development data labels; 2) build efficient classifier without clustering method. Our results indicate substantial improvements obtained from incorporating one or more of the aforementioned techniques.

[1]  Patrick Kenny,et al.  Bayesian Speaker Verification with Heavy-Tailed Priors , 2010, Odyssey.

[2]  John H. L. Hansen,et al.  CRSS systems for 2012 NIST Speaker Recognition Evaluation , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Douglas A. Reynolds,et al.  Summary and initial results of the 2013-2014 speaker recognition i-vector machine learning challenge , 2014, INTERSPEECH.

[4]  References , 1971 .

[5]  John H. L. Hansen,et al.  An investigation on back-end for speaker recognition in multi-session enrollment , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Yun Lei,et al.  A noise-robust system for NIST 2012 speaker recognition evaluation , 2013, INTERSPEECH.

[7]  John H. L. Hansen,et al.  I4u submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification , 2013, INTERSPEECH.

[8]  Hynek Hermansky,et al.  Developing a speaker identification system for the DARPA RATS project , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Kevin Walker,et al.  The RATS radio traffic collection system , 2012, Odyssey.

[10]  Patrick Kenny,et al.  Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[11]  Nuance - Politecnico di torino's 2012 NIST speaker recognition evaluation system , 2013, INTERSPEECH.

[12]  John H. L. Hansen,et al.  Automatic regularization of cross-entropy cost for speaker recognition fusion , 2013, INTERSPEECH.