Using Fishervoice to enhance the performance of I-vector based speaker verification system

I-vector is a popular feature representation technique in speaker verification systems. In this paper, we use Fishervoice algorithm in combination with i-vector feature representation to improve speaker verification performance. By applying the Fishervoice model to map the i-vector into a low-dimensional discriminant subspace, the intra-speaker variability can be reduced and the discriminative class boundary information can be emphasized for enhanced recognition performance. Experiments on NIST SRE 2008 core test task show that the proposed framework achieves 19.9% and 8.5% dramatic relative decrease in EER and minDCF metrics respectively compared to the state-of-the-art PLDA based method.

[1]  Dahua Lin,et al.  Nonparametric Discriminant Analysis for Face Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  李志锋 CLUSTERING SIMILAR ACOUSTIC CLASSES IN THE FISHERVOICE FRAMEWORK , 2013 .

[3]  Patrick Kenny,et al.  Bayesian Speaker Verification with Heavy-Tailed Priors , 2010, Odyssey.

[4]  Daniel Garcia-Romero,et al.  Analysis of i-vector Length Normalization in Speaker Recognition Systems , 2011, INTERSPEECH.

[5]  Patrick Kenny,et al.  Joint Factor Analysis of Speaker and Session Variability: Theory and Algorithms , 2006 .

[6]  Sridha Sridharan,et al.  Feature warping for robust speaker verification , 2001, Odyssey.

[7]  李志锋 An Analysis Framework based on Random Subspace Sampling for Speaker Verification , 2011 .

[8]  Patrick Kenny,et al.  A Study of Interspeaker Variability in Speaker Verification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Zhifeng Li,et al.  Fishervioce: A discriminant subspace framework for speaker recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Jason W. Pelecanos,et al.  Robust Speaker Recognition Over Varying Channels Report from JHU workshop 2008 , 2009 .

[11]  Sergey Ioffe,et al.  Probabilistic Linear Discriminant Analysis , 2006, ECCV.

[12]  Patrick Kenny,et al.  Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Zhifeng Li,et al.  An enhanced Fishervoice subspace framework for text-independent speaker verification , 2010, 2010 7th International Symposium on Chinese Spoken Language Processing.

[14]  James H. Elder,et al.  Probabilistic Linear Discriminant Analysis for Inferences About Identity , 2007, 2007 IEEE 11th International Conference on Computer Vision.