An analysis framework of two-level sampling subspace for speaker verification

Using high-dimensional Joint Factor Analysis (JFA) speaker supervectors for the Fishervoice based subspace analysis suffers high computational complexity problem in the model training process. To address this problem, we propose a two-level sampling subspace framework. For the first level of this framework, partial mean vectors are selected from the JFA speaker supervector to form a low-dimensional feature vector. For the second level, PCA is first applied to perform dimension reduction for the feature vector. Several classifiers are then constructed on a collection of random subspaces generated by randomly sampling the reduced feature space. Finally, all classifiers are fused to obtain the final decision. Experimental results on NIST08 show that the proposed framework improves the performance of JFA and Fishervoice by a relative decrease of 13.8% and 7.2% respectively on EER. The minDCF is reduced to 2.19 by using the new model.

[1]  Patrick Kenny,et al.  A Study of Interspeaker Variability in Speaker Verification , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  李志锋 An Analysis Framework based on Random Subspace Sampling for Speaker Verification , 2011 .

[3]  Patrick Kenny,et al.  Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Zhifeng Li,et al.  An enhanced Fishervoice subspace framework for text-independent speaker verification , 2010, 2010 7th International Symposium on Chinese Spoken Language Processing.

[5]  Zhifeng Li,et al.  Fishervioce: A discriminant subspace framework for speaker recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Sridha Sridharan,et al.  Feature warping for robust speaker verification , 2001, Odyssey.

[7]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[8]  James H. Elder,et al.  Probabilistic Linear Discriminant Analysis for Inferences About Identity , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[9]  Dahua Lin,et al.  Nonparametric Discriminant Analysis for Face Recognition , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  李志锋 CLUSTERING SIMILAR ACOUSTIC CLASSES IN THE FISHERVOICE FRAMEWORK , 2013 .