Improved i-vector Speaker Verification Based on WCCN and ZT-norm

For the purpose of improving system performance in high channel variability, an improved i-vector speaker verification algorithm is proposed in this paper. Firstly, i-vectors are obtained from GMM-UBM of registered speakers. And then, the weighted linear discriminant analysis is utilized to play the role of channel compensation and dimensionality reduction in i-vectors. By doing this, more discriminant vectors could be extracted. Immediately following, WCCN and ZT-norm are combined to normalize the scores from cosine distance score classifier for the sake of removing channel disturbance. Finally, cosine distance score classifier of high robustness is generated to find target speaker. Experiment results demonstrate that our proposed i-vector system has better performance.

[1]  Patrick Kenny,et al.  Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Ping Tan,et al.  A novel SVM Kernel with GMM super-vector based on bhattacharyya distance clustering plus within class covariance normalization , 2015, 2015 11th International Conference on Natural Computation (ICNC).

[3]  Douglas E. Sturim,et al.  Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.

[4]  David A. van Leeuwen,et al.  Improved speaker recognition when using i-vectors from multiple speech sources , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Pietro Laface,et al.  On the use of i–vector posterior distributions in Probabilistic Linear Discriminant Analysis , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6]  William M. Campbell,et al.  Advances in channel compensation for SVM speaker recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[7]  Patrick Kenny,et al.  Joint Factor Analysis Versus Eigenchannels in Speaker Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Pla Information Method for speaker recognition based on With-in Class Covariance Normalization and SVM , 2009 .

[9]  Heming Zhao,et al.  Joint Factor Analysis of Channel Mismatch in Whispering Speaker Verification , 2012 .

[10]  Jin Young Kim,et al.  Speaker Verification Using A Modified Adaptive GMM Approach Based On Low Rank Matrix Recovery , 2016 .

[11]  Hagai Aronowitz,et al.  Inter dataset variability compensation for speaker recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  D. Matrouf,et al.  A study on the roles of total variability space and session variability modeling in speaker recognition , 2016, Int. J. Speech Technol..