A speaker recognition algorithm based on factor analysis

Channel interference factor for the identification result is prevalent among the existing speaker recognition algorithms. In order to improve the accuracy of the algorithm, the paper utilizes the technique of latent factor analysis(LFA) to deal with the channel factors in the speaker's Gaussian Mixture Model(GMM). In the endpoint detection phase of speaker recognition, the algorithm introduces the GMM for speech modeling to accurately determine the beginning and ending points of the speech segment, and then establish speaker GMM. The algorithm use factor analysis technique to fit the differences between the speaker characteristics space and the channel space, and removes channel factor in speaker's GMM. And then the algorithm extracts GMM super-vectors as the input of Support Vector Machine(SVM) to obtain recognition results. Experimental results show that the combination of factor analysis and SVM can obtain better recognition rate and ensure the robustness of the recognition algorithm.

[1]  Rong Zheng,et al.  Text-independent speaker identification using GMM-UBM and frame level likelihood normalization , 2004, 2004 International Symposium on Chinese Spoken Language Processing.

[2]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[3]  Chih-Ta Yen,et al.  Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications , 2013, Multimedia Tools and Applications.

[4]  Haizhou Li,et al.  An overview of text-independent speaker recognition: From features to supervectors , 2010, Speech Commun..

[5]  Stefan-Adrian Toma,et al.  Automatic speaker verification experiments using HMM , 2010, 2010 8th International Conference on Communications.

[6]  Chai Wutiwiwatchai,et al.  Comparative Study of Continuous Hidden Markov Models (CHMM) and Artificial Neural Network (ANN) on Speaker Identification System , 2001, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[7]  Alessio Brutti,et al.  Geometric contamination for GMM/UBM speaker verification in reverberant environments , 2013, INTERSPEECH.

[8]  K.L. Brown,et al.  Text-independent speaker identification , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  Douglas E. Sturim,et al.  Support vector machines using GMM supervectors for speaker verification , 2006, IEEE Signal Processing Letters.

[10]  K. Sreenivasa Rao,et al.  Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text-Independent Speaker Verification , 2013, MIKE.

[11]  Man-Wai Mak,et al.  A study of voice activity detection techniques for NIST speaker recognition evaluations , 2014, Comput. Speech Lang..

[12]  Lukás Burget,et al.  Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System , 2007, IEEE Transactions on Audio, Speech, and Language Processing.