A text-independent speaker recognition system based on Probabilistic Principle Component Analysis

To alleviate the problem of severe degradation of speaker recognition performance because of the phoneme variability between training and testing speech data, in the text-independent speaker recognition system. The paper proposed a text-independent (TI) speaker identification method that suppresses the phonetic information by a subspace method, Probabilistic Principle Component Analysis (PPCA) is utilized to construct these subspaces. Firstly, the covariance matrix was obtained from the large training speech feature data, and then the projection matrix was obtained using the EM algorithm. In the proposed method, it is assumed that a subspace with large variance in the speech feature space is a “phoneme-dependent subspace” and a complementary subspace of it is a “phoneme-independent subspace”, the feature vectors of train/test speech data are projected to a phoneme-independent subspace and a new feature vectors are obtained. In GMM-based TI speaker identification experiments, the new feature vectors improves the identification rate by 16.25% and 2.99% respectively, compared with conventional MFCC, PCA-based MFCC. It shows that the new feature vectors of the proposed method can efficiently capture speaker-discriminative information, and suppress the other speech information.

[1]  Christopher M. Bishop,et al.  Mixtures of Probabilistic Principal Component Analyzers , 1999, Neural Computation.

[2]  Wu Zhaohui,et al.  Experimental evaluation of a new speaker identification framework using PCA , 2003, SMC'03 Conference Proceedings. 2003 IEEE International Conference on Systems, Man and Cybernetics. Conference Theme - System Security and Assurance (Cat. No.03CH37483).

[3]  S. Kuroiwa,et al.  Text-independent speaker identification based on feature transformation to phoneme-independent subspace , 2008, 2008 11th IEEE International Conference on Communication Technology.

[4]  Zhaohui Wu,et al.  Experimental evaluation of a new speaker identification framework using PCA , 2003, SMC.

[5]  A. Alcaim,et al.  On the use of PCA in GMM and AR-vector models for text independent speaker verification , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[6]  Changwoo Seo,et al.  GMM based on local PCA for speaker identification , 2001 .

[7]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[8]  Masafumi Nishida,et al.  Speaker verification by integrating dynamic and static features using subspace method , 2000, INTERSPEECH.

[9]  James T. Kwok,et al.  Bilinear Probabilistic Principal Component Analysis , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[10]  Masafumi Nishida,et al.  Speaker recognition by separating phonetic space and speaker space , 2001, INTERSPEECH.