论文信息 - Joint frame and Gaussian selection for text independent speaker verification

Joint frame and Gaussian selection for text independent speaker verification

Gaussian selection is a technique applied in the GMM-UBM framework to accelerate score calculation. We have recently introduced a novel Gaussian selection method known as sorted GMM (SGMM). SGMM uses scalar-indexing of the universal background model mean vectors to achieve fast search of the top-scoring Gaussians. In the present work we extend this method by using 2-dimensional indexing, which leads to simultaneous frame and Gaussian selection. Our results on the NIST 2002 speaker recognition evaluation corpus indicate that both the 1- and 2- dimensional SGMMs outperform frame decimation and temporal tracking of top-scoring Gaussians by a wide margin (in terms of Gaussian computations relative to GMM-UBM as baseline).

Tomi Kinnunen | Rahim Saeidi | Pasi Fränti | Robert D. Rodman | Hamid Reza Sadegh Mohammadi

[1] Riccardo Poli,et al. Particle swarm optimization , 1995, Swarm Intelligence.

[2] Rahim Saeidi,et al. Particle Swarm Optimization for Sorted Adapted Gaussian Mixture Models , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[3] Hagai Aronowitz,et al. Efficient Speaker Recognition Using Approximated Cross Entropy (ACE) , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[4] Tomi Kinnunen,et al. Real-time speaker identification and verification , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[5] Vijendra Raj Apsingekar,et al. Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[6] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..

[7] Douglas A. Reynolds,et al. A study of computation speed-UPS of the GMM-UBM speaker recognition system , 1999, EUROSPEECH.

[8] Douglas A. Reynolds,et al. Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[9] Rahim Saeidi,et al. Efficient implementation of GMM based speaker verification using sorted Gaussian mixture model , 2006, 2006 14th European Signal Processing Conference.

[10] Roland Auckenthaler,et al. Gaussian selection applied to text-independent speaker verification , 2001, Odyssey.

[11] Roland Auckenthaler,et al. Score Normalization for Text-Independent Speaker Verification Systems , 2000, Digit. Signal Process..

[12] Alexander I. Rudnicky,et al. Four-layer categorization scheme of fast GMM computation techniques in large vocabulary continuous speech recognition systems , 2004, INTERSPEECH.

[13] Jason W. Pelecanos,et al. Text-Independent Speaker Verification in Embedded Environments , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.