Ensemble of Support Vector Machine for Text-Independent Speaker Recognition

Summary In this paper, the ensemble of support vector machines is applied to text-independent speaker recognition, and the bagging-like model and boosting-like model are proposed by adopted the ensemble idea. The purposes of adopting this idea are to deal with the large scale speech data and improve the performance of speaker recognition. The distance-based and probability-based scoring methods are used to score the new utterance. Compared with the conventional vector-based speaker models (Vector Quantization and Gaussian Mixture Model), our method is hyperplan-based. The experiments have been run on the YOHO database, and the results show that our models can get attractive performances.

[1]  Herbert Gish,et al.  Speaker identification via support vector classifiers , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[2]  Xin Dong,et al.  Speaker recognition using continuous density support vector machines , 2001 .

[3]  Gunnar Rätsch,et al.  An Introduction to Boosting and Leveraging , 2002, Machine Learning Summer School.

[4]  Giorgio Valentini,et al.  Ensembles of Learning Machines , 2002, WIRN.

[5]  Samy Bengio,et al.  A Parallel Mixture of SVMs for Very Large Scale Problems , 2001, Neural Computation.

[6]  Roland Auckenthaler,et al.  Score Normalization for Text-Independent Speaker Verification Systems , 2000, Digit. Signal Process..

[7]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[8]  David Haussler,et al.  Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[9]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[10]  Hyun-Chul Kim,et al.  Constructing support vector machine ensemble , 2003, Pattern Recognit..

[11]  John Platt,et al.  Fast training of svms using sequential minimal optimization , 1998 .

[12]  James T. Kwok Moderating the outputs of support vector machine classifiers , 1999, IEEE Trans. Neural Networks.

[13]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[14]  William M. Campbell,et al.  Support vector machines for speaker verification and identification , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[15]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[16]  Christian Pellegrini,et al.  Local experts combination through density decomposition , 1999, AISTATS.