论文信息 - Sequence Kernels for Speaker and Speech Recognition

Sequence Kernels for Speaker and Speech Recognition

AURORA 2 small vocabulary digit string recognition task– whole-word models, 16 emitting-states with 3 components per state– clean training data for HMM training - HTK parameterisation– SVMs trained on subset of multi-style data - Set A N2-N4, 10-20dB SNR– Set A N1 and Set B and Set C unseen noise conditions– Noise estimated in a ML-fashion for each utterance

Mark J. F. Gales | Federico Flego | Martin Layton | C Longworth

[1] Vladimir Vapnik,et al. Statistical learning theory , 1998 .

[2] Mark J. F. Gales,et al. Speech Recognition using SVMs , 2001, NIPS.

[3] Mehryar Mohri,et al. Weighted automata kernels - general framework and algorithms , 2003, INTERSPEECH.

[4] Mark J. F. Gales,et al. Derivative and parametric kernels for speaker verification , 2007, INTERSPEECH.

[5] William M. Campbell,et al. Advances in channel compensation for SVM speaker recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[6] Mark J. F. Gales,et al. Augmented Statistical Models for Speech Recognition , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[7] Mark J. F. Gales,et al. Acoustic Modelling Using Continuous Rational Kernels , 2005, 2005 IEEE Workshop on Machine Learning for Signal Processing.

[8] Kiyoshi Asai,et al. Marginalized kernels for biological sequences , 2002, ISMB.

[9] Liang Lu,et al. Cluster adaptive training weights as features in SVM-based speaker verification , 2007, INTERSPEECH.

[10] Douglas E. Sturim,et al. SVM Based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11] Shantanu Chakrabartty,et al. Support vector machines for segmental minimum Bayes risk decoding of continuous speech , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[12] David Haussler,et al. Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[13] Jeff A. Bilmes,et al. Graphical models and automatic speech recognition , 2002 .

[14] Andreas Stolcke,et al. Within-class covariance normalization for SVM-based speaker recognition , 2006, INTERSPEECH.

[15] William M. Campbell,et al. Support vector machines for speaker and language recognition , 2006, Comput. Speech Lang..

[16] Nello Cristianini,et al. Classification using String Kernels , 2000 .

[17] Steve Renals,et al. Speaker verification using sequence discriminant support vector machines , 2005, IEEE Transactions on Speech and Audio Processing.

[18] Mark J. F. Gales,et al. Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..

[19] Andreas Stolcke,et al. MLLR transforms as features in speaker recognition , 2005, INTERSPEECH.

[20] Li Deng,et al. HMM adaptation using vector taylor series for noisy speech recognition , 2000, INTERSPEECH.

[21] M. J. F. Gales,et al. DISCRIMINATIVE CLASSIFIERS WITH GENERATIVE KERNELS FOR NOISE ROBUST SPEECH RECOGNITION , 2008 .