Sequence Kernels for Speaker and Speech Recognition

AURORA 2 small vocabulary digit string recognition task– whole-word models, 16 emitting-states with 3 components per state– clean training data for HMM training - HTK parameterisation– SVMs trained on subset of multi-style data - Set A N2-N4, 10-20dB SNR– Set A N1 and Set B and Set C unseen noise conditions– Noise estimated in a ML-fashion for each utterance

[1]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[2]  Mark J. F. Gales,et al.  Speech Recognition using SVMs , 2001, NIPS.

[3]  Mehryar Mohri,et al.  Weighted automata kernels - general framework and algorithms , 2003, INTERSPEECH.

[4]  Mark J. F. Gales,et al.  Derivative and parametric kernels for speaker verification , 2007, INTERSPEECH.

[5]  William M. Campbell,et al.  Advances in channel compensation for SVM speaker recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[6]  Mark J. F. Gales,et al.  Augmented Statistical Models for Speech Recognition , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[7]  Mark J. F. Gales,et al.  Acoustic Modelling Using Continuous Rational Kernels , 2005, 2005 IEEE Workshop on Machine Learning for Signal Processing.

[8]  Kiyoshi Asai,et al.  Marginalized kernels for biological sequences , 2002, ISMB.

[9]  Liang Lu,et al.  Cluster adaptive training weights as features in SVM-based speaker verification , 2007, INTERSPEECH.

[10]  Douglas E. Sturim,et al.  SVM Based Speaker Verification using a GMM Supervector Kernel and NAP Variability Compensation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[11]  Shantanu Chakrabartty,et al.  Support vector machines for segmental minimum Bayes risk decoding of continuous speech , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[12]  David Haussler,et al.  Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[13]  Jeff A. Bilmes,et al.  Graphical models and automatic speech recognition , 2002 .

[14]  Andreas Stolcke,et al.  Within-class covariance normalization for SVM-based speaker recognition , 2006, INTERSPEECH.

[15]  William M. Campbell,et al.  Support vector machines for speaker and language recognition , 2006, Comput. Speech Lang..

[16]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[17]  Steve Renals,et al.  Speaker verification using sequence discriminant support vector machines , 2005, IEEE Transactions on Speech and Audio Processing.

[18]  Mark J. F. Gales,et al.  Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..

[19]  Andreas Stolcke,et al.  MLLR transforms as features in speaker recognition , 2005, INTERSPEECH.

[20]  Li Deng,et al.  HMM adaptation using vector taylor series for noisy speech recognition , 2000, INTERSPEECH.

[21]  M. J. F. Gales,et al.  DISCRIMINATIVE CLASSIFIERS WITH GENERATIVE KERNELS FOR NOISE ROBUST SPEECH RECOGNITION , 2008 .