A study of discriminative feature extraction for i-vector based acoustic sniffing in IVN acoustic model training

Recently, we proposed an i-vector approach to acoustic sniffing for irrelevant variability normalization based acoustic model training in large vocabulary continuous speech recognition (LVCSR). Its effectiveness has been confirmed by experimental results on Switchboard- 1 conversational telephone speech transcription task. In this paper, we study several discriminative feature extraction approaches in i-vector space to improve both recognition accuracy and run-time efficiency. New experimental results are reported on a much larger scale LVCSR task with about 2000 hours training data.

[1]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[2]  David Miller,et al.  The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text , 2004, LREC.

[3]  Shigeru Katagiri,et al.  Discriminative metric design for robust pattern recognition , 1997, IEEE Trans. Signal Process..

[4]  Patrick Kenny,et al.  Front-End Factor Analysis for Speaker Verification , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Zhi-Jie Yan,et al.  A study of an irrelevant variability normalization based discriminative training approach for LVCSR , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Zhi-Jie Yan,et al.  A new i-vector approach and its application to irrelevant variability normalization based acoustic model training , 2011, 2011 IEEE International Workshop on Machine Learning for Signal Processing.

[7]  Zhi-Jie Yan,et al.  An i-vector Based Approach to Acoustic Sniffing for Irrelevant Variability Normalization Based Acoustic Model Training and Speech Recognition , 2011, INTERSPEECH.

[8]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Qiang Huo,et al.  A study of irrelevant variability normalization based training and unsupervised online adaptation for LVCSR , 2010, INTERSPEECH.

[10]  Teng Gao,et al.  DESIGNING ANMPI-BASED PARALLEL AND DISTRIBUTED MACHINE LEARNING PLATFORM ON LARGE-SCALE HPC CLUSTERS , 2012 .