论文信息 - Complexity of the TDNN Acoustic Model with Respect to the HMM Topology - 字舞流文

Complexity of the TDNN Acoustic Model with Respect to the HMM Topology

Josef V. Psutka | Ales Prazák | Jan Vanek

[1] Lukás Burget,et al. Semi-supervised training of Deep Neural Networks , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[2] Li Deng,et al. Inaugural Editorial: Riding the Tidal Wave of Human-Centric Information Processing - Innovate, Outreach, Collaborate, Connect, Expand, and Win , 2012, IEEE Trans. Speech Audio Process..

[3] Geoffrey E. Hinton,et al. Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[4] Sanjeev Khudanpur,et al. End-to-end Speech Recognition Using Lattice-free MMI , 2018, INTERSPEECH.

[5] Erich Elsen,et al. Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.

[6] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .

[7] Sanjeev Khudanpur,et al. A time delay neural network architecture for efficient modeling of long temporal contexts , 2015, INTERSPEECH.

[8] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[9] Kyu J. Han,et al. Deep Learning-Based Telephony Speech Recognition in the Wild , 2017, INTERSPEECH.

[10] Steve Young,et al. The HTK hidden Markov model toolkit: design and philosophy , 1993 .

[11] Daniel Soutner,et al. Web Text Data Mining for Building Large Scale Language Modelling Corpus , 2011, TSD.

[12] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.

[13] Yiming Wang,et al. Purely Sequence-Trained Neural Networks for ASR Based on Lattice-Free MMI , 2016, INTERSPEECH.

[14] Lukás Burget,et al. Sequence-discriminative training of deep neural networks , 2013, INTERSPEECH.

[15] Andrew W. Senior,et al. Fast and accurate recurrent neural network acoustic models for speech recognition , 2015, INTERSPEECH.

[16] Lalit R. Bahl,et al. Maximum mutual information estimation of hidden Markov model parameters for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17] Jan Vanek,et al. Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors , 2012, IEEE Transactions on Audio, Speech, and Language Processing.