A Convenient and Extensible Offline Chinese Speech Recognition System Based on Convolutional CTC Networks
暂无分享,去创建一个
Wang Yong | Wu Guodong | Gong Shuai | Chang Renkai | Hao Tuo | Wang Yong | Gong Shuai | Chang Renkai | Hao Tuo | Wu Guodong
[1] L. Rabiner,et al. An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.
[2] Yanmin Qian,et al. Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[3] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.
[4] Jonathan G. Fiscus,et al. A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER) , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.
[5] Dimitri Palaz,et al. Analysis of CNN-based speech recognition system using raw speech as input , 2015, INTERSPEECH.
[6] Alex Bateman,et al. An introduction to hidden Markov models. , 2007, Current protocols in bioinformatics.
[7] Jürgen Schmidhuber,et al. Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.
[8] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[9] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.
[10] M. S. Ryan,et al. The Viterbi Algorithm 1 1 The Viterbi Algorithm . , 2009 .
[11] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[12] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.
[13] Bowen Zhou,et al. A Structured Self-attentive Sentence Embedding , 2017, ICLR.
[14] Jr. G. Forney,et al. The viterbi algorithm , 1973 .
[15] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[16] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[17] Samy Bengio,et al. Tacotron: Towards End-to-End Speech Synthesis , 2017, INTERSPEECH.