A free Kazakh speech database and a speech recognition baseline
暂无分享,去创建一个
Thomas Fang Zheng | Dong Wang | Zhiyuan Tang | Ying Shi | Askar Hamdullah | Dong Wang | T. Zheng | Zhiyuan Tang | Ying Shi | Askar Hamdullah
[1] James Baker,et al. A historical perspective of speech recognition , 2014, CACM.
[2] Sanjeev Khudanpur,et al. Parallel training of DNNs with Natural Gradient and Parameter Averaging , 2014 .
[3] Martin Krämer. Vowel Harmony and Correspondence Theory , 2003 .
[4] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[5] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.
[6] Dong Yu,et al. Automatic Speech Recognition: A Deep Learning Approach , 2014 .
[7] Carla Lopes,et al. Phone Recognition on the TIMIT Database , 2012 .
[8] Janet M. Baker,et al. The Design for the Wall Street Journal-based CSR Corpus , 1992, HLT.
[9] Yoshua Bengio,et al. End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[10] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[11] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[12] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[13] Dong Wang,et al. THCHS-30 : A Free Chinese Speech Corpus , 2015, ArXiv.