A KALDI-DNN-based ASR system for Italian
暂无分享,去创建一个
[1] Piero Cosi,et al. On the development of matched and mismatched Italian children's speech recognition systems , 2009, INTERSPEECH.
[2] Jan Cernocký,et al. Improved feature processing for deep neural networks , 2013, INTERSPEECH.
[3] Sanjeev Khudanpur,et al. Parallel training of DNNs with Natural Gradient and Parameter Averaging , 2014 .
[4] Dong Yu,et al. Conversational Speech Transcription Using Context-Dependent Deep Neural Networks , 2012, ICML.
[5] Thomas Hain,et al. Hypothesis spaces for minimum Bayes risk training in large vocabulary speech recognition , 2006, INTERSPEECH.
[6] Fabio Brugnara,et al. Acoustic variability and automatic recognition of children's speech , 2007, Speech Commun..
[7] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[8] Hsiao-Wuen Hon,et al. An overview of the SPHINX speech recognition system , 1990, IEEE Trans. Acoust. Speech Signal Process..
[9] Paul Lamere,et al. Sphinx-4: a flexible open source framework for speech recognition , 2004 .
[10] Brian Kingsbury,et al. Evaluation of Proposed Modifications to MPE for Large Scale Discriminative Training , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[11] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..
[12] Lukás Burget,et al. Sequence-discriminative training of deep neural networks , 2013, INTERSPEECH.
[13] Kiyohiro Shikano,et al. Julius - an open source real-time large vocabulary recognition engine , 2001, INTERSPEECH.
[14] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[15] Geoffrey E. Hinton. A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.
[16] Georg Heigold,et al. The RWTH aachen university open source speech recognition system , 2009, INTERSPEECH.
[17] Kai Feng,et al. The subspace Gaussian mixture model - A structured model for speech recognition , 2011, Comput. Speech Lang..
[18] Xiaohui Zhang,et al. Improving deep neural network acoustic models using generalized maxout networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Piero Cosi,et al. Italian children's speech recognition for advanced interactive literacy tutors , 2005, INTERSPEECH.
[20] Daniel Bolaños. The Bavieca open-source speech recognition toolkit , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[21] Piero Cosi,et al. High performance "general purpose" phonetic recognition for Italian , 2000, INTERSPEECH.
[22] Mark Hasegawa-Johnson,et al. Semi-supervised training of Gaussian mixture models by conditional entropy minimization , 2010, INTERSPEECH.
[23] Steve Young,et al. The HTK book version 3.4 , 2006 .
[24] Fabio Tesser,et al. Comparing open source ASR toolkits on Italian children speech , 2014, WOCCI.
[25] Diego Giuliani,et al. Investigating recognition of children's speech , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[26] Piero Cosi. Recent advances in sonic Italian children2s speech recognition for interactive literacy tutors , 2008, WOCCI.
[27] Kadri Hacioglu,et al. Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[28] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .