Boosting attribute and phone estimation accuracies with deep neural networks for detection-based speech recognition
暂无分享,去创建一个
Chin-Hui Lee | Dong Yu | Li Deng | Sabato Marco Siniscalchi | S. M. Siniscalchi | L. Deng | Dong Yu | Chin-Hui Lee
[1] S.E. Levinson,et al. Structural methods in automatic speech recognition , 1985, Proceedings of the IEEE.
[2] Kenneth Ward Church. Phonological parsing in speech recognition , 1987 .
[3] Janet M. Baker,et al. The Design for the Wall Street Journal-based CSR Corpus , 1992, HLT.
[4] Li Deng,et al. Speech recognition using the atomic speech units constructed from overlapping articulatory features , 1994, EUROSPEECH.
[5] Biing-Hwang Juang,et al. Key-phrase detection and verification for flexible speech understanding , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[6] Katrin Kirchhoff,et al. Robust speech recognition using articulatory information , 1998 .
[7] Biing-Hwang Juang,et al. Flexible speech understanding based on combined key-phrase detection and verification , 1998, IEEE Trans. Speech Audio Process..
[8] Li Deng,et al. Articulatory Features and Associated Production Models Statistical Speech Recognition , 1999 .
[9] Qiang Huo,et al. On adaptive decision rules and decision parameter adaptation for automatic speech recognition , 2000, Proceedings of the IEEE.
[10] Simon King,et al. Detection of phonological features in continuous speech using neural networks , 2000, Comput. Speech Lang..
[11] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.
[12] Karen Livescu,et al. Feature-based pronunciation modeling for automatic speech recognition , 2005 .
[13] Chin-Hui Lee,et al. Towards bottom-up continuous phone recognition , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).
[14] Biing-Hwang Juang,et al. An overview on automatic speech attribute transcription (ASAT) , 2007, INTERSPEECH.
[15] Michael Picheny,et al. Articulatory feature detection with Support Vector Machines for integration into ASR and phone recognition , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[16] Chin-Hui Lee,et al. A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition , 2009, Speech Commun..
[17] Chin-Hui Lee,et al. Exploring universal attribute characterization of spoken languages for spoken language recognition , 2009, INTERSPEECH.
[18] Dong Yu,et al. Investigation of full-sequence training of deep belief networks for speech recognition , 2010, INTERSPEECH.
[19] Dong Yu,et al. Roles of Pre-Training and Fine-Tuning in Context-Dependent DBN-HMMs for Real-World Speech Recognition , 2010 .
[20] Dong Yu,et al. Improved Bottleneck Features Using Pretrained Deep Neural Networks , 2011, INTERSPEECH.
[21] Chin-Hui Lee,et al. A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines , 2011, INTERSPEECH.
[22] Dong Yu,et al. Conversational Speech Transcription Using Context-Dependent Deep Neural Networks , 2012, ICML.
[23] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[24] Dau-Cheng Lyu,et al. Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[25] Geoffrey E. Hinton,et al. Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.