Analysis of the errors produced by the 2004 BBN speech recognition system in the DARPA EARS evaluations
暂无分享,去创建一个
[1] Sherif Abdou,et al. The BBN RT04 English broadcast news transcription system , 2005, INTERSPEECH.
[2] Steven Greenberg,et al. LINGUISTIC DISSECTION OF SWITCHBOARD-CORPUS AUTOMATIC SPEECH RECOGNITION SYSTEMS , 2000 .
[3] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..
[4] Elizabeth Shriberg,et al. Spontaneous speech: how people really talk and why engineers should care , 2005, INTERSPEECH.
[5] Mark J. F. Gales,et al. Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..
[6] Steven Greenberg,et al. AN INTRODUCTION TO THE DIAGNOSTIC EVALUATION OF SWITCHBOARD-CORPUS AUTOMATIC SPEECH RECOGNITION SYSTEMS , 2000 .
[7] Mari Ostendorf,et al. Improving Information Extraction by Modeling Errors in Speech Recognizer Output , 2001, HLT.
[8] Andreas Stolcke,et al. Word predictability after hesitations: a corpus-based study , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[9] Andreas Stolcke,et al. Human language technology: opportunities and challenges , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[10] Richard M. Schwartz,et al. Efficient 2-pass n-best decoder , 1997, EUROSPEECH.
[11] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[12] Katsutoshi Ohtsuki,et al. Unsupervised vocabulary expansion for automatic transcription of broadcast news , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[13] S. Matsoukas,et al. Improved speaker adaptation using speaker dependent feature projections , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[14] Sadaoki Furui,et al. Why Is the Recognition of Spontaneous Speech so Hard? , 2005, TSD.
[15] Richard M. Schwartz,et al. The 2004 BBN/LIMSI 20xRT English conversational telephone speech recognition system , 2005, INTERSPEECH.
[16] Richard M. Schwartz,et al. Single-tree method for grammar-directed search , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[17] Andreas Stolcke,et al. Statistical language modeling for speech disfluencies , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[18] Eric Fosler-Lussier,et al. Fast speakers in large vocabulary continuous speech recognition: analysis & antidotes , 1995, EUROSPEECH.