Advances in transcription of broadcast news and conversational telephone speech within the combined EARS BBN/LIMSI system
暂无分享,去创建一个
Richard M. Schwartz | John Makhoul | Spyridon Matsoukas | Rohit Prasad | Jean-Luc Gauvain | Chia-Lin Kao | Jeff Z. Ma | Thomas Colthurst | Owen Kimball | Lori Lamel | Holger Schwenk | Gilles Adda | Fabrice Lefèvre | Bing Xiang | Long Nguyen
[1] Jonathan G. Fiscus,et al. A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER) , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.
[2] Lori Lamel,et al. On designing pronunciation lexicons for large vocabulary continuous speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[3] Jean-Luc Gauvain,et al. Conversational telephone speech recognition , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[4] Richard M. Schwartz,et al. Efficient 2-pass n-best decoder , 1997, EUROSPEECH.
[5] Spyridon Matsoukas,et al. Minimum phoneme error based heteroscedastic linear discriminant analysis for speech recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[6] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..
[7] Jean-Luc Gauvain,et al. Partitioning and transcription of broadcast news data , 1998, ICSLP.
[8] Yves Normandin. Optimal splitting of HMM Gaussian mixture components with MMIE training , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[9] Daniel Povey,et al. Large scale discriminative training of hidden Markov models for speech recognition , 2002, Comput. Speech Lang..
[10] Jean-Luc Gauvain,et al. Lightly supervised and unsupervised acoustic model training , 2002, Comput. Speech Lang..
[11] Reinhold Häb-Umbach,et al. A study on speaker normalization using vocal tract normalization and speaker adaptive training , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[12] Andreas Stolcke,et al. Getting More Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures , 2003, NAACL.
[13] David Miller,et al. From switchboard to fisher: telephone collection protocols, their uses and yields , 2003, INTERSPEECH.
[14] Jean-Luc Gauvain,et al. Lightly supervised acoustic model training using consensus networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[15] Jean-Luc Gauvain,et al. Connectionist language modeling for large vocabulary continuous speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[16] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..
[17] Nguyen Thanh Long,et al. The 1999 BBN BYBLOS 10xRT Broadcast News Transcription System , 1997 .
[18] Daniel Povey,et al. Minimum Phone Error and I-smoothing for improved discriminative training , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[19] Bing Xiang,et al. Light supervision in acoustic model training , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[20] Amro El-Jaroudi,et al. Parameter optimization for vocal tract length normalization , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[21] John Makhoul,et al. Using quick transcriptions to improve conversational speech models , 2004, INTERSPEECH.
[22] Daniel Povey,et al. Large scale discriminative training for speech recognition , 2000 .
[23] S. Matsoukas,et al. Improved speaker adaptation using speaker dependent feature projections , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[24] Herbert Gish,et al. A segmental speech model with applications to word spotting , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[25] Herbert Gish,et al. Speech recognition in multiple languages and domains: the 2003 BBN/LIMSI EARS system , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[26] Thomas Niesler,et al. The 1998 HTK system for transcription of conversational telephone speech , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[27] Richard M. Schwartz,et al. A compact model for speaker-adaptive training , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[28] Richard M. Schwartz,et al. Single-tree method for grammar-directed search , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[29] Richard M. Schwartz,et al. Towards a robust real-time decoder , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[30] Jean-Luc Gauvain,et al. Neural network language models for conversational speech recognition , 2004, INTERSPEECH.
[31] Ramesh A. Gopinath,et al. Maximum likelihood modeling with Gaussian distributions for classification , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[32] Mark J. F. Gales,et al. Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..
[33] Daben Liu,et al. A cross-channel modeling approach for automatic segmentation of conversational telephone speech [automatic speech recognition applications] , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).
[34] Jean-Luc Gauvain,et al. Continuous Speech Recognition at LIMSI , 1992 .
[35] Rohit Prasad,et al. THE 2004 BBN/LIMSI 20xRT ENGLISH CONVERSATIONAL TELEPHONE SPEECH SYSTEM , 2004 .
[36] Andreas Stolcke,et al. Finding consensus among words: lattice-based word error minimization , 1999, EUROSPEECH.