论文信息 - Transcribing lectures and seminars

Transcribing lectures and seminars

This paper describes recent research carried out in the context of the FP6 Integrated Project CHIL in developing a system to automatically transcribe lectures and seminars. We made use of widely available corpora to train both the acoustic and language models, since only a small amount of CHIL data were available for system development. For acoustic model training made use of the transcribed portion of the TED corpus of Eurospeech recordings, as well as the ICSI, ISL, and NIST meeting corpora. For language model training, text materials were extracted from a variety of on-line conference proceedings. Word error rates of about 25% are obtained on test data extracted 12 seminars.

[1] Lori Lamel,et al. The translanguage English database (TED) , 1994, ICSLP.

[2] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[3] Andreas Stolcke,et al. Finding consensus among words: lattice-based word error minimization , 1999, EUROSPEECH.

[4] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..

[5] Susanne Burger,et al. The ISL meeting corpus: the impact of meeting type on speech style , 2002, INTERSPEECH.

[6] Andreas Stolcke,et al. The ICSI Meeting Corpus , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[7] Martial Michel,et al. The NIST Meeting Room Pilot Corpus , 2004, LREC.

[8] John Makhoul,et al. THE 2004 BBN/LIMSI 10xRT ENGLISH BROADCAST NEWS TRANSCRIPTION SYSTEM , 2004 .

[9] Alexander H. Waibel. CHIL - Computers in the Human Interaction Loop , 2005, MVA.

[10] Climent Nadeu,et al. FIRST EXPERIMENTS OF AUTOMATIC SPEECH ACTIVITY DETECTION, SOURCE LOCALIZATION AND SPEECH RECOGNITION IN THE CHIL PROJECT , 2005 .