Language modeling for broadcast news transcription

This paper addresses the problem of language modeling for the transcription of broadcast news data. Different approaches for language model training were explored and tested in the context of a complete transcription system. Language model efficiency was investigated for the following aspects: mixing of different training material (sources and epoch); approach for mixing (interpolation vs count merging); and using class-based language models. The experimental results indicate that judicious selection of the training source and epoch is important, and that given sufficient broadcast new transcriptions, newspaper and newswire texts are not necessary. Results are given in terms of perplexity and word error rates. The combined improvements in text selection, interpolation, 4-gram and class-based LMs led to a 20% reduction in the perplexity of the LM of the final pass (3-gram class interpolated with a word 4-gram) compared with the 3-gram LM used in the the LIMSI Nov’97 BN system.

[1]  Jean-Luc Gauvain,et al.  Large vocabulary speech recognition in French , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2]  Steve Young,et al.  Large vocabulary speech recognition , 1995 .

[3]  Jean-Luc Gauvain,et al.  Partitioning and transcription of broadcast news data , 1998, ICSLP.

[4]  Michèle Jardino Multilingual stochastic n-gram class language models , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5]  Jean-Luc Gauvain,et al.  Transcribing Broadcast News: The LIMSI Nov96 Hub4 System , 1997 .

[6]  Jean-Luc Gauvain,et al.  Recent advances in transcribing television and radio broadcasts , 1999, EUROSPEECH.

[7]  Lori Lamel,et al.  The LIMSI 1998 Hub-4E Transcription System , 1997 .

[8]  Jean-Luc Gauvain,et al.  The LIMSI 1995 Hub3 System , 1995 .

[9]  Stanley F. Chen,et al.  Language and Pronunciation Modeling in the CMU 1996 Hub 4 Evaluation , 1999 .