From generic to task-oriented speech recognition : French experience in the NESPOLE! European project

This paper presents CLIPS laboratory activities in speech recognition related to language model adaptation and acoustic model adaptation in the NESPOLE! European project. ASR system needed to be adapted in two ways. The language model had to deal with task specific vocabulary and the acoustic model had to be robust to VoIP (Voice over IP) speech. It was shown that Internet, as a very large source of text, can be a very interesting database for spoken language modelling adaptation. The influence of different VoIP codecs on the performance of our speech recognition engine was investigated and a new strategy was proposed to cope with degradation due to low bitrate coding. The acoustic models of the speech recognition system were trained with transcoded speech. Results have shown that this strategy allows to recover acceptable performance for the NESPOLE! project context.

[1]  Maxine Eskénazi,et al.  BREF, a large vocabulary spoken corpus for French , 1991, EUROSPEECH.

[2]  Carolyn Penstein Rosé,et al.  Recent Advances in JANUS: A Speech Translation System , 1993, TMI.

[3]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[4]  Liang He,et al.  The study on distributed speech recognition system , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[5]  Guy Perennou,et al.  BDLEX lexical data and knowledge base of spoken and written French , 1987, ECST.

[6]  Michèle Jardino,et al.  Language modeling based on automatic word concatenations , 1999, EUROSPEECH.

[7]  Gianni Lazzari Spoken translation: challenges and opportunities , 2000, INTERSPEECH.

[8]  José Rouillard,et al.  Internet Documents: A Rich Source for Spoken Language Modeling , 1999 .

[9]  George Saon,et al.  Data-driven approach to designing compound words for continuous speech recognition , 2001, IEEE Trans. Speech Audio Process..