High-quality Speech Translation for Language Learning

In this paper, we describe a translation framework aimed at achieving high-quality speech translation within restricted conversational domains. Towards this goal, we developed an interlingua-based approach, in which a generation-based method is augmented with an examplebased method to improve system robustness, even with imperfect inputs due to speech recognition errors. The framework is integrated into a dialogue-based language tutoring system, to provide immediate translation assistance to students during the dialogue interaction. We evaluated the translation quality within a weather information domain configured for native English speakers practicing Mandarin Chinese. We achieved perfect or acceptable translations for 94.3% of the manual transcriptions of a test set of 695 spoken queries, and 90.2% on automatic speech recognition outputs.

[1]  Michael Picheny,et al.  A trainable approach for multi-lingual speech-to-speech translation system , 2002 .

[2]  Victor Zue,et al.  JUPlTER: a telephone-based conversational interface for weather information , 2000, IEEE Trans. Speech Audio Process..

[3]  Satoshi Sato,et al.  CTM: An Example-Based Translation Aid System , 1992, COLING.

[4]  Daniel Marcu,et al.  Towards a Unified Approach to Memory- and Statistical-Based Machine Translation , 2001, ACL.

[5]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[6]  Min Tang,et al.  Active Learning for Statistical Natural Language Parsing , 2002, ACL.

[7]  S. Seneff,et al.  Spoken Conversational Interaction for Language Learning , 2004 .

[8]  Brooke A. Cowan PLUTO : a preprocessor for multilingual spoken language generation , 2004 .

[9]  Manny Rayner,et al.  A Flexible Speech to Speech Phrasebook Translator , 2002, Speech-to-Speech Translation@ACL.

[10]  Stephanie Seneff,et al.  TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[11]  Stephanie Seneff,et al.  GENESIS-II: a versatile system for language generation in conversational system applications , 2000, INTERSPEECH.

[12]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[13]  Stephanie Seneff,et al.  Dialogue Management in the Mercury Flight Reservation System , 2000 .

[14]  Stephanie Seneff,et al.  Automatic induction of n-gram language models from a natural language grammar , 2003, INTERSPEECH.

[15]  Alexander I. Rudnicky,et al.  Speech Translation on a Tight Budget without Enough Data , 2002, Speech-to-Speech Translation@ACL.

[16]  Ralf D. Brown,et al.  Adding linguistic knowledge to a lexical example-based translation system , 1999, TMI.