Domain Specific Speech Acts for Spoken Language Translation

We describe a coding scheme for machine translation of spoken taskoriented dialogue. The coding scheme covers two levels of speaker intention − domain independent speech acts and domain dependent domain actions. Our database contains over 14,000 tagged sentences in English, Italian, and German. We argue that domain actions, and not speech acts, are the relevant discourse unit for improving translation quality. We also show that, although domain actions are domain specific, the approach scales up to large domains without an explosion of domain actions and can be coded with high inter-coder reliability across research sites. Furthermore, although the number of domain actions is on the order of ten times the number of speech acts, sparseness is not a problem for the training of classifiers for identifying the domain action. We describe our work on developing high accuracy speech act and domain action classifiers, which is the core of the source language analysis module of our NESPOLE machine translation system.

[1]  Nadia Mana,et al.  Balancing Expressiveness and Simplicity in an Interlingua for Task Based Dialogue , 2002, Speech-to-Speech Translation@ACL.

[2]  Alon Lavie,et al.  An interlingua based on domain actions for machine translation of task-oriented dialogues , 1998, ICSLP.

[3]  Marsal Gavaldà SOUP: A Parser for Real-world Spontaneous Speech , 2000, IWPT.

[4]  Fabio Pianesi,et al.  Enhancing the usability and performance of NESPOLE!: a real-world speech-to-speech translation system , 2002 .

[5]  Alon Lavie,et al.  Parsing Domain Actions with Phrase-Level Grammars and Memory-Based Learners , 2003, IWPT.

[6]  Alon Lavie,et al.  Robust analysis of spoken input combining statistical and knowledge-based information sources , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[7]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[8]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[9]  Carolyn Penstein Rosé,et al.  Minimizing Cumulative Error in Discourse Context , 1996, ECAI Workshop on Dialogue Processing in Spoken Language Systems.

[10]  Norbert Reithinger,et al.  Predicting dialogue acts for a speech-to-speech translation system , 1996 .

[11]  Carolyn Penstein Rosé,et al.  Using Discourse Predictions for Ambiguity Resolution , 1996, COLING.

[12]  Alon Lavie,et al.  Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers , 2002, Speech-to-Speech Translation@ACL.

[13]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[14]  C. Fillmore,et al.  Regularity and Idiomaticity in Grammatical Constructions: The Case of Let Alone , 1988 .

[15]  Norbert Reithinger,et al.  Predicting dialogue acts for a speech-to-speech translation system , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.