Despite joining the C-STAR II consortium in late 1996, the CLIPS ++ group succeeded in building the French parts of a multilingual task-oriented spoken dialogue translation system and took part in multilingual, intercontinental demonstrations held on July 22 1999 by CLIPS (France), CMU (United States), ETRI (South Korea), ATR (Japan), IRST (Italy), and UKA (Germany). The challenge was to reach the minimum quality level adequate for handling specific tasks, which is quite higher than what is sufficient for casual chatting and can be achieved by putting together commercially available components. After presenting the modules and the architecture of our C-STAR II demonstrator, we evaluate the results, both externally and internally. While the reactions to the final demonstrations were very positive, and many said that these prototypes should quickly lead to products, we feel that there is still much room for improving the overall quality in significant ways. In the last part, we focus on future avenues of research to further improve the quality of task-oriented speech translation, in particular by defining a more powerful and orthogonal taskoriented semantic pivot, using the linguistic and dialogic context, and generating information usable by speech synthesis to generate better prosody.
[1]
Christian Boitet Geta.
GETA's MT methodology and its current development towards personal networking communication and spee
,
1997
.
[2]
Alon Lavie,et al.
An interlingua based on domain actions for machine translation of task-oriented dialogues
,
1998,
ICSLP.
[3]
Christian Boitet,et al.
Analysis into a formal task-oriented pivot without clear abstract - semantics is best handled as "usual" translation
,
2000,
INTERSPEECH.
[4]
Toshiyuki Takezawa,et al.
End-to-end evaluation in ATR-MATRIX: speech translation system between English and Japanese
,
1999,
EUROSPEECH.
[5]
Eric Keller,et al.
Motivations for the prosodic predictive chain
,
1998,
SSW.
[6]
José Rouillard,et al.
A network architecture for building applications that use speech recognition and/or synthesis
,
1999,
EUROSPEECH.
[7]
Eric Keller.
Simplification of TTS architecture vs. operational quality
,
1997,
EUROSPEECH.