Translation of conversational speech with JANUS-II

We investigate the possibility of translating continuous spoken conversations in a cross talk environment. This is a task known to be difficult for human translators due to several factors. It is characterized by rapid and even overlapping turn taking, a high degree of coarticulation, and fragmentary language. We describe experiments using both push to talk as well as cross talk recording conditions. Our results indicate that conversational speech recognition and translation is possible, even in a free crosstalk environment. To date, our system has achieved performances of over 80%, acceptable translations on transcribed input, and over 70% acceptable translations on speech input recognized with a 70-80% word accuracy. The system's performance on spontaneous conversations recorded in a cross talk environment is shown to be as good and even slightly superior to the simpler and easier push to talk scenario.

[1]  Wayne H. Ward Extracting information in spontaneous speech , 1994, ICSLP.

[2]  Alon Lavie,et al.  End-to-End Evaluation in JANUS: A Speech-to-speech Translation System , 1996, ECAI Workshop on Dialogue Processing in Spoken Language Systems.

[3]  Masaru Tomita,et al.  An Efficient Augmented-Context-Free Parsing Algorithm , 1987, Comput. Linguistics.

[4]  Alon Lavie,et al.  GLR* – An Efficient Noise-skipping Parsing Algorithm For Context Free Grammars , 1993, IWPT.

[5]  TomitaMasaru An efficient augmented-context-free parsing algorithm , 1987 .

[6]  Finn Dag Buø,et al.  JANUS 93: towards spontaneous speech translation , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Alon Lavie,et al.  An Integrated Heuristic Scheme for Partial Parse Evaluation , 1994, ACL.