Automated English-Korean Translation for Enhanced Coalition Communications

■This article describes our progress on automated, two-way English-Korean translation of text and speech for enhanced military coalition communications. Our goal is to improve multilingual communications by producing accurate translations across a number of languages. Therefore, we have chosen an interlingua-based approach to machine translation that readily extends to multiple languages. In this approach, a natural-language-understanding system transforms the input into an intermediate-meaning representation called a semantic frame, which serves as the basis for generating output in multiple languages. To produce useful, accurate, and effective translation systems in the short term, we have focused on limited military-task domains, and have configured our system as a translator's aid so that the human translator can confirm or edit the machine translation. We have obtained promising results in translation of telegraphic military messages in a naval domain, and have successfully extended the system to additional military domains. The system has been demonstrated in a coalition exercise and at Combined Forces Command in the Republic of Korea. From these demonstrations we learned that the system must be robust enough to handle new inputs, which is why we have developed a multistage robust translation strategy, including a part-of-speech tagging technique to handle new words, and a fragmentation strategy for handling complex sentences. Our current work emphasizes ongoing development of these robust translation techniques and extending the translation system to application domains of interest to users in the military coalition environment in the Republic of Korea.

[1]  Dennis Woojun Yang Korean language generation in an interlingua-based speech translation system , 1995 .

[2]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[3]  Robert D. Rodman,et al.  An Introduction to Language , 1984 .

[4]  Ralph Grishman,et al.  Analyzing Telegraphic Messages , 1989, HLT.

[5]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .

[6]  Steve J. Young,et al.  Large vocabulary continuous speech recognition using HTK , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  James F. Allen Natural language understanding , 1987, Bejnamin/Cummings series in computer science.

[8]  Hyun Seok Park,et al.  Korean Grammar Using TAGs , 1994 .

[9]  Stephanie Seneff,et al.  TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[10]  Victor Zue,et al.  A Bilingual VOYAGER System , 1993, HLT.

[11]  Jung-Taik Hwang A fragmentation technique for parsing complex sentences for machine translation , 1997 .

[12]  S. E. Martin,et al.  A Reference Grammar of Korean , 1975 .

[13]  Ralph Grishman,et al.  A Corpus-based Probabilistic Grammar with Only Two Non-terminals , 1995, IWPT.

[14]  Stephanie Seneff,et al.  Automatic English-to-Korean Text Translation of Telegraphic Messages in a Limited Domain , 1996, COLING.

[15]  Victor Zue,et al.  Multilingual human-computer interactions: from information access to language learning , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[16]  John S. White,et al.  Evaluation in the ARPA Machine Translation Program: 1993 Methodology , 1994, HLT.

[17]  Eric Brill,et al.  A Rule-Based Approach to Prepositional Phrase Attachment Disambiguation , 1994, COLING.

[18]  James R. Glass,et al.  Multilingual language generation across multiple domains , 1994, ICSLP.

[19]  Steve Young,et al.  A review of large-vocabulary continuous-speech recognition , 1996 .

[20]  Key-Sun Choi,et al.  An English-To-Korean Machine Translator: MATES/EK , 1994, COLING.

[21]  Beth M. Sundheim Plans For a Task-Oriented Evaluation of Natural Language Understanding Systems , 1989, HLT.

[22]  Alon Lavie,et al.  Janus-III: speech-to-speech translation in multiple languages , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23]  Jun Park,et al.  An experiment on Korean-to-English and Korean-to-Japanese spoken language translation , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[24]  Stephanie Seneff,et al.  Ambiguity Resolution for Machine Translation of Telegraphic Messages , 1997, ACL.

[25]  Eric Brill,et al.  A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[26]  James F. Allen Natural language understanding (2nd ed.) , 1995 .