Resolving Structural Transfer Ambiguity in Chinese-to-Korean Machine Translation

We propose a new prototype Chinese-to-Korean machine translation system called TOTAL-CK, a prototype transfer based MT system designed for the large-scale practical domain. TOTAL-CK consists of the components of analysis, transfer, and generation. In this paper, we mainly discuss the transfer issues resulting from stylistic structural differences between Chinese and Korean. The dependency grammar formalism is employed for Chinese parsed trees and their equivalent Korean parsed trees. We deal with structural transfer ambiguities that arise when a given source language syntactic pattern is potentially translated by a number of different target language syntactic patterns. Also, structural transfer ambiguities are the major complex transfer problem appearing during dependency transferring, in an SVO pattern that most frequently occurs in Chinese sentences. There are three types of structural transfer ambiguity: transitive transfer, subject-governing transfer and verb-governing transfer. They are discovered by analyzing the ambiguity and the related formalism. Finally, we present a thesaurus-based disambiguation approach for dependency structural transfer, which is adopted in TOTAL-CK. Our approach achieves 94.6% in precision for SVO sentences and 93.5% for overall sentences.

[1]  Gil Chang Kim,et al.  Voice Generation from Conceptual Representation: Syntactic, Semantic, and Pragmitic Aspects , 1988 .

[2]  Chao-Huang Chang,et al.  Resolution of ambiguity in Chinese and its application to machine translation , 1991, Machine Translation.

[3]  Chrysanne Di Marco,et al.  A model of comparative stylistics for machine translation , 1994, Machine Translation.

[4]  Haodong Wu,et al.  Structural Disambiguation Based on Reliable Estimation of Strength of Association , 1998, COLING-ACL.

[5]  Key-Sun Choi,et al.  MATES/CK: A Chinese-to-Korean Machine Translator , 1999 .

[6]  Charles N. Li,et al.  Mandarin Chinese: A Functional Reference Grammar , 1989 .

[7]  Bonnie J. Dorr,et al.  Machine Translation Divergences: A Formal Description and Proposed Solution , 1994, CL.

[8]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .

[9]  Igor Mel’čuk,et al.  Dependency Syntax: Theory and Practice , 1987 .

[10]  Yoshihiko Nitta,et al.  Problems of machine translation systems: Effect of cultural differences on sentence structure , 1986, Future Gener. Comput. Syst..

[11]  Ming Zhou A Block-Based Robust Dependency Parser for Unrestricted Chinese Text , 1999, ACL 2000.

[12]  K Tsou,et al.  The Role of High Frequent Maximal Crossing Ambiguities in Chinese Word Segmentation , 1999 .

[13]  Taijiro Tsutsumi Wide-Range Restructuring of Intermediate Representations in Machine Translation , 1990, Comput. Linguistics.

[14]  MPhil PhD Arturo Trujillo BSc Translation Engines: Techniques for Machine Translation , 1999, Applied Computing.