论文信息 - Extraction of Translation Unit from Chinese-English Parallel Corpora

Extraction of Translation Unit from Chinese-English Parallel Corpora

More and more researchers have recognized the potential value of the parallel corpus in the research on Machine Translation and Machine Aided Translation. This paper examines how Chinese English translation units could be extracted from parallel corpus. An iterative algorithm based on degree of word association is proposed to identify the multiword units for Chinese and English. Then the Chinese-English Translation Equivalent Pairs are extracted from the parallel corpus. We also made comparison between different statistical association measurement in this paper.

Baobao Chang | Pernilla Danielsson | Wolfgang Teubert

[1] D. Maynard. Term recognition using combined knowledge sources , 1999 .

[2] Dan Tufis,et al. Computational bilingual lexicography: automatic extraction of translation dictionaries , 2001 .

[3] Dekai Wu,et al. Learning an English-Chinese Lexicon from a Parallel Corpus , 1994, AMTA.

[4] Kenneth Ward Church,et al. Identifying word correspondence in parallel texts , 1991 .