Extraction of Translation Unit from Chinese-English Parallel Corpora
暂无分享,去创建一个
More and more researchers have recognized the potential value of the parallel corpus in the research on Machine Translation and Machine Aided Translation. This paper examines how Chinese English translation units could be extracted from parallel corpus. An iterative algorithm based on degree of word association is proposed to identify the multiword units for Chinese and English. Then the Chinese-English Translation Equivalent Pairs are extracted from the parallel corpus. We also made comparison between different statistical association measurement in this paper.
[1] D. Maynard. Term recognition using combined knowledge sources , 1999 .
[2] Dan Tufis,et al. Computational bilingual lexicography: automatic extraction of translation dictionaries , 2001 .
[3] Dekai Wu,et al. Learning an English-Chinese Lexicon from a Parallel Corpus , 1994, AMTA.
[4] Kenneth Ward Church,et al. Identifying word correspondence in parallel texts , 1991 .