论文信息 - Aligning a Parallel English-Chinese Corpus Statistically With Lexical Criteria

Aligning a Parallel English-Chinese Corpus Statistically With Lexical Criteria

We describe our experience with automatic alignment of sentences in parallel English-Chinese texts. Our report concerns three related topics: (1) progress on the HKUST English-Chinese Parallel Bilingual Corpus; (2) experiments addressing the applicability of Gale & Church's (1991) length-based statistical method to the task of alignment involving a non-Indo-European language; and (3) an improved statistical method that also incorporates domain-specific lexical cues.

Dekai Wu | Dekai Wu

[1] C. M. Sperberg-McQueen,et al. Guidelines for electronic text encoding and interchange , 1994 .

[2] Stanley F. Chen,et al. Aligning Sentences in Bilingual Corpora Using Lexical Information , 1993, ACL.

[3] Martin Kay,et al. Text-Translation Alignment , 1993, Comput. Linguistics.

[4] Pascale Fung,et al. Statistical Augmentation of a Chinese Machine-Readable Dictionary , 1994, ArXiv.

[5] Robert L. Mercer,et al. Aligning Sentences in Parallel Corpora , 1991, ACL.

[6] Kenneth Ward Church. Char_align: A Program for Aligning Parallel Texts at the Character Level , 1993, ACL.

[7] Kenneth Ward Church,et al. K-vec: A New Approach for Aligning Parallel Texts , 1994, COLING.

[8] Kenneth Ward Church,et al. A Program for Aligning Sentences in Bilingual Corpora , 1993, CL.

[9] Kenneth Ward Church,et al. Robust Bilingual Word Alignment for Machine Aided Translation , 1993, VLC@ACL.