Sequence Intersection Based Phrase Translation Extraction from Bilingual Corpus
暂无分享,去创建一个
Phrase translation extraction is one of the key techniques in the Example-Based Machine Translation(EBMT),and its accuracy has a direct influence on the EBMT system performance.This paper proposes a phrase translation extraction method based on sequence intersection in which the sentence is taken as word sequence.Among Chinese-Japanese sentence aligned bilingual corpus,the source sentences containing the phrase are first searched out.Then the pairwise intersections of all these target sentences are acquired as the phrase translaiton.This approach can achieve high-quality phrase translations by mining the bilingual corpus,avoiding pre-possing steps like word alignment,parsing and dictionary.The experiments show our method achieves over 80% accuracy for the acquired phrase translation.