Building a Spanish-Portuguese parallel corpus for statistical machine translation
暂无分享,去创建一个
Parallel corpora have long been recognised as valuable resources for building MT applications, but their usefulness have often been limited to the translation between language pairs that include English. In this work we describe our efforts to build a parallel corpus for the Brazilian Portuguese and European Spanish languages. The corpus has been aligned at sentence and word levels and manually inspected for correctness, representing a first step towards the development of translation models for this language pair.
[1] Ivandré Paraboni,et al. An Experiment in Spanish-Portuguese Statistical Machine Translation , 2008, SBIA.
[2] Kepa Sarasola,et al. An open-source shallow-transfer machine translation engine for the Romance languages of Spain , 2005, EAMT.
[3] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.
[4] Helena de Medeiros Caseli. Indução de léxicos bilíngües e regras para a tradução automática , 2007 .