论文信息 - Building a Spanish-Portuguese parallel corpus for statistical machine translation

Building a Spanish-Portuguese parallel corpus for statistical machine translation

Parallel corpora have long been recognised as valuable resources for building MT applications, but their usefulness have often been limited to the translation between language pairs that include English. In this work we describe our efforts to build a parallel corpus for the Brazilian Portuguese and European Spanish languages. The corpus has been aligned at sentence and word levels and manually inspected for correctness, representing a first step towards the development of translation models for this language pair.

Ivandré Paraboni | Thiago Alexandre Salgueiro Pardo | Wilker Aziz

[1] Ivandré Paraboni,et al. An Experiment in Spanish-Portuguese Statistical Machine Translation , 2008, SBIA.

[2] Kepa Sarasola,et al. An open-source shallow-transfer machine translation engine for the Romance languages of Spain , 2005, EAMT.

[3] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[4] Helena de Medeiros Caseli. Indução de léxicos bilíngües e regras para a tradução automática , 2007 .