Spoken Language Translation for Polish

Spoken language translation (SLT) is becoming more important in the increasingly globalized world, both from a social and economic point of view. It is one of the major challenges for automatic speech recognition (ASR) and machine translation (MT), driving intense research activities in these areas. While past research in SLT, due to technology limitations, dealt mostly with speech recorded under controlled conditions, today's major challenge is the translation of spoken language as it can be found in real life. Considered application scenarios range from portable translators for tourists, lectures and presentations translation, to broadcast news and shows with live captioning. We would like to present PJIIT's experiences in the SLT gained from the Eu-Bridge 7th framework project and the U-Star consortium activities for the Polish/English language pair. Presented research concentrates on ASR adaptation for Polish (state-of-the-art acoustic models: DBN-BLSTM training, Kaldi: LDA+MLLT+SAT+MMI), language modeling for ASR & MT (text normalization, RNN-based LMs, n-gram model domain interpolation) and statistical translation techniques (hierarchical models, factored translation models, automatic casing and punctuation, comparable and bilingual corpora preparation). While results for the well-defined domains (phrases for travelers, parliament speeches, medical documentation, movie subtitling) are very encouraging, less defined domains (presentation, lectures) still form a challenge. Our progress in the IWSLT TED task (MT only) will be presented, as well as current progress in the Polish ASR.

[1]  Sophia Ananiadou,et al.  Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions , 2007, ACL 2007.

[2]  Philipp Koehn,et al.  Findings of the 2013 Workshop on Statistical Machine Translation , 2013, WMT@ACL.

[3]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[4]  Daniel Povey,et al.  The Kaldi Speech Recognition Toolkit , 2011 .

[5]  Daniel M. Bikel,et al.  Intricacies of Collins’ Parsing Model , 2004, CL.

[6]  James Baker,et al.  A historical perspective of speech recognition , 2014, CACM.

[7]  Philipp Koehn,et al.  Scalable Modified Kneser-Ney Language Model Estimation , 2013, ACL.

[8]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[9]  Ryszard Gubrynowicz,et al.  User-Centered Design for a Voice Portal , 2009, Aspects of Natural Language Processing.

[10]  Krzysztof Marasek TED Polish-to-English translation system for the IWSLT 2012 , 2012, IWSLT.

[11]  Krzysztof Marasek,et al.  Polish-English speech statistical machine translation systems for the IWSLT 2014 , 2015, IWSLT.

[12]  Krzysztof,et al.  Alignment of the Polish-English Parallel Text for a Statistical Machine "Translation , 2013 .

[13]  Philipp Koehn,et al.  Factored Translation Models , 2007, EMNLP.

[14]  Noah A. Smith,et al.  A Simple, Fast, and Effective Reparameterization of IBM Model 2 , 2013, NAACL.

[15]  Nadir Durrani,et al.  Munich-Edinburgh-Stuttgart Submissions of OSM Systems at WMT13 , 2013, WMT@ACL.

[16]  Krzysztof Marasek,et al.  Real-Time Statistical Speech Translation , 2015, WorldCIST.

[17]  Krzysztof Marasek,et al.  SPEECON – Speech Databases for Consumer Devices: Database Specification and Validation , 2002, LREC.

[18]  Hans Uszkoreit,et al.  The Polish Language in the Digital Age , 2012 .

[19]  Grazyna Demenko,et al.  JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts , 2008, LREC.