Combined Automatic Speech Recognition and Machine Translation in Business Correspondence Domain for English-Croatian

The paper presents combined automatic speech recognition (ASR) of English and machine translation (MT) for English and Croatian and Croatian-English language pairs in the domain of business correspondence. The first part presents results of training the ASR commercial system on English data sets, enriched by error analysis. The second part presents results of machine translation performed by free online tool for English and Croatian and Croatian- English language pairs. Human evaluation in terms of usability is conducted and internal consistency calculated by Cronbach's alpha coefficient, enriched by error analysis. Automatic evaluation is performed by WER (Word Error Rate) and PER (Position-independent word Error Rate) metrics, followed by investigation of Pearson’s correlation with human evaluation.

[1]  Mai Miyabe,et al.  Development of Circulating Support Environment of Multilingual Medical Communication using Parallel Texts for Foreign Patients , 2010 .

[2]  Farzad Ehsani,et al.  Speech Technology in Computer-Assisted Language Learning: Strengths and Limitations of a New CALL Paradigm. , 1998 .

[3]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[4]  W. Marsden I and J , 2012 .

[5]  Hassan Sawaf Automatic Speech Recognition & Hybrid MT for HQ Closed-Captioning & Subtitling for Video Broadcast , 2012, AMTA.

[6]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[7]  Sanja Seljan,et al.  Domain-Specific Evaluation of Croatian Speech Synthesis in CALL , 2013 .

[8]  Ivan Dunđer,et al.  Automatic word-level evaluation and error analysis of formant speech synthesis for Croatian , 2013 .

[9]  Ho-Youl Jung,et al.  A Speech and Character Combined Recognition Engine for Mobile Devices , 2006, EUC.

[10]  Mark J. F. Gales,et al.  Speech Recognition System Combination for Machine Translation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[11]  Mike Wald Creating accessible educational multimedia through editing automatic speech recognition captioning in real time , 2006, Interact. Technol. Smart Educ..

[12]  10 emerging technologies that will change your world , 2004, IEEE Engineering Management Review.

[13]  Rita Singh,et al.  TONGUES: rapid development of a speech-to-speech translation system , 2002 .

[14]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[15]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[16]  Francoise Beaufays,et al.  Google Search by Voice: A Case Study , 2010 .

[17]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing , 2000 .

[18]  Alexander I. Rudnicky,et al.  Rapid development of speech-to-speech translation systems , 2002, INTERSPEECH.

[19]  Richard C. Rose,et al.  Integration of ASR and machine translation models in a document translation task , 2007, INTERSPEECH.

[20]  Francisco Casacuberta,et al.  Computer-assisted translation using speech recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[21]  Hermann Ney,et al.  Word Error Rates: Decomposition over POS classes and Applications for Error Analysis , 2007, WMT@ACL.

[22]  Zöe Handley Is text-to-speech synthesis ready for use in computer-assisted language learning? , 2009, Speech Commun..

[23]  Sid-Ahmed Selouani,et al.  Online Collaborative Learning System Using Speech Technology , 2008 .

[24]  P. Alberto,et al.  Literacy , 1983, Language in Society.

[25]  Marie-Josée Hamel,et al.  Establishing a Methodology for Benchmarking Speech Synthesis for Computer-Assisted Language Learning (CALL). , 2005 .

[26]  F. Casacuberta,et al.  Recent efforts in spoken language translation , 2008, IEEE Signal Processing Magazine.

[27]  Simon Judge,et al.  The limitations of speech control: perceptions of provision of speech-driven environmental controls , 2011 .

[28]  Maja Popovic Hjerson: An Open Source Tool for Automatic Error Classification of Machine Translation Output , 2011, Prague Bull. Math. Linguistics.

[29]  Niels Ole Bernsen,et al.  Overview of Evaluation and Usability , 2005 .

[30]  Damir Boras,et al.  Aspects of a Theory and the Present State of Speech Synthesis , 2006 .

[31]  Bent B. Andresen Literacy, assistive technology and e-inclusion , 2007 .

[32]  Juan-Manuel Torres-Moreno Beyond Stemming and Lemmatization: Ultra-stemming to Improve Automatic Text Summarization , 2012, ArXiv.