Multilingual Videos for MOOCs and OER

Massive Open Online Courses (MOOCs) and Open Educational Resources (OER) are rapidly growing, but are not usually offered in multiple languages due to the lack of cost-effective solutions to translate the different objects comprising them and particularly videos. However, current state-of-the-art automatic speech recognition (ASR) and machine translation (MT) techniques have reached a level of maturity which opens the possibility of producing multilingual video subtitles of publishable quality at low cost. This work summarizes authors' experience in exploring this possibility in two real-life case studies: a MOOC platform and a large video lecture repository. Apart from describing the systems, tools and integration components employed for such purpose, a comprehensive evaluation of the results achieved is provided in terms of quality and efficiency. More precisely, it is shown that draft multilingual subtitles produced by domainadapted ASR/MT systems reach a level of accuracy that make them worth post-editing, instead of generating them ex novo, saving approximately 25%–75% of the time. Finally, the results reported on user multilingual data consumption reflect that multilingual subtitles have had a very positive impact in our case studies boosting student enrolment, in the case of the MOOC platform, by 70% relative.

[1]  Robin H. Kay,et al.  Exploring the use of video podcasts in education: A comprehensive review of the literature , 2012, Comput. Hum. Behav..

[2]  Andy Way,et al.  Enhancing Access to Online Education: Quality Machine Translation of MOOC Content , 2016, LREC.

[3]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[4]  Dennis Reidsma,et al.  Designing Focused and Efficient Annotation Tools , 2005 .

[5]  Timothy J. Hazen Automatic alignment and error correction of human generated transcripts for long speech recordings , 2006, INTERSPEECH.

[6]  Tatsuya Kawahara,et al.  Automatic transcription system for meetings of the Japanese national congress , 2009, INTERSPEECH.

[7]  Elaine Toms,et al.  The effect of speech recognition accuracy rates on the usefulness and usability of webcast archives , 2006, CHI.

[8]  M. J. Hunt Figures of merit for assessing connected-word recognisers , 1990, Speech Commun..

[9]  Embry-Riddle Aeronautical,et al.  The Flipped Classroom: A Survey of the Research , 2013 .

[10]  Alfons Juan-Císcar,et al.  Efficiency and usability study of innovative computer-aided transcription strategies for video lecture repositories , 2015, Speech Commun..

[11]  Michel Simard,et al.  Introduction to special issue on post-editing , 2014, Machine Translation.

[12]  J. Paul Robinson,et al.  Using speech recognition for real-time captioning and lecture transcription in the classroom , 2013, IEEE Transactions on Learning Technologies.

[13]  Jianfeng Gao,et al.  Domain Adaptation via Pseudo In-Domain Data Selection , 2011, EMNLP.

[14]  Christoph Meinel,et al.  Browsing within Lecture Videos Based on the Chain Index of Speech Transcription , 2008, IEEE Transactions on Learning Technologies.

[15]  Lucia Specia,et al.  Exploiting Objective Annotations for Minimising Translation Post-editing Effort , 2011, EAMT.

[16]  Matthew G. Snover,et al.  A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[17]  Lucia Specia,et al.  Improving the Confidence of Machine Translation Quality Estimates , 2009, MTSUMMIT.

[18]  Markus Ketterl,et al.  Opencast Matterhorn: A community-driven open source software project for producing, managing, and distributing academic video , 2010, Interact. Technol. Smart Educ..

[19]  François Masselot,et al.  A Productivity Test of Statistical Machine Translation Post-Editing in a Typical Localisation Context , 2010, Prague Bull. Math. Linguistics.

[20]  Mark Liberman,et al.  Transcriber: Development and use of a tool for assisting speech corpora production , 2001, Speech Commun..

[21]  Jason D. Williams,et al.  Crowd-sourcing for difficult transcription of speech , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[22]  Alfons Juan-Císcar,et al.  Language model adaptation for video lectures transcription , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.