Evaluation of a live broadcast news subtitling system for portuguese

Broadcast news play an important role in our lives providing access to news, information and entertainment. The existence of subtitles is an important medium for inclusion of people with special needs and also an advantage on noisy and populated environments. In this work we will describe and evaluate a system for subtitling live broadcast news for RTP (Radio Televisao de Portugal) the Portuguese public broadcast company. Developing a fully automatic subtitling system is a huge breakthrough which results from the convergence of different research models and software developments to create a working system. Our online system has 12% word error rate for the displayed subtitles working under real time with an average latency of just 6.5 seconds.

[1]  João Paulo da Silva Neto,et al.  A stream-based audio segmentation, classification and clustering pre-processing system for broadcast news using ANN models , 2005, INTERSPEECH.

[2]  Isabel Trancoso,et al.  A specialized on-the-fly algorithm for lexicon and language model composition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[3]  Fernando Batista,et al.  Recovering punctuation marks for automatic speech recognition , 2007, INTERSPEECH.

[4]  Ciro Martins,et al.  Broadcast news subtitling system in Portuguese , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[5]  Mark J. F. Gales,et al.  Progress in the CU-HTK broadcast news transcription system , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[6]  Ciro Martins,et al.  Dynamic language modeling for a daily broadcast news transcription system , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[7]  Patrick Cardinal,et al.  Computer-assisted closed-captioning of live TV broadcasts in French , 2006, INTERSPEECH.

[8]  João Paulo da Silva Neto,et al.  AUDIMUS.MEDIA: A Broadcast News Speech Recognition System for the European Portuguese Language , 2003, PROPOR.