A Prototype System for Selective Dissemination of Broadcast News in European Portuguese

This paper describes ongoing work on selective dissemination of broadcast news. Our pipeline system includes several modules: audio preprocessing, speech recognition, and topic segmentation and indexation. The main goal of this work is to study the impact of earlier errors in the last modules. The impact of audio preprocessing errors is quite small on the speech recognition module, but quite significant in terms of topic segmentation. On the other hand, the impact of speech recognition errors on the topic segmentation and indexation modules is almost negligible. The diagnostic of the errors in these modules is a very important step for the improvement of the prototype of a media watch system described in this paper.

[1]  João Paulo da Silva Neto,et al.  Evaluation of an alert system for selective dissemination of broadcast news , 2003, INTERSPEECH.

[2]  Jean-Luc Gauvain,et al.  THE LIMSI TOPIC TRACKING SYSTEM FOR TDT2002 , 2002 .

[3]  Richard M. Schwartz,et al.  The 2004 BBN 1xRT recognition systems for English broadcast news and conversational telephone speech , 2005, INTERSPEECH.

[4]  João Paulo da Silva Neto,et al.  AUDIMUS.MEDIA: A Broadcast News Speech Recognition System for the European Portuguese Language , 2003, PROPOR.

[5]  Julia Hirschberg,et al.  The Rules Behind Roles: Identifying Speaker Role in Radio Broadcasts , 2000, AAAI/IAAI.

[6]  Isabel Trancoso,et al.  Improving the topic indexation and segmentation modules of a media watch system , 2004, INTERSPEECH.

[7]  Fall 2004 Rich Transcription ( RT-04 F ) Evaluation Plan , .

[8]  Alex Acero,et al.  Speech utterance classification , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  Jean-Luc Gauvain,et al.  Developments in continuous speech dictation using the ARPA WSJ task , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[10]  Sherif Abdou,et al.  The BBN RT04 English broadcast news transcription system , 2005, INTERSPEECH.

[11]  Ciro Martins,et al.  Dynamic Vocabulary Adaptation for a daily and real-time Broadcast News Transcription System , 2006, 2006 IEEE Spoken Language Technology Workshop.

[12]  Fernando Pereira,et al.  Weighted finite-state transducers in speech recognition , 2002, Comput. Speech Lang..

[13]  Ellen M. Voorhees,et al.  The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[14]  M. A. Siegler,et al.  Automatic Segmentation, Classification and Clustering of Broadcast News Audio , 1997 .

[15]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[16]  Guillaume Gravier,et al.  The ESTER phase II evaluation campaign for the rich transcription of French broadcast news , 2005, INTERSPEECH.

[17]  João Paulo da Silva Neto,et al.  A stream-based audio segmentation, classification and clustering pre-processing system for broadcast news using ANN models , 2005, INTERSPEECH.

[18]  João Paulo da Silva Neto,et al.  The COST278 broadcast news segmentation and speaker clustering evaluation - overview, methodology, systems, results , 2005, INTERSPEECH.

[19]  Jean-Luc Gauvain,et al.  Tracking topics in broadcast news data , 2003 .

[20]  Jean-Luc Gauvain,et al.  Combining speaker identification and BIC for speaker diarization , 2005, INTERSPEECH.

[21]  Adolfo Guzmán-Arenas,et al.  Document Indexing with a Concept Hierarchy , 2005, Computación y Sistemas.

[22]  Douglas A. Reynolds,et al.  An overview of automatic speaker diarization systems , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  Isabel Trancoso,et al.  A SYSTEM FOR SELECTIVE DISSEMINATION OF MULTIMEDIA INFORMATION RESULTING FROM THE ALERT PROJECT , 2003 .

[24]  S. Chen,et al.  Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion , 1998 .

[25]  Gethin Williams,et al.  Knowing What You Don't Know: Roles for Confidence Measures in Automatic Speech Recognition , 1999 .

[26]  Elizabeth Shriberg,et al.  Spontaneous speech: how people really talk and why engineers should care , 2005, INTERSPEECH.

[27]  Isabel Trancoso,et al.  A specialized on-the-fly algorithm for lexicon and language model composition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.