论文信息 - Automatic processing of broadcast audio in multiple languages

Automatic processing of broadcast audio in multiple languages

This paper addresses recent progress in LVCSR in multiple languages which has enabled the processing of broadcast audio for information access. At LIMSI, broadcast news transcription systems have been developed for seven languages. Automatic processing to access the content must take into account the specificities of audio data, such as needing to deal with the continuous data stream and an imperfect word transcription, and specificities of the language. Some near-term applications are audio data mining, structurization of audiovisual archives, selective dissemination of information and media monitoring.

Jean-Luc Gauvain | Lori Lamel

[1] Lori Lamel,et al. Investigating text normalization and pronunciation variants for German broadcast transcription , 2000, INTERSPEECH.

[2] George Zavaliagkos,et al. Utilizing untranscribed training data to improve perfomance , 1998, LREC.

[3] Jean-Luc Gauvain,et al. Lightly supervised and unsupervised acoustic model training , 2002, Comput. Speech Lang..

[4] Alexander H. Waibel,et al. Unsupervised training of a speech recognizer: recent experiments , 1999, EUROSPEECH.

[5] Chabane Djeraba. Content-based multimedia indexing and retrieval , 2002, IEEE MultiMedia.

[6] Jean-Luc Gauvain,et al. Partitioning and transcription of broadcast news data , 1998, ICSLP.

[7] Jean-Luc Gauvain,et al. THE LIMSI TOPIC TRACKING SYSTEM FOR TDT2002 , 2002 .

[8] Jean-Luc Gauvain,et al. Broadcast news transcription in Mandarin , 2000, INTERSPEECH.

[9] Ellen M. Voorhees,et al. The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[10] Tanja Schultz,et al. Language-independent and language-adaptive acoustic modeling for speech recognition , 2001, Speech Commun..

[11] Jean-Luc Gauvain,et al. The LIMSI SDR System for TREC-8 , 1999, TREC.

[12] Jean-Luc Gauvain,et al. Fast decoding for indexation of broadcast data , 2000, INTERSPEECH.