论文信息 - THE ROLE OF AUTOMATED SPEECH AND AUDIO ANALYSIS IN SEMANTIC MULTIMEDIA ANNOTATION

THE ROLE OF AUTOMATED SPEECH AND AUDIO ANALYSIS IN SEMANTIC MULTIMEDIA ANNOTATION

This paper overviews the various ways in which automatic speech and audio analysis can be deployed to enhance the semantic annotation of multimedia content, and as a consequence to improve the effectiveness of conceptual access tools. A number of techniques will be presented, including the alignment of text resources, large vocabulary speech recognition, key word spotting and speaker classification. The applicability of techniques will be discussed from a media crossing perspective. The added value will be illustrated by the description of two complementary demonstrators for browsing broadcast news archieves.

Franciska de Jong | Adrianus J. van Hessen | Roeland Ordelman

[1] Wessel Kraaij,et al. Phoneme based spoken document retrieval , 1998 .

[2] Karen Spärck Jones,et al. Automatic content-based retrieval of broadcast news , 1995, MULTIMEDIA '95.

[3] Karen Spärck Jones,et al. Effects of out of vocabulary words in spoken document retrieval (poster session) , 2000, SIGIR '00.

[4] Pedro J. Moreno,et al. A recursive algorithm for the forced alignment of very long audio segments , 1998, ICSLP.

[5] Wessel Kraaij,et al. Unsupervised Event Clustering in Multilingual News Streams , 2002 .

[6] David A. van Leeuwen,et al. Automatic detection of laughter , 2005, INTERSPEECH.

[7] Nelleke Oostdijk,et al. The Spoken Dutch Corpus. Overview and First Evaluation , 2000, LREC.

[8] Alexandre Allauzen,et al. Diachronic vocabulary adaptation for broadcast news transcription , 2005, INTERSPEECH.

[9] Jonathan G. Fiscus,et al. Automatic Language Model Adaptation for Spoken Document Retrieval , 2000, RIAO.

[10] Wessel Kraaij,et al. Content Reduction for Cross-media Browsing , 2005 .

[11] K. Sparck Jones,et al. General query expansion techniques for spoken document retrieval , 1999 .

[12] Paul Over,et al. TRECVID 2003 - an overview , 2003 .

[13] Roeland Ordelman,et al. Dutch speech recognition in multimedia information retrieval , 2003 .