Affordable access to multimedia by exploiting collateral data

In addition to multimedia collections and their metadata, there often is a variety of collateral data sources available on (parts of) a collection. Collateral data - secondary information objects that relate to the primary multimedia documents - can be very useful in the process of automated generation of annotations for multimedia archives in that they reduce both costs and effort in annotation and access. Furthermore, they can be used to enhance result presentation in retrieval engines. To optimally exploit collateral data, methods for automatic indexing as well as changes in the current archiving workflow are proposed.

[1]  Stephen E. Robertson,et al.  Okapi at TREC-4 , 1995, TREC.

[2]  Karen Spärck Jones,et al.  Automatic content-based retrieval of broadcast news , 1995, MULTIMEDIA '95.

[3]  Pedro J. Moreno,et al.  A recursive algorithm for the forced alignment of very long audio segments , 1998, ICSLP.

[4]  Franciska de Jong,et al.  Infolink: Analysis of Dutch Broadcast News and Cross-Media Browsing , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[5]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[6]  Richard Wright,et al.  Accessing the spoken word , 2005, International Journal on Digital Libraries.

[7]  Karen Spärck Jones,et al.  Effects of out of vocabulary words in spoken document retrieval (poster session) , 2000, SIGIR '00.

[8]  Jonathan G. Fiscus,et al.  Automatic Language Model Adaptation for Spoken Document Retrieval , 2000, RIAO.

[9]  Ronald Rosenfeld,et al.  Optimizing lexical and N-gram coverage via judicious use of linguistic data , 1995, EUROSPEECH.

[10]  Roeland Ordelman,et al.  Dutch speech recognition in multimedia information retrieval , 2003 .

[11]  Nelleke Oostdijk,et al.  The Spoken Dutch Corpus. Overview and First Evaluation , 2000, LREC.

[12]  Franciska de Jong,et al.  Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition , 2007, SAMT.

[13]  Kenney Ng,et al.  Subword-based approaches for spoken document retrieval , 2000, Speech Commun..

[14]  Douglas A. Reynolds,et al.  An overview of automatic speaker diarization systems , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Franciska de Jong,et al.  Radio Oranje: Enhanced Access to a Historical Spoken Word Collection , 2007, CLIN 2007.

[16]  K. Sparck Jones,et al.  General query expansion techniques for spoken document retrieval , 1999 .

[17]  Alexandre Allauzen,et al.  Open vocabulary ASR for audiovisual document indexation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[18]  Wessel Kraaij,et al.  Content Reduction for Cross-media Browsing , 2005 .

[19]  Steve Young,et al.  The HTK book version 3.4 , 2006 .

[20]  Bhuvana Ramabhadran,et al.  Automatic recognition of spontaneous speech for access to multilingual oral history archives , 2004, IEEE Transactions on Speech and Audio Processing.

[21]  Paul Over,et al.  TRECVID 2003 - an overview , 2003 .

[22]  Alan F. Smeaton,et al.  Taiscéalaí: Information Retrieval from an Archive of Spoken Radio News , 1998, ECDL.