Multimedia Search Without Visual Analysis: The Value of Linguistic and Contextual Information

This paper addresses the focus of this special issue by analyzing the potential contribution of linguistic content and other nonimage aspects to the processing of audiovisual data. It summarizes the various ways in which linguistic content analysis contributes to enhancing the semantic annotation of multimedia content, and, as a consequence, to improving the effectiveness of conceptual media access tools. A number of techniques are presented, including the time-alignment of textual resources, audio and speech processing, content reduction and reasoning tools, and the exploitation of surface features

[1]  Jean-Luc Gauvain,et al.  Transcribing broadcast news for audio and video indexing , 2000, CACM.

[2]  Karen Spärck Jones,et al.  Automatic content-based retrieval of broadcast news , 1995, MULTIMEDIA '95.

[3]  Djoerd Hiemstra,et al.  The TIJAH XML information retrieval system , 2006, SIGIR '06.

[4]  Paul Buitelaar,et al.  Feature Representation for Cross-Lingual, Cross-Media Semantic Web Application , 2005, SemAnnot@ISWC.

[5]  Yorick Wilks,et al.  Intelligent Multimedia Indexing and Retrieval through Multi-source Information Extraction and Merging , 2003, IJCAI.

[6]  Kalina Bontcheva,et al.  Extracting Information for Automatic Indexing of Multimedia Material , 2002, LREC.

[7]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[8]  Paul Over,et al.  TRECVID 2003 - an overview , 2003 .

[9]  Jesús Contreras,et al.  Neptuno: Semantic Web Technologies for a Digital Newspaper Archive , 2004, ESWS.

[10]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[11]  Wessel Kraaij,et al.  Task based evaluation of exploratory search systems , 2006 .

[12]  Marcel Worring,et al.  Building a visual ontology for video retrieval , 2005, MULTIMEDIA '05.

[13]  Thijs Westerveld,et al.  Surface Features in Video Retrieval , 2005, Adaptive Multimedia Retrieval.

[14]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Tat-Seng Chua,et al.  TRECVID 2005 by NUS PRIS , 2005, TRECVID.

[16]  de Franciska Jong,et al.  OLIVE: Speech-Based Video Retrieval , 1998 .

[17]  Roeland Ordelman,et al.  Dutch speech recognition in multimedia information retrieval , 2003 .

[18]  Wei-Ying Ma,et al.  Multimedia information retrieval: what is it, and why isn't anyone using it? , 2005, MIR '05.

[19]  Tomek Strzalkowski Natural Language Information Retrieval , 1995, Inf. Process. Manag..

[20]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[21]  David A. van Leeuwen,et al.  Automatic detection of laughter , 2005, INTERSPEECH.

[22]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[23]  Wessel Kraaij,et al.  Content Reduction for Cross-media Browsing , 2005 .

[24]  Alexander G. Hauptmann Lessons for the Future from a Decade of Informedia Video Analysis Research , 2005, CIVR.

[25]  Alexander G. Hauptmann,et al.  LSCOM Lexicon Definitions and Annotations (Version 1.0) , 2006 .

[26]  Gustavo Carneiro,et al.  A database centric view of semantic image annotation and retrieval , 2005, SIGIR '05.

[27]  Franciska de Jong,et al.  Automated Speech and Audio Analysis for Semantic Access to Multimedia , 2006, SAMT.

[28]  Ryen W. White,et al.  Overview of the CLEF-2005 Cross-Language Speech Retrieval Track , 2005, CLEF.

[29]  Wessel Kraaij,et al.  Unsupervised Event Clustering in Multilingual News Streams , 2002 .

[30]  Alan F. Smeaton,et al.  Designing the User Interface for the Físchlár Digital Video Library , 2006, J. Digit. Inf..

[31]  Atanas Kiryakov,et al.  Semantic annotation, indexing, and retrieval , 2004, J. Web Semant..