A Survey on Multimodal Video Representation for Semantic Retrieval

This paper surveys the approaches to video representation, focusing on semantic analysis for content-based indexing and retrieval. A problem of adaptive representation of digital multimedia is critically assessed and some novel ideas are presented. Furthermore, the concept of video multimodality is reevaluated and redefined in order to introduce modalities such as editing technique or affect to the audience

[1]  Sion Hannuna,et al.  SEGMENTING QUADRUPED GAIT PATTERNS FROM WILDLIFE VIDEO , 2005 .

[2]  Alberto Del Bimbo,et al.  Highlights modeling and detection in sports videos , 2004, Pattern Analysis and Applications.

[3]  Janko Calic,et al.  Automated Visual Recognition of Individual African Penguins , 2004 .

[4]  Riccardo Leonardi,et al.  Indexing audiovisual databases through joint audio and video processing , 1998, Int. J. Imaging Syst. Technol..

[5]  FRANK NACK,et al.  Toward the Automated Editing of Theme Oriented Video Sequences , 1997, Appl. Artif. Intell..

[6]  Brett Adams Where does computational media aesthetics fit , 2003 .

[7]  Chitra Dorai,et al.  Bridging the semantic gap with computational media aesthetics , 2003, IEEE MultiMedia.

[8]  Lev Kuleshov,et al.  Kuleshov on Film: Writings by Lev Kuleshov , 1975 .

[9]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Neill W. Campbell,et al.  Iterative refinement by relevance feedback in content-based digital image retrieval , 1998, MULTIMEDIA '98.

[11]  Riccardo Leonardi,et al.  Semantic Indexing of Multimedia Documents , 2002, IEEE Multim..

[12]  Majid Mirmehdi,et al.  ICBR - Multimedia Management System for Intelligent Content Based Retrieval , 2004, CIVR.

[13]  Ishwar K. Sethi,et al.  Multimedia content processing through cross-modal association , 2003, MULTIMEDIA '03.

[14]  Riccardo Leonardi,et al.  Semantic indexing of soccer audio-visual sequences: a multimodal approach based on controlled Markov chains , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Joëlle Coutaz,et al.  A design space for multimodal systems: concurrent processing and data fusion , 1993, INTERCHI.

[16]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[17]  F. Saussure,et al.  Course in General Linguistics , 1960 .

[18]  Steffen Staab,et al.  Semantic Annotation of Images and Videos for Multimedia Analysis , 2005, ESWC.

[19]  Thomas S. Huang,et al.  Relevance feedback in image retrieval: A comprehensive review , 2003, Multimedia Systems.

[20]  Svetha Venkatesh,et al.  Media computing : computational media aesthetics , 2002 .

[21]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[22]  Simone Santini,et al.  Emergent Semantics through Interaction in Image Databases , 2001, IEEE Trans. Knowl. Data Eng..

[23]  Glorianna Davenport,et al.  Cinematic primitives for multimedia , 1991, IEEE Computer Graphics and Applications.

[24]  T. Burghardt,et al.  Analysing animal behaviour in wildlife videos using face detection and tracking , 2006 .

[25]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[26]  Nevenka Dimitrova Context and Memory in Multimedia Content Analysis , 2004, IEEE Multim..

[27]  Gert Cauwenberghs,et al.  Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[28]  Christian Metz,et al.  Essais sur la signification au cinéma , 2013 .

[29]  Alan Hanjalic,et al.  Affective video content representation and modeling , 2005, IEEE Transactions on Multimedia.

[30]  Janko Calic,et al.  A rule-based video annotation system , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  Atsuo Yoshitaka,et al.  A Survey on Content-Based Retrieval for Multimedia Databases , 1999, IEEE Trans. Knowl. Data Eng..

[32]  Yiannis Kompatsiaris,et al.  Achieving Integration of Knowledge and Content Technologies: The aceMedia Project , 2004, EWIMT.

[33]  Milind R. Naphade On supervision and statistical learning for semantic multimedia analysis , 2004, J. Vis. Commun. Image Represent..

[34]  Marc Davis,et al.  Media streams: representing video for retrieval and repurposing , 1994, MULTIMEDIA '94.

[35]  Steffen Staab Emergent Semantics , 2002, IEEE Intell. Syst..

[36]  Zhu Liu,et al.  Multimedia content analysis-using both audio and visual clues , 2000, IEEE Signal Process. Mag..

[37]  Janko Calic,et al.  Spatial analysis in key-frame extraction using video segmentation , 2004 .

[38]  S. Eisenstein,et al.  Film Form: Essays in Film Theory , 1949 .

[39]  Janko Calic,et al.  Tracking Animals in Wildlife Videos Using Face Detection , 2004, EWIMT.

[40]  Thomas S. Huang,et al.  Factor graph framework for semantic video indexing , 2002, IEEE Trans. Circuits Syst. Video Technol..

[41]  L. Manovich,et al.  The language of new media , 2001 .