Interactive Indexing and Retrieval of Multimedia Content

The indexing and retrieval of multimedia items is difficult due to the semantic gap between the user's perception of the data and the descriptions we can derive automatically from the data using computer vision, speech recognition, and natural language processing. In this contribution we consider the nature of the semantic gap in more detail and show examples of methods that help in limiting the gap. These methods can be automatic, but in general the indexing and retrieval of multimedia items should be a collaborative process between the system and the user. We show how to employ the user's interaction for limiting the semantic gap.

[1]  Thomas S. Huang,et al.  Constructing table-of-content for videos , 1999, Multimedia Systems.

[2]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Arnold W. M. Smeulders,et al.  Color Invariance , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Bob J. Wielinga,et al.  Ontology-Based Photo Annotation , 2001, IEEE Intell. Syst..

[6]  Marcel Worring,et al.  Granulometric analysis of document images , 2002, Object recognition supported by user interaction for service robots.

[7]  Marcel Worring,et al.  Interactive adaptive movie annotation , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[8]  Marcel Worring,et al.  Genre-based search through biomedical images , 2002, Object recognition supported by user interaction for service robots.

[9]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .