Indexing and retrieval of multimedia objects at different levels of granularity

Intelligent access to multimedia databases for `naive user' should probably be based on queries formulation by `intelligent agents'. These agents should `understand' the semantics of the contents, learn user preferences and deliver to the user a subset of the source contents, for further navigation. The goal of such systems should be to enable `zero-command' access to the contents, while keeping the freedom of choice of the user. Such systems should interpret multimedia contents in terms of multiple audiovisual objects (from video to visual or audio object), and on actions and scenarios.

[1]  Philippe Aigrain,et al.  Medium knowledge-based macro-segmentation of video into sequences , 1997 .

[2]  Ramin Zabih,et al.  Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[3]  Patrick Bouthemy,et al.  A unified approach to shot change detection and camera motion characterization , 1999, IEEE Trans. Circuits Syst. Video Technol..

[4]  Alex Pentland,et al.  Flexible Images: Matching and Recognition Using Learned Deformations , 1997, Comput. Vis. Image Underst..

[5]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Liming Chen,et al.  Multi-criteria video segmentation for TV news , 1997, Proceedings of First Signal Processing Society Workshop on Multimedia Signal Processing.

[7]  Shih-Fu Chang,et al.  Local color and texture extraction and spatial query , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[8]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[9]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[10]  Osamu Hori,et al.  A shot classification method of selecting effective key-frames for video browsing , 1997, MULTIMEDIA '96.

[11]  Wayne H. Wolf,et al.  Key frame selection by motion analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[12]  Philippe Joly,et al.  Efficient automatic analysis of camera work and microsegmentation of video using spatiotemporal images , 1996, Signal Process. Image Commun..

[13]  Yihong Gong,et al.  Automatic parsing of news video , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[14]  Boon-Lock Yeo,et al.  Video browsing using clustering and scene transitions on compressed sequences , 1995, Electronic Imaging.

[15]  Liming Chen,et al.  Improvement of shot detection methods based on dynamic threshold selection , 1997, Other Conferences.

[16]  Liming Chen,et al.  Multichannel video segmentation , 1996, Other Conferences.

[17]  Pascal Faudemay,et al.  Video indexing based on image and sound , 1997, Other Conferences.

[18]  Michael Mills,et al.  A magnifier tool for video data , 1992, CHI.

[19]  Wolfgang Effelsberg,et al.  Video abstracting , 1997, CACM.