论文信息 - Enabling MPEG-7 structural and semantic descriptions in retrieval applications

Enabling MPEG-7 structural and semantic descriptions in retrieval applications

The MPEG-7 standard supports the description of both the structure and the semantics of multimedia; however, the generation and consumption of MPEG-7 structural and semantic descriptions are outside the scope of the standard. This article presents two research prototype systems that demonstrate the generation and consumption of MPEG-7 structural and semantic descriptions in retrieval applications. The active system for MPEG-4 video object simulation (AMOS) is a video object segmentation and retrieval system that segments, tracks, and models objects in videos (e.g., person, car) as a set of regions with corresponding visual features and spatiotemporal relations. The region-based model provides an effective base for similarity retrieval of video objects. The second system, the Intelligent Multimedia Knowledge Application (IMKA), uses the novel MediaNet framework for representing semantic and perceptual information about the world using multimedia. MediaNet knowledge bases can be constructed automatically from annotated collections of multimedia data and used to enhance the retrieval of multimedia.

Shih-Fu Chang | Di Zhong | Ana B. Benitez

[1] Shih-Fu Chang,et al. An integrated approach for content-based video object segmentation and retrieval , 1999, IEEE Trans. Circuits Syst. Video Technol..

[2] John R. Smith,et al. New frontiers for intelligent content-based retrieval , 2001, IS&T/SPIE Electronic Imaging.

[3] A.B. Benitez,et al. Validation experiments on structural, conceptual, collection, and access description schemes for MPEG-7 , 2000, 2000 Digest of Technical Papers. International Conference on Consumer Electronics. Nineteenth in the Series (Cat. No.00CH37102).