MPEG-7 based description schemes for multi-level video content classification

MPEG-7 has emerged as the standard for multimedia data content description for efficiently describing multimedia content. In this context, its primary goal is to provide flexible and effective searching and retrieval of multimedia resources. Most of the earlier work on MPEG-7 description schemes (DSs) and descriptors (Ds) focuses on the description of a single multimedia document, whereas MPEG-7 can be further exploited to support more advances implementations under multimedia database systems. Therefore, it is important to reconsider issues related to high level multimedia modeling and representation, in the light of the MPEG-7 perspective. In this paper, we propose a high level multimedia representation and description scheme based on multi-level video modeling and semantic video classification. The proposed multi-level multimedia representation and DSs are expected to support more effective video content indexing and accessing operations. The presented DSs and Ds are further described by using the XML Schema language, which has been adopted as the basis of the Description Definition Language (DDL) of the MPEG-7 standard.

[1]  Ming-Chieh Lee,et al.  Semiautomatic segmentation and tracking of semantic video objects , 1998, IEEE Trans. Circuits Syst. Video Technol..

[2]  Aljoscha Smolic,et al.  A set of visual feature descriptors and their combination in a low-level description scheme , 2000, Signal Process. Image Commun..

[3]  Boon-Lock Yeo,et al.  Extracting story units from long programs for video browsing and navigation , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[4]  Qian Huang,et al.  Object-based multimedia content description schemes and applications for MPEG-7 , 2000, Signal processing. Image communication.

[5]  Noel E. O'Connor,et al.  Description schemes for video programs, users and devices , 2000, Signal Process. Image Commun..

[6]  Shih-Fu Chang,et al.  MetaSEEk: a content-based metasearch engine for images , 1997, Electronic Imaging.

[7]  Jianping Fan,et al.  Automatic image segmentation by integrating color-edge extraction and seeded region growing , 2001, IEEE Trans. Image Process..

[8]  Thomas S. Huang,et al.  Constructing table-of-content for videos , 1999, Multimedia Systems.

[9]  Jianping Fan,et al.  Adaptive motion-compensated video coding scheme towards content-based bit rate allocation , 2000, J. Electronic Imaging.

[10]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[11]  Jianping Fan,et al.  Model-Based Video Classification toward Hierarchical Representation, Indexing and Access , 2002, Multimedia Tools and Applications.

[12]  A. Murat Tekalp,et al.  Content-based access to video objects: Temporal Segmentation, visual summarization, and feature extraction , 1998, Signal Process..

[13]  E. Terzi A. Vakali J. Fan M.-S. Hacid,et al.  The MPEG-7 Multimedia Content Description Standard and the XML Schema Language , 2005 .

[14]  Shih-Fu Chang,et al.  Self-describing schemes for interoperable MPEG-7 multimedia content descriptions , 1998, Electronic Imaging.

[15]  Jianping Fan,et al.  Model-based semantic object extraction for content-based video representation and indexing , 2001, IS&T/SPIE Electronic Imaging.

[16]  Takeo Kanade,et al.  Name-It: association of face and name in video , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.