论文信息 - VideoAL: a novel end-to-end MPEG-7 video automatic labeling system

VideoAL: a novel end-to-end MPEG-7 video automatic labeling system

In this paper, we describe a novel end-to-end video automatic labeling system, which accepts MPEG-I sequence inputs and generates MPEG-7 XML metadata files based on the prior established anchor models. Seven modules were developed for the system: shot segmentation, region segmentation, annotation, feature extraction, model learning, classification, and XML rendering. The performance of this system has been tested in the NIST TREC-2002 video concept detection benchmark. The proposed system performs best in the mean average precision out of 18 worldwide participants.

John R. Smith | Ching-Yung Lin | Milind R. Naphade | Apostol Natsev | Belle L. Tseng

[1] Haim H. Permuter,et al. IBM Research TREC 2002 Video Retrieval System , 2002, TREC.

[2] John R. Smith,et al. VideoAnnEx: IBM MPEG-7 Annotation Tool for Multimedia Indexing and Concept Learning , 2003 .

[3] John R. Smith,et al. Modeling semantic concepts to support query by keywords in video , 2002, Proceedings. International Conference on Image Processing.

[4] Milind R. Naphade. Statistical techniques in video data management , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[5] Paul Over,et al. The TREC-2002 Video Track Report , 2002, TREC.