论文信息 - Kodak consumer video benchmark data set : concept definition and annotation * *

Kodak consumer video benchmark data set : concept definition and annotation * *

Semantic indexing of images and videos in the consumer domain has become a very important issue for both research and actual application. In this work we developed Kodak's consumer video benchmark data set, which includes (1) a significant number of videos from actual users, (2) a rich lexicon that accommodates consumers. needs, and (3) the annotation of a subset of concepts over the entire video data set. To the best of our knowledge, this is the first systematic work in the consumer domain aimed at the definition of a large lexicon, construction of a large benchmark data set, and annotation of videos in a rigorous fashion. Such effort will have significant impact by providing a sound foundation for developing and evaluating large-scale learning-based semantic indexing/annotation techniques in the consumer domain.

[1] Alexander G. Hauptmann,et al. LSCOM Lexicon Definitions and Annotations (Version 1.0) , 2006 .

[2] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .

[3] Shih-Fu Chang,et al. Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts , 2007 .

[4] Nathaniel Hawthorne,et al. The House of the Seven Gables , 1851 .

[5] John R. Smith,et al. Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[6] Mark Sanderson,et al. The CLEF Cross Language Image Retrieval Track (ImageCLEF) 2004 , 2004, CLEF.

[7] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[8] Marcel Worring,et al. The Mediamill Semantic Video Search Engine , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[9] Winston H. Hsu,et al. Brief Descriptions of Visual Features for Baseline TRECVID Concept Detectors , 2006 .

[10] Paul Over,et al. TREC video retrieval evaluation TRECVID , 2008 .