Content-based video retrieval: does video's semantic visual feature matter?
暂无分享,去创建一个
A new shot level video browsing method based on semantic visual features (e.g., car, mountain, and fire) is proposed to facilitate content-based retrieval. The video's binary semantic feature vector is utilized to calculate the score of similarity between two shot keyframes. The score is then used to browse the "similar" keyframes in terms of semantic visual features. A pilot user study was conducted to better understand users' behaviors in video retrieval context. Three video retrieval and browsing systems are compared: temporal neighbor, semantic visual feature, and fused browsing system. The initial results indicated that the semantic visual feature browsing was effective and efficient for Visual Centric tasks, but not for Non-visual Centric tasks.
[1] João Magalhães,et al. Video Retrieval Using Search and Browsing , 2004, TRECVID.
[2] Dong Xu,et al. Columbia University TRECVID-2006 Video Search and High-Level Feature Extraction , 2006, TRECVID.
[3] Noel E. O'Connor,et al. Combining textual and visual information processing for interactive video retrieval: SCHEMA's participation in TRECVID 2004 , 2004 .