Semantic home video categorization

Nowadays, a strong need exists for the efficient organization of an increasing amount of home video content. To create an efficient system for the management of home video content, it is required to categorize home video content in a semantic way. So far, a significant amount of research has already been dedicated to semantic video categorization. However, conventional categorization approaches often rely on unnecessary concepts and complicated algorithms that are not suited in the context of home video categorization. To overcome the aforementioned problem, this paper proposes a novel home video categorization method that adopts semantic home photo categorization. To use home photo categorization in the context of home video, we segment video content into shots and extract key frames that represent each shot. To extract the semantics from key frames, we divide each key frame into ten local regions and extract lowlevel features. Based on the low level features extracted for each local region, we can predict the semantics of a particular key frame. To verify the usefulness of the proposed home video categorization method, experiments were performed with home video sequences, labeled by concepts part of the MPEG-7 VCE2 dataset. To verify the usefulness of the proposed home video categorization method, experiments were performed with 70 home video sequences. For the home video sequences used, the proposed system produced a recall of 77% and an accuracy of 78%.

[1]  John R. Smith,et al.  Multimedia semantic indexing using model vectors , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[2]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Yong Man Ro,et al.  Video contents authoring system for efficient consumption on portable multimedia device , 2008, Electronic Imaging.

[4]  N. Nikolaidis,et al.  Video shot detection and condensed representation. a review , 2006, IEEE Signal Processing Magazine.

[5]  B. S. Manjunath,et al.  Introduction to mpeg-7 , 2002 .

[6]  Wei-Hao Lin,et al.  News video classification using SVM-based multimodal classifiers and combination strategies , 2002, MULTIMEDIA '02.

[7]  Yong Man Ro,et al.  Semantic Home Photo Categorization , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Yong Man Ro,et al.  Automatic Image Categorization using MPEG-7 Description , 2003, IS&T/SPIE Electronic Imaging.

[9]  Yong Man Ro,et al.  Hierarchical rotational invariant similarity measurement for MPEG-7 homogeneous texture descriptor , 2000 .

[10]  Paul Over,et al.  TRECVID 2006 Overview , 2006, TRECVID.

[11]  Sang Heun Shim,et al.  Real-time shot boundary detection for digital video camera using the MPEG-7 descriptor , 2002, IS&T/SPIE Electronic Imaging.

[12]  Mor Naaman,et al.  Why we tag: motivations for annotation in mobile and online media , 2007, CHI.

[13]  Yong Man Ro,et al.  Home Photo Categorization Based on Photographic Region Templates , 2005, AIRS.

[14]  YongMan Ro Category Classification using Multiple MPEG-7 Descriptors , 2002 .

[15]  John R. Smith,et al.  Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[16]  Roberto Manduchi,et al.  A Study on Bayes Feature Fusion for Image Classification , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[17]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .