Tracking users' capture intention: a novel complementary view for home video content analysis

In this paper, we present a novel view to home video content analysis, which aims at tracking the capture intention of camcorder users. Based on the study of intention mechanism in psychology, a set of domain-specific capture intention concepts are defined. A comprehensive and extensible scheme consisting of video structuring, intention oriented feature analysis, as well as intention unit segmentation and classification is proposed to mine the users' capture intention. Experiments were carried on home video sequences of 90 hours in total, taken by 16 persons in recent 20 years. Both the user study and objective evaluations indicate that our proposed intention-based approach is an effective complement to existing home video content analysis schemes.

[1]  Yong Rui,et al.  Segmenting visual actions based on spatio-temporal motion patterns , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[2]  Lie Lu,et al.  Improve audio representation by using feature structure patterns , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Hyung-Myung Kim,et al.  Efficient camera motion characterization for MPEG video indexing , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[4]  Robert E. Schapire,et al.  Theoretical Views of Boosting and Applications , 1999, ALT.

[5]  William Brown,et al.  Psychology and life , 1934 .

[6]  Shingo Uchihashi,et al.  A semi-automatic approach to home video editing , 2000, UIST '00.

[7]  Tao Mei,et al.  To mine capture intention of camcorder users , 2005, Visual Communications and Image Processing.

[8]  Rainer Lienhart Dynamic video summarization of home video , 1999, Electronic Imaging.

[9]  Peng Wu A semi-automatic approach to detect highlights for home video annotation , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Michael E. Bratman,et al.  Intention, Plans, and Practical Reason , 1991 .

[11]  Lie Lu,et al.  Optimization-based automated home video editing system , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Shih-Fu Chang,et al.  Understanding and modeling user interests in consumer videos , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[13]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.