A Rotation Invariant Descriptor for Robust Video Copy Detection

A large amount of videos on the Internet are generated from authorized sources by various kinds of transformations. Many works are proposed for robust description of video, which lead to satisfying matching qualities on Content Based Copy Detection (CBCD) issue. However, the trade-off of efficiency and effectiveness is still a problem among the state-of-the-art CBCD approaches. In this paper, we propose a novel frame-level descriptor for video. Firstly, each selected frame is partitioned into certain rings. Then the Histogram of Oriented Gradient (HOG) and the Relative Mean Intensity (RMI) are calculated as the original features. We finally fuse these two features by summing HOGs with RMIs as the corresponding weights. The proposed descriptor is succinct in concept, compact in structure, robust for rotation like transformations and fast to compute. Experiments on the CIVR’07 Copy Detection Corpus and the Video Transformation Corpus show improved performances both on matching quality and executive time compared to the pervious approaches.

[1]  Chong-Wah Ngo,et al.  Near-duplicate keyframe retrieval with visual keywords and semantic context , 2007, CIVR '07.

[2]  Qingming Huang,et al.  Robust copy detection by mining temporal self-similarities , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[3]  Qingming Huang,et al.  Fast copy detection based on Slice Entropy Scattergraph , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[4]  Li Chen,et al.  Video copy detection: a comparative study , 2007, CIVR '07.

[5]  Hung-Khoon Tan,et al.  Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning , 2007, IEEE Transactions on Multimedia.

[6]  Mei-Chen Yeh,et al.  Video copy detection by fast sequence matching , 2009, CIVR '09.

[7]  Qingming Huang,et al.  Near-duplicate video matching with transformation recognition , 2009, MM '09.

[8]  Hung-Khoon Tan,et al.  Scalable detection of partial near-duplicate videos by visual-temporal consistency , 2009, ACM Multimedia.

[9]  Alberto Del Bimbo,et al.  Video Clip Matching Using MPEG-7 Descriptors and Edit Distance , 2006, CIVR.

[10]  Changick Kim,et al.  Spatiotemporal sequence matching for efficient video copy detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Edward Y. Chang,et al.  Enhanced perceptual distance functions and indexing for image replica recognition , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Chong-Wah Ngo,et al.  Efficient Near-Duplicate Keyframe Retrieval with Visual Language Models , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[13]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).