Self-similarity-based partial near-duplicate video retrieval and alignment

There have been recent studies on partial near-duplicate videos, which involve segments of videos that are near duplicates of each other. State-of-the-art searching schemes usually segment the input video into clips and implement clip-level near-duplicate retrieval. However, the segmentation results are always poorly aligned, which lead to a difficult “unbalance” problem. In this paper, we introduce a self-similarity-based feature representation called the Self-Similarity Belt (SSBelt), which derives from the Self-Similarity Matrix (SSM). In addition, a distinctive pattern in SSBelt called the Interest Corner is detected and described by a bag-of-words representation. The visual words are then combined into visual shingles and indexed by an inverted file index for fast retrieval. Another important task is to accurately align the unbalanced clips, for which we propose the Intensity Mark (IMark) and design a coarse-to-fine near-duplicate video localization scheme. Experimental results show the effectiveness of our approach for both web-based near-duplicate video and unbalanced video datasets. The near-duplicate alignment capacity of IMark is also shown to be effective.

[1]  Ruud M. Bolle,et al.  Comparison of sequence matching techniques for video copy detection , 2001, IS&T/SPIE Electronic Imaging.

[2]  Zi Huang,et al.  Multiple feature hashing for real-time large scale near-duplicate video retrieval , 2011, ACM Multimedia.

[3]  Olivier Buisson,et al.  Scalable mining of large video databases using copy detection , 2008, ACM Multimedia.

[4]  Marko Heikkilä,et al.  Description of interest regions with local binary patterns , 2009, Pattern Recognit..

[5]  Shih-Fu Chang,et al.  Detecting image near-duplicate by stochastic attributed relational graph matching with learning , 2004, MULTIMEDIA '04.

[6]  Shingo Uchihashi,et al.  The beat spectrum: a new approach to rhythm analysis , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[7]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[8]  Patrick Pérez,et al.  View-Independent Action Recognition from Temporal Self-Similarities , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Michael Isard,et al.  General Theory , 1969 .

[10]  Hung-Khoon Tan,et al.  Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning , 2007, IEEE Transactions on Multimedia.

[11]  Nuria Oliver,et al.  Understanding near-duplicate videos: a user-centric approach , 2009, ACM Multimedia.

[12]  Andrei Z. Broder,et al.  On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[13]  Shuicheng Yan,et al.  Near-duplicate keyframe retrieval by semi-supervised learning and nonrigid image matching , 2011, TOMCCAP.

[14]  Lei Chen,et al.  Monitoring near duplicates over video streams , 2010, ACM Multimedia.

[15]  Chong-Wah Ngo,et al.  Practical elimination of near-duplicates from web video search , 2007, ACM Multimedia.

[16]  Gang Hua,et al.  Interest seam image , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Jiwu Huang,et al.  Salient covariance for near-duplicate image and video detection , 2011, 2011 18th IEEE International Conference on Image Processing.

[18]  Mei-Chen Yeh,et al.  A compact, effective descriptor for video copy detection , 2009, MM '09.

[19]  Hung-Khoon Tan,et al.  Real-Time Near-Duplicate Elimination for Web Video Search With Content and Context , 2009, IEEE Transactions on Multimedia.

[20]  Yan Ke,et al.  An efficient parts-based near-duplicate and sub-image retrieval system , 2004, MULTIMEDIA '04.

[21]  Xian-Sheng Hua,et al.  Robust video signature based on ordinal measure , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[22]  Hung-Khoon Tan,et al.  Efficient Mining of Multiple Partial Near-Duplicate Alignments by Temporal Network , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Wesley De Neve,et al.  Near-Duplicate Video Clip Detection Using Model-Free Semantic Concept Detection and Adaptive Semantic Distance Measurement , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Lei Chen,et al.  Structure Tensor Series-Based Large Scale Near-Duplicate Video Retrieval , 2012, IEEE Transactions on Multimedia.

[25]  Qi Tian,et al.  The use of temporal, semantic and visual partitioning model for efficient near-duplicate keyframe detection in large scale news corpus , 2007, CIVR '07.

[26]  Yao Zhao,et al.  Frame Fusion for Video Copy Detection , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[28]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[29]  David Nistér,et al.  Scalable Recognition with a Vocabulary Tree , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[30]  Yan Ke,et al.  Efficient Near-duplicate Detection and Sub-image Retrieval , 2004 .

[31]  Hwann-Tzong Chen,et al.  A square-root sampling approach to fast histogram-based search , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[32]  Fei Wang,et al.  Real-time large scale near-duplicate web video retrieval , 2010, ACM Multimedia.

[33]  Dong Xu,et al.  Near Duplicate Identification With Spatially Aligned Pyramid Matching , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[34]  Hong Liu,et al.  Gradient Ordinal Signature and Fixed-Point Embedding for Efficient Near-Duplicate Video Detection , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Qingming Huang,et al.  Near-duplicate video matching with transformation recognition , 2009, MM '09.

[36]  Qingming Huang,et al.  Robust copy detection by mining temporal self-similarities , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[37]  Qingming Huang,et al.  Fast copy detection based on Slice Entropy Scattergraph , 2010, 2010 IEEE International Conference on Multimedia and Expo.