Pattern-Based Near-Duplicate Video Retrieval and Localization on Web-Scale Videos

With the exponential growth of web multimedia contents, the Internet is rife with near-duplicate videos, the video copies applied with visual/temporal transformations and/or post productions. Two critical issues, copyright infringement and search result redundancy, arise accordingly. To resolve these problems, this paper proposes a spatiotemporal pattern-based approach under the hierarchical filter-and-refine framework for efficient and effective near-duplicate video retrieval and localization . Firstly, non-near-duplicate videos are fast filtered out through a computationally efficient data structure, termed pattern -based index tree (PI-tree). Then, an m- pattern -based dynamic programming (mPDP) algorithm is designed to localize near-duplicate segments and to re-rank the videos retrieved. The influence of time shift misalignment can be alleviated by time-shift m-pattern similarity (TPS) measurement. Comprehensive experiments on the five datasets are conducted to verify the effectiveness, efficiency, robustness, and scalability of the proposed approach. Convincing results demonstrate that our proposed approach outperforms the state-of-the-art approaches in terms of mean average precision (MAP) and normalized detection cost rate (NDCR) on the testing datasets. Furthermore, the proposed approach can achieve high quality of near-duplicate video localization in terms of quality frames (QF) and mean F1.

[1]  Tao Wang,et al.  One step beyond histograms: Image representation using Markov stationary features , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Wen Gao,et al.  Video Copy-Detection and Localization with a Scalable Cascading Framework , 2013, IEEE MultiMedia.

[3]  Lei Chen,et al.  Structure Tensor Series-Based Large Scale Near-Duplicate Video Retrieval , 2012, IEEE Transactions on Multimedia.

[4]  Zi Huang,et al.  Multiple feature hashing for real-time large scale near-duplicate video retrieval , 2011, ACM Multimedia.

[5]  Vincent S. Tseng,et al.  Effective content-based video retrieval using pattern-indexing and matching techniques , 2010, Expert Syst. Appl..

[6]  Fei Wang,et al.  Million-scale near-duplicate video retrieval system , 2011, ACM Multimedia.

[7]  Michael R. Lyu,et al.  Copyright protection on the web: a hybrid digital video watermarking scheme , 2004, WWW Alt. '04.

[8]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[9]  R. Roopalakshmi,et al.  A novel spatio-temporal registration framework for video copy localization based on multimodal features , 2013, Signal Process..

[10]  Tiejun Huang,et al.  TASC: A Transformation-Aware Soft Cascading Approach for Multimodal Video Copy Detection , 2015, TOIS.

[11]  Fei Wang,et al.  Real-time large scale near-duplicate web video retrieval , 2010, ACM Multimedia.

[12]  Chien-Li Chou,et al.  Near-duplicate video retrieval and localization using pattern set based dynamic programming , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[13]  Chong-Wah Ngo,et al.  Practical elimination of near-duplicates from web video search , 2007, ACM Multimedia.

[14]  John R. Kender,et al.  Fast Near-Duplicate Video Retrieval via Motion Time Series Matching , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[15]  Quoc V. Le,et al.  Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis , 2011, CVPR 2011.

[16]  Luntian Mou,et al.  A multimodal video copy detection approach with sequential pyramid matching , 2011, 2011 18th IEEE International Conference on Image Processing.

[17]  Bernd Girod,et al.  Watermarking of uncompressed and compressed video , 1998, Signal Process..

[18]  Kiyoharu Aizawa,et al.  Self-similarity-based partial near-duplicate video retrieval and alignment , 2013, International Journal of Multimedia Information Retrieval.

[19]  Cordelia Schmid,et al.  An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering , 2010, IEEE Transactions on Multimedia.

[20]  Zi Huang,et al.  Near-duplicate video retrieval: Current research and future trends , 2013, CSUR.

[21]  Rabab Kreidieh Ward,et al.  A Robust and Fast Video Copy Detection System Using Content-Based Fingerprinting , 2011, IEEE Transactions on Information Forensics and Security.

[22]  Hong Liu,et al.  A Segmentation and Graph-Based Video Sequence Matching Method for Video Copy Detection , 2013, IEEE Transactions on Knowledge and Data Engineering.

[23]  Jiwu Huang,et al.  Salient covariance for near-duplicate image and video detection , 2011, 2011 18th IEEE International Conference on Image Processing.

[24]  Sergei Vassilvitskii,et al.  Scalable K-Means++ , 2012, Proc. VLDB Endow..

[25]  Tom Drummond,et al.  Faster and Better: A Machine Learning Approach to Corner Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  R. Vidal,et al.  Histograms of oriented optical flow and Binet-Cauchy kernels on nonlinear dynamical systems for the recognition of human actions , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Chu-Song Chen,et al.  A Framework for Handling Spatiotemporal Variations in Video Copy Detection , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Hung-Khoon Tan,et al.  Real-Time Near-Duplicate Elimination for Web Video Search With Content and Context , 2009, IEEE Transactions on Multimedia.

[29]  Fangzhe Chang,et al.  Efficient video copy detection via aligning video signature time series , 2012, ICMR.

[30]  Zi Huang,et al.  Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval , 2013, IEEE Transactions on Multimedia.

[31]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[32]  Mei-Chen Yeh,et al.  Video copy detection by fast sequence matching , 2009, CIVR '09.

[33]  Hung-Khoon Tan,et al.  Scalable detection of partial near-duplicate videos by visual-temporal consistency , 2009, ACM Multimedia.

[34]  Yusuke Uchida,et al.  Fast and accurate content-based video copy detection using bag-of-global visual features , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[35]  Tiejun Huang,et al.  Video Copy Detection Using a Soft Cascade of Multimodal Features , 2012, 2012 IEEE International Conference on Multimedia and Expo.