A novel spatio-temporal registration framework for video copy localization based on multimodal features

Fighting movie piracy requires copy detection followed by the accurate frame alignments of master and copy videos, in order to estimate distortion model and capture location in a theater. Existing research on pirate video registration utilizes only visual features for aligning pirate and master videos, while no effort is made to employ acoustic features. Further, most studies in illegal video registration concentrate on the alignment of watermarked videos, while few attempts are made to address the alignment of non-watermarked sequences. We attempt to solve these issues, by proposing a novel spatio-temporal registration framework that utilizes content-based multimodal features for frame alignments. The proposed scheme includes three stages: first, a video sequence is compactly represented using Speeded Up Robust Features (SURF) and audio spectral signatures; second, sliding window based dynamic time warping (DTW) is employed to compute temporal frame alignments; third, robust SURF descriptors are utilized to generate accurate geometric frame alignments. The results of experiments on three different datasets demonstrate the robustness and efficiency of the proposed method against various video transformations.

[1]  Hui Cheng Temporal registration of video sequences , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[2]  Hui Cheng,et al.  Spatial temporal and histogram video registration for digital watermark detection , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[3]  Bertrand Chupeau,et al.  Temporal Video Registration for Watermark Detection , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[4]  Bertrand Chupeau,et al.  In-theater piracy: finding where the pirate was , 2008, Electronic Imaging.

[5]  Jian Lu,et al.  Video fingerprinting for copy identification: from research to industry applications , 2009, Electronic Imaging.

[6]  Patrick Lambert,et al.  A Simple but Effective Approach to Video Copy Detection , 2010, 2010 Canadian Conference on Computer and Robot Vision.

[7]  Yao Zhao,et al.  Frame Fusion for Video Copy Detection , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Bertrand Chupeau,et al.  Adaptive video fingerprints for accurate temporal registration , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Meinard Müller,et al.  Information retrieval for music and motion , 2007 .

[10]  Sang Uk Lee,et al.  Video frame-matching algorithm using dynamic programming , 2009, J. Electronic Imaging.

[11]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[12]  Anssi Klapuri,et al.  Musical instrument recognition using cepstral coefficients and temporal features , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[13]  G. Ram Mohana Reddy,et al.  A Novel Approach to Video Copy Detection Using Audio Fingerprints and PCA , 2011, ANT/MobiWIS.

[14]  Bertrand Chupeau,et al.  Image and video fingerprinting: forensic applications , 2009, Electronic Imaging.

[15]  Bertrand Chupeau,et al.  Automatic estimation and compensation of geometric distortions in video copies , 2007, Electronic Imaging.

[16]  Hui Cheng A review of video registration methods for watermark detection in digital cinema applications , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[17]  Ning Chen,et al.  A robust hashing algorithm based on SURF for video copy detection , 2012, Comput. Secur..

[18]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[19]  Benoit M. Macq,et al.  Temporal alignment of video sequences for watermarking systems , 2003, IS&T/SPIE Electronic Imaging.

[20]  A. Aydin Alatan,et al.  Content Based Copy Detection with Coarse Audio-Visual Fingerprints , 2009, 2009 Seventh International Workshop on Content-Based Multimedia Indexing.

[21]  Tae Hong Park Introduction to digital signal processing - Computer Musically Speaking , 2009 .

[22]  Michel Barlaud,et al.  Compensation of geometrical deformations for watermark extraction in digital cinema application , 2001, IS&T/SPIE Electronic Imaging.

[23]  B. S. Manjunath,et al.  Efficient and Robust Detection of Duplicate Videos in a Large Database , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Bertrand Chupeau,et al.  A framework for video forensics based on local and temporal fingerprints , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[25]  Zhijie Zhang,et al.  Video copy detection based on Speeded Up Robust Features and Locality Sensitive Hashing , 2010, 2010 IEEE International Conference on Automation and Logistics.