A suffix array approach to video copy detection in video sharing social networks

To address the multiplicity and copyright issues on file sharing social networks, we propose a fast video copy detection algorithm using the suffix array data structure in this work. The proposed algorithm consists of two steps. In the first step, we extract robust features which are discriminative yet insensitive to various attacks. Specifically, we develop a compact one-dimensional signature based on the shot change position of video files. Unlike images and audio, the size of a video file is usually large, which makes it computationally expensive to match two long signature sequences. Thus, in the second step, we adopt an efficient matching technique based on the suffix array data structure. The proposed system can perform the sequence matching in linear time while the complexity of conventional duplicate video detection algorithms grows at least quadratically with the video length.

[1]  Avideh Zakhor,et al.  Efficient video similarity measurement with video signature , 2002, Proceedings. International Conference on Image Processing.

[2]  Irena Koprinska,et al.  Temporal video segmentation: A survey , 2001, Signal Process. Image Commun..

[3]  Enno Ohlebusch,et al.  Replacing suffix trees with enhanced suffix arrays , 2004, J. Discrete Algorithms.

[4]  Justin Zobel,et al.  Detection of video sequences using compact signatures , 2006, TOIS.

[5]  Avideh Zakhor,et al.  Efficient video similarity measurement with video signature , 2003, IEEE Trans. Circuits Syst. Video Technol..

[6]  Ruud M. Bolle,et al.  Comparison of sequence matching techniques for video copy detection , 2001, IS&T/SPIE Electronic Imaging.

[7]  Eugene W. Myers,et al.  Suffix arrays: a new method for on-line string searches , 1993, SODA '90.