A probabilistic framework for fusing frame-based searches within a video copy detection system

In the last few years, content-based video copy detection became an important and key tool for solving the tricky problem of video copyright protection. This problem has been heightened with the development of web video exchange platforms. In general, content-based video copy detection relies on the description of the visual content of the video. The video is segmented and a selected subset of frames, called keyframes, is described. Searching for a video is then performed by a set of similarity searches based on keyframes. These searches provide partial results that have to be integrated and fused. In this paper, we focus on this particular and crucial step. The objective is to properly fuse together partial results, to take into account the temporal coherence of the video and to be efficient (i.e. rapid). The solution we propose is based on a probabilistic framework that models the different parameters and inputs of this step and enables to deal with the temporal consistency. It also makes the process more reliable, as imprecision tolerated during the keyframe-based similarity searches has no impact on the overall accuracy. This particularly allows the speeding up of the detection process.

[1]  Olivier Buisson,et al.  Statistical similarity search applied to content-based video copy detection , 2005, 21st International Conference on Data Engineering Workshops (ICDEW'05).

[2]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Patrick Gros,et al.  Approximate searches: k-neighbors + precision , 2003, CIKM '03.

[4]  Ton Kalker,et al.  Feature Extraction and a Database Strategy for Video Fingerprinting , 2002, VISUAL.

[5]  A. Benveniste,et al.  Desgin and comparative study of some sequential jump detection algorithms for digital signals , 1983 .

[6]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[8]  Yueting Zhuang,et al.  Adaptive key frame extraction using unsupervised clustering , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[9]  Patrick Gros,et al.  A Geometrical Key-Frame Selection Method Exploiting Dominant Motion Estimation in Video , 2004, CIVR.

[10]  Changick Kim,et al.  Spatiotemporal sequence matching for efficient video copy detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Patrick Pérez,et al.  Color-Based Probabilistic Tracking , 2002, ECCV.

[12]  Laurent Amsaleg,et al.  Scalability of local image descriptors: a comparative study , 2006, MM '06.

[13]  Simon J. Godsill,et al.  On sequential Monte Carlo sampling methods for Bayesian filtering , 2000, Stat. Comput..

[14]  Edith Cohen,et al.  Finding interesting associations without support pruning , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[15]  Patrick Gros,et al.  Robust content-based image searches for copyright protection , 2003, MMDB '03.

[16]  Nasir D. Memon,et al.  Spatio–Temporal Transform Based Video Hashing , 2006, IEEE Transactions on Multimedia.

[17]  Li Chen,et al.  Video copy detection: a comparative study , 2007, CIVR '07.