Content-based video copy detection using nearest-neighbor mapping

We report results on video copy detection using nearest-neighbor (NN) mapping that has been used successfully in audio copy detection. For copy detection search, we use a sliding window to move the query video over the test video, and count the number of frames of query that match the frames in the test segment. The feature in the test frame that we match is the frame number of the query that is closest to that test frame. This leads to good matching scores even when the query video is distorted and contains occlusions. We test the NN mapping algorithm and the video features that map test frame to the closest query frame on TRECVID 2009 and 2010 content-based copy detection (CBCD) evaluation data. For both these tasks, the NN mapping for video copy detection gives minimal normalized detection cost rate (min NDCR) comparable to that achieved with audio copy detection for the same task. For the TRECVID 2011 CBCD evaluation data we got the lowest min NDCR for 26 out of 56 transforms for actual no false alarm case.

[1]  Mubarak Shah,et al.  University of Central Florida at TRECVID 2008 Content Based Copy Detection and Surveillance Event Detection , 2008, TRECVID.

[2]  Kunio Kashino,et al.  NTT Communication Science Laboratories at TRECVID 2010 Content Based Copy Detection , 2010, TRECVID.

[3]  Shumeet Baluja,et al.  Advertisement Detection and Replacement using Acoustic and Visual Repetition , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[4]  A. Aydin Alatan,et al.  Content Based Copy Detection with Coarse Audio-Visual Fingerprints , 2009, 2009 Seventh International Workshop on Content-Based Multimedia Indexing.

[5]  Pinar Duygulu Sahin,et al.  Comparison and combination of two novel commercial detection methods , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[6]  Langis Gagnon,et al.  CRIM Notebook Paper - TRECVID 2011 Surveillance Event Detection , 2011, TRECVID.

[7]  Tao Liu,et al.  AT&T Research at TRECVID 2009 Content-based Copy Detection , 2009, TRECVID.

[8]  Patrick Kenny,et al.  Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture models , 2008, INTERSPEECH.

[9]  Patrick Cardinal,et al.  CRIM’s content-based audio copy detection system for TRECVID 2009 , 2012, 2010 International Workshop on Content Based Multimedia Indexing (CBMI).

[10]  平松 薫,et al.  TRECVID 2010 Content-Based Copy Detectionタスク参加報告 , 2011 .

[11]  Derek Hoiem,et al.  Computer vision for music identification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Wen Gao,et al.  PKU-IDM @ TRECVid 2010: Copy Detection with Visual-Audio Feature Fusion and Sequential Pyramid Matching , 2010 .

[13]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.