An Incremental Clustering based codebook construction in video copy detection

The quality of codebook is the determinant factor in BoW-based copy detection strategies. However, most of the adopted codebook construction algorithms are derived from image retrieval or object recognition, which neglect the robustness in partitioning original features and copy features (especially those with serious transformations) into the same group. To deal with this problem, we have developed an Incremental Clustering algorithm to construct a robust codebook. Unlike many existing algorithms which need loading the entire data into memory, our algorithm process data incrementally and improve the quality by transferring data from a global view. In addition, we propose a codebook evaluation scheme by simulating copy and non-copy pairs. Our experimental results show that our approach attains a high precision in copy pairs, which demonstrates the robustness of our codebook.

[1]  Lei Wang Toward A Discriminative Codebook: Codeword Selection across Multi-resolution , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Michalis Vazirgiannis,et al.  On Clustering Validation Techniques , 2001, Journal of Intelligent Information Systems.

[3]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[4]  Chun Chen,et al.  Discriminative codeword selection for image representation , 2010, ACM Multimedia.

[5]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[6]  Sheng Tang,et al.  Visual words based spatiotemporal sequence matching in video copy detection , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[7]  Rong Jin,et al.  Online visual vocabulary pruning using pairwise constraints , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.