Z-grid-based probabilistic retrieval for scaling up content-based copy detection

Scalability is the key issue in making content-based copy de-tection (CBCD) methods practical for very large image and video databases. Since copies are transformed versions of original documents, CBCD involves some form of retrieval by similarity using as queries the descriptions of potential copies. To enhance the scalability of an existing competitive CBCD method, we introduce here three improvements of this retrieval process: a Z-grid for building the index, uniformity-based sorting and adapted partitioning of the components. Retrieval speed is significantly increased, enabling us to monitor with a single computer one TV channel against a database of 120,000 hours of video.

[1]  Olivier Buisson,et al.  Robust Content-Based Video Copy Identification in a Large Reference Database , 2003, CIVR.

[2]  Yan Ke,et al.  An efficient parts-based near-duplicate and sub-image retrieval system , 2004, MULTIMEDIA '04.

[3]  Edward Y. Chang,et al.  RIME: a replicated image detector for the World Wide Web , 1998, Other Conferences.

[4]  Nozha Boujemaa,et al.  What's beyond query by example? , 2003 .

[5]  Edward J. Delp,et al.  Advances in Digital Video Content Protection , 2005, Proceedings of the IEEE.

[6]  Patrick Gros,et al.  Robust content-based image searches for copyright protection , 2003, MMDB '03.

[7]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Ruud M. Bolle,et al.  Comparison of sequence matching techniques for video copy detection , 2001, IS&T/SPIE Electronic Imaging.

[9]  Andrew Zisserman,et al.  Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?" , 2002, ECCV.

[10]  G. Medioni,et al.  Content-based image retrieval: an overview , 2004 .

[11]  Hanan Samet,et al.  Foundations of multidimensional and metric data structures , 2006, Morgan Kaufmann series in data management systems.

[12]  Andreas Henrich,et al.  The LSD/sup h/-tree: an access structure for feature vectors , 1998, Proceedings 14th International Conference on Data Engineering.

[13]  Olivier Buisson,et al.  Discriminant local features selection using efficient density estimation in a large database , 2005, MIR '05.

[14]  Olivier Buisson,et al.  Content-Based Copy Retrieval Using Distortion-Based Probabilistic Similarity Search , 2007, IEEE Transactions on Multimedia.

[15]  Olivier Buisson,et al.  Robust voting algorithm based on labels of behavior for video copy detection , 2006, MM '06.