Large-Scale Video Search with Efficient Temporal Voting Structure

In this work, we propose a fast content-based video querying system for large-scale video search. The proposed system is distinguished from similar works with two major contributions. First contribution is superiority of joint usage of repeated content representation and efficient hashing mechanisms. Repeated content representation is utilized with a simple yet robust feature, which is based on edge energy of frames. Each of the representation is converted into hash code with Hamming Embedding method for further queries. Second contribution is novel queue-based voting scheme that leads to modest memory requirements with gradual memory allocation capability, contrary to complete brute-force temporal voting schemes. This aspect enables us to make queries on large video databases conveniently, even on commodity computers with limited memory capacity. Our results show that the system can respond to video queries on a large video database with fast query times, high recall rate and very low memory and disk requirements.

[1]  Gozde Bozdagi Akar,et al.  Enhanced spatio-temporal video copy detection by combining trajectory and spatial consistency , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[2]  Jiajun Wang,et al.  VCDB: A Large-Scale Database for Partial Copy Detection in Videos , 2014, ECCV.

[3]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[4]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[5]  Fei Wang,et al.  Real-time large scale near-duplicate web video retrieval , 2010, ACM Multimedia.

[6]  Olivier Buisson,et al.  Scaling content-based video copy detection to very large databases , 2009, Multimedia Tools and Applications.

[7]  Yanqiang Lei,et al.  Video Sequence Matching Based on the Invariance of Color Correlation , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Bernd Girod,et al.  Stanford I2V: a news video dataset for query-by-image experiments , 2015, MMSys.

[9]  Hung-Khoon Tan,et al.  Scalable detection of partial near-duplicate videos by visual-temporal consistency , 2009, ACM Multimedia.

[10]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[11]  Cordelia Schmid,et al.  An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering , 2010, IEEE Transactions on Multimedia.

[12]  Shumeet Baluja,et al.  Advertisement Detection and Replacement using Acoustic and Visual Repetition , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[13]  Jean-Hugues Chenot,et al.  A large-scale audio and video fingerprints-generated database of TV repeated contents , 2014, 2014 12th International Workshop on Content-Based Multimedia Indexing (CBMI).

[14]  Guillaume Gravier,et al.  Efficient Mining of Repetitions in Large-Scale TV Streams with Product Quantization Hashing , 2012, ECCV Workshops.