A robust and low-cost video fingerprint extraction method for copy detection

Video fingerprinting for content-based video identification is a very useful task for the management and monetization of copyrighted content distribution. The main challenges of monitoring and copy detection systems are: a) the effective identification of highly transformed videos (robustness) and b) computational efficiency which may be relevant for some applications. Typically, most video fingerprinting methods focus on robustness leaving aside computational efficiency. However, for real-time applications are necessary low computational cost detection methods, for instance, in illegal content monitoring in video streaming distributions. Therefore, in this paper, we propose a low-cost and effective video fingerprint extraction method based on the combination of content-based features using both acoustic and visual video components. Our method is capable of detecting video copies by using computationally efficient fingerprints while maintaining robustness against the decrease in quality and content preserved distortions, which are frequent but severe attacks.

[1]  Georges Quénot,et al.  TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.

[2]  Ju Liu,et al.  Robust video hashing based on representative-dispersive frames , 2012, Science China Information Sciences.

[3]  Paul Over,et al.  Content-Based Video Copy Detection Benchmarking at TRECVID , 2014, ACM Trans. Inf. Syst..

[4]  Karol Visinko,et al.  Alice´s Adventures in Wonderland by Lewis Carroll , 2001 .

[5]  Yong Man Ro,et al.  Adaptive weighted fusion with new spatial and temporal fingerprints for improved video copy detection , 2014, Signal Process. Image Commun..

[6]  David J. Robertson,et al.  Face Averages Enhance User Recognition for Smartphone Security , 2015, PloS one.

[7]  Claudia Feregrino Uribe,et al.  A robust audio fingerprinting method using spectrograms saliency maps , 2014, The 9th International Conference for Internet Technology and Secured Transactions (ICITST-2014).

[8]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[9]  Cordelia Schmid,et al.  Actions in context , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Krystian Mikolajczyk,et al.  Evaluation of local detectors and descriptors for fast feature matching , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[11]  Pedro Cano,et al.  A Review of Audio Fingerprinting , 2005, J. VLSI Signal Process..

[12]  Benjamin Bustos,et al.  Competitive content-based video copy detection using global descriptors , 2011, Multimedia Tools and Applications.

[13]  John Langford,et al.  Cover trees for nearest neighbor , 2006, ICML.

[14]  Yongdong Zhang,et al.  A video copy detection algorithm combining local feature's robustness and global feature's speed , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Bo Li,et al.  Robust video identification approach based on local non-negative matrix factorization , 2015 .

[16]  C.-C. Jay Kuo,et al.  Current Developments and Future Trends in Audio Authentication , 2012, IEEE MultiMedia.

[17]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[18]  Sen Zhang,et al.  Suffix Array Construction in External Memory Using D-Critical Substrings , 2014, TOIS.

[19]  Jiemi Zhang,et al.  A content-based video copy detection method with randomly projected binary features , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[20]  Qingwei Gao,et al.  Efficient video copy detection using multi-modality and dynamic path search , 2014, Multimedia Systems.

[21]  Cordelia Schmid,et al.  Actions in context , 2009, CVPR.

[22]  Ju Liu,et al.  Visual Attention Based Temporally Weighting Method for Video Hashing , 2013, IEEE Signal Processing Letters.

[23]  Yanqiang Lei,et al.  Video Sequence Matching Based on the Invariance of Color Correlation , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Tom Drummond,et al.  Fusing points and lines for high performance tracking , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[25]  Cordelia Schmid,et al.  Evaluation of GIST descriptors for web-scale image search , 2009, CIVR '09.

[26]  Luntian Mou,et al.  A multimodal video copy detection approach with sequential pyramid matching , 2011, 2011 18th IEEE International Conference on Image Processing.

[27]  Husrev T. Sencar,et al.  Content-Based Video Copy Detection - A Survey , 2010, Intelligent Multimedia Analysis for Security Applications.

[28]  Laurent Amsaleg,et al.  Locality sensitive hashing: A comparison of hash function types and querying mechanisms , 2010, Pattern Recognit. Lett..

[29]  Zhicheng Zhao,et al.  A Multi Modal Content-Based Copy Detection Approach , 2012, 2012 Eighth International Conference on Computational Intelligence and Security.

[30]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[31]  B. Rossion,et al.  Defining face perception areas in the human brain: A large-scale factorial fMRI face localizer analysis , 2012, Brain and Cognition.

[32]  A. B.,et al.  SPEECH COMMUNICATION , 2001 .

[33]  Rabab Kreidieh Ward,et al.  A Robust and Fast Video Copy Detection System Using Content-Based Fingerprinting , 2011, IEEE Transactions on Information Forensics and Security.

[34]  Federica Battisti,et al.  A study on the effects of quality of service parameters on perceived video quality , 2014, 2014 5th European Workshop on Visual Information Processing (EUVIP).

[35]  L. Carroll Character Education Values on Alice's Characterization in The Alice's Adventures in Wonderland by Lewis Carroll , 2019, International Journal of Language and Literature.

[36]  Zhe Wang,et al.  Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search , 2007, VLDB.

[37]  Yusuke Uchida,et al.  Fast and accurate content-based video copy detection using bag-of-global visual features , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[38]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[39]  Sanket Shinde,et al.  Recent advances in content based video copy detection , 2015, 2015 International Conference on Pervasive Computing (ICPC).

[40]  Charalampos Dimoulas,et al.  Syncing Shared Multimedia through Audiovisual Bimodal Segmentation , 2015, IEEE MultiMedia.