vdFP - Video fingerprinting technologies for media and security applications - D1 : Report on existing technologies

Video fingerprinting is een bewezen en commercieel verkrijgbare techniek die ingezet kan worden om kopieen van of videomateriaal op te sporen. Video fingerprinting vergelijkt digitale video’s met elkaar en moet typisch kunnen omgaan met zaken zoals verschillende video codecs, veranderde resolutie, het plaatsen van een logo, niet gelijke frame rates, etc. Ontwikkelingen in dit gebied zijn vooral gestimuleerd door de media wereld, die er belang bij heeft verspreiding van illegale kopieen van videomateriaal tegen te gaan. Er zijn echter meerdere toepassingen, ieder met eigen karakteristieken. In dit engelstalige rapport geven wij een overzicht van de bestaande video fingerprinting technieken. Naast video fingerprinting kijken wij ook naar een aantal gerelateerde technieken zoals audio fingerprinting en objectherkenning in video’s. Voor elke technologie beschrijven wij de state-of-the-art en inventariseren wij bestaande, commerciele oplossingen.

[1]  E.J.E.M. Pauwels,et al.  Morphology-based Stable Salient Regions Detector , 2006 .

[2]  Ahmed K. Elmagarmid,et al.  InsightVideo: toward hierarchical video content organization for efficient browsing, summarization and retrieval , 2005, IEEE Transactions on Multimedia.

[3]  Kota Iwamoto,et al.  Image Signature Robust to Caption Superimposition for Video Sequence Identification , 2006, 2006 International Conference on Image Processing.

[4]  M. Belmonte,et al.  Abnormal Attention in Autism Shown by Steady-State Visual Evoked Potentials , 2000 .

[5]  Langis Gagnon,et al.  CRIM Notebook Paper - TRECVID 2008 Video Copy Detection Using Latent Aspect Modeling Over SIFT Matches , 2008, TRECVID.

[6]  Rainer Lienhart,et al.  Mining TV broadcasts for recurring video sequences , 2009, CIVR '09.

[7]  Fatih Murat Porikli,et al.  Region Covariance: A Fast Descriptor for Detection and Classification , 2006, ECCV.

[8]  Werner Bailer,et al.  A distance measure for repeated takes of one scene , 2008, The Visual Computer.

[9]  Alberto Del Bimbo,et al.  Video Clip Matching Using MPEG-7 Descriptors and Edit Distance , 2006, CIVR.

[10]  Jinchang Ren,et al.  Hierarchical Modeling and Adaptive Clustering for Real-Time Summarization of Rush Videos , 2009, IEEE Transactions on Multimedia.

[11]  Alan Hanjalic,et al.  Logo recognition in video stills by string matching , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[12]  Helmut Neuschmied,et al.  Robust Sound Modeling for Song Detection in Broadcast Audio , 2002 .

[13]  Jenny Benois-Pineau,et al.  Clustering of scene repeats for essential rushes preview , 2009, 2009 10th Workshop on Image Analysis for Multimedia Interactive Services.

[14]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[15]  Gábor Richly,et al.  Optimised soundprint selection for identification in audio streams , 2001 .

[16]  Bernard Mérialdo,et al.  Sequence alignment for redundancy removal in video rushes summarization , 2008, TVS '08.

[17]  Svetha Venkatesh,et al.  Using multiple windows to track concept drift , 2004, Intell. Data Anal..

[18]  Fatih Murat Porikli,et al.  Human Detection via Classification on Riemannian Manifolds , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[20]  Craig Seidel Content fingerprinting from an industry perspective , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[21]  Bin Zhang,et al.  BUPT at TRECVID 2008 , 2008, TRECVID.

[22]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[23]  Guohong Wang,et al.  A Vehicle-logo Recognition Method Based on Wavelet Transform and Invariant Moment , 2008 .

[24]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[25]  Rosa Lancini,et al.  Audio content identification by using perceptual hashing , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[26]  Noel E. O'Connor,et al.  Dublin City University at the TRECVid 2008 BBC rushes summarisation task , 2008, TVS '08.

[27]  Nikolas P. Galatsanos,et al.  Video rushes summarization using spectral clustering and sequence alignment , 2008, TVS '08.

[28]  Matthieu Cord,et al.  Summarization scheme based on near-duplicate analysis , 2008, TVS '08.

[29]  Cordelia Schmid,et al.  INRIA-LEAR'S Video Copy Detection System , 2008, TRECVID.

[30]  Juan Chen,et al.  University of Bradford at TRECVID 2008: Content Based Copy Detection Task , 2008, TRECVID.

[31]  Jinchang Ren,et al.  Hierarchical modeling and adaptive clustering for real-time summarization of rush videos in trecvid'08 , 2008, TVS '08.

[32]  Luc Brun,et al.  Audio Fingerprint Identification by Approximate String Matching , 2007, ISMIR.

[33]  Feihu Qi,et al.  A learning-based logo recognition algorithm using SIFT and efficient correspondence matching , 2008, 2008 International Conference on Information and Automation.

[34]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.

[35]  John C. Platt,et al.  Distortion discriminant analysis for audio fingerprinting , 2003, IEEE Trans. Speech Audio Process..

[36]  Shumeet Baluja,et al.  Audio Fingerprinting: Combining Computer Vision & Data Stream Processing , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[37]  Kunio Kashino,et al.  Very quick audio searching: introducing global pruning to the Time-Series Active Search , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[38]  Les E. Atlas,et al.  Modulation-scale analysis for content identification , 2004, IEEE Transactions on Signal Processing.

[39]  Olivier Buisson,et al.  Feature statistical retrieval applied to content based copy identification , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[40]  Mei-Chen Yeh,et al.  Video copy detection by fast sequence matching , 2009, CIVR '09.

[41]  Ruud M. Bolle,et al.  Comparison of distance measures for video copy detection , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[42]  Borko Furht,et al.  Video identification using video tomography , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[43]  L. R. Rabiner,et al.  A comparative study of several dynamic time-warping algorithms for connected-word recognition , 1981, The Bell System Technical Journal.

[44]  Shumeet Baluja,et al.  Content Fingerprinting Using Wavelets , 2006 .

[45]  Tieniu Tan,et al.  Comparison of Similarity Measures for Trajectory Clustering in Outdoor Surveillance Scenes , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[46]  Rosa Lancini,et al.  Robust audio fingerprinting for song identification , 2004, 2004 12th European Signal Processing Conference.

[47]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[48]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[49]  Ye Zhang,et al.  Vehicle logo recognition using mathematical morphology , 2007 .

[50]  Yiannis Kompatsiaris,et al.  COST292 experimental framework for TRECVID2008 , 2008, TRECVID.

[51]  Yongsheng Gao,et al.  New approach for logo recognition , 2000, SPIE Defense + Commercial Sensing.

[52]  Xiaofang Zhou,et al.  Video matching using binary signature , 2005, 2005 International Symposium on Intelligent Signal Processing and Communication Systems.

[53]  David A. Forsyth,et al.  Towards auto-documentary: tracking the evolution of news stories , 2004, MULTIMEDIA '04.

[54]  Shih-Fu Chang,et al.  VideoQ: an automated content based video search system using visual cues , 1997, MULTIMEDIA '97.

[55]  L. Cazzanti,et al.  Automatic identification of sound recordings , 2004, IEEE Signal Processing Magazine.

[56]  Eric Allamanche,et al.  Content-based Identification of Audio Material Using MPEG-7 Low Level Description , 2001, ISMIR.

[57]  Mubarak Shah,et al.  Tracking news stories across different sources , 2005, MULTIMEDIA '05.

[58]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[59]  Li Chen,et al.  Video copy detection: a comparative study , 2007, CIVR '07.

[60]  Özgür Ulusoy,et al.  Bilkent University Multimedia Database Group at TRECVID 2008 , 2008, TRECVID.

[61]  Regunathan Radhakrishnan,et al.  Audio Signature Extraction Based on Projections of Spectrograms , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[62]  Regunathan Radhakrishnan,et al.  Video fingerprinting based on moment invariants capturing appearance and motion , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[63]  Werner Bailer,et al.  Organizing rushes video by visually similar setting , 2008, CIVR '08.

[64]  Li Zhao,et al.  Key-frame extraction and shot retrieval using nearest feature line (NFL) , 2000, MULTIMEDIA '00.

[65]  Eric Allamanche,et al.  MPEG-7 Scalable Robust Audio Fingerprinting , 2002 .

[66]  Hayko Riemenschneider,et al.  Efficient Partial Shape Matching of Outer Contours , 2009, ACCV.

[67]  Seungjae Lee,et al.  Audio fingerprinting based on normalized spectral subband moments , 2006, IEEE Signal Processing Letters.

[68]  Pedro Cano,et al.  A Review of Audio Fingerprinting , 2005, J. VLSI Signal Process..

[69]  Nasir D. Memon,et al.  Perceptual Audio Hashing Functions , 2005, EURASIP J. Adv. Signal Process..

[70]  Donald A. Adjeroh,et al.  A Distance Measure for Video Sequences , 1999, Comput. Vis. Image Underst..

[71]  Sid-Ahmed Berrani,et al.  The Orange Labs Real Time Video Copy Detection System - TrecVid 2008 Results , 2008, TRECVID.

[72]  Dong Wang,et al.  THU and ICRC at TRECVID 2007 , 2007, TRECVID.

[73]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[74]  Derek Hoiem,et al.  Computer vision for music identification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[75]  Ton Kalker,et al.  Feature Extraction and a Database Strategy for Video Fingerprinting , 2002, VISUAL.

[76]  Dian Tjondronegoro,et al.  Efficient generation of pleasant video summaries , 2008, TVS '08.

[77]  Giovanni Soda,et al.  Edge-backpropagation for noisy logo recognition , 2003, Pattern Recognit..

[78]  Anindya Sarkar,et al.  Feature fusion and redundancy pruning for rush video summarization , 2007, TVS '07.

[79]  Shih-Fu Chang,et al.  Topic Tracking Across Broadcast News Videos with Visual Duplicates and Semantic Concepts , 2006, 2006 International Conference on Image Processing.

[80]  Sanjeev R. Kulkarni,et al.  A framework for measuring video similarity and its application to video query by example , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[81]  Shumeet Baluja,et al.  Advertisement Detection and Replacement using Acoustic and Visual Repetition , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[82]  ティー. ヒッケン,ウェンデル,et al.  Audio fingerprinting system and audio fingerprinting method , 2004 .

[83]  Bilge Günsel,et al.  Istanbul Technical University at TRECVID2008 , 2008, TRECVID.

[84]  Paul Over,et al.  TRECVID 2008 - Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2010, TRECVID.

[85]  Luc Brun,et al.  A Robust Audio Fingerprint's Based Identification Method , 2007, IbPRIA.

[86]  Julien Law-To,et al.  INRIA-IMEDIA TRECVID 2008: Video Copy Detection , 2008, TRECVID.

[87]  Duy-Dinh Le,et al.  National Institute of Informatics, Japan at TRECVID 2008 , 2008, TRECVID.

[88]  Young-Ho Suh,et al.  Video fingerprinting based on orientation of luminance centroid , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[89]  Ruud M. Bolle,et al.  Comparison of sequence matching techniques for video copy detection , 2001, IS&T/SPIE Electronic Imaging.

[90]  Tat-Seng Chua,et al.  Retrieval of News Video Using Video Sequence Matching , 2005, 11th International Multimedia Modelling Conference.

[91]  Shree K. Nayar,et al.  Ordinal measures for visual correspondence , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[92]  Jorma Laaksonen,et al.  PicSOM Experiments in TRECVID 2018 , 2015, TRECVID.