DCT-Based Videoprinting on Saliency-Consistent Regions for Detecting Video Copies with Text Insertion

Ideal video fingerprinting should be robust to various practical distortions. Conventional fingerprinting mainly copes with natural distortions (brightness change, resolution reduction, etc.), while always gives poor performance in case of text insertion. One alterative way is to apply a weighting scheme based on the probability of text insertion for feature similarity calculation. However, the weights must be learned with labeled samples. In this paper, we propose a method that first addresses valid regions where the saliency values keep consistent between the query and original frames, namely saliency-consistent regions. Other regions, probably the inserted ones, are discarded. Then a DCT-based hamming distance is calculated on those saliency-consistent regions. Besides, the saliency-based distance is also considered and a further weighted linear distance is evaluated. The proposed algorithm is tested on the MPEG-7 video fingerprint dataset, achieving a false rate of 0.7% in case of text insertion and 0.32% in average for other 8 distortions.

[1]  Ton Kalker,et al.  Feature Extraction and a Database Strategy for Video Fingerprinting , 2002, VISUAL.

[2]  Anindya Sarkar,et al.  Video fingerprinting: features for duplicate and similar video detection and query-based video retrieval , 2008, Electronic Imaging.

[3]  Chang Dong Yoo,et al.  Robust video fingerprinting for content-based video identification , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Suh-Yin Lee,et al.  Recent Advances in Visual Information Systems , 2002, Lecture Notes in Computer Science.

[5]  Nasir D. Memon,et al.  Spatio–Temporal Transform Based Video Hashing , 2006, IEEE Transactions on Multimedia.

[6]  Olivier Buisson,et al.  Robust voting algorithm based on labels of behavior for video copy detection , 2006, MM '06.

[7]  Kota Iwamoto,et al.  Image Signature Robust to Caption Superimposition for Video Sequence Identification , 2006, 2006 International Conference on Image Processing.

[8]  Fred Stentiford,et al.  Video sequence matching based on temporal ordinal measurement , 2008, Pattern Recognit. Lett..

[9]  Changick Kim,et al.  Content-based image copy detection , 2003, Signal Process. Image Commun..

[10]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[11]  Rakesh Mohan,et al.  Video sequence matching , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).