Fusing audio-visual fingerprint to detect TV commercial advertisement

Sixty-four percent of consumers believe television advertising still has the greatest impact on them. Therefore, there is a great application to provide accurate and real-time TV advertising identification for government and advertisement providers. As the integration of multi-modal method takes full account of video and audio information, this paper aims to handle composite fingerprinting in a unified framework for advertising identification. The Improved Harris Combining Motion feature which is based on the differences between the adjacent video frames can produce video fingerprint. Meanwhile the proposed FIR filter based Fast Audio Fingerprint is focused on extracting the differences between the equivalent bands from adjacent frames. Moreover, this multi-model framework combines the audio and video fingerprint by weighted manner. Experimental results show that compared with the current methods, both audio and video fingerprint has the advantage of higher discrimination, stronger robustness and lower time complexity. Moreover, multi model fingerprint can enhances the performance of the unique fingerprint.

[1]  Wolfgang Effelsberg,et al.  On the detection and recognition of television commercials , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[2]  Mathias Lux,et al.  A Novel Approach for Fast and Accurate Commercial Detection in H.264/AVC Bit Streams Based on Logo Identification , 2009, MMM.

[3]  Wenyu Jiang,et al.  A review of video fingerprints invariant to geometric attacks , 2009, Electronic Imaging.

[4]  Fred Stentiford,et al.  Video sequence matching based on temporal ordinal measurement , 2008, Pattern Recognit. Lett..

[5]  Chang Dong Yoo,et al.  Robust video fingerprinting based on 2D-OPCA of affine covariant regions , 2008, 2008 15th IEEE International Conference on Image Processing.

[6]  Junsong Yuan,et al.  Fast Video Segment Identification from Large Video Collection , 2004 .

[7]  Atta Badii,et al.  A framework towards a multi-modal fingerprinting scheme for multimedia assets , 2010, Int. J. Bus. Inf. Syst..

[8]  A. Aydin Alatan,et al.  Content Based Copy Detection with Coarse Audio-Visual Fingerprints , 2009, 2009 Seventh International Workshop on Content-Based Multimedia Indexing.

[9]  Jenq-Haur Wang,et al.  Efficient Histogram-Based Indexing for Video Copy Detection , 2007, Ninth IEEE International Symposium on Multimedia Workshops (ISMW 2007).

[10]  Changick Kim,et al.  Spatiotemporal sequence matching for efficient video copy detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Giovanni Motta,et al.  The iDUDE Framework for Grayscale Image Denoising , 2011, IEEE Transactions on Image Processing.

[12]  Rakesh Mohan,et al.  Video sequence matching , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[13]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[14]  Hoirin Kim,et al.  Frequency Filtering for a Highly Robust Audio Fingerprinting Scheme in a Real-Noise Environment , 2006, IEICE Trans. Inf. Syst..

[15]  Pedro Cano,et al.  A review of algorithms for audio fingerprinting , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..

[16]  Ruud M. Bolle,et al.  Comparison of distance measures for video copy detection , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[17]  Bertrand Chupeau,et al.  Adaptive video fingerprints for accurate temporal registration , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Edward J. Delp,et al.  Media Forensics and Security , 2009 .

[19]  B. Vasudev,et al.  Spatiotemporal sequence matching for efficient video copy detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Avideh Zakhor,et al.  Efficient video similarity measurement with video signature , 2002, Proceedings. International Conference on Image Processing.

[21]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[22]  Seungjae Lee,et al.  Audio fingerprinting based on normalized spectral subband moments , 2006, IEEE Signal Processing Letters.

[23]  Xiangyang Wang,et al.  Image denoising using bilateral filter and Gaussian scale mixtures in shiftable complex directional pyramid domain , 2011, Comput. Electr. Eng..

[24]  Ton Kalker,et al.  Feature Extraction and a Database Strategy for Video Fingerprinting , 2002, VISUAL.

[25]  Noel Murphy,et al.  Automatic TV advertisement detection from MPEG bitstream , 2002, Pattern Recognit..

[26]  Jaap A. Haitsma,et al.  Robust Audio Hashing for Content Identification , 2001 .

[27]  B. S. Manjunath,et al.  Efficient and Robust Detection of Duplicate Videos in a Large Database , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Hoirin Kim,et al.  Frequency-Temporal Filtering for a Robust Audio Fingerprinting Scheme in Real-Noise Environments , 2006 .

[29]  Lipo Wang,et al.  Image Denoising Using Noisy Chaotic Neural Networks , 2011 .

[30]  Chang Dong Yoo,et al.  Robust video fingerprinting for content-based video identification , 2008, IEEE Transactions on Circuits and Systems for Video Technology.