A novel framework for CBCD using integrated color and acoustic features

Most studies in content-based video copy detection (CBCD) concentrate on visual signatures, while only very few efforts are made to exploit audio features. The audio data, if present, is an essential source of a video; hence, the integration of visual-acoustic fingerprints significantly improves the copy detection performance. Based on this aspect, we propose a new framework, which jointly employs color-based visual features and audio fingerprints for detecting the duplicate videos. The proposed framework incorporates three stages: First, a novel visual fingerprint based on spatio-temporal dominant color features is generated; Second, mel-frequency cepstral coefficients are extracted and compactly represented as acoustic signatures; Third, the resultant multimodal signatures are jointly used for the CBCD task, by employing combination rule and weighting strategies. The results of experiments on TRECVID 2008 and 2009 datasets, demonstrate the improved efficiency of the proposed framework compared to the reference methods against a wide range of video transformations.

[1]  G. Ram Mohana Reddy,et al.  A framework for estimating geometric distortions in video copies based on visual-audio fingerprints , 2013, Signal, Image and Video Processing.

[2]  Zhu Liu,et al.  Multimedia content analysis-using both audio and visual clues , 2000, IEEE Signal Process. Mag..

[3]  John S. Boreczky,et al.  A hidden Markov model framework for video segmentation using audio and image features , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[4]  Xian-Sheng Hua,et al.  Robust video signature based on ordinal measure , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[5]  Kazuyo Tanaka,et al.  Time-space acoustical feature for fast video copy detection , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[6]  Chu-Song Chen,et al.  A Framework for Handling Spatiotemporal Variations in Video Copy Detection , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Craig Gotsman,et al.  Dynamic Color Quantization of Video Sequences , 1995, IEEE Trans. Vis. Comput. Graph..

[8]  Özgür Ulusoy,et al.  Video copy detection using multiple visual cues and MPEG-7 descriptors , 2010, J. Vis. Commun. Image Represent..

[9]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[10]  G. Ram Mohana Reddy,et al.  A Novel Approach to Video Copy Detection Using Audio Fingerprints and PCA , 2011, ANT/MobiWIS.

[11]  Ning Chen,et al.  Audio hash function based on non-negative matrix factorisation of mel-frequency cepstral coefficients , 2011, IET Inf. Secur..

[12]  Tae Hong Park Introduction to digital signal processing - Computer Musically Speaking , 2009 .

[13]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[14]  Nasir D. Memon,et al.  Perceptual Audio Hashing Functions , 2005, EURASIP J. Adv. Signal Process..

[15]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[16]  Hsin-Min Wang,et al.  Time-Series Linear Search for Video Copies Based on Compact Signature Manipulation and Containment Relation Modeling , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Gary Marchionini,et al.  Open video: A framework for a test collection , 2000, J. Netw. Comput. Appl..

[18]  G. Ram Mohana Reddy,et al.  Efficient Video Copy Detection Using Simple and Effective Extraction of Color Features , 2011, ACC.

[19]  R. Roopalakshmi,et al.  A novel spatio-temporal registration framework for video copy localization based on multimodal features , 2013, Signal Process..

[20]  B. S. Manjunath,et al.  An efficient color representation for image retrieval , 2001, IEEE Trans. Image Process..

[21]  A. Aydin Alatan,et al.  Content Based Copy Detection with Coarse Audio-Visual Fingerprints , 2009, 2009 Seventh International Workshop on Content-Based Multimedia Indexing.

[22]  B. S. Manjunath,et al.  Efficient and Robust Detection of Duplicate Videos in a Large Database , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Yao Zhao,et al.  Frame Fusion for Video Copy Detection , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Hsin-Min Wang,et al.  Fast min-hashing indexing and robust spatio-temporal matching for detecting video copies , 2010, TOMCCAP.

[25]  Nuria Oliver,et al.  Multimodal video copy detection applied to social media , 2009, WSM '09.

[26]  Vasudev Bhaskaran,et al.  Spatiotemporal sequence matching for efficient video copy detection , 2005, IEEE Trans. Circuits Syst. Video Technol..

[27]  T. Kashiwagi,et al.  Introduction of frequency image and applications , 2007, SICE Annual Conference 2007.

[28]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[29]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[30]  Wei-Han Chang,et al.  A fast MPEG-7 dominant color extraction with new similarity measure for image retrieval , 2008, J. Vis. Commun. Image Represent..