A framework for estimating geometric distortions in video copies based on visual-audio fingerprints

Spatio-temporal alignments and estimation of distortion model between pirate and master video contents are prerequisites, in order to approximate the illegal capture location in a theater. State-of-the-art techniques are exploiting only visual features of videos for the alignment and distortion model estimation of watermarked sequences, while few efforts are made toward acoustic features and non-watermarked video contents. To solve this, we propose a distortion model estimation framework based on multimodal signatures, which fully integrates several components: Compact representation of a video using visual-audio fingerprints derived from Speeded Up Robust Features and Mel-Frequency Cepstral Coefficients; Segmentation-based bipartite matching scheme to obtain accurate temporal alignments; Stable frame pairs extraction followed by filtering policies to achieve geometric alignments; and distortion model estimation in terms of homographic matrix. Experiments on camcorded datasets demonstrate the promising results of the proposed framework compared to the reference methods.

[1]  Heung-Kyu Lee,et al.  Digital Cinema Watermarking for Estimating the Position of the Pirate , 2010, IEEE Transactions on Multimedia.

[2]  Sang Uk Lee,et al.  Video frame-matching algorithm using dynamic programming , 2009, J. Electronic Imaging.

[3]  Bertrand Chupeau,et al.  Adaptive video fingerprints for accurate temporal registration , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Hui Cheng Temporal registration of video sequences , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[5]  G. Chartrand Introductory Graph Theory , 1984 .

[6]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[7]  Michel Barlaud,et al.  Compensation of geometrical deformations for watermark extraction in digital cinema application , 2001, IS&T/SPIE Electronic Imaging.

[8]  Bertrand Chupeau,et al.  Temporal Video Registration for Watermark Detection , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[9]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[10]  Ning Chen,et al.  A robust hashing algorithm based on SURF for video copy detection , 2012, Comput. Secur..

[11]  Bertrand Chupeau,et al.  Automatic estimation and compensation of geometric distortions in video copies , 2007, Electronic Imaging.

[12]  Benoit M. Macq,et al.  Temporal alignment of video sequences for watermarking systems , 2003, IS&T/SPIE Electronic Imaging.

[13]  G. Ram Mohana Reddy,et al.  A Novel Approach to Video Copy Detection Using Audio Fingerprints and PCA , 2011, ANT/MobiWIS.

[14]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[15]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[16]  Fred Stentiford,et al.  Video sequence matching based on temporal ordinal measurement , 2008, Pattern Recognit. Lett..

[17]  R. Roopalakshmi,et al.  A novel spatio-temporal registration framework for video copy localization based on multimodal features , 2013, Signal Process..