A Compressed-domain Robust Descriptor for Near Duplicate Video Copy Detection

This paper introduces a global descriptor from the compressed video domain (H.264) for near duplicate video copy detection tasks. The proposed descriptor uses a spatial-temporal feature structure in an ordinal pattern distribution format. The proposed descriptor is constructed from Intra-Prediction Modes (IPM) of key frames (IDR & I slices) and extracted from the compressed video files, using the MPEG4/AVC (H.264) codec. Intra-prediction is the compression technique used in the key frames of the H.264 codec. As the proposed feature describes pictures globally, this research compares the feature with the two other well-known global image descriptors, ordinal intensity/colour Histograms and ordinal Auto-correlograms, as baselines. Our experiments show how the proposed feature outperforms the baseline features in non-geometric transformations T3, T4 and T5 in effectiveness as well as efficiency. It is due to better representation of the image content and smaller feature vector size. The core competency of the proposed feature is in non-linear brightness and contrast changes (Gamma expansion and compression) in which the intensity/colour Histograms and Auto-correlograms are deficient.

[1]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[2]  Qi Tian,et al.  Fast and robust short video clip search using an index structure , 2004, MIR '04.

[3]  Gang Hua,et al.  IBM Research TRECVID-2010 Video Copy Detection and Multimedia Event Detection System , 2010, TRECVID.

[4]  Mohammed Ghanbari,et al.  Compressed domain texture based visual information retrieval method for I-frame coded pictures , 2010, IEEE Transactions on Consumer Electronics.

[5]  Gilles Boulianne,et al.  CRIM AT TRECVID-2011: Content-Based Copy Detection using Nearest-Neighbor Mapping , 2011, TRECVID.

[6]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[7]  Mubarak Shah,et al.  University of Central Florida at TRECVID 2008 Content Based Copy Detection and Surveillance Event Detection , 2008, TRECVID.

[8]  William T. Freeman,et al.  Orientation Histograms for Hand Gesture Recognition , 1995 .

[9]  Shree K. Nayar,et al.  Ordinal Measures for Image Correspondence , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Liang-Gee Chen,et al.  Analysis, fast algorithm, and VLSI architecture design for H.264/AVC intra frame coder , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[12]  Mohammad Shahram Moin,et al.  A New Content-Based Image Retrieval Approach Based on Pattern Orientation Histogram , 2007, MIRAGE.

[13]  Georges Quénot,et al.  TRECVID 2015 - An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics , 2011, TRECVID.

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).