SIFT feature-preserving bit allocation for H.264/AVC video compression

Compression artifacts in low-quality videos strongly influence the performance of feature matching algorithms. In order to achieve reasonable feature matching performance even for low bit rate video, we propose to allocate the bit budget during compression such that the important features are preserved. Specifically, we present two bit allocation approaches to preserve the strongest SIFT features for H.264 encoded videos. For both approaches, we first categorize the Macroblocks in a Group of Pictures into several groups according to the scale specific characteristics of SIFT features. In our first approach a novel R-D model based on the matching score is applied to allocate the bit budget to these groups. In our second approach, in order to reduce the computational complexity, we analyze the detector characteristics of correctly matched pairs and propose a R-D optimization method based on the repeatability metric. Our experiments show that both approaches achieve better feature preservation when compared to standard video encoding which is optimized for maximum picture quality. The proposed approaches are fully standard compatible and the encoded videos can be decoded by any H.264 decoder.

[1]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[2]  Cordelia Schmid,et al.  Evaluation of Interest Point Detectors , 2000, International Journal of Computer Vision.

[3]  Joint Video Team Draft ITU-T Recommendation and Final draft international standard of joint video specification , 2003 .

[4]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[5]  Cordelia Schmid,et al.  A Comparison of Affine Region Detectors , 2005, International Journal of Computer Vision.

[6]  Bernd Girod,et al.  Compression of image patches for local feature extraction , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[8]  Wei Tsang Ooi,et al.  Critical video quality for distributed automated video surveillance , 2005, MULTIMEDIA '05.

[9]  Eckehard G. Steinbach,et al.  Preserving SIFT features in JPEG-encoded images , 2011, 2011 18th IEEE International Conference on Image Processing.