A new algorithm for compressing massive region-of-interest location information in videos

Region-of-Interest (ROI) location information in videos is of increasing importance in many applications including user interest analysis and user experience improvement. Although ROI-based coding has been studied by many researchers, most of the works only focus on using the ROI information for improving coding efficiency while the ROI location information itself is seldom coded or transmitted. Though some methods can recover the ROI locations at the decoder by some parameters such as the quantization parameter, they will fail to work when the number of ROIs becomes large or the ROIs become overlapping. In this paper, we propose a new algorithm to compress this massive ROI location information in videos. The proposed algorithm introduces the region position information extracted from the reconstructed frame as the reference to reduce the ROI location data. Furthermore, the temporal correlations among ROIs in neighboring frames are also utilized for compressing the ROI locations. By suitably integrating the extracted region position as well as the temporal correlation, the proposed algorithm can reduce the data by about 20% for videos with 30 ROI regions. Experimental results demonstrate the effectiveness of the proposed algorithm.

[1]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[2]  Radha Poovendran,et al.  Activity Recognition Using a Combination of Category Components and Local Models for Video Surveillance , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Wei Li,et al.  Region-Based Rate Control for H.264/AVC for Low Bit-Rate Applications , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[5]  Warnakulasuriya Anil Chandana Fernando,et al.  Unequal Error Protection Technique for ROI Based H.264 Video Coding , 2006, 2006 Canadian Conference on Electrical and Computer Engineering.

[6]  Ke Gu,et al.  Robust object tracking with bidirectional corner matching and trajectory smoothness algorithm , 2012, 2012 IEEE 14th International Workshop on Multimedia Signal Processing (MMSP).

[7]  James M. Rehg,et al.  Real-time human detection using contour cues , 2011, 2011 IEEE International Conference on Robotics and Automation.

[8]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[9]  Thomas Sikora,et al.  The MPEG-4 video standard verification model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[10]  M. Brunig,et al.  Face detection and tracking for video coding applications , 2000, Conference Record of the Thirty-Fourth Asilomar Conference on Signals, Systems and Computers (Cat. No.00CH37154).

[11]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[12]  Ming-Chieh Chi,et al.  ROI video coding based on H.263+ with robust skin-color detection technique , 2003, IEEE Trans. Consumer Electron..