Effect of Video Transcoding Parameters on Visual Object Tracking for Surveillance Systems

In any video surveillance system, it is very important to provide effective remote viewing to heterogeneous users with various network conditions and viewing device. To meet this adaptability requirement, the video transcoding process is inevitable, which consists of converting a video from one compressed format to another. However, since the transcoding operation is a lossy process, this can effect the performance of video analysis techniques such as visual object tracking. Consequently, in this paper, we evaluate the impact of video transcoding parameters on the performance of visual object tracking algorithms. To address this, first, a large-scale transcoding surveillance video (TSV) dataset is constructed. Then, we design a framework for assessing the performance of trackers. Finally, we evaluate six state-of-the-art trackers on the TSV dataset and analysis their performance with regard of transcoding parameters. Experimental results show that the video transcoding can negatively affect the performance of visual object tracking1.

[1]  Wu-chi Feng,et al.  Understanding the Impact of Compression on Feature Detection and Matching in Computer Vision , 2016, 2016 IEEE International Symposium on Multimedia (ISM).

[2]  Erik Blasch,et al.  Encoding color information for visual tracking: Algorithms and benchmark , 2015, IEEE Transactions on Image Processing.

[3]  Zhenyu He,et al.  The Seventh Visual Object Tracking VOT2019 Challenge Results , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[4]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[5]  Jianke Zhu,et al.  A Scale Adaptive Kernel Correlation Filter Tracker with Feature Integration , 2014, ECCV Workshops.

[6]  Michael Felsberg,et al.  ECO: Efficient Convolution Operators for Tracking , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Bernard Ghanem,et al.  A Benchmark and Simulator for UAV Tracking , 2016, ECCV.

[8]  M. Shamim Hossain,et al.  Remote display solution for video surveillance in multimedia cloud , 2015, Multimedia Tools and Applications.

[9]  Michail-Alexandros Kourtis,et al.  The Impact of Video Transcoding Parameters on Event Detection for Surveillance Systems , 2013, 2013 IEEE International Symposium on Multimedia.

[10]  Ming-Hsuan Yang,et al.  Object Tracking Benchmark , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Simon Lucey,et al.  Need for Speed: A Benchmark for Higher Frame Rate Object Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[12]  J. Lilius,et al.  Stream-Based Admission Control and Scheduling for Video Transcoding in Cloud Computing , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[13]  M. Anwar Hossain,et al.  Framework for a Cloud-Based Multimedia Surveillance System , 2014, Int. J. Distributed Sens. Networks.

[14]  Anthony Vetro,et al.  Video transcoding architectures and techniques: an overview , 2003, IEEE Signal Process. Mag..

[15]  M. Shamim Hossain,et al.  QoS-adaptive service configuration framework for cloud-assisted video surveillance systems , 2016, Multimedia Tools and Applications.

[16]  Rui Dai,et al.  Object-Detection-Based Video Compression for Wireless Surveillance Systems , 2017, IEEE MultiMedia.

[17]  Bernard Ghanem,et al.  TrackingNet: A Large-Scale Dataset and Benchmark for Object Tracking in the Wild , 2018, ECCV.

[18]  Qing Guo,et al.  Effects of Blur and Deblurring to Visual Object Tracking , 2019, ArXiv.

[19]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Simon Lucey,et al.  Learning Background-Aware Correlation Filters for Visual Tracking , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[21]  Luca Bertinetto,et al.  Staple: Complementary Learners for Real-Time Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Luca Bertinetto,et al.  End-to-End Representation Learning for Correlation Filter Based Tracking , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Francesca De Simone,et al.  Impact of video transcoding artifacts on the subjective quality , 2010, 2010 Second International Workshop on Quality of Multimedia Experience (QoMEX).

[24]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Azeddine Beghdadi,et al.  Towards the design of smart video-surveillance system , 2018, 2018 NASA/ESA Conference on Adaptive Hardware and Systems (AHS).

[26]  Fan Yang,et al.  LaSOT: A High-Quality Benchmark for Large-Scale Single Object Tracking , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Feng Li,et al.  Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.