Performance Evaluation of Visual Tracking Algorithms on Video Sequences With Quality Degradation

Recently, there are lots of visual tracking algorithms proposed to improve the performance of object tracking in video sequences with various real conditions, such as severe occlusion, complicated background, fast motion, and so on. In real visual tracking systems, there are various quality degradation occurring during video acquisition, transmission, and processing. However, most existing studies focus on improving the accuracy of visual tracking while ignoring the performance of tracking algorithms on video sequences with certain quality degradation. In this paper, we investigate the performance evaluation of existing visual tracking algorithms on video sequences with quality degradation. A quality-degraded video database for visual tracking (QDVD-VT), including the reference video sequences and their corresponding distorted versions, is constructed as the benchmarking for robustness analysis of visual tracking algorithms. Based on the constructed QDVD-VT, we propose a method for robustness measurement of visual tracking (RMVT) algorithms by accuracy rate and performance stability. The performance of ten existing visual tracking algorithms is evaluated by the proposed RMVT based on the built QDVD-VT. We provide the detailed analysis and discussion on the robustness analysis of different visual tracking algorithms on video sequences with quality degradation from different distortion types. To visualize the robustness of visual tracking algorithms well, we design a robustness pentagon to show the accuracy rate and performance stability of visual tracking algorithms. Our initial investigation shows that it is still challenging for effective object tracking for existing visual tracking algorithms on video sequences with quality degradation. There is much room for the performance improvement of existing tracking algorithms on video sequences with quality degradation in real applications.

[1]  Wei Tsang Ooi,et al.  Video quality for face detection, recognition, and tracking , 2011, TOMCCAP.

[2]  Olga Sorkine-Hornung,et al.  A comparative study of image retargeting , 2010, ACM Trans. Graph..

[3]  Weisi Lin,et al.  Perceptual Quality Assessment of Screen Content Images , 2015, IEEE Transactions on Image Processing.

[4]  Weisi Lin,et al.  Perceptual visual quality metrics: A survey , 2011, J. Vis. Commun. Image Represent..

[5]  Kai Zeng,et al.  Objective Quality Assessment for Image Retargeting Based on Structural Similarity , 2014, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[6]  Zhou Wang,et al.  Modern Image Quality Assessment , 2006, Modern Image Quality Assessment.

[7]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[8]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Jiri Matas,et al.  P-N learning: Bootstrapping binary classifiers by structural constraints , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Guangming Shi,et al.  Perceptual Quality Metric With Internal Generative Mechanism , 2013, IEEE Transactions on Image Processing.

[11]  Weisi Lin,et al.  A benchmark for robustness analysis of visual tracking algorithms , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Horst Bischof,et al.  PROST: Parallel robust online simple tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Simone Calderara,et al.  Visual Tracking: An Experimental Survey , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Zhou Wang,et al.  Subjective quality assessment of Screen Content Images , 2014, 2014 Sixth International Workshop on Quality of Multimedia Experience (QoMEX).

[15]  Kai Zeng,et al.  Perceptual Quality Assessment for Multi-Exposure Image Fusion , 2015, IEEE Transactions on Image Processing.

[16]  Ariel Shamir,et al.  A comparative study of image retargeting , 2010, SIGGRAPH 2010.

[17]  Zhou Wang,et al.  Objective Quality Assessment of Tone-Mapped Images , 2013, IEEE Transactions on Image Processing.

[18]  Michail-Alexandros Kourtis,et al.  The Impact of Video Transcoding Parameters on Event Detection for Surveillance Systems , 2013, 2013 IEEE International Symposium on Multimedia.

[19]  Weisi Lin,et al.  Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular Visual Characteristics , 2013, IEEE Transactions on Image Processing.

[20]  Alan C. Bovik,et al.  Image information and visual quality , 2006, IEEE Trans. Image Process..

[21]  Huchuan Lu,et al.  Robust object tracking via sparsity-based collaborative model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Haibin Ling,et al.  Real time robust L1 tracker using accelerated proximal gradient approach , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Junzhou Huang,et al.  Robust tracking using local sparse appearance model and K-selection , 2011, CVPR 2011.

[24]  Weisi Lin,et al.  Image Quality Assessment Based on Gradient Similarity , 2012, IEEE Transactions on Image Processing.

[25]  Wenjun Zhang,et al.  Subjective and objective quality assessment for images with contrast change , 2013, 2013 IEEE International Conference on Image Processing.

[26]  YeganehHojatollah,et al.  Objective Quality Assessment of Tone-Mapped Images , 2013 .

[27]  Kai Zeng,et al.  Quality Prediction of Asymmetrically Distorted Stereoscopic 3D Images , 2015, IEEE Transactions on Image Processing.

[28]  Zhou Wang,et al.  No-Reference Quality Assessment of Contrast-Distorted Images Based on Natural Scene Statistics , 2015, IEEE Signal Processing Letters.

[29]  Wenjun Zhang,et al.  Using Free Energy Principle For Blind Image Quality Assessment , 2015, IEEE Transactions on Multimedia.

[30]  Laura Sevilla-Lara,et al.  Distribution fields for tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Weisi Lin,et al.  No-Reference Image Blur Assessment Based on Discrete Orthogonal Moments , 2016, IEEE Transactions on Cybernetics.

[32]  Huchuan Lu,et al.  Visual tracking via adaptive structural local sparse appearance model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  D. Chandler Seven Challenges in Image Quality Assessment: Past, Present, and Future Research , 2013 .

[34]  Gérard G. Medioni,et al.  Context tracker: Exploring supporters and distracters in unconstrained environments , 2011, CVPR 2011.

[35]  D. Chandler,et al.  Supplement to “ VSNR : A Visual Signal-to-Noise Ratio for Natural Images Based on Near-Threshold and Suprathreshold Vision ” , 2007 .

[36]  Pietro Perona,et al.  Pedestrian Detection: An Evaluation of the State of the Art , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Rui Caseiro,et al.  Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.

[38]  Yuming Fang Application-specific visual quality assessment: current status and future trends , 2015, ICIMCS '15.

[39]  Narendra Ahuja,et al.  Robust visual tracking via multi-task sparse learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Weisi Lin,et al.  Image Sharpness Assessment by Sparse Representation , 2016, IEEE Transactions on Multimedia.

[41]  Alan C. Bovik,et al.  A survey on 3D quality of experience and 3D quality assessment , 2013, Electronic Imaging.