The Visual Object Tracking VOT 2013 challenge results

Visual tracking has attracted a significant attention in the last few decades. The recent surge in the number of publications on tracking-related problems have made it almost impossible to follow the developments in the field. One aUniversity of Ljubljana, Slovenia bAustrian Institute of Technology, Austria cUniversity of Birmingham, United Kingdom dCzech Technical University in Prague, Czech Republic eAustralian National University fDSTO, Edinburgh, SA, Australia gSharif University of Technology, Tehran, Iran hCenter for Informatics Science, Nile University, Giza, Egypt iParthenope University of Naples, Italy jUniversity of South Australia, Mawson Lakes, SA, Australia kVision and Sensing, ESTeM, University of Canberra, Australia lPanasonic R&D Center, Singapore mCISP, University of Malaya, Malaysia nEng. Design and Math., University of West England, United Kingdom oIzmir Institute of Technology, Turkey pCollege of Computer Science, Zhejiang University, China qNLPR, Institute of Automation, CAS, Beijing, China rChinese Academy of Sciences, China sUniversity of Surrey, United Kingdom tLinköping University, Sweden uRobotic Vision Team, Kingston University, United Kingdom vIHCC, CECS, Australian National University, Australia wNII, JFLI, Hitotsubashi, Japan xNII, Hitotsubashi, Japan of the reasons is that there is a lack of commonly accepted annotated data-sets and standardized evaluation protocols that would allow objective comparison of different tracking methods. To address this issue, the Visual Object Tracking (VOT) workshop was organized in conjunction with ICCV2013. Researchers from academia as well as industry were invited to participate in the first VOT2013 challenge which aimed at single-object visual trackers that do not apply pre-learned models of object appearance (modelfree). Presented here is the VOT2013 benchmark dataset for evaluation of single-object visual trackers as well as the results obtained by the trackers competing in the challenge. In contrast to related attempts in tracker benchmarking, the dataset is labeled per-frame by visual attributes that indicate occlusion, illumination change, motion change, size change and camera motion, offering a more systematic comparison of the trackers. Furthermore, we have designed an automated system for performing and evaluating the experiments. We present the evaluation protocol of the VOT2013 challenge and the results of a comparison of 27 trackers on the benchmark dataset. The dataset, the evaluation tools and the tracker rankings are publicly available from the challenge website1. 1http://votchallenge.net

[1]  Dariu Gavrila,et al.  The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[2]  David S. Doermann,et al.  Tools and techniques for video performance evaluation , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[3]  Hyeonjoon Moon,et al.  The FERET Evaluation Methodology for Face-Recognition Algorithms , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Thomas B. Moeslund,et al.  A Survey of Computer Vision-Based Human Motion Capture , 2001, Comput. Vis. Image Underst..

[5]  Christopher O. Jaynes,et al.  An Open Development Environment for Evaluation of Video Surveillance Systems , 2002 .

[6]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Jacques Verly,et al.  The State of the Art in Multiple Object Tracking Under Occlusion in Video Sequences , 2003 .

[8]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[9]  J.M. Ferryman,et al.  PETS Metrics: On-Line Performance Evaluation Service , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[10]  J. Pers,et al.  Multiple interacting targets tracking with application to team sports , 2005, ISPA 2005. Proceedings of the 4th International Symposium on Image and Signal Processing and Analysis, 2005..

[11]  Robert T. Collins,et al.  An Open Source Tracking Testbed and Evaluation Web Site , 2005 .

[12]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[13]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[14]  M. Shah,et al.  Object tracking: A survey , 2006, CSUR.

[15]  Ann B. Lee,et al.  Treelets--An adaptive multi-scale basis for sparse unordered data , 2007, 0707.0481.

[16]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[17]  Richard Szeliski,et al.  A Database and Evaluation Methodology for Optical Flow , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[18]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[19]  Jing Zhang,et al.  Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video: Data, Metrics, and Protocol , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Junseok Kwon,et al.  Tracking of a non-rigid object via patch-based dynamic appearance modeling and adaptive Basin Hopping Monte Carlo sampling , 2009, CVPR.

[21]  Rama Chellappa,et al.  Online Empirical Evaluation of Tracking Algorithms , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Ales Leonardis,et al.  A Two-Stage Dynamic Model for Visual Tracking , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[23]  Jiri Matas,et al.  P-N learning: Bootstrapping binary classifiers by structural constraints , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[24]  Sebastien C. Wong,et al.  Combining online feature selection with adaptive shape estimation , 2010, 2010 25th International Conference of Image and Vision Computing New Zealand.

[25]  Alexei A. Efros,et al.  Unbiased look at dataset bias , 2011, CVPR 2011.

[26]  Chunhua Shen,et al.  Real-time visual tracking using compressive sensing , 2011, CVPR 2011.

[27]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Horst Bischof,et al.  Hough-based tracking of non-rigid objects , 2011, 2011 International Conference on Computer Vision.

[29]  Philip H. S. Torr,et al.  Struck: Structured output tracking with kernels , 2011, 2011 International Conference on Computer Vision.

[30]  Jiri Matas,et al.  Robustifying the Flock of Trackers , 2011 .

[31]  Yuichi Matsumoto,et al.  Shrink boost for selecting multi-LBP histogram features in object detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Bin Shen,et al.  Online robust image alignment via iterative convex optimization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Laura Sevilla-Lara,et al.  Distribution fields for tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Chun Chen,et al.  A Convolutional Treelets Binary Feature Approach to Fast Keypoint Recognition , 2012, ECCV.

[35]  Fatih Murat Porikli,et al.  Changedetection.net: A new change detection benchmark dataset , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[36]  Lei Zhang,et al.  Real-Time Compressive Tracking , 2012, ECCV.

[37]  周鑫,et al.  Tracking-learning-detection (TLD)-based video object tracking method , 2012 .

[38]  Huchuan Lu,et al.  Least Soft-Threshold Squares Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Dorothy Ndedi Monekosso,et al.  SwATrack: A Swarm Intelligence-based Abrupt Motion Tracker , 2013, MVA.

[40]  Zhongfei Zhang,et al.  A survey of appearance models in visual object tracking , 2013, ACM Trans. Intell. Syst. Technol..

[41]  Mohamed ElHelw,et al.  Robust Real-Time Tracking with Diverse Ensembles and Random Projections , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[42]  Kaiqi Huang,et al.  An Adaptive Combination of Multiple Features for Robust Tracking in Real Scene , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[43]  Alfredo Petrosino,et al.  MATRIOSKA: A Multi-level Approach to Fast Tracking by Learning , 2013, ICIAP.

[44]  Haibin Ling,et al.  Finding the Best from the Second Bests - Inhibiting Subjective Bias in Evaluation of Visual Tracking Algorithms , 2013, 2013 IEEE International Conference on Computer Vision.

[45]  Jiri Matas,et al.  Long-Term Tracking through Failure Cases , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[46]  Michael Felsberg,et al.  Enhanced Distribution Field Tracking Using Channel Representations , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[47]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[48]  Ales Leonardis,et al.  Robust Visual Tracking Using an Adaptive Coupled-Layer Visual Model , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Andrea Cavallaro,et al.  A Protocol for Evaluating Video Trackers Under Real-World Conditions , 2013, IEEE Transactions on Image Processing.

[50]  Ales Leonardis,et al.  An Enhanced Adaptive Coupled-Layer LGTracker++ , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[51]  Xiaoqin Zhang,et al.  Graph Embedding Based Semi-supervised Discriminative Tracker , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[52]  Ales Leonardis,et al.  Is my new tracker really better than yours? , 2014, IEEE Winter Conference on Applications of Computer Vision.

[53]  Hamid R. Rabiee,et al.  Patchwise Joint Sparse Tracking With Occlusion Detection , 2014, IEEE Transactions on Image Processing.