The Visual Object Tracking VOT2016 Challenge Results

The Visual Object Tracking challenge VOT2016 aims at comparing short-term single-object visual trackers that do not apply pre-learned models of object appearance. Results of 70 trackers are presented, with a large number of trackers being published at major computer vision conferences and journals in the recent years. The number of tested state-of-the-art trackers makes the VOT 2016 the largest and most challenging benchmark on short-term tracking to date. For each participating tracker, a short description is provided in the Appendix. The VOT2016 goes beyond its predecessors by (i) introducing a new semi-automatic ground truth bounding box annotation methodology and (ii) extending the evaluation system with the no-reset experiment. The dataset, the evaluation kit as well as the results are publicly available at the challenge website (http://votchallenge.net).

Zhenyu He | Henry Medeiros | Michael Felsberg | Bohyung Han | Huchuan Lu | Ming Tang | Dit-Yan Yeung | Qingming Huang | Bernard Ghanem | Erkut Erdem | Hongdong Li | Guna Seetharaman | Hyemin Lee | Wolfgang Hübner | Xin Li | Thomas Mauthner | Horst Possegger | Horst Bischof | Tao Hu | Yifan Wang | Martin Danelljan | Gustav Häger | Luca Bertinetto | Andrea Vedaldi | Yiannis Demiris | Zejian Yuan | Fei Zhao | Junliang Xing | Philip H. S. Torr | Matej Kristan | Noor M. Al-Shakarji | Kannappan Palaniappan | Honggang Qi | Krystian Mikolajczyk | Weiming Hu | Junyu Gao | Siwei Lyu | Vincenzo Santopietro | Abhinav Gupta | Jianke Zhu | Zhan Xu | Lijun Wang | Jochen Lang | Simone Melzi | Changsheng Xu | João F. Henriques | Longyin Wen | Dawei Du | Rustam Stolkin | Tony Pridmore | Aleš Leonardis | Yuankai Qi | Jack Valmadre | Xiaomeng Wang | Filiz Bunyak | Shizeng Yao | Aydin Alatan | Rengarajan Pelapur | Nana Fan | Michael Arens | Fatih Porikli | Payman Moallem | Wenbo Li | Erhan Gundogdu | Tianzhu Zhang | Osman Akin | Michel Valstar | Aykut Erdem | Karel Lebeda | Ondrej Miksik | Matthias Mueller | Daijin Kim | Mahdieh Poostchi | Naiyan Wang | Yang Li | Simon Hadfield | Robert Laganière | Isabela Drummond | Ke Gao | Jingjing Xiao | Siyi Li | Jiyeoup Jeong | Andreas Robinson | Richard Bowden | Pedro Senna | Shengping Zhang | Brais Martinez | Stuart Golodetz | Giorgio Roffo | Deepak Mishra | Bin Liu | Alfredo Petrosino | Alireza Memarmoghadam | Chong Sun | Francesco Battistone | Jae-chan Jeong | Jae-Yeong Lee | Jin Gao | Ji-Wan Kim | Mengdan Zhang | Sunglok Choi | Zhizhen Chi | Dapeng Chen | Lei Qin | Hyeonseob Nam | Xiangyuan Lan | Jiayi Feng | Jongwon Choi | Noor Al-Shakarji | Fahad Khan | Jose M. Martinez | Mooyeol Baek | Roman Pflugfelder | Anton Varfolomieiev | Shengkun Li | Ryan Walsh | Alan Lukežič | Madan Kumar Rapuru | Sumithra Kakanuru | Jiři Matas | Luka Čehovin | Tomáš Vojír̃ | Gustavo Fernández | Alvaro Garcia-Martin | Andrés Solís Montero | Andy J. Ma | Chang-Ming Chang | Gao Zhu | Gorthi R. K. Sai Subrahmanyam | Guilherme Bastos | Hyung Jin Chang | Jae-il Cho | Jin Young Choi | João F. Henriques | Madan Kumar Rapuru | Mario Maresca | Muhammad Haris Khan | Philip H. S. Torr | Pong C. Yuen | Rafael Martin-Nieto | Sebastian B. Krah | Stefan Becker | Zexiong Cai | José M. Martínez | Muhammad Haris Khan | A. Gupta | A. Vedaldi | A. Leonardis | Bohyung Han | Longyin Wen | Naiyan Wang | K. Mikolajczyk | M. Felsberg | Martin Danelljan | D. Yeung | Jiri Matas | H. Bischof | Siwei Lyu | R. Laganière | F. Khan | Andreas Robinson | Huchuan Lu | Y. Demiris | Luca Bertinetto | Jack Valmadre | Zejian Yuan | F. Porikli | T. Mauthner | R. Stolkin | Giorgio Roffo | Jianke Zhu | S. Melzi | Bernard Ghanem | Hongdong Li | Michael Arens | Siyi Li | Zhenyu He | T. Pridmore | J. Lang | Xin Li | M. Valstar | Changsheng Xu | Daijin Kim | Lijun Wang | Tianzhu Zhang | O. Miksik | Wenbo Li | Weiming Hu | M. Kristan | Tomás Vojír | R. Pflugfelder | G. Fernandez | Luka Cehovin | A. Petrosino | Jin Gao | Jingjing Xiao | Junliang Xing | M. Maresca | P. Yuen | H. Chang | Gustav Häger | A. Lukežič | Alireza Memarmoghadam | Álvaro García-Martín | Andrés Solís Montero | A. J. Ma | A. Varfolomieiev | A. Alatan | Aykut Erdem | Bin Liu | Brais Martínez | Chang-Ming Chang | Chong Sun | Dapeng Chen | Dawei Du | Deepak Mishra | Erhan Gundogdu | E. Erdem | Fei Zhao | F. Bunyak | Francesco Battistone | Gao Zhu | G. Subrahmanyam | G. Bastos | G. Seetharaman | H. Medeiros | H. Qi | Horst Possegger | Hyemin Lee | Hyeonseob Nam | I. Drummond | Jae-chan Jeong | J. Cho | Jae-Y. Lee | Jiayi Feng | J. Choi | Ji-Wan Kim | Jiyeoup Jeong | Jongwon Choi | Junyu Gao | K. Palaniappan | K. Lebeda | Ke Gao | Lei Qin | M. Poostchi | Matthias Mueller | Mengdan Zhang | Ming Tang | Mooyeol Baek | Nana Fan | Osman Akin | P. Moallem | P. Senna | Qingming Huang | Rafael Martin-Nieto | R. Pelapur | R. Bowden | Ryan Walsh | S. Krah | Shengkun Li | Shengping Zhang | Shizeng Yao | Simon Hadfield | S. Becker | S. Golodetz | S. Kakanuru | Sunglok Choi | Tao Hu | V. Santopietro | W. Hübner | X. Lan | Xiaomeng Wang | Yang Li | Yifan Wang | Yuankai Qi | Z. Cai | Zhan Xu | Zhizhen Chi | O. Mikšík | Henry Medeiros | G. R. S. Subrahmanyam | H. Chang | Rafael Martín-Nieto

[1]  D. Shanno Conditioning of Quasi-Newton Methods for Function Minimization , 1970 .

[2]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Hyeonjoon Moon,et al.  The FERET evaluation methodology for face-recognition algorithms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Dariu Gavrila,et al.  The Visual Analysis of Human Movement: A Survey , 1999, Comput. Vis. Image Underst..

[5]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[6]  Jorge Nocedal,et al.  A trust region method based on interior point techniques for nonlinear programming , 2000, Math. Program..

[7]  Thomas B. Moeslund,et al.  A Survey of Computer Vision-Based Human Motion Capture , 2001, Comput. Vis. Image Underst..

[8]  Jacques Verly,et al.  The State of the Art in Multiple Object Tracking Under Occlusion in Video Sequences , 2003 .

[9]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[10]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[11]  J.M. Ferryman,et al.  PETS Metrics: On-Line Performance Evaluation Service , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[12]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Yann LeCun,et al.  Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Adrian Hilton,et al.  A survey of advances in vision-based human motion capture and analysis , 2006, Comput. Vis. Image Underst..

[15]  M. Shah,et al.  Object tracking: A survey , 2006, CSUR.

[16]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[17]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[18]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[19]  Cordelia Schmid,et al.  Learning Color Names for Real-World Applications , 2009, IEEE Transactions on Image Processing.

[20]  Jing Zhang,et al.  Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video: Data, Metrics, and Protocol , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Guillermo Sapiro,et al.  Online dictionary learning for sparse coding , 2009, ICML '09.

[22]  Fei-Fei Li,et al.  ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Bruce A. Draper,et al.  Visual object tracking using adaptive correlation filters , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Jiri Matas,et al.  Forward-Backward Error: Automatic Detection of Tracking Failures , 2010, 2010 20th International Conference on Pattern Recognition.

[26]  Guna Seetharaman,et al.  Efficient feature extraction and likelihood fusion for vehicle tracking in low frame rate airborne video , 2010, 2010 13th International Conference on Information Fusion.

[27]  Rama Chellappa,et al.  Online Empirical Evaluation of Tracking Algorithms , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Ling Shao,et al.  Recent advances and trends in visual tracking: A review , 2011, Neurocomputing.

[30]  Horst Bischof,et al.  Hough-based tracking of non-rigid objects , 2011, 2011 International Conference on Computer Vision.

[31]  Jae-Yeong Lee,et al.  Visual tracking by partition-based histogram backprojection and maximum support criteria , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[32]  Jiri Matas,et al.  Robustifying the Flock of Trackers , 2011 .

[33]  Jiri Matas,et al.  Tracking the Untrackable: How to Track When Your Object Is Featureless , 2012, ACCV Workshops.

[34]  Guna Seetharaman,et al.  Robust Orientation and Appearance Adaptation for Wide-Area Large Format Video Object Tracking , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[35]  Rui Caseiro,et al.  Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.

[36]  Fatih Murat Porikli,et al.  Changedetection.net: A new change detection benchmark dataset , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[37]  Zdenek Kalal,et al.  Tracking-Learning-Detection , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Guna Seetharaman,et al.  Efficient GPU Implementation of the Integral Histogram , 2012, ACCV Workshops.

[39]  Zhongfei Zhang,et al.  A survey of appearance models in visual object tracking , 2013, ACM Trans. Intell. Syst. Technol..

[40]  Fernando De la Torre,et al.  Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Guna Seetharaman,et al.  Feature selection for appearance-based vehicle tracking in geospatial video , 2013, Defense, Security, and Sensing.

[42]  Michael Felsberg,et al.  The Visual Object Tracking VOT2013 Challenge Results , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[43]  Jiri Matas,et al.  Robust scale-adaptive mean-shift for tracking , 2013, Pattern Recognit. Lett..

[44]  Alfredo Petrosino,et al.  MATRIOSKA: A Multi-level Approach to Fast Tracking by Learning , 2013, ICIAP.

[45]  Shengping Zhang,et al.  Sparse coding based visual tracking: Review and experimental comparison , 2013, Pattern Recognit..

[46]  Jiri Matas,et al.  Long-Term Tracking through Failure Cases , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[47]  Michael Felsberg,et al.  Enhanced Distribution Field Tracking Using Channel Representations , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[48]  Naim Dahnoun,et al.  Studies in Computational Intelligence , 2013 .

[49]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Ales Leonardis,et al.  Robust Visual Tracking Using an Adaptive Coupled-Layer Visual Model , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Nanning Zheng,et al.  Constructing Adaptive Complex Cells for Robust Visual Tracking , 2013, 2013 IEEE International Conference on Computer Vision.

[52]  Jianke Zhu,et al.  A Scale Adaptive Kernel Correlation Filter Tracker with Feature Integration , 2014, ECCV Workshops.

[53]  Michael Felsberg,et al.  Accurate Scale Estimation for Robust Visual Tracking , 2014, BMVC.

[54]  Jiri Matas,et al.  The Enhanced Flock of Trackers , 2014, Registration and Recognition in Images and Videos.

[55]  Ales Leonardis,et al.  Is my new tracker really better than yours? , 2014, IEEE Winter Conference on Applications of Computer Vision.

[56]  David Zhang,et al.  Fast Visual Tracking via Dense Spatio-temporal Context Learning , 2014, ECCV.

[57]  Jin Gao,et al.  Transfer Learning Based Visual Tracking with Gaussian Processes Regression , 2014, ECCV.

[58]  Alfredo Petrosino,et al.  Clustering Local Motion Estimates for Robust and Efficient Object Tracking , 2014, ECCV Workshops.

[59]  Stan Sclaroff,et al.  MEEM: Robust Tracking via Multiple Experts Using Entropy Minimization , 2014, ECCV.

[60]  Stefanos Zafeiriou,et al.  Incremental Face Alignment in the Wild , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[61]  Tony P. Pridmore,et al.  MTS: A Multiple Temporal Scale Tracker Handling Occlusion and Abrupt Motion Variation , 2014, ACCV.

[62]  Simone Calderara,et al.  Visual Tracking: An Experimental Survey , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[63]  José María Martínez Sanchez,et al.  Single Object Long-term Tracker for Smart Control of a PTZ camera , 2014, ICDSC.

[64]  Michael Felsberg,et al.  Adaptive Color Attributes for Real-Time Visual Tracking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[65]  Matej Kristan,et al.  A Graphical Model for Rapid Obstacle Image-Map Estimation from Unmanned Surface Vehicles , 2014, ACCV.

[66]  Michael Felsberg,et al.  Learning Spatially Regularized Correlation Filters for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[67]  Robert Laganière,et al.  Scalable Kernel Correlation Filter with Sparse Feature Integration , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[68]  Ales Leonardis,et al.  Single target tracking using adaptive clustered decision trees and dynamic multi-level appearance models , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[69]  Thomas Mauthner,et al.  In defense of color-based model-free tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[70]  Marco Cristani,et al.  Infinite Feature Selection , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[71]  Stefan Roth,et al.  MOTChallenge 2015: Towards a Benchmark for Multi-Target Tracking , 2015, ArXiv.

[72]  Ming-Hsuan Yang,et al.  Hierarchical Convolutional Features for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[73]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[74]  Zhenyu He,et al.  The Thermal Infrared Visual Object Tracking VOT-TIR2016 Challenge Results , 2016, ECCV Workshops.

[75]  Ming-Hsuan Yang,et al.  Long-term correlation tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[76]  Francesco Solera,et al.  Towards the evaluation of reproducible robustness in tracking-by-detection , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[77]  Ming-Hsuan Yang,et al.  Object Tracking Benchmark , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[78]  Xiaogang Wang,et al.  Visual Tracking with Fully Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[79]  Tony P. Pridmore,et al.  TRIC-track: Tracking by Regression with Incrementally Learned Cascades , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[80]  Ming Tang,et al.  Multi-kernel Correlation Filter for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[81]  Abhinav Gupta,et al.  Transferring Rich Feature Hierarchies for Robust Visual Tracking , 2015, ArXiv.

[82]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[83]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[84]  Erik Blasch,et al.  Encoding color information for visual tracking: Algorithms and benchmark , 2015, IEEE Transactions on Image Processing.

[85]  Roman P. Pflugfelder,et al.  Clustering of static-adaptive correspondences for deformable object tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[86]  A. Aydin Alatan,et al.  Spatial windowing for correlation filter based visual tracking , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[87]  Aykut Erdem,et al.  Deformable part-based tracking by coupled global and local correlation filters , 2016, J. Vis. Commun. Image Represent..

[88]  Jiri Matas,et al.  A Novel Performance Evaluation Methodology for Single-Target Trackers , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[89]  Bohyung Han,et al.  Modeling and Propagating CNNs in a Tree Structure for Visual Tracking , 2016, ArXiv.

[90]  Luca Bertinetto,et al.  Staple: Complementary Learners for Real-Time Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[91]  Jiri Matas,et al.  Texture-Independent Long-Term Tracking Using Virtual Corners , 2016, IEEE Transactions on Image Processing.

[92]  Luca Bertinetto,et al.  Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[93]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[94]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[95]  Guna Seetharaman,et al.  Semantic Depth Map Fusion for Moving Vehicle Detection in Aerial Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[96]  Yiannis Demiris,et al.  Visual Tracking Using Attention-Modulated Disintegration and Integration , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[97]  Michael Felsberg,et al.  Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking , 2016, ECCV.

[98]  Vibhav Vineet,et al.  Struck: Structured Output Tracking with Kernels , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[99]  Ales Leonardis,et al.  Visual Object Tracking Performance Measures Revisited , 2015, IEEE Transactions on Image Processing.

[100]  Ales Leonardis,et al.  Robust visual tracking using template anchors , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[101]  Wolfgang Hübner,et al.  MAD for visual tracker fusion , 2016, Security + Defence.

[102]  Jiri Matas,et al.  Online adaptive hidden Markov model for multi-tracker fusion , 2015, Comput. Vis. Image Underst..

[103]  Xiaogang Wang,et al.  STCT: Sequentially Training Convolutional Networks for Visual Tracking , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[104]  Simone Melzi,et al.  Online Feature Selection for Visual Tracking , 2016, BMVC.

[105]  Shuicheng Yan,et al.  NUS-PRO: A New Visual Tracking Challenge , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[106]  Qingming Huang,et al.  Online Deformable Object Tracking Based on Structure-Aware Hyper-Graph , 2016, IEEE Transactions on Image Processing.

[107]  Hongdong Li,et al.  Beyond Local Search: Tracking Objects Everywhere with Instance-Specific Proposals , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[108]  Qi Tian,et al.  Geometric Hypergraph Learning for Visual Tracking , 2016, IEEE Transactions on Cybernetics.

[109]  Zhe,et al.  The Visual Object Tracking VOT2015 Challenge Results , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[110]  Matej Kristan,et al.  Deformable Parts Correlation Filters for Robust Visual Tracking , 2016, IEEE Transactions on Cybernetics.