Modular tracking framework: A fast library for high precision tracking

This paper presents MTF — a modular, extensible and highly efficient open source framework for registration based tracking targeted at robotics applications. It is implemented entirely in C++ and is designed from the ground up to easily integrate with systems that support any of several major vision and robotics libraries including OpenCV, ROS, ViSP and Eigen. It is also faster and more precise than other existing tracking systems. In order to establish the theoretical basis for its design, a new way to conceptualize registration based trackers is also introduced that decomposes tracking into three sub modules — Search Method, Appearance Model and State Space Model. Along with being a practical solution for fast and high precision tracking, this framework can also serve as a useful research tool by allowing existing and new methods for any of the sub modules to be studied better. Through extensive use of generic programming, the system makes it easy to plug in a new method for any of the sub modules so that it can not only be tested comprehensively with existing methods but also become immediately available for deployment in any project that uses the framework.

[1]  Maxime Meilland,et al.  Improving NCC-Based Direct Visual Tracking , 2012, ECCV.

[2]  Gregory D. Hager,et al.  Efficient Region Tracking With Parametric Models of Geometry and Illumination , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Mennatullah Siam Based Appearance Model with Approximate Nearest Neigbour Search , 2016 .

[4]  Selim Benhimane,et al.  Homography-based 2D Visual Tracking and Servoing , 2007, Int. J. Robotics Res..

[5]  Zhenyu He,et al.  The Visual Object Tracking VOT2016 Challenge Results , 2016, ECCV Workshops.

[6]  Frank Chongwoo Park,et al.  A Geometric Particle Filter for Template-Based Visual Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Geraldo F. Silveira,et al.  Real-time Visual Tracking under Arbitrary Illumination Changes , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Abhineet Singh,et al.  Unifying Registration Based Tracking: A Case Study with Structural Similarity , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[9]  Xi Zhang,et al.  RKLT: 8 DOF Real-Time Robust Video Tracking Combing Coarse Ransac Features and Accurate Fast Template Registration , 2015, 2015 12th Conference on Computer and Robot Vision.

[10]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[11]  Russell H. Taylor,et al.  Visual tracking using the sum of conditional variance , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[12]  Simone Calderara,et al.  Visual Tracking: An Experimental Survey , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Simon Baker,et al.  Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.

[14]  Silvio Savarese,et al.  Learning to Track at 100 FPS with Deep Regression Networks , 2016, ECCV.

[15]  Éric Marchand,et al.  ViSP for visual servoing: a generic software platform with a wide class of robot control skills , 2005, IEEE Robotics & Automation Magazine.

[16]  Simon Baker,et al.  Equivalence and efficiency of image alignment algorithms , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[17]  Martin Jägersand,et al.  Realtime Registration-Based Tracking via Approximate Nearest Neighbour Search , 2013, Robotics: Science and Systems.

[18]  G. Schweighofer Groupwise Geometric and Photometric Direct Image Registration , 2006 .

[19]  Richard Szeliski,et al.  Construction of Panoramic Image Mosaics with Global and Local Alignment , 2001 .

[20]  Morgan Quigley,et al.  ROS: an open-source Robot Operating System , 2009, ICRA 2009.

[21]  J.-Y. Bouguet,et al.  Pyramidal implementation of the lucas kanade feature tracker , 1999 .

[22]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[23]  James M. Conrad,et al.  Survey of popular robotics simulators, frameworks, and toolkits , 2011, 2011 Proceedings of IEEE Southeastcon.

[24]  Gregory D. Hager,et al.  X Vision: A Portable Substrate for Real-Time Vision Applications , 1998, Comput. Vis. Image Underst..

[25]  Éric Marchand,et al.  Accurate real-time tracking using mutual information , 2010, 2010 IEEE International Symposium on Mixed and Augmented Reality.

[26]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[27]  Amaury Dame A unified direct approach for visual servoing and visual tracking using mutual information , 2010 .

[28]  Xi Zhang,et al.  Modular Decomposition and Analysis of Registration Based Trackers , 2016, 2016 13th Conference on Computer and Robot Vision (CRV).

[29]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[30]  Abhineet Singh,et al.  Modular Tracking Framework: A Unified Approach to Registration based Tracking , 2016, ArXiv.