Dictionary Learning and Confidence Map Estimation-Based Tracker for Robot-Assisted Therapy System

In this paper, we propose a new tracker based on dictionary learning and confidence map estimation for a robot-assisted therapy system. We first over-segment the image into superpixel patches, and then employ color and depth cues to estimate the object confidence of each superpixel patch. We build two Bag-of-Word (BoW) models from initial frames to encode foreground/background appearance, and compute object confidence at superpixel level using BoW model in both foreground and background. We further refine target confidence by depth-based statistical features to mitigate noise interference and the uncertainty of visual cues. We derive the global confidence of each target candidate at bag level, and incorporate the confidence estimations to determine the posterior probability of each candidate within the Bayesian framework. Experimental results demonstrate the superior performance of the proposed method, especially in long-term tracking and occlusion handling.

[1]  Arnold W. M. Smeulders,et al.  UvA-DARE (Digital Academic Repository) Siamese Instance Search for Tracking , 2016 .

[2]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Lei Zhang,et al.  Real-Time Compressive Tracking , 2012, ECCV.

[4]  Silvio Savarese,et al.  Learning to Track at 100 FPS with Deep Regression Networks , 2016, ECCV.

[5]  Jiri Matas,et al.  P-N learning: Bootstrapping binary classifiers by structural constraints , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Youfu Li,et al.  GM-PHD-Based Multi-Target Visual Tracking Using Entropy Distribution and Game Theory , 2014, IEEE Transactions on Industrial Informatics.

[7]  Honghai Liu,et al.  Multiple Perspective Object Tracking via Context-Aware Correlation Filter , 2018, IEEE Access.

[8]  Qingming Huang,et al.  Hedged Deep Tracking , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Zhongfei Zhang,et al.  A survey of appearance models in visual object tracking , 2013, ACM Trans. Intell. Syst. Technol..

[10]  Huchuan Lu,et al.  Robust Superpixel Tracking , 2014, IEEE Transactions on Image Processing.

[11]  Ming-Hsuan Yang,et al.  JOTS: Joint Online Tracking and Segmentation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Jinhai Xiang,et al.  Robust visual tracking via bag of superpixels , 2015, Multimedia Tools and Applications.

[13]  Michael Felsberg,et al.  Learning Spatially Regularized Correlation Filters for Visual Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14]  Shengyong Chen,et al.  Adaptive Compressive Tracking based on Locality Sensitive Histograms , 2017, Pattern Recognit..

[15]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[17]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Wolfgang Nejdl,et al.  Introduction to the special section on twitter and microblogging services , 2013, TIST.

[19]  Vibhav Vineet,et al.  Struck: Structured Output Tracking with Kernels , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.