Deep learning assisted robust visual tracking with adaptive particle filtering

We propose a novel visual tracking algorithm based on the representations from a pre-trained Convolutional Neural Network (CNN). Our algorithm pre-trains a simplified CNN using a large set of videos with tracking ground truths to obtain a generic target representation. When tracking, Particle Filtering (PF) is combined to the fully-connected layer in the pre-trained CNN. Deep representations and hand-crafted features help to model tracking. To optimize the particles distribution, the velocity and acceleration information aids to calculate dynamic model. Meanwhile, our algorithm updates the tracking model in a lazy manner to avoid shift and expensive computation. As compared to previous methods, our results demonstrate superior performances in existing tracking benchmarks. A simplified deep learning frame is applied into visual tracking.Simple network frame and adaptive particle filter make the tracker more robust especially when the quick movement occurs.This method is validated through subjective and objective metrics.Quantitative comparisons highlight the contributions of FC features and hand-crafted features.

[1]  Ming Tang,et al.  Robust tracking via weakly supervised ranking SVM , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Ehud Rivlin,et al.  Robust Fragments-based Tracking using the Integral Histogram , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[3]  Qingshan Liu,et al.  Robust Visual Tracking via Convolutional Networks Without Training , 2015, IEEE Transactions on Image Processing.

[4]  Horst Bischof,et al.  Semi-supervised On-Line Boosting for Robust Tracking , 2008, ECCV.

[5]  Jenny Benois-Pineau,et al.  A multi-resolution particle filter tracking in a multi-camera environment , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[6]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[7]  Nassir Navab,et al.  Rapid selection of reliable templates for visual tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  Laura Sevilla-Lara,et al.  Distribution fields for tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[10]  Rynson W. H. Lau,et al.  Visual Tracking via Locality Sensitive Histograms , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Huchuan Lu,et al.  Visual tracking via adaptive structural local sparse appearance model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Lei Han,et al.  An object tracking method based on guided filter for night fusion image , 2016 .

[13]  Seunghoon Hong,et al.  Online Tracking by Learning Discriminative Saliency Map with Convolutional Neural Network , 2015, ICML.

[14]  Haibin Ling,et al.  Robust visual tracking using ℓ1 minimization , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16]  Dominique Barba,et al.  Tracking of Objects in Video Scenes with Time Varying Content , 2002, EURASIP J. Adv. Signal Process..

[17]  Yi Li,et al.  DeepTrack: Learning Discriminative Feature Representations by Convolutional Neural Networks for Visual Tracking , 2014, BMVC.

[18]  Josef Kittler,et al.  Audio Assisted Robust Visual Tracking With Adaptive Particle Filtering , 2015, IEEE Transactions on Multimedia.

[19]  Patrick Pérez,et al.  Color-Based Probabilistic Tracking , 2002, ECCV.

[20]  Changsheng Xu,et al.  Structural Sparse Tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[22]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[23]  Haibin Ling,et al.  Real time robust L1 tracker using accelerated proximal gradient approach , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Dit-Yan Yeung,et al.  Learning a Deep Compact Image Representation for Visual Tracking , 2013, NIPS.

[25]  Lei Zhang,et al.  Fast Compressive Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Afshin Dehghan,et al.  Deep Tracking: Visual Tracking Using Deep Convolutional Networks , 2015, ArXiv.

[27]  Huchuan Lu,et al.  Superpixel tracking , 2011, 2011 International Conference on Computer Vision.

[28]  Yifan Zhou,et al.  Multi-object particle filter tracking with automatic event analysis , 2010, ARTEMIS '10.

[29]  Ming-Hsuan Yang,et al.  Robust Object Tracking with Online Multiple Instance Learning , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.