Exploring structure for long-term tracking of multiple objects in sports videos

Abstract In this paper, we propose a novel approach for exploiting structural relations to track multiple objects that may undergo long-term occlusion and abrupt motion. We use a model-free approach that relies only on annotations given in the first frame of the video to track all the objects online, i.e. without knowledge from future frames. We initialize a probabilistic Attributed Relational Graph (ARG) from the first frame, which is incrementally updated along the video. Instead of using the structural information only to evaluate the scene, the proposed approach considers it to generate new tracking hypotheses. In this way, our method is capable of generating relevant object candidates that are used to improve or recover the track of lost objects. The proposed method is evaluated on several videos of table tennis, volleyball, and on the ACASVA dataset. The results show that our approach is very robust, flexible and able to outperform other state-of-the-art methods in sports videos that present structural patterns.

[1]  Konrad Schindler,et al.  Continuous Energy Minimization for Multitarget Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Lei Zhang,et al.  Real-Time Compressive Tracking , 2012, ECCV.

[3]  Rainer Stiefelhagen,et al.  Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics , 2008, EURASIP J. Image Video Process..

[4]  Yihong Gong,et al.  Multi-target tracking by learning local-to-global trajectory models , 2015, Pattern Recognit..

[5]  Dongbing Gu,et al.  Abrupt motion tracking using a visual saliency embedded particle filter , 2014, Pattern Recognit..

[6]  Yao Lu,et al.  Abrupt motion tracking via adaptive stochastic approximation Monte Carlo sampling , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  Matej Kristan,et al.  Closed-world tracking of multiple interacting targets for indoor-sports applications , 2009, Comput. Vis. Image Underst..

[8]  Jean Ponce,et al.  Learning Graphs to Match , 2013, 2013 IEEE International Conference on Computer Vision.

[9]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[10]  Anderson Rocha,et al.  A multiple camera methodology for automatic localization and tracking of futsal players , 2014, Pattern Recognit. Lett..

[11]  Silvio Savarese,et al.  Learning to Track: Online Multi-object Tracking by Decision Making , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[12]  Junseok Kwon,et al.  Tracking of Abrupt Motion Using Wang-Landau Monte Carlo Estimation , 2008, ECCV.

[13]  Ming-Hsuan Yang,et al.  Object Tracking Benchmark , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  David Windridge,et al.  An evaluation of bags-of-words and spatio-temporal shapes for action recognition , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[15]  Changsheng Xu,et al.  Object Tracking by Occlusion Detection via Structured Sparse Learning , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[16]  Yanxi Liu,et al.  Tracking Sports Players with Context-Conditioned Motion Models , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Philip H. S. Torr,et al.  Struck: Structured output tracking with kernels , 2011, ICCV.

[18]  James J. Little,et al.  A Boosted Particle Filter: Multitarget Detection and Tracking , 2004, ECCV.

[19]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[20]  Séverine Dubuisson,et al.  A survey of datasets for visual tracking , 2015, Machine Vision and Applications.

[21]  Philippe C. Cattin,et al.  Tracking the invisible: Learning where the object might be , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22]  Isabelle Bloch,et al.  Fragments based tracking with adaptive cue integration , 2012, Comput. Vis. Image Underst..

[23]  Wongun Choi,et al.  Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[24]  Michael Isard,et al.  CONDENSATION—Conditional Density Propagation for Visual Tracking , 1998, International Journal of Computer Vision.

[25]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[26]  James J. Little,et al.  Learning to Track and Identify Players from Broadcast Sports Videos , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Roberto Marcondes Cesar Junior,et al.  Attributed Graphs for Tracking Multiple Objects in Structured Sports Videos , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[28]  Gernot A. Fink,et al.  Markov Models for Pattern Recognition: From Theory to Applications , 2007 .

[29]  Francesco Solera,et al.  Learning to Divide and Conquer for Online Multi-target Tracking , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[30]  Vibhav Vineet,et al.  Struck: Structured Output Tracking with Kernels , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Mubarak Shah,et al.  Tracking When the Camera Looks Away , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[32]  Roberto Marcondes Cesar Junior,et al.  On the ternary spatial relation "Between" , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[33]  Bernt Schiele,et al.  Detection and Tracking of Occluded People , 2014, International Journal of Computer Vision.

[34]  Pascal Fua,et al.  Multi-Commodity Network Flow for Tracking Multiple People , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Ricardo M. L. Barros,et al.  Background recovering in outdoor image sequences: An example of soccer players segmentation , 2006, Image Vis. Comput..

[36]  Shihong Lao,et al.  Multiple Player Tracking in Sports Video: A Dual-Mode Two-Way Bayesian Inference Approach With Progressive Observation Modeling , 2011, IEEE Transactions on Image Processing.

[37]  Patrick Pérez,et al.  Color-Based Probabilistic Tracking , 2002, ECCV.

[38]  Lu Zhang,et al.  Preserving Structure in Model-Free Tracking , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.