Automatic player labeling, tracking and field registration and trajectory mapping in broadcast soccer video

In this article, we present a method to perform automatic player trajectories mapping based on player detection, unsupervised labeling, efficient multi-object tracking, and playfield registration in broadcast soccer videos. Player detector determines the players' positions and scales by combining the ability of dominant color based background subtraction and a boosting detector with Haar features. We first learn the dominant color with accumulate color histogram at the beginning of processing, then use the player detector to collect hundreds of player samples, and learn player appearance codebook by unsupervised clustering. In a soccer game, a player can be labeled as one of four categories: two teams, referee or outlier. The learning capability enables the method to be generalized well to different videos without any manual initialization. With the dominant color and player appearance model, we can locate and label each player. After that, we perform multi-object tracking by using Markov Chain Monte Carlo (MCMC) data association to generate player trajectories. Some data driven dynamics are proposed to improve the Markov chain's efficiency, such as label consistency, motion consistency, and track length, etc. Finally, we extract key-points and find the mapping from an image plane to the standard field model, and then map players' position and trajectories to the field. A large quantity of experimental results on FIFA World Cup 2006 videos demonstrate that this method can reach high detection and labeling precision, reliably tracking in scenes of player occlusion, moderate camera motion and pose variation, and yield promising field registration results.

[1]  Antonio Criminisi,et al.  TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation , 2006, ECCV.

[2]  Qian Yu,et al.  Map-Enhanced Detection and Tracking from a Moving Platform with Local and Global Data Association , 2007, 2007 IEEE Workshop on Motion and Video Computing (WMVC'07).

[3]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[4]  Michael Isard,et al.  BraMBLe: a Bayesian multiple-blob tracker , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[5]  Samy Bengio,et al.  Semi-supervised adapted HMMs for unusual event detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6]  Peter H. N. de With,et al.  Fast camera calibration for the analysis of sport sequences , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[7]  P.V.C. Hough,et al.  Machine Analysis of Bubble Chamber Pictures , 1959 .

[8]  A. G. Amitha Perera,et al.  Multi-Object Tracking Through Simultaneous Long Occlusions and Split-Merge Conditions , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Svetha Venkatesh,et al.  Object labelling from human action recognition , 2003, Proceedings of the First IEEE International Conference on Pervasive Computing and Communications, 2003. (PerCom 2003)..

[10]  Jake K. Aggarwal,et al.  Tracking soccer players using broadcast TV images , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[11]  Wolfgang Effelsberg,et al.  Robust camera calibration for sport videos using court models , 2003, IS&T/SPIE Electronic Imaging.

[12]  Wen Gao,et al.  Mining Information of Attack-Defense Status from Soccer Video Based on Scene Analysis , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[13]  Tao Wang,et al.  Soccer Highlight Detection using Two-Dependence Bayesian Network , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[14]  S. Shankar Sastry,et al.  Markov Chain Monte Carlo Data Association for Multi-Target Tracking , 2009, IEEE Transactions on Automatic Control.

[15]  Hanspeter Bieri,et al.  A Video‐Based 3D‐Reconstruction of Soccer Games , 2000, Comput. Graph. Forum.

[16]  A. G. Amitha Perera,et al.  A unified framework for tracking through occlusions and across sensor gaps , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[18]  Tim Hesterberg,et al.  Monte Carlo Strategies in Scientific Computing , 2002, Technometrics.

[19]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20]  Miki Haseyama,et al.  A soccer field tracking method with wire frame model from TV images , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[21]  James J. Little,et al.  A Boosted Particle Filter: Multitarget Detection and Tracking , 2004, ECCV.

[22]  Sang Wook Lee,et al.  Probabilistic Tracking of Soccer Players and Ball , 2004 .

[23]  Stefan Carlsson,et al.  Tracking and Labelling of Interacting Multiple Targets , 2006, ECCV.

[24]  Wen Gao,et al.  A new method to calculate the camera focusing area and player position on playfield in soccer video , 2005, Visual Communications and Image Processing.

[25]  LiuJia,et al.  Automatic player labeling, tracking and field registration and trajectory mapping in broadcast soccer video , 2011 .

[26]  Gérard G. Medioni,et al.  Multiple Target Tracking Using Spatio-Temporal Markov Chain Monte Carlo Data Association , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  S. Shankar Sastry,et al.  Markov Chain Monte Carlo Data Association for Multiple-Target Tracking , 2005, CDC 2005.

[28]  Y. Bar-Shalom Tracking and data association , 1988 .

[29]  Stefan Carlsson,et al.  Multi-Target Tracking - Linking Identities using Bayesian Network Inference , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[30]  Xinguo Yu,et al.  3D reconstruction and enrichment of broadcast soccer video , 2004, MULTIMEDIA '04.

[31]  Jeff A. Bilmes,et al.  A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .

[32]  Bernt Schiele,et al.  Pedestrian detection in crowded scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[33]  Yongduek Seo,et al.  Physics-based 3D position analysis of a soccer ball from monocular image sequences , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[34]  Lifeng Sun,et al.  A Three-Level Scheme for Real-Time Ball Tracking , 2007, MCAM.

[35]  Donald Reid An algorithm for tracking multiple targets , 1978 .

[36]  Dong Xu,et al.  Visual Event Recognition in News Video using Kernel Methods with Multi-Level Temporal Alignment , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Harry Shum,et al.  Automatic extraction of semantic colors in sports video , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[38]  Graham A. Thomas,et al.  Real-time camera tracking using sports pitch markings , 2007, Journal of Real-Time Image Processing.

[39]  Wenlong Li,et al.  Optimization and Parallelization on a Multimeida Application , 2007, 2007 IEEE International Conference on Multimedia and Expo.