An Automated System for Generating Tactical Performance Statistics for Individual Soccer Players From Videos

The world of sports intrinsically involves fast and complex events that are difficult for coaches, trainers and players to analyze, and also for audiences to follow. In fast paced team sports such as soccer, keeping track of all the players and analyzing their performance after every match are very challenging. Current scenarios for identifying the best talents in soccer involve word-of-mouth and coaches/recruiters scouring through hours of manually annotated videos. This is a very expensive and laborious process and also biased by the nature of the recruiters. To alleviate these problems, this paper proposes an automated system that can detect, track, classify the teams of multiple players and identify the player controlling the ball in a video. The system generates three very important tactical statistics for a player: 1) duration of ball possession, 2) number of successful passes and 3) number of successful steals. This is done by training Convolutional Neural Networks (CNNs) to (a) localize and track the players on the field, (b) classify the team of a detected player, (c) identify the player controlling the ball and (d) pooling all the information extracted from (a), (b), and (c) to generate the statistics of players. To overcome the problem that the features learned from specific soccer matches do not necessarily generalize across different soccer matches, the paper proposes minimal amount of match-specific annotation and data augmentation, using a variant of Deep Convolutional Generative Adversarial Networks (DCGAN) to improve the accuracy. Experimental results and ablation studies show that the proposed approach outperforms the state-of-the-art approaches in terms of accuracy and processing speed.

[1]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[2]  Carl H. Smith,et al.  Generalization versus classification , 1992, COLT '92.

[3]  David Haussler,et al.  Exploiting Generative Models in Discriminative Classifiers , 1998, NIPS.

[4]  A. Scoulding,et al.  Passing in the Soccer World Cup 2002. , 2004 .

[5]  A. Redwood-Brown Passing patterns before and after goal scoring in FA Premier League Soccer , 2008 .

[6]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[7]  Jin-Jang Leou,et al.  An Improved Mean Shift Algorithm Based Tracking System for Soccer Game Analysis , 2009 .

[8]  Mohammad Rahmati,et al.  Automatic soccer players tracking in goal scenes by camera motion elimination , 2009, Image Vis. Comput..

[9]  Tiziana D'Orazio,et al.  An Investigation Into the Feasibility of Real-Time Soccer Offside Detection From a Multiple Camera System , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Jia Liu,et al.  Automatic Player Detection, Labeling and Tracking in Broadcast Soccer Video , 2007, BMVC.

[11]  A. Dellal,et al.  Ball Possession Strategies in Elite Soccer According to the Evolution of the Match-Score: the Influence of Situational Variables , 2010 .

[12]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[13]  James J. Little,et al.  Identifying players in broadcast sports videos using conditional random fields , 2011, CVPR 2011.

[14]  Bir Bhanu,et al.  Distributed Video Sensor Networks , 2011 .

[15]  Bojan Leontijević,et al.  INFLUENCE OF CERTAIN TACTICAL ATTACKING PATTERNS ON THE RESULT ACHIEVED BY THE TEAMS PARTICIPANTS OF THE 2010 FIFA WORLD CUP IN SOUTH AFRICA , 2011 .

[16]  Shihong Lao,et al.  Multiple Player Tracking in Sports Video: A Dual-Mode Two-Way Bayesian Inference Approach With Progressive Observation Modeling , 2011, IEEE Transactions on Image Processing.

[17]  J. Sampaio,et al.  Effects of game location and final outcome on game-related statistics in each zone of the pitch in professional football , 2012 .

[18]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19]  Shu-Yuan Chen,et al.  Automatic Broadcast Soccer Video Analysis, Player Detection, and Tracking Based on Color Histogram , 2013 .

[20]  Tomoko Ogiwara Pass Appearance Time and pass attempts by teams qualifying for the second stage of FIFA World Cup 2010 in South Africa , 2013 .

[21]  Carsten Griwodz,et al.  Soccer video and player position dataset , 2014, MMSys '14.

[22]  Oluwasanmi Koyejo,et al.  Consistent Binary Classification with Generalized Performance Metrics , 2014, NIPS.

[23]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[24]  Xiaoou Tang,et al.  Pedestrian Attribute Recognition At Far Distance , 2014, ACM Multimedia.

[25]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[26]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[27]  Luca Bertinetto,et al.  Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[28]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[30]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[31]  Kaori,et al.  Pass Appearance Time and Pass Attempts by Teams Qualifying for the Second Stage of FIFA World Cup 2014 in Brazil , 2016 .

[32]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Yuanjie Zheng,et al.  Breast Cancer Multi-classification from Histopathological Images with Structured Deep Learning Model , 2017, Scientific Reports.

[34]  Christophe De Vleeschouwer,et al.  Scene-specific classifier for effective and efficient team sport players detection from a single calibrated camera , 2017, Comput. Vis. Image Underst..

[35]  Hamid Abrishami Moghaddam,et al.  A survey on player tracking in soccer videos , 2017, Comput. Vis. Image Underst..

[36]  Dietrich Paulus,et al.  Simple online and realtime tracking with a deep association metric , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[37]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  James J. Little,et al.  Classification of Puck Possession Events in Ice Hockey , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[39]  Andrew Zisserman,et al.  Detect to Track and Track to Detect , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[40]  Nuno Vasconcelos,et al.  Cascade R-CNN: Delving Into High Quality Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[41]  Xiaoou Tang,et al.  LiteFlowNet: A Lightweight Convolutional Neural Network for Optical Flow Estimation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42]  Fine-Grained Activity Recognition in Baseball Videos , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[43]  Bir Bhanu,et al.  DeephESC: An Automated System for Generating and Classification of Human Embryonic Stem Cells , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[44]  Tae-Hyun Oh,et al.  Part-Based Player Identification Using Deep Convolutional Representation and Multi-scale Pooling , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[45]  Jianfeng Xu,et al.  Fast and Accurate Object Detection Using Image Cropping/Resizing in Multi-View 4K Sports Videos , 2018, MMSports@MM.

[46]  Marc Van Droogenbroeck,et al.  A Bottom-Up Approach Based on Semantics for the Interpretation of the Main Camera Stream in Soccer Games , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[47]  Bir Bhanu,et al.  Soccer: Who Has the Ball? Generating Visual Analytics and Player Statistics , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[48]  Alireza Behrad,et al.  Event detection in soccer videos using unsupervised learning of Spatio-temporal features based on pooled spatial pyramid model , 2018, Multimedia Tools and Applications.

[49]  Tieniu Tan,et al.  A Light CNN for Deep Face Representation With Noisy Labels , 2015, IEEE Transactions on Information Forensics and Security.

[50]  Bir Bhanu,et al.  Fine-Grained Visual Dribbling Style Analysis for Soccer Videos With Augmented Dribble Energy Image , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[51]  Bir Bhanu,et al.  Pose-Guided R-CNN for Jersey Number Recognition in Sports , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[52]  Rajkumar Theagarajan,et al.  DeephESC 2.0: Deep Generative Multi Adversarial Networks for improving the classification of hESC , 2019, PloS one.

[53]  David A. Clausi,et al.  Temporal Hockey Action Recognition via Pose and Optical Flows , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[54]  Marc Van Droogenbroeck,et al.  ARTHuS: Adaptive Real-Time Human Segmentation in Sports Through Online Distillation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[55]  Christophe De Vleeschouwer,et al.  Associative Embedding for Team Discrimination , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[56]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[57]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.