Soccer Video Structure Analysis by Parallel Feature Fusion Network and Hidden-to-Observable Transferring Markov Model

Automated analysis of broadcast soccer game video is a challenging computer vision problem. Prior to performing high-level analysis (such as event detection), accurate classification of shot views and play–break segmentation are required to analyze the structure of soccer video. A novel deep network called parallel feature fusion network (PFF-Net) combines local and full-scene features to produce accurate shot view classification based on camera zoom and out-of-field status. Then, a novel hidden-to-observable Markov model (H2O-MM) is introduced to determine play/break status of the shots. Testing is performed using a variety of professional broadcast soccer videos. Variations of the PFF-Net are considered and compared with four existing methods where the PFF-Net demonstrates superior performance (92.6%). The H2O-MM has the accuracy of 98.7% for play–break segmentation, which is an improvement over two existing hidden Markov models. The new methods provide improved temporal labeling of broadcast soccer videos, which can be used to further improve overall automated event detection.

[1]  Xueming Qian,et al.  HMM based soccer video event detection using enhanced mid-level semantic , 2011, Multimedia Tools and Applications.

[2]  Mehran Yazdi,et al.  Log-Spectrum based RSTB invariant template matching with modified ICA , 2010, 2010 5th International Symposium on Telecommunications.

[3]  Pat Langley,et al.  Estimating Continuous Distributions in Bayesian Classifiers , 1995, UAI.

[4]  Amjad Rehman,et al.  Features extraction for soccer video semantic analysis: current achievements and remaining issues , 2012, Artificial Intelligence Review.

[5]  Chung-Lin Huang,et al.  Semantic analysis of soccer video using dynamic Bayesian network , 2006, IEEE Transactions on Multimedia.

[6]  Samira Pouyanfar,et al.  Semantic Event Detection Using Ensemble Deep Learning , 2016, 2016 IEEE International Symposium on Multimedia (ISM).

[7]  Jia Liu,et al.  Automatic Player Detection, Labeling and Tracking in Broadcast Soccer Video , 2007, BMVC.

[8]  M. Archana,et al.  Object Detection and Tracking Based on Trajectory in Broadcast Tennis Video , 2015 .

[9]  Yi-Ping Phoebe Chen,et al.  The power of play-break for automatic detection and browsing of self-consumable sport video highlights , 2004, MIR '04.

[10]  Marjan Mernik,et al.  Exploration and exploitation in evolutionary algorithms: A survey , 2013, CSUR.

[11]  Tiziana D'Orazio,et al.  An Investigation Into the Feasibility of Real-Time Soccer Offside Detection From a Multiple Camera System , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Sudipta Roy,et al.  Video shot boundary detection: A review , 2015, 2015 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT).

[13]  Weiping Li,et al.  Review of Deep Learning , 2018, ArXiv.

[14]  David J. Field,et al.  Sparse coding with an overcomplete basis set: A strategy employed by V1? , 1997, Vision Research.

[15]  Mehran Yazdi,et al.  Shot boundary detection with effective prediction of transitions' positions and spans by use of classifiers and adaptive thresholds , 2016, 2016 24th Iranian Conference on Electrical Engineering (ICEE).

[16]  Bhabatosh Chanda,et al.  A Model-Based Shot Boundary Detection Technique Using Frame Transition Parameters , 2012, IEEE Transactions on Multimedia.

[17]  Amir-Masoud Eftekhari-Moghadam,et al.  Multimodal feature extraction and fusion for semantic mining of soccer video: a survey , 2012, Artificial Intelligence Review.

[18]  Hamid Reza Pourreza,et al.  Fast Highlight Detection and Scoring for Broadcast Soccer Video Summarization using On-Demand Feature Extraction and Fuzzy Inference , 2015 .

[19]  Yi-Ping Phoebe Chen,et al.  Knowledge-Discounted Event Detection in Sports Video , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[20]  Shohreh Kasaei,et al.  Event Detection and Summarization in Soccer Videos Using Bayesian Network and Copula , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Noel E. O'Connor,et al.  Event detection in field sports video using audio-visual features and a support vector Machine , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Yong Shi,et al.  Fast Video Shot Boundary Detection Based on SVD and Pattern Matching , 2013, IEEE Transactions on Image Processing.

[23]  Marco Wiering,et al.  Using Deep Convolutional Neural Networks to Predict Goal-scoring Opportunities in Soccer , 2017, ICPRAM.

[24]  Yann LeCun,et al.  Predicting Deeper into the Future of Semantic Segmentation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[25]  A. Murat Tekalp,et al.  Fuzzy framework for unsupervised video content characterization and shot classification , 2001, J. Electronic Imaging.

[26]  N. K. Narayanan,et al.  Key-frame Extraction by Analysis of Histograms of Video Frames Using Statistical Methods , 2015 .

[27]  Shih-Fu Chang,et al.  Structure analysis of soccer video with domain knowledge and hidden Markov models , 2004, Pattern Recognit. Lett..

[28]  Keeseong Cho,et al.  Extraction of visual information in basketball broadcasting video for event segmentation system , 2016, 2016 International Conference on Information and Communication Technology Convergence (ICTC).

[29]  Lionel. Guimaraes,et al.  Shot classification in broadcast soccer video. , 2013 .

[30]  Alberto Del Bimbo,et al.  Taking into Consideration Sports Semantic Annotation of Sports Videos Content-based Multimedia Indexing and Retrieval , 2002 .

[31]  A. Murat Tekalp,et al.  Automatic Soccer Video Analysis and Summarization , 2003, IS&T/SPIE Electronic Imaging.

[32]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[33]  Karsten Müller,et al.  Soccer Jersey Number Recognition Using Convolutional Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[34]  Changsheng Xu,et al.  Sports Video Analysis: Semantics Extraction, Editorial Content Creation and Adaptation , 2009, J. Multim..

[35]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[36]  Yann LeCun,et al.  Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[37]  Hamid Soltanian-Zadeh,et al.  Counterattack detection in broadcast soccer videos using camera motion estimation , 2015, 2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP).