Event Detection in Soccer Videos through Text-based Localization and Audiovisual Analysis

This paper presents a framework for soccer event detection through joint textual, aural and visual feature analysis. Firstly, textual cues from online sporting resources are used to significantly reduce and localize the event search space. Then, analysis is performed based on generic rule-sets imposed on specific audiovisual feature properties to isolate the most compressed view of the events. Experiments conducted on 30-hours of soccer videos from various broadcasters show encouraging results for the detection of goals, penalties, yellow cards, red cards and substitutions.

[1]  Mahmood Fathy,et al.  A Low Cost Algorithm for Expected Goal Events Detection in Broadcast Soccer Video , 2010, J. Digit. Content Technol. its Appl..

[2]  Bo Zhang,et al.  A Formal Study of Shot Boundary Detection , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  A. Murat Tekalp,et al.  Automatic soccer video analysis and summarization , 2003, IEEE Trans. Image Process..

[4]  Alfian Abdul Halin,et al.  Shot view classification for playfield-based sports video , 2009, 2009 IEEE International Conference on Signal and Image Processing Applications.

[5]  Guna Seetharaman,et al.  Semantic Concept Mining Based on Hierarchical Event Detection for Soccer Video Indexing , 2009, J. Multim..

[6]  Mohan S. Kankanhalli,et al.  Creating audio keywords for event detection in soccer video , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[7]  Alberto Del Bimbo,et al.  Highlights modeling and detection in sports videos , 2004, Pattern Analysis and Applications.

[8]  Jun Li,et al.  Building a Large Annotation Ontology for Movie Video Retrieval , 2010, J. Digit. Content Technol. its Appl..

[9]  Francesco Camastra,et al.  Machine Learning for Audio, Image and Video Analysis - Theory and Applications , 2007, Advanced Information and Knowledge Processing.

[10]  Hyeonsang Eom,et al.  A compound framework for sports results prediction: A football case study , 2008, Knowl. Based Syst..

[11]  Mohan S. Kankanhalli,et al.  Visual keywords labeling in soccer video , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[12]  Reede Ren,et al.  Audio-visual football video analysis, from structure detection to attention analysis , 2008 .

[13]  Yo-Ping Huang,et al.  An intelligent strategy for the automatic detection of highlights in tennis video recordings , 2009, Expert Syst. Appl..

[14]  Nurfadhlina Mohd Sharef,et al.  Order Independent Incremental Evolving Fuzzy Grammar Fragment Learner , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[15]  Chiou-Ting Hsu,et al.  Fusion of audio and motion information on HMM-based highlight extraction for baseball games , 2006, IEEE Transactions on Multimedia.

[16]  Hossam M. Zawbaa,et al.  Soccer video summarization using enhanced logo detection , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[17]  Alberto Del Bimbo,et al.  Video event classification using string kernels , 2010, Multimedia Tools and Applications.

[18]  Noel E. O'Connor,et al.  Event detection in field sports video using audio-visual features and a support vector Machine , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Min Chen,et al.  Hierarchical Temporal Association Mining for Video Event Detection in Video Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[20]  Noboru Babaguchi,et al.  Personalized abstraction of broadcasted American football video by highlight selection , 2004, IEEE Transactions on Multimedia.

[21]  Lars Kai Hansen,et al.  Vocal Segment Classification in Popular Music , 2008, ISMIR.

[22]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[23]  Joo-Hwee Lim,et al.  Visual keywords labeling in soccer video , 2004, ICPR 2004.

[24]  Chung-Lin Huang,et al.  Semantic analysis of soccer video using dynamic Bayesian network , 2006, IEEE Transactions on Multimedia.

[25]  Patrick Bouthemy,et al.  Unsupervised soccer video abstraction based on pitch, dominant color and camera motion analysis , 2004, MULTIMEDIA '04.