Semantic Concept Mining Based on Hierarchical Event Detection for Soccer Video Indexing

In this paper, we present a novel automated indexing and semantic labeling for broadcast soccer video sequences. The proposed method automatically extracts silent events from the video and classifies each event sequence into a concept by sequential association mining. The paper makes three new contributions in multimodal sports video indexing and summarization. First, we propose a novel hierarchical framework for soccer (football) video event sequence detection and classification. Unlike most existing video classification approaches, which focus on shot detection followed by shot-clustering for classification, the proposed scheme perform a top-down video scene classification which avoids shot clustering. This improves the classification accuracy and also maintains the temporal order of shots. Second, we compute the association for the events of each excitement clip using a priori mining algorithm. We propose a novel sequential association distance to classify the association of the excitement clip into semantic concepts. For soccer video, we have considered goal scored by team-A, goal scored by team-B, goal saved by team-A, goal saved by team-B as semantic concepts. Third, the extracted excitement clips with semantic concept label helps us to summarize many hours of video to collection of soccer highlights such as goals, saves, corner kicks, etc. We show promising results, with correctly indexed soccer scenes, enabling structural and temporal analysis, such as video retrieval, highlight extraction, and video skimming.

[1]  Shih-Fu Chang,et al.  Algorithms and system for segmentation and structure analysis in soccer video , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[2]  Changsheng Xu,et al.  Robust soccer highlight generation with a novel dominant-speech feature extractor , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[3]  R. Dahyot,et al.  Browsing sports video: trends in sports-related indexing and retrieval work , 2006, IEEE Signal Processing Magazine.

[4]  Joemon M. Jose,et al.  Football Video Segmentation Based on Video Production Strategy , 2005, ECIR.

[5]  Baoxin Li,et al.  A general framework for sports video summarization with its application to soccer , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[6]  Harry Shum,et al.  Generic slow-motion replay detection in sports video , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[7]  Nicola Ancona,et al.  Goal detection in football by using support vector machines for classification , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[8]  Lei Wang,et al.  Offense based temporal segmentation for event detection in soccer video , 2004, MIR '04.

[9]  Somnath Sengupta,et al.  A Hierarchical Framework for Generic Sports Video Classification , 2006, ACCV.

[10]  Chong-Wah Ngo,et al.  On clustering and retrieval of video shots through temporal slices analysis , 2002, IEEE Trans. Multim..

[11]  Guna Seetharaman,et al.  Flux Tensor Constrained Geodesic Active Contours with Sensor Fusion for Persistent Object Tracking , 2007, J. Multim..

[12]  A. Murat Tekalp,et al.  Automatic soccer video analysis and summarization , 2003, IEEE Trans. Image Process..

[13]  Baoxin Li,et al.  Automatic detection of replay segments in broadcast sports programs by detection of logos in scene transitions , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Alberto Del Bimbo,et al.  Semantic annotation of soccer videos: automatic highlights identification , 2003, Comput. Vis. Image Underst..

[15]  Alan Hanjalic,et al.  Adaptive extraction of highlights from a sport video based on excitement modeling , 2005, IEEE Transactions on Multimedia.

[16]  Chng Eng Siong,et al.  Generation of Personalized Music Sports Video Using Multimodal Cues , 2007, IEEE Transactions on Multimedia.

[17]  Noboru Babaguchi,et al.  Personalized abstraction of broadcasted American football video by highlight selection , 2004, IEEE Transactions on Multimedia.

[18]  Hanqing Lu,et al.  Shot Classification in Broadcast Soccer Video , 2008 .

[19]  Regunathan Radhakrishnan,et al.  Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[20]  Chiou-Ting Hsu,et al.  Fusion of audio and motion information on HMM-based highlight extraction for baseball games , 2006, IEEE Transactions on Multimedia.

[21]  Stefan Carlsson,et al.  Multi-Target Tracking - Linking Identities using Bayesian Network Inference , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[22]  N. Vincent,et al.  3 classes segmentation for analysis of football audio sequences , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[23]  Avideh Zakhor,et al.  Applications of Video-Content Analysis and Retrieval , 2002, IEEE Multim..

[24]  Nikolas P. Galatsanos,et al.  Scene Detection in Videos Using Shot Clustering and Symbolic Sequence Segmentation , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[25]  Changsheng Xu,et al.  A Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video , 2008, IEEE Transactions on Multimedia.

[26]  Min Chen,et al.  Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework , 2008, IEEE Transactions on Multimedia.

[27]  Qi Tian,et al.  A unified framework for semantic shot classification in sports video , 2005, IEEE Trans. Multim..

[28]  Ming Xu,et al.  Tracking football players with multiple cameras , 2004 .

[29]  M.H. Kolekar,et al.  Semantic Indexing of News Video Sequences: A Multimodal Hierarchical Approach Based on Hidden Markov Model , 2005, TENCON 2005 - 2005 IEEE Region 10 Conference.

[30]  Yan Li,et al.  Evaluating the performance of systems for tracking football players and ball , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[31]  Jean-Marc Odobez,et al.  Multi-modal audio-visual event recognition for football analysis , 2003, 2003 IEEE XIII Workshop on Neural Networks for Signal Processing (IEEE Cat. No.03TH8718).

[32]  Noel E. O'Connor,et al.  Event detection in field sports video using audio-visual features and a support vector Machine , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Anoop Gupta,et al.  Automatically extracting highlights for TV Baseball programs , 2000, ACM Multimedia.

[34]  Wen Gao,et al.  Human Behavior Analysis for Highlight Ranking in Broadcast Racket Sports Video , 2007, IEEE Transactions on Multimedia.

[35]  C.-C. Jay Kuo,et al.  Rule-based video classification system for basketball video indexing , 2000, MULTIMEDIA '00.

[36]  Richard J. Qian,et al.  Detecting semantic events in soccer games: towards a complete solution , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[37]  Tianli Yu,et al.  Retrieval of video clips using global motion information , 2001 .

[38]  Somnath Sengupta,et al.  Event-Importance Based Customized and Automatic Cricket Highlight Generation , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[39]  Ying Li,et al.  Multimedia database management systems , 1999, J. Vis. Commun. Image Represent..