Event-Based Sports Videos Classification Using HMM Framework

Sports video classification is an important application of video content analysis. Event detection or recognition in sports video is an important task in semantic understanding of video content. In this paper, we propose a framework based on hidden Markov models (HMM) to represent a video as a sequence of core events that occur in a particular sport. The objective is to observe a subset of (hidden) state sequences to see whether they can be interpreted as the core events of that sport. We propose a method for sports video classification based on the events in each sport category. The proposed method for detection of events is based on a subset of state sequences, unlike the traditional way of computing the likelihood as a sum or maximum overall possible state sequences. The neighboring frames are considered for the computation of event probability at any instant of time. We demonstrate the application of this framework to five sport genre types, namely basketball, cricket, football, tennis, and volleyball.

[1]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[2]  William J. Christmas,et al.  Tracking the Evolution of a Tennis Match Using Hidden Markov Models , 2004, SSPR/SPR.

[3]  Fei-Fei Li,et al.  Learning latent temporal structure for complex event detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  A. Murat Tekalp,et al.  Automatic soccer video analysis and summarization , 2003, IEEE Trans. Image Process..

[5]  Aaron F. Bobick,et al.  Recognition of Visual Activities and Interactions by Stochastic Parsing , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Noel E. O'Connor,et al.  Event detection in field sports video using audio-visual features and a support vector Machine , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  Chung-Hsien Wu,et al.  Error Weighted Semi-Coupled Hidden Markov Model for Audio-Visual Emotion Recognition , 2012, IEEE Transactions on Multimedia.

[9]  Baoxin Li,et al.  Semantic sports video analysis: approaches and new applications , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[10]  Lei Wu,et al.  Enhancing Bag-of-Words Models with Semantics-Preserving Metric Learning , 2011, IEEE MultiMedia.

[11]  Chong-Wah Ngo,et al.  Mining Event Structures from Web Videos , 2011, IEEE MultiMedia.

[12]  Anil C. Kokaram,et al.  Semantic Event Detection in Sports Through Motion Understanding , 2004, CIVR.

[13]  William J. Christmas,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Layered Data Association Using Graph-theoretic Formulation with Application to Tennis Ball Tracking in Monocular Sequences , 2022 .

[14]  Shiqiang Yang,et al.  An HMM-based framework for video semantic analysis , 2005, IEEE Trans. Circuits Syst. Video Technol..

[15]  Özgür Ulusoy,et al.  Automatic detection of salient objects and spatial relations in videos for a video database system , 2008, Image Vis. Comput..

[16]  David S. Doermann,et al.  Sports video classification using HMMS , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[17]  Mei-Chen Yeh,et al.  Fast Visual Retrieval Using Accelerated Sequence Matching , 2011, IEEE Transactions on Multimedia.

[18]  Ling Shao,et al.  Boosted key-frame selection and correlated pyramidal motion-feature representation for human action recognition , 2013, Pattern Recognit..

[19]  Stefanos D. Kollias,et al.  Adaptive rule-based recognition of events in video sequences , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[20]  Rama Chellappa,et al.  Interpretation of state sequences in HMM for activity representation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[21]  Gang Hua,et al.  Semantic Model Vectors for Complex Video Event Recognition , 2012, IEEE Transactions on Multimedia.

[22]  Surya Nepal,et al.  Edge-based semantic classification of sports video sequences , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[23]  A. Enis Çetin,et al.  HMM Based Falling Person Detection Using Both Audio and Video , 2005, 2006 IEEE 14th Signal Processing and Communications Applications.

[24]  B. Ugur Toreyin,et al.  Ses ve video işaretlerinde saklı markof modeli tabanlı düşen kişi tespiti , 2006 .

[25]  Chong-Wah Ngo,et al.  Summarizing Rushes Videos by Motion, Object, and Event Understanding , 2012, IEEE Transactions on Multimedia.

[26]  Denis Pellerin,et al.  State Filtering and Change Detection Using TBM Conflict Application to Human Action Recognition in Athletics Videos , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Alberto Del Bimbo,et al.  Taking into Consideration Sports Semantic Annotation of Sports Videos Content-based Multimedia Indexing and Retrieval , 2002 .

[28]  M. Pawlewski,et al.  Motion-based classification of cartoons , 2001, Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing. ISIMP 2001 (IEEE Cat. No.01EX489).