Semantic Event Detection in Structured Video Using Hybrid HMM/SVM

In this paper, we propose a new semantic event detection algorithm in structured video. A hybrid method that combines HMM with SVM to detect semantic events in video is proposed. The proposed detection method has some advantages that it is suitable to the temporal structure of event thanks to Hidden Markov Models (HMM) and guarantees high classification accuracy thanks to Support Vector Machines (SVM). The performance of the proposed method is compared with that of HMM based method, which shows the performance increase in both recall and precision of semantic event detection.

[1]  Ioannis Pitas,et al.  Visual speech recognition using support vector machines , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[2]  Wei-Ying Ma,et al.  Image and Video Retrieval , 2003, Lecture Notes in Computer Science.

[3]  Noboru Babaguchi,et al.  Event based indexing of broadcasted sports video by intermodal collaboration , 2002, IEEE Trans. Multim..

[4]  Shih-Fu Chang,et al.  Structure analysis of sports video using domain models , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[5]  Mei Han,et al.  Extract highlights from baseball game video with hidden Markov models , 2002, Proceedings. International Conference on Image Processing.

[6]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[7]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[8]  Joseph Picone,et al.  Hybrid SVM/HMM architectures for speech recognition , 2000, INTERSPEECH.

[9]  Patrick Gros,et al.  HMM based structuring of tennis videos using visual and audio cues , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[10]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[11]  Mei Han,et al.  Baseball scene classification using multimedia features , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[13]  Xian Zhang,et al.  Video Program Clustering Indexing Based on Face Recognition Hybrid Model of Hidden Markov Model and Support Vector Machine , 2004, IWCIA.

[14]  Feng Jiang,et al.  Based on HMM and SVM multilayer architecture classifier for Chinese sign language recognition with large vocabulary , 2004, Third International Conference on Image and Graphics (ICIG'04).

[15]  Shih-Fu Chang,et al.  Structure analysis of soccer video with hidden Markov models , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  Mohan S. Kankanhalli,et al.  Goal detection in soccer video using audio/visual keywords , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[17]  Yong Man Ro,et al.  Video Segmentation Using Hidden Markov Model with Multimodal Features , 2004, CIVR.

[18]  William M. Campbell,et al.  A SVM/HMM system for speaker recognition , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[19]  Ajay Divakaran,et al.  Rapid generation of sports video highlights using the MPEG-7 motion activity descriptor , 2001, IS&T/SPIE Electronic Imaging.

[20]  YongMan Ro Golf Video Semantic Event Detection Using Hidden Markov Model , 2005 .

[21]  Alberto Del Bimbo,et al.  Soccer highlights detection and recognition using HMMs , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[22]  Ngoc Thanh Nguyen,et al.  Soccer Video Summarization System Based on Hidden Markov Model with Multiple MPEG-7 Descriptors , 2003, CISST.

[23]  Alex Acero,et al.  Spoken Language Processing , 2001 .

[24]  Regunathan Radhakrishnan,et al.  Generation of sports highlights using motion activity in combination with a common audio feature extraction framework , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).