A framework for event detection in field-sports video broadcasts based on SVM generated audio-visual feature model. Case-study: soccer video

In this paper we propose a novel audio-visual feature-based framework, for event detection in field sports broadcast video. The system is evaluated via a case-study involving MPEG encoded soccer video. Specifically, the evidence gathered by various feature detectors is combined by means of a learning algorithm (a support vector machine), which infers the occurrence of an event, based on a model generated during a training phase, utilizing a corpus of 25 hours of content. The system is evaluated using 25 hours of separate test content. Following an evaluation of results obtained, it is shown for this case, that both high precision and recall statistics are achievable.

[1]  Chuan Wu,et al.  Events recognition by semantic inference for sports video , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[2]  Alberto Del Bimbo,et al.  Soccer highlights detection and recognition using HMMs , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[3]  Boon-Lock Yeo,et al.  Analysis And Presentation Of Soccer Highlights From Digital Video , 1995 .

[4]  Anil C. Kokaram,et al.  Joint audio visual retrieval for tennis broadcasts , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[5]  Anoop Gupta,et al.  Automatically extracting highlights for TV Baseball programs , 2000, ACM Multimedia.

[6]  Milan Petkovic,et al.  Multi-modal extraction of highlights from TV Formula 1 programs , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[7]  Ki-Sang Hong,et al.  Soccer video mosaicing using self-calibration and line tracking , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[8]  Jean-Christophe Terrillon,et al.  Comparative Performance of Different Chrominance Spaces for Color Segmentation and Detection of Human Faces in Complex Scene Images , 1999 .

[9]  Alan F. Smeaton,et al.  Evaluation of automatic shot boundary detection on a large video test suite , 1999 .

[10]  Thomas Risse,et al.  Hough transform for line recognition: Complexity of evidence accumulation and cluster detection , 1989, Comput. Vis. Graph. Image Process..

[11]  C.-C. Jay Kuo,et al.  Rule-based video classification system for basketball video indexing , 2000, MULTIMEDIA '00.

[12]  S. Marlow,et al.  A combined audio-visual contribution to event detection in field sports broadcast video. Case study: Gaelic football , 2003, Proceedings of the 3rd IEEE International Symposium on Signal Processing and Information Technology (IEEE Cat. No.03EX795).

[13]  Shih-Fu Chang,et al.  Structure analysis of sports video using domain models , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[14]  Ichiro Ide,et al.  An object detection method for describing soccer games from video , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.