Automatic Sports Video Analysis using Audio Clues and Context Knowledge

Sports analysis has recently become popular in research and professional applications. This paper presents a scheme for automatic sports video analysis based on audio clues and specific game context knowledge. We propose a simple, two-step racket-hit detection for achieving accurate event classification for tennis video. To implement the mapping between the sample-level feature space and the semantic-level space, we employ heuristic rules based on specific knowledge of the tennis game. Experimental results have shown that the proposed system can reliably detect the racket hit (at about 90%) and identify meaningful events such as rally, scoring, different types of service, and return. Our system can be operated stand-alone or combined with video analysis and then used for effective and automatic extraction of various tennis events and analysis of tactics with high reliability.

[1]  Regunathan Radhakrishnan,et al.  Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[2]  Hisashi Miyamori,et al.  Video annotation for content-based retrieval using human behavior analysis and domain knowledge , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[3]  Jungong Han,et al.  An automatic analyzer for sports video databases using visual cues and real-world modeling , 2006, 2006 Digest of Technical Papers International Conference on Consumer Electronics.

[4]  Qi Tian,et al.  A fusion scheme of visual and auditory modalities for event detection in sports video , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[5]  Anil K. Jain,et al.  Automatic classification of tennis video for high-level content-based retrieval , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[6]  Zhu Liu,et al.  Multimedia content analysis-using both audio and visual clues , 2000, IEEE Signal Process. Mag..

[7]  Anil C. Kokaram,et al.  Joint audio visual retrieval for tennis broadcasts , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[8]  A. Murat Tekalp,et al.  Automatic soccer video analysis and summarization , 2003, IEEE Trans. Image Process..