Multilevel analysis of sports video sequences

We propose a fully automatic and flexible framework for analysis and summarization of tennis broadcast video sequences, using visual features and specific game-context knowledge. Our framework can analyze a tennis video sequence at three levels, which provides a broad range of different analysis results. The proposed framework includes novel pixel-level and object-level tennis video processing algorithms, such as a moving-player detection taking both the color and the court (playing-field) information into account, and a player-position tracking algorithm based on a 3-D camera model. Additionally, we employ scene-level models for detecting events, like service, base-line rally and net-approach, based on a number real-world visual features. The system can summarize three forms of information: (1) all court-view playing frames in a game, (2) the moving trajectory and real-speed of each player, as well as relative position between the player and the court, (3) the semantic event segments in a game. The proposed framework is flexible in choosing the level of analysis that is desired. It is effective because the framework makes use of several visual cues obtained from the real-world domain to model important events like service, thereby increasing the accuracy of the scene-level analysis. The paper presents attractive experimental results highlighting the system efficiency and analysis capabilities.

[1]  Andrew P. Bradley,et al.  Player Tracking and Stroke Recognition in Tennis Video , 2003 .

[2]  Jungong Han,et al.  Automatic tracking method for sports video analysis , 2005 .

[3]  Yves Jean,et al.  Real time tracking for enhanced tennis broadcasts , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[4]  Peter H. N. de With,et al.  Fast camera calibration for the analysis of sport sequences , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[5]  Azriel Rosenfeld,et al.  Tracking Groups of People , 2000, Comput. Vis. Image Underst..

[6]  Alessandro Micarelli,et al.  Automatic Annotation of Tennis Video Sequences , 2002, DAGM-Symposium.

[7]  Anil C. Kokaram,et al.  Classification and representation of semantic content in broadcast tennis videos , 2005, IEEE International Conference on Image Processing 2005.

[8]  Jenny Benois-Pineau,et al.  Real-Time and Distributed AV Content Analysis System for Consumer Electronics Networks , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[9]  Alberto Del Bimbo,et al.  Soccer highlights detection and recognition using HMMs , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[10]  Patrick Gros,et al.  Temporal structure analysis of broadcast tennis video using hidden Markov models , 2003, IS&T/SPIE Electronic Imaging.

[11]  Joseph J. LaViola,et al.  An experiment comparing double exponential smoothing and Kalman filter-based predictive tracking algorithms , 2003, IEEE Virtual Reality, 2003. Proceedings..

[12]  Shih-Fu Chang,et al.  Real-time content-based adaptive streaming of sports videos , 2001, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL 2001).

[13]  Anil K. Jain,et al.  Automatic classification of tennis video for high-level content-based retrieval , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[14]  Richard J. Qian,et al.  Detecting semantic events in soccer games: towards a complete solution , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[15]  HongJiang Zhang,et al.  Automatic parsing of TV soccer programs , 1995, Proceedings of the International Conference on Multimedia Computing and Systems.