Structure analysis of sports video using domain models

In this paper, we present an effective framework for scene detection and structure analysis for sports videos, using tennis and baseball as examples. Sports video can be characterized by its predictable temporal syntax, recurrent events with consistent features, and a fixed number of views. Our approach combines domain-specific knowledge, supervised machine learning techniques, and automatic feature analysis at multiple levels. Real time processing performance is achieved by utilizing compressed-domain processing techniques. High accuracy in view recognition is achieved by using compressed-domain global features as prefilters and object-level refined analysis in the latter verification stage. Applications include high-level structure browsing/navigation, highlight generation, and mobile media filtering.

[1]  Qian Huang,et al.  Automated semantic structure reconstruction and representation generation for broadcast news , 1998, Electronic Imaging.

[2]  Shih-Fu Chang,et al.  Long-term moving object segmentation and tracking using spatio-temporal consistency , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[3]  Sanjeev R. Kulkarni,et al.  Automated analysis and annotation of basketball video , 1997, Electronic Imaging.

[4]  Qian Huang,et al.  Multimedia Search and Retrieval , 1999 .

[5]  Anil K. Jain,et al.  Automatic classification of tennis video for high-level content-based retrieval , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.