An integrated baseball digest system using maximum entropy method

In this paper, we propose a novel system that is able to automatically detect and classify highlights from baseball game videos in TV broadcast. The digest system gives complete indexes of a baseball game which cover all of the status changes in a game. We achieve this by seamlessly integrating image, audio and speech clues using a maximum entropy based method. What distinguishes our system from previous ones is that we emphasize on the integration of multimedia features and the acquisition of domain knowledge through machine learning process. Integration of multimedia features is important because with the current state-of-the-art image and audio analysis techniques, most image and audio features we can extract from videos are very low level, and detecting/classifying sports game highlights based on features from single medium are doomed to yield poor performances. Acquiring domain knowledge through learning process is preferred over heuristic rules because machine learning process is more powerful for discovering and expressing domain knowledge. We perform extensive experiments on game videos including various stadiums, teams and broadcasted by different TV stations.

[1]  Anoop Gupta,et al.  Automatically extracting highlights for TV Baseball programs , 2000, ACM Multimedia.

[2]  Wenjun Zeng,et al.  Integrated image and speech analysis for content-based video indexing , 1996, Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems.

[3]  Riccardo Leonardi,et al.  Event recognition in sport programs using low-level motion indices , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[4]  Peter J. L. van Beek,et al.  Detection of slow-motion replay segments in sports video for highlights generation , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[5]  Shih-Fu Chang,et al.  Algorithms and system for segmentation and structure analysis in soccer video , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[6]  Kazuyoshi Yoshino,et al.  Qualitative image analysis of group behaviour , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Noboru Babaguchi,et al.  Towards abstracting sports video by highlights , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[8]  HongJiang Zhang,et al.  Automatic parsing of TV soccer programs , 1995, Proceedings of the International Conference on Multimedia Computing and Systems.

[9]  Shih-Fu Chang,et al.  Automatic selection of visual features and classifiers , 1999, Electronic Imaging.

[10]  Shih-Fu Chang,et al.  Structure analysis of sports video using domain models , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[11]  J. Darroch,et al.  Generalized Iterative Scaling for Log-Linear Models , 1972 .