Affective sports highlight detection

This paper explores a psychological attention approach for sports highlight detection. A multiresolution autoregressive algorithm is proposed to fuse misaligned audio-visual time sequences and estimate an unified attention curve. Game highlights are found by ranking attention intensity; content-based events are filtered out by allocating local attention peaks. The test bed includes six complete football games from World Cup 2002, 2006 and Champion League 2006, and two content suppliers, BBC and ITV. Two evaluations are presented, the comparison on average attention and event attention, and the ranking of goal events. Experiments show this fusion framework is robust on different data collections.

[1]  Nuno Vasconcelos,et al.  Bayesian Video Shot Segmentation , 2000, NIPS.

[2]  Joemon M. Jose,et al.  Football Video Segmentation Based on Video Production Strategy , 2005, ECIR.

[3]  A. Treisman,et al.  Perceiving visually presented objets: recognition, awareness, and modularity , 1998, Current Opinion in Neurobiology.

[4]  Anoop Gupta,et al.  Automatically extracting highlights for TV Baseball programs , 2000, ACM Multimedia.

[5]  Alan Hanjalic,et al.  Adaptive extraction of highlights from a sport video based on excitement modeling , 2005, IEEE Transactions on Multimedia.

[6]  Svetha Venkatesh,et al.  Horror film genre typing and scene labeling via audio analysis , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[7]  A. Willsky Multiresolution Markov models for signal and image processing , 2002, Proc. IEEE.

[8]  Mike Graham,et al.  Extracting information about emotions in films , 2003, ACM Multimedia.

[9]  Loong Fah Cheong,et al.  Affective understanding in film , 2006, IEEE Trans. Circuits Syst. Video Technol..

[10]  Lihao Xu,et al.  Affective video content repression and model , 2005 .

[11]  Mohan S. Kankanhalli,et al.  Goal detection in soccer video using audio/visual keywords , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[12]  K. C. Chou,et al.  Multiscale recursive estimation, data fusion, and regularization , 1994, IEEE Trans. Autom. Control..

[13]  Michael S. Lew,et al.  Principles of Visual Information Retrieval , 2001, Advances in Pattern Recognition.

[14]  Chng Eng Siong,et al.  Automatic replay generation for soccer video broadcasting , 2004, MULTIMEDIA '04.

[15]  J. M. Kittross The measurement of meaning , 1959 .

[16]  Patrick Gros,et al.  HMM based structuring of tennis videos using visual and audio cues , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[17]  Joemon M. Jose,et al.  Attention guided football video content recommendation on mobile devices , 2006, MobiMedia '06.

[18]  Riccardo Leonardi,et al.  Semantic indexing of soccer audio-visual sequences: a multimodal approach based on controlled Markov chains , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  A. Murat Tekalp,et al.  Automatic soccer video analysis and summarization , 2003, IEEE Trans. Image Process..