Audio keywords detection in basketball video

This paper presents an audio keywords detection method for highlight retrieval in basketball video. The keywords contain shoes squeaking sound, speech, cheer, long whistle and short whistle, which correspond to basketball game events. After feature analysis, the Simple Excellent Feature Combination based on Pearson Correlation Coefficient (SEFC-PCC) is used to select efficient features, which contributes to a preferable performance and lower computational complexity. A novel multi-stage SVM classifier is proposed to do the final detection of the five audio keywords. There are 428 audio sequences about 704 seconds used in the validation experiment; it gives a performance evaluation with average detection accuracy of 92%∼99%.

[1]  Xue-wen Chen An improved branch and bound algorithm for feature selection , 2003, Pattern Recognit. Lett..

[2]  Zhongzhe Xiao,et al.  Features extraction and selection for emotional speech classification , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[3]  Weibei Dou,et al.  Content-based Table Tennis Games Highlight Detection Utilizing Audiovisual Clues , 2007, Fourth International Conference on Image and Graphics (ICIG 2007).

[4]  Noel E. O'Connor,et al.  Event detection in field sports video using audio-visual features and a support vector Machine , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Anoop Gupta,et al.  Automatically extracting highlights for TV Baseball programs , 2000, ACM Multimedia.

[6]  Min Xu,et al.  Multimodal Semantic Analysis and Annotation for Basketball Video , 2006, EURASIP J. Adv. Signal Process..

[7]  Surya Nepal,et al.  Automatic detection of 'Goal' segments in basketball videos , 2001, MULTIMEDIA '01.

[8]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[9]  Hsuan-Tien Lin A Study on Sigmoid Kernels for SVM and the Training of non-PSD Kernels by SMO-type Methods , 2005 .

[10]  Zuoliang Cao,et al.  Omni-directional Vision Localization Based on Particle Filter , 2007, Fourth International Conference on Image and Graphics (ICIG 2007).

[11]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.

[12]  C.-C. Jay Kuo,et al.  Rule-based video classification system for basketball video indexing , 2000, MULTIMEDIA '00.

[13]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[14]  Regunathan Radhakrishnan,et al.  Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[15]  Liu Guangbin,et al.  A novel Matching Pursuit algorithm with adaptive subdictionary , 2008, 2008 9th International Conference on Signal Processing.