Sports Type Determination Based on Keyword Spotting

This paper proposes a method to automatically determine the sports type of a sports game based on KWS (keyword spotting) techniques. First, we develop an audio segmentation module as the front-end to extract announcer’s speech efficiently from the complex sports audio stream. Then we employ speech recognition technology on these speech segments to extract keywords as the features of each kind of sports. Finally, based on the improved KWS results and specific keywords selected for each kind of sports, the classification is conducted based on a vote ranking strategy. For robust KWS in our system, adaptation techniques for acoustic model and language model are employed. In the acoustic model adaptation, supervised adaptation is carried out using MAP(maximum a posterior). In the language model adaptation, a keyword-frequency-based adaptation is proposed in this paper. Both adaptations show significant improvements on KWS performance. By integrating all the techniques, we achieve 100% accuracy rate in STD (sports type determination) tested on 15 games of seven kinds of sports.

[1]  Yonghong Yan,et al.  A One-Pass Real-Time Decoder Using Memory-Efficient State Network , 2008, IEICE Trans. Inf. Syst..

[2]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[3]  Regunathan Radhakrishnan,et al.  Highlights extraction from sports video based on an audio-visual marker detection framework , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[4]  Giuseppe Riccardi,et al.  Acoustic and word lattice based algorithms for confidence scores , 2002, INTERSPEECH.

[5]  Deb Roy,et al.  Temporal feature induction for baseball highlight classification , 2007, ACM Multimedia.

[6]  Lie Lu,et al.  Speaker change detection and tracking in real-time news broadcasting analysis , 2002, MULTIMEDIA '02.

[7]  Lie Lu,et al.  Digital Object Identifier (DOI) 10.1007/s00530-002-0065-0 Multimedia Systems , 2003 .

[8]  Alexander H. Waibel,et al.  Strategies for automatic segmentation of audio data , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).