Combining Short and Long Term Audio Features for TV Sports Highlight Detection

As bearer of high-level semantics, audio signal is being more and more used in content-based multimedia retrieval. In this paper, we investigate TV tennis game highlight detection based on the use of both short and long term audio features and propose two approaches, decision fusion and hierarchical classifier, in order to combine these two kinds of audio features. As more information is included in decision making, the overall performance of the system is enhanced.

[1]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[4]  Anoop Gupta,et al.  Automatically extracting highlights for TV Baseball programs , 2000, ACM Multimedia.

[5]  Liming Chen,et al.  Highlights Detection in Sports Videos Based on Audio Analysis , 2003 .

[6]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[7]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .