A comparison of global rule induction and HMM approaches on extracting story boundaries in news video

This paper presents a multi-modal two-level framework for news story segmentation designed to cope with large news video corpus such as the data used in TREC video retrieval (TRECVID) evaluations. We divide our system into two levels: shot level that assigns one of the pre-defined semantic tags to each input shot; and story level that performs story segmentation based on the output of the shot level and other temporal features. We demonstrate the generality of our framework by employing two machine-learning approaches at the story level. The first approach employs a statistical method called Hidden Markov Models (HMM) whereas the second uses a rule induction technique. We tested both approaches on ~ 120 hours of news video provided by TRECVID 2003. The results demonstrate that our 2-level machine-learning framework is effective and is adequate to cope with large-scale practical problems.

[1]  Jing Xiao,et al.  A global rule induction approach to information extraction , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[2]  Robert Dale,et al.  Handbook of Natural Language Processing , 2001, Computational Linguistics.

[3]  Ichiro Ide,et al.  Automatic Video Indexing Based on Shot Classification , 1998, AMCP.

[4]  Michael J. Witbrock,et al.  Story segmentation and detection of commercials in broadcast news video , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[5]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[6]  Marti A. Hearst Multi-Paragraph Segmentation Expository Text , 1994, ACL.

[7]  Mark T. Maybury,et al.  Broadcast news navigation using story segmentation , 1997, MULTIMEDIA '97.

[8]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[9]  Yiming Yang,et al.  Topic Detection and Tracking Pilot Study Final Report , 1998 .