A Local Temporal Context-Based Approach for TV News Story Segmentation

Users are often interested in retrieving only a particular passage on a topic of interest to them. It is therefore necessary to split videos into shorter segments corresponding to appropriate retrieval units. We propose here a method based on a local temporal context for the segmentation of TV news videos into stories. First, we extract multiple descriptors which are complementary and give good insights about story boundaries. Once extracted, these descriptors are expanded with a local temporal context and combined by an early fusion process. The story boundaries are then predicted using machine learning techniques. We investigate the system by experiments conducted using TRECVID 2003 data and protocol of the story boundary detection task and we show that the extension of multimodal descriptors by a local temporal context approach improves results and our method outperforms the state of the art.

[1]  Jenny Benois-Pineau,et al.  The Argos Campaign: Evaluation of Video Analysis Tools , 2007, 2007 International Workshop on Content-Based Multimedia Indexing.

[2]  Frank Hopfgartner,et al.  Split and Merge Based Story Segmentation in News Videos , 2009, ECIR.

[3]  Jean-Luc Gauvain,et al.  The LIMSI Broadcast News transcription system , 2002, Speech Commun..

[4]  Bernard Mérialdo,et al.  Rushes video summarization and evaluation , 2009, Multimedia Tools and Applications.

[5]  Dominique Fohr,et al.  Speaker diarization using normalized cross likelihood ratio , 2007, INTERSPEECH.

[6]  Chin-Hui Lee,et al.  A Multi-Modal Approach to Story Segmentation for News Video , 2003, World Wide Web.

[7]  Georges Quénot,et al.  CLIPS at TRECVID : Shot Boundary Detection and Feature Detection , 2003, TRECVID.

[8]  Chin-Hui Lee,et al.  A detection-based approach to broadcast news video story segmentation , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Yiannis Kompatsiaris,et al.  TV Content Analysis: Techniques and Applications , 2011 .

[10]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[11]  John M. Gauch,et al.  Real Time Video Scene Detection and Classification , 1999, Inf. Process. Manag..

[12]  Paul Over,et al.  TRECVID-An Overview , 2003, TRECVID.

[13]  Frank Hopfgartner,et al.  TV News Story Segmentation Based on Semantic Coherence and Content Similarity , 2010, MMM.

[14]  Georges Quénot,et al.  From Text Detection in Videos to Person Identification , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[15]  Shih-Fu Chang,et al.  Story boundary detection in large broadcast news video archives: techniques, experience and trends , 2004, MULTIMEDIA '04.

[16]  Jenny Benois-Pineau,et al.  The ARGOS campaign: Evaluation of video analysis and indexing tools , 2007, Signal Process. Image Commun..

[17]  Tat-Seng Chua,et al.  Story Boundary Detection in News Video using Global Rule Induction Technique , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[18]  Takeo Kanade,et al.  Neural Network-Based Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..