Story segmentation in TV news broadcast

Segmentation of TV news broadcast into semantically meaningful stories is an essential pre-requisite for a wide range of video analytics applications. In this work we have introduced a hybrid approach for news story segmentation based on conditional random fields (CRFs). The story boundary detection problem is converted into a shot classification problem by classifying video shots into either of the four categories. These are start shot, end shot and middle shots of a story or single shot story. To achieve this classification, we have introduced two new features. These are overlay text based semantic similarity and grid-wise edge orientation histogram. The first feature measures the semantic similarity between video shots by linking them through a set of web news articles. We use overlay text with their relevance as weight to link a set of articles with the video shots. The second feature captures the variations in presentation formats. The CRF model effectively combines these two features to model the news stories. Experimental results on approximately 50 hours of news videos demonstrates the efficiency of the proposed features. We were able to achieve an F1 score of 81% with our proposed features.

[1]  Bin Ma,et al.  Broadcast news story segmentation using latent topics on data manifold , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Frank Hopfgartner,et al.  TV News Story Segmentation Based on Semantic Coherence and Content Similarity , 2010, MMM.

[3]  Shih-Fu Chang,et al.  Story boundary detection in large broadcast news video archives: techniques, experience and trends , 2004, MULTIMEDIA '04.

[4]  Marie-Francine Moens,et al.  News Story Segmentation in Multiple Modalities , 2009, CBMI.

[5]  Takeo Kanade,et al.  Spotting by Association in News Video , 1997 .

[6]  Vincent Claveau,et al.  Topic segmentation of TV-streams by watershed transform and vectorization , 2015, Comput. Speech Lang..

[7]  Marti A. Hearst Multi-Paragraph Segmentation Expository Text , 1994, ACL.

[8]  João Paulo da Silva Neto,et al.  Audio segmentation, classification and clustering in a broadcast news task , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  Lei Xie,et al.  Measuring semantic similarity by contextualword connections in Chinese news story segmentation , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[10]  Alan F. Smeaton,et al.  Dublin City University Video Track Experiments for TREC 2002 , 2001, TREC.

[11]  Hugh E. Williams,et al.  RMIT University at TRECVID 2004 , 2004, TRECVID.

[12]  Bo Xu,et al.  A general Framework of video segmentation to logical unit based on conditional random fields , 2013, ICMR '13.

[13]  Bo Xu,et al.  Multi-modal information fusion for news story segmentation in broadcast video , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Paul Over,et al.  TRECVID 2004 - An Overview , 2004, TRECVID.

[15]  Yiannis Kompatsiaris,et al.  On the use of audio events for improving video scene segmentation , 2010, 11th International Workshop on Image Analysis for Multimedia Interactive Services WIAMIS 10.

[16]  Jong Wook Kim,et al.  Effectively Detecting Topic Boundaries in a News Video by Using Wikipedia , 2014 .

[17]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[18]  Omar Javed,et al.  University of Central Florida at TRECVID 2004 , 2003, TRECVID.

[19]  Yiannis Demiris,et al.  The Infinite-Order Conditional Random Field Model for Sequential Data Modeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[21]  Noel E. O'Connor,et al.  TV news story segmentation, personalisation and recommendation , 2003 .

[22]  R. Smith,et al.  An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[23]  Raghvendra Kannao,et al.  Overlay text extraction from TV news broadcast , 2015, 2015 Annual IEEE India Conference (INDICON).

[24]  Walid Mahdi,et al.  Automatic topics segmentation for TV news video using prior knowledge , 2015, Multimedia Tools and Applications.

[25]  Delphine Charlet,et al.  Fusion of speaker and lexical information for topic segmentation: A co-segmentation approach , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[26]  Rong Zheng,et al.  Multiple style exploration for story unit segmentation of broadcast news video , 2013, Multimedia Systems.

[27]  Mubarak Shah,et al.  Story Segmentation in News Videos Using Visual and Text Cues , 2005, CIVR.

[28]  Thomas G. Dietterich Machine Learning for Sequential Data: A Review , 2002, SSPR/SPR.