Beyond Shot Retrieval: Searching for Broadcast News Items Using Language Models of Concepts

Current video search systems commonly return video shots as results. We believe that users may better relate to longer, semantic video units and propose a retrieval framework for news story items, which consist of multiple shots. The framework is divided into two parts: (1) A concept based language model which ranks news items with known occurrences of semantic concepts by the probability that an important concept is produced from the concept distribution of the news item and (2) a probabilistic model of the uncertain presence, or risk, of these concepts. In this paper we use a method to evaluate the performance of story retrieval, based on the TRECVID shot-based retrieval groundtruth. Our experiments on the TRECVID 2005 collection show a significant performance improvement against four standard methods.

[1]  David Carmel,et al.  Spoken document retrieval from call-center conversations , 2006, SIGIR.

[2]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[3]  Hwee Tou Ng,et al.  A lattice-based approach to query-by-example spoken document retrieval , 2008, SIGIR '08.

[4]  Alex Acero,et al.  Position Specific Posterior Lattices for Indexing Speech , 2005, ACL.

[5]  Alexander J. Smola,et al.  Advances in Large Margin Classifiers , 2000 .

[6]  Jun Wang,et al.  Mean-Variance Analysis: A New Document Ranking Theory in Information Retrieval , 2009, ECIR.

[7]  Christian Petersohn Fraunhofer HHI at TRECVID 2004: Shot Boundary Detection System , 2004, TRECVID.

[8]  Paul Over,et al.  Video shot boundary detection: Seven years of TRECVid activity , 2010, Comput. Vis. Image Underst..

[9]  Hal R. Varian,et al.  Economics and search , 1999, SIGF.

[10]  Javed A. Aslam,et al.  Models for metasearch , 2001, SIGIR '01.

[11]  Rong Yan,et al.  Semantic concept-based query expansion and re-ranking for multimedia retrieval , 2007, ACM Multimedia.

[12]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[13]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[14]  Marcel Worring,et al.  Concept-Based Video Retrieval , 2009, Found. Trends Inf. Retr..

[15]  Wei-Ying Ma,et al.  Image and Video Retrieval , 2003, Lecture Notes in Computer Science.

[16]  Marcel Worring,et al.  The challenge problem for automated detection of 101 semantic concepts in multimedia , 2006, MM '06.

[17]  Shih-Fu Chang,et al.  Video search reranking via information bottleneck principle , 2006, MM '06.

[18]  Donna K. Harman,et al.  Overview of the Ninth Text REtrieval Conference (TREC-9) , 2000, TREC.

[19]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[20]  Djoerd Hiemstra,et al.  The Effectiveness of Concept Based Search for Video Retrieval , 2007, LWA.

[21]  Alan F. Smeaton,et al.  Everyday concept detection in visual lifelogs: validation, relationships and trends , 2010, Multimedia Tools and Applications.

[22]  Alan F. Smeaton,et al.  A Comparison of Score, Rank and Probability-Based Fusion Methods for Video Shot Retrieval , 2005, CIVR.

[23]  Djoerd Hiemstra,et al.  Reusing annotation labor for concept selection , 2009, CIVR '09.