A cross-media evolutionary timeline generation framework based on iterative recommendation

Summarization methods such as timelines have greatly helped people to understand all kinds of news events within limited time. However, there are few studies probing into cross-media summarization, for example, generating timelines which contain both texts and images that can reinforce each other. In this paper, we tackle this important and challenging problem by proposing a novel solution. Specifically, we first reveal three requisite characteristics of an ideal image-text timeline. With the idea of recommendation, all these requisites will be modeled respectively, and fused compactly in a unified cross-media framework. Finally, we put all sentences and images into either the schema of referrer or the schema of recommended candidate, and the former recommends the latter. After changing their roles iteratively, we can achieve the optimal timelines which will significantly improve user experience and satisfaction. Experiments on real-world datasets show that the timelines generated by our framework outperform several competitive baselines.

[1]  Yansong Feng,et al.  Topic Models for Image Annotation and Text Illustration , 2010, HLT-NAACL.

[2]  David Evans,et al.  Tracking and summarizing news on a daily basis with Columbia's Newsblaster , 2002 .

[3]  Xian-Sheng Hua,et al.  Interactive browsing via diversified visual summarization for image search results , 2011, Multimedia Systems.

[4]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[5]  Edward Y. Chang,et al.  Optimal multimodal fusion for multimedia data analysis , 2004, MULTIMEDIA '04.

[6]  M. Volman,et al.  The Web as an Information Resource in K–12 Education: Strategies for Supporting Students in Searching and Processing Information , 2005 .

[7]  Vikrant Gupta,et al.  A Statistical Approach for Automatic Text Summarization by Extraction , 2011, 2011 International Conference on Communication Systems and Network Technologies.

[8]  Yan Zhang,et al.  Timeline Generation through Evolutionary Trans-Temporal Summarization , 2011, EMNLP.

[9]  Wei-Ying Ma,et al.  VIPS: a Vision-based Page Segmentation Algorithm , 2003 .

[10]  Kentaro Toyama,et al.  Effects of integrating digital visual materials with textbook scans in the classroom , 2009 .

[11]  Xiaojun Wan,et al.  Multi-document summarization using cluster-based link analysis , 2008, SIGIR '08.

[12]  Fuji Ren,et al.  GA, MR, FFNN, PNN and GMM based models for automatic text summarization , 2009, Comput. Speech Lang..

[13]  Marcel Worring,et al.  Personalizing automated image annotation using cross-entropy , 2011, ACM Multimedia.

[14]  Pu-Jen Cheng,et al.  Visualizing timelines: evolutionary summarization via iterative reinforcement between text and image streams , 2012, CIKM.

[15]  Ramiz M. Aliguliyev,et al.  A new sentence similarity measure and sentence based extractive technique for automatic text summarization , 2009, Expert Syst. Appl..

[16]  Yangqiu Song,et al.  ImageHive: Interactive Content-Aware Image Summarization , 2012, IEEE Computer Graphics and Applications.

[17]  Benjamin Gleason,et al.  #Occupy Wall Street , 2013 .

[18]  Tao Jiang,et al.  Discovering Image-Text Associations for Cross-Media Web Information Fusion , 2006, PKDD.

[19]  Thomas Mandl Vague Transformations in Information Retrieval , 1998, ISI.

[20]  Rada Mihalcea,et al.  A Language Independent Algorithm for Single and Multiple Document Summarization , 2005, IJCNLP.

[21]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[22]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[23]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[24]  Roger Levy,et al.  A new approach to cross-modal multimedia retrieval , 2010, ACM Multimedia.

[25]  Sreenivas Gollapudi,et al.  Enriching textbooks with images , 2011, CIKM '11.

[26]  Shuicheng Yan,et al.  Efficient large-scale image annotation by probabilistic collaborative multi-label propagation , 2010, ACM Multimedia.

[27]  Jia Chen,et al.  Visual Contextual Advertising: Bringing Textual Advertisements to Images , 2010, AAAI.