Automatic Generation of Social Event Storyboard From Image Click-Through Data

Recent studies have shown that a noticeable percentage of Web search traffic is about social events. While traditional Web sites can only show human-edited events, in this paper, we present a novel system to automatically detect events from search log data and generate storyboards where the events are arranged chronologically. We chose image search log as the resource for event mining, as search logs can directly reflect people’s interests. To discover events from log data, we present a smooth nonnegative matrix factorization framework, which combines the information of query semantics, temporal correlations, search logs, and time continuity. Moreover, we consider the time factor to be an important element, since different events will develop in different time tendencies. In addition, to provide a media-rich and visually appealing storyboard, each event is associated with a set of representative photos arranged along a timeline. These relevant photos are automatically selected from image search results by analyzing image content features. We use celebrities as our test domain, which takes a large percentage of image search traffic. Experiments consisting of Web search traffic on 200 celebrities, for a period of six months, show very encouraging results compared with handcrafted editorial storyboards.

[1]  Shih-Fu Chang,et al.  Image Retrieval: Current Techniques, Promising Directions, and Open Issues , 1999, J. Vis. Commun. Image Represent..

[2]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Qi Tian,et al.  Multimedia search reranking: A literature survey , 2014, CSUR.

[4]  Sanjeev Arora,et al.  Learning Topic Models -- Going beyond SVD , 2012, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.

[5]  Weisi Lin,et al.  Image Quality Assessment Based on Gradient Similarity , 2012, IEEE Transactions on Image Processing.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Tie-Yan Liu,et al.  Event detection from evolution of click-through data , 2006, KDD '06.

[8]  Slim Essid,et al.  Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring , 2013, IEEE Transactions on Multimedia.

[9]  Philip S. Yu,et al.  Time-dependent event hierarchy construction , 2007, KDD '07.

[10]  Yoshihiko Suhara,et al.  Event mining from the Blogosphere using topic words , 2007, ICWSM.

[11]  Dimitrios Gunopulos,et al.  Searching for events in the blogosphere , 2009, WWW '09.

[12]  Song Tan,et al.  CeleBrowser: An example of browsing big data on small device , 2014, ICMR.

[13]  Min-Chun Hu,et al.  Representative photo selection for restaurants in food blogs , 2015, 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[14]  Wei-Ying Ma,et al.  Probabilistic query expansion using query logs , 2002, WWW '02.

[15]  Hai Leong Chieu,et al.  Query based event extraction along a timeline , 2004, SIGIR '04.

[16]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[17]  Lu Wang,et al.  Clustering query refinements by user intent , 2010, WWW '10.

[18]  Yiming Yang,et al.  Topic Detection and Tracking Pilot Study Final Report , 1998 .

[19]  Steffen Staab,et al.  Smart photo selection: interpret gaze as personal interest , 2014, CHI.

[20]  David B. Dunson,et al.  Probabilistic topic models , 2012, Commun. ACM.

[21]  Kai Zhang,et al.  Mining common topics from multiple asynchronous text streams , 2009, WSDM '09.

[22]  Tao Mei,et al.  Learning to personalize trending image search suggestion , 2014, SIGIR.

[23]  Xiao Zhang,et al.  QsRank: Query-sensitive hash code ranking for efficient ∊-neighbor search , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Tao Mei,et al.  Towards Cross-Domain Learning for Social Video Popularity Prediction , 2013, IEEE Transactions on Multimedia.

[25]  Amy R. Winebarger,et al.  Automatic Event Detection and Characterization of Solar Events with IRIS, SDO/AIA and Hi-C , 2016 .

[26]  Andrew McCallum,et al.  Topics over time: a non-Markov continuous-time model of topical trends , 2006, KDD '06.

[27]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[28]  Ramesh C. Jain,et al.  Detecting events from continuous media by intermodal collaboration and knowledge use , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[29]  Meng Chang Chen,et al.  Using Incremental PLSI for Threshold-Resilient Online Event Analysis , 2008, IEEE Transactions on Knowledge and Data Engineering.

[30]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[31]  Maarten de Rijke,et al.  People searching for people: analysis of a people search engine log , 2011, SIGIR '11.

[32]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[33]  Shuangyong Song,et al.  Understanding a Celebrity with His Salient Events , 2010, AMT.

[34]  Bin Wang,et al.  A probabilistic model for retrospective news event detection , 2005, SIGIR '05.

[35]  Chao Liu,et al.  A probabilistic approach to spatiotemporal theme pattern mining on weblogs , 2006, WWW '06.

[36]  Claire Cardie,et al.  Timeline generation: tracking individuals on twitter , 2013, WWW.

[37]  Hui Xiong,et al.  Detecting and Tracking Topics and Events from Web Search Logs , 2012, TOIS.

[38]  Wei Chu,et al.  Modeling the impact of short- and long-term behavior on search personalization , 2012, SIGIR '12.

[39]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[40]  Noriaki Kawamae,et al.  Trend analysis model: trend consists of temporal words, topics, and timestamps , 2011, WSDM '11.