DancingLines: An Analytical Scheme to Depict Cross-Platform Event Popularity

Nowadays, events usually burst and are propagated online through multiple modern media like social networks and search engines. There exists various research discussing the event dissemination trends on individual medium, while few studies focus on event popularity analysis from a cross-platform perspective. Challenges come from the vast diversity of events and media, limited access to aligned datasets across different media and a great deal of noise in the datasets. In this paper, we design DancingLines, an innovative scheme that captures and quantitatively analyzes event popularity between pairwise text media. It contains two models: TF-SW, a semantic-aware popularity quantification model, based on an integrated weight coefficient leveraging Word2Vec and TextRank; and wDTW-CD, a pairwise event popularity time series alignment model matching different event phases adapted from Dynamic Time Warping. We also propose three metrics to interpret event popularity trends between pairwise social platforms. Experimental results on eighteen real-world event datasets from an influential social network and a popular search engine validate the effectiveness and applicability of our scheme. DancingLines is demonstrated to possess broad application potentials for discovering the knowledge of various aspects related to events and different media.

[1]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[2]  Huajiao Li,et al.  Breaking news dissemination in the media via propagation behavior based on complex network theory , 2016 .

[3]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[4]  Xiting Wang,et al.  Tracking Idea Flows between Social Groups , 2015, AAAI.

[5]  Can Wang,et al.  Labelling Topics in Weibo Using Word Embedding and Graph-Based Method , 2016, 2016 International Conference on Information Systems Engineering (ICISE).

[6]  Hong Cheng,et al.  A Model-Free Approach to Infer the Diffusion Network from Event Cascade , 2016, CIKM.

[7]  Philip S. Yu,et al.  Extracting social events for learning better information diffusion models , 2013, KDD.

[8]  M. Osborne,et al.  Bieber no more : First Story Detection using Twitter and Wikipedia , 2012 .

[9]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[10]  Leandro Nunes de Castro,et al.  A keyword extraction method from twitter messages represented as graphs , 2014, Appl. Math. Comput..

[11]  Xiaoping Zhou,et al.  Cross-Platform Identification of Anonymous Identical Users in Multiple Social Media Networks , 2016, IEEE Transactions on Knowledge and Data Engineering.

[12]  Chris Callison-Burch,et al.  WikiTopics: What is Popular on Wikipedia and Why , 2011 .

[13]  Lei Chen,et al.  Event detection over twitter social media streams , 2013, The VLDB Journal.

[14]  Yan Tang,et al.  ESAP: A Novel Approach for Cross-Platform Event Dissemination Trend Analysis Between Social Network and Search Engine , 2016, WISE.

[15]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[16]  Eamonn Keogh,et al.  On the effect of endpoints on dynamic time warping , 2016 .

[17]  M. Newman Power laws, Pareto distributions and Zipf's law , 2005 .

[18]  Yang Song,et al.  Topical Keyphrase Extraction from Twitter , 2011, ACL.

[19]  Piotr Indyk,et al.  Approximate nearest neighbors: towards removing the curse of dimensionality , 1998, STOC '98.

[20]  Laks V. S. Lakshmanan,et al.  KeySee: supporting keyword search on evolving events in social streams , 2013, KDD.

[21]  Eamonn J. Keogh,et al.  Semi-Supervision Dramatically Improves Time Series Clustering under Dynamic Time Warping , 2016, CIKM.

[22]  Rui Li,et al.  TEDAS: A Twitter-based Event Detection and Analysis System , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[23]  Peter Nijkamp,et al.  Accessibility of Cities in the Digital Economy , 2004, cond-mat/0412004.

[24]  Yong Yu,et al.  ASNets: A Benchmark Dataset of Aligned Social Networks for Cross-Platform User Modeling , 2016, CIKM.

[25]  Salvatore Orlando,et al.  A Study on Microblog and Search Engine User Behaviors: How Twitter Trending Topics Help Predict Google Hot Queries , 2013 .

[26]  Eamonn J. Keogh,et al.  Derivative Dynamic Time Warping , 2001, SDM.

[27]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[28]  Olufemi A. Omitaomu,et al.  Weighted dynamic time warping for time series classification , 2011, Pattern Recognit..

[29]  Gilberto Câmara,et al.  A Time-Weighted Dynamic Time Warping Method for Land-Use and Land-Cover Mapping , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[30]  Philip S. Yu,et al.  Link Prediction across Aligned Networks with Sparse and Low Rank Matrix Estimation , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[31]  Jiawei Han,et al.  Mining Multi-aspect Reflection of News Events in Twitter: Discovery, Linking and Presentation , 2015, 2015 IEEE International Conference on Data Mining.

[32]  M. Shamim Hossain,et al.  Cross-Platform Emerging Topic Detection and Elaboration from Multimedia Streams , 2015, ACM Trans. Multim. Comput. Commun. Appl..

[33]  Martin Wattenberg,et al.  Stacked Graphs – Geometry & Aesthetics , 2008, IEEE Transactions on Visualization and Computer Graphics.