Embedding and predicting the event at early stage

Social media has become one of the most credible sources for delivering messages, breaking news, as well as events. Predicting the future dynamics of an event at a very early stage is significantly valuable, e.g, helping company anticipate marketing trends before the event becomes mature. However, this prediction is non-trivial because a) social events always stay with “noise” under the same topic and b) the information obtained at its early stage is too sparse and limited to support an accurate prediction. In order to overcome these two problems, in this paper, we design an event early embedding model (EEEM) that can 1) extract social events from noise, 2) find the previous similar events, and 3) predict future dynamics of a new event with very limited information. Specifically, a denoising approach is derived from the knowledge of signal analysis to eliminate social noise and extract events. Moreover, we propose a novel predicting scheme based on locally linear embedding algorithm to construct the volume of a new event from its k nearest neighbors. Compared to previous work only fitting the historical volume dynamics to make a prediction, our predictive model is based on both the volume information and content information of events. Extensive experiments conducted on a large-scale dataset of Twitter data demonstrate the capacity of our model on extract events and the promising performance of prediction by considering both volume information as well as content information. Compared with predicting with only the content or the volume feature, we find the best performance of considering they both with our proposed fusion method.

[1]  Zhiting Hu,et al.  Dynamic User Modeling in Social Media Systems , 2015, TOIS.

[2]  Jure Leskovec,et al.  SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity , 2015, KDD.

[3]  Heng Tao Shen,et al.  Unsupervised Deep Hashing with Similarity-Adaptive and Discrete Optimization , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Jure Leskovec,et al.  Inferring networks of diffusion and influence , 2010, KDD.

[5]  Hakim Hacid,et al.  A predictive model for the temporal dynamics of information diffusion in online social networks , 2012, WWW.

[6]  Wolfgang Nejdl,et al.  Understanding the diversity of tweets in the time of outbreaks , 2013, WWW.

[7]  Nicu Sebe,et al.  Optimized Graph Learning Using Partial Tags and Multiple Features for Image and Video Annotation , 2016, IEEE Transactions on Image Processing.

[8]  Wei Liu,et al.  Asymmetric Binary Coding for Image Search , 2017, IEEE Transactions on Multimedia.

[9]  Hila Becker,et al.  Learning similarity metrics for event identification in social media , 2010, WSDM '10.

[10]  Luming Zhang,et al.  Multiple Social Network Learning and Its Application in Volunteerism Tendency Prediction , 2015, SIGIR.

[11]  Maia Zaharieva,et al.  Social Event Mining in Large Photo Collections , 2015, ICMR.

[12]  Christian Bauckhage,et al.  Mathematical Models of Fads Explain the Temporal Dynamics of Internet Memes , 2013, ICWSM.

[13]  Heng Tao Shen,et al.  Processing Long Queries Against Short Text , 2017, ACM Trans. Inf. Syst..

[14]  Yang Yang,et al.  Adversarial Cross-Modal Retrieval , 2017, ACM Multimedia.

[15]  Hua Lu,et al.  A unified model for stable and temporal topic detection from social media data , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[16]  Meng Wang,et al.  Coherent Semantic-Visual Indexing for Large-Scale Image Retrieval in the Cloud , 2017, IEEE Transactions on Image Processing.

[17]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[18]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[19]  Jiawei Han,et al.  Predicting future popularity trend of events in microblogging platforms , 2012, ASIST.

[20]  Tad Hogg,et al.  Using a model of social dynamics to predict popularity of news , 2010, WWW '10.

[21]  Xuelong Li,et al.  Robust Discrete Spectral Hashing for Large-Scale Image Semantic Indexing , 2015, IEEE Transactions on Big Data.

[22]  Albert-László Barabási,et al.  Modeling and Predicting Popularity Dynamics via Reinforced Poisson Processes , 2014, AAAI.

[23]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[24]  Yang Yang,et al.  Multitask Spectral Clustering by Exploring Intertask Correlation , 2015, IEEE Transactions on Cybernetics.

[25]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[26]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[27]  Xuelong Li,et al.  Robust Web Image Annotation via Exploring Multi-Facet and Structural Knowledge , 2017, IEEE Transactions on Image Processing.

[28]  Qiang Yang,et al.  Predicting user activity level in social networks , 2013, CIKM.

[29]  Ciro Cattuto,et al.  Dynamical classes of collective attention in twitter , 2011, WWW.

[30]  Zi Huang,et al.  Robust discrete code modeling for supervised hashing , 2018, Pattern Recognit..

[31]  James Caverlee,et al.  Discovering trending phrases on information streams , 2011, CIKM '11.

[32]  Hong Cheng,et al.  Why It Happened: Identifying and Modeling the Reasons of the Happening of Social Events , 2015, KDD.

[33]  Wei-Ying Ma,et al.  Hashtag-Based Sub-Event Discovery Using Mutually Generative LDA in Twitter , 2016, AAAI.

[34]  Zi Huang,et al.  EventEye: Monitoring Evolving Events from Tweet Streams , 2014, ACM Multimedia.

[35]  Albert-László Barabási,et al.  The origin of bursts and heavy tails in human dynamics , 2005, Nature.

[36]  Shichao Zhang,et al.  Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Zi Huang,et al.  A temporal context-aware model for user behavior modeling in social media systems , 2014, SIGMOD Conference.

[38]  Themis Palpanas,et al.  Dynamics of news events and social media reaction , 2014, KDD.

[39]  Xuelong Li,et al.  Block-Row Sparse Multiview Multilabel Learning for Image Classification , 2016, IEEE Transactions on Cybernetics.

[40]  Heng Tao Shen,et al.  Video Captioning With Attention-Based LSTM and Semantic Consistency , 2017, IEEE Transactions on Multimedia.

[41]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[42]  Zi Huang,et al.  Discrete Nonnegative Spectral Clustering , 2017, IEEE Transactions on Knowledge and Data Engineering.

[43]  Heng Tao Shen,et al.  Hashing with Angular Reconstructive Embeddings , 2018, IEEE Transactions on Image Processing.

[44]  Bernhard Schölkopf,et al.  Uncovering the Temporal Dynamics of Diffusion Networks , 2011, ICML.

[45]  Yiming Yang,et al.  A study of retrospective and on-line event detection , 1998, SIGIR '98.

[46]  Kian-Lee Tan,et al.  Location-Aware Pub/Sub System: When Continuous Moving Queries Meet Dynamic Event Streams , 2015, SIGMOD Conference.

[47]  Prasenjit Mitra,et al.  Temporal and Information Flow Based Event Detection from Social Text Streams , 2007, AAAI.