Deep fusion of multimodal features for social media retweet time prediction

The popularity of various social media platforms (e.g., Twitter, Facebook, Instagram, and Weibo) has led to the generation of millions of micro-blogs each day. Retweet (message forwarding function) is considered to be one of the most effective behavior for information propagation on social networks. The task of retweet behavior prediction has received much attention in recent years, such as modelling the followers’ preference to predict if a tweet from others would be retweeted or not. But one important aspect in retweet behavior prediction is still being overlooked: the followers’ retweet time prediction, which is helpful to understand the popularity of a tweet, the relationships between users, and the influence of users on their followers. However, due to the complex entanglement of multimodal features in social media such as text, social relationships, users’ active time and many others, it is nontrivial to effectively predict the retweet time of followers. In this work, in order to predict the followers’ retweet time on Twitter, we present an end-to-end deep learning model, namely DFMF (Deep Fusion of Multimodal Features), to implicitly learn the latent features and interactions of tweets, social relationships, and the posting time. Specifically, we adopt a word embedding layer to learn the high-level semantics of tweets and a node embedding layer to learn the hidden representations of the complex social relationships. Then, together with the one-hot representation of a tweet’s posting time, the multimodal information is concatenated and fed into fully-connected forward neural networks for implicit cross-modality feature fusion, which is used to predict the retweet time. Finally, we evaluate the proposed method with a real-world Twitter dataset, the experimental results demonstrate that our proposed DFMF is more accurate in predicting the retweet time and can achieve as much as 11.25% performance improvement on the recall accuracy compared to Logistic Regression (LR) and Support Vector Machine (SVM).

[1]  Ting Wang,et al.  Who will retweet me?: finding retweeters in twitter , 2013, SIGIR.

[2]  Xiang Lian,et al.  RSkNN: kNN Search on Road Networks by Incorporating Social Influence , 2016, IEEE Transactions on Knowledge and Data Engineering.

[3]  Miles Efron,et al.  Information search and retrieval in microblogs , 2011, J. Assoc. Inf. Sci. Technol..

[4]  Yang Liu,et al.  Who Influenced You? Predicting Retweet via Social Influence Locality , 2015, ACM Trans. Knowl. Discov. Data.

[5]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[6]  Jianxin Li,et al.  Most Influential Community Search over Large Social Networks , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[7]  Deli Zhao,et al.  Network Representation Learning with Rich Text Information , 2015, IJCAI.

[8]  Guangyan Huang,et al.  Short text similarity measurement using context‐aware weighted biterms , 2020, Concurr. Comput. Pract. Exp..

[9]  Lin Li,et al.  Twitter Analysis for Depression on Social Networks based on Sentiment and Stress , 2019, 2019 6th International Conference on Behavioral, Economic and Socio-Cultural Computing (BESC).

[10]  Xiang Lian,et al.  Keyword Search over Distributed Graphs with Compressed Signature , 2017, IEEE Transactions on Knowledge and Data Engineering.

[11]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[12]  Miles Osborne,et al.  RT to Win! Predicting Message Propagation in Twitter , 2011, ICWSM.

[13]  Felix Naumann,et al.  Analyzing and predicting viral tweets , 2013, WWW.

[14]  Virgílio A. F. Almeida,et al.  Understanding factors that affect response rates in twitter , 2012, HT '12.

[15]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[16]  Jeffrey Nichols,et al.  Who Will Retweet This? Detecting Strangers from Twitter to Retweet Information , 2015, ACM Trans. Intell. Syst. Technol..

[17]  Gleb Gusev,et al.  Prediction of retweet cascade size over time , 2012, CIKM.

[18]  Wanlei Zhou,et al.  Identifying Propagation Sources in Networks: State-of-the-Art and Comparative Studies , 2017, IEEE Communications Surveys & Tutorials.

[19]  Jeffrey Xu Yu,et al.  GCache: Neighborhood-Guided Graph Caching in a Distributed Environment , 2019, IEEE Transactions on Parallel and Distributed Systems.

[20]  Jeffrey Xu Yu,et al.  Target-Aware Holistic Influence Maximization in Spatial Social Networks , 2022, IEEE Transactions on Knowledge and Data Engineering.

[21]  Xin Xin,et al.  When to Make a Topic Popular Again? A Temporal Model for Topic Rehotting Prediction in Online Social Networks , 2018, IEEE Transactions on Signal and Information Processing over Networks.

[22]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[23]  Juan-Zi Li,et al.  Understanding retweeting behaviors in social networks , 2010, CIKM.

[24]  Brian D. Davison,et al.  Predicting popular messages in Twitter , 2011, WWW.

[25]  Xiang Lian,et al.  Constrained Shortest Path Query in a Large Time-Dependent Graph , 2019, Proc. VLDB Endow..

[26]  Andrew Y. Ng,et al.  Parsing Natural Scenes and Natural Language with Recursive Neural Networks , 2011, ICML.

[27]  Yue Liu,et al.  Aggregate Characterization of User Behavior in Twitter and Analysis of the Retweet Graph , 2014, ACM Trans. Internet Techn..

[28]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[29]  Feng Xia,et al.  Community-diversified influence maximization in social networks , 2020, Inf. Syst..

[30]  Renaud Lambiotte,et al.  TiDeH: Time-Dependent Hawkes Process for Predicting Retweet Dynamics , 2016, ICWSM.

[31]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[32]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[33]  Mohand Boughanem,et al.  Featured Tweet Search: Modeling Time and Social Influence for Microblog Retrieval , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[34]  Scott Counts,et al.  Predicting the Speed, Scale, and Range of Information Diffusion in Twitter , 2010, ICWSM.

[35]  Xuanjing Huang,et al.  Hot Topic-Aware Retweet Prediction with Masked Self-attentive Model , 2019, SIGIR.

[36]  Xuanjing Huang,et al.  Retweet Prediction with Attention-based Deep Neural Network , 2016, CIKM.

[37]  Kim Hyun Ki,et al.  Predicting the Lifespan and Retweet Times of Tweets Based on Multiple Feature Analysis , 2014 .

[38]  Shuiqiao Yang,et al.  Discovering Topic Representative Terms for Short Text Clustering , 2019, IEEE Access.

[39]  Ayan Kumar Bhowmick Temporal Pattern of Retweet(s) Help to Maximize Information Diffusion in Twitter , 2020, WSDM.