Graph-based Rumour Detection for Social Media

Social platforms became a major source of rumours. While rumours can have severe real-world implications, their detection is notoriously hard: Content on social platforms is short and lacks semantics; it spreads quickly through a dynamically evolving network; and without considering the context of content, it may be impossible to arrive at a truthful interpretation. Traditional approaches to rumour detection, however, exploit solely a single content modality, e.g., social media posts, which limits their detection accuracy. In this paper, we cope with the aforementioned challenges by means of a multi-modal approach to rumour detection that identifies anomalies in both, the entities (e.g., users, posts, and hashtags) of a social platform and their relations. Based on local anomalies, we show how to detect rumours at the network level, following a graph-based scan approach.

[1]  Matthias Weidlich,et al.  Exploring Data Partitions for What-if Analysis , 2018 .

[2]  Hao Wang,et al.  Adapting to User Interest Drift for POI Recommendation , 2016, IEEE Transactions on Knowledge and Data Engineering.

[3]  Shazia Wasim Sadiq,et al.  Discovering interpretable geo-social communities for user behavior prediction , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[4]  June Luo,et al.  A Spatial Scan Statistic on Trends , 2014 .

[5]  Daniel B. Neill,et al.  Non-parametric scan statistics for event detection and forecasting in heterogeneous social media graphs , 2014, KDD.

[6]  Yizhou Sun,et al.  Mining Heterogeneous Information Networks: Principles and Methodologies , 2012, Mining Heterogeneous Information Networks: Principles and Methodologies.

[7]  Kenny Q. Zhu,et al.  False rumors detection on Sina Weibo by propagation structures , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[8]  Io Taxidou,et al.  Realtime Analysis of Information Diffusion in Social Media , 2013, Proc. VLDB Endow..

[9]  Karl Aberer,et al.  Comparing Events Coverage in Online News and Social Media: The Case of Climate Change , 2015, ICWSM.

[10]  Padhraic Smyth,et al.  Adaptive event detection with time-varying poisson processes , 2006, KDD '06.

[11]  Daniel B. Neill,et al.  Fast subset scan for spatial pattern detection , 2012 .

[12]  Fan Zhang,et al.  OLAK: An Efficient Algorithm to Prevent Unraveling in Social Networks , 2017, Proc. VLDB Endow..

[13]  M. Kulldorff A spatial scan statistic , 1997 .

[14]  Philip S. Yu,et al.  A Survey of Heterogeneous Information Network Analysis , 2015, IEEE Transactions on Knowledge and Data Engineering.

[15]  Huan Liu,et al.  Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose , 2013, ICWSM.

[16]  Jiebo Luo,et al.  SocialSpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks , 2011, Proc. VLDB Endow..

[17]  Vasudeva Varma,et al.  Towards Semantic Retrieval of Hashtags in Microblogs , 2015, WWW.

[18]  Daniel B. Neill,et al.  Dynamic Pattern Detection with Temporal Consistency and Connectivity Constraints , 2013, 2013 IEEE 13th International Conference on Data Mining.

[19]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[20]  Bela Stantic,et al.  Bootstrapping Uncertainty in Schema Covering , 2018, ADC.

[21]  Ioana Manolescu,et al.  Computational fact-checking: Problems, state of the art, and perspectives , 2018, WWW 2018.

[22]  Santhosh Kumar,et al.  Temporal Opinion Spam Detection by Multivariate Indicative Signals , 2016, ICWSM.

[23]  Karl Aberer,et al.  An Evaluation of Model-Based Approaches to Sensor Data Compression , 2013, IEEE Transactions on Knowledge and Data Engineering.

[24]  Sen Wang,et al.  Provenance-Based Rumor Detection , 2017, ADC.

[25]  Matthias Weidlich,et al.  What-If Analysis with Conflicting Goals: Recommending Data Ranges for Exploration , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[26]  Justin Cheng,et al.  Rumor Cascades , 2014, ICWSM.

[27]  Rose Yu,et al.  GLAD: group anomaly detection in social media analysis , 2014, ACM Trans. Knowl. Discov. Data.

[28]  Yehuda Vardi,et al.  A Hybrid High-Order Markov Chain Model for Computer Intrusion Detection , 2001 .

[29]  Qiaozhu Mei,et al.  Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts , 2015, WWW.

[30]  Karl Aberer,et al.  Minimizing Efforts in Validating Crowd Answers , 2015, SIGMOD Conference.

[31]  Quoc Viet Hung Nguyen,et al.  Probabilistic Schema Covering , 2017 .

[32]  Ioana Manolescu,et al.  Computational fact-checking: a content management perspective , 2018, Proc. VLDB Endow..

[33]  Arkaitz Zubiaga,et al.  Detection and Resolution of Rumours in Social Media , 2017, ACM Comput. Surv..

[34]  Karl Aberer,et al.  An Evaluation of Aggregation Techniques in Crowdsourcing , 2013, WISE.

[35]  Barnabás Póczos,et al.  Hierarchical Probabilistic Models for Group Anomaly Detection , 2011, AISTATS.

[36]  Zi Huang,et al.  Joint Event-Partner Recommendation in Event-Based Social Networks , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[37]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[38]  Wei Gao,et al.  Detecting Rumors from Microblogs with Recurrent Neural Networks , 2016, IJCAI.

[39]  Bernhard Schölkopf,et al.  One-Class Support Measure Machines for Group Anomaly Detection , 2013, UAI.

[40]  Wei Gao,et al.  Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning , 2017, ACL.

[41]  Charu C. Aggarwal,et al.  Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes , 2012, Proc. VLDB Endow..

[42]  VARUN CHANDOLA,et al.  Outlier Detection : A Survey , 2007 .

[43]  A. Karr,et al.  Computer Intrusion: Detecting Masquerades , 2001 .

[44]  Yang Wang,et al.  SPTF: A Scalable Probabilistic Tensor Factorization Model for Semantic-Aware Behavior Prediction , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[45]  Ching-Yung Lin,et al.  A Survey on Social Media Anomaly Detection , 2016, SIGKDD Explor..

[46]  Quoc Viet Hung Nguyen,et al.  User Guidance for Efficient Fact Checking , 2019, Proc. VLDB Endow..

[47]  Fan Yang,et al.  Automatic detection of rumor on Sina Weibo , 2012, MDS '12.

[48]  Christopher Kermorvant,et al.  Dropout Improves Recurrent Neural Networks for Handwriting Recognition , 2013, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[49]  Fredric C. Gey,et al.  The relationship between recall and precision , 1994 .

[50]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[51]  Ponnurangam Kumaraguru,et al.  TweetCred: Real-Time Credibility Assessment of Content on Twitter , 2014, SocInfo.

[52]  Fernando Diaz,et al.  CrisisLex: A Lexicon for Collecting and Filtering Microblogged Communications in Crises , 2014, ICWSM.

[53]  Sinan Aral,et al.  The spread of true and false news online , 2018, Science.

[54]  Douglas H. Jones,et al.  Goodness-of-fit test statistics that dominate the Kolmogorov statistics , 1979 .