Exploiting Context for Rumour Detection in Social Media

Tools that are able to detect unverified information posted on social media during a news event can help to avoid the spread of rumours that turn out to be false. In this paper we compare a novel approach using Conditional Random Fields that learns from the sequential dynamics of social media posts with the current state-of-the-art rumour detection system, as well as other baselines. In contrast to existing work, our classifier does not need to observe tweets querying the stance of a post to deem it a rumour but, instead, exploits context learned during the event. Our classifier has improved precision and recall over the state-of-the-art classifier that relies on querying tweets, as well as outperforming our best baseline. Moreover, the results provide evidence for the generalisability of our classifier.

[1]  Misako Takayasu,et al.  Rumor Diffusion and Convergence during the 3.11 Earthquake: A Twitter Case Study , 2015, PloS one.

[2]  R. Procter,et al.  Reading the riots on Twitter: methodological innovation for the analysis of big data , 2013 .

[3]  P. Bordia,et al.  Rumor, Gossip and Urban Legends , 2007 .

[4]  Yongdong Zhang,et al.  News Verification by Exploiting Conflicting Social Viewpoints in Microblogs , 2016, AAAI.

[5]  Wei Gao,et al.  Detecting Rumors from Microblogs with Recurrent Neural Networks , 2016, IJCAI.

[6]  Bernd Carsten Stahl,et al.  Digital Wildfires: Propagation, Verification, Regulation, and Responsible Innovation , 2016, TOIS.

[7]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[8]  Wei Gao,et al.  Detect Rumors Using Time Series of Social Context Information on Microblogging Websites , 2015, CIKM.

[9]  R. Procter,et al.  Reading the riots: what were the police doing on Twitter? , 2013 .

[10]  Kenny Q. Zhu,et al.  False rumors detection on Sina Weibo by propagation structures , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[11]  Arkaitz Zubiaga,et al.  Crowdsourcing the Annotation of Rumourous Conversations in Social Media , 2015, WWW.

[12]  Kalina Bontcheva,et al.  TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text , 2013, RANLP.

[13]  Arkaitz Zubiaga,et al.  Detection and Resolution of Rumours in Social Media , 2017, ACM Comput. Surv..

[14]  Xiaomo Liu,et al.  Real-time Rumor Debunking on Twitter , 2015, CIKM.

[15]  Dragomir R. Radev,et al.  Rumor has it: Identifying Misinformation in Microblogs , 2011, EMNLP.

[16]  Arkaitz Zubiaga,et al.  Microblog Analysis as a Programme of Work , 2015, ArXiv.

[17]  Sven Behnke,et al.  PyStruct: learning structured prediction in python , 2014, J. Mach. Learn. Res..

[18]  Arkaitz Zubiaga,et al.  Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads , 2015, PloS one.

[19]  Li Zeng,et al.  #Unconfirmed: Classifying Rumor Stance in Crisis-Related Social Media Messages , 2016, ICWSM.

[20]  Hanan Samet,et al.  TwitterStand: news in tweets , 2009, GIS.

[21]  Rui Lv,et al.  Rumors detection in Chinese via crowd responses , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[22]  Andrew McCallum,et al.  An Introduction to Conditional Random Fields , 2010, Found. Trends Mach. Learn..

[23]  Angela Crandall,et al.  Humanitarianism 2.0 , 2015 .

[24]  Jinquan Zeng,et al.  Rumor Identification in Microblogging Systems Based on Users’ Behavior , 2015, IEEE Transactions on Computational Social Systems.

[25]  Arkaitz Zubiaga,et al.  PHEME : computing veracity : the fourth challenge of big social data , 2014 .

[26]  Jason R. C. Nurse,et al.  Determining the Veracity of Rumours on Twitter , 2016, SocInfo.

[27]  Kalina Bontcheva,et al.  Text Processing with GATE , 2011 .

[28]  Leo Postman,et al.  AN ANALYSIS OF RUMOR , 1946 .

[29]  Arkaitz Zubiaga,et al.  Gaussian Processes for Rumour Stance Classification in Social Media , 2016, ACM Trans. Inf. Syst..

[30]  Mona T. Diab,et al.  Rumor Detection and Classification for Twitter Data , 2015, ArXiv.

[31]  Arkaitz Zubiaga,et al.  Supporting the Use of User Generated Content in Journalistic Practice , 2017, CHI.

[32]  Mona T. Diab,et al.  Rumor Identification and Belief Investigation on Twitter , 2016, WASSA@NAACL-HLT.

[33]  Qiaozhu Mei,et al.  Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts , 2015, WWW.

[34]  Heng Ji,et al.  Curating and contextualizing Twitter stories to assist with social newsgathering , 2013, IUI '13.

[35]  Arkaitz Zubiaga,et al.  Stance Classification in Rumours as a Sequential Task Exploiting the Tree Structure of Social Media Conversations , 2016, COLING.

[36]  Kalina Bontcheva,et al.  Classifying Tweet Level Judgements of Rumours in Social Media , 2015, EMNLP.

[37]  Georgi Georgiev,et al.  An Analysis of Event-Agnostic Features for Rumour Classification in Twitter , 2016, SMN@ICWSM.

[38]  Kate Starbird,et al.  Rumors, False Flags, and Digital Vigilantes: Misinformation on Twitter after the 2013 Boston Marathon Bombing , 2014 .

[39]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[40]  Arkaitz Zubiaga,et al.  SemEval-2017 Task 8: RumourEval: Determining rumour veracity and support for rumours , 2017, *SEMEVAL.