Learning Reporting Dynamics during Breaking News for Rumour Detection in Social Media

Breaking news leads to situations of fast-paced reporting in social media, producing all kinds of updates related to news stories, albeit with the caveat that some of those early updates tend to be rumours, i.e., information with an unverified status at the time of posting. Flagging information that is unverified can be helpful to avoid the spread of information that may turn out to be false. Detection of rumours can also feed a rumour tracking system that ultimately determines their veracity. In this paper we introduce a novel approach to rumour detection that learns from the sequential dynamics of reporting during breaking news in social media to detect rumours in new stories. Using Twitter datasets collected during five breaking news stories, we experiment with Conditional Random Fields as a sequential classifier that leverages context learnt during an event for rumour detection, which we compare with the state-of-the-art rumour detection system as well as other baselines. In contrast to existing work, our classifier does not need to observe tweets querying a piece of information to deem it a rumour, but instead we detect rumours from the tweet alone by exploiting context learnt during the event. Our classifier achieves competitive performance, beating the state-of-the-art classifier that relies on querying tweets with improved precision and recall, as well as outperforming our best baseline with nearly 40% improvement in terms of F1 score. The scale and diversity of our experiments reinforces the generalisability of our classifier.

[1]  Hanan Samet,et al.  TwitterStand: news in tweets , 2009, GIS.

[2]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[3]  P. Donovan How Idle is Idle Talk? One Hundred Years of Rumor Research , 2007 .

[4]  Arkaitz Zubiaga,et al.  PHEME : computing veracity : the fourth challenge of big social data , 2014 .

[5]  Wei Gao,et al.  Detect Rumors Using Time Series of Social Context Information on Microblogging Websites , 2015, CIKM.

[6]  Kalina Bontcheva,et al.  Classifying Tweet Level Judgements of Rumours in Social Media , 2015, EMNLP.

[7]  L. Postman,et al.  The psychology of rumor , 1947 .

[8]  Georgi Georgiev,et al.  An Analysis of Event-Agnostic Features for Rumour Classification in Twitter , 2016, SMN@ICWSM.

[9]  Heng Ji,et al.  Curating and contextualizing Twitter stories to assist with social newsgathering , 2013, IUI '13.

[10]  Leo Postman,et al.  AN ANALYSIS OF RUMOR , 1946 .

[11]  P. Bordia,et al.  Rumor, Gossip and Urban Legends , 2007 .

[12]  Arkaitz Zubiaga,et al.  Crowdsourcing the Annotation of Rumourous Conversations in Social Media , 2015, WWW.

[13]  Bernd Carsten Stahl,et al.  Digital Wildfires: Propagation, Verification, Regulation, and Responsible Innovation , 2016, TOIS.

[14]  Kenny Q. Zhu,et al.  False rumors detection on Sina Weibo by propagation structures , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[15]  R. Procter,et al.  Reading the riots on Twitter: methodological innovation for the analysis of big data , 2013 .

[16]  Kalina Bontcheva,et al.  Text Processing with GATE , 2011 .

[17]  Mona T. Diab,et al.  Rumor Identification and Belief Investigation on Twitter , 2016, WASSA@NAACL-HLT.

[18]  Li Zeng,et al.  #Unconfirmed: Classifying Rumor Stance in Crisis-Related Social Media Messages , 2016, ICWSM.

[19]  Tetsuro Takahashi,et al.  Rumor detection on twitter , 2012, The 6th International Conference on Soft Computing and Intelligent Systems, and The 13th International Symposium on Advanced Intelligence Systems.

[20]  Wei Gao,et al.  Detecting Rumors from Microblogs with Recurrent Neural Networks , 2016, IJCAI.

[21]  Rui Lv,et al.  Rumors detection in Chinese via crowd responses , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[22]  Misako Takayasu,et al.  Rumor Diffusion and Convergence during the 3.11 Earthquake: A Twitter Case Study , 2015, PloS one.

[23]  R. Procter,et al.  Reading the riots: what were the police doing on Twitter? , 2013 .

[24]  Sven Behnke,et al.  PyStruct: learning structured prediction in python , 2014, J. Mach. Learn. Res..

[25]  Hongyan Liu,et al.  Detecting Event Rumors on Sina Weibo Automatically , 2013, APWeb.

[26]  Arkaitz Zubiaga,et al.  Analysing How People Orient to and Spread Rumours in Social Media by Looking at Conversational Threads , 2015, PloS one.

[27]  Andrew McCallum,et al.  An Introduction to Conditional Random Fields , 2010, Found. Trends Mach. Learn..

[28]  Qiaozhu Mei,et al.  Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts , 2015, WWW.

[29]  Yongdong Zhang,et al.  News Verification by Exploiting Conflicting Social Viewpoints in Microblogs , 2016, AAAI.

[30]  Dragomir R. Radev,et al.  Rumor has it: Identifying Misinformation in Microblogs , 2011, EMNLP.

[31]  Angela Crandall,et al.  Humanitarianism 2.0 , 2015 .

[32]  Jinquan Zeng,et al.  Rumor Identification in Microblogging Systems Based on Users’ Behavior , 2015, IEEE Transactions on Computational Social Systems.

[33]  Kalina Bontcheva,et al.  TwitIE: An Open-Source Information Extraction Pipeline for Microblog Text , 2013, RANLP.

[34]  Xiaomo Liu,et al.  Real-time Rumor Debunking on Twitter , 2015, CIKM.

[35]  Kate Starbird,et al.  Rumors, False Flags, and Digital Vigilantes: Misinformation on Twitter after the 2013 Boston Marathon Bombing , 2014 .

[36]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[37]  Eunsoo Seo,et al.  Identifying rumors and their sources in social networks , 2012, Defense + Commercial Sensing.