Rumor Gauge

The spread of malicious or accidental misinformation in social media, especially in time-sensitive situations, such as real-world emergencies, can have harmful effects on individuals and society. In this work, we developed models for automated verification of rumors (unverified information) that propagate through Twitter. To predict the veracity of rumors, we identified salient features of rumors by examining three aspects of information spread: linguistic style used to express rumors, characteristics of people involved in propagating information, and network propagation dynamics. The predicted veracity of a time series of these features extracted from a rumor (a collection of tweets) is generated using Hidden Markov Models. The verification algorithm was trained and tested on 209 rumors representing 938,806 tweets collected from real-world events, including the 2013 Boston Marathon bombings, the 2014 Ferguson unrest, and the 2014 Ebola epidemic, and many other rumors about various real-world events reported on popular websites that document public rumors. The algorithm was able to correctly predict the veracity of 75% of the rumors faster than any other public source, including journalists and law enforcement officials. The ability to track rumors and predict their outcomes may have practical applications for news consumers, financial markets, journalists, and emergency services, and more generally to help minimize the impact of false information on Twitter.

[1]  T. Shibutani Improvised News: A Sociological Study of Rumor , 1966 .

[2]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[3]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[4]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[5]  R. L. Rosnow Inside rumor: A personal journey. , 1991 .

[6]  R. L. Rosnow,et al.  Rumor rest stops on the information highway: Transmission patterns in a computer-mediated rumor chain , 1998 .

[7]  Robin Cowan,et al.  Network Structure and the Diffusion of Knowledge , 2004 .

[8]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[9]  Alessandro Vespignani,et al.  Epidemic spreading in scale-free networks. , 2000, Physical review letters.

[10]  M. Newman Spread of epidemic disease on networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Sunil J Rao,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2003 .

[12]  J. Pennebaker,et al.  Psychological aspects of natural language. use: our words, our selves. , 2003, Annual review of psychology.

[13]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[14]  Donald F. Towsley,et al.  The effect of network topology on the spread of epidemics , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[15]  D. Watts,et al.  Influentials, Networks, and Public Opinion Formation , 2007 .

[16]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[17]  Hui Ding,et al.  Querying and mining of time series data: experimental comparison of representations and distance measures , 2008, Proc. VLDB Endow..

[18]  Bertrand De Longueville,et al.  "OMG, from here, I can see the flames!": a use case of mining location based social networks to acquire spatio-temporal data on forest fires , 2009, LBSN '09.

[19]  Hanan Samet,et al.  TwitterStand: news in tweets , 2009, GIS.

[20]  P. Earle,et al.  OMG Earthquake! Can Twitter Improve Earthquake Response? , 2009 .

[21]  Leysia Palen,et al.  Twitter adoption and use in mass convergence and emergency events , 2009 .

[22]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[23]  Kirill Kireyev Applications of Topics Models to Analysis of Disaster-Related Twitter Data , 2009 .

[24]  Nello Cristianini,et al.  Flu Detector - Tracking Epidemics on Twitter , 2010, ECML/PKDD.

[25]  Barbara Poblete,et al.  Twitter under crisis: can we trust what we RT? , 2010, SOMA '10.

[26]  Max Kaufmann Syntactic Normalization of Twitter Messages , 2010 .

[27]  Yutaka Matsuo,et al.  Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[28]  Leysia Palen,et al.  Microblogging during two natural hazards events: what twitter may contribute to situational awareness , 2010, CHI.

[29]  Damon Centola,et al.  The Spread of Behavior in an Online Social Network Experiment , 2010, Science.

[30]  Mor Naaman,et al.  Is it really about me?: message content in social awareness streams , 2010, CSCW '10.

[31]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[32]  Leysia Palen,et al.  Chatter on the red: what hazards threat reveals about the social life of microblogged information , 2010, CSCW '10.

[33]  Jacob Ratkiewicz,et al.  Detecting and Tracking the Spread of Astroturf Memes in Microblog Streams , 2010, ArXiv.

[34]  Wilma Stassen Your news in 140 characters: exploring the role of social media in journalism , 2011 .

[35]  E. Steyerberg,et al.  [Regression modeling strategies]. , 2011, Revista espanola de cardiologia.

[36]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[37]  Jacob Ratkiewicz,et al.  Detecting and Tracking Political Abuse in Social Media , 2011, ICWSM.

[38]  Devavrat Shah,et al.  Rumors in a Network: Who's the Culprit? , 2009, IEEE Transactions on Information Theory.

[39]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[40]  Jacob Ratkiewicz,et al.  Truthy: mapping the spread of astroturf in microblog streams , 2010, WWW.

[41]  Dragomir R. Radev,et al.  Rumor has it: Identifying Misinformation in Microblogs , 2011, EMNLP.

[42]  P. M. Greco When the tail wags the dog. , 2011, American journal of orthodontics and dentofacial orthopedics : official publication of the American Association of Orthodontists, its constituent societies, and the American Board of Orthodontics.

[43]  Sinan Aral,et al.  Identifying Influential and Susceptible Members of Social Networks , 2012, Science.

[44]  Christos Faloutsos,et al.  Rise and fall patterns of information diffusion: model and implications , 2012, KDD.

[45]  Daniel G. Goldstein,et al.  The structure of online diffusion networks , 2012, EC '12.

[46]  Venkatesan Guruswami,et al.  CopyCatch: stopping group attacks by spotting lockstep behavior in social networks , 2013, WWW.

[47]  Kyomin Jung,et al.  Prominent Features of Rumor Propagation in Online Social Media , 2013, 2013 IEEE 13th International Conference on Data Mining.

[48]  Justin Cheng,et al.  Rumor Cascades , 2014, ICWSM.

[49]  Christos Faloutsos,et al.  CatchSync: catching synchronized behavior in large directed graphs , 2014, KDD.

[50]  Gerardo Iñiguez,et al.  Complex contagion process in spreading of online innovation , 2014, Journal of The Royal Society Interface.

[51]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[52]  Noah A. Smith,et al.  A Dependency Parser for Tweets , 2014, EMNLP.

[53]  John Yen,et al.  Finding influential users of online health communities: a new metric based on sentiment influence. , 2014, Journal of the American Medical Informatics Association : JAMIA.

[54]  Chang-Tien Lu,et al.  Misinformation Propagation in the Age of Twitter , 2014, Computer.

[55]  Soroush Vosoughi,et al.  A Human-Machine Collaborative System for Identifying Rumors on Twitter , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[56]  Soroush Vosoughi,et al.  Automatic detection and verification of rumors on Twitter , 2015 .

[57]  Qiaozhu Mei,et al.  Enquiring Minds: Early Detection of Rumors in Social Media from Enquiry Posts , 2015, WWW.

[58]  Soroush Vosoughi,et al.  Enhanced Twitter Sentiment Classification Using Contextual Information , 2015, WASSA@EMNLP.

[59]  Christos Faloutsos,et al.  Spotting Suspicious Behaviors in Multimodal Data: A General Metric and Algorithms , 2016, IEEE Transactions on Knowledge and Data Engineering.

[60]  Soroush Vosoughi,et al.  Tweet Acts: A Speech Act Classifier for Twitter , 2016, ICWSM.

[61]  Jin Yang,et al.  Automatic rumors identification on Sina Weibo , 2016, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[62]  Yi Li,et al.  In a World That Counts: Clustering and Detecting Fake Social Engagement at Scale , 2015, WWW.

[63]  Soroush Vosoughi,et al.  A Semi-Automatic Method for Efficient Detection of Stories on Social Media , 2016, ICWSM.

[64]  Christos Faloutsos,et al.  Suspicious Behavior Detection: Current Trends and Future Directions , 2016, IEEE Intelligent Systems.