Fake and Spam Messages: Detecting Misinformation During Natural Disasters on Social Media

During natural disasters or crises, users on social media tend to easily believe contents of postings related to the events, and retweet the postings with hoping them to be reached to many other users. Unfortunately, there are malicious users who understand the tendency and post misinformation such as spam and fake messages with expecting wider propagation. To resolve the problem, in this paper we conduct a case study of 2013 Moore Tornado and Hurricane Sandy. Concretely, we (i) understand behaviors of these malicious users, (ii) analyze properties of spam, fake and legitimate messages, (iii) propose flat and hierarchical classification approaches, and (iv) detect both fake and spam messages with even distinguishing between them. Our experimental results show that our proposed approaches identify spam and fake messages with 96.43% accuracy and 0.961 F-measure.

[1]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[2]  Jun Hu,et al.  Detecting and characterizing social spam campaigns , 2010, CCS '10.

[3]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[4]  Yejin Choi,et al.  Distributional Footprints of Deceptive Product Reviews , 2012, ICWSM.

[5]  Alex Hai Wang,et al.  Don't follow me: Spam detection in Twitter , 2010, 2010 International Conference on Security and Cryptography (SECRYPT).

[6]  Junhui Wang,et al.  Detecting group review spam , 2011, WWW.

[7]  Christos Faloutsos,et al.  Retweeting Activity on Twitter: Signs of Deception , 2015, PAKDD.

[8]  Jacob Ratkiewicz,et al.  Detecting and Tracking Political Abuse in Social Media , 2011, ICWSM.

[9]  Kyumin Lee,et al.  Seven Months with the Devils: A Long-Term Study of Content Polluters on Twitter , 2011, ICWSM.

[10]  P. Kumaraguru,et al.  $1.00 per RT #BostonMarathon #PrayForBoston: Analyzing fake content on Twitter , 2013, 2013 APWG eCrime Researchers Summit.

[11]  Fan Yang,et al.  Automatic detection of rumor on Sina Weibo , 2012, MDS '12.

[12]  Claire Cardie,et al.  Estimating the prevalence of deception in online review communities , 2012, WWW.

[13]  M. Chuah,et al.  Spam Detection on Twitter Using Traditional Classifiers , 2011, ATC.

[14]  Anupam Joshi,et al.  Faking Sandy: characterizing and identifying fake images on Twitter during Hurricane Sandy , 2013, WWW.

[15]  Gang Wang,et al.  Crowds on Wall Street: Extracting Value from Collaborative Investing Platforms , 2015, CSCW.

[16]  Gianluca Stringhini,et al.  Detecting spammers on social networks , 2010, ACSAC '10.

[17]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[18]  Vern Paxson,et al.  @spam: the underground on 140 characters or less , 2010, CCS '10.

[19]  Jon M. Kleinberg,et al.  WWW 2009 MADRID! Track: Data Mining / Session: Opinions How Opinions are Received by Online Communities: A Case Study on Amazon.com Helpfulness Votes , 2022 .

[20]  Danah Boyd,et al.  Detecting Spam in a Twitter Network , 2009, First Monday.

[21]  Kyumin Lee,et al.  Uncovering social spammers: social honeypots + machine learning , 2010, SIGIR.

[22]  Chao Wu,et al.  Information Credibility on Twitter in Emergency Situation , 2012, PAISI.

[23]  Niloy Ganguly,et al.  Spammers' networks within online social networks: a case-study on Twitter , 2011, WWW.

[24]  Kyumin Lee,et al.  Campaign extraction from social media , 2013, ACM Trans. Intell. Syst. Technol..

[25]  Philip S. Yu,et al.  Review spam detection via temporal pattern discovery , 2012, KDD.

[26]  Ponnurangam Kumaraguru,et al.  TweetCred: A Real-time Web-based System for Assessing Credibility of Content on Twitter , 2014, ArXiv.