It's always April fools' day!: On the difficulty of social network misinformation classification via propagation features

Given the huge impact that Online Social Networks (OSN) had in the way people get informed and form their opinion, they became an attractive playground for malicious entities that want to spread misinformation, and leverage their effect. In fact, misinformation easily spreads on OSN, and this is a huge threat for modern society, possibly influencing also the outcome of elections, or even putting people's life at risk (e.g., spreading "anti-vaccines" misinformation). Therefore, it is of paramount importance for our society to have some sort of "validation" on information spreading through OSN. The need for a wide-scale validation would greatly benefit from automatic tools. In this paper, we show that it is difficult to carry out an automatic classification of misinformation considering only structural properties of content propagation cascades. We focus on structural properties, because they would be inherently difficult to be manipulated, with the the aim of circumventing classification systems. To support our claim, we carry out an extensive evaluation on Facebook posts belonging to conspiracy theories (representative of misinformation), and scientific news (representative of fact-checked content). Our findings show that conspiracy content reverberates in a way which is hard to distinguish from scientific content: for the classification mechanism we investigated, classification F-score never exceeds 0.7.

[1]  Agata Fronczak,et al.  Average path length in random networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Michela Del Vicario,et al.  Viral Misinformation: The Role of Homophily and Polarization , 2014, WWW.

[3]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[4]  Walter Quattrociocchi,et al.  Echo Chambers on Facebook , 2016 .

[5]  Jon M. Kleinberg,et al.  Characterizing and curating conversation threads: expansion, focus, volume, re-entry , 2013, WSDM.

[6]  Jeffrey T. Hancock,et al.  Experimental evidence of massive-scale emotional contagion through social networks , 2014, Proceedings of the National Academy of Sciences.

[7]  Kyomin Jung,et al.  Prominent Features of Rumor Propagation in Online Social Media , 2013, 2013 IEEE 13th International Conference on Data Mining.

[8]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.

[9]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[10]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[11]  Jacob Cohen,et al.  Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[12]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[13]  Yamir Moreno,et al.  Dynamics of rumor spreading in complex networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Guido Caldarelli,et al.  Science vs Conspiracy: Collective Narratives in the Age of Misinformation , 2014, PloS one.

[15]  C. Sunstein The Law of Group Polarization , 1999, How Change Happens.

[16]  Guido Caldarelli,et al.  Opinion dynamics on interacting networks: media competition and social influence , 2014, Scientific Reports.

[17]  G. Caldarelli,et al.  The spreading of misinformation online , 2016, Proceedings of the National Academy of Sciences.

[18]  Yamir Moreno,et al.  Emergence of Influential Spreaders in Modified Rumor Models , 2012, Journal of Statistical Physics.

[19]  P. Holland,et al.  Transitivity in Structural Models of Small Groups , 1971 .

[20]  Matthew J. Salganik,et al.  Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market , 2006, Science.

[21]  Guido Caldarelli,et al.  Debunking in a world of tribes , 2015, PloS one.

[22]  Guido Caldarelli,et al.  Emotional Dynamics in the Age of Misinformation , 2015, PloS one.

[23]  Eunsoo Seo,et al.  Identifying rumors and their sources in social networks , 2012, Defense + Commercial Sensing.

[24]  Bryan Greetham,et al.  The First Draft , 2014 .

[25]  Damon Centola,et al.  The Spread of Behavior in an Online Social Network Experiment , 2010, Science.

[26]  Yamir Moreno,et al.  Contact-based Social Contagion in Multiplex Networks , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Lars Backstrom,et al.  Structural diversity in social contagion , 2012, Proceedings of the National Academy of Sciences.

[28]  Lada A. Adamic,et al.  The role of social networks in information diffusion , 2012, WWW.

[29]  Brian D. Davison,et al.  Predicting popular messages in Twitter , 2011, WWW.

[30]  Mahmoud Fouz,et al.  Why rumors spread so quickly in social networks , 2012, Commun. ACM.

[31]  Yamir Moreno,et al.  Locating privileged spreaders on an online social network. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[33]  Justin Cheng,et al.  Rumor Cascades , 2014, ICWSM.

[34]  Rediet Abebe Can Cascades be Predicted? , 2014 .

[35]  Jenna Wiens,et al.  Patient Risk Stratification for Hospital-Associated C. diff as a Time-Series Classification Task , 2012, NIPS.

[36]  Jure Leskovec,et al.  Do Cascades Recur? , 2016, WWW.

[37]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[38]  Nicholas A. Christakis,et al.  Cooperative behavior cascades in human social networks , 2009, Proceedings of the National Academy of Sciences.

[39]  Ee-Peng Lim,et al.  Finding Bursty Topics from Microblogs , 2012, ACL.

[40]  M. Macy,et al.  Complex Contagions and the Weakness of Long Ties1 , 2007, American Journal of Sociology.

[41]  A. Izenman Linear Discriminant Analysis , 2013 .

[42]  Rossano Schifanella,et al.  Friendship prediction and homophily in social media , 2012, TWEB.