Rumour detection and monitoring in open source intelligence: understanding publishing behaviours as a prerequisite

In the context of information warfare, rumour detection has become a central issue. From classical media-related campaign, to propaganda and indoctrination that lie at the core of terrorism, rumour is a mean widely used and thus a threat that must be identified as soon as possible, and in the best-case scenario, anticipated and curbed. The emergence of a new informational environment due to the adoption of the Internet as a massive information diffusion medium has led to a situation suitable to the creation and propagation of rumours. Indeed, the Web gives everyone not only the possibility to observe information flows but also the opportunity to influence and create them. In order to tackle the issue of rumour detection, one has to understand the mechanisms underlying their propagation. In this perspective, we believe that it is essential to identify and understand the publishing behaviours of the sources. Therefore, we focus in this paper on the identification of groups of sources with similar publishing characteristics. We propose to tackle this problematic by using clustering methods on data extracted from Web sources. The resulting clusters obtained from the clustering are then interpreted as groups of Websites behaving similarly and used to characterize publishing behaviours.

[1]  Peter Sheridan Dodds,et al.  Universal behavior in a generalized model of contagion. , 2004, Physical review letters.

[2]  Mark J. Embrechts,et al.  On the Use of the Adjusted Rand Index as a Metric for Evaluating Supervised Classification , 2009, ICANN.

[3]  J. Pieterse International Encyclopedia of Communication , 2008 .

[4]  Yamir Moreno,et al.  Theory of Rumour Spreading in Complex Social Networks , 2007, ArXiv.

[5]  Serge Galam,et al.  Modelling rumors: the no plane Pentagon French hoax case , 2002, cond-mat/0211571.

[6]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[7]  Devavrat Shah,et al.  Rumors in a Network: Who's the Culprit? , 2009, IEEE Transactions on Information Theory.

[8]  Ludmila I. Kuncheva,et al.  Evaluation of Stability of k-Means Cluster Ensembles with Respect to Random Initialization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  R. H. Knapp,et al.  A PSYCHOLOGY OF RUMOR , 1944 .

[10]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[11]  François Nel,et al.  Detecting Anomalies in Open Source Information Diffusion , 2009 .

[12]  M. Newman Spread of epidemic disease on networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  D. Zanette Dynamics of rumor propagation on small-world networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.