Statistical analysis of risk assessment factors and metrics to evaluate radicalisation in Twitter

Abstract Nowadays, Social Networks have become an essential communication tools producing a large amount of information about their users and their interactions, which can be analysed with Data Mining methods. In the last years, Social Networks are being used to radicalise people. In this paper, we study the performance of a set of indicators and their respective metrics, devoted to assess the risk of radicalisation of a precise individual on three different datasets. Keyword-based metrics, even though depending on the written language, performs well when measuring frustration, perception of discrimination as well as declaration of negative and positive ideas about Western society and Jihadism, respectively. However, metrics based on frequent habits such as writing ellipses are not well enough to characterise a user in risk of radicalisation. The paper presents a detailed description of both, the set of indicators used to assess the radicalisation in Social Networks and the set of datasets used to evaluate them. Finally, an experimental study over these datasets are carried out to evaluate the performance of the metrics considered.

[1]  Daniel J. Brass,et al.  Network Analysis in the Social Sciences , 2009, Science.

[2]  S. Borgatti,et al.  On Social Network Analysis in a Supply Chain Context , 2009 .

[3]  Pablo Gamallo,et al.  Yet Another Suite of Multilingual NLP Tools , 2015, SLATE.

[4]  John Scott What is social network analysis , 2010 .

[5]  Pooja Wadhwa,et al.  Tracking on-line radicalization using investigative data mining , 2013, 2013 National Conference on Communications (NCC).

[6]  Juan Julián Merelo Guervós,et al.  Where is evolutionary computation going? A temporal analysis of the EC community , 2007, Genetic Programming and Evolvable Machines.

[7]  Ashish Sureka,et al.  Using KNN and SVM Based One-Class Classifier for Detecting Online Radicalization on Twitter , 2015, ICDCIT.

[8]  Djamal Benslimane,et al.  Measuring the Radicalisation Risk in Social Networks , 2017, IEEE Access.

[9]  Robin Thompson,et al.  Radicalization and the Use of Social Media , 2011 .

[10]  David Camacho,et al.  Adaptive k-Means Algorithm for Overlapped Graph Clustering , 2012, Int. J. Neural Syst..

[11]  David Camacho,et al.  Using the Clustering Coefficient to Guide a Genetic-Based Communities Finding Algorithm , 2011, IDEAL.

[12]  D. Pressman,et al.  Internet Use and Violent Extremism: A Cyber-VERA Risk Assessment Protocol , 2016 .

[13]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[14]  Alessandro Flammini,et al.  Predicting online extremism, content adopters, and interaction reciprocity , 2016, SocInfo.

[15]  Lisa Kaati,et al.  Detecting Multipliers of Jihadism on Twitter , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[16]  Lisa Kaati,et al.  Detecting Jihadist Messages on Twitter , 2015, 2015 European Intelligence and Security Informatics Conference.

[17]  Mahmoud Barhamgi,et al.  An Initial Study on Radicalization Risk Factors: Towards an Assessment Software Tool , 2017, 2017 28th International Workshop on Database and Expert Systems Applications (DEXA).

[18]  N. Christakis,et al.  Formation of raiding parties for intergroup violence is mediated by social network structure , 2016, Proceedings of the National Academy of Sciences.

[19]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[20]  Carlos Cotta,et al.  An analysis of the structure and evolution of the scientific collaboration network of computer intelligence in games , 2014 .

[21]  D. Pressman,et al.  Calibrating risk for violent political extremists and terrorists: the VERA 2 structured assessment , 2012 .

[22]  Jason J. Jung,et al.  ACO-based clustering for Ego Network analysis , 2017, Future Gener. Comput. Syst..

[23]  Colin Phillips,et al.  The psycholinguistics of ellipsis , 2014 .

[24]  Michalis Vazirgiannis,et al.  Clustering and Community Detection in Directed Networks: A Survey , 2013, ArXiv.

[25]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[26]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[27]  Derek Greene,et al.  Online social media in the Syria conflict: Encompassing the extremes and the in-betweens , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[28]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[29]  Olivier Pietquin,et al.  MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP , 2016, LREC.

[30]  Kayo Fujimoto,et al.  Adolescent affiliations and adiposity: a social network analysis of friendships and obesity. , 2009, The Journal of adolescent health : official publication of the Society for Adolescent Medicine.

[31]  Ashish Sureka,et al.  Solutions to Detect and Analyze Online Radicalization : A Survey , 2013, ArXiv.

[32]  N. Guarino,et al.  Formal Ontology in Information Systems : Proceedings of the First International Conference(FOIS'98), June 6-8, Trento, Italy , 1998 .

[33]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.