AI techniques to analyse a social network on text, user and group level: application on Galaxy2

Online Social Networks have taken a huge place in the informational space, and are often used for advertising, e-reputation, propaganda, or even manipulation, either by individuals, companies or states. As the quantity of information makes the human exploitation difficult, solutions to support the decision makers can only come from the use of AI techniques to extract intelligence from posted messages, to qualify the user behaviours, and to identify the social structure. In this article, we illustrate how to exploit such techniques on a very peculiar social network, named Galaxy2, hidden in the Dark Web. We propose an analysis of 1000 days of activity using NLP techniques to find the most interesting topics and to discover key actors. We then proceed with a MLbased profiling of the user behaviours. Finally, we introduce influence and cohesion scores for groups of users, which help their characterisation and evaluation.

[1]  Sean F. Everton Disrupting Dark Networks , 2012 .

[2]  Jean-Loup Guillaume,et al.  Static community detection algorithms for evolving networks , 2010, 8th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks.

[3]  Martin Rosvall,et al.  Maps of Information Flow Reveal Community Structure In Complex Networks , 2007 .

[4]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[5]  Kwan Hui Lim,et al.  An interaction-based approach to detecting highly interactive Twitter communities using tweeting links , 2016, Web Intell..

[6]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[7]  Kenneth E. Shirley,et al.  LDAvis: A method for visualizing and interpreting topics , 2014 .

[8]  Laurent Vercouter,et al.  Measures for topical cohesion of user communities on Twitter , 2017, WI.

[9]  Alexander Lazovik,et al.  Mining Twitter in the Cloud: A Case Study , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[10]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[11]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[13]  Jure Leskovec,et al.  Defining and Evaluating Network Communities Based on Ground-Truth , 2012, ICDM.

[14]  Danai Koutra,et al.  RolX: structural role extraction & mining in large graphs , 2012, KDD.

[15]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[16]  Geoffrey L. Cohen,et al.  Party over policy: The dominating impact of group influence on political beliefs. , 2003, Journal of personality and social psychology.

[17]  Haewoon Kwak,et al.  Finding influentials based on the temporal order of information adoption in twitter , 2010, WWW '10.

[18]  David A. Bader,et al.  Massive Social Network Analysis: Mining Twitter for Social Good , 2010, 2010 39th International Conference on Parallel Processing.

[19]  S. Milgram,et al.  GROUP PRESSURE AND ACTION AGAINST A PERSON. , 1964, Journal of abnormal psychology.

[20]  Nemanja Spasojevic,et al.  Klout score: Measuring influence across multiple social networks , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[21]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[22]  Rémy Cazabet,et al.  Simulate to Detect: A Multi-agent System for Community Detection , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[23]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[24]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  D. Boyd,et al.  Dynamic Debates: An Analysis of Group Polarization Over Time on Twitter , 2010 .

[26]  Hanan Samet,et al.  TwitterStand: news in tweets , 2009, GIS.

[27]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Fujio Toriumi,et al.  Real-time Tweet Classification in Disaster Situation , 2016, WWW.

[29]  Heng Ji,et al.  The Nature of Communications and Emerging Communities on Twitter Following the 2013 Syria Sarin Gas Attacks , 2014, 2014 IEEE Joint Intelligence and Security Informatics Conference.