Community extraction and visualization in social networks applied to Twitter

Abstract Nowadays, social network analysis attracts more interest from the scientific community. However, it becomes trickier to analyse the generated data by the social networks due to their complexity, which hides the underlying patterns. In this work we propose an approach for social media analysis, especially for Twitter’s network. Our approach relies on two complementary steps: (i) a community identification based on a new community detection algorithm called Tribase , and (ii) an interactive community visualization, which provides gradual knowledge acquisition using our visualization tool, called NLCOMS . In order to assess the proposed approach, we have tested it on real-world data of the ANR Info-RSN project. This project is related to information propagation and community detection in Twitter’s network, more precisely on a collection of tweets dealing with media articles. The results show that our approach allows us to visually reveal the community structure and the related characteristics.

[1]  Jean-Daniel Fekete,et al.  MatrixExplorer: a Dual-Representation System to Explore Social Networks , 2006, IEEE Transactions on Visualization and Computer Graphics.

[2]  Philippe Castagliola,et al.  A Comparison of the Readability of Graphs Using Node-Link and Matrix-Based Representations , 2004 .

[3]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Matthieu Latapy,et al.  Theory and Practice of Triangle Problems in Very Large (Sparse (Power-Law)) Graphs , 2006, ArXiv.

[5]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[6]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[7]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[8]  Arjan Kuijper,et al.  Visual Analysis of Large Graphs: State‐of‐the‐Art and Future Research Challenges , 2011, Eurographics.

[9]  Jinhu Lu,et al.  Complex Systems and Networks : Dynamics, Controls and Applications , 2015 .

[10]  Jure Leskovec,et al.  Empirical comparison of algorithms for network community detection , 2010, WWW '10.

[11]  M. Newman Analysis of weighted networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Andrea Lancichinetti,et al.  Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Guillaume Chelius,et al.  Triangles to Capture Social Cohesion , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[14]  Emden R. Gansner,et al.  Graphviz and Dynagraph – Static and Dynamic Graph Drawing Tools , 2003 .

[15]  Roberto Tamassia,et al.  Handbook on Graph Drawing and Visualization , 2013 .

[16]  Jean-Daniel Fekete,et al.  Task taxonomy for graph visualization , 2006, BELIV '06.

[17]  Przemyslaw Kazienko,et al.  An Introduction to Community Detection in Multi-layered Social Network , 2011, WSKS.

[18]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[19]  Benoît Otjacques,et al.  Representation of graphs on a matrix layout , 2005, Ninth International Conference on Information Visualisation (IV'05).

[20]  Andrew E. Johnson,et al.  Visualizing the Evolution of Community Structures in Dynamic Social Networks , 2011, Comput. Graph. Forum.

[21]  Randy Goebel,et al.  Meerkat: Community Mining with Dynamic Social Networks , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[22]  Ivan Herman,et al.  Graph Visualization and Navigation in Information Visualization: A Survey , 2000, IEEE Trans. Vis. Comput. Graph..

[23]  Helen C. Purchase,et al.  Metrics for Graph Drawing Aesthetics , 2002, J. Vis. Lang. Comput..

[24]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[25]  Stephen G. Kobourov,et al.  Force-Directed Drawing Algorithms , 2013, Handbook of Graph Drawing and Visualization.

[26]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[27]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  Jacques Bertin,et al.  Semiologie graphique : les diagrammes les réseaux, les cartes , 1969 .

[29]  P. Ronhovde,et al.  Multiresolution community detection for megascale networks by information-based replica correlations. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Jean-Daniel Fekete,et al.  NodeTrix: a Hybrid Visualization of Social Networks , 2007, IEEE Transactions on Visualization and Computer Graphics.

[31]  Christian Staudt,et al.  NetworKit: An Interactive Tool Suite for High-Performance Network Analysis , 2014, ArXiv.

[32]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[33]  Nam P. Nguyen,et al.  Adaptive algorithms for detecting community structure in dynamic social networks , 2011, 2011 Proceedings IEEE INFOCOM.

[34]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Santo Fortunato,et al.  Limits of modularity maximization in community detection , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[37]  Benoît Otjacques,et al.  Branch-and-bound algorithm for the maximum triangle packing problem , 2015, Comput. Ind. Eng..

[38]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[39]  Camille Roth,et al.  Natural Scales in Geographical Patterns , 2017, Scientific Reports.

[40]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[41]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[42]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[43]  L. Freeman,et al.  The Development of Social Network Analysis: A Study in the Sociology of Science , 2005 .

[44]  Tore Opsahl,et al.  Clustering in weighted networks , 2009, Soc. Networks.

[45]  Tamara G. Kolda,et al.  Using Triangles to Improve Community Detection in Directed Networks , 2014, ArXiv.

[46]  Zhi-Zhong Chen,et al.  An improved randomized approximation algorithm for maximum triangle packing , 2009, Discret. Appl. Math..

[47]  Jörg Lässig,et al.  DenGraph-HO: Density-based Hierarchical Community Detection for Explorative Visual Network Analysis , 2011, SGAI Conf..

[48]  Jure Leskovec,et al.  Mining of Massive Datasets, 2nd Ed , 2014 .

[49]  Ben Shneiderman,et al.  Analyzing Social Media Networks with NodeXL: Insights from a Connected World , 2010 .

[50]  Carl T. Bergstrom,et al.  The map equation , 2009, 0906.1405.

[51]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[53]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.