Topic-based social network analysis for virtual communities of interests in the Dark Web

The study of extremist groups and their interaction is a crucial task in order to maintain homeland security and peace. Tools such as social networks analysis and text mining have contributed to their understanding in order to develop counter-terrorism applications. This work addresses the topic-based community key-members extraction problem, for which our method combines both text mining and social network analysis techniques. This is achieved by first applying latent Dirichlet allocation to build two topic-based social networks in online forums: one social network oriented towards the thread creator point-of-view, and the other is oriented towards the repliers of the overall forum. Then, by using different network analysis measures, topic-based key members are evaluated using as benchmark a social network built a plain representation of the network of posts. Experiments were successfully performed using an English language based forum available in the Dark Web portal.

[1]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[2]  Lina Zhou,et al.  Social computing and weighting to identify member roles in online communities , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[3]  Hsinchun Chen,et al.  CrimeNet explorer: a framework for criminal network knowledge discovery , 2005, TOIS.

[4]  Hsinchun Chen,et al.  Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums , 2008, TOIS.

[5]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[6]  Luis A. Guerrero,et al.  Virtual Communities of Practice's Purpose Evolution Analysis Using a Concept-Based Mining Approach , 2009, KES.

[7]  Haewoon Kwak,et al.  Mining communities in networks: a solution for consistency and its evaluation , 2009, IMC '09.

[8]  Srini Ramaswamy,et al.  Social network analysis for email classification , 2008, ACM-SE 46.

[9]  Miia Kosonen,et al.  Knowledge sharing in virtual communities - a review of the empirical research , 2009, Int. J. Web Based Communities.

[10]  Hsinchun Chen,et al.  Collecting and Analyzing the Presence of Terrorists on the Web: A Case Study of Jihad Websites , 2005, ISI.

[11]  Mark A. Girolami,et al.  Employing Latent Dirichlet Allocation for fraud detection in telecommunications , 2007, Pattern Recognit. Lett..

[12]  Panayiotis Zaphiris,et al.  Investigating social network patterns within an empathic online community for older people , 2009, Comput. Hum. Behav..

[13]  Jácint Szabó,et al.  Linked latent Dirichlet allocation in web spam filtering , 2009, AIRWeb '09.

[14]  Line Dubé,et al.  The Success of Virtual Communities of Practice : The Leadership Factor , 2005 .

[15]  Gilbert Probst,et al.  Why communities of practice succeed and why they fail , 2008 .

[16]  Hsinchun Chen,et al.  Applying authorship analysis to extremist-group Web forum messages , 2005, IEEE Intelligent Systems.

[17]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[18]  Sang-Won Lee,et al.  On social Web sites , 2010, Inf. Syst..

[19]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[20]  Marc Sageman,et al.  A Strategy for Fighting International Islamist Terrorists , 2008 .

[21]  Constance Elise Porter,et al.  A Typology of Virtual Communities: A Multi-Disciplinary Foundation for Future Research , 2006, J. Comput. Mediat. Commun..

[22]  Hsinchun Chen,et al.  On the Topology of the Dark Web of Terrorist Groups , 2006, ISI.

[23]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[24]  Hsinchun Chen,et al.  The topology of dark networks , 2008, Commun. ACM.

[25]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Hsinchun Chen,et al.  US domestic extremist groups on the Web: link and content analysis , 2005, IEEE Intelligent Systems.

[27]  Li Fan,et al.  Dark web forums portal: Searching and analyzing jihadist forums , 2009, 2009 IEEE International Conference on Intelligence and Security Informatics.

[28]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks , 2005, IJCAI.

[29]  A. Banerjee,et al.  Social Topic Models for Community Extraction , 2008 .

[30]  Joseph Migga Kizza,et al.  Discovering topics from dark websites , 2009, 2009 IEEE Symposium on Computational Intelligence in Cyber Security.

[31]  R. B. Bradford Application of Latent Semantic Indexing in Generating Graphs of Terrorist Networks , 2006, ISI.

[32]  Daniel Dajun Zeng,et al.  Finding leaders from opinion networks , 2009, 2009 IEEE International Conference on Intelligence and Security Informatics.

[33]  Gregor Heinrich Parameter estimation for text analysis , 2009 .