Identifying Dark Web clusters with temporal coherence analysis

Extremists are actively utilizing social media as propaganda to promote their ideologies. Online forums are ideal platforms to draw attention from worldwide Internet users to the timely issues and some opinions in these discussions can be threatening the public safety. It is of great interest for the intelligence to identify clusters on these forums and capture the topics of discussions and their development. Previous work in cluster identification focused on social networks constructed by the direct interactions between users utilizing link analysis techniques. However, the direct interactions between users may only capture one potential relationship between forum users. Users who share common interests may not necessarily interact with each other directly. On the other hand, they may be active in similar events simultaneously. In this paper, we propose a temporal coherence analysis approach to identify clusters of users from the Dark Web data. Users are represented as vectors of activeness and clusters are extracted with the support of temporal coherence analysis. We tested our proposed methods on both synthetic dataset and real world dataset. Using the real-world Dark Web dataset, three clusters were identified and each cluster was also associated with a specific theme. It shows that a cluster of users participating in a theme of discussion can be discovered without using any content analysis but only using temporal analysis.

[1]  Philip S. Yu,et al.  GraphScope: parameter-free mining of large time-evolving graphs , 2007, KDD '07.

[2]  John Scott Social Network Analysis , 1988 .

[3]  Christopher C. Yang,et al.  Following the Social Media: Aspect Evolution of Online Discussion , 2011, SBP.

[4]  Tanya Y. Berger-Wolf,et al.  A framework for community identification in dynamic social networks , 2007, KDD '07.

[5]  Philip S. Yu,et al.  Online Analysis of Community Evolution in Data Streams , 2005, SDM.

[6]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Ravi Kumar,et al.  Dynamics of conversations , 2010, KDD.

[8]  The National Academy of Sciences , 1928, Science.

[9]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[10]  Tobun Dorbin Ng,et al.  Web opinions analysis with scalable distance-based clustering , 2009, 2009 IEEE International Conference on Intelligence and Security Informatics.

[11]  Christopher C. Yang,et al.  An analysis of user influence ranking algorithms on Dark Web forums , 2010, ISI-KDD '10.

[12]  Christopher C. Yang,et al.  Identifing influential users in an online healthcare social network , 2010, 2010 IEEE International Conference on Intelligence and Security Informatics.