Ranking Radically Influential Web Forum Users

The growing popularity of online social media is leading to its widespread use among the online community for various purposes. In the recent past, it has been found that the web is also being used as a tool by radical or extremist groups and users to practice several kinds of mischievous acts with concealed agendas and promote ideologies in a sophisticated manner. Some of the web forums are predominantly being used for open discussions on critical issues influenced by radical thoughts. The influential users dominate and influence the newly joined innocent users through their radical thoughts. This paper presents an application of collocation theory to identify radically influential users in web forums. The radicalness of a user is captured by a measure based on the degree of match of the commented posts with a threat list. Eleven different collocation metrics are formulated to identify the association among users, and they are finally embedded in a customized PageRank algorithm to generate a ranked list of radically influential users. The experiments are conducted on a standard data set provided for a challenge at ISI-KDD'12 workshop to find radical and infectious threads, members, postings, ideas, and ideologies. Experimental results show that our proposed method outperforms the existing UserRank algorithm. We also found that the collocation theory is more effective to deal with such ranking problem than the textual and temporal similarity-based measures studied earlier.

[1]  Hsinchun Chen,et al.  Affect Analysis of Web Forums and Blogs Using Correlation Ensembles , 2008, IEEE Transactions on Knowledge and Data Engineering.

[2]  Muhammad Abulaish,et al.  Modeling a Web Forum Ecosystem into an Enriched Social Graph , 2012, MSM/MUSE.

[3]  M. Gardell,et al.  The Emergence of a Euro-American Radical Right , 2000 .

[4]  Masahiro Kimura,et al.  Extracting Influential Nodes for Information Diffusion on a Social Network , 2007, AAAI.

[5]  Matthew Richardson,et al.  Mining the network value of customers , 2001, KDD '01.

[6]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[7]  Yung-Ming Li,et al.  Identifying influential reviewers for word-of-mouth marketing , 2010, Electron. Commer. Res. Appl..

[8]  Sebastián A. Ríos,et al.  Topic-based social network analysis for virtual communities of interests in the Dark Web , 2010, KDD 2010.

[9]  D. Green,et al.  Studying Hate Crime with the Internet: What Makes Racists Advocate Racial Violence? , 2002 .

[10]  Hsinchun Chen,et al.  Analyzing terror campaigns on the internet: Technical sophistication, content richness, and Web interactivity , 2007, Int. J. Hum. Comput. Stud..

[11]  Muhammad Abulaish,et al.  Identifying cliques in dark web forums - An agglomerative clustering approach , 2012, 2012 IEEE International Conference on Intelligence and Security Informatics.

[12]  D. B. Skillicorn Applying interestingness measures to Ansar forum texts , 2010, ISI-KDD '10.

[13]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[14]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[15]  Hsinchun Chen,et al.  The Dark Web Portal Project: Collecting and Analyzing the Presence of Terrorist Groups on the Web , 2005, ISI.

[16]  Chris Volinsky,et al.  Network-Based Marketing: Identifying Likely Adopters Via Consumer Networks , 2006, math/0606278.

[17]  Matthew Richardson,et al.  Mining knowledge-sharing sites for viral marketing , 2002, KDD.

[18]  Ronald Fagin,et al.  Comparing top k lists , 2003, SODA '03.

[19]  Martin Theobald,et al.  From Focused Crawling to Expert Information: an Application Framework for Web Exploration and Portal Generation , 2003, VLDB.

[20]  Christopher C. Yang,et al.  An analysis of user influence ranking algorithms on Dark Web forums , 2010, ISI-KDD '10.

[21]  Hsinchun Chen,et al.  Uncovering the dark Web: A case study of Jihad on the Web , 2008, J. Assoc. Inf. Sci. Technol..

[22]  Steve Kramer,et al.  Anomaly detection in extremist web forums using a dynamical systems approach , 2010, ISI-KDD '10.

[23]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Report , 1999, TREC.

[24]  Michael Trusov,et al.  Determining Influential Users in Internet Social Networks , 2010 .

[25]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[26]  Christopher C. Yang,et al.  Identifing influential users in an online healthcare social network , 2010, 2010 IEEE International Conference on Intelligence and Security Informatics.

[27]  Hsinchun Chen,et al.  A Framework for Exploring Gray Web Forums: Analysis of Forum-Based Communities in Taiwan , 2006, ISI.

[28]  Philip S. Yu,et al.  Identifying the influential bloggers in a community , 2008, WSDM '08.

[29]  Hsinchun Chen,et al.  A hybrid approach to Web forum interactional coherence analysis , 2008, J. Assoc. Inf. Sci. Technol..

[30]  Éva Tardos,et al.  Influential Nodes in a Diffusion Model for Social Networks , 2005, ICALP.

[31]  Tim Oates,et al.  Modeling the Spread of Influence on the Blogosphere , 2006 .

[32]  Kristina Lerman,et al.  Predicting Influential Users in Online Social Networks , 2010, ArXiv.

[33]  Krishna P. Gummadi,et al.  Measuring User Influence in Twitter: The Million Follower Fallacy , 2010, ICWSM.

[34]  Hsinchun Chen,et al.  A focused crawler for Dark Web forums , 2010, J. Assoc. Inf. Sci. Technol..

[35]  Hsinchun Chen,et al.  A multi-region empirical study on the internet presence of global extremist organizations , 2011, Inf. Syst. Frontiers.

[36]  Sebastián A. Ríos,et al.  Topic-based social network analysis for virtual communities of interests in the Dark Web , 2010, ISI-KDD '10.

[37]  Armelle Brun,et al.  Detecting Leaders in Behavioral Networks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.