Detecting Radical Text over Online Media using Deep Learning

Social Media has influenced the way people socially connect, interact and opinionize. The growth in technology has enhanced communication and dissemination of information. Unfortunately,many terror groups like jihadist communities have started consolidating a virtual community online for various purposes such as recruitment, online donations, targeting youth online and spread of extremist ideologies. Everyday a large number of articles, tweets, posts, posters, blogs, comments, views and news are posted online without a check which in turn imposes a threat to the security of any nation. However, different agencies are working on getting down this radical content from various online social media platforms. The aim of our paper is to utilise deep learning algorithm in detection of radicalization contrary to the existing works based on machine learning algorithms. An LSTM based feed forward neural network is employed to detect radical content. We collected total 61601 records from various online sources constituting news, articles and blogs. These records are annotated by domain experts into three categories: Radical(R), Non-Radical (NR) and Irrelevant(I) which are further applied to LSTM based network to classify radical content. A precision of 85.9% has been achieved with the proposed approach

[1]  Harith Alani,et al.  Contextual Semantics for Radicalisation Detection on Twitter , 2018, SW4SG@ISWC.

[2]  Vivek Venkatesh,et al.  Exposure to Extremist Online Content Could Lead to Violent Radicalization:A Systematic Review of Empirical Evidence , 2018, International Journal of Developmental Science.

[3]  Juan Manuel Rodriguez,et al.  Textual Aggression Detection through Deep Learning , 2018, TRAC@COLING 2018.

[4]  Mahmoud Barhamgi,et al.  Social networks data analysis with semantics: application to the radicalization problem , 2018, Journal of Ambient Intelligence and Humanized Computing.

[5]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[6]  Javier Del Ser,et al.  On the Design and Tuning of Machine Learning Models for Language Toxicity Classification in Online Platforms , 2018, IDC.

[7]  Peng Zhou,et al.  Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling , 2016, COLING.

[8]  Zhiyuan Liu,et al.  A C-LSTM Neural Network for Text Classification , 2015, ArXiv.

[9]  Njagi Dennis Gitari,et al.  A Lexicon-based Approach for Hate Speech Detection , 2015, MUE 2015.

[10]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[11]  Erkki Sutinen,et al.  Automatic Detection of Antisocial Behaviour in Texts , 2014, Informatica.

[12]  Ashish Sureka,et al.  Using KNN and SVM Based One-Class Classifier for Detecting Online Radicalization on Twitter , 2015, ICDCIT.

[13]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[14]  Arie W. Kruglanski,et al.  Terrorism, radicalization, and de-radicalization , 2016 .

[15]  Uffe Kock Wiil,et al.  Detecting Social Polarization and Radicalization , 2011 .

[16]  Timothy W. Finin,et al.  Why we twitter: understanding microblogging usage and communities , 2007, WebKDD/SNA-KDD '07.

[17]  Bo Li,et al.  Research on online supporting community of extreme organization by AI-SNA based method , 2017, 2017 8th IEEE International Conference on Software Engineering and Service Science (ICSESS).

[18]  Ritesh Kumar,et al.  Aggression-annotated Corpus of Hindi-English Code-mixed Data , 2018, LREC.

[19]  Theodoros Rekatsinas,et al.  Deep Learning for Entity Matching: A Design Space Exploration , 2018, SIGMOD Conference.

[20]  Harith Alani,et al.  Understanding the Roots of Radicalisation on Twitter , 2018, WebSci.

[21]  Jun Zhao,et al.  How to Generate a Good Word Embedding , 2015, IEEE Intelligent Systems.

[22]  Arjun Mukherjee,et al.  Mining contentions from discussions and debates , 2012, KDD.