论文信息 - Detecting Radical Text over Online Media using Deep Learning

Detecting Radical Text over Online Media using Deep Learning

Social Media has influenced the way people socially connect, interact and opinionize. The growth in technology has enhanced communication and dissemination of information. Unfortunately,many terror groups like jihadist communities have started consolidating a virtual community online for various purposes such as recruitment, online donations, targeting youth online and spread of extremist ideologies. Everyday a large number of articles, tweets, posts, posters, blogs, comments, views and news are posted online without a check which in turn imposes a threat to the security of any nation. However, different agencies are working on getting down this radical content from various online social media platforms. The aim of our paper is to utilise deep learning algorithm in detection of radicalization contrary to the existing works based on machine learning algorithms. An LSTM based feed forward neural network is employed to detect radical content. We collected total 61601 records from various online sources constituting news, articles and blogs. These records are annotated by domain experts into three categories: Radical(R), Non-Radical (NR) and Irrelevant(I) which are further applied to LSTM based network to classify radical content. A precision of 85.9% has been achieved with the proposed approach

Divya Bansal | Jaspal Kaur Saini | Armaan Kaur

[1] Harith Alani,et al. Contextual Semantics for Radicalisation Detection on Twitter , 2018, SW4SG@ISWC.

[2] Vivek Venkatesh,et al. Exposure to Extremist Online Content Could Lead to Violent Radicalization:A Systematic Review of Empirical Evidence , 2018, International Journal of Developmental Science.

[3] Juan Manuel Rodriguez,et al. Textual Aggression Detection through Deep Learning , 2018, TRAC@COLING 2018.

[4] Mahmoud Barhamgi,et al. Social networks data analysis with semantics: application to the radicalization problem , 2018, Journal of Ambient Intelligence and Humanized Computing.

[5] Zellig S. Harris,et al. Distributional Structure , 1954 .

[6] Javier Del Ser,et al. On the Design and Tuning of Machine Learning Models for Language Toxicity Classification in Online Platforms , 2018, IDC.

[7] Peng Zhou,et al. Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling , 2016, COLING.

[8] Zhiyuan Liu,et al. A C-LSTM Neural Network for Text Classification , 2015, ArXiv.

[9] Njagi Dennis Gitari,et al. A Lexicon-based Approach for Hate Speech Detection , 2015, MUE 2015.

[10] Zellig S. Harris,et al. Distributional Structure , 1954 .

[11] Erkki Sutinen,et al. Automatic Detection of Antisocial Behaviour in Texts , 2014, Informatica.