论文信息 - Detecting and Monitoring Hate Speech in Twitter

Detecting and Monitoring Hate Speech in Twitter

Social Media are sensors in the real world that can be used to measure the pulse of societies. However, the massive and unfiltered feed of messages posted in social media is a phenomenon that nowadays raises social alarms, especially when these messages contain hate speech targeted to a specific individual or group. In this context, governments and non-governmental organizations (NGOs) are concerned about the possible negative impact that these messages can have on individuals or on the society. In this paper, we present HaterNet, an intelligent system currently being used by the Spanish National Office Against Hate Crimes of the Spanish State Secretariat for Security that identifies and monitors the evolution of hate speech in Twitter. The contributions of this research are many-fold: (1) It introduces the first intelligent system that monitors and visualizes, using social network analysis techniques, hate speech in Social Media. (2) It introduces a novel public dataset on hate speech in Spanish consisting of 6000 expert-labeled tweets. (3) It compares several classification approaches based on different document representation strategies and text classification models. (4) The best approach consists of a combination of a LTSM+MLP neural network that takes as input the tweet’s word, emoji, and expression tokens’ embeddings enriched by the tf-idf, and obtains an area under the curve (AUC) of 0.828 on our dataset, outperforming previous methods presented in the literature.

[1] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.

[2] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[3] Andrew B. Whinston,et al. Designing a social-broadcasting-based business intelligence system , 2011, TMIS.

[4] H. Varian,et al. Predicting the Present with Google Trends , 2009 .

[5] Munmun De Choudhury,et al. Analyzing the Dynamics of Communication in Online Social Networks , 2010, Handbook of Social Network Technologies.

[6] Zeerak Waseem,et al. Are You a Racist or Am I Seeing Things? Annotator Influence on Hate Speech Detection on Twitter , 2016, NLP+CSS@EMNLP.

[7] Bernard J. Jansen,et al. Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media , 2018, ICWSM.

[8] Paolo Rosso,et al. Detecting Deceptive Opinions: Intra and Cross-Domain Classification Using an Efficient Representation , 2017, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[9] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[10] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[11] M. Kaminski. The right to explanation, explained , 2018, Research Handbook on Information Law and Governance.

[12] Vivek Narayanan,et al. Fast and Accurate Sentiment Classification Using an Enhanced Naive Bayes Model , 2013, IDEAL.

[13] Johan Bollen,et al. Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[14] Felice Dell'Orletta,et al. Hate Me, Hate Me Not: Hate Speech Detection on Facebook , 2017, ITASEC.

[15] Dirk Neumann,et al. Crime Mapping through Geo-Spatial Social Media Activity , 2014, ICIS.

[16] Michael I. Jordan,et al. Machine learning: Trends, perspectives, and prospects , 2015, Science.

[17] Björn Gambäck,et al. Using Convolutional Neural Networks to Classify Hate-Speech , 2017, ALW@ACL.

[18] A. Downs. Up and Down with Ecology--The Issue Attention Cycle , 1972 .

[19] Kai Wu,et al. Social Media as Sensor in Real World: Geolocate User with Microblog , 2014, NLPCC.

[20] Fabrício Benevenuto,et al. Analyzing the Targets of Hate in Online Social Media , 2016, ICWSM.

[21] Panagiotis Takis Metaxas,et al. How (Not) to Predict Elections , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[22] Harith Alani,et al. Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new dataset, the STS-Gold , 2013, ESSEM@AI*IA.

[23] Helmut Schmidt,et al. Probabilistic part-of-speech tagging using decision trees , 1994 .

[24] Wenpu Xing,et al. Weighted PageRank algorithm , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[25] Efthimios Tambouris,et al. Understanding the Predictive Power of Social Media This is a pre-print version of the following article : , 2013 .

[26] Eric P. Xing,et al. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2014, ACL 2014.

[27] Matthew S. Gerber,et al. Predicting crime using Twitter and kernel density estimation , 2014, Decis. Support Syst..

[28] Sérgio Nunes,et al. A Survey on Automatic Detection of Hate Speech in Text , 2018, ACM Comput. Surv..

[29] David Robinson,et al. Detecting Hate Speech on Twitter Using a Convolution-GRU Based Deep Neural Network , 2018, ESWC.

[30] Walter Daelemans,et al. Pattern for Python , 2012, J. Mach. Learn. Res..

[31] Haojie Zhu,et al. A Spatio-Temporal Kernel Density Estimation Framework for Predictive Crime Hotspot Mapping and Evaluation , 2018, Applied Geography.

[32] Ingmar Weber,et al. Automated Hate Speech Detection and the Problem of Offensive Language , 2017, ICWSM.

[33] Xinyu Chen,et al. Crime prediction using Twitter sentiment and weather , 2015, 2015 Systems and Information Engineering Design Symposium.

[34] Connie St Louis,et al. Can Twitter predict disease outbreaks? , 2012, BMJ : British Medical Journal.

[35] Huan Liu,et al. Feature Selection and Classification - A Probabilistic Wrapper Approach , 1996, IEA/AIE.

[36] Spencer Ch. The Utility of Hotspot Mapping for Predicting Spatial Patterns of Crime , 2008 .