Kek, Cucks, and God Emperor Trump: A Measurement Study of 4chan's Politically Incorrect Forum and Its Effects on the Web

The discussion-board site 4chan has been part of the Internet's dark underbelly since its inception, and recent political events have put it increasingly in the spotlight. In particular, /pol/, the "Politically Incorrect" board, has been a central figure in the outlandish 2016 US election season, as it has often been linked to the alt-right movement and its rhetoric of hate and racism. However, 4chan remains relatively unstudied by the scientific community: little is known about its user base, the content it generates, and how it affects other parts of the Web. In this paper, we start addressing this gap by analyzing /pol/ along several axes, using a dataset of over 8M posts we collected over two and a half months. First, we perform a general characterization, showing that /pol/ users are well distributed around the world and that 4chan's unique features encourage fresh discussions. We also analyze content, finding, for instance, that YouTube links and hate speech are predominant on /pol/. Overall, our analysis not only provides the first measurement study of /pol/, but also insight into online harassment and hate speech trends in social media.

[1]  Jure Leskovec,et al.  Antisocial Behavior in Online Discussion Communities , 2015, ICWSM.

[2]  Krishna P. Gummadi,et al.  The Many Shades of Anonymity: Characterizing Anonymous Social Media Content , 2021, ICWSM.

[3]  Denis Gordeev Automatic verbal aggression detection for Russian and American imageboards , 2016, ArXiv.

[4]  Shivakant Mishra,et al.  Analyzing Negative User Behavior in a Semi-anonymous Social Network , 2014, ArXiv.

[5]  Michael S. Bernstein,et al.  4chan and /b/: An Analysis of Anonymity and Ephemerality in a Large Online Community , 2011, ICWSM.

[6]  Jing Zhou,et al.  Hate Speech Detection with Comment Embeddings , 2015, WWW.

[7]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[8]  B. Lewis,et al.  Ethical research standards in a world of big data , 2014, F1000Research.

[9]  Bernard W. Silverman,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[10]  Phyllis B. Gerstenfeld,et al.  Hate Online: A Content Analysis of Extremist Internet Sites , 2003 .

[11]  Rodmonga Potapova,et al.  Determination of the Internet Anonymity Influence on the Level of Aggression and Usage of Obscene Lexis , 2015, ArXiv.

[12]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[13]  Denis Gordeev,et al.  Detecting State of Aggression in Sentences Using CNN , 2016, SPECOM.

[14]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[15]  Joel R. Tetreault,et al.  Abusive Language Detection in Online User Content , 2016, WWW.

[16]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[17]  Haewoon Kwak,et al.  STFU NOOB!: predicting crowdsourced decisions on toxic behavior in online games , 2014, WWW.

[18]  Aleksandra Korolova,et al.  Cloak and Swagger: Understanding Data Sensitivity through the Lens of User Anonymity , 2014, 2014 IEEE Symposium on Security and Privacy.

[19]  Filippo Menczer,et al.  Clustering memes in social media , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).