论文信息 - Algorithmic content moderation: Technical and political challenges in the automation of platform governance

Algorithmic content moderation: Technical and political challenges in the automation of platform governance

As government pressure on major technology companies builds, both firms and legislators are searching for technical solutions to difficult platform governance puzzles such as hate speech and misinformation. Automated hash-matching and predictive machine learning tools – what we define here as algorithmic moderation systems – are increasingly being deployed to conduct content moderation at scale by major platforms for user-generated content such as Facebook, YouTube and Twitter. This article provides an accessible technical primer on how algorithmic moderation works; examines some of the existing automated tools used by major platforms to handle copyright infringement, terrorism and toxic speech; and identifies key political and ethical issues for these systems as the reliance on them grows. Recent events suggest that algorithmic moderation has become necessary to manage growing public expectations for increased platform responsibility, safety and security on the global stage; however, as we demonstrate, these systems remain opaque, unaccountable and poorly understood. Despite the potential promise of algorithms or ‘AI’, we show that even ‘well optimized’ moderation systems could exacerbate, rather than relieve, many existing problems with content policy as enacted by platforms for three main reasons: automated moderation threatens to (a) further increase opacity, making a famously non-transparent set of practices even more difficult to understand or audit, (b) further complicate outstanding issues of fairness and justice in large-scale sociotechnical systems and (c) re-obscure the fundamentally political nature of speech decisions being executed at scale.

[1] Lucas Dixon,et al. Ex Machina: Personal Attacks Seen at Scale , 2016, WWW.

[2] Ingmar Weber,et al. Understanding Abuse: A Typology of Abusive Language Detection Subtasks , 2017, ALW@ACL.

[3] Ricardo Baeza-Yates,et al. FA*IR: A Fair Top-k Ranking Algorithm , 2017, CIKM.

[4] N. Elkin-Koren,et al. Behind the Scenes of Online Copyright Enforcement: Empirical Evidence on Notice & Takedown , 2018 .

[5] Michael Veale,et al. Like Trainer, Like Bot? Inheritance of Bias in Algorithmic Content Moderation , 2017, SocInfo.

[6] Ronak Patel. First World Problems: A Fair Use Analysis of Internet Memes , 2013 .

[7] Brendan T. O'Connor,et al. Demographic Dialectal Variation in Social Media: A Case Study of African-American English , 2016, EMNLP.

[8] James Grimmelmann,et al. The Virtues of Moderation , 2015 .

[9] Jenna Burrell,et al. How the machine ‘thinks’: Understanding opacity in machine learning algorithms , 2016 .

[10] Nicolas Suzor,et al. Lawless: the secret rules that govern our digital lives , 2018 .

[11] M. Soha,et al. Monetizing a Meme: YouTube, Content ID, and the Harlem Shake , 2016 .

[12] R. Stuart Geiger,et al. Bots, bespoke, code and the materiality of software platforms , 2014 .

[13] Ying Chen,et al. Detecting Offensive Language in Social Media to Protect Adolescent Online Safety , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[14] Christopher G. Harris,et al. A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[15] Michael Wiegand,et al. A Survey on Hate Speech Detection using Natural Language Processing , 2017, SocialNLP@EACL.

[16] Julie E. Cohen,et al. Fair Use Infrastructure for Copyright Management Systems , 2000 .

[17] Jennifer M. Urban,et al. Notice and Takedown in Everyday Practice , 2016 .

[18] M. Kearns,et al. Fairness in Criminal Justice Risk Assessments: The State of the Art , 2017, Sociological Methods & Research.

[19] M. C. Elish,et al. Situating methods in the magic of Big Data and AI , 2018 .

[20] Natasha Duarte,et al. Mixed Messages? The Limits of Automated Social Media Content Analysis , 2018, FAT.

[21] Nicole Immorlica,et al. Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[22] Geoff Kaufman,et al. Moderator engagement and community development in the age of algorithms , 2019, New Media Soc..

[23] Robert Gorwa,et al. Democratic Transparency in the Platform Society , 2020 .

[24] Kate Klonick,et al. Facebook v. Sullivan: Building Constitutional Law for Online Speech , 2019, SSRN Electronic Journal.

[25] Jiao Yu-hua,et al. An Overview of Perceptual Hashing , 2008 .