Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia's Verifiability

Wikipedia is playing an increasingly central role on the web, and the policies its contributors follow when sourcing and fact-checking content affect million of readers. Among these core guiding principles, verifiability policies have a particularly important role. Verifiability requires that information included in a Wikipedia article be corroborated against reliable secondary sources. Because of the manual labor needed to curate Wikipedia at scale, however, its contents do not always evenly comply with these policies. Citations (i.e. reference to external sources) may not conform to verifiability requirements or may be missing altogether, potentially weakening the reliability of specific topic areas of the free encyclopedia. In this paper, we aim to provide an empirical characterization of the reasons why and how Wikipedia cites external sources to comply with its own verifiability guidelines. First, we construct a taxonomy of reasons why inline citations are required, by collecting labeled data from editors of multiple Wikipedia language editions. We then crowdsource a large-scale dataset of Wikipedia sentences annotated with categories derived from this taxonomy. Finally, we design algorithmic models to determine if a statement requires a citation, and to predict the citation reason . We evaluate the accuracy of such models across different classes of Wikipedia articles of varying quality, and on external datasets of claims annotated for fact-checking purposes.

[1]  David R. Karger,et al.  A Structured Response to Misinformation: Defining and Annotating Credibility Indicators in News Articles , 2018, WWW.

[2]  J. Giles Internet encyclopaedias go head to head , 2005, Nature.

[3]  Brent J. Hecht,et al.  Turkers, Scholars, "Arafat" and "Peace": Cultural Communities and Algorithmic Gold Standards , 2015, CSCW.

[4]  Cardona Alzate,et al.  Predicción y selección de variables con bosques aleatorios en presencia de variables correlacionadas , 2020 .

[5]  Brent J. Hecht,et al.  The Substantial Interdependence of Wikipedia and Google: A Case Study on the Relationship Between Peer Production Communities and Information Technologies , 2017, ICWSM.

[6]  Preslav Nakov,et al.  Overview of the CLEF-2018 CheckThat! Lab on Automatic Identification and Verification of Political Claims. Task 1: Check-Worthiness , 2018, CLEF.

[7]  Jure Leskovec,et al.  Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes , 2016, WWW.

[8]  J. Hooper On Assertive Predicates , 1975 .

[9]  Hyunjung Kim,et al.  An anatomy of the credibility of online newspapers , 2010, Online Inf. Rev..

[10]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[11]  Nazanin Andalibi,et al.  Designing information savvy societies: an introduction to assessability , 2014, CHI.

[12]  Regina Barzilay,et al.  Automatically Generating Wikipedia Articles: A Structure-Aware Approach , 2009, ACL.

[13]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[14]  Ido Dagan,et al.  Efficient Tree-based Approximation for Entailment Graph Learning , 2012, ACL.

[15]  Tara L. Pummer,et al.  Reliability of Wikipedia as a medication information source for pharmacy students , 2011 .

[16]  Jian Pei,et al.  Citation recommendation without author supervision , 2011, WSDM '11.

[17]  Amy Bruckman,et al.  Decentralization in Wikipedia Governance , 2009, J. Manag. Inf. Syst..

[18]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[19]  Chu-Ren Huang,et al.  Fake News Detection Through Multi-Perspective Speaker Profiles , 2017, IJCNLP.

[20]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[21]  Aaron Halfaker,et al.  Information Fortification: An Online Citation Behavior , 2018, GROUP.

[22]  Ivan Beschastnikh,et al.  Wikipedian Self-Governance in Action: Motivating the Policy Lens , 2021, ICWSM.

[23]  Chengkai Li,et al.  Toward Automated Fact-Checking: Detecting Check-worthy Factual Claims by ClaimBuster , 2017, KDD.

[24]  Andreas Vlachos,et al.  FEVER: a Large-scale Dataset for Fact Extraction and VERification , 2018, NAACL.

[25]  Ido Dagan,et al.  Integrating Deep Linguistic Features in Factuality Prediction over Unified Datasets , 2017, ACL.

[26]  Dragomir R. Radev,et al.  Purpose and Polarity of Citation: Towards NLP-based Bibliometrics , 2013, NAACL.

[27]  Avishek Anand,et al.  Fine Grained Citation Span for References in Wikipedia , 2017, EMNLP.

[28]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[29]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[30]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[31]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[32]  David J. Ketchen,et al.  THE APPLICATION OF CLUSTER ANALYSIS IN STRATEGIC MANAGEMENT RESEARCH: AN ANALYSIS AND CRITIQUE , 1996 .

[33]  Bonnie L. Webber,et al.  Squibs: Stable Classification of Text Genres , 2011, CL.

[34]  Aniket Kittur,et al.  Crowdsourcing user studies with Mechanical Turk , 2008, CHI.

[35]  Witold Abramowicz,et al.  Analysis of References Across Wikipedia Languages , 2017, ICIST.

[36]  Robert F. Tate,et al.  Correlation Between a Discrete and a Continuous Variable. Point-Biserial Correlation , 1954 .

[37]  D. Gergle,et al.  Hot Off the Wiki , 2013 .

[38]  Avishek Anand,et al.  Automated News Suggestions for Populating Wikipedia Entity Pages , 2015, CIKM.

[39]  Arkaitz Zubiaga,et al.  Towards Automated Factchecking: Developing an Annotation Schema and Benchmark for Consistent Automated Claim Detection , 2018, ArXiv.

[40]  John Riedl,et al.  Creating, destroying, and restoring value in wikipedia , 2007, GROUP.

[41]  Martin Wattenberg,et al.  The Hidden Order of Wikipedia , 2007, HCI.

[42]  James Pustejovsky,et al.  FactBank: a corpus annotated with event factuality , 2009, Lang. Resour. Evaluation.

[43]  Wolfgang Nejdl,et al.  Finding News Citations for Wikipedia , 2016, CIKM.

[44]  Aaron Halfaker,et al.  When the levee breaks: without bots, what happens to Wikipedia's quality control processes? , 2013, OpenSym.

[45]  Daniel Jurafsky,et al.  Linguistic Models for Analyzing and Detecting Biased Language , 2013, ACL.

[46]  Krishnendu Chatterjee,et al.  Assigning trust to Wikipedia content , 2008, Int. Sym. Wikis.

[47]  Camille Roth,et al.  {{Citation needed}}: the dynamics of referencing in Wikipedia , 2012, WikiSym '12.