Understanding Characteristics of Biased Sentences in News Articles

Providing balanced and good quality news articles to readers is an important challenge in news recommendation. Often, readers tend to select and read articles which confirm their social environment and their political beliefs. This issue is also known as filter bubble. As a remedy, initial approaches towards automatically detecting bias in news articles have been developed. Obtaining a suitable ground truth for such a task is however difficult. In this paper, we describe ground truth dataset created with the help of crowd-sourcing for fostering research on bias detection and removal from news content. We then analyze the characteristics of the user annotations, in particular concerning bias-inducing words. Our results indicate that determining bias-induced words is subjective to certain degree and that a high agreement on all bias-inducing words of all readers is hard to obtain. We also study the discriminative characteristics of biased content and find that linguistic features, such as negative words, tend to be indicative for bias.

[1]  Elisha Elovic,et al.  Testing and Comparing Computational Approaches for Identifying the Language of Framing in Political News , 2015, NAACL.

[2]  Daniel Jurafsky,et al.  Linguistic Models for Analyzing and Detecting Biased Language , 2013, ACL.

[3]  M. Allen,et al.  Media bias in presidential elections: a meta‐analysis , 2000 .

[4]  Tim Groseclose,et al.  A Measure of Media Bias , 2005 .

[5]  Stefanie Nowak,et al.  How reliable are annotations via crowdsourcing: a study about inter-annotator agreement for multi-label image annotation , 2010, MIR '10.

[6]  Benoît Sagot,et al.  Annotating omission in statement pairs , 2017, LAW@ACL.

[7]  J. Crowcroft,et al.  Visualizing Media Bias through Twitter , 2012, Proceedings of the International AAAI Conference on Web and Social Media.

[8]  Masatoshi Yoshikawa,et al.  News Bias Analysis Based on Stakeholder Mining , 2011, IEICE Trans. Inf. Syst..

[9]  Arkaitz Zubiaga,et al.  Crowdsourcing the Annotation of Rumourous Conversations in Social Media , 2015, WWW.

[10]  Steven Brakman,et al.  The impact of newspapers on consumer confidence: does spin bias exist? , 2004, SSRN Electronic Journal.

[11]  Seungwoo Kang,et al.  NewsCube: delivering multiple aspects of news to mitigate media bias , 2009, CHI.

[12]  W. Bennett News, the politics of illusion , 1983 .

[13]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[14]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[15]  Norman Meuschke,et al.  Matrix-Based News Aggregation: Exploring Different News Perspectives , 2017, 2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL).

[16]  Jesse M. Shapiro,et al.  Media Bias and Reputation , 2005, Journal of Political Economy.