Assessing Media Bias in Cross-Linguistic and Cross-National Populations

Media bias is a worldwide concern. Although automated methods exist for the analysis of various forms of media bias, language is still an important barrier toward spotting worldwide differences in reporting. In this paper, we propose a methodology based on word embeddings, lexicon translation, and document similarity to assess media bias in news articles published in different idioms. We model media bias under the perspective of subjective language use, i.e., the more subjective the content of a news article is, the more biased it is. Our core assumption is that news articles reporting the same events, but written in different languages, should have similar levels of subjectivity; otherwise, we may have spotted biased text. Our method consists of using translated versions of subjectivity lexicons that were originally constructed for measuring subjectivity in the Brazilian Portuguese language. We evaluate our approach on two labeled data sets to show that our method is valid and apply our methodology to analyze recent and largely resounded topics, such as the Venezuela crisis and Syrian war, on four distinct idioms: Portuguese, German, English, and Spanish.

[1]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[2]  Daniel A. Keim,et al.  Visual Analysis of Explicit Opinion and News Bias in German Soccer Articles , 2012, EuroVA@EuroVis.

[3]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[4]  Matt J. Kusner,et al.  From Word Embeddings To Document Distances , 2015, ICML.

[5]  Bruno Pouliquen,et al.  Sentiment Analysis in the News , 2010, LREC.

[6]  Qinmin Hu,et al.  Mining Temporal Discriminant Frames via Joint Matrix Factorization: A Case Study of Illegal Immigration in the U.S. News Media , 2018, KSEM.

[7]  Leandro Balby Marinho,et al.  Media Bias Characterization in Brazilian Presidential Elections , 2019, SIdEWayS@HT.

[8]  Bela Gipp,et al.  Automated identification of media bias in news articles: an interdisciplinary literature review , 2018, International Journal on Digital Libraries.

[9]  David Lazer,et al.  More Voices Than Ever? Quantifying Media Bias in Networks , 2011, ICWSM.

[10]  Hans Uszkoreit,et al.  TQ-AutoTest – An Automated Test Suite for (Machine) Translation Quality , 2018, LREC.

[11]  S. Greenstein,et al.  Collective Intelligence and Neutral Point of View: The Case of Wikipedia , 2012 .

[12]  D. Murphey Bias: A CBS Insider Exposes How the Media Distort the News , 2002 .

[13]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[14]  Multilingual Subjectivity Detection Using Deep Multiple Kernel Learning , 2015 .

[15]  R. Benson Journalism: Normative Theories , 2008 .

[16]  Arvind Narayanan,et al.  Semantics derived automatically from language corpora contain human-like biases , 2016, Science.

[17]  L. d’Haenens,et al.  Refugees in the news: Comparing Belgian and Swedish newspaper coverage of the European refugee situation during summer 2015 , 2018, Communications.

[18]  Ralf Krestel,et al.  Identifying Media Bias by Analyzing Reported Speech , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[19]  Paolo Gastaldo,et al.  Bayesian network based extreme learning machine for subjectivity detection , 2017, J. Frankl. Inst..

[20]  Rada Mihalcea,et al.  Learning Multilingual Subjective Language via Cross-Lingual Projections , 2007, ACL.

[21]  Robert M. Entman,et al.  Framing: Toward Clarification of a Fractured Paradigm , 1993 .

[22]  R. Entman Framing Bias: Media in the Distribution of Power , 2007 .

[23]  Rada Mihalcea,et al.  Porting Multilingual Subjectivity Resources across Languages , 2013, IEEE Transactions on Affective Computing.

[24]  Evelin Amorim,et al.  Automated Essay Scoring in the Presence of Biased Ratings , 2018, NAACL.

[25]  Introducing subjectivities in language variation and change , 2005 .

[26]  Noah A. Smith,et al.  Analyzing Framing through the Casts of Characters in the News , 2016, EMNLP.

[27]  K. Hawkins Responding to radical populism: Chavismo in Venezuela , 2016 .

[28]  Andreas Hotho,et al.  Media Bias in German Online Newspapers , 2015, HT.

[29]  Daniel Jurafsky,et al.  Linguistic Models for Analyzing and Detecting Biased Language , 2013, ACL.

[30]  P. Mundim O viés da cobertura política da imprensa nas eleições presidenciais brasileiras de 2002, 2006 e 2010 , 2018 .

[31]  R. Nickerson Confirmation Bias: A Ubiquitous Phenomenon in Many Guises , 1998 .

[32]  Arie Verhagen,et al.  Constructions of intersubjectivity , 2005 .

[33]  W. Schramm,et al.  Four Theories of the Press: The Authoritarian, Libertarian, Social Responsibility, and Soviet Communist Concepts of What the Press Should Be and Do , 1963 .

[34]  Mounia Lalmas,et al.  Social media news communities: gatekeeping, coverage, and statement bias , 2013, CIKM.