论文信息 - Neural Based Statement Classification for Biased Language - 字舞流文

Neural Based Statement Classification for Biased Language

Biased language commonly occurs around topics which are of controversial nature, thus, stirring disagreement between the different involved parties of a discussion. This is due to the fact that for language and its use, specifically, the understanding and use of phrases, the stances are cohesive within the particular groups. However, such cohesiveness does not hold across groups. In collaborative environments or environments where impartial language is desired (e.g. Wikipedia, news media), statements and the language therein should represent equally the involved parties and be neutrally phrased. Biased language is introduced through the presence of inflammatory words or phrases, or statements that may be incorrect or one-sided, thus violating such consensus. In this work, we focus on the specific case of phrasing bias, which may be introduced through specific inflammatory words or phrases in a statement. For this purpose, we propose an approach that relies on a recurrent neural networks in order to capture the inter-dependencies between words in a phrase that introduced bias. We perform a thorough experimental evaluation, where we show the advantages of a neural based approach over competitors that rely on word lexicons and other hand-crafted features in detecting biased language. We are able to distinguish biased statements with a precision of P=0.917, thus significantly outperforming baseline models with an improvement of over 30%. Finally, we release the largest corpus of statements annotated for biased language.

Besnik Fetahu | Christoph Hube | B. Fetahu | C. Hube

[1] Susan C. Herring,et al. Cultural bias in Wikipedia content on famous persons , 2011, J. Assoc. Inf. Sci. Technol..

[2] David Lazer,et al. A Frame of Mind: Using Statistical Models for Detection of Framing and Agenda Setting Campaigns , 2015, ACL.

[3] Karl Aberer,et al. Selection Bias in News Coverage: Learning it, Fighting it , 2018, WWW.

[4] Douglas Biber,et al. Variation across speech and writing: Methodology , 1988 .

[5] Aristides Gionis,et al. Joint Non-negative Matrix Factorization for Learning Ideological Leaning on Twi er , 2017 .

[6] Diyi Yang,et al. Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[7] Elisha Elovic,et al. Testing and Comparing Computational Approaches for Identifying the Language of Framing in Political News , 2015, NAACL.

[8] David García,et al. It's a Man's Wikipedia? Assessing Gender Inequality in an Online Encyclopedia , 2015, ICWSM.

[10] Noah A. Smith,et al. Shedding (a Thousand Points of) Light on Biased Language , 2010, Mturk@HLT-NAACL.

[11] Philip Resnik,et al. Political Ideology Detection Using Recursive Neural Networks , 2014, ACL.

[12] Avishek Anand,et al. How much is Wikipedia Lagging Behind News? , 2015, WebSci.

[13] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[14] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[15] James W. Pennebaker,et al. Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[16] S. Greenstein,et al. Is Wikipedia Biased , 2012 .

[17] Eunsol Choi,et al. Truth of Varying Shades: Analyzing Language in Fake News and Political Fact-Checking , 2017, EMNLP.

[18] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[19] Suzanne Romaine,et al. Language in Society: An Introduction to Sociolinguistics , 1997 .

[20] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[21] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[22] Wolfgang Nejdl,et al. Finding News Citations for Wikipedia , 2016, CIKM.

[23] Ralf D. Brown,et al. THE PRONOUNS OF POWER AND SOLIDARITY , 1968 .

[24] Brian Martin,et al. Persistent Bias on Wikipedia , 2018 .

[25] Daniel Jurafsky,et al. Linguistic Models for Analyzing and Detecting Biased Language , 2013, ACL.

[26] Besnik Fetahu,et al. Detecting Biased Statements in Wikipedia , 2018, WWW.