Darling or Babygirl ? Investigating Stylistic Bias in Sentiment Analysis

Sentiment analysis is increasingly used for a range of applications from customer service to opinion mining. Stylistic bias arises when text generated by different groups of people expressing the same underlying content receive disparate treatment. Using three lexical alignment techniques, we find that standard sentiment models produce significantly different sentiment scores for word pairs that mainly differ stylistically. We suggest a simple align and substitute method to automatically generate examples of potentially undesirable biases in black-box models in order to better facilitate identification and mitigation of differential treatment based on stylistic variation.

[1]  Guillaume Lample,et al.  Word Translation Without Parallel Data , 2017, ICLR.

[2]  Dirk Hovy,et al.  Learning a POS tagger for AAVE-like language , 2016, NAACL.

[3]  D. Lawton Social Class Differences in Language Development: A Study of Some Samples of Written Work , 1963 .

[4]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[5]  Regina Barzilay,et al.  Style Transfer from Non-Parallel Text by Cross-Alignment , 2017, NIPS.

[6]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[7]  Jieyu Zhao,et al.  Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints , 2017, EMNLP.

[8]  Alexander M. Rush,et al.  Adversarially Regularized Autoencoders for Generating Discrete Structures , 2017, ArXiv.

[9]  Ilya Sutskever,et al.  Learning to Generate Reviews and Discovering Sentiment , 2017, ArXiv.

[10]  Dirk Hovy,et al.  Cross-lingual syntactic variation over age and gender , 2015, CoNLL.

[11]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[12]  Shlomo Argamon,et al.  Effects of Age and Gender on Blogging , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[13]  Adam Tauman Kalai,et al.  Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[14]  Brendan T. O'Connor,et al.  Demographic Dialectal Variation in Social Media: A Case Study of African-American English , 2016, EMNLP.

[15]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.