论文信息 - Word Embeddings (Also) Encode Human Personality Stereotypes - 字舞流文

Word Embeddings (Also) Encode Human Personality Stereotypes

Word representations trained on text reproduce human implicit bias related to gender, race and age. Methods have been developed to remove such bias. Here, we present results that show that human stereotypes exist even for much more nuanced judgments such as personality, for a variety of person identities beyond the typically legally protected attributes and that these are similarly captured in word representations. Specifically, we collected human judgments about a person’s Big Five personality traits formed solely from information about the occupation, nationality or a common noun description of a hypothetical person. Analysis of the data reveals a large number of statistically significant stereotypes in people. We then demonstrate the bias captured in lexical representations is statistically significantly correlated with the documented human bias. Our results, showing bias for a large set of person descriptors for such nuanced traits put in doubt the feasibility of broadly and fairly applying debiasing methods and call for the development of new methods for auditing language technology systems and resources.

Norman I. Badler | Ani Nenkova | Oshin Agarwal | Funda Durupinar | N. Badler | Funda Durupinar | A. Nenkova | Oshin Agarwal

[1] Roy Schwartz,et al. Symmetric Pattern Based Word Embeddings for Improved Word Similarity Prediction , 2015, CoNLL.

[2] Jieyu Zhao,et al. Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods , 2018, NAACL.

[3] Fabio Pianesi,et al. Workshop on Computational Personality Recognition: Shared Task , 2013, Proceedings of the International AAAI Conference on Web and Social Media.

[4] Yoav Goldberg,et al. Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them , 2019, NAACL-HLT.

[5] D. Kahneman. Thinking, Fast and Slow , 2011 .

[6] L. R. Goldberg. THE DEVELOPMENT OF MARKERS FOR THE BIG-FIVE FACTOR STRUCTURE , 1992 .

[7] T. Graepel,et al. Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[8] Adam Tauman Kalai,et al. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[9] Stereotype threat , 2020, The Lancet.

[10] Sudeep Bhatia,et al. Vector Space Semantic Models Predict Subjective Probability Judgments for Real-World Events , 2016, CogSci.

[11] S. Srivastava,et al. The Big Five Trait taxonomy: History, measurement, and theoretical perspectives. , 1999 .

[12] Erik Cambria,et al. AFFECTIVE COMPUTI G AND SENTIMENT ANALYSIS Deep Learning-Based Document Modeling for Personality Detection from Text , 2017 .

[13] Heng Ji,et al. Gender and Animacy Knowledge Discovery from Web-Scale N-Grams for Unsupervised Person Mention Detection , 2009, PACLIC.

[14] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[15] C. Barbaranelli,et al. National Character Does Not Reflect Mean Personality Trait Levels in 49 Cultures , 2005, Science.

[16] Dekang Lin,et al. Bootstrapping Path-Based Pronoun Resolution , 2006, ACL.

[17] S. Gosling,et al. A very brief measure of the Big-Five personality domains , 2003 .

[18] Arvind Narayanan,et al. Semantics derived automatically from language corpora contain human-like biases , 2016, Science.

[19] Jieyu Zhao,et al. Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints , 2017, EMNLP.

[20] Philip Resnik,et al. Using Topic Modeling to Improve Prediction of Neuroticism and Depression in College Students , 2013, EMNLP.

[21] A. Greenwald,et al. Measuring individual differences in implicit cognition: the implicit association test. , 1998, Journal of personality and social psychology.

[22] Marilyn A. Walker,et al. PERSONAGE: Personality Generation for Dialogue , 2007, ACL.

[23] Melissa C. Thomas-Hunt,et al. Condoning stereotyping? How awareness of stereotyping prevalence impacts expression of stereotypes. , 2015, The Journal of applied psychology.

[24] Rachel Rudinger,et al. Gender Bias in Coreference Resolution , 2018, NAACL.

[25] S. Spencer,et al. Stereotype Threat. , 2016, Annual review of psychology.