Perturbation Sensitivity Analysis to Detect Unintended Model Biases
暂无分享,去创建一个
Margaret Mitchell | Ben Hutchinson | Vinodkumar Prabhakaran | B. Hutchinson | Vinodkumar Prabhakaran | Margaret Mitchell
[1] Daniel Jurafsky,et al. Understanding Neural Networks through Representation Erasure , 2016, ArXiv.
[2] Alexandra Chouldechova,et al. The Frontiers of Fairness in Machine Learning , 2018, ArXiv.
[3] Carlos Eduardo Scheidegger,et al. Certifying and Removing Disparate Impact , 2014, KDD.
[4] Saif Mohammad,et al. Examining Gender and Race Bias in Two Hundred Sentiment Analysis Systems , 2018, *SEMEVAL.
[5] Jason Baldridge,et al. Mind the GAP: A Balanced Corpus of Gendered Ambiguous Pronouns , 2018, TACL.
[6] Adam Tauman Kalai,et al. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.
[7] Yulia Tsvetkov,et al. RtGender: A Corpus for Studying Differential Responses to Gender , 2018, LREC.
[8] Toniann Pitassi,et al. Fairness through awareness , 2011, ITCS '12.
[9] Anne Marie Piper,et al. Addressing Age-Related Bias in Sentiment Analysis , 2018, CHI.
[10] Solon Barocas,et al. Prediction-Based Decisions and Fairness: A Catalogue of Choices, Assumptions, and Definitions , 2018, 1811.07867.
[11] Arvind Narayanan,et al. Semantics derived automatically from language corpora contain human-like biases , 2016, Science.
[12] Chandler May,et al. On Measuring Social Biases in Sentence Encoders , 2019, NAACL.
[13] Alan W Black,et al. Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings , 2019, NAACL.
[14] Ankur Taly,et al. Counterfactual Fairness in Text Classification through Robustness , 2018, AIES.
[15] Lucy Vasserman,et al. Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification , 2019, WWW.
[16] Danah Boyd,et al. Fairness and Abstraction in Sociotechnical Systems , 2019, FAT.
[17] Alexandra Chouldechova,et al. Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting , 2019, FAT.
[18] Brendan T. O'Connor,et al. Racial Disparity in Natural Language Processing: A Case Study of Social Media African-American English , 2017, ArXiv.
[19] Lucy Vasserman,et al. Measuring and Mitigating Unintended Bias in Text Classification , 2018, AIES.
[20] Yoav Goldberg,et al. Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them , 2019, NAACL-HLT.