暂无分享,去创建一个
Xiang Ren | Aida Mostafazadeh Davani | Brendan Kennedy | Morteza Dehghani | Mohammad Atari | Ali Omrani | Xiang Ren | Morteza Dehghani | Brendan Kennedy | M. Atari | Ali Omrani
[1] Kristina Lerman,et al. A Survey on Bias and Fairness in Machine Learning , 2019, ACM Comput. Surv..
[2] Jieyu Zhao,et al. Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods , 2018, NAACL.
[3] Ona de Gibert,et al. Hate Speech Dataset from a White Supremacy Forum , 2018, ALW.
[4] Nathan Srebro,et al. Equality of Opportunity in Supervised Learning , 2016, NIPS.
[5] Matt J. Kusner,et al. Counterfactual Fairness , 2017, NIPS.
[6] Amy J. C. Cuddy,et al. A model of (often mixed) stereotype content: competence and warmth respectively follow from perceived status and competition. , 2002, Journal of personality and social psychology.
[7] Frederick Liu,et al. Incorporating Priors with Feature Attribution on Text Classification , 2019, ACL.
[8] Yoav Goldberg,et al. Adversarial Removal of Demographic Attributes from Text Data , 2018, EMNLP.
[9] Pascale Fung,et al. Reducing Gender Bias in Abusive Language Detection , 2018, EMNLP.
[10] Lucy Vasserman,et al. Measuring and Mitigating Unintended Bias in Text Classification , 2018, AIES.
[11] Ankur Taly,et al. Counterfactual Fairness in Text Classification through Robustness , 2018, AIES.
[12] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL 2006.
[13] Steven Bird,et al. NLTK: The Natural Language Toolkit , 2002, ACL.
[14] Blake Lemoine,et al. Mitigating Unwanted Biases with Adversarial Learning , 2018, AIES.
[15] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .
[16] Julia Hirschberg,et al. Detecting Hate Speech on the World Wide Web , 2012 .
[17] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[18] Harini Kannan,et al. Adversarial Logit Pairing , 2018, NIPS 2018.
[19] Xiang Ren,et al. Contextualizing Hate Speech Classifiers with Post-hoc Explanation , 2020, ACL.
[20] Michael Wiegand,et al. Detection of Abusive Language: the Problem of Biased Datasets , 2019, NAACL.
[21] Toniann Pitassi,et al. Learning Adversarially Fair and Transferable Representations , 2018, ICML.
[22] Siva Reddy,et al. StereoSet: Measuring stereotypical bias in pretrained language models , 2020, ACL.