BiasHeal: On-the-Fly Black-Box Healing of Bias in Sentiment Analysis Systems

Although Sentiment Analysis (SA) is widely applied in many domains, existing research has revealed that the unfairness in SA systems can be harmful to the welfare of less privileged people. Several works propose pre-processing and in-processing methods to eliminate bias in SA systems, but little attention is paid to utilizing post-processing methods to heal bias. Postprocessing methods are particularly important for systems that use third-party SA services. Systems that use such services have no access to the SA engine or its training data and thus cannot apply pre-processing nor in-processing methods. Therefore, this paper proposes a black-box post-processing method to make an SA system heal bias and construct fair results when bias is detected. We propose and investigate six self-healing strategies. Our evaluation results on two datasets show that the best strategy can construct fair results and improve accuracy on the two datasets by 2.76% and 2.85%, respectively. To the best of our knowledge, our work is the first self-healing method that can be deployed to ensure SA fairness without requiring access to the SA engine or its training data.

[1]  Sameer Singh,et al.  Beyond Accuracy: Behavioral Testing of NLP Models with CheckList , 2020, ACL.

[2]  Ferdian Thung,et al.  BiasFinder: Metamorphic Test Generation to Uncover Bias for Sentiment Analysis Systems , 2021, IEEE Transactions on Software Engineering.

[3]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[4]  Rachel K. E. Bellamy,et al.  AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias , 2018, ArXiv.

[5]  Tim Menzies,et al.  Fairway: a way to build fair ML software , 2020, ESEC/SIGSOFT FSE.

[6]  M. Cannarsa Ethics Guidelines for Trustworthy AI , 2021, The Cambridge Handbook of Lawyering in the Digital Age.

[7]  Blake Lemoine,et al.  Mitigating Unwanted Biases with Adversarial Learning , 2018, AIES.

[8]  Mark Harman,et al.  "Ignorance and Prejudice" in Software Fairness , 2021, 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE).

[9]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[10]  Jun Sakuma,et al.  Fairness-Aware Classifier with Prejudice Remover Regularizer , 2012, ECML/PKDD.

[11]  Tony Russell-Rose,et al.  Extracting sentiment from healthcare survey data: An evaluation of sentiment analysis tools , 2015, 2015 Science and Information Conference (SAI).

[12]  Jin Song Dong,et al.  White-box Fairness Testing through Adversarial Sampling , 2020, 2020 IEEE/ACM 42nd International Conference on Software Engineering (ICSE).

[13]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[14]  Ting Zhang,et al.  Sentiment Analysis for Software Engineering: How Far Can Pre-trained Transformer Models Go? , 2020, 2020 IEEE International Conference on Software Maintenance and Evolution (ICSME).

[15]  Yuriy Brun,et al.  Themis: automatically testing software for discrimination , 2018, ESEC/SIGSOFT FSE.

[16]  David Lo,et al.  BiasRV: uncovering biased sentiment predictions at runtime , 2021, ESEC/SIGSOFT FSE.

[17]  Saif Mohammad,et al.  Examining Gender and Race Bias in Two Hundred Sentiment Analysis Systems , 2018, *SEMEVAL.