How Fair Can We Go: Detecting the Boundaries of Fairness Optimization in Information Retrieval

The presence of bias in today's IR systems has raised concerns on the social responsibilities of IR. Fairness has become an increasingly important factor when building systems for information searching and content recommendations. Fairness in IR is often considered as an optimization problem where the system aims to optimize the utility, subject to a set of fairness constraints, or optimize fairness while guaranteeing a lower bound on the utility, or jointly optimize for both utility and fairness to achieve an overall satisfaction. While various optimization algorithms have been proposed along with theoretical analysis, in real world applications, the performance of different optimization algorithms often heavily depend on the data. Therefore, it is consequential to ask what is the solution space characterized by the data, what effect does introducing fairness bring to the system, and can we identify this solution space to help us trade-off different optimization policies and guide us to pick suitable algorithms and/or make adjustments on data? In this work, we propose a framework that offers a novel perspective into the optimization with fairness constraints problems. Our framework can effectively and efficiently estimate the solution space and answer such questions. It also has the advantage of simplicity, explainability, and reliability. Specifically, we derive theoretical expressions to identify the fairness and relevance bounds for data of different distributions, and apply them to both synthetic and real world datasets. We present a series of use cases to demonstrate how our framework is applied to facilitate various analyses and decision making.

[1]  Ivar Bråten,et al.  Trust and mistrust when students read multiple information sources about climate change , 2011 .

[2]  Christo Wilson,et al.  Investigating the Impact of Gender on Rank in Resume Search Engines , 2018, CHI.

[3]  Hany Farid,et al.  The accuracy, fairness, and limits of predicting recidivism , 2018, Science Advances.

[4]  Nisheeth K. Vishnoi,et al.  An Algorithmic Framework to Control Bias in Bandit-based Personalization , 2018, ArXiv.

[5]  Fernando Diaz,et al.  Towards a Fair Marketplace: Counterfactual Evaluation of the trade-off between Relevance, Fairness & Satisfaction in Recommendation Systems , 2018, CIKM.

[6]  Ricardo Baeza-Yates,et al.  FA*IR: A Fair Top-k Ranking Algorithm , 2017, CIKM.

[7]  Aaron Roth,et al.  Fairness in Learning: Classic and Contextual Bandits , 2016, NIPS.

[8]  Ronald E. Robertson,et al.  The search engine manipulation effect (SEME) and its possible impact on the outcomes of elections , 2015, Proceedings of the National Academy of Sciences.

[9]  Krishna P. Gummadi,et al.  Equity of Attention: Amortizing Individual Fairness in Rankings , 2018, SIGIR.

[10]  Krishna P. Gummadi,et al.  Quantifying Search Bias: Investigating Sources of Bias for Political Searches in Social Media , 2017, CSCW.

[11]  Nisheeth K. Vishnoi,et al.  Ranking with Fairness Constraints , 2017, ICALP.

[12]  Alamir Novin,et al.  Making Sense of Conflicting Science Information: Exploring Bias in the Search Engine Result Page , 2017, CHIIR.

[13]  Gianluca Demartini,et al.  Investigating User Perception of Gender Bias in Image Search: The Role of Sexism , 2018, SIGIR.

[14]  Emre Kıcıman,et al.  Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries , 2018, Front. Big Data.

[15]  Aaron Roth,et al.  Meritocratic Fairness for Cross-Population Selection , 2017, ICML.

[16]  Fernando Diaz,et al.  Auditing Search Engines for Differential Satisfaction Across Demographics , 2017, WWW.

[17]  Julian Unkel,et al.  Ranking versus reputation: perception and effects of search result credibility , 2017, Behav. Inf. Technol..

[18]  Thorsten Joachims,et al.  Fairness of Exposure in Rankings , 2018, KDD.

[19]  James Grimmelmann Some Skepticism About Search Neutrality , 2011 .

[20]  Emine Yilmaz,et al.  Research Frontiers in Information Retrieval Report from the Third Strategic Workshop on Information Retrieval in Lorne (SWIRL 2018) , 2018 .

[21]  Abbe Mowshowitz,et al.  Bias on the web , 2002, CACM.

[22]  Yi Zhang,et al.  Bayesian graphical models for adaptive filtering , 2005, SIGF.

[23]  Krishna P. Gummadi,et al.  Analyzing Biases in Perception of Truth in News Stories and Their Implications for Fact Checking , 2018, IEEE Transactions on Computational Social Systems.

[24]  Yvonne Kammerer,et al.  Chapter 10 How Search Engine Users Evaluate and Select Web Search Results: The Impact of the Search Engine Interface on Credibility Assessments , 2012 .

[25]  Fernando Diaz,et al.  Research Frontiers in Information Retrieval: Report from the Third Strategic Workshop on Information Retrieval in Lorne (SWIRL 2018) , 2018, SIGF.

[26]  Milad Shokouhi,et al.  Anchoring and Adjustment in Relevance Estimation , 2015, SIGIR.

[27]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[28]  Paul D. Clough,et al.  Competent Men and Warm Women: Gender Stereotypes and Backlash in Image Search Results , 2017, CHI.

[29]  Sean A. Munson,et al.  Unequal Representation and Gender Stereotypes in Image Search Results for Occupations , 2015, CHI.