论文信息 - Evolution and impact of bias in human and machine learning algorithm interaction

Evolution and impact of bias in human and machine learning algorithm interaction

Traditionally, machine learning algorithms relied on reliable labels from experts to build predictions. More recently however, algorithms have been receiving data from the general population in the form of labeling, annotations, etc. The result is that algorithms are subject to bias that is born from ingesting unchecked information, such as biased samples and biased labels. Furthermore, people and algorithms are increasingly engaged in interactive processes wherein neither the human nor the algorithms receive unbiased data. Algorithms can also make biased predictions, leading to what is now known as algorithmic bias. On the other hand, human’s reaction to the output of machine learning methods with algorithmic bias worsen the situations by making decision based on biased information, which will probably be consumed by algorithms later. Some recent research has focused on the ethical and moral implication of machine learning algorithmic bias on society. However, most research has so far treated algorithmic bias as a static factor, which fails to capture the dynamic and iterative properties of bias. We argue that algorithmic bias interacts with humans in an iterative manner, which has a long-term effect on algorithms’ performance. For this purpose, we present an iterated-learning framework that is inspired from human language evolution to study the interaction between machine learning algorithms and humans. Our goal is to study two sources of bias that interact: the process by which people select information to label (human action); and the process by which an algorithm selects the subset of information to present to people (iterated algorithmic bias mode). We investigate three forms of iterated algorithmic bias (personalization filter, active learning, and random) and how they affect the performance of machine learning algorithms by formulating research questions about the impact of each type of bias. Based on statistical analyses of the results of several controlled experiments, we found that the three different iterated bias modes, as well as initial training data class imbalance and human action, do affect the models learned by machine learning algorithms. We also found that iterated filter bias, which is prominent in personalized user interfaces, can lead to more inequality in estimated relevance and to a limited human ability to discover relevant data. Our findings indicate that the relevance blind spot (items from the testing set whose predicted relevance probability is less than 0.5 and who thus risk being hidden from humans) amounted to 4% of all relevant items when using a content-based filter that predicts relevant items. A similar simulation using a real-life rating data set found that the same filter resulted in a blind spot size of 75% of the relevant testing set.

[1] Patrick Shafto,et al. Reasoning in teaching and misleading situations , 2011, CogSci.

[2] Filip Radlinski,et al. Evaluating the accuracy of implicit feedback from clicks and query reformulations in Web search , 2007, TOIS.

[3] John C. S. Lui,et al. Modeling the Assimilation-Contrast Effects in Online Product Rating Systems: Debiasing and Recommendations , 2017, ACM Conference on Recommender Systems.

[4] Doina Caragea,et al. Exploring high-dimensional classification boundaries , 2005 .

[5] Olfa Nasraoui,et al. A cross-modal warm-up solution for the cold-start problem in collaborative filtering recommender systems , 2014, WebSci '14.

[6] Rodney X. Sturdivant,et al. Applied Logistic Regression: Hosmer/Applied Logistic Regression , 2005 .

[7] Ryen W. White. Beliefs and biases in web search , 2013, SIGIR.

[8] Patrick Shafto,et al. Technological Workforce and Its Impact on Algorithmic Justice in Politics , 2019, Customer Needs and Solutions.

[9] Ruslan Salakhutdinov,et al. Probabilistic Matrix Factorization , 2007, NIPS.

[10] Yoav Shoham,et al. Fab: content-based, collaborative recommendation , 1997, CACM.

[11] Karthik Ramani,et al. Deconvolving Feedback Loops in Recommender Systems , 2016, NIPS.

[12] Catherine Tucker,et al. Algorithmic bias? An empirical study into apparent gender-based discrimination in the display of STEM career ads , 2019 .

[13] David A. Cohn,et al. Active Learning with Statistical Models , 1996, NIPS.

[14] Sophie Ahrens,et al. Recommender Systems , 2012 .

[15] Vittorio Castelli,et al. On the exponential value of labeled samples , 1995, Pattern Recognit. Lett..

[16] Edward A. Fox,et al. Research Contributions , 2014 .

[17] Simon Kirby,et al. Iterated Learning: A Framework for the Emergence of Language , 2003, Artificial Life.

[18] Mohamed Jemni,et al. Automatic Personalization in E-Learning Based on Recommendation Systems: An Overview , 2012 .

[19] Olfa Nasraoui,et al. Complete This Puzzle: A Connectionist Approach to Accurate Web Recommendations Based on a Committee of Predictors , 2004, WebKDD.

[20] Michael J. Pazzani,et al. Content-Based Recommendation Systems , 2007, The Adaptive Web.

[21] A. Gopnik,et al. Children’s imitation of causal action sequences is influenced by statistical and pedagogical evidence , 2011, Cognition.

[22] James Allan,et al. Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[23] Sean M. McNee,et al. Improving recommendation lists through topic diversification , 2005, WWW '05.

[24] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[25] Mohamed Jemni,et al. Automatic Recommendations for E-Learning Personalization Based on Web Usage Mining Techniques and Information Retrieval , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[26] Noah D. Goodman,et al. Teaching Games : Statistical Sampling Assumptions for Learning in Pedagogical Situations , 2008 .

[27] Simon Kirby,et al. Cumulative cultural evolution in the laboratory: An experimental approach to the origins of structure in human language , 2008, Proceedings of the National Academy of Sciences.

[28] Pattie Maes,et al. Evolving agents for personalized information filtering , 1993, Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications.

[29] Dietmar Jannach,et al. Biases in Automated Music Playlist Generation: A Comparison of Next-Track Recommending Techniques , 2016, UMAP.

[30] Bianca Zadrozny,et al. Learning and evaluating classifiers under sample selection bias , 2004, ICML.

[31] C. J. van Rijsbergen,et al. Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[32] David Danks,et al. Algorithmic Bias in Autonomous Systems , 2017, IJCAI.

[33] T. Griffiths,et al. Iterated learning: Intergenerational knowledge transmission reveals inductive biases , 2007, Psychonomic bulletin & review.

[34] Gediminas Adomavicius,et al. Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[35] Pattie Maes,et al. Agents that reduce work and information overload , 1994, CACM.

[36] Mariarosaria Taddeo,et al. Recommender systems and their ethical challenges , 2020, AI & SOCIETY.

[37] A. M. Madni,et al. Recommender systems in e-commerce , 2014, 2014 World Automation Congress (WAC).

[38] Karen Spärck Jones. Some thoughts on classification for retrieval , 1970, J. Documentation.

[39] Barbara E. Engelhardt,et al. How algorithmic confounding in recommendation systems increases homogeneity and decreases utility , 2017, RecSys.

[40] Olfa Nasraoui,et al. Human-Recommender Systems: From Benchmark Data to Benchmark Cognitive Models , 2016, RecSys.