Evolution and impact of bias in human and machine learning algorithm interaction

Traditionally, machine learning algorithms relied on reliable labels from experts to build predictions. More recently however, algorithms have been receiving data from the general population in the form of labeling, annotations, etc. The result is that algorithms are subject to bias that is born from ingesting unchecked information, such as biased samples and biased labels. Furthermore, people and algorithms are increasingly engaged in interactive processes wherein neither the human nor the algorithms receive unbiased data. Algorithms can also make biased predictions, leading to what is now known as algorithmic bias. On the other hand, human’s reaction to the output of machine learning methods with algorithmic bias worsen the situations by making decision based on biased information, which will probably be consumed by algorithms later. Some recent research has focused on the ethical and moral implication of machine learning algorithmic bias on society. However, most research has so far treated algorithmic bias as a static factor, which fails to capture the dynamic and iterative properties of bias. We argue that algorithmic bias interacts with humans in an iterative manner, which has a long-term effect on algorithms’ performance. For this purpose, we present an iterated-learning framework that is inspired from human language evolution to study the interaction between machine learning algorithms and humans. Our goal is to study two sources of bias that interact: the process by which people select information to label (human action); and the process by which an algorithm selects the subset of information to present to people (iterated algorithmic bias mode). We investigate three forms of iterated algorithmic bias (personalization filter, active learning, and random) and how they affect the performance of machine learning algorithms by formulating research questions about the impact of each type of bias. Based on statistical analyses of the results of several controlled experiments, we found that the three different iterated bias modes, as well as initial training data class imbalance and human action, do affect the models learned by machine learning algorithms. We also found that iterated filter bias, which is prominent in personalized user interfaces, can lead to more inequality in estimated relevance and to a limited human ability to discover relevant data. Our findings indicate that the relevance blind spot (items from the testing set whose predicted relevance probability is less than 0.5 and who thus risk being hidden from humans) amounted to 4% of all relevant items when using a content-based filter that predicts relevant items. A similar simulation using a real-life rating data set found that the same filter resulted in a blind spot size of 75% of the relevant testing set.

[1]  Patrick Shafto,et al.  Reasoning in teaching and misleading situations , 2011, CogSci.

[2]  Filip Radlinski,et al.  Evaluating the accuracy of implicit feedback from clicks and query reformulations in Web search , 2007, TOIS.

[3]  John C. S. Lui,et al.  Modeling the Assimilation-Contrast Effects in Online Product Rating Systems: Debiasing and Recommendations , 2017, ACM Conference on Recommender Systems.

[4]  Doina Caragea,et al.  Exploring high-dimensional classification boundaries , 2005 .

[5]  Olfa Nasraoui,et al.  A cross-modal warm-up solution for the cold-start problem in collaborative filtering recommender systems , 2014, WebSci '14.

[6]  Rodney X. Sturdivant,et al.  Applied Logistic Regression: Hosmer/Applied Logistic Regression , 2005 .

[7]  Ryen W. White Beliefs and biases in web search , 2013, SIGIR.

[8]  Patrick Shafto,et al.  Technological Workforce and Its Impact on Algorithmic Justice in Politics , 2019, Customer Needs and Solutions.

[9]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[10]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[11]  Karthik Ramani,et al.  Deconvolving Feedback Loops in Recommender Systems , 2016, NIPS.

[12]  Catherine Tucker,et al.  Algorithmic bias? An empirical study into apparent gender-based discrimination in the display of STEM career ads , 2019 .

[13]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[14]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[15]  Vittorio Castelli,et al.  On the exponential value of labeled samples , 1995, Pattern Recognit. Lett..

[16]  Edward A. Fox,et al.  Research Contributions , 2014 .

[17]  Simon Kirby,et al.  Iterated Learning: A Framework for the Emergence of Language , 2003, Artificial Life.

[18]  Mohamed Jemni,et al.  Automatic Personalization in E-Learning Based on Recommendation Systems: An Overview , 2012 .

[19]  Olfa Nasraoui,et al.  Complete This Puzzle: A Connectionist Approach to Accurate Web Recommendations Based on a Committee of Predictors , 2004, WebKDD.

[20]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[21]  A. Gopnik,et al.  Children’s imitation of causal action sequences is influenced by statistical and pedagogical evidence , 2011, Cognition.

[22]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[23]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.

[24]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[25]  Mohamed Jemni,et al.  Automatic Recommendations for E-Learning Personalization Based on Web Usage Mining Techniques and Information Retrieval , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[26]  Noah D. Goodman,et al.  Teaching Games : Statistical Sampling Assumptions for Learning in Pedagogical Situations , 2008 .

[27]  Simon Kirby,et al.  Cumulative cultural evolution in the laboratory: An experimental approach to the origins of structure in human language , 2008, Proceedings of the National Academy of Sciences.

[28]  Pattie Maes,et al.  Evolving agents for personalized information filtering , 1993, Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications.

[29]  Dietmar Jannach,et al.  Biases in Automated Music Playlist Generation: A Comparison of Next-Track Recommending Techniques , 2016, UMAP.

[30]  Bianca Zadrozny,et al.  Learning and evaluating classifiers under sample selection bias , 2004, ICML.

[31]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[32]  David Danks,et al.  Algorithmic Bias in Autonomous Systems , 2017, IJCAI.

[33]  T. Griffiths,et al.  Iterated learning: Intergenerational knowledge transmission reveals inductive biases , 2007, Psychonomic bulletin & review.

[34]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[35]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[36]  Mariarosaria Taddeo,et al.  Recommender systems and their ethical challenges , 2020, AI & SOCIETY.

[37]  A. M. Madni,et al.  Recommender systems in e-commerce , 2014, 2014 World Automation Congress (WAC).

[38]  Karen Spärck Jones Some thoughts on classification for retrieval , 1970, J. Documentation.

[39]  Barbara E. Engelhardt,et al.  How algorithmic confounding in recommendation systems increases homogeneity and decreases utility , 2017, RecSys.

[40]  Olfa Nasraoui,et al.  Human-Recommender Systems: From Benchmark Data to Benchmark Cognitive Models , 2016, RecSys.

[41]  David W. Hosmer,et al.  Applied Logistic Regression , 1991 .

[42]  M. Graffar [Modern epidemiology]. , 1971, Bruxelles medical.

[43]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[44]  R. Nosofsky American Psychological Association, Inc. Choice, Similarity, and the Context Theory of Classification , 2022 .

[45]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[46]  Thomas L. Griffiths,et al.  A Bayesian View of Language Evolution by Iterated Learning - eScholarship , 2005 .

[47]  J. Klayman,et al.  Confirmation, Disconfirmation, and Informa-tion in Hypothesis Testing , 1987 .

[48]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[49]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[50]  Abbe Mowshowitz,et al.  Bias on the web , 2002, CACM.

[51]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[52]  Simon Kirby,et al.  Innateness and culture in the evolution of language , 2006, Proceedings of the National Academy of Sciences.

[53]  Olfa Nasraoui,et al.  PrCP: Pre-recommendation Counter-Polarization. , 2018 .

[54]  Patrick Shafto,et al.  Unifying pedagogical reasoning and epistemic trust. , 2012, Advances in child development and behavior.

[55]  John Langford,et al.  Cost-sensitive learning by cost-proportionate example weighting , 2003, Third IEEE International Conference on Data Mining.

[56]  T. Griffiths,et al.  Iterated learning and the cultural ratchet , 2009 .

[57]  Zhiting Hu,et al.  Dynamic User Modeling in Social Media Systems , 2015, TOIS.

[58]  Christopher C. Yang Search Engines Information Retrieval in Practice , 2010, J. Assoc. Inf. Sci. Technol..

[59]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[60]  Krishna P. Gummadi,et al.  Incremental Fairness in Two-Sided Market Platforms: On Smoothly Updating Recommendations , 2019, AAAI 2020.

[61]  Adam Tauman Kalai,et al.  Decoupled Classifiers for Group-Fair and Efficient Machine Learning , 2017, FAT.

[62]  Jiadong Ren,et al.  Social recommendation model based on user interaction in complex social networks , 2019, PloS one.

[63]  Emre Kıcıman,et al.  Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries , 2018, Front. Big Data.

[64]  Patrick Shafto,et al.  Learning to trust and trusting to learn: a theoretical framework , 2015, Trends in Cognitive Sciences.

[65]  Beata Beigman Klebanov,et al.  Learning with Annotation Noise , 2009, ACL.

[66]  Mark Crovella,et al.  Closed-Loop Opinion Formation , 2017, WebSci.

[67]  H. B. Mann,et al.  On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other , 1947 .

[68]  A. Tversky Elimination by aspects: A theory of choice. , 1972 .

[69]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[70]  Baxter S. Eaves,et al.  Epistemic trust: modeling children's reasoning about others' knowledge and intent. , 2012, Developmental science.

[71]  Suresh Venkatasubramanian,et al.  A comparative study of fairness-enhancing interventions in machine learning , 2018, FAT.

[72]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[73]  Olfa Nasraoui,et al.  Detecting polarization in ratings: An automated pipeline and a preliminary quantification on several benchmark data sets , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[74]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[75]  Xing Xie,et al.  GeoMF++: Scalable Location Recommendation via Joint Geographical Modeling and Matrix Factorization , 2018, TOIS.

[76]  Charles Elkan,et al.  The Foundations of Cost-Sensitive Learning , 2001, IJCAI.

[77]  Olfa Nasraoui,et al.  Iterated Algorithmic Bias in the Interactive Machine Learning Process of Information Filtering , 2018, KDIR.

[78]  Douglas L. Medin,et al.  Context theory of classification learning. , 1978 .

[79]  Kristina Lerman,et al.  Leveraging Position Bias to Improve Peer Recommendation , 2014, PloS one.

[80]  Noah D. Goodman,et al.  The double-edged sword of pedagogy: Instruction limits spontaneous exploration and discovery , 2011, Cognition.

[81]  Michael R. Lyu,et al.  Improving Recommender Systems by Incorporating Social Contextual Information , 2011, TOIS.

[82]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[83]  Ricardo Baeza-Yates,et al.  Data and algorithmic bias in the web , 2016, WebSci.

[84]  Noah D. Goodman,et al.  A rational account of pedagogical reasoning: Teaching by, and learning from, examples , 2014, Cognitive Psychology.

[85]  Douglas S. McNair,et al.  Preventing Disparities: Bayesian and Frequentist Methods for Assessing Fairness in Machine-Learning Decision-Support Models , 2018 .

[86]  Fan Meng,et al.  Recommendation Algorithm based on Link Prediction and Domain Knowledge in Retail Transactions , 2014, ITQM.

[87]  Amy Perfors,et al.  Language Evolution Can Be Shaped by the Structure of the World , 2014, Cogn. Sci..

[88]  Franco Turini,et al.  Discrimination-aware data mining , 2008, KDD.

[89]  Olfa Nasraoui,et al.  Transparency in Fair Machine Learning: the Case of Explainable Recommender Systems , 2018, Human and Machine Learning.

[90]  Jöran Beel,et al.  Position Bias in Recommender Systems for Digital Libraries , 2018, iConference.

[91]  Dino Pedreschi,et al.  Algorithmic bias amplifies opinion fragmentation and polarization: A bounded confidence model , 2018, PloS one.

[92]  Solon Barocas,et al.  D ATA M INING AND THE D ISCOURSE ON D ISCRIMINATION , 2014 .

[93]  Patrick Shafto,et al.  Explaining Choice Behavior: The Intentional Selection Assumption , 2015, CogSci.

[94]  Kate M. Miltner,et al.  Big Data| Critiquing Big Data: Politics, Ethics, Epistemology | Special Section Introduction , 2014 .

[95]  Michael J. Pazzani,et al.  Learning and Revising User Profiles: The Identification of Interesting Web Sites , 1997, Machine Learning.

[96]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[97]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[98]  Thorsten Joachims,et al.  Unbiased Learning-to-Rank with Biased Feedback , 2016, WSDM.

[99]  Francesco Bonchi,et al.  Algorithmic Bias: From Discrimination Discovery to Fairness-aware Data Mining , 2016, KDD.

[100]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[101]  Mehrnoush Shamsfard,et al.  Matrix Factorization with Explicit Trust and Distrust Side Information for Improved Social Recommendation , 2014, TOIS.

[102]  Olfa Nasraoui,et al.  Mining search engine query logs for query recommendation , 2006, WWW '06.

[103]  S. Kirby,et al.  Iterated learning and the evolution of language , 2014, Current Opinion in Neurobiology.

[104]  S. Robertson The probability ranking principle in IR , 1997 .

[105]  Sen Wang,et al.  Operation rule derivation of hydropower reservoir by k-means clustering method and extreme learning machine based on particle swarm optimization , 2019, Journal of Hydrology.

[106]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[107]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[108]  Zi Huang,et al.  Joint Modeling of User Check-in Behaviors for Real-time Point-of-Interest Recommendation , 2016, ACM Trans. Inf. Syst..

[109]  Patrick Shafto,et al.  Chapter Four - Choice from among Intentionally Selected Options , 2015 .

[110]  R. Duncan Luce,et al.  Individual Choice Behavior: A Theoretical Analysis , 1979 .

[111]  Kenny Smith,et al.  Iterated learning in populations of Bayesian agents , 2009 .

[112]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[113]  Erez Shmueli,et al.  Algorithmic Fairness , 2020, ArXiv.

[114]  Anna N. Rafferty,et al.  Convergence Bounds for Language Evolution by Iterated Learning , 2009 .

[115]  Shai Shalev-Shwartz,et al.  Online learning: theory, algorithms and applications (למידה מקוונת.) , 2007 .

[116]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[117]  R. Luce,et al.  Individual Choice Behavior: A Theoretical Analysis. , 1960 .

[118]  Thorsten Joachims,et al.  Recommendations as Treatments: Debiasing Learning and Evaluation , 2016, ICML.

[119]  Peter Bailey,et al.  Incorporating User Expectations and Behavior into the Measurement of Search Effectiveness , 2017, ACM Trans. Inf. Syst..

[120]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[121]  Solon Barocas,et al.  Ten simple rules for responsible big data research , 2017, PLoS Comput. Biol..

[122]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[123]  Ali Farhadi,et al.  Deep Classifiers from Image Tags in the Wild , 2015, MMCommons '15.

[124]  J. Heckman Sample Selection Bias as a Specification Error (with an Application to the Estimation of Labor Supply Functions) , 1977 .

[125]  Ana-Andreea Stoica,et al.  Algorithmic Glass Ceiling in Social Networks: The effects of social recommendations on network diversity , 2018, WWW.

[126]  Peretz Shoval,et al.  Information Filtering: Overview of Issues, Research and Systems , 2001, User Modeling and User-Adapted Interaction.

[127]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[128]  Olfa Nasraoui,et al.  A Hybrid Recommender System Guided by Semantic User Profiles for Search in the E-learning Domain , 2010 .

[129]  Stephanie M. Stalinski,et al.  Journal of Experimental Psychology: Learning, Memory, and Cognition , 2012 .

[130]  Engin Bozdag,et al.  Bias in algorithmic filtering and personalization , 2013, Ethics and Information Technology.

[131]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[132]  Olfa Nasraoui,et al.  Human-Algorithm Interaction Biases in the Big Data Cycle: A Markov Chain Iterated Learning Framework , 2016, ArXiv.

[133]  Megan Garcia,et al.  Racist in the Machine: The Disturbing Implications of Algorithmic Bias , 2016 .

[134]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[135]  W. Kruskal,et al.  Use of Ranks in One-Criterion Variance Analysis , 1952 .

[136]  Miroslav Dudík,et al.  Correcting sample selection bias in maximum entropy density estimation , 2005, NIPS.

[137]  Boi Faltings,et al.  Non-Discriminatory Machine Learning through Convex Fairness Criteria , 2018, AAAI.

[138]  Adam Tauman Kalai,et al.  Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[139]  Naresh Manwani,et al.  Noise Tolerance Under Risk Minimization , 2011, IEEE Transactions on Cybernetics.

[140]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[141]  S. Ramkumar A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites , 2014 .

[142]  Chuntian Cheng,et al.  Annual Streamflow Time Series Prediction Using Extreme Learning Machine Based on Gravitational Search Algorithm and Variational Mode Decomposition , 2020 .

[143]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[144]  R. Lowry,et al.  Concepts and Applications of Inferential Statistics , 2014 .

[145]  Alfred Kobsa,et al.  The Adaptive Web, Methods and Strategies of Web Personalization , 2007, The Adaptive Web.

[146]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[147]  William W. Cohen,et al.  Recommendation as Classification: Using Social and Content-Based Information in Recommendation , 1998, AAAI/IAAI.

[148]  Gao Cong,et al.  Who, Where, When, and What , 2015, ACM Trans. Inf. Syst..

[149]  J. Busemeyer,et al.  Extending the Bounds of Rationality: Evidence and Theories of Preferential Choice , 2006 .

[150]  James Bennett,et al.  The Netflix Prize , 2007 .

[151]  Mustansar Ali Ghazanfar,et al.  Modeling user rating preference behavior to improve the performance of the collaborative filtering based recommender systems , 2019, PloS one.

[152]  Jialie Shen,et al.  On Effective Location-Aware Music Recommendation , 2016, ACM Trans. Inf. Syst..

[153]  David M. Blei,et al.  Modeling User Exposure in Recommendation , 2015, WWW.

[154]  Mark Steyvers,et al.  Topics in semantic representation. , 2007, Psychological review.

[155]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.