Emergent Instabilities in Algorithmic Feedback Loops

Algorithms that aid human tasks, such as recommendation systems, are ubiquitous. They appear in everything from social media to streaming videos to online shopping. However, the feedback loop between people and algorithms is poorly understood and can amplify cognitive and social biases (algorithmic confounding), leading to unexpected outcomes. In this work, we explore algorithmic confounding in collaborative filtering-based recommendation algorithms through teacher-student learning simulations. Namely, a student collaborative filtering-based model, trained on simulated choices, is used by the recommendation algorithm to recommend items to agents. Agents might choose some of these items, according to an underlying teacher model, with new choices then fed back into the student model as new training data (approximating online machine learning). These simulations demonstrate how algorithmic confounding produces erroneous recommendations which in turn lead to instability, i.e., wide variations in an item’s popularity between each simulation realization. We use the simulations to demonstrate a novel approach to training collaborative filtering models that can create more stable and accurate recommendations. Our methodology is general enough that it can be extended to other socio-technical systems in order to better quantify and improve the stability of algorithms. These results highlight the need to account for emergent behaviors from interactions between people and algorithms.

[1]  Anil A. Bharath,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[2]  Yehuda Koren,et al.  Lessons from the Netflix prize challenge , 2007, SKDD.

[3]  Jon M. Kleinberg,et al.  Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.

[4]  Kristina Lerman,et al.  The myopia of crowds: Cognitive load and collective evaluation of answers on Stack Exchange , 2016, PloS one.

[5]  Mehrbakhsh Nilashi,et al.  Collaborative filtering recommender systems , 2013 .

[6]  Saeed Shiry Ghidary,et al.  Usage-based web recommendations: a reinforcement learning approach , 2007, RecSys '07.

[7]  Florent Krzakala,et al.  Constrained low-rank matrix estimation: phase transitions, approximate message passing and applications , 2017, ArXiv.

[8]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[9]  Barbara E. Engelhardt,et al.  How algorithmic confounding in recommendation systems increases homogeneity and decreases utility , 2017, RecSys.

[10]  Matthew J. Salganik,et al.  Experimental Study of Inequality and Unpredictability in an Artificial Cultural Market , 2006, Science.

[11]  Martin Trnecka,et al.  Model order selection for approximate Boolean matrix factorization problem , 2021, Knowl. Based Syst..

[12]  Guido Caldarelli,et al.  Users Polarization on Facebook and Youtube , 2016, PloS one.

[13]  Aaron Roth,et al.  Fairness in Learning: Classic and Contextual Bandits , 2016, NIPS.

[14]  Suresh Venkatasubramanian,et al.  Runaway Feedback Loops in Predictive Policing , 2017, FAT.

[15]  Yongfeng Zhang,et al.  Understanding Echo Chambers in E-commerce Recommender Systems , 2020, SIGIR.

[16]  Yong Li,et al.  E-learning Recommendation System , 2008, 2008 International Conference on Computer Science and Software Engineering.

[17]  Tim Weninger,et al.  GuessTheKarma: A Game to Assess Social Rating Systems , 2018, Proc. ACM Hum. Comput. Interact..

[18]  M. Mehdi Afsar,et al.  Reinforcement learning based recommender systems: A survey , 2021, ArXiv.

[19]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[20]  Bamshad Mobasher,et al.  Feedback Loop and Bias Amplification in Recommender Systems , 2020, CIKM.

[21]  Giuseppe Sansonetti,et al.  Context-aware movie recommendation based on signal processing and machine learning , 2011, CAMRa '11.

[22]  Qiang Yang,et al.  Partially Observable Markov Decision Process for Recommender Systems , 2016, ArXiv.

[23]  Tat-Seng Chua,et al.  Neural Collaborative Filtering , 2017, WWW.

[24]  Donghyun Kim,et al.  Convolutional Matrix Factorization for Document Context-Aware Recommendation , 2016, RecSys.

[25]  Alexander D'Amour,et al.  Fairness is not static: deeper understanding of long term fairness via simulation studies , 2020, FAT*.

[26]  Karthik Ramani,et al.  Deconvolving Feedback Loops in Recommender Systems , 2016, NIPS.

[27]  Dino Pedreschi,et al.  Algorithmic bias amplifies opinion fragmentation and polarization: A bounded confidence model , 2018, PloS one.

[28]  Tad Hogg,et al.  Origins of Algorithmic Instabilities in Crowdsourced Ranking , 2020, Proc. ACM Hum. Comput. Interact..

[29]  Kristina Lerman,et al.  Leveraging Position Bias to Improve Peer Recommendation , 2014, PloS one.

[30]  D. Perry,et al.  How Does He Saw Me ? A Recommendation Engine for Picking Heroes in Dota 2 , 2013 .

[31]  Surya Ganguli,et al.  An analytic theory of generalization dynamics and transfer learning in deep linear networks , 2018, ICLR.

[32]  Paulo S. C. Alencar,et al.  The use of machine learning algorithms in recommender systems: A systematic review , 2015, Expert Syst. Appl..

[33]  Aaron Roth,et al.  Fairness in Reinforcement Learning , 2016, ICML.

[34]  Nathan Srebro,et al.  Implicit Regularization in Matrix Factorization , 2017, 2018 Information Theory and Applications Workshop (ITA).