Human-Algorithm Interaction Biases in the Big Data Cycle: A Markov Chain Iterated Learning Framework

Early supervised machine learning algorithms have relied on reliable expert labels to build predictive models. However, the gates of data generation have recently been opened to a wider base of users who started participating increasingly with casual labeling, rating, annotating, etc. The increased online presence and participation of humans has led not only to a democratization of unchecked inputs to algorithms, but also to a wide democratization of the "consumption" of machine learning algorithms' outputs by general users. Hence, these algorithms, many of which are becoming essential building blocks of recommender systems and other information filters, started interacting with users at unprecedented rates. The result is machine learning algorithms that consume more and more data that is unchecked, or at the very least, not fitting conventional assumptions made by various machine learning algorithms. These include biased samples, biased labels, diverging training and testing sets, and cyclical interaction between algorithms, humans, information consumed by humans, and data consumed by algorithms. Yet, the continuous interaction between humans and algorithms is rarely taken into account in machine learning algorithm design and analysis. In this paper, we present a preliminary theoretical model and analysis of the mutual interaction between humans and algorithms, based on an iterated learning framework that is inspired from the study of human language evolution. We also define the concepts of human and algorithm blind spots and outline machine learning approaches to mend iterated bias through two novel notions: antidotes and reactive learning.

[1]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[2]  Pattie Maes,et al.  Agents that reduce work and information overload , 1994, CACM.

[3]  Corey J Bohil,et al.  Observational versus feedback training in rule-based and information-integration category learning , 2002, Memory & cognition.

[4]  Mike Fitzpatrick Choice , 2004, The Lancet.

[5]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[6]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[7]  James Allan,et al.  Automatic Query Expansion Using SMART: TREC 3 , 1994, TREC.

[8]  William W. Cohen,et al.  Recommendation as Classification: Using Social and Content-Based Information in Recommendation , 1998, AAAI/IAAI.

[9]  Patrick Shafto,et al.  Explaining Choice Behavior: The Intentional Selection Assumption , 2015, CogSci.

[10]  R. Duncan Luce,et al.  Individual Choice Behavior , 1959 .

[11]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[12]  Michael J. Pazzani,et al.  Learning and Revising User Profiles: The Identification of Interesting Web Sites , 1997, Machine Learning.

[13]  Patrick Shafto,et al.  Learning to trust and trusting to learn: a theoretical framework , 2015, Trends in Cognitive Sciences.

[14]  Ji Zhu,et al.  A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning , 2004, NIPS.

[15]  Michael C. Frank,et al.  Learning From Others , 2012, Perspectives on psychological science : a journal of the Association for Psychological Science.

[16]  Michael J. Pazzani,et al.  Adaptive News Access , 2007, The Adaptive Web.

[17]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[18]  Olfa Nasraoui,et al.  Complete This Puzzle: A Connectionist Approach to Accurate Web Recommendations Based on a Committee of Predictors , 2004, WebKDD.

[19]  Pattie Maes,et al.  Evolving agents for personalized information filtering , 1993, Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications.

[20]  Thomas L. Griffiths,et al.  A Bayesian View of Language Evolution by Iterated Learning - eScholarship , 2005 .

[21]  Eli Pariser,et al.  The Filter Bubble: What the Internet Is Hiding from You , 2011 .

[22]  J. Stevenson The cultural origins of human cognition , 2001 .

[23]  J. Klayman,et al.  Confirmation, Disconfirmation, and Informa-tion in Hypothesis Testing , 1987 .

[24]  Karen Spärck Jones Some thoughts on classification for retrieval , 1970, J. Documentation.

[25]  J. Kruschke,et al.  ALCOVE: an exemplar-based connectionist model of category learning. , 1992, Psychological review.

[26]  R. Luce,et al.  Individual Choice Behavior: A Theoretical Analysis. , 1960 .

[27]  Wai-Tat Fu,et al.  Beyond the filter bubble: interactive effects of perceived threat and topic involvement on selective exposure to information , 2013, CHI.

[28]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[30]  D. Markant,et al.  Is it better to select or to receive? Learning via active and passive hypothesis testing. , 2014, Journal of experimental psychology. General.

[31]  Kristin L. Sainani,et al.  Dealing with missing data , 2002 .

[32]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[33]  Y. Vardi Empirical Distributions in Selection Bias Models , 1985 .

[34]  Mohamed Jemni,et al.  Automatic Recommendations for E-Learning Personalization Based on Web Usage Mining Techniques and Information Retrieval , 2008, 2008 Eighth IEEE International Conference on Advanced Learning Technologies.

[35]  Gregory Ashby,et al.  A neuropsychological theory of multiple systems in category learning. , 1998, Psychological review.

[36]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[37]  Matthias Seeger,et al.  Learning from Labeled and Unlabeled Data , 2010, Encyclopedia of Machine Learning.

[38]  Fabio A. González,et al.  Performance of Recommendation Systems in Dynamic Streaming Environments , 2007, SDM.

[39]  T. Griffiths,et al.  Iterated learning and the cultural ratchet , 2009 .

[40]  M. Posner,et al.  On the genesis of abstract ideas. , 1968, Journal of experimental psychology.

[41]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[42]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[43]  Olfa Nasraoui,et al.  A Hybrid Recommender System Guided by Semantic User Profiles for Search in the E-learning Domain , 2010 .

[44]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[45]  L. Thurstone A law of comparative judgment. , 1994 .

[46]  Patricia M. Berretty,et al.  On the dominance of unidimensional rules in unsupervised categorization , 1999, Perception & psychophysics.

[47]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[48]  R. Shepard,et al.  Learning and memorization of classifications. , 1961 .

[49]  Patrick Shafto,et al.  FOUR Choice from among Intentionally Selected Options , 2015 .

[50]  Craig Boutilier,et al.  Active Collaborative Filtering , 2002, UAI.

[51]  Alexander Zien,et al.  Semi-Supervised Text Classification Using EM , 2006 .

[52]  Olfa Nasraoui,et al.  A cross-modal warm-up solution for the cold-start problem in collaborative filtering recommender systems , 2014, WebSci '14.

[53]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[54]  Sean M. McNee,et al.  Getting to know you: learning new user preferences in recommender systems , 2002, IUI '02.

[55]  A. Tversky,et al.  Choice in Context: Tradeoff Contrast and Extremeness Aversion , 1992 .

[56]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[57]  Richard S. Zemel,et al.  Collaborative prediction and ranking with non-random missing data , 2009, RecSys '09.

[58]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[59]  Ricardo Melo,et al.  The Design of Horacle : Inducing Serendipity on the Web , 2013 .

[60]  William M. Smith,et al.  A Study of Thinking , 1956 .

[61]  J. Yellott The relationship between Luce's Choice Axiom, Thurstone's Theory of Comparative Judgment, and the double exponential distribution , 1977 .

[62]  J. Busemeyer,et al.  Extending the Bounds of Rationality: Evidence and Theories of Preferential Choice , 2006 .

[63]  Yi Yang,et al.  Multi-Class Active Learning by Uncertainty Sampling with Diversity Maximization , 2015, International Journal of Computer Vision.

[64]  Olfa Nasraoui,et al.  Mining search engine query logs for query recommendation , 2006, WWW '06.

[65]  Edward A. Fox,et al.  Research Contributions , 2014 .

[66]  R. Nosofsky American Psychological Association, Inc. Choice, Similarity, and the Context Theory of Classification , 2022 .

[67]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[68]  Sean A. Munson,et al.  Presenting diverse political opinions: how and how much , 2010, CHI.

[69]  S. Robertson The probability ranking principle in IR , 1997 .

[70]  A. Gopnik,et al.  Children’s imitation of causal action sequences is influenced by statistical and pedagogical evidence , 2011, Cognition.

[71]  Kinshuk,et al.  Intelligent and Adaptive Learning Systems: Technology Enhanced Support for Learners and Teachers , 2011 .

[72]  Jaime G. Carbonell,et al.  Proactive learning: cost-sensitive active learning with multiple imperfect oracles , 2008, CIKM '08.

[73]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[74]  Simon Kirby,et al.  Spontaneous evolution of linguistic structure-an iterated learning model of the emergence of regularity and irregularity , 2001, IEEE Trans. Evol. Comput..

[75]  Vittorio Castelli,et al.  On the exponential value of labeled samples , 1995, Pattern Recognit. Lett..

[76]  Dietram A. Scheufele,et al.  Science, New Media, and the Public , 2013, Science.

[77]  Mohamed Jemni,et al.  Automatic Personalization in E-Learning Based on Recommendation Systems: An Overview , 2012 .

[78]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[79]  A. Tversky Elimination by aspects: A theory of choice. , 1972 .

[80]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[81]  Baxter S. Eaves,et al.  Epistemic trust: modeling children's reasoning about others' knowledge and intent. , 2012, Developmental science.

[82]  정승원,et al.  Confirmation , 1979, Journal of General Internal Medicine.

[83]  Douglas L. Medin,et al.  Context theory of classification learning. , 1978 .

[84]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[85]  Noah D. Goodman,et al.  A rational account of pedagogical reasoning: Teaching by, and learning from, examples , 2014, Cognitive Psychology.

[86]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[87]  S. Ramkumar A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites , 2014 .

[88]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[89]  Raymond J. Mooney,et al.  Diverse ensembles for active learning , 2004, ICML.

[90]  Olfa Nasraoui,et al.  A recommender system based on the collaborative behavior of bird flocks , 2010, 6th International Conference on Collaborative Computing: Networking, Applications and Worksharing (CollaborateCom 2010).

[91]  D. McFadden Quantal Choice Analysis: A Survey , 1976 .

[92]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[93]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[94]  Simon Kirby,et al.  Innateness and culture in the evolution of language , 2006, Proceedings of the National Academy of Sciences.

[95]  D. Doyle,et al.  New media , 2000, Canadian Journal of Anaesthesia-journal Canadien D Anesthesie.

[96]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[97]  Michael R. Lyu,et al.  Effective missing data prediction for collaborative filtering , 2007, SIGIR.

[98]  John Riedl,et al.  Recommender systems in e-commerce , 1999, EC '99.

[99]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[100]  Patrick Shafto,et al.  Unifying pedagogical reasoning and epistemic trust. , 2012, Advances in child development and behavior.

[101]  Patrick Shafto,et al.  Chapter Four - Choice from among Intentionally Selected Options , 2015 .

[102]  Tom M. Mitchell,et al.  Semi-Supervised Text Classification Using EM , 2006, Semi-Supervised Learning.

[103]  Peretz Shoval,et al.  Information Filtering: Overview of Issues, Research and Systems , 2001, User Modeling and User-Adapted Interaction.

[104]  R. Kothari,et al.  Learning from labeled and unlabeled data , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[105]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[106]  Mark Steyvers,et al.  Topics in semantic representation. , 2007, Psychological review.

[107]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[108]  Olfa Nasraoui,et al.  A Semi-supervised Learning Framework to Cluster Mixed Data Types , 2012, KDIR.

[109]  Xiaojin Zhu,et al.  Semi-Supervised Learning , 2010, Encyclopedia of Machine Learning.