Social Influence Bias in Recommender Systems : A Methodology for Learning , Analyzing , and Mitigating Bias in Ratings

To facilitate browsing and selection, almost all recommender systems display an aggregate statistic (the average/mean or median rating value) for each item. This value has potential to influence a participant’s individual rating for an item due to what is known in the survey and psychology literature as Social Influence Bias; the tendency for individuals to conform to what they perceive as the “norm” in a community. As a result, ratings can be closer to the average and less diverse than they would be otherwise. We propose a methodology to 1) learn, 2) analyze, and 3) mitigate the effect of social influence bias in recommender systems. In the Learning phase, a baseline dataset is established with an initial set of participants by allowing them to rate items twice: before seeing the median rating, and again after seeing it. In the Analysis phase, a new non-parametric significance test based on the Wilcoxon statistic can quantify the extent of social influence bias in this data. If this bias is significant, we propose a Mitigation phase where mathematical models are constructed from this data using polynomial regression and the Bayesian Information Criterion (BIC) and then inverted to produce a filter that can reduce the effect of social influence bias. As a case study, we apply this methodology to the California Report Card (CRC), a new recommender system that encourages political engagement. After the Learning phase collected 9390 ratings, the non-parametric test in the Analysis phase rejected the null hypothesis, identifying significant social influence bias: ratings after display of the median were on average 19.3% closer to the median value. In the Mitigating phase, the learned polynomial models were able to predict changed ratings with a normalized RMSE of 12.8% and reduce bias by 76.3%. Results suggest that social influence bias can be significant in recommender systems and that this bias can be substantially reduced with machine learning. The CRC, our data, and experimental code can be found at: http://californiareportcard.org/data/

[1]  Haiyi Zhu,et al.  To Switch or Not To Switch , 2012, CHI.

[2]  R. Bond,et al.  Culture and conformity: A meta-analysis of studies using Asch's (1952b, 1956) line judgment task. , 1996 .

[3]  S. Moscovici,et al.  Social Influence, Conformity Bias, and the Study of Active Minorities , 1972 .

[4]  Chun Liu,et al.  Social Influence Bias : A Randomized Experiment , 2014 .

[5]  J. Alborsa,et al.  New learning network paradigms: Communities of objectives, crowdsourcing, wikis and open source , 2008, Int. J. Inf. Manag..

[6]  Jon M. Kleinberg,et al.  WWW 2009 MADRID! Track: Data Mining / Session: Opinions How Opinions are Received by Online Communities: A Case Study on Amazon.com Helpfulness Votes , 2022 .

[7]  Edward P. Markowski,et al.  Conditions for the Effectiveness of a Preliminary Test of Variance , 1990 .

[8]  S. Asch Studies of independence and conformity: I. A minority of one against a unanimous majority. , 1956 .

[9]  S. Asch Opinions and Social Pressure , 1955, Nature.

[10]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[11]  Kenneth Y. Goldberg,et al.  Eigentaste: A Constant Time Collaborative Filtering Algorithm , 2001, Information Retrieval.

[12]  P. DeMarzo,et al.  Persuasion Bias, Social Influence, and Uni-Dimensional Opinions , 2001 .

[13]  R. E. Burnkrant,et al.  Informational and Normative Social Influence in Buyer Behavior , 1975 .

[14]  Siamak Faridani Using canonical correlation analysis for generalized sentiment analysis, product recommendation and search , 2011, RecSys '11.

[15]  S. Bikhchandani,et al.  Herd Behavior in Financial Markets: A Review , 2000, SSRN Electronic Journal.

[16]  Jeffrey K. MacKie-Mason,et al.  Incentive-Centered Design for User-Contributed Content , 2012 .

[17]  A. Banerjee,et al.  A Simple Model of Herd Behavior , 1992 .

[18]  Ephrat Bitton,et al.  A spatial model for collaborative filtering of comments in an online discussion forum , 2009, RecSys '09.

[19]  W. Wood Attitude change: persuasion and social influence. , 2000, Annual review of psychology.

[20]  Kenneth Y. Goldberg,et al.  Opinion space: a scalable tool for browsing online comments , 2010, CHI.

[21]  Jen-Hung Huang,et al.  Herding in online product choice , 2006 .

[22]  U. Dholakia,et al.  Auction or agent (or both)? A study of moderators of the herding bias in digital auctions , 2002 .

[23]  Haiyi Zhu,et al.  To Switch or Not To Switch , 2014 .

[24]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[25]  D. Helbing,et al.  How social influence can undermine the wisdom of crowd effect , 2011, Proceedings of the National Academy of Sciences.

[26]  Kieron O'Hara,et al.  Transparency, open data and trust in government: shaping the infosphere , 2012, WebSci '12.

[27]  Kenneth Y. Goldberg,et al.  Eigentaste 5.0: constant-time adaptability in a recommender system using item clustering , 2007, RecSys '07.