Do News Consumers Want Explanations for Personalized News Rankings

This paper states the case for the principle of minimal necessary data: If two recommender algorithms achieve the same effectiveness, the better algorithm is the one that requires less user data. Applying this principle involves carrying out training data requirements analysis, which we argue should be adopted as best practice for the development and evaluation of recommender algorithms. We take the position that responsible recommendation is recommendation that serves the people whose data it uses. To minimize the imposition on users’ privacy, it is important that a recommender system does not collect or store more user information than it absolutely needs. Further, algorithms using minimal necessary data reduce training time and address the cold start problem. To illustrate the trade-off between training data volume and accuracy, we carry out a set of classic recommender system experiments. We conclude that consistently applying training data requirements analysis would represent a relatively small change in researchers’ current practices, but a large step towards more responsible recommender systems. ACM Reference format: Martha Larson1,2, Alessandro Zito3, Babak Loni2, Paolo Cremonesi3. 2017. Towards Minimal Necessary Data: The Case for Analyzing Training Data Requirements of Recommender Algorithms. In Proceedings of Workshop on Responsible Recommendation at ACM RecSys’17, Como, Italy, 31 August 2017 (FATREC’17), 6 pages. https://doi.org/10.18122/B2VX12

[1]  Fernando Diaz,et al.  Auditing Search Engines for Differential Satisfaction Across Demographics , 2017, WWW.

[2]  Nikos Mamoulis,et al.  Fairness in Package-to-Group Recommendations , 2017, WWW.

[3]  Ankur Taly,et al.  Axiomatic Attribution for Deep Networks , 2017, ICML.

[4]  David García,et al.  Bias in Online Freelance Marketplaces: Evidence from TaskRabbit and Fiverr , 2017, CSCW.

[5]  Krishna P. Gummadi,et al.  Fairness Beyond Disparate Treatment & Disparate Impact: Learning Classification without Disparate Mistreatment , 2016, WWW.

[6]  Alexandra Chouldechova,et al.  Fair prediction with disparate impact: A study of bias in recidivism prediction instruments , 2016, Big Data.

[7]  Pasquale Lops,et al.  ExpLOD: A Framework for Explaining Recommendations based on the Linked Open Data Cloud , 2016, RecSys.

[8]  Juho Hamari,et al.  The sharing economy: Why people participate in collaborative consumption , 2016, J. Assoc. Inf. Sci. Technol..

[9]  Adam Tauman Kalai,et al.  Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[10]  Yong-Yeol Ahn,et al.  Twitter's Glass Ceiling: The Effect of Perceived Gender on Online Visibility , 2016, ICWSM.

[11]  Michael Luca Reviews, Reputation, and Revenue: The Case of Yelp.Com , 2016 .

[12]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[13]  Tamar Kricheli-Katz,et al.  How many cents on the dollar? Women and men in product markets , 2016, Science Advances.

[14]  Nicholas Diakopoulos,et al.  Algorithmic Accountability , 2015 .

[15]  Steven M. Bellovin,et al.  "I don't have a photograph, but you can have my footprints.": Revealing the Demographics of Location Data , 2015, ICWSM.

[16]  Loren G. Terveen,et al.  Avoiding the South Side and the Suburbs: The Geography of Mobile Crowdsourcing Markets , 2015, CSCW.

[17]  Andreas Krause,et al.  Incentivizing Users for Balancing Bike Sharing Systems , 2015, AAAI.

[18]  Karrie Karahalios,et al.  Auditing Algorithms : Research Methods for Detecting Discrimination on Internet Platforms , 2014 .

[19]  J. Byers,et al.  The Rise of the Sharing Economy: Estimating the Impact of Airbnb on the Hotel Industry , 2017 .

[20]  Sujoy Roy,et al.  Personalized news recommendation based on implicit feedback , 2013, NRS '13.

[21]  Latanya Sweeney,et al.  Discrimination in online ad delivery , 2013, CACM.

[22]  Yin Zhang,et al.  Solving a low-rank factorization model for matrix completion by a nonlinear successive over-relaxation algorithm , 2012, Mathematical Programming Computation.

[23]  Daniele Quercia,et al.  Auralist: introducing serendipity into music recommendation , 2012, WSDM '12.

[24]  Toniann Pitassi,et al.  Fairness through awareness , 2011, ITCS '12.

[25]  Sune Lehmann,et al.  Understanding the Demographics of Twitter Users , 2011, ICWSM.

[26]  Michael Karlsson,et al.  The immediacy of online news, the visibility of journalistic processes and a restructuring of journalistic authority , 2011 .

[27]  Judith Masthoff,et al.  Designing and Evaluating Explanations for Recommender Systems , 2011, Recommender Systems Handbook.

[28]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[29]  Jiahui Liu,et al.  Personalized news recommendation based on click behavior , 2010, IUI '10.

[30]  Motoaki Kawanabe,et al.  How to Explain Individual Classification Decisions , 2009, J. Mach. Learn. Res..

[31]  Michael Werman,et al.  Fast and robust Earth Mover's Distances , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[32]  John Riedl,et al.  Tagsplanations: explaining recommendations using tags , 2009, IUI.

[33]  Michael Werman,et al.  A Linear Time Histogram Metric for Improved SIFT Matching , 2008, ECCV.

[34]  Li Chen,et al.  Trust-inspiring explanation interfaces for recommender systems , 2007, Knowl. Based Syst..

[35]  Judith Masthoff,et al.  A Survey of Explanations in Recommender Systems , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[36]  J. Singer CONTESTED AUTONOMY , 2007 .

[37]  Dubravko Radic,et al.  Are Customers Always Right? Fees to Protect Firms from Customer Opportunistic Behavior , 2007 .

[38]  Raymond J. Mooney,et al.  Explaining Recommendations: Satisfaction vs. Promotion , 2005 .

[39]  Ying Wang,et al.  Average rate updating mechanism in proportional fair scheduler for HDR , 2004, GLOBECOM.

[40]  K. Margaritis,et al.  Analysis of Recommender Systems’ Algorithms , 2003 .

[41]  Raymond J. Mooney,et al.  Content-boosted collaborative filtering for improved recommendations , 2002, AAAI/IAAI.

[42]  John Riedl,et al.  Explaining collaborative filtering recommendations , 2000, CSCW '00.

[43]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .