Introducing serendipity in recommender systems through collaborative methods

Widely used recommendation systems are mainly accuracy-oriented since they are based on item-based ratings and useror item-based similarity measures. Such accuracy-based engines do not consider factors such as proliferation of varied user interests and the desire for changes. This results in a muted user experience that is generated from a constrained and narrow feature set. Recommender systems should therefore consider other important metrics outside of accuracy such as coverage, novelty, serendipity, unexpectedness and usefulness. The main focus of this thesis is to both incorporate serendipity into a recommendation engine and improve its quality using the widely used collaborative filtering method. Serendipity is defined as finding something good or useful while not specifically searching for it. The design of recommendation engines that considers serendipity is a relatively new and an open research problem. This is largely due to a certain degree of ambiguity in balancing the level of unexpectedness and usefulness of items. In this thesis, a new hybrid algorithm that combines a standard user-based collaborative filtering method, and item attributes has been proposed to improve the quality of serendipity over those that use item ratings alone. The algorithm was implemented using Python in conjunction with the scientific computing package NumPy. Furthermore, the code has been validated using a well-accepted and widely used open source software namely, Apache Mahout, that provides support for recommender system application development. The new method has been tested on the 100K MovieLens dataset from the GroupLens Research Center that consists of 100,000 preferences for 1,682 movies rated by 943 customers. The new algorithm is shown to be capable of identifying a significant fraction of movies that are less serendipitous but which might not have been identified otherwise, thereby improving the quality of predictions.

[1]  Greg Linden,et al.  Amazon . com Recommendations Item-to-Item Collaborative Filtering , 2001 .

[2]  Ryohei Orihara,et al.  Metrics for Evaluating the Serendipity of Recommendation Lists , 2007, JSAI.

[3]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[4]  John Riedl,et al.  Analysis of recommendation algorithms for e-commerce , 2000, EC '00.

[5]  David M. Pennock,et al.  Categories and Subject Descriptors , 2001 .

[6]  Mouzhi Ge,et al.  Beyond accuracy: evaluating recommender systems by coverage and serendipity , 2010, RecSys '10.

[7]  Sean Owen,et al.  Collaborative Filtering with Apache Mahout , 2012 .

[8]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[9]  Sean M. McNee,et al.  Improving recommendation lists through topic diversification , 2005, WWW '05.

[10]  Mehrbakhsh Nilashi,et al.  Collaborative filtering recommender systems , 2013 .

[11]  Chunming Rong,et al.  Using Mahout for Clustering Wikipedia's Latest Articles: A Comparison between K-means and Fuzzy C-means in the Cloud , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[12]  Susan T. Dumais,et al.  From x-rays to silly putty via Uranus: serendipity and its role in web search , 2009, CHI.

[13]  Chhavi Rana New dimensions of temporal serendipity and temporal novelty inrecommender system , 2013 .

[14]  Sung-Bong Yang,et al.  Using Attributes to Improve Prediction Quality in Collaborative Filtering , 2004, EC-Web.

[15]  Panagiotis Adamopoulos,et al.  On Unexpectedness in Recommender Systems: Or How to Expect the Unexpected , 2011, DiveRS@RecSys.

[16]  Zoran Obradovic,et al.  Collaborative Filtering Using a Regression-Based Approach , 2003, Knowledge and Information Systems.

[17]  Kishor Sadafale,et al.  An online recommendation system for e-commerce based on apache mahout framework , 2013, SIGMIS-CPR '13.

[18]  Michael J. Pazzani,et al.  Collaborative Filtering with the Simple Bayesian Classifier , 2000, PRICAI.

[19]  Sean M. McNee,et al.  Being accurate is not enough: how accuracy metrics have hurt recommender systems , 2006, CHI Extended Abstracts.

[20]  Shogo Nishida,et al.  Discovery-oriented collaborative filtering for improving user satisfaction , 2009, IUI.

[21]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[22]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[23]  Daniel Lemire,et al.  Slope One Predictors for Online Rating-Based Collaborative Filtering , 2007, SDM.

[24]  George Karypis,et al.  Evaluation of Item-Based Top-N Recommendation Algorithms , 2001, CIKM '01.

[25]  Panagiotis Adamopoulos,et al.  On Unexpectedness in Recommender Systems , 2013, ACM Trans. Intell. Syst. Technol..

[26]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[27]  John Riedl,et al.  Recommender Systems for Large-scale E-Commerce : Scalable Neighborhood Formation Using Clustering , 2002 .

[28]  Shuhang Guo,et al.  Analysis and Evaluation of Similarity Metrics in Collaborative Filtering Recommender System , 2014 .

[29]  David C. Wilson,et al.  Case Study Evaluation of Mahout as a Recommender Platform , 2012, RUE@RecSys.

[30]  Noriaki Kawamae,et al.  Serendipitous recommendations via innovators , 2010, SIGIR.

[31]  Guy Shani,et al.  Evaluating Recommendation Systems , 2011, Recommender Systems Handbook.

[32]  Pasquale Lops,et al.  Can a Recommender System Induce Serendipitous Encounters , 2010 .