Exploring Statistical Language Models for Recommender Systems

Even though there exist multiple approaches to build recommendation algorithms, algebraic techniques based on vector and matrix representations are predominant in the field. Notwithstanding the fact that these algebraic Collaborative Filtering methods have been demonstrated to be very effective in the rating prediction task, they do not generally provide good results in the top-N recommendation task. In this research, we return to the roots of recommender systems and we explore the relationship between Information Filtering and Information Retrieval. We think that probabilistic methods taken from the latter field such as statistical Language Models can be a more effective and formal way for generating personalised ranks of recommendations. We compare our improvements against several algebraic and probabilistic state-of-the-art algorithms and pave the way to future and promising research directions.

[1]  Jun Wang,et al.  A User-Item Relevance Model for Log-Based Collaborative Filtering , 2006, ECIR.

[2]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[3]  ChengXiang Zhai,et al.  Statistical Language Models for Information Retrieval , 2008, NAACL.

[4]  Jun Wang,et al.  Language Models of Collaborative Filtering , 2009, AIRS.

[5]  Bracha Shapira,et al.  Recommender Systems Handbook , 2015, Springer US.

[6]  Tie-Yan Liu,et al.  A Theoretical Analysis of NDCG Type Ranking Measures , 2013, COLT.

[7]  Guy Shani,et al.  Evaluating Recommendation Systems , 2011, Recommender Systems Handbook.

[8]  Yi-Cheng Zhang,et al.  Solving the apparent diversity-accuracy dilemma of recommender systems , 2008, Proceedings of the National Academy of Sciences.

[9]  Alejandro Bellogín,et al.  Relevance-based language modelling for recommender systems , 2013, Inf. Process. Manag..

[10]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[11]  Alejandro Bellogín,et al.  Precision-oriented evaluation of recommender systems: an algorithmic comparison , 2011, RecSys '11.

[12]  Jun Wang,et al.  Noname manuscript No. (will be inserted by the editor) Bridging Memory-Based Collaborative Filtering and Text Retrieval , 2022 .

[13]  Yuanzhi Li,et al.  A Theoretical Analysis of NDCG Ranking Measures , 2013 .

[14]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[15]  Alvaro Barreiro,et al.  A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems , 2015, ECIR.

[16]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[17]  Peretz Shoval,et al.  Information Filtering: Overview of Issues, Research and Systems , 2001, User Modeling and User-Adapted Interaction.

[18]  Marti A. Hearst Chapter 2 of the second edition of Modern Information Retrieval Renamed Modern Information Retrieval : The Concepts and Technology behind Search , 2011 .

[19]  Nicholas J. Belkin,et al.  Information filtering and information retrieval: two sides of the same coin? , 1992, CACM.

[20]  Nicola Barbieri,et al.  An Analysis of Probabilistic Methods for Top-N Recommendation in Collaborative Filtering , 2011, ECML/PKDD.

[21]  George Karypis,et al.  A Comprehensive Survey of Neighborhood-based Recommendation Methods , 2011, Recommender Systems Handbook.

[22]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[23]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.