Axiomatic Analysis of Language Modelling of Recommender Systems

Language Models constitute an effective framework for text retrieval tasks. Recently, it has been extended to various collaborative filtering tasks. In particular, relevance-based language models can be used for generating highly accurate recommendations using a memory-based approach. On the other hand, the query likelihood model has proven to be a successful strategy for neighbourhood computation. Since relevance-based language models rely on user neighbourhoods for producing recommendations, we propose to use the query likelihood model for computing those neighbourhoods instead of cosine similarity. The combination of both techniques results in a formal probabilistic recommender system which has not been used before in collaborative filtering. A thorough evaluation on three datasets shows that the query likelihood model provides better results than cosine similarity. To understand this improvement, we devise two properties that a good neighbourhood algorithm should satisfy. Our axiomatic analysis shows ...

[1]  W. Bruce Croft,et al.  Relevance-Based Language Models , 2001, SIGIR '01.

[2]  Jun Wang,et al.  Noname manuscript No. (will be inserted by the editor) Bridging Memory-Based Collaborative Filtering and Text Retrieval , 2022 .

[3]  Yi-Cheng Zhang,et al.  Solving the apparent diversity-accuracy dilemma of recommender systems , 2008, Proceedings of the National Academy of Sciences.

[4]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[5]  Stephen E. Robertson,et al.  Understanding inverse document frequency: on theoretical arguments for IDF , 2004, J. Documentation.

[6]  David J. C. MacKay,et al.  A hierarchical Dirichlet language model , 1995, Natural Language Engineering.

[7]  Alvaro Barreiro,et al.  Language Models for Collaborative Filtering Neighbourhoods , 2016, ECIR.

[8]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[9]  Frederick Jelinek,et al.  Interpolated estimation of Markov source parameters from sparse data , 1980 .

[10]  Tao Tao,et al.  A formal study of information retrieval heuristics , 2004, SIGIR '04.

[11]  W. Bruce Croft,et al.  A language modeling approach to information retrieval , 1998, SIGIR '98.

[12]  Hermann Ney,et al.  On structuring probabilistic dependences in stochastic language modelling , 1994, Comput. Speech Lang..

[13]  Neil J. Hurley,et al.  Novelty and Diversity in Top-N Recommendation -- Analysis and Evaluation , 2011, TOIT.

[14]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[15]  Bracha Shapira,et al.  Recommender Systems Handbook , 2015, Springer US.

[16]  Saul Vargas,et al.  Novelty and Diversity in Recommender Systems , 2015, Recommender Systems Handbook.

[17]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[18]  Daniel Valcarce,et al.  Exploring Statistical Language Models for Recommender Systems , 2015, RecSys.

[19]  Karen Sparck Jones A statistical interpretation of term specificity and its application in retrieval , 1972 .

[20]  Alvaro Barreiro,et al.  A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems , 2015, ECIR.

[21]  Nicholas J. Belkin,et al.  Information filtering and information retrieval: two sides of the same coin? , 1992, CACM.

[22]  Alvaro Barreiro,et al.  A Study of Priors for Relevance-Based Language Modelling of Recommender Systems , 2015, RecSys.

[23]  Pasquale Lops,et al.  Semantics-aware Content-based Recommender Systems , 2014, CBRecSys@RecSys.

[24]  Javier Parapar,et al.  Additive Smoothing for Relevance-Based Language Modelling of Recommender Systems , 2016, CERI.

[25]  ChengXiang Zhai,et al.  Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-Relevance Feedback , 2015, ICTIR.

[26]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[27]  Alejandro Bellogín,et al.  Relevance-based language modelling for recommender systems , 2013, Inf. Process. Manag..

[28]  George Karypis,et al.  A Comprehensive Survey of Neighborhood-based Recommendation Methods , 2011, Recommender Systems Handbook.