Language Models for Collaborative Filtering Neighbourhoods

Language Models are state-of-the-art methods in Information Retrieval. Their sound statistical foundation and high effectiveness in several retrieval tasks are key to their current success. In this paper, we explore how to apply these models to deal with the task of computing user or item neighbourhoods in a collaborative filtering scenario. Our experiments showed that this approach is superior to other neighbourhood strategies and also very efficient. Our proposal, in conjunction with a simple neighbourhood-based recommender, showed a great performance compared to state-of-the-art methods (NNCosNgbr and PureSVD) while its computational complexity is low.

[1]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[2]  George Karypis,et al.  A Comprehensive Survey of Neighborhood-based Recommendation Methods , 2011, Recommender Systems Handbook.

[3]  Daniel Valcarce,et al.  Exploring Statistical Language Models for Recommender Systems , 2015, RecSys.

[4]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[5]  Bracha Shapira,et al.  Recommender Systems Handbook , 2015, Springer US.

[6]  Alvaro Barreiro,et al.  A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems , 2015, ECIR.

[7]  Jun Wang,et al.  A User-Item Relevance Model for Log-Based Collaborative Filtering , 2006, ECIR.

[8]  Alejandro Bellogín,et al.  Precision-oriented evaluation of recommender systems: an algorithmic comparison , 2011, RecSys '11.

[9]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[10]  Yuanzhi Li,et al.  A Theoretical Analysis of NDCG Ranking Measures , 2013 .

[11]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[12]  Alejandro Bellogín,et al.  Probabilistic collaborative filtering with negative cross entropy , 2013, RecSys.

[13]  Jun Wang,et al.  Noname manuscript No. (will be inserted by the editor) Bridging Memory-Based Collaborative Filtering and Text Retrieval , 2022 .

[14]  Alejandro Bellogín,et al.  Relevance-based language modelling for recommender systems , 2013, Inf. Process. Manag..

[15]  Leif Azzopardi,et al.  Assessing multivariate Bernoulli models for information retrieval , 2008, TOIS.

[16]  ChengXiang Zhai,et al.  Statistical Language Models for Information Retrieval , 2008, NAACL.

[17]  Tie-Yan Liu,et al.  A Theoretical Analysis of NDCG Type Ranking Measures , 2013, COLT.

[18]  Yi-Cheng Zhang,et al.  Solving the apparent diversity-accuracy dilemma of recommender systems , 2008, Proceedings of the National Academy of Sciences.

[19]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[20]  Alvaro Barreiro,et al.  A Study of Priors for Relevance-Based Language Modelling of Recommender Systems , 2015, RecSys.

[21]  Yehuda Koren,et al.  Advances in Collaborative Filtering , 2011, Recommender Systems Handbook.

[22]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[23]  Eric Gaussier Statistical Language Models for Information Retrieval ChengXiang Zhai University of Illinois at Urbana Champaign Morgan & Claypool (Synthesis Lectures on Human Language Technologies, edited by Graeme Hirst), volume 1, 2008; xiii+125 pp, Princeton, NJ; paperbound, ISBN 978-1-59829-590-0, $40.00; eboo , 2010, Computational Linguistics.

[24]  W. Bruce Croft,et al.  A language modeling approach to information retrieval , 1998, SIGIR '98.

[25]  Leif Azzopardi,et al.  An analysis on document length retrieval trends in language modeling smoothing , 2008, Information Retrieval.

[26]  Kartik Hosanagar,et al.  Blockbuster Culture's Next Rise or Fall: The Impact of Recommender Systems on Sales Diversity , 2007, Manag. Sci..