Item Weighting Techniques for Collaborative Filtering

Collaborative Filtering (CF) recommender systems generate rating predictions for a target user by exploiting the ratings of similar users. Therefore, the computation of user-to-user similarity is an important element in CF; it is used in the neighborhood formation and rating prediction steps. In this paper we investigate the role of item weighting techniques. An item weight provides a measure of the importance of an item for predicting the rating of another item and it is computed as a correlation coefficient between the two items’ rating vectors. In this paper we analyze a wide range of item weighting schemas. Moreover, we introduce an item filtering approach, based on item weighting, that works by discarding in the user-touser similarity computation the items with the smallest weights.We assume that the items with smallest weights are the least useful for generating the prediction. We have evaluated the proposed methods using two datasets (MovieLens and Yahoo!) and identified the conditions for their best application in CF.

[1]  Gediminas Adomavicius,et al.  Incorporating contextual information in recommender systems using a multidimensional approach , 2005, TOIS.

[2]  Michael J. Pazzani,et al.  Learning Collaborative Information Filters , 1998, ICML.

[3]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[4]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[5]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[6]  Carlo Strapparava,et al.  Adaptive Hypermedia and Adaptive Web-Based Systems, 5th International Conference, AH 2008, Hannover, Germany, July 29 - August 1, 2008. Proceedings , 2008, AH.

[7]  Alfred Kobsa,et al.  The Adaptive Web, Methods and Strategies of Web Personalization , 2007, The Adaptive Web.

[8]  David W. Aha,et al.  Feature Weighting for Lazy Learning Algorithms , 1998 .

[9]  Hiroshi Motoda,et al.  Feature Extraction, Construction and Selection: A Data Mining Perspective , 1998 .

[10]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[11]  Cristina Conati,et al.  User Modeling 2007, 11th International Conference, UM 2007, Corfu, Greece, June 25-29, 2007, Proceedings , 2007, User Modeling.

[12]  David W. Aha,et al.  A Review and Empirical Evaluation of Feature Weighting Methods for a Class of Lazy Learning Algorithms , 1997, Artificial Intelligence Review.

[13]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[14]  Filip Radlinski,et al.  Query chains: learning to rank from implicit feedback , 2005, KDD '05.

[15]  Luo Si,et al.  An automatic weighting scheme for collaborative filtering , 2004, SIGIR '04.

[16]  Hans-Peter Kriegel,et al.  Feature Weighting and Instance Selection for Collaborative Filtering: An Information-Theoretic Approach* , 2003, Knowledge and Information Systems.

[17]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender Systems , 2000 .

[18]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[19]  Tsvi Kuflik,et al.  Cross-Domain Mediation in Collaborative Filtering , 2007, User Modeling.

[20]  Francesco Ricci,et al.  Dynamic Item Weighting and Selection for Collaborative Filtering , 2007 .

[21]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[22]  Francesco Ricci,et al.  Locally Adaptive Neighborhood Selection for Collaborative Filtering Recommendations , 2008, AH.

[23]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[24]  P. Langley Selection of Relevant Features in Machine Learning , 1994 .

[25]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender System - A Case Study , 2000 .

[26]  Tao Qin,et al.  Feature selection for ranking , 2007, SIGIR.