A new approach for combining content-based and collaborative filters

With the development of e-commerce and the proliferation of easily accessible information, recommender systems have become a popular technique to prune large information spaces so that users are directed toward those items that best meet their needs and preferences. A variety of techniques have been proposed for performing recommendations, including content-based and collaborative techniques. Content-based filtering selects information based on semantic content, whereas collaborative filtering combines the opinions of other users to make a prediction for a target user. In this paper, we describe a new filtering approach that combines the content-based filter and collaborative filter to capitalize on their respective strengths, and thereby achieves a good performance. We present a series of recommendations on the selection of the appropriate factors and also look into different techniques for calculating user-user similarities based on the integrated information extracted from user profiles and user ratings. Finally, we experimentally evaluate our approach and compare it with classic filters, the result of which demonstrate the effectiveness of our approach.

[1]  William Goffman,et al.  Inefficiency of the use of Boolean functions for information retrieval systems , 1961, Commun. ACM.

[2]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[3]  Stephen E. Robertson,et al.  Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[4]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[5]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[6]  Peter G. Anick,et al.  A direct manipulation interface for boolean information retrieval via natural language query , 1989, SIGIR '90.

[7]  Ross Wilkinson,et al.  Using the cosine measure in a neural network for document retrieval , 1991, SIGIR '91.

[8]  Tetsuya Morita,et al.  A fuzzy document retrieval system using the keyword connection matrix and a learning method , 1991 .

[9]  Douglas B. Terry,et al.  Using collaborative filtering to weave an information tapestry , 1992, CACM.

[10]  Douglas B. Terry,et al.  A tour through Tapestry , 1993, COCS '93.

[11]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[12]  Myoung-Ho Kim,et al.  Ranking Documents in Thesaurus-Based Boolean Retrieval Systems , 1994, Inf. Process. Manag..

[13]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[14]  Gary Marchionini,et al.  A Conceptual Framework for Text Filtering , 1996 .

[15]  Larry Stead,et al.  Group Asynchronous Browsing on the World Wide Web , 1995, World Wide Web J..

[16]  Yoav Shoham,et al.  Fab: content-based, collaborative recommendation , 1997, CACM.

[17]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[18]  Paul S. Bradley,et al.  Refining Initial Points for K-Means Clustering , 1998, ICML.

[19]  Ahmad M. Ahmad Wasfi Collecting user access patterns for building user profiles and collaborative filtering , 1998, IUI '99.

[20]  William W. Cohen,et al.  Recommendation as Classification: Using Social and Content-Based Information in Recommendation , 1998, AAAI/IAAI.

[21]  Naohiro Ishii,et al.  Content-based Collaborative Information Filtering: Actively Learning to Classify and Recommend Documents , 1998, CIA.

[22]  John Riedl,et al.  Combining Collaborative Filtering with Personal Agents for Better Recommendations , 1999, AAAI/IAAI.

[23]  Mark Claypool,et al.  Combining Content-Based and Collaborative Filters in an Online Newspaper , 1999, SIGIR 1999.

[24]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.

[25]  Jonathan L. Herlocker,et al.  Clustering items for collaborative filtering , 1999 .

[26]  Kenneth Y. Goldberg,et al.  Jester 2.0 (poster abstract): evaluation of an new linear time collaborative filtering algorithm , 1999, SIGIR '99.

[27]  Vijay V. Raghavan,et al.  Adaptive Concept-based Retrieval Using a Neural Network∗ , 2000 .

[28]  James C. French,et al.  Flycasting: using collaborative filtering to generate a playlist for online radio , 2001, Proceedings First International Conference on WEB Delivering of Music. WEDELMUSIC 2001.

[29]  David M. Pennock,et al.  Probabilistic Models for Unified Collaborative and Content-Based Recommendation in Sparse-Data Environments , 2001, UAI.

[30]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[31]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..