Mining Typical Preferences of Collaborative User Groups

Collaborative filtering systems have the problems of sparsity– providing a recommendation by correlation between only two customers’ preferences, and being unable to recommend a unique item owing to the recommendation based on preference rather than on the content of the item. The native feature space consists of unique words with single dimension when it occurs in documents as items, which can be tens or hundreds of thousands of words for even a moderate-sized text collection. This is prohibitively high for many learning algorithms. Since the feature extraction method using association word mining does not use the profile, it needs not update the profile, and it automatically generates noun phrases by using confidence and support of the Apriori algorithm without calculating the probability for index. However, in case that the feature extraction method is based on a set of association words, it makes an error of judging different documents identically. This paper proposes an association word mining method with weighted word, which reflects not only the preference rating of items but also information on the items. The proposed method is capable of creating the profile of the collaborative users, in which users are grouped according to the vector space model and Kmeans algorithm. Thus, the new method eliminates the existing collaborative filtering system’s problems of sparsity and of recommendations based on the degree of correlation of user preferences. Entropy is used in order to address the said system’s shortcoming whereby items are recommended according to the degree of correlation of the two most similar users within a group. Thus, the typical preference of the group is extracted. Since user preferences cannot be automatically regarded as accurate data, users within the group who have entropies beyond the threshold are selected as typical users. After this selection, the typical preference can be extracted by assigning typical user preferences in the form of weights. By using the typical preference of the group, the method also reduces the time required for retrieving the most similar users within the group.

[1]  Jung-Hyun Lee,et al.  Feature Selection Using Association Word Mining for Classification , 2001, DEXA.

[2]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[3]  N. Ishii,et al.  Formal Models for Learning of User Preferences , a Preliminary Report , 1999 .

[4]  Wee Sun Lee Collaborative Learning and Recommender Systems , 2001, ICML.

[5]  John Riedl,et al.  Analysis of recommendation algorithms for e-commerce , 2000, EC '00.

[6]  Michael J. Pazzani,et al.  Learning Collaborative Information Filters , 1998, ICML.

[7]  William W. Cohen,et al.  Recommendation as Classification: Using Social and Content-Based Information in Recommendation , 1998, AAAI/IAAI.

[8]  Michael J. Pazzani,et al.  Learning and Revising User Profiles: The Identification of Interesting Web Sites , 1997, Machine Learning.

[9]  Dean P. Foster,et al.  Clustering Methods for Collaborative Filtering , 1998, AAAI 1998.

[10]  Ian Soboroff. Charles Nicholas Combining Content and Collaboration in Text Filtering , 1999 .

[11]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[12]  Sanjay Ranka,et al.  An effic ient k-means clustering algorithm , 1997 .

[13]  Wee Sun Lee Collaborative Learning for Recommender Systems , 2001 .

[14]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[15]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[16]  Bernard Mérialdo,et al.  Using category-based collaborative filtering in the Active WebMuseum , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[17]  Sung-Bong Yang,et al.  An Improved Recommendation Algorithm in Collaborative Filtering , 2002, EC-Web.

[18]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender System - A Case Study , 2000 .

[19]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[20]  Young-Seok Lee,et al.  Cluster Feature Selection using Entropy Weighting and SVD , 2002 .