A novel collaborative filtering framework based on Fuzzy C-Means clustering using FP-Tree

Collaborative filtering (CF) has been a comprehensive approach in recommendation system. But data are always sparse; any given user has seen or buys only a small fraction of all items. This becomes the bottleneck of CF. Cluster-based smoothing technique for nature language processing is successful to estimate probability of the unseen term by using the topic (cluster) of the term belongs to, which motivate us to examine the sparsity problem on collaborative filtering. There are many clustering algorithms. Many K-Means-like algorithms which have low time and space complexity have been widely used. But the E-Commerce data are always mixed-type, like numerical, nominal, set etc. The K-Means-like algorithm is not suit to handle these mixed-type. In this paper, we use FP-Tree successfully to handle nominal and set type in mixed-type data. We chose an improved Fuzzy C-Means algorithm which leverages a FP-Tree approach to cluster the items. As a result, we provide higher accuracy as well as increased efficiency in recommendations. The experiments show that improved FCM algorithm clusters items efficiently and raises the efficiency of recommendations.

[1]  Israel Spiegler,et al.  Investigating diversity of clustering methods: An empirical comparison , 2007, Data Knowl. Eng..

[2]  Abhinandan Das,et al.  Google news personalization: scalable online collaborative filtering , 2007, WWW '07.

[3]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[4]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[5]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[6]  Qiang Yang,et al.  Scalable collaborative filtering using cluster-based smoothing , 2005, SIGIR '05.

[7]  Zhengxin Chen,et al.  An iterative initial-points refinement algorithm for categorical data clustering , 2002, Pattern Recognit. Lett..

[8]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[9]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[10]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[11]  Sun Ji,et al.  Clustering Algorithms Research , 2008 .

[12]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[13]  Thomas Hofmann,et al.  Latent Class Models for Collaborative Filtering , 1999, IJCAI.

[14]  Ji-Gui Sun,et al.  Clustering Algorithms Research , 2008 .

[15]  Dean P. Foster,et al.  Clustering Methods for Collaborative Filtering , 1998, AAAI 1998.

[16]  Arnd Kohrs,et al.  Clustering for collaborative filtering applications , 1999 .