Component-wise robust linear fuzzy clustering for collaborative filtering

Abstract Automated collaborative filtering is a popular technique for reducing information overload and the task is to predict missing values in a data matrix. Extraction of local linear models is a useful technique for predicting the missing values. Linear models featuring local structures of the high-dimensional incomplete data set are estimated by a modified linear fuzzy clustering algorithm. Fuzzy c -varieties (FCV) is a linear fuzzy clustering algorithm that estimates local principal component vectors as the vectors spanning prototypes of clusters. Least squares techniques, however, often fail to account for “outliers”, which are common in real applications. In this paper, a technique for making the FCV algorithm robust to intra-sample outliers is proposed. The objective function based on the lower rank approximation of the data matrix is minimized by a robust M-estimation algorithm that is similar to FCM-type iterative procedures. In numerical experiments, the diagnostic power of the filtering system is shown to be improved by predicting missing values using robust local linear models.

[1]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[2]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[3]  Junzo Watada,et al.  Fuzzy Principal Component Analysis and Its Application , 1997 .

[4]  Pattie Maes,et al.  Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[5]  Patrick Brézillon,et al.  Lecture Notes in Artificial Intelligence , 1999 .

[6]  Bradley N. Miller,et al.  GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[7]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[8]  James M. Keller,et al.  A possibilistic approach to clustering , 1993, IEEE Trans. Fuzzy Syst..

[9]  Stuart Geman,et al.  Statistical methods for tomographic image reconstruction , 1987 .

[10]  Michael J. Black,et al.  Robust Principal Component Analysis for Computer Vision , 2001, ICCV.

[11]  Hidetomo Ichihashi,et al.  Linear fuzzy clustering techniques with missing values and their application to local principal component analysis , 2004, IEEE Transactions on Fuzzy Systems.

[12]  Hidetomo Ichihashi,et al.  Collaborative Filtering Using Principal Component Analysis and Fuzzy Clustering , 2001, Web Intelligence.

[13]  Rajesh N. Davé,et al.  Robust clustering methods: a unified view , 1997, IEEE Trans. Fuzzy Syst..

[14]  Sadaaki Miyamoto,et al.  Fuzzy c-means as a regularization and maximum entropy approach , 1997 .

[15]  Rajesh N. Davé,et al.  Characterization and detection of noise in clustering , 1991, Pattern Recognit. Lett..

[16]  Michael J. Black,et al.  A Framework for Robust Subspace Learning , 2003, International Journal of Computer Vision.

[17]  P. Holland,et al.  Robust regression using iteratively reweighted least-squares , 1977 .

[18]  J. Bezdek,et al.  DETECTION AND CHARACTERIZATION OF CLUSTER SUBSTRUCTURE I. LINEAR STRUCTURE: FUZZY c-LINES* , 1981 .

[19]  H. Ichihashi,et al.  Simultaneous approach to principal component analysis and fuzzy clustering with missing values , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[20]  John Riedl,et al.  An algorithmic framework for performing collaborative filtering , 1999, SIGIR '99.