论文信息 - A Formal Statistical Approach to Collaborative Filtering

A Formal Statistical Approach to Collaborative Filtering

Grouping people into clusters based on the items they have purchased allows accurate recommendations of new items for purchase: If you and I have liked many of the same movies, then I will probably enjoy other movies that you like. Recommending items based on similarity of interest (a.k.a. collaborative ltering) is attractive for many domains: books, CDs, movies, etc., but does not always work well. Because data are always sparse { any given person has seen only a small fraction of all movies { much more accurate predictions can be made by grouping people into clusters with similar taste in movies and grouping movies into clusters which tend to be liked by the same people. Finding optimal clusters is tricky because the movie groups should be used to help determine the people groups and visa versa. We present a formal statistical model of collaborative ltering, and compare di erent algorithms for estimating the model parameters including variations of K-means clustering and Gibbs Sampling. This formal model is easily extended to handle clustering of objects with multiple attributes.

Dean P. Foster | Lyle H. Ungar | Dean Phillips Foster | L. Ungar

[1] Douglas B. Terry,et al. Using collaborative filtering to weave an information tapestry , 1992, CACM.

[2] Bradley N. Miller,et al. GroupLens: applying collaborative filtering to Usenet news , 1997, CACM.

[3] G. McLachlan,et al. The EM algorithm and extensions , 1996 .

[4] Geoffrey E. Hinton,et al. A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[5] G. Casella,et al. Explaining the Gibbs Sampler , 1992 .

[6] Pattie Maes,et al. Evolving agents for personalized information filtering , 1993, Proceedings of 9th IEEE Conference on Artificial Intelligence for Applications.

[7] Pattie Maes,et al. Social information filtering: algorithms for automating “word of mouth” , 1995, CHI '95.

[8] P. Sopp. Cluster analysis. , 1996, Veterinary immunology and immunopathology.

[9] Desire L. Massart,et al. The Interpretation of Analytical Chemical Data by the Use of Cluster Analysis , 1983 .