Complex-network theoretic clustering for identifying groups of similar listeners in p2p systems

This article presents an approach to automatically create virtual communities of users with similar music preferences in a distributed system. Our goal is to create personalized music channels for these communities using the content shared by its members in peer-to-peer networks for each community. To extract these communities a complex network theoretic approach is chosen. A fully connected graph of users is created using epidemic protocols. We show that the created graph sufficiently converges to a graph created with a centralized algorithm after a small number of protocol iterations. To find suitable techniques for creating user communities, we analyze graphs created from real-world recommender datasets and identify specific properties of these datasets. Based on these properties, different graph-based community-extraction techniques are chosen and evaluated. We select a technique that exploits identified properties to create clusters of music listeners. The suitability of this technique is validated using a music dataset and two large movie datasets. On a graph of 6,040 peers, the selected technique assigns at least 85% of the peers to optimal communities, and obtains a mean classification error of less than 0.05% over the remaining peers that are not assigned to the best community.

[1]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[2]  A. Rbnyi ON THE EVOLUTION OF RANDOM GRAPHS , 2001 .

[3]  Duncan J. Watts,et al.  Six Degrees: The Science of a Connected Age , 2003 .

[4]  Johan A. Pouwelse,et al.  Personalization on a peer-to-peer television system , 2006, Multimedia tools and applications.

[5]  Fabio Vignoli,et al.  Virtual Communities for Creating Shared Music Channels , 2007, ISMIR.

[6]  P. Erdos,et al.  On the evolution of random graphs , 1984 .

[7]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[8]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[9]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[11]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[12]  Márk Jelasity,et al.  Large-Scale Newscast Computing on the Internet , 2002 .

[13]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[14]  B. Bollobás The evolution of random graphs , 1984 .

[15]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[16]  L. D. Costa Hub-Based Community Finding , 2004, cond-mat/0405022.