Szemerédi-type clustering of peer-to-peer streaming system

In this work we made a preliminary clustering analysis of an experimental peer-to-peer system, tested in a small scale experiments within PlanetLab. The application was an Internet-TV like streaming system based on Chord architecture. Our clustering is inspired by the Szemeredi's Regularity Lemma (SzRL). Such approach was already demonstrated in biology and appeared to be a powerful tool. Szemeredi's result suggests that the nodes of a large enough graph can be partitioned in few clusters in such a way that link distribution between most of the pairs look like random. Our main goal is to study what can this type of clustering tell us about p2p systems using our experimental system as source of data. We searched clusterings of Szemeredi-type by using max likelihood as guidance. Our graph is directed and weighted. The link direction indicates a client-server relation and the value is the proportion of all chunks obtained from such a link during the whole experiment. We think that the preliminary results are interesting. Most of the cluster pairs have very distinguished patterns of link distribution, indicating that such a novel approach has potential in classifying peers effectively. The values of weights between clusters and their distribution show some apparent patterns. We end up with 9 cluster pairs. Contributions: practical implementations of streaming system by V. P. and analysis by H. R.

[1]  E. Szemerédi Regular Partitions of Graphs , 1975 .

[2]  Vojtech Rödl,et al.  The algorithmic aspects of the regularity lemma , 1992, Proceedings., 33rd Annual Symposium on Foundations of Computer Science.

[3]  M. Simonovits,et al.  Szemeredi''s Regularity Lemma and its applications in graph theory , 1995 .

[4]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[5]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[6]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[7]  Eddie Kohler,et al.  Clustering and sharing incentives in BitTorrent systems , 2006, SIGMETRICS '07.

[8]  Marcello Pelillo,et al.  Szemerédi's Regularity Lemma and Its Applications to Pairwise Clustering and Segmentation , 2007, EMMCVPR.

[9]  A. Binzenhofer,et al.  Relying on randomness PlanetLab experiments with distributed file-sharing protocols , 2007, 2007 Next Generation Internet Networks.

[10]  Laurent Massoulié,et al.  Epidemic live streaming: optimal performance trade-offs , 2008, SIGMETRICS '08.

[11]  Gábor E. Tusnády,et al.  Reconstructing Cortical Networks: Case of Directed Graphs with High Level of Reciprocity , 2008 .

[12]  Asaf Shapira,et al.  Approximate Hypergraph Partitioning and Applications , 2010, SIAM J. Comput..

[13]  Ilkka Norros,et al.  Peer-to-peer streaming experiments based on Chord overlay , 2011 .

[14]  Alex D. Scott,et al.  Szemerédi's Regularity Lemma for Matrices and Sparse Graphs , 2010, Combinatorics, Probability and Computing.