A cluster based approach for network distance embedding

Several coordinate bases embedding schemes have been proposed for scalable estimation of network distance (round trip time) among Internet hosts. These schemes may be broadly categorized into Landmark and distributed peer-to-peer based. While Landmark based approaches suffer from scalability due to the large amount of measurement loads, distributed schemes suffer from stability and accuracy issues in the presence of node churns. In this paper, we propose CSHE, a cluster based statistical approach for the network distance embedding that combines the stability of Landmark scheme and the scaling property of distributed approach. CSHE groups the nodes into a set of clusters where a new node embeds itself into the co-ordinate space by computing its distance against a set of nodes that are randomly chosen from each cluster. Using real measurement traces, we evaluate the accuracy and robustness of CSHE. We find that the accuracy of CSHE is comparable to the best known accurate embedding (GNP based embedding) and does not suffer with node churns.

[1]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[2]  Jon Crowcroft,et al.  On the accuracy of embeddings for internet coordinate systems , 2005, IMC '05.

[3]  Emin Gün Sirer,et al.  Meridian: a lightweight network location service without virtual coordinates , 2005, SIGCOMM '05.

[4]  D. Saha,et al.  On Suitability of Euclidean Embedding for Host-Based Network Coordinate Systems , 2010, IEEE/ACM Transactions on Networking.

[5]  Margo I. Seltzer,et al.  Network Coordinates in the Wild , 2007, NSDI.

[6]  Eng Keong Lua,et al.  Internet Routing Policies and Round-Trip-Times , 2005, PAM.

[7]  Edith Cohen,et al.  Replication strategies in unstructured peer-to-peer networks , 2002, SIGCOMM.

[8]  Ravi Jain,et al.  An Experimental Study of the Skype Peer-to-Peer VoIP System , 2005, IPTPS.

[9]  Zhi-Li Zhang,et al.  On suitability of Euclidean embedding of internet hosts , 2006, SIGMETRICS '06/Performance '06.

[10]  Miguel Castro,et al.  PIC: practical Internet coordinates for distance estimation , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[11]  Mark Crovella,et al.  Virtual landmarks for the internet , 2003, IMC '03.

[12]  Hui Zhang,et al.  Predicting Internet network distance with coordinates-based approaches , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[13]  Hyuk Lim,et al.  Constructing Internet coordinate system based on delay measurement , 2003, IEEE/ACM Transactions on Networking.

[14]  Robert Tappan Morris,et al.  Vivaldi: a decentralized network coordinate system , 2004, SIGCOMM '04.

[15]  Yuval Shavitt,et al.  Big-Bang simulation for embedding network distances in Euclidean space , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[16]  Krishna P. Gummadi,et al.  King: estimating latency between arbitrary internet end hosts , 2002, IMW '02.

[17]  Ang Li,et al.  Myth: An Accurate and Scalable Network Coordinate System under High Node Churn Rate , 2007, 2007 15th IEEE International Conference on Networks.