Non-Euclidean Internet Coordinates Embedding

For many applications it is desirable to be able to estimate latency in a decentralised network when it is not practical to explicitly measure it. It has previously been shown that latency can be approximated by assigning hosts coordinates in some geometric space such that the Euclidean distance between two hosts in this space is equivalent to latency, a method known as a Network Coordinate (NC) system. This is commonly achieved by a large scale distributed optimisation which seeks to minimise the error between latency and Euclidean distance. In this work we challenge the assumption of Euclidean space as a satisfactory model for embedding Internet-like networks, due to the curved nature of network distances. We present a novel distributed optimisation methodology: Non-Euclidean Internet Coordinates Embedding (NICE). NICE uses a polynomial regression model to explicitly learn the most effective distance function for latency estimation within a geometric space, in addition to a distributed non linear dimensionality reduction method. Dimensionality reduction is achieved via a variant of Landmark Multi Dimensional Scaling (LMDS) and a distributed optimisation algorithm. This allows the distributed system to create a set of coordinates for each of the participating hosts that can be used to accurately estimate latency. The system is implemented within the Java based PeerSim network simulator using both real and artificially generated input topologies and then compared to two of the most widely implemented NC systems: GNP and Vivaldi. By experimental simulation we show that NICE is significantly more accurate than either method while still remaining robust in the face of real network conditions.

[1]  Márk Jelasity,et al.  PeerSim: A scalable P2P simulator , 2009, 2009 IEEE Ninth International Conference on Peer-to-Peer Computing.

[2]  Yuval Shavitt,et al.  Hyperbolic embedding of internet graph for distance estimation and overlay construction , 2008, TNET.

[3]  M. J. Box,et al.  Non-linear optimization techniques; , 1969 .

[4]  David E. Culler,et al.  PlanetLab: an overlay testbed for broad-coverage services , 2003, CCRV.

[5]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[6]  BERNARD M. WAXMAN,et al.  Routing of multipoint connections , 1988, IEEE J. Sel. Areas Commun..

[7]  Mark Crovella,et al.  Virtual landmarks for the internet , 2003, IMC '03.

[8]  Jin Li,et al.  On peer-to-peer (P2P) content delivery , 2008, Peer-to-Peer Netw. Appl..

[9]  Krishna P. Gummadi,et al.  King: estimating latency between arbitrary internet end hosts , 2002, IMW '02.

[10]  Moritz Steiner,et al.  Where Is My Peer? Evaluation of the Vivaldi Network Coordinate System in Azureus , 2009, Networking.

[11]  George Pavlou,et al.  Near-optimal service facility location in dynamic communication networks , 2005, IEEE Communications Letters.

[12]  John Kubiatowicz,et al.  Handling churn in a DHT , 2004 .

[13]  Xiao Wang,et al.  Phoenix: A Weight-Based Network Coordinate System Using Matrix Factorization , 2011, IEEE Transactions on Network and Service Management.

[14]  George Pavlou,et al.  An adaptive clustering approach for the management of dynamic systems , 2005, IEEE Journal on Selected Areas in Communications.

[15]  Hui Zhang,et al.  Predicting Internet network distance with coordinates-based approaches , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[16]  Hyuk Lim,et al.  Constructing Internet coordinate system based on delay measurement , 2003, IEEE/ACM Transactions on Networking.

[17]  Dan R. K. Ports,et al.  Census: Location-Aware Membership Management for Large-Scale Distributed Systems , 2009, USENIX Annual Technical Conference.

[18]  John Platt,et al.  FastMap, MetricMap, and Landmark MDS are all Nystrom Algorithms , 2005, AISTATS.

[19]  Ibrahim Matta,et al.  BRITE: an approach to universal topology generation , 2001, MASCOTS 2001, Proceedings Ninth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[20]  Giuseppe Di Fatta,et al.  Virtual landmarking for locality aware peer IDs , 2011 .

[21]  Jon Crowcroft,et al.  On the accuracy of embeddings for internet coordinate systems , 2005, IMC '05.

[22]  Playing Vivaldi in Hyperbolic Space , 2006 .

[23]  Miguel Castro,et al.  PIC: practical Internet coordinates for distance estimation , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[24]  Mark Handley,et al.  Topologically-aware overlay construction and server selection , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[25]  Fatos Xhafa,et al.  Utilizing Next Generation Emerging Technologies for Enabling Collective Computational Intelligence in Disaster Management , 2011, Next Generation Data Technologies for Collective Computational Intelligence.

[26]  Robert Tappan Morris,et al.  Vivaldi: a decentralized network coordinate system , 2004, SIGCOMM '04.

[27]  Hui Zhang,et al.  A Network Positioning System for the Internet , 2004, USENIX Annual Technical Conference, General Track.

[28]  Margo I. Seltzer,et al.  Network-Aware Operator Placement for Stream-Processing Systems , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[29]  Jennifer Rexford,et al.  BGP routing policies in ISP networks , 2005, IEEE Network.

[30]  Joshua B. Tenenbaum,et al.  Sparse multidimensional scaling using land-mark points , 2004 .

[31]  Pierre Geurts,et al.  Network Distance Prediction Based on Decentralized Matrix Factorization , 2010, Networking.

[32]  Michal Szymaniak,et al.  Practical large-scale latency estimation , 2008, Comput. Networks.

[33]  Jonathan M. Smith,et al.  IDES: An Internet Distance Estimation Service for Large Networks , 2006, IEEE Journal on Selected Areas in Communications.

[34]  V. Mcgee Multidimensional Scaling Of N Sets Of Similarity Measures: A Nonmetric Individual Differences Approach. , 1968, Multivariate behavioral research.

[35]  Jon Crowcroft,et al.  Lighthouses for Scalable Distributed Location , 2003, IPTPS.

[36]  Bobby Bhattacharjee,et al.  Symbiotic Relationships in Internet Routing Overlays , 2009, NSDI.