Tarantula: Towards an Accurate Network Coordinate System by Handling Major Portion of TIVs

Network Coordinate (NC) systems provide an efficient and scalable mechanism to estimate latencies among hosts. However, many popular algorithms like Vivaldi suffer greatly from the existence of Triangle Inequality Violations (TIVs). Two-layer systems like Pharos and hierarchical Vivaldi have been proposed to remedy the impact of TIVs. They divide the whole space into several location-based clusters and run NC systems on both global layer and local layer. However, the two-layer model is only able to optimize the intra-cluster links relating to a limited portion of TIV triangles. In this paper, we propose a new NC system, Tarantula, which divides the space in a novel way. By categorizing the TIVs into three classes, we show that Tarantula handles a much larger portion of existing TIVs than two-layer systems. Moreover, we present two techniques to further strengthen the Tarantula system: 1) relate the updating step size in the Vivaldi algorithm used in Tarantula to ground-truth latency so as to improve the prediction for short links; 2) propose Dynamic Cluster Optimization to dynamically adjust clustering of hosts. Our experimental results show that Tarantula outperforms Pharos and Vivaldi significantly in terms of estimation accuracy. When implementing different NC systems in the application of server selection and detour finding, Tarantula again performs the best.

[1]  Sonia Fahmy,et al.  Impact of the Inaccuracy of Distance Prediction Algorithms on Internet Applications - an Analytical and Comparative Study , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[2]  Dan Li,et al.  Taming the triangle inequality violations with network coordinate system on real internet , 2010, ReARCH '10.

[3]  Yuval Shavitt,et al.  Big-bang simulation for embedding network distances in Euclidean space , 2004, IEEE/ACM Transactions on Networking.

[4]  Bobby Bhattacharjee,et al.  Symbiotic Relationships in Internet Routing Overlays , 2009, NSDI.

[5]  Patrick Wendell,et al.  DONAR: decentralized server selection for cloud services , 2010, SIGCOMM '10.

[6]  Zhi-Li Zhang,et al.  On suitability of Euclidean embedding of internet hosts , 2006, SIGMETRICS '06/Performance '06.

[7]  Xiao Wang,et al.  Phoenix: A Weight-Based Network Coordinate System Using Matrix Factorization , 2011, IEEE Transactions on Network and Service Management.

[8]  Hui Zhang,et al.  Predicting Internet network distance with coordinates-based approaches , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[9]  Yang Chen,et al.  Pharos: accurate and decentralised network coordinate system , 2009, IET Commun..

[10]  Jon Crowcroft,et al.  On the accuracy of embeddings for internet coordinate systems , 2005, IMC '05.

[11]  Margo I. Seltzer,et al.  Network Coordinates in the Wild , 2007, NSDI.

[12]  Jacob R. Lorch,et al.  Matchmaking for online games and other latency-sensitive P2P systems , 2009, SIGCOMM '09.

[13]  Miguel Castro,et al.  PIC: practical Internet coordinates for distance estimation , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[14]  Olivier Bonaventure,et al.  Scalable Route Selection for IPv6 Multihomed Sites , 2005, NETWORKING.

[15]  Mohamed Ali Kâafar,et al.  Towards a Two-Tier Internet Coordinate System to Mitigate the Impact of Triangle Inequality Violations , 2008, Networking.

[16]  Robert Tappan Morris,et al.  Vivaldi: a decentralized network coordinate system , 2004, SIGCOMM '04.

[17]  Benoit Donnet,et al.  A Survey on Network Coordinates Systems, Design, and Security , 2010, IEEE Communications Surveys & Tutorials.