The design space of probing algorithms for network-performance measurement

We present a framework for the design and analysis of probing methods to monitor network performance, an important technique for collecting measurements in tasks such as fault detection. We use this framework to study the interaction among numerous, possibly conflicting, optimization goals in the design of a probing algorithm. We present a rigorous definition of a probing-algorithm design problem that can apply broadly to network-measurement scenarios. We also present several metrics relevant to the analysis of probing algorithms, including probing frequency and network coverage, communication and computational overhead, and the amount of algorithm state required. We show inherent tradeoffs among optimization goals and give hardness results for achieving some combinations of optimization goals. We also consider the possibility of developing approximation algorithms for achieving some of the goals and describe a randomized approach as an alternative, evaluating it using our framework. Our work aids future development of low-overhead probing techniques and introduces principles from IP-based networking to theoretically grounded approaches for concurrent path-selection problems.

[1]  Thomas Erlebach,et al.  Approximation Algorithms for Edge-Disjoint Paths and Unsplittable Flow , 2006, Efficient Approximation and Online Algorithms.

[2]  Rajeev Motwani,et al.  Randomized algorithms , 1996, CSUR.

[3]  Nick G. Duffield,et al.  Network Tomography of Binary Network Performance Characteristics , 2006, IEEE Transactions on Information Theory.

[4]  Rajeev Rastogi,et al.  Robust Monitoring of Link Delays and Faults in IP Networks , 2003, IEEE/ACM Transactions on Networking.

[5]  Ojas Parekh,et al.  Path Hitting in Acyclic Graphs , 2007, Algorithmica.

[6]  Paul Barford,et al.  Network Performance Anomaly Detection and Localization , 2009, IEEE INFOCOM 2009.

[7]  R. Rastogi,et al.  Robust Monitoring of Link Delays and Faults , 2006 .

[8]  Renata Teixeira,et al.  NetDiagnoser: troubleshooting network unreachabilities using end-to-end probes and routing data , 2007, CoNEXT '07.

[9]  Ian Holyer,et al.  The NP-Completeness of Edge-Coloring , 1981, SIAM J. Comput..

[10]  Robert Nowak,et al.  Internet tomography , 2002, IEEE Signal Process. Mag..

[11]  Azer Bestavros,et al.  On the marginal utility of network topology measurements , 2001, IMW '01.

[12]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[13]  Gordon T. Wilfong,et al.  The stable paths problem and interdomain routing , 2002, TNET.

[14]  Thomas Sauerwald,et al.  The Weighted Coupon Collector's Problem and Applications , 2009, COCOON.

[15]  Yin Zhang,et al.  NetQuest: A Flexible Framework for Large-Scale Network Measurement , 2009, IEEE/ACM Transactions on Networking.

[16]  Myungjin Lee,et al.  Enabling Flow-level Latency Measurements across Routers in Data Centers , 2011, Hot-ICE.

[17]  Subhash Khot Hardness results for approximate hypergraph coloring , 2002, STOC '02.

[18]  Ulrik Brandes,et al.  Efficient generation of large random networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Renata Teixeira,et al.  Minimizing Probing Cost for Detecting Interface Failures: Algorithms and Scalability Analysis , 2009, IEEE INFOCOM 2009.

[20]  Peter Slavík A Tight Analysis of the Greedy Algorithm for Set Cover , 1997, J. Algorithms.

[21]  Paul Barford,et al.  Multiobjective Monitoring for SLA Compliance , 2010, IEEE/ACM Transactions on Networking.

[22]  Yu Gu,et al.  Disjoint-Path Facility Location: Theory and Practice , 2011, ALENEX.

[23]  M. Halldórsson,et al.  Strong Colorings of Hypergraphs , 2004, WAOA.

[24]  Rajeev Motwani,et al.  Randomized Algorithms , 1995, SIGA.

[25]  Matthew Roughan,et al.  The Internet Topology Zoo , 2011, IEEE Journal on Selected Areas in Communications.