Selecting Measurement Paths for Efficient Network Monitoring and Diagnosis under Operational Constraints

End-to-end network diagnostics tools for identifying degraded links along an Internet path have been studied extensively, but few are found on the design and development of deployable solutions based on these approaches for actual large-scale networks. This paper presents a scalable measurement system that utilizes many users' off-the-shelf PCs as measurement end-point nodes (called gbeaconsh) to actively monitor network performance along many end-to-end paths and to conduct network diagnosis to infer degraded links when end-to-end performance degradation is detected. The network paths of end-to-end performance measurement and network diagnosis among beacons are dynamically and adaptively selected taking operational constraints such as the network resources along the path and the machine resource of the beacons into consideration. The system is evaluated through large-scale simulations and small-scale real-world experiments using a prototype implementation, and its considerable potential for practical network operations is demonstrated.

[1]  Rajeev Rastogi,et al.  Robust Monitoring of Link Delays and Faults in IP Networks , 2003, IEEE/ACM Transactions on Networking.

[2]  Yin Zhang,et al.  NetQuest: A Flexible Framework for Large-Scale Network Measurement , 2009, IEEE/ACM Transactions on Networking.

[3]  Randy H. Katz,et al.  An algebraic approach to practical and scalable overlay network monitoring , 2004, SIGCOMM '04.

[4]  Yao Zhao,et al.  Towards Efficient Large-Scale VPN Monitoring and Diagnosis under Operational Constraints , 2009, IEEE INFOCOM 2009.

[5]  Christian Huitema,et al.  STUN - Simple Traversal of User Datagram Protocol (UDP) Through Network Address Translators (NATs) , 2003, RFC.

[6]  Jia Wang,et al.  Locating internet bottlenecks: algorithms, measurements, and implications , 2004, SIGCOMM '04.

[7]  Matthew Roughan,et al.  IP forwarding anomalies and improving their detection using multiple data sources , 2004, NetT '04.

[8]  Kostas G. Anagnostakis,et al.  cing: measuring network-internal delays using only existing infrastructure , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[9]  Anees Shaikh,et al.  An empirical evaluation of wide-area internet bottlenecks , 2003, SIGMETRICS '03.

[10]  Katerina J. Argyraki,et al.  Netscope: Practical Network Loss Tomography , 2010, 2010 Proceedings IEEE INFOCOM.

[11]  Mark Crovella,et al.  Efficient monitoring of end-to-end network properties , 2004, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..