Experiences and Results from a New High Performance Network and Application Monitoring Toolkit

Grid Computing capabilities are increasingly needed for scientific research. Groups such as Globus and the Particle Physics Data Grid are developing tools to meet these needs. An additional challenge is the evaluation and fine-tuning of these applications, as well as support for long term monitoring, performance analysis, and troubleshooting. In September 2001, SLAC started the development of a toolkit for studying the available bandwidth as measured by various network sensing tools and comparing that with the bandwidth achievable by various bulk data transfer applications. This study has provided experience in the challenges of deploying and using the sensor tools and transfer applications, as well as information for fine tuning the applications and analyzing their performance. The results presented in this paper include the deployment challenges, techniques for optimizing the duration of measurements, the impacts of throughput on CPU utilization, optimizing windows and parallel streams, the impact on other users, comparisons of various throughput measurement techniques, patterns of throughput behaviors, forecasting, and comparisons of active and passive measurements. We finish up with possible avenues for future development.

[1]  Sally Floyd,et al.  HighSpeed TCP for Large Congestion Windows , 2003, RFC.

[2]  Andrew Hanushevsky,et al.  Peer-to-Peer Computing for Secure High Performance Data Copying , 2002 .

[3]  Daniel J. Barrett,et al.  SSH, The Secure Shell: The Definitive Guide , 2001 .

[4]  W. Matthews Passive and active monitoring on a high performance research network , 2001 .

[5]  Les Cottrell,et al.  The PingER project: active Internet performance monitoring for the HENP community , 2000, IEEE Commun. Mag..

[6]  Richard Wolski,et al.  Synchronizing network probes to avoid measurement intrusiveness with the Network Weather Service , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[7]  C. A. Logg Passive Performance Monitoring and Traffic Characteristics on the SLAC Internet Border , 2002 .

[8]  Parameswaran Ramanathan,et al.  What do packet dispersion techniques measure? , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[9]  Vern Paxson,et al.  Experiences with NIMI , 2002, Proceedings 2002 Symposium on Applications and the Internet (SAINT) Workshops.

[10]  ArtemTrunov,et al.  Peer—to—Peer Computing for secure High Performance Data Copying , 2001 .

[11]  Manish Jain,et al.  Pathload: A Measurement Tool for End-to-End Available Bandwidth , 2002 .

[12]  Brian Tierney,et al.  TCP Tuning Guide for Distributed Application on Wide Area Networks , 2001, login Usenix Mag..