The MAGNeT Toolkit: Design, Implementation and Evaluation

The current trend in constructing high-performance computing systems is to connect a large number of machines via a fast interconnect or a large-scale network such as the Internet. This approach relies on the performance of the interconnect (or Internet) to enable fast, large-scale distributed computing. A detailed understanding of the communication traffic is required in order to optimize the operation of the entire system.Network researchers traditionally monitor traffic in the network to gain the insight necessary to optimize network operations. Recent work suggests additional insight can be obtained by also monitoring traffic at the application level.The Monitor for Application-Generated Network Traffic toolkit (MAGNeT) we describe here monitors application traffic patterns in production systems, thus enabling more highly optimized networks and interconnects for the next generation of high-performance computing systems.

[1]  Wu-chun Feng,et al.  MAGNeT: monitor for application-generated network traffic , 2001, Proceedings Tenth International Conference on Computer Communications and Networks (Cat. No.01EX495).

[2]  William E. Johnston,et al.  The NetLogger methodology for high performance distributed systems performance analysis , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[3]  Jeffrey Semke PSC TCP Kernel Monitor , 2000 .

[4]  Wu-chun Feng,et al.  Capturing Network Traffic with a MAGNeT , 2001, Annual Linux Showcase & Conference.

[5]  Wu-chun Feng,et al.  A case for TCP Vegas in high-performance computational grids , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[6]  Thomas R. Gross,et al.  Bandwidth monitoring for network-aware applications , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[7]  Wu-chun Feng,et al.  Dynamic Right-Sizing: TCP Flow-Control Adaptation , 2001 .

[8]  Wu-chun Feng,et al.  On the burstiness of the TCP congestion-control mechanism in a distributed computing system , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[9]  Wu-chun Feng,et al.  The adverse impact of the TCP congestion-control mechanism in heterogeneous computing systems , 2000, Proceedings 2000 International Conference on Parallel Processing.

[10]  Wu-chun Feng,et al.  The Failure of TCP in High-Performance Computational Grids , 2000, ACM/IEEE SC 2000 Conference (SC'00).