The Design and Demonstration of the Ultralight Testbed

In this paper we present the motivation, the design, and a recent demonstration of the UltraLight testbed at SC|05. The goal of the Ultralight testbed is to help meet the data-intensive computing challenges of the next generation of particle physics experiments with a comprehensive, network- focused approach. UltraLight adopts a new approach to networking: instead of treating it traditionally, as a static, unchanging and unmanaged set of inter-computer links, we are developing and using it as a dynamic, configurable, and closely monitored resource that is managed from end-to-end. To achieve its goal we are constructing a next-generation global system that is able to meet the data processing, distribution, access and analysis needs of the particle physics community. In this paper we will first present early results in the various working areas of the project. We then describe our experiences of the network architecture, kernel setup, application tuning and configuration used during the bandwidth challenge event at SC|05. During this Challenge, we achieved a record-breaking aggregate data rate in excess of 150 Gbps while moving physics datasets between many Grid computing sites.

[1]  Cheng Jin,et al.  FAST TCP: Motivation, Architecture, Algorithms, Performance , 2006, IEEE/ACM Transactions on Networking.

[2]  Xipeng Xiao,et al.  Internet QoS: a big picture , 1999, IEEE Netw..

[3]  Ashiq Anjum,et al.  Grid Enabled Analysis : Architecture, prototype and status , 2005 .

[4]  Werner Nutt,et al.  R-GMA: An Information Integration System for Grid Monitoring , 2003, OTM.

[5]  Harvey B Newman,et al.  A Globally Distributed Real Time Infrastruture for World Wide Collaborations , 2005 .

[6]  Fernando Paganini,et al.  FAST TCP: from theory to experiments , 2005, IEEE Network.

[7]  Sergio Andreozzi,et al.  GridICE: a monitoring service for Grid systems , 2005, Future Gener. Comput. Syst..

[8]  Frank Kelly,et al.  Rate control for communication networks: shadow prices, proportional fairness and stability , 1998, J. Oper. Res. Soc..

[9]  Vern Paxson,et al.  TCP Congestion Control , 1999, RFC.

[10]  Ashiq Anjum,et al.  The Clarens Web service framework for distributed scientific analysis in grid projects , 2005, 2005 International Conference on Parallel Processing Workshops (ICPPW'05).

[11]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[12]  Iosif Legrand,et al.  MonALISA : A Distributed Monitoring Service Architecture , 2003, ArXiv.

[13]  Jiantao Wang,et al.  Modelling and stability of FAST TCP , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..