Large-scale clusters built out of commercial components face similar scalability obstacles as the massively parallel processors (MPP) of the 1980s. This is especially true when they are used for scientific computing. Their networks are the descendants of the MPP networks, but the communication software in use has been designed for wide-area networks with client/server applications in mind. We present a communication protocol which has been designed specifically for large-scale clusters with a scientific application workload. The protocol takes advantage of the low error rate and high performance of these networks. It is adapted to the peculiarities of these MPP-like networks and the communication characteristics of scientific applications. This paper only presents the protocol itself and the ideas behind it. We refer the reader to other publications for more information about scalability, performance, and usage of the protocol presented here.
[1]
Shawn David Ostermann.
Reliable message transport for network communication
,
1994
.
[2]
Rolf Riesen,et al.
Experience in offloading protocol processing to a programmable NIC
,
2002,
Proceedings. IEEE International Conference on Cluster Computing.
[3]
Rolf Riesen,et al.
Portals 3.0: protocol building blocks for low overhead communication
,
2002,
Proceedings 16th International Parallel and Distributed Processing Symposium.
[4]
Ron Brightwell,et al.
The Portals 3.0 Message Passing Interface Revision 1.0
,
1999
.
[5]
Charles L. Seitz,et al.
Myrinet: A Gigabit-per-Second Local Area Network
,
1995,
IEEE Micro.