Enabling a PC Cluster for High-Performance Computing 1

Due to their excellent cost/performance ratio, clusters of PCs can be attractive high-performance computing (HPC) platforms. Yet, their limited communication performance over standard LANs is still prohibitive for parallel applications. The project “Shared Memory in a LAN-like Environment“ (SMiLE) at LRR-TUM adopts Scalable Coherent Interface (SCI) interconnect technology to build, and provide software for, a PC cluster which, with hardware-based distributed shared memory (DSM) and high-performance communication characteristics, is regarded as well suited for HPC. The paper describes the key features of the enabling technology, SCI. It then discusses the developments and important results of the SMiLE project so far: the development and initial performance of a PCI/SCI interface card, and the design and initial performance results of low-latency communication layers, Active Messages and a sockets emulation library.

[1]  Samuel J. Leffler,et al.  The design and implementation of the 4.3 BSD Unix operating system , 1991, Addison-Wesley series in computer science.

[2]  Jack Dongarra,et al.  A User''s Guide to PVM Parallel Virtual Machine , 1991 .

[3]  Seth Copen Goldstein,et al.  Active messages: a mechanism for integrating communication and computation , 1998, ISCA '98.

[4]  Jack Dongarra,et al.  PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing , 1995 .

[5]  Thorsten von Eicken,et al.  Low-Latency Communication Over ATM Networks Using Active Messages , 1995, IEEE Micro.

[6]  Chris J. Scheiman,et al.  Implementing Active Messages and Split-C for SCI Clusters and Some Architectural Implications , 1996 .

[7]  Armando P. Stettner The design and implementation of the 4.3BSD UNIX operating system , 1988 .

[8]  David B. Gustavson The Scalable Coherent Interface and related standards projects , 1992, IEEE Micro.

[9]  Thorsten von Eicken,et al.  Low-latency communication over ATM networks using active messages , 1994, Symposium Record Hot Interconnects II.

[10]  Corporate IEEE Standard for Scalable Coherent Interface, Science: IEEE Std. 1596-1992 , 1993 .

[11]  Knut Omang,et al.  Performance of Low-Cost UltraSparc Multiprocessors connected by SCI , 1996 .

[12]  Qiang Li,et al.  Local-Area MultiProcessor: the scalable coherent interface , 1994, Optics East.

[13]  Seth Copen Goldstein,et al.  Active messages: a mechanism for integrating communication and computation , 1998, ISCA '98.

[14]  Thorsten von Eicken,et al.  U-Net: a user-level network interface for parallel and distributed computing , 1995, SOSP.