The lightweight protocol CLIC: performance of an MPI implementation on CLIC

The CLIC is a lightweight protocol that has recently been proposed for efficient communication in clusters using the Linux Operating System. Besides optimizing communication performance, by reducing the latencies and increasing the bandwidth figures even for short messages, the proposed communication layer also meets other requirements such as multiprogramming, portability, protection against corrupted programs, reliable message delivery, direct access to the network for all applications, etc. In this way, instead of removing the operating system kernel from the critical path and creating a user-level network interface, the operating system support has been optimized to provide reliable and efficient network software, avoiding the TCP/IP protocol stack. The LAM-MPI communication layer has been implemented on top of the proposed protocol and the communication performance has been tested in a cluster of PCs with Linux OS and interconnected with Fast Ethernet.

[1]  Giovanni Chiola,et al.  GAMMA: A low-cost network of workstations based on active messages , 1997, PDP.

[2]  Kees Verstoep,et al.  Performance of a High-Level Parallel Language on a High-Speed Network , 1997, J. Parallel Distributed Comput..

[3]  Henri E. Bal,et al.  User-Level Network Interface Protocols , 1998, Computer.

[4]  Thomas L. Sterling,et al.  BEOWULF: A Parallel Workstation for Scientific Computation , 1995, ICPP.

[5]  Bernard Tourancheau,et al.  BIP: A New Protocol Designed for High Performance Networking on Myrinet , 1998, IPPS/SPDP Workshops.

[6]  Rajkumar Buyya,et al.  2001 IEEE International Conference on Cluster Computing , 2001 .

[7]  Andrew A. Chien,et al.  Software overhead in messaging layers: where does the time go? , 1994, ASPLOS VI.

[8]  Antonio F. Díaz,et al.  CLIC: Fast Communication on Linux Clusters , 2000, CLUSTER.

[9]  Thorsten von Eicken,et al.  U-Net: a user-level network interface for parallel and distributed computing , 1995, SOSP.

[10]  Jehoshua Bruck,et al.  Efficient message passing interface (MPI) for parallel computing on clusters of workstations , 1995, SPAA '95.

[11]  David A. Patterson,et al.  Logp quantified: the case for low-overhead local area networks , 1995 .

[12]  David E. Culler,et al.  Virtual network transport protocols for Myrinet , 1998, IEEE Micro.

[13]  Andrew A. ChienJanuary Fast Messages ( FM ) : E cient , Portable Communication for Workstation Clusters and Massively-Parallel Processors , 1997 .

[14]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[15]  Guang R. Gao,et al.  Polling Watchdog: Combining Polling and Interrupts for Efficient Message Handling , 1996, International Symposium on Computer Architecture.

[16]  Giovanni Chiola,et al.  Efficient parallel processing on low-cost clusters with GAMMA active ports , 2000, Parallel Comput..

[17]  Thorsten von Eicken,et al.  Low-Latency Communication over Fast Ethernet , 1996, Euro-Par, Vol. I.

[18]  Gianni Conte,et al.  An MPI Implementation on the Top of the Virtual Interface Architecture , 1999, PVM/MPI.

[19]  Kees Verstoep,et al.  Efficient reliable multicast on Myrinet , 1996, Proceedings of the 1996 ICPP Workshop on Challenges for Parallel Processing.

[20]  Charles L. Seitz,et al.  Myrinet: A Gigabit-per-Second Local Area Network , 1995, IEEE Micro.

[21]  Hiroshi Tezuka,et al.  Pin-down cache: a virtual memory management technique for zero-copy communication , 1998, Proceedings of the First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing.

[22]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.