Telegraphos: high-performance networking for parallel processing on workstation clusters

Networks of workstations and high-performance microcomputers have been rarely used for running high-performance applications like multimedia, simulations, scientific and engineering applications, because, although they have significant aggregate computing power, they lack the support for efficient message-passing and shared-memory communication. In this paper we present Telegraphos, a distributed system that provides efficient shared-memory support on top of a workstation cluster. We focus on the network interface of Telegraphos that provides a variety of shared-memory operations like remote reads, remote writes, remote atomic operations, all launched from user level without any intervention of the operating system. Telegraphos I, the first Telegraphos prototype has been implemented. Emphasis was put on rapid prototyping, so the technology used was conservative: FPGA's, SRAM's, and TTL buffers. Telegraphos II, is the single-chip version of the Telegraphos architecture; its switch was implemented and its network interface is being debugged.

[1]  Jack Dongarra,et al.  Pvm 3 user's guide and reference manual , 1993 .

[2]  Aristides Efthymiou,et al.  Pipelined memory shared buffer for VLSI switches , 1995, SIGCOMM '95.

[3]  Dimitrios Nikolaou Serpanos,et al.  Scalable shared memory interconnections , 1990 .

[4]  Kai Li,et al.  Retrospective: virtual memory mapped network interface for the SHRIMP multicomputer , 1994, ISCA '98.

[5]  Evangelos P. Markatos,et al.  Trace-driven simulation of data alignment and other factors affecting update and invalidate based coherent memory , 1994, Proceedings of International Workshop on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[6]  Scott B. Marovich,et al.  Hamlyn: a high-performance network interface with sender-based memory management , 1995 .

[7]  Ewing Lusk,et al.  User''s Guide to the p4 Parallel Programming System , 1992 .

[8]  Richard P. LaRowe,et al.  Hardware assist for distributed shared memory , 1993, [1993] Proceedings. The 13th International Conference on Distributed Computing Systems.

[9]  Alan L. Cox,et al.  TreadMarks: Distributed Shared Memory on Standard Workstations and Operating Systems , 1994, USENIX Winter.

[10]  Evangelos P. Markatos,et al.  Using remote memory to avoid disk thrashing: a simulation study , 1996, Proceedings of MASCOTS '96 - 4th International Workshop on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[11]  Willy Zwaenepoel,et al.  Implementation and performance of Munin , 1991, SOSP '91.

[12]  Anoop Gupta,et al.  Integration of message passing and shared memory in the Stanford FLASH multiprocessor , 1994, ASPLOS VI.

[13]  Robert J. Fowler,et al.  NUMA policies and their relation to memory architecture , 1991, ASPLOS IV.

[14]  Gary Scott Delp The architecture and implementation of MEMNET: a high--speed shared-memory computer communication network , 1988 .

[15]  Brian N. Bershad,et al.  The Midway distributed shared memory system , 1993, Digest of Papers. Compcon Spring.

[16]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.

[17]  Richard P. LaRowe,et al.  Update Propagation in the Galactica Net Distributed Shared Memory Architecture , 1993 .

[18]  Richard L. Sites,et al.  Alpha AXP architecture , 1993, CACM.

[19]  LiKai,et al.  Memory coherence in shared virtual memory systems , 1989 .

[20]  James R. Larus,et al.  Tempest and typhoon: user-level shared memory , 1994, ISCA '94.

[21]  Creve Maples,et al.  A high-performance, memory-based interconnection system for multicomputer environments , 1990, Proceedings SUPERCOMPUTING '90.