SP2 System Architecture

Scalable parallel systems are increasingly being used today to address existing and emerging application areas that require performance levels significantly beyond what symmetric multiprocessors are capable of providing. These areas include traditional technical computing applications, commercial computing applications such as decision support and transaction processing, and emerging areas such as “grand challenge” applications, digital libraries, and video production and distribution. The IBM SP2™ is a general-purpose scalable parallel system designed to address a wide range of these applications. This paper gives an overview of the architecture and structure of SP2, discusses the rationale for the significant system design decisions that were made, indicates the extent to which key objectives were met, and identifies future system challenges and advanced technology development areas.

[1]  Dror G. Feitelson,et al.  Parallel File Systems for the IBM SP Computers , 1995, IBM Syst. J..

[2]  Tse-yun Feng,et al.  A Survey of Interconnection Networks , 1981, Computer.

[3]  Philip S. Yu,et al.  Performance Comparison of IO Shipping and Database Call Shipping: Schemes in Multisystem Partitioned Databases , 1989, Perform. Evaluation.

[4]  Vijay K. Naik,et al.  A Scalable Implementation of the NAS Parallel Benchmark BT on Distributed Memory Systems , 1995, IBM Syst. J..

[5]  David H. Bailey,et al.  The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..

[6]  Dennis G. Shea,et al.  The SP2 High-Performance Switch , 1995, IBM Syst. J..

[7]  J. T. Robinson,et al.  On coupling multi-systems through data sharing , 1987, Proceedings of the IEEE.

[8]  William J. Dally,et al.  The message-driven processor: a multicomputer processing node with efficient mechanisms , 1992, IEEE Micro.

[9]  Jim Gray,et al.  Benchmark Handbook: For Database and Transaction Processing Systems , 1992 .

[10]  Michael Stonebraker,et al.  The Case for Shared Nothing , 1985, HPTS.

[11]  Marek Sherman Distributed transaction processing with Encina , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[12]  Donald Yeung,et al.  Sparcle: an evolutionary processor design for large-scale multiprocessors , 1993, IEEE Micro.

[13]  Philip S. Yu,et al.  On multisystem coupling through function request shipping , 1986, IEEE Transactions on Software Engineering.

[14]  Bowen Alpern,et al.  High-Performance Parallel Implementations of the NAS Kernel Benchmarks on the IBM SP2 , 1995, IBM Syst. J..

[15]  David H. Bailey,et al.  NAS parallel benchmark results , 1993, IEEE Parallel & Distributed Technology: Systems & Applications.

[16]  Kazuo Goto,et al.  The DCS: a new approach to multisystem data-sharing , 1984, AFIPS '84.

[17]  Steven W. White,et al.  POWER2: Next generation of the RISC System/6000 family , 1994, IBM J. Res. Dev..

[18]  Nancy P. Kronenberg,et al.  VAXcluster: a closely-coupled distributed system , 1986, TOCS.

[19]  Robert W. Horst Massively parallel systems you can trust , 1994, Proceedings of COMPCON '94.

[20]  David Clay,et al.  Informix parallel data query (PDQ) , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[21]  Marc Snir,et al.  The Communication Software and Parallel Environment of the IBM SP2 , 1995, IBM Syst. J..

[22]  Anoop Gupta,et al.  The Stanford Dash multiprocessor , 1992, Computer.

[23]  Jehoshua Bruck,et al.  The IBM External User Interface for Scalable Parallel Systems , 1994, Parallel Comput..

[24]  James R. Larus,et al.  Tempest and typhoon: user-level shared memory , 1994, ISCA '94.

[25]  Guy L. Steele,et al.  The High Performance Fortran Handbook , 1993 .

[26]  Jack J. Dongarra,et al.  The PVM Concurrent Computing System: Evolution, Experiences, and Trends , 1994, Parallel Comput..

[27]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.