Reducing the variance of point to point transfers in the IBM 9076 parallel computer

Commodity workstations have adapted to standard UNIX like environments to allow scientists to efficiently develop and port applications across systems. UNIX based environments, such as IBM's AIX, furnishes such an operating environment while providing efficient uni-processor utilization for user code execution. When these machines are interconnected with a low latency (user space) communication mechanism, large variances in point to point communication times for identical parallel programs are typically found. It is our contention that a large part of this variance is introduced by operating system support functionality that can delay point to point user space communications. We are able to experimentally measure this effect by monitoring the change in time of circulating a token through parallel processors connected in a virtual ring configuration. This paper proposed some solutions and then experimentally validates their ability to reduce point to point message passing variance for the IBM 9076 (SP1) machines.<<ETX>>

[1]  Armando P. Stettner The design and implementation of the 4.3BSD UNIX operating system , 1988 .

[2]  Ronald L. Graham,et al.  Concrete mathematics - a foundation for computer science , 1991 .

[3]  Samuel J. Leffler,et al.  The design and implementation of the 4.3 BSD Unix operating system , 1991, Addison-Wesley series in computer science.

[4]  Brian N. Bershad,et al.  The interaction of architecture and operating system design , 1991, ASPLOS IV.

[5]  Jay K. Strosnider,et al.  Modeling and validation of the real-time Mach scheduler , 1993, SIGMETRICS '93.

[6]  R. Mraz EUIm: A message passing library for the IBM power visualization system , 1993, Proceedings 1993 IEEE Workshop on Advances in Parallel and Distributed Systems.

[7]  Joe Gwinn,et al.  Some measurements of timeline gaps in VAX/VMS , 1994, OPSR.

[8]  Craig B. Stunkel,et al.  The SP1 high-performance switch , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[9]  Dennis G. Shea,et al.  Architecture and implementation of Vulcan , 1994, Proceedings of 8th International Parallel Processing Symposium.