Memory ushering in a scalable computing cluster

Abstract Scalable computing clusters (SCC) are becoming an alternative to mainframes and MPPs for the execution of high performance, demanding applications in multi-user, time-sharing environments. In order to better utilize the multiple resources of such systems, it is necessary to develop means for cluster wide resource allocation and sharing, that will make an SCC easy to program and use. This paper presents the details of a memory ushering algorithm among the nodes of an SCC. This algorithm allows a node which has exhausted its main memory to use available memory in other nodes. The paper first presents results of simulations of several algorithms for process placement to nodes. It then describes the memory ushering algorithm of the MOSIX multicomputer operating system for an SCC and its performance.

[1]  Jack Dongarra,et al.  PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing , 1995 .

[2]  Mor Harchol-Balter,et al.  Exploiting process lifetime distributions for dynamic load balancing , 1996, SIGMETRICS '96.

[3]  Ali R. Hurson,et al.  Scheduling and Load Balancing in Parallel and Distributed Systems , 1995 .

[4]  Mark Nuttall,et al.  A brief survey of systems providing process or object migration facilities , 1994, OPSR.

[5]  Fred Douglis,et al.  Transparent process migration: Design alternatives and the sprite implementation , 1991, Softw. Pract. Exp..

[6]  David E. Culler,et al.  A case for NOW (networks of workstation) , 1995, PODC '95.

[7]  Michael Hobbs,et al.  Performance comparison of process migration with remote process creation mechanisms in RHODOS , 1996, Proceedings of 16th International Conference on Distributed Computing Systems.

[8]  Philip Joyce,et al.  Performance Comparison of Process Migration with Remote Process Creation and Execution in RHODOS. , 1996, IEEE International Conference on Distributed Computing Systems.

[9]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[10]  Amnon Barak,et al.  The MOSIX Distributed Operating System: Load Balancing for UNIX , 1993 .

[11]  Charles L. Seitz,et al.  Myrinet: A Gigabit-per-Second Local Area Network , 1995, IEEE Micro.

[12]  Mark Nuttall Survey of systems providing process or object migration , 1994 .