Multi-User System Management on SCI Clusters

The growing maturity of hardware and software components has tempted researchers to build very large SCI clusters with several hundred processors that are operated as high-performance compute servers in multi-user mode.

[1]  Axel Keller,et al.  RSD — Resource and Service Description , 1998 .

[2]  Reinhard Grebe,et al.  Parallele Datenverarbeitung mit dem Transputer, 3. Transputer-Anwender-Treffen TAT '91, Aachen, 17.-18. September 1991 , 1990, Transputer-Anwender-Treffen.

[3]  Nigel P. Topham,et al.  Performance of the decoupled ACRI-1 architecture: the perfect club , 1995, HPCN Europe.

[4]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[5]  David Abramson,et al.  Nimrod: a tool for performing parametrised simulations using distributed workstations , 1995, Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing.

[6]  Axel Keller,et al.  CCS resource management in networked HPC systems , 1998, Proceedings Seventh Heterogeneous Computing Workshop (HCW'98).

[7]  Francine Berman,et al.  Application-Level Scheduling on Distributed Heterogeneous Networks , 1996, Proceedings of the 1996 ACM/IEEE Conference on Supercomputing.

[8]  Miron Livny,et al.  A worldwide flock of Condors: Load sharing among workstation clusters , 1996, Future Gener. Comput. Syst..

[9]  Kurt Kremer,et al.  A Distributed Computing Center Software for the Efficient Use of Parallel Computer Systems , 1994, HPCN.

[10]  Andrew S. Grimshaw,et al.  Metasystems: An Approach Combining Parallel Processing and Heterogeneous Distributed Computing Systems , 1994, J. Parallel Distributed Comput..

[11]  Jörn Gehring,et al.  Architecture-Independent Request-Scheduling with Tight Waiting-Time Estimations , 1996, JSSPP.

[12]  Friedhelm Ramme,et al.  A General Purpose Resource Description Language , 1991, Transputer-Anwender-Treffen.

[13]  F. Tandiary,et al.  Batrun: utilizing idle workstations for large scale computing , 1996, IEEE Parallel Distributed Technol. Syst. Appl..

[14]  Charles L. Seitz,et al.  Myrinet: A Gigabit-per-Second Local Area Network , 1995, IEEE Micro.

[15]  Geoffrey C. Fox,et al.  Cluster Computing Review , 1995 .