A performance comparison of Linux and a lightweight kernel

In this paper, we compare running the Linux operating system on the compute nodes of ASCI Red hardware to running a specialized, highly-optimized lightweight kernel (LWK) operating system. We have ported Linux to the compute and service nodes of the ASCI Red supercomputer, and have run several benchmarks. We present performance and scalability results for Linux compared with the LWK environment. To our knowledge, this is the first direct comparison on identical hardware of Linux and an operating system designed specifically for large-scale supercomputers. In addition to presenting these results, we discuss the limitations of both operating systems, in terms of the empirical evidence as well as other important factors.

[1]  David L. Black,et al.  An OSF/1 UNIX for Massively Parallel Multicomputers , 1993, USENIX Winter.

[2]  Ronald Mraz,et al.  Reducing the variance of point to point transfers in the IBM 9076 parallel computer , 1994, Proceedings of Supercomputing '94.

[3]  Rolf Riesen,et al.  SUNMOS for the Intel Paragon - a brief user`s guide , 1994 .

[4]  Subhash Saini,et al.  Applications performance under OSF/1 AD and SUNMOS on Intel Paragon XP/S-15 , 1994, Proceedings of Supercomputing '94.

[5]  Thomas L. Sterling,et al.  BEOWULF: A Parallel Workstation for Scientific Computation , 1995, ICPP.

[6]  J. M. McGlaun,et al.  CTH: A software family for multi-dimensional shock physics analysis , 1995 .

[7]  David Scott,et al.  A TeraFLOP supercomputer in 1996: the ASCI TFLOP system , 1996, Proceedings of International Conference on Parallel Processing.

[8]  David S. Greenberg,et al.  Massively parallel computing using commodity components , 2000, Parallel Comput..

[9]  Dror G. Feitelson,et al.  Comparing Windows NT, Linux, and QNX as the basis for cluster systems , 2001, Concurr. Comput. Pract. Exp..

[10]  David F. Heidel,et al.  An Overview of the BlueGene/L Supercomputer , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[11]  Alan L. Cox,et al.  Practical, transparent operating system support for superpages , 2002, OPSR.

[12]  Steve Neuner Scaling Linux to new heights: the SGI Altix 3000 system , 2003 .

[13]  Rolf Riesen,et al.  On the appropriateness of commodity operating systems for large-scale, balanced computing systems , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[14]  Proceedings. IEEE International Conference on Cluster Computing , 2003, 2003 Proceedings IEEE International Conference on Cluster Computing.

[15]  Scott Pakin,et al.  Identifying and Eliminating the Performance Variability on the ASCI Q Machine , 2003 .

[16]  Peter Druschel,et al.  Transparent operating system support for superpages , 2004 .

[17]  R. V. D. Wijngaart NAS Parallel Benchmarks Version 2.4 , 2022 .