PACS-CS: a large-scale bandwidth-aware PC cluster for scientific computation

We have been developing a large scale PC cluster named PACS-CS (Parallel Array Computer System for Computational Sciences) at Center for Computational Sciences, University of Tsukuba, for wide variety of computational science applications such as computational physics, computational material science, computational biology, etc. We consider the most important issue on the computation node is the memory access bandwidth, then a node is equipped with a single CPU which is different from ordinary high-end PC clusters. The interconnection network for parallel processing is configured as a multi-dimensional hyper-crossbar network based on trunking of Gigabit Ethernet to support large scale scientific computation with physical space modeling. Based on the above concept, we are developing an original mother board to configure a single CPU node with 8 ports of Gigabit Ethernet, which can be implemented in the half size of 19 inch rack-mountable 1U size platform. Under the preliminary performance evaluation, we confirmed that the computation part in practical Lattice QCD code will be able to achieve 30% of peak performance, and up to 600 Mbyte/sec of bandwidth at single directed neighboring communication will be achieved. PACS-CS will start its operation on July 2006 with 2560 CPUs and 14.3 Tflops of peak performance.

[1]  Yasuhiro Inagami,et al.  Deadlock-free fault-tolerant routing in the multi-dimensional crossbar network and its implementation for the Hitachi SR2201 , 1997, Proceedings 11th International Parallel Processing Symposium.

[2]  K. Kumon,et al.  PM/InfiniBand-FJ: a high performance communication facility using InfiniBand for large scale PC clusters , 2004, Proceedings. Seventh International Conference on High Performance Computing and Grid in Asia Pacific Region, 2004..

[3]  Taisuke Boku,et al.  Heterogeneous multi-computer system: a new platform for multi-paradigm scientific simulation , 2002, ICS '02.

[4]  Jie Chen,et al.  QMP-MVIA: a message passing system for Linux clusters with gigabit Ethernet mesh connections , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).

[5]  Hiroshi Nakamura,et al.  CP-PACS: a massively parallel processor for large scale scientific calculations , 1997, ICS '97.

[6]  Takeshi Hoshino,et al.  QCDPAX-an MIMD array of vector processors for the numerical simulation of quantum chromodynamics , 1989, Proceedings of the 1989 ACM/IEEE Conference on Supercomputing (Supercomputing '89).

[7]  Jie Chen,et al.  Message passing for Linux clusters with gigabit Ethernet mesh connections , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[8]  Kouichi Kumon,et al.  PM/Ethernet-kRMA: a high performance remote memory access facility using multiple gigabit ethernet cards , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..