Estimating the Optimal Configuration of a Multi-Core Cluster: A Preliminary Study

While multi-core processors became crucial elements of high-performance computing, the clusters of multi-core processors are still difficult target for optimization, since the communication time differs significantly in intra-node communication and inter-node communication.  It is thus very important to select the optimal configuration of multi-core clusters; i.e., to find the optimal number of processes and to find the optimal allocation of processes to nodes. This study first elucidates the multi-core specific issues by examining the effect of process allocation in a multi-core cluster, using divergent and convergent allocations of four benchmark programs. Then, the estimation of optimal configuration is attempted for divergent and convergent allocations, using execution-time estimation models for single-core clusters.  Though the estimation errors were less than 20\% in most cases, further improvement is expected by incorporating the enhancements for multi-core systems into estimation models.

[1]  Wenguang Chen,et al.  MPIPP: an automatic profile-guided parallel process placement toolset for SMP clusters and multiclusters , 2006, ICS '06.

[2]  Peter C. J. Graham,et al.  On the Programming Impact ofMulti-Core,Multi-Processor Nodes inMPI Clusters , 2007, 21st International Symposium on High Performance Computing Systems and Applications (HPCS'07).

[3]  Dhabaleswar K. Panda,et al.  Designing High Performance and Scalable MPI Intra-node Communication Support for Clusters , 2006, 2006 IEEE International Conference on Cluster Computing.

[4]  Sadaf R. Alam,et al.  Characterization of Scientific Workloads on Systems with Multi-Core Processors , 2006, 2006 IEEE International Symposium on Workload Characterization.

[5]  Shuichi Ichikawa,et al.  An execution-time estimation model for heterogeneous clusters , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[6]  Shuichi Ichikawa,et al.  Optimizing process allocation of parallel programs for heterogeneous clusters , 2009, Concurr. Comput. Pract. Exp..

[7]  Mario A. R. Dantas,et al.  An Experimental Study on How to Build Efficient Multi-core Clusters for High Performance Computing , 2008, 2008 11th IEEE International Conference on Computational Science and Engineering.

[8]  Martin Cuma,et al.  Using benchmarking to determine efficient usage of nodes in a cluster , 2007 .

[9]  Hossein Pourreza,et al.  On the Programming Impact of Multi-Core, Multi-Processor Nodes in MPI Clusters , 2007 .

[10]  Dhabaleswar K. Panda,et al.  Understanding the Impact of Multi-Core Architecture in Cluster Computing: A Case Study with Intel Dual-Core System , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[11]  Shuichi Ichikawa,et al.  Optimizing the configuration of a heterogeneous cluster with multiprocessing and execution-time estimation , 2005, Parallel Comput..