Performance versus Cost of a Parallel Conjugate Gradient Method in Cloud and Commodity Clusters

Summary Cloud computing is an emerging technology to run HPC applications using computing resources on a pay per use basis. The CG method is a linear solver which is used in many engineering and scientific applications, and is computationally demanding. We implement different approaches of a parallel CG method and compare their performance on different types of platforms: an HPC-optimized cluster, a built heterogeneous cluster, and Amazon cloud. We evaluate the performance of two approaches: broadcast and a ring-based communicationcomputation overlap, and compare to that of National Aeronautics and Space Administration Advanced Supercomputing CG parallel benchmark. We present an evaluation of the performance vis-a-vis cost tradeoff. The results show that, cloud instances suffer from network performance issues, which is revealed by the low performance of the CG method for small problem sizes. An HPC cloud instance type performs the best with relatively less cost than HPC-optimized commodity cluster and other more virtualized cluster instance types, for big problem sizes, while scaling well with increasing problem size. It gives better performance for overlap-based CG method; the performance increases and the cost decreases. Given the emergence of Cloud Computing, the results in this paper analyze performance and cost issues when Clouds are used for CG-based scientific and engineering applications.

[1]  David H. Bailey,et al.  The Nas Parallel Benchmarks , 1991, Int. J. High Perform. Comput. Appl..

[2]  Miron Livny,et al.  The cost of doing science on the cloud: The Montage example , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[3]  Rajkumar Buyya,et al.  High-Performance Cloud Computing: A View of Scientific Applications , 2009, 2009 10th International Symposium on Pervasive Systems, Algorithms, and Networks.

[4]  Katarzyna Keahey Cloud Computing for Science , 2009, SSDBM.

[5]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[6]  Renato Figueiredo,et al.  Science Clouds: Early Experiences in Cloud Computing for Scientific Applications , 2008 .

[7]  John Shalf,et al.  Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[8]  Constantinos Evangelinos,et al.  Cloud Computing for parallel Scientific HPC Applications: Feasibility of Running Coupled Atmosphere- , 2008 .

[9]  Scott Hazelhurst,et al.  Scientific computing using virtual high-performance computing: a case study using the Amazon elastic computing cloud , 2008, SAICSIT '08.

[10]  J.W. Manke,et al.  Parallel computing in aerospace , 2001, Parallel Comput..

[11]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[12]  Hee-Dae Kwon Efficient parallel implementations of finite element methods based on the conjugate gradient method , 2003, Appl. Math. Comput..

[13]  Ian Foster,et al.  Designing and building parallel programs , 1994 .

[14]  Khaled Shuaib,et al.  Empirical Study for Communication Cost of Parallel Conjugate Gradient on a Star-Based Network , 2010, 2010 Fourth Asia International Conference on Mathematical/Analytical Modelling and Computer Simulation.

[15]  Rajkumar Buyya,et al.  Market-Oriented Cloud Computing: Vision, Hype, and Reality for Delivering IT Services as Computing Utilities , 2008, 2008 10th IEEE International Conference on High Performance Computing and Communications.

[16]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[17]  Abhishek Gupta,et al.  Evaluation of HPC Applications on Cloud , 2011, 2011 Sixth Open Cirrus Summit.

[18]  L. Ismail Communication issues in parallel Conjugate Gradient method using a star-based network , 2010, 2010 International Conference on Computer Applications and Industrial Electronics.

[19]  Larry S.K. Fung,et al.  From Mega Cell to Giga Cell Reservoir Simulation , 2008 .