A GPU Accelerated High Performance Cloud Computing Infrastructure for Grid Computing Based Virtual Environmental Laboratory

Numerical models play a main role in the earth sciences, filling in the gap between experimental and theoretical approach. Nowadays, the computational approach is widely recognized as the complement to the scientific analysis. Meanwhile, the huge amount of observed/modelled data, and the need to store, process, and refine them, often makes the use of high performance parallel computing the only effective solution to ensure the effective usability of numerical applications, as in the field of atmospheric /oceanographic science, where the development of the Earth Simulator supercomputer [65] is just the edge. Grid Computing [38] is a key technology in all the computational sciences, allowing the use of inhomogeneous and geographically spread computational resources, shared across a virtual laboratory. Moreover, this technology offers several invaluable tools in ensuring security, performance, and availability of the applications. A large amount of simulation models have been successfully developed in the past, but a lot of them are poorly engineered and have been designed following a monolithic programming approach, unsuitable for a distributed computing environment or to be accelerated by GPGPUs [53]. The use of the grid computing technologies is often limited to computer science specialists, because of the complexity of grid itself and of its middleware. Another source of complexity resides on the use of coupled models, as, for example, in the case of atmosphere/seawave/ocean dynamics. The grid enabling approach could be hampered by the grid software and hardware infrastructure complexity. In this context, the build-up of a grid-aware virtual laboratory for environmental applications is a topical challenge for computer scientists. The term “e-Science” is usually referred to computationally enhanced science. With the rise of cloud computing technology and on-demand resource allocation, the meaning of eScience could straightforwardly change to elastic-Science. The aim of our virtual laboratory is to bridge the gap between the technology push of the high performance cloud computing and the pull of a wide range of scientific experimental applications. It provides generic functionalities supporting a wide class of specific e-Science application environments and

[1]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[2]  Giulio Giunta,et al.  pPOM: A nested, scalable, parallel and Fortran 90 implementation of the Princeton Ocean Model , 2007, Environ. Model. Softw..

[3]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[4]  Ian T. Foster,et al.  Using Hybrid Grid/Cloud Computing Technologies for Environmental Data Elastic Storage, Processing, and Provisioning , 2010, Handbook of Cloud Computing.

[5]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[6]  Francisco Javier García Blas,et al.  A scalable MPI implementation of an ad-hoc parallel I / O system , 2009 .

[7]  Vaidy S. Sunderam,et al.  PVM: A Framework for Parallel Distributed Computing , 1990, Concurr. Pract. Exp..

[8]  Rudolf Eigenmann,et al.  Executing MPI programs on virtual machines in an Internet sharing system , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[9]  Raffaele Montella,et al.  Modeling and computational issues for air/water quality problems: a grid computing approach , 2005 .

[10]  Karsten Schwan,et al.  High performance and scalable I/O virtualization via self-virtualized devices , 2007, HPDC '07.

[11]  Robert Latham,et al.  Scalable I/O forwarding framework for high-performance computing systems , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[12]  A. Blumberg,et al.  A Description of a Three‐Dimensional Coastal Ocean Circulation Model , 2013 .

[13]  Vanish Talwar,et al.  GViM: GPU-accelerated virtual machines , 2009, HPCVirt '09.

[14]  Borja Sotomayor,et al.  Combining batch execution and leasing using virtual machines , 2008, HPDC '08.

[15]  Raffaele Montella,et al.  A globus toolkit 4 based instrument service for environmental data acquisition and distribution , 2008, UPGRADE '08.

[16]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[17]  David K. McAllister,et al.  Fast Matrix Multiplies Using Graphics Hardware , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[18]  Pat Hanrahan,et al.  Brook for GPUs: stream computing on graphics hardware , 2004, ACM Trans. Graph..

[19]  Jesús Carretero,et al.  Design and Evaluation of Multiple-Level Data Staging for Blue Gene Systems , 2011, IEEE Transactions on Parallel and Distributed Systems.

[20]  N. Booij,et al.  THE "SWAN" WAVE MODEL FOR SHALLOW WATER , 1997 .

[21]  Lakshmi Sobhana Kalli,et al.  Market-Oriented Cloud Computing : Vision , Hype , and Reality for Delivering IT Services as Computing , 2013 .

[22]  P. Mell,et al.  The NIST Definition of Cloud Computing , 2011 .

[23]  Constantinos Evangelinos,et al.  Cloud Computing for parallel Scientific HPC Applications: Feasibility of Running Coupled Atmosphere- , 2008 .

[24]  ともやん KVM (Kernel-based Virtual Machine) - 仮想化 , 2009 .

[25]  R. Vautard,et al.  Aerosol modeling with CHIMERE—preliminary evaluation at the continental scale , 2004 .

[26]  Ravi S. Nanjundiah,et al.  Parallel implementation, validation, and performance of MM5 , 1994 .

[27]  Chris Rose,et al.  A Break in the Clouds: Towards a Cloud Definition , 2011 .

[28]  Suchuan Dong,et al.  Cross-site computations on the TeraGrid , 2005, Computing in Science & Engineering.

[29]  Hendrik L. Tolman,et al.  A Third-Generation Model for Wind Waves on Slowly Varying, Unsteady, and Inhomogeneous Depths and Currents , 1991 .

[30]  Giulio Giunta,et al.  A GPGPU Transparent Virtualization Component for High Performance Computing Clouds , 2010, Euro-Par.

[31]  Rajkumar Buyya,et al.  A taxonomy of scientific workflow systems for grid computing , 2005, SGMD.

[32]  Jesús Carretero,et al.  AHPIOS: An MPI-Based Ad Hoc Parallel I/O System , 2008, 2008 14th IEEE International Conference on Parallel and Distributed Systems.

[33]  Jian Wang,et al.  XenLoop: a transparent high performance inter-VM network loopback , 2008, HPDC '08.

[34]  Ian T. Foster,et al.  Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, Journal of Computer Science and Technology.

[35]  Ian T. Foster,et al.  MPICH-G2: A Grid-enabled implementation of the Message Passing Interface , 2002, J. Parallel Distributed Comput..

[36]  Rajkumar Buyya,et al.  High-Performance Cloud Computing: A View of Scientific Applications , 2009, 2009 10th International Symposium on Pervasive Systems, Algorithms, and Networks.

[37]  Tetsuya Sato The earth simulator: Roles and impacts , 2004 .

[38]  Ming Zhou,et al.  A Xen-Based Data Sharing & Access Controlling Method , 2009, 2009 Third International Symposium on Intelligent Information Technology Application.

[39]  Rajkumar Buyya,et al.  Market-Oriented Cloud Computing: Vision, Hype, and Reality for Delivering IT Services as Computing Utilities , 2008, 2008 10th IEEE International Conference on High Performance Computing and Communications.

[40]  Lizhe Wang,et al.  Scientific Cloud Computing: Early Definition and Experience , 2008, 2008 10th IEEE International Conference on High Performance Computing and Communications.

[41]  Ian Foster,et al.  The Grid: A New Infrastructure for 21st Century Science , 2002 .

[42]  Douglas Thain,et al.  Distributed computing in practice: the Condor experience , 2005, Concurr. Pract. Exp..

[43]  Jian Wang,et al.  XenLoop: a transparent high performance inter-vm network loopback , 2008, HPDC.

[44]  A. Moore,et al.  Near real-time ocean circulation assimilation and prediction in the Intra-Americas Sea with ROMS , 2009 .

[45]  Giulio Giunta,et al.  A Grid Computing Based Virtual Laboratory for Environmental Simulations , 2007, PPAM.

[46]  Giulio Giunta,et al.  Multidimensional Environmental Data Resource Brokering on Computational Grids and Scientific Clouds , 2010, Handbook of Cloud Computing.

[47]  Daniela di Serafino,et al.  An Operational Mesoscale Air Quality Model for the Campania Region , 2000 .

[48]  G. Bruce Berriman,et al.  On the Use of Cloud Computing for Scientific Workflows , 2008, 2008 IEEE Fourth International Conference on eScience.

[49]  Giulio Giunta,et al.  A Grid Computing Based Virtual Laboratory for Environmental Simulations , 2006, Euro-Par.

[50]  William Gropp MPI at Exascale: Challenges for Data Structures and Algorithms , 2009, PVM/MPI.

[51]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[52]  Richard Wolski,et al.  The Eucalyptus Open-Source Cloud-Computing System , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.