LLSuperCloud: Sharing HPC systems for diverse rapid prototyping

The supercomputing and enterprise computing arenas come from very different lineages. However, the advent of commodity computing servers has brought the two arenas closer than they have ever been. Within enterprise computing, commodity computing servers have resulted in the development of a wide range of new cloud capabilities: elastic computing, virtualization, and data hosting. Similarly, the supercomputing community has developed new capabilities in heterogeneous, massively parallel hardware and software. Merging the benefits of enterprise clouds and supercomputing has been a challenging goal. Significant effort has been expended in trying to deploy supercomputing capabilities on cloud computing systems. These efforts have resulted in unreliable, low-performance solutions, which requires enormous expertise to maintain. LLSuperCloud provides a novel solution to the problem of merging enterprise cloud and supercomputing technology. More specifically LLSuperCloud reverses the traditional paradigm of attempting to deploy supercomputing capabilities on a cloud and instead deploys cloud capabilities on a supercomputer. The result is a system that can handle heterogeneous, massively parallel workloads while also providing high performance elastic computing, virtualization, and databases. The benefits of LLSuperCloud are highlighted using a mixed workload of C MPI, parallel MATLAB, Java, databases, and virtualized Web services.

[1]  Jack J. Dongarra,et al.  Evaluation of the HPC Challenge Benchmarks in Virtualized Environments , 2011, Euro-Par Workshops.

[2]  F. J. Corbat INTRODUCTION AND OVERVIEW OF THE MULTICS SYSTEM , 2010 .

[3]  Jeremy Kepner,et al.  Driving big data with big compute , 2012, 2012 IEEE Conference on High Performance Extreme Computing.

[4]  Shuen-Tai Wang,et al.  Formosa3: A Cloud-Enabled HPC Cluster in NCHC , 2011 .

[5]  Jeremy Kepner Parallel MATLAB - for Multicore and Multinode Computers , 2009, Software, environments, tools.

[6]  Hahn Kim,et al.  Technical Challenges of Supporting Interactive HPC , 2007, 2007 DoD High Performance Computing Modernization Program Users Group Conference.

[7]  Jeremy Kepner,et al.  'pMATLAB Parallel MATLAB Library' , 2007, Int. J. High Perform. Comput. Appl..

[8]  Sebastien Goasguen,et al.  A study of a KVM-based cluster for grid computing , 2009, ACM-SE 47.

[9]  Hyung Seok Kim,et al.  Interactive Grid Computing at Lincoln Laboratory , 2006 .

[10]  Jeremy Kepner,et al.  HPC-VMs: Virtual machines in high performance computing systems , 2012, 2012 IEEE Conference on High Performance Extreme Computing.

[11]  Jeremy Kepner,et al.  MatlabMPI , 2004, J. Parallel Distributed Comput..

[12]  Rogier Dittner,et al.  The Best Damn Server Virtualization Book Period: Including Vmware, Xen, and Microsoft Virtual Server , 2007 .