An allocation and provisioning model of science cloud for high throughput computing applications

Recent cloud computing enables numerous scientists to earn advantages by serving on-demand and elastic resources whenever they desire computing resources. This science cloud paradigm has been actively developed and investigated to satisfy requirements of the scientists such as performance, feasibility and so on. However, effective allocation and provisioning virtual machines on clouds are still considered as a challenging issue in scientists using high throughput computing, since it determines whether they can earn benefits from economy of scale in clouds or not. Moreover, allocating the "right" provisioned cloud resources on an optimal data center is very important as performance can vary widely depending on where and under what circumstances it actually runs. In these reasons, it is required that an appropriate and suitable model for science cloud to support increasing scientists and computations. In this paper, we present an allocation and provisioning model of science cloud, especially for high throughput computing applications. In this model, we utilize job traces where statistical method is applied to pick the most influential features for improving application performance. With the feature, the system determines where VM is deployed (allocation) and which instance type is proper (provisioning). An adaptive evaluation step which is subsequent to the job execution enables our model to adapt to dynamical computing environments. We show performance achievements as comparing the proposed model with other policies through experiments. Finally, we expect that improvement on performance as well as reduction of cost from resource consumption through our model.

[1]  Hui Li,et al.  Efficient response time predictions by exploiting application and resource state similarities , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[2]  Seoyoung Kim,et al.  Application-specific Cloud Provisioning Model Using Job Profiles Analysis , 2012, 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems.

[3]  Prashant J. Shenoy,et al.  Resource overbooking and application profiling in a shared Internet hosting platform , 2009, TOIT.

[4]  Gang Wang,et al.  Appliance-Based Autonomic Provisioning Framework for Virtualized Outsourcing Data Center , 2007, Fourth International Conference on Autonomic Computing (ICAC'07).

[5]  Soonwook Hwang,et al.  Poster: HTCaaS: A Large-Scale High-Throughput Computing by Leveraging Grids, Supercomputers and Cloud , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[6]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[7]  Yi Liang,et al.  In Cloud, Can Scientific Communities Benefit from the Economies of Scale? , 2010, IEEE Transactions on Parallel and Distributed Systems.

[8]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.