YML-PC: A Reference Architecture Based on Workflow for Building Scientific Private Clouds

Cloud computing platforms such as Amazon EC2 provide customers flexible, on-demand resources at low cost. However, while the existing offerings are useful for providing basic computation and storage resources, they fail to consider factors such as security, custom, and policy. So, many enterprises and research institutes would not like to utilize those public Clouds. According to investigations on real requirements from scientific computing users in China, the project YML-PC has been started to build private Clouds and hybrid Clouds for them. In this paper, we will focus on the first step of YML-PC to present a reference architecture based on the workflow framework YML for building scientific private Clouds. Then, some key technologies such as trust model, data persistence, and schedule mechanisms in YML-PC are discussed. Finally, some experiments are carried out to testify that the solution presented in this paper is more efficient.

[1]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[2]  Rajkumar Buyya,et al.  Evaluating the cost-benefit of using cloud computing to extend the capacity of clusters , 2009, HPDC '09.

[3]  Ling Shang,et al.  Solution of Large Scale Matrix Inversion on Cluster and Grid , 2008, 2008 Seventh International Conference on Grid and Cooperative Computing.

[4]  Hai Jin,et al.  Evaluating MapReduce on Virtual Machines: The Hadoop Case , 2009, CloudCom.

[5]  Antonio Puliafito,et al.  Cloud@Home: Bridging the Gap between Volunteer and Cloud Computing , 2009, ICIC.

[6]  Serge G. Petiton,et al.  A peer to peer computing framework: design and performance evaluation of YML , 2004, Third International Symposium on Parallel and Distributed Computing/Third International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Networks.

[7]  Zhijian Wang,et al.  TM-DG: a trust model based on computer users' daily behavior for desktop grid platform , 2007, CompFrame '07.

[8]  Lizhe Wang,et al.  Scientific Cloud Computing: Early Definition and Experience , 2008, 2008 10th IEEE International Conference on High Performance Computing and Communications.

[9]  David P. Anderson,et al.  Exploiting non-dedicated resources for cloud computing , 2010, 2010 IEEE Network Operations and Management Symposium - NOMS 2010.

[10]  Franck Cappello,et al.  Cost-benefit analysis of Cloud Computing versus desktop grids , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[11]  Yong Zhao,et al.  Cloud Computing and Grid Computing 360-Degree Compared , 2008, GCE 2008.

[12]  Serge G. Petiton,et al.  Federation and Abstraction of Heterogeneous Global Computing Platforms with the YML Framework , 2009, 2009 International Conference on Complex, Intelligent and Software Intensive Systems.

[13]  Ling Shang,et al.  A New Parallel Paradigm for Block-Based Gauss-Jordan Algorithm , 2009, 2009 Eighth International Conference on Grid and Cooperative Computing.

[14]  Zhe Zhang,et al.  Improving Data Availability for Better Access Performance: A Study on Caching Scientific Data on Distributed Desktop Workstations , 2009, Journal of Grid Computing.

[15]  Simson L. Garfinkel,et al.  An Evaluation of Amazon's Grid Computing Services: EC2, S3, and SQS , 2007 .

[16]  Rajkumar Buyya,et al.  High-Performance Cloud Computing: A View of Scientific Applications , 2009, 2009 10th International Symposium on Pervasive Systems, Algorithms, and Networks.

[17]  Franck Cappello,et al.  Grid'5000: a large scale and highly reconfigurable grid experimental testbed , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[18]  Luís Moura Silva,et al.  Evaluating the performance and intrusiveness of virtual machines for desktop grid computing , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[19]  Philippe Smets The transferable belief model and other interpretations of Dempster-Shafer's model , 1990, UAI.

[20]  Mitsuhisa Sato,et al.  OmniRPC: a grid RPC system for parallel programming in cluster and grid environment , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[21]  Jean-Marc Vincent,et al.  Mining for statistical models of availability in large-scale distributed systems: An empirical study of SETI@home , 2009, 2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems.

[22]  Eddy Caron,et al.  Cloud Computing Resource Management through a Grid Middleware: A Case Study with DIET and Eucalyptus , 2009, 2009 IEEE International Conference on Cloud Computing.

[23]  Gilles Fedak,et al.  XtremWeb: Building an Experimental Platform for Global Computing , 2000, GRID.

[24]  Geoffrey C. Fox,et al.  Using clouds to provide grids with higher levels of abstraction and explicit support for usage modes , 2009 .

[25]  Kenichi Hagihara,et al.  Computing Low Latency Batches with Unreliable Workers in Volunteer Computing Environments , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[26]  Gilles Fedak,et al.  The Computational and Storage Potential of Volunteer Computing , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[27]  Serge G. Petiton,et al.  Workflow Global Computing with YML , 2006, 2006 7th IEEE/ACM International Conference on Grid Computing.