Enabling e-Science Applications on the Cloud with COMPSs

COMP Superscalar (COMPSs) is a programming framework that provides an easy-to-use programming model and a runtime to ease the development of applications for distributed environments. Thanks to its modular architecture COMPSs can use a wide range of computational infrastructures providing a uniform interface for job submission and file transfer operations through adapters for different middlewares. In the context of the VENUS-C project the COMPSs framework has been extended through the development of a programming model enactment service that allows researcher to transparently port and execute scientific applications in the Cloud. This paper presents the implementation of a bioinformatics workflow (using BLAST as core program), the porting to the COMPSs framework and its deployment on the VENUS-C platform. The proposed approach has been evaluated on a Cloud testbed using virtual machines managed by EMOTIVE Cloud and compared to a similar approach on the Azure platform and to other implementations on HPC infrastructures.

[1]  Jie Li,et al.  Bridging the Gap between Desktop and the Cloud for eScience Applications , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[2]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[3]  Wu-chun Feng,et al.  The design, implementation, and evaluation of mpiBLAST , 2003 .

[4]  Jordi Torres Elastic Management of Tasks in Virtualized Environments , 2009 .

[5]  José A. B. Fortes,et al.  CloudBLAST: Combining MapReduce and Virtualization on Distributed Resources for Bioinformatics Applications , 2008, 2008 IEEE Fourth International Conference on eScience.

[6]  砂田 憲吾,et al.  Bridging the gap between , 2009 .

[7]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[8]  Eliza Varney Distributed Management Task Force, Inc , 2010 .

[9]  Rosa M. Badia,et al.  COMP Superscalar: Bringing GRID Superscalar and GCM Together , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).