Design and Evaluation of Opal2: A Toolkit for Scientific Software as a Service

Grid computing provides mechanisms for making large-scale computing environments available to the masses. In recent times, with the advent of Cloud computing, the concepts of Software as a Service (SaaS), where vendors provide key software products as services over the internet that can be accessed by users to perform complex tasks, and Service as Software (SaS), where customizable and repeatable services are packaged as software products that dynamically meet the demands of individual users, have become increasingly popular. Both SaaS and SaS models are highly applicable to scientific software and users alike. Opal2 is a toolkit for wrapping scientific applications as Web services on Grid and cloud computing resources. It provides a mechanism for scientific application developers to expose the functionality of their codes via simple Web service APIs, abstracting out the details of the back-end infrastructure. Services may be combined via customized workflows for specific research areas and distributed as virtual machine images. In this paper, we describe the overall philosophy and architecture of the Opal2 framework, including its new plug-in architecture and data handling capabilities. We analyze its performance in typical cluster and Grid settings, and in a cloud computing environment within virtual machines, using Amazon's Elastic Computing Cloud (EC2).

[1]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004, Proceedings. 16th International Conference on Scientific and Statistical Database Management, 2004..

[2]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[3]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..

[4]  Thomas Fahringer,et al.  The Otho Toolkit - Synthesizing tailor-made scientific grid application wrapper services , 2007, Multiagent Grid Syst..

[5]  Xiaohui Wei,et al.  Providing Dynamic Virtualized Access to Grid Resources via the Web 2.0 Paradigm , 2007, GCE 2007.

[6]  Kim K. Baldridge,et al.  Opal: SimpleWeb Services Wrappers for Scientific Applications , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[7]  Francine D. Berman,et al.  The Teragrid Project , 2002 .

[8]  Philip M. Papadopoulos,et al.  NPACI: rocks: tools and techniques for easily deploying manageable Linux clusters , 2001, Proceedings 42nd IEEE Symposium on Foundations of Computer Science.

[9]  Eduardo Huedo,et al.  The GridWay Framework for Adaptive Scheduling and Execution on Grids , 2001, Scalable Comput. Pract. Exp..

[10]  John J. Barton SOAP Messages with Attachments , 2001 .

[11]  William E. Allcock,et al.  The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[12]  Michel F Sanner,et al.  A component-based software environment for visualizing large macromolecular assemblies. , 2005, Structure.

[13]  Edward Walker,et al.  Benchmarking Amazon EC2 for High-Performance Scientific Computing , 2008, login Usenix Mag..

[14]  Kohei Ichikawa,et al.  Design of a grid service-based platform for in silico protein-ligand screenings , 2009, Comput. Methods Programs Biomed..

[15]  Christian Bauer,et al.  Hibernate in action , 2005 .

[16]  Kim K. Baldridge,et al.  An end-to-end Web services-based infrastructure for biomedical applications , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[17]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[18]  Ian T. Foster,et al.  A security architecture for computational grids , 1998, CCS '98.

[19]  Rommie E. Amaro,et al.  Ensemble-Based Virtual Screening Reveals Potential Novel Antiviral Compounds for Avian Influenza Neuraminidase , 2008, Journal of medicinal chemistry.

[20]  Christoph W. Sensen,et al.  Creating Bioinformatics Semantic Web Services from Existing Web Services: A Real-World Application of SAWSDL , 2008, 2008 IEEE International Conference on Web Services.

[21]  Yi Huang,et al.  Building web services for scientific grid applications , 2006, IBM J. Res. Dev..

[22]  Mark Nottingham,et al.  The Atom Syndication Format , 2005, RFC.

[23]  Warren Smith,et al.  A Resource Management Architecture for Metacomputing Systems , 1998, JSSPP.

[24]  Dmitrii Zagorodnov,et al.  Eucalyptus : A Technical Report on an Elastic Utility Computing Archietcture Linking Your Programs to Useful Systems , 2008 .

[25]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[26]  Xiaohui Wei,et al.  CSF4: A WSRF Compliant Meta-Scheduler , 2006, GCA.

[27]  Donald F. Ferguson,et al.  The WS-Resource Framework , 2004 .

[28]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..