Providing Dynamic Virtualized Access to Grid Resources via the Web 2.0 Paradigm

Grid systems provide mechanisms for single sign-on, and uniform APIs for job submission and data transfer, in order to allow the coupling of distributed resources in a seamless manner. However, new users face a daunting barrier of entry due to the high cost of deployment and maintenance. They are often required to learn complex concepts relative to grid infrastructures (credential management, scheduling systems, data staging, etc). To most scientific users, running their applications with minimal changes and yet getting results faster is highly desirable, without having to know much about how the resources are used. Hence, a higher level of abstraction must be provided for the underlying infrastructure to be used effectively. For this purpose, as part of our prior work, we have developed the Opal toolkit for exposing applications on grid resources as simple Web services [1]. Opal provides a basic set of APIs that allows users to execute their deployed applications through graphical user interfaces, or via programmatic means. In this paper, we present our ongoing work to extend the Opal toolkit that enables the creation of an end-to-end infrastructure to dynamically leverage scientific applications on grid resources. In particular, we focus on the following two key extensions. We have developed a command-line syntax description language that can be published via the Opal interface. This enables the creation of dynamic Web forms for job submission, and may even be used for creating familiar application-specific user interfaces in other types of Problem Solving Environments (PSE). Additionally, we have extended the Community Scheduler Framework (CSF4) Meta-scheduler to support the concept of applications as first class resources. This enables Opal to submit jobs to different grid resources where an application is deployed without affecting how user applications interact with Opal. We believe that these extensions enable the Opal toolkit to effectively leverage multiple grid resources, and provide access to the users in a transparent Web 2.0 fashion.

[1]  William E. Allcock,et al.  The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[2]  Xiaohui Wei,et al.  My WorkSphere: Integrative work environment for grid-unaware biomedical researchers and applications , 2006 .

[3]  Donald F. Ferguson,et al.  The WS-Resource Framework , 2004 .

[4]  Francisco Curbera,et al.  Web Services Business Process Execution Language Version 2.0 , 2007 .

[5]  T. Oinn,et al.  Soaplab - a unified Sesame door to analysis tools , 2003 .

[6]  Simon J. Cox Proceedings of the UK e-science All Hands Meeting , 2007 .

[7]  Jennifer M. Schopf,et al.  Performance analysis of the Globus Toolkit Monitoring and Discovery Service, MDS2 , 2004, IEEE International Conference on Performance, Computing, and Communications, 2004.

[8]  A. Kropp,et al.  Web Services for Remote Portlets Specification , 2003 .

[9]  Putchong Uthayopas,et al.  SCMS: An Integrated Cluster Management Tool for Beowulf Cluster System , 2000, PDPTA.

[10]  Xiaohui Wei,et al.  GDIA: A Scalable Grid Infrastructure for Data Intensive Applications , 2006, 2006 International Conference on Hybrid Information Technology.

[11]  Karan Bhatia,et al.  SOAs for Scientific Applications: Experiences and Challenges , 2007, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007).

[12]  Xiaohui Wei,et al.  CSF4: A WSRF Compliant Meta-Scheduler , 2006, GCA.

[13]  Wilfred W. Li,et al.  MEME: discovering and analyzing DNA and protein sequence motifs , 2006, Nucleic Acids Res..

[14]  Nathan A. Baker,et al.  National Biomedical Computation Resource ( NBCR ) : Developing End-to-End Cyberinfrastructure for Multiscale Modeling in Biomedical Research , 2006 .

[15]  Kim K. Baldridge,et al.  Opal: SimpleWeb Services Wrappers for Scientific Applications , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[16]  David E. Culler,et al.  The ganglia distributed monitoring system: design, implementation, and experience , 2004, Parallel Comput..

[17]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[18]  Catherine Letondal,et al.  A Web interface generator for molecular biology programs in Unix , 2001, Bioinform..

[19]  Ian T. Foster,et al.  Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, Journal of Computer Science and Technology.

[20]  Sandeep Chandra,et al.  GAMA: grid account management architecture , 2005, First International Conference on e-Science and Grid Computing (e-Science'05).