SOAs for scientific applications: Experiences and challenges

Over the past several years, with the advent of the Open Grid Services Architecture (OGSA) [I. Foster, C. Kesselman, J. Nick, S. Tuecke, Grid Services for Distributed System Integration, Computer 35 (6) (2002)] and the Web Services Resource Framework (WSRF) [K. Czajkowski, et al., WS-resource framework. http://www-106.ibm.com/developerworks/library/ws-resource/ws-wsrf.pdf, 2004. [25]], Service-oriented Architectures (SOA) and Web service technologies have been embraced in the field of scientific and Grid computing. These new principles promise to help make scientific infrastructures simpler to use, more cost effective to implement, and easier to maintain. However, understanding how to leverage these developments to actually design and build a system remains more of an art than a science. In this paper, we present some positions learned through experience, that provide guidance in leveraging SOA technologies to build scientific infrastructures. In addition, we present the technical challenges that need to be addressed in building an SOA, and as a case study, we present the SOA that we have designed for the National Biomedical Computation Resource (NBCR) [The National Biomedical Computation Resource (NBCR). http://nbcr.net/] community. We discuss how we have addressed these technical challenges, and present the overall architecture, the individual software toolkits developed, the client interfaces, and the usage scenarios. We hope that our experiences prove to be useful in building similar infrastructures for other scientific applications.

[1]  Anupriya Ankolekar,et al.  Automated discovery, interaction and composition of Semantic Web services , 2003, J. Web Semant..

[2]  Larry Rudolph,et al.  Job Scheduling Strategies for Parallel Processing: IPPS/SPDP'98 Workshop, Orlando, Florida, USA, March 30, 1998 Proceedings , 1998 .

[3]  Kim K. Baldridge,et al.  An end-to-end Web services-based infrastructure for biomedical applications , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[4]  Ian T. Foster,et al.  A security architecture for computational grids , 1998, CCS '98.

[5]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[6]  Ákos Frohner,et al.  From gridmap-file to VOMS: managing authorization in a Grid environment , 2005, Future Gener. Comput. Syst..

[7]  Roy Fielding,et al.  Architectural Styles and the Design of Network-based Software Architectures"; Doctoral dissertation , 2000 .

[8]  Kohei Ichikawa,et al.  OPAL OP: AN EXTENSIBLE GRID-ENABLING WRAPPING TOOL FOR LEGACY APPLICATIONS , 2007, Grid 2007.

[9]  Daniel Atkins,et al.  Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure , 2003 .

[10]  Eduardo Huedo,et al.  A modular meta-scheduling architecture for interfacing with pre-WS and WS Grid resource management services , 2007, Future Gener. Comput. Syst..

[11]  Ian T. Foster,et al.  A community authorization service for group collaboration , 2002, Proceedings Third International Workshop on Policies for Distributed Systems and Networks.

[12]  Warren Smith,et al.  A Resource Management Architecture for Metacomputing Systems , 1998, JSSPP.

[13]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[14]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..

[15]  William E. Allcock,et al.  The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[16]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[17]  Bertram Ludäscher,et al.  Kepler: an extensible system for design and execution of scientific workflows , 2004 .

[18]  Kim K. Baldridge,et al.  Opal: SimpleWeb Services Wrappers for Scientific Applications , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[19]  Kim K. Baldridge,et al.  GEMSTONE: GRID ENABLED MOLECULAR SCIENCE THROUGH ONLINE NETWORKED ENVIRONMENTS , 2006 .

[20]  Xiaohui Wei,et al.  Providing Dynamic Virtualized Access to Grid Resources via the Web 2.0 Paradigm , 2007, GCE 2007.

[21]  Joel H. Saltz,et al.  caGrid 1.0: A Grid Enterprise Architecture for Cancer Research , 2007, AMIA.

[22]  Philip M. Papadopoulos,et al.  NPACI Rocks: tools and techniques for easily deploying manageable Linux clusters , 2003, Concurr. Comput. Pract. Exp..

[23]  Xiaohui Wei,et al.  CSF4: A WSRF Compliant Meta-Scheduler , 2006, GCA.

[24]  I. Foster,et al.  Service-Oriented Science , 2005, Science.

[25]  Sandeep Chandra,et al.  GAMA: grid account management architecture , 2005, First International Conference on e-Science and Grid Computing (e-Science'05).

[26]  Kim K. Baldridge,et al.  Incorporation of middleware and grid technologies to enhance usability in Computational Chemistry applications , 2004, Future Gener. Comput. Syst..