CHANDRA, SANDEEP. Service Based Support for Scientific Workflows (Under the direction of Dr. Mladen A. Vouk) A Problem Solving Environment (PSE) is a computer-based system that provides all the computational facilities necessary to solve a target class of problems, that is, it efficiently supports a specific set of scientific workflows. A special class of PSE’s are those that rely on networks. Network-based PSEs are collections of distributed applications, interfaces, libraries, databases, programs, tools, clients, and intelligent agents, which facilitate user interaction and cooperative execution of the components charged with the solution tasks. Thus, the need for effective and efficient communication among the PSE components is obvious. This has resulted in a proliferation of communication building blocks, or middleware, for distributed scientific computing and problem solving. The most recent, and quite promising, option is the availability of network-based services, sometimes called Web Services. This work is concerned with evaluation of the feasibility, usability and effectiveness of service-based support for scientific workflows. A successful open source proof of concept architecture and framework was developed and assessed using a Bioinformatics scientific problem solving workflow. Access to the data, computations, and user interfaces is based on the services architecture and standards such as XML for data descriptions, WSDL for service descriptions, SOAP for service delivery, and UDDI for service registration and brokering. This service-oriented approach facilitates integration of disparate data acquisition and analysis applications that participate in complex scientific problem solving processes. The developed framework, called Scientific Data Management Service Workflow System, or SDMSWS, was found to be sufficiently flexible and versatile that it was possible to effect appliance-like composition and use of standard-conforming workflow services. An open-source solution, such as SDMSWS, enables services from different organizations, platforms and domains to interoperate seamlessly through standard interoperability protocols. This is contrasted with similar systems that implement proprietary service solutions. In order to compare SDMSWS with other systems that support scientific problem solving workflows, it was necessary to develop a set of measures. They include measures related to end-user issues, workflow issues, services issues, networking issues, and a variety of more general issues. The selection and the definition of the metrics, and the rational behind them, are discussed. The comparative analysis reveals that its open source nature has some distinct advantages. For example, the framework minimizes the customization and integration of new component workflow services, and it shows that, unlike commercial solutions, it is relatively independent of the service domains and can incorporate any network-based service that conforms to standard Web Services description and protocols. It is the finding of this work that complex distributed scientific workflows can be, and should be, supported using open-source service-based solutions. However, current standards, for workflow description, interchange and execution were found wanting and further work will be needed before one can depend on them in practical scientific workflow environments. SERVICE BASED SUPPORT FOR SCIENTIFIC WORKFLOWS
[1]
Matjaz B. Juric,et al.
Business process execution language for web services
,
2004
.
[2]
E. Gallopoulos,et al.
Computer as thinker/doer: problem-solving environments for computational science
,
1994,
IEEE Computational Science and Engineering.
[3]
Ken Brodlie,et al.
GRASPARC-A problem solving environment integrating computation and visualization
,
1993,
Proceedings Visualization '93.
[4]
S. Krishnan,et al.
2 XLANG : Web Services for Business Process Design
,
2002
.
[5]
Daniel C. Stanzione,et al.
Coven-a framework for high performance problem solving environments
,
2002,
Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.
[6]
Gregor von Laszewski,et al.
Designing Grid-based problem solving environments and portals
,
2001,
Proceedings of the 34th Annual Hawaii International Conference on System Sciences.
[7]
Steven Tuecke,et al.
The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration
,
2002
.
[8]
John R. Rice,et al.
Enabling Technologies for Computational Science: Frameworks, Middleware and Environments
,
2012
.
[9]
James R. Rice,et al.
From Scientific Software Libraries to Problem Solving Environments John R. Rice
,
1996
.
[10]
Jonathan Pevsner,et al.
Basic Local Alignment Search Tool (BLAST)
,
2005
.
[11]
L. Stein.
Creating a bioinformatics nation
,
2002,
Nature.
[12]
Ian Horrocks,et al.
The Semantic Web: The Roles of XML and RDF
,
2000,
IEEE Internet Comput..
[13]
Thomas E. Potok,et al.
VIPAR: Advanced Information Agents discovering knowledge in an open and changing environment
,
2003
.
[14]
Nagiza F. Samatova,et al.
An ontology-based HTML to XML conversion using intelligent agents
,
2002,
Proceedings of the 35th Annual Hawaii International Conference on System Sciences.
[15]
Dan Brickley,et al.
Resource Description Framework (RDF) Model and Syntax Specification
,
2002
.
[16]
CremonesiPaolo.
Parallel, distributed and network-based processing
,
2006
.
[17]
Hiroshi Ashida.
[What is bioinformatics?].
,
2002,
Rinsho byori. The Japanese journal of clinical pathology.
[18]
C. M. Sperberg-McQueen,et al.
Extensible Markup Language (XML)
,
1997,
World Wide Web J..
[19]
Harry G. Perros,et al.
Performance of Network-Based Problem-Solving Environments
,
2000
.
[20]
Ian T. Foster,et al.
The anatomy of the grid: enabling scalable virtual organizations
,
2001,
Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.
[21]
Ranieri Baraglia,et al.
Meta: A Web-Based Metacomputing Environment to Build a Computational Chemistry Problem Solving Environment
,
2002,
PDP.
[22]
Salah E. Elmaghraby,et al.
Activity networks: Project planning and control by network models
,
1977
.
[23]
Mladen A. Vouk,et al.
A Lightweight Software Bus for Prototyping Problem Solving Environments
,
1996
.