Reliable Scientific Service Compositions

Distributed service oriented architectures (SOAs) are increasingly used by users, who are insufficiently skilled in the art of distributed system programming. A good example are computational scientists who build large-scale distributed systems using service-oriented Grid computing infrastructures. Computational scientists use these infrastructure to build scientific applications, which are composed from basic Web services into larger orchestrations using workflow languages, such as the Business Process Execution Language. For these users reliability of the infrastructure is of significant importance and that has to be provided in the presence of hardware or operational failures. The primitives available to achieve such reliability currently leave much to be desired by users who do not necessarily have a strong education in distributed system construction. We characterise scientific service compositions and the environment they operate in by introducing the notion of global scientific BPEL workflows. We outline the threats to the reliability of such workflows and discuss the limited support that available specifications and mechanisms provide to achieve reliability. Furthermore, we propose a line of research to address the identified issues by investigating autonomic mechanisms that assist computational scientists in building, executing and maintaining reliable workflows.

[1]  Robbert van Renesse,et al.  Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining , 2003, TOCS.

[2]  Robbert van Renesse,et al.  Navigating in the Storm: Using Astrolabe to Adaptively Configure Web Services and Their Clients , 2006, Cluster Computing.

[3]  Antony I. T. Rowstron,et al.  The IceCube approach to the reconciliation of divergent replicas , 2001, PODC '01.

[4]  Pascal Lorenz,et al.  Self-adaptability and man-in-the-loop: a dilemma in autonomic computing systems , 2004 .

[5]  Luca Cardelli Wide Area Computation , 2000, JISBD.

[6]  Bharat K. Bhargava,et al.  Ensuring relaxed atomicity for flexible transactions in multidatabase systems , 1994, SIGMOD '94.

[7]  Liang Chen,et al.  Sedna: A BPEL-Based Environment for Visual Scientific Workflow Modeling , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[8]  Gustavo Alonso,et al.  Exception Handling in Workflow Management Systems , 2000, IEEE Trans. Software Eng..

[9]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.

[10]  Julian Jang,et al.  Compensation is Not Enough , 2003 .

[11]  John Darlington,et al.  A Standards Based Approach to Job Submission Through Web Services , 2004 .

[12]  Gail E. Kaiser,et al.  Concurrency control in advanced database applications , 1991, CSUR.

[13]  Özgür Ulusoy,et al.  A Transaction Model for Multidatabase Systems , 1996, Euro-Par, Vol. II.

[14]  Robbert van Renesse,et al.  Adding high availability and autonomic behavior to Web services , 2004, Proceedings. 26th International Conference on Software Engineering.

[15]  Liang Chen,et al.  Grid Service Orchestration Using the Business Process Execution Language (BPEL) , 2005, Journal of Grid Computing.

[16]  Julian Jang,et al.  Compensation is not enough [fault-handling and compensation mechanism] , 2003, Seventh IEEE International Enterprise Distributed Object Computing Conference, 2003. Proceedings..

[17]  Stuart Anderson,et al.  Making autonomic computing systems accountable: the problem of human computer interaction , 2003, 14th International Workshop on Database and Expert Systems Applications, 2003. Proceedings..

[18]  Shyan-Shu Shieh,et al.  An experimental study of model predictive control based on artificial neural networks , 2003 .

[19]  Michael J. Butler,et al.  Precise Modelling of Compensating Business Transactions and its Application to BPEL , 2005, J. Univers. Comput. Sci..

[20]  Werner Vogels Tracking Service Availability in Long Running Business Activities , 2003, ICSOC.