Fault detection, prevention and recovery techniques in current Grid Worklow Systems

The workflow paradigm is a highly successful paradigm for the creation of Grid applications. Despite the popularity of the workflow approach, the systems that support the execution of workflow applications in Grid environments are still not able to deliver the quality, robustness and reliability that their users require and demand. To understand the current state-of-the-art and the reasons behind the shortcomings, we sent out a detailed questionnaire to developers of many of the major Grid workflow systems. This paper shows the outcome of the questionnaire evaluation, reveals future directions and helps to guide research towards the identified open issues in adoption of fault tolerance techniques.