Risk-aware Migrations For Prepossessing SLAs

SLAs were developed in order to guarantee the customer's desired quality of service. To prepossess SLAs even in the case of system failures, migrating the job to an alternative resource is a well-known fault-tolerance mechanism. In this paper we start to consider migrations in a risk-aware concept. We plan to introduce risk assessment and management technologies into the grid fabric in order to ensure prepossessing SLAs. The most benefits are seen in a risk-aware scheduling and initiating precautionary fault-tolerance mechanism. This paper focusses on precautionary migrations which should prevent an SLA violation. A motivating scenario presents the variety of required actions in a system with high workload for several migration alternatives. The important aspects of jobs and resources are explained. Furthermore, we present a measurement to estimate the effects of migrating to an alternative resource. This will be one decision criteria in the migration process. Future work will complete the risk-aware scheduling of migrations

[1]  Klara Nahrstedt,et al.  A distributed resource management architecture that supports advance reservations and co-allocation , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[2]  Christopher J. Alberts,et al.  Continuous Risk Management Guidebook. , 1996 .

[3]  Ian T. Foster,et al.  SNAP: A Protocol for Negotiating Service Level Agreements and Coordinating Resource Management in Distributed Systems , 2002, JSSPP.

[4]  Asit Dan,et al.  Web services agreement specification (ws-agreement) , 2004 .

[5]  Matthias Hovestadt Fault Tolerance Mechanisms for SLA-aware Resource Management , 2005, 11th International Conference on Parallel and Distributed Systems (ICPADS'05).

[6]  Akhil Sahai,et al.  Specifying and monitoring guarantees in commercial grids through SLA , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..