Autonomic QoS-Aware resource management in grid computing using online performance models

As Grid Computing increasingly enters the commercial domain, performance and Quality of Service (QoS) issues are becoming a major concern. The inherent complexity, heterogeneity and dynamics of Grid computing environments pose some challenges in managing their capacity to ensure that QoS requirements are continuously met. In this paper, an approach to autonomic QoS-aware resource management in Grid computing based on online performance models is proposed. The paper presents a novel methodology for designing autonomic QoS-aware resource managers that have the capability to predict the performance of the Grid components they manage and allocate resources in such a way that service level agreements are honored. The goal is to make the Grid middleware self-configurable and adaptable to changes in the system environment and workload. The approach is subjected to an extensive experimental evaluation in the context of a real-world Grid environment and its effectiveness, practicality and performance are demonstrated.

[1]  Falko Bause,et al.  "QN + PN = QPN" - Combining Queueing Networks and Petri Nets , 1993 .

[2]  Peter Buchholz,et al.  Queueing Petri Nets with Product Form Solution , 1998, Perform. Evaluation.

[3]  Klara Nahrstedt,et al.  A distributed resource management architecture that supports advance reservations and co-allocation , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[4]  Carla E. Brodley,et al.  Predictive application-performance modeling in a computational grid environment , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[5]  Ian Foster,et al.  A quality of service architecture that combines resource reservation and application adaptation , 2000, 2000 Eighth International Workshop on Quality of Service. IWQoS 2000 (Cat. No.00EX400).

[6]  Francisco Curbera,et al.  Web services description language (wsdl) version 1. 2 , 2001 .

[7]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[8]  G. Laszewski,et al.  A QoS Guided Scheduling Algorithm for Grid Computing * , 2002 .

[9]  C. Evertsson Performance by design , 2002 .

[10]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[11]  Ming Wu,et al.  Grid Harvest Service: a system for long-term, application-level task scheduling , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[12]  Karim Djemame,et al.  Adaptive grid resource brokering , 2003, 2003 Proceedings IEEE International Conference on Cluster Computing.

[13]  R. V. van Nieuwpoort,et al.  The Grid 2: Blueprint for a New Computing Infrastructure , 2003 .

[14]  Daniel A. Menascé,et al.  A framework for resource allocation in grid computing , 2004, The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004. (MASCOTS 2004). Proceedings..

[15]  Daniel A. Menascé,et al.  A framework for QoS-aware software components , 2004, WOSP '04.

[16]  Klara Nahrstedt,et al.  QoS and Contention-Aware Multi-Resource Reservation , 2004, Cluster Computing.

[17]  Kaizar Amin,et al.  Analysis and Provision of QoS for Distributed Grid Applications , 2004, Journal of Grid Computing.

[18]  D. Walker,et al.  A model for quality-of-service provision in service oriented architectures , 2005 .

[19]  Daniel A. Menascé,et al.  Resource Allocation for Autonomic Data Centers using Analytic Performance Models , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[20]  Daniel A. Menascé,et al.  On the Use of Online Analytic Performance Models, in Self-Managing and Self-Organizing Computer Systems , 2005, Self-star Properties in Complex Information Systems.

[21]  Asser N. Tantawi,et al.  Performance management for cluster-based web services , 2005, IEEE Journal on Selected Areas in Communications.

[22]  Rajarshi Das,et al.  Utility-Function-Driven Resource Allocation in Autonomic Systems , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[23]  Asser N. Tantawi,et al.  Modeling Differentiated Services of Multi-Tier Web Applications , 2006, 14th IEEE International Symposium on Modeling, Analysis, and Simulation.

[24]  Samuel Kounev,et al.  Performance Modeling and Evaluation of Distributed Component-Based Systems Using Queueing Petri Nets , 2006, IEEE Transactions on Software Engineering.

[25]  Samuel Kounev,et al.  SimQPN - A tool and methodology for analyzing queueing Petri net models by means of simulation , 2006, Perform. Evaluation.

[26]  Jeffrey S. Chase,et al.  Learning Application Models for Utility Resource Planning , 2006, 2006 IEEE International Conference on Autonomic Computing.

[27]  Ian T. Foster,et al.  Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, Journal of Computer Science and Technology.

[28]  Rajarshi Das,et al.  A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation , 2006, 2006 IEEE International Conference on Autonomic Computing.

[29]  Constantin Adam,et al.  A middleware design for large-scale clusters offering multiple services , 2006, IEEE Transactions on Network and Service Management.

[30]  Jordi Torres,et al.  Should the grid middleware look to self-managing capabilities? , 2007, Eighth International Symposium on Autonomous Decentralized Systems (ISADS'07).