On-Line Monitoring of Service-Level Agreements in the Grid

Monitoring of Service Level Agreements is a crucial phase of SLA management. In the most challenging case, monitoring of SLA fulfillment is required in (near) real-time and needs to combine performance data regarding multiple distributed services and resources. Currently existing Grid monitoring and information services do not provide adequate on-line monitoring capabilities to fulfill this case. We present an application of Complex Event Processing principles and technologies for on-line SLA monitoring in the Grid. The capabilities of the presented SLA monitoring framework include (1) on-demand definition of SLA metrics using a high-level query language; (2) real-time calculation of the defined SLA metrics; (3) advanced query capabilities which allow for defining high-level complex metrics derived from basic metrics. SLA monitoring of data-intensive grid jobs serves as a case study to demonstrate the capabilities of the approach.

[1]  Alessandra Gorla,et al.  Achieving Cost-Effective Software Reliability Through Self-Healing , 2010, Comput. Informatics.

[2]  Wolfgang Emmerich,et al.  Efficient online monitoring of web-service SLAs , 2008, SIGSOFT '08/FSE-16.

[3]  Akhil Sahai,et al.  Specifying and monitoring guarantees in commercial grids through SLA , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[4]  Schahram Dustdar,et al.  Comprehensive QoS monitoring of Web services and event-based SLA violation detection , 2009, MWSOC '09.

[5]  Hong Linh Truong,et al.  Towards a Framework for Monitoring and Analyzing QoS Metrics of Grid Services , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[6]  Helen Wright,et al.  Steering and visualization: Enabling technologies for computational science , 2010, Future Gener. Comput. Syst..

[7]  Bernd Freisleben,et al.  A Streaming Intrusion Detection System for Grid Computing Environments , 2009, 2009 11th IEEE International Conference on High Performance Computing and Communications.

[8]  Marian Bubak,et al.  Perspectives on grid computing , 2010, Future Gener. Comput. Syst..

[9]  Thomas Fahringer,et al.  SCALEA-G: A unified monitoring and performance analysis system for the grid , 2004 .

[10]  Marian Bubak,et al.  Processing moldable tasks on the grid: Late job binding with lightweight user-level overlay , 2011, Future Gener. Comput. Syst..

[11]  Dimosthenis Kyriazis,et al.  Real-time reconfiguration for guaranteeing QoS provisioning levels in Grid environments , 2009, Future Gener. Comput. Syst..

[12]  Sotirios Chatzis,et al.  Managing service level agreement contracts in OGSA-based Grids , 2008, Future Gener. Comput. Syst..

[13]  Bartosz Balis,et al.  Real-time Grid monitoring based on complex event processing , 2011, Future Gener. Comput. Syst..