Predicting Violations of QoS Requirements in Distributed Systems

A Quality of Service (QoS) requirement refers to a non-functional requirement such as performance. A QoS management system allocates and schedules computing resources. A dynamic QoS management system is one that dynamically allocates resources to an application during its lifetime. This is usually done when the application has a QoS requirement that is not being satisfied at run-time. Ideally, the QoS management system is able to predict when the QoS requirement will be violated before it is violated. This paper describes an approach to prediction and hows shows how this was applied to a case study.

[1]  Malgorzata Steinder,et al.  The present and future of event correlation: A need for end-to-end service fault localization , 2001 .

[2]  Peter W. Glynn,et al.  Internet service performance failure detection , 1998, PERV.

[3]  Joseph L. Hellerstein,et al.  Predictive models for proactive network management: application to a production Web server , 2000, NOMS 2000. 2000 IEEE/IFIP Network Operations and Management Symposium 'The Networked Planet: Management Beyond 2000' (Cat. No.00CB37074).

[4]  Marina Thottan,et al.  Fault prediction at the network layer using intelligent agents , 1999, Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302).

[5]  Frank Feather,et al.  A case study of Ethernet anomalies in a distributed computing environment , 1990 .

[6]  Michael B. Jones,et al.  CPU reservations and time constraints: efficient, predictable scheduling of independent activities , 1997, SOSP.

[7]  C.S. Hood,et al.  Probabilistic network fault detection , 1996, Proceedings of GLOBECOM'96. 1996 IEEE Global Telecommunications Conference.

[8]  Shawn Ostermann,et al.  Detecting network intrusions via a statistical analysis of network packet characteristics , 2001, Proceedings of the 33rd Southeastern Symposium on System Theory (Cat. No.01EX460).

[9]  Michael Anthony Bauer,et al.  Distributed Resource Management to Support Distributed Application-Specific Quality of Service , 2001, MMNS.