A scalable monitoring approach for service level agreements validation

In order to detect violations of end-to-end service level agreements (SLA) and to isolate trouble links and nodes based on a unified framework, managers of a service provider network need to gather quality of service (QoS) measurements from multiple nodes in the network. For a network carrying over thousands of flows with end-to-end SLAs, the information exchanged between network nodes and a central network management system (NMS) could be substantial. Moreover in situations where only, a small number of flows violate their respective SLAs, simple polling mechanisms can lead to huge unnecessary overhead in identifying these ill-behaved flows. We propose an algorithm called (ARM) (Aggregation and Refinement based Monitoring) to reduce the amount of information exchange. (ARM) uses a histogram-based dynamic QoS data aggregation/refinement technique at each network node and a reasoning engine at the NMS to minimized the amount of data exchange between network nodes and NMS. (ARM) not only reduces unnecessary reporting through selective refinement, it also performs well across a wide range of traffic loads. Our simulation results show that (ARM) is at least an order of magnitude more efficient than a simple polling scheme. It also outperforms two centralized highly optimized schemes that cannot be implemented in practice.

[1]  Danny Raz,et al.  Minimizing the monitoring cost in network management , 1999, Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302).

[2]  Keith McCloghrie,et al.  Protocol Operations for version 2 of the Simple Network Management Protocol (SNMPv2) , 1993, RFC.

[3]  Jeffrey D. Case,et al.  Simple Network Management Protocol (SNMP) , 1989, RFC.

[4]  Debasis Mitra,et al.  Design of generalized processor sharing schedulers which statistically multiplex heterogeneous QoS classes , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[5]  Steven Waldbusser Remote Network Monitoring Management Information Base Version 2 using SMIv2 , 1997, RFC.

[6]  Vern Paxson,et al.  End-to-end Internet packet dynamics , 1997, SIGCOMM '97.

[7]  Steven Waldbusser Remote Network Monitoring Management Information Base , 1991, RFC.

[8]  Thomas M. Chen,et al.  INQIRE: a software approach to monitoring QoS in ATM networks , 1998 .

[9]  Bernd J. Krämer,et al.  Agent Based Management of Distributed Systems with Variabel Polling Frequency Policies , 1997, Integrated Network Management.

[10]  Jeffrey D. Case,et al.  Simple Network Management Protocol (SNMP) , 1990, RFC.

[11]  Kenneth D. White Definitions of Managed Objects for Service Level Agreements Performance Monitoring , 2000, RFC.

[12]  Vern Paxson,et al.  Framework for IP Performance Metrics , 1998, RFC.

[13]  Aurel A. Lazar,et al.  Objective-Driven Monitoring for Broadband Networks , 1996, IEEE Trans. Knowl. Data Eng..

[14]  David Tse,et al.  A time-scale decomposition approach to measurement-based admission control , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[15]  Abhay Parekh,et al.  A generalized processor sharing approach to flow control in integrated services networks: the multiple node case , 1994, TNET.

[16]  Keizo Sugiyama,et al.  Dynamic polling scheme based on time variation of network management information values , 1999, Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302).

[17]  Gordon B. Davis,et al.  IFIP , 1997, DATB.