Accurate and efficient SLA compliance monitoring

Service level agreements (SLAs) define performance guarantees made by service providers, e.g, in terms of packet loss, delay, delay variation, and network availability. In this paper, we describe a new active measurement methodology to accurately monitor whether measured network path characteristics are in compliance with performance targets specified in SLAs. Specifically, (1) we describe a new methodology for estimating packet loss rate that significantly improves accuracy over existing approaches; (2) we introduce a new methodology for measuring mean delay along a path that improves accuracy over existing methodologies, and propose a method for obtaining confidence intervals on quantiles of the empirical delay distribution without making any assumption about the true distribution of delay; (3) we introduce a new methodology for measuring delay variation that is more robust than prior techniques; and (4) we extend existing work in network performance tomography to infer lower bounds on the quantiles of a distribution of performance measures along an unmeasured path given measurements from a subset of paths. We unify active measurements for these metrics in a discrete time-based tool called SLAM. The unified probe stream from SLAM consumes lower overall bandwidth than if individual streams are used to measure path properties. We demonstrate the accuracy and convergence properties of SLAM in a controlled laboratory environment using a range of background traffic scenarios and in one- and two-hop settings, and examine its accuracy improvements over existing standard techniques.

[1]  Donald F. Towsley,et al.  Multicast-based inference of network-internal loss characteristics , 1999, IEEE Trans. Inf. Theory.

[2]  Albert Greenberg,et al.  Operations and Management of IP Networks: What Researchers Should Know , 2005 .

[3]  Xin Wang,et al.  A scalable monitoring approach for service level agreements validation , 2000, Proceedings 2000 International Conference on Network Protocols.

[4]  Arne A. Nilsson,et al.  On service level agreements for IP networks , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[5]  Matthew J. Zekauskas,et al.  A One-way Packet Loss Metric for IPPM , 1999, RFC.

[6]  Robert Nowak,et al.  Unicast Network Tomography using EM Algorithms , 2000 .

[7]  Jean-Chrysostome Bolot,et al.  End-to-end packet delay and loss behavior in the internet , 1993, SIGCOMM '93.

[8]  Vern Paxson,et al.  Measurements and analysis of end-to-end Internet dynamics , 1997 .

[9]  Vern Paxson,et al.  Framework for IP Performance Metrics , 1998, RFC.

[10]  Paul Barford,et al.  A Framework for Multi-Objective SLA Compliance Monitoring , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[11]  Bernd Girod,et al.  Adaptive playout scheduling and loss concealment for voice communication over IP networks , 2003, IEEE Trans. Multim..

[12]  Robert G. Cole,et al.  Voice over IP performance monitoring , 2001, CCRV.

[13]  Tanja Zseby,et al.  Deployment of Sampling Methods for SLA Validation with Non-Intrusive Measurements , 2002 .

[14]  Philip F. Chimento,et al.  IP Packet Delay Variation Metric for IP Performance Metrics (IPPM) , 2002, RFC.

[15]  Darryl Veitch,et al.  A Precision Infrastructure for Active Probing , 2001 .

[16]  Randy H. Katz,et al.  An algebraic approach to practical and scalable overlay network monitoring , 2004, SIGCOMM 2004.

[17]  Paul Barford,et al.  A geometric approach to improving active packet loss measurement , 2008, IEEE/ACM Trans. Netw..

[18]  Donald F. Towsley,et al.  Multicast-based inference of network-internal delay distributions , 2002, TNET.

[19]  Randy H. Katz,et al.  An algebraic approach to practical and scalable overlay network monitoring , 2004, SIGCOMM '04.

[20]  Al Morton,et al.  Standardized active measurements on a tier 1 IP backbone , 2003, IEEE Commun. Mag..

[21]  Vern Paxson,et al.  IPPM Metrics for Measuring Connectivity , 1999, RFC.

[22]  Matthew J. Zekauskas,et al.  A One-way Delay Metric for IPPM , 1999, RFC.

[23]  Paul Barford,et al.  Self-configuring network traffic generation , 2004, IMC '04.

[24]  Zhi-Li Zhang,et al.  Practical delay monitoring for ISPs , 2005, CoNEXT '05.

[25]  Donald F. Towsley,et al.  Measurement and modelling of the temporal dependence in packet loss , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[26]  Paul Barford,et al.  Scalable network path emulation , 2005, 13th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems.

[27]  Paul Barford,et al.  Improving accuracy in end-to-end packet loss measurement , 2005, SIGCOMM '05.

[28]  Yin Zhang,et al.  On the constancy of internet path properties , 2001, IMW '01.

[29]  SommersJoel,et al.  Improving accuracy in end-to-end packet loss measurement , 2005 .

[30]  Masaki Aida,et al.  A scalable and lightweight QoS monitoring technique combining passive and active approaches , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[31]  Matthew Roughan,et al.  Fundamental bounds on the accuracy of network performance measurements , 2005, SIGMETRICS '05.

[32]  Randy H. Katz,et al.  Tomography-based overlay network monitoring , 2003, IMC '03.

[33]  J. Sommers,et al.  A Geometric Approach to Improving Active Packet Loss Measurement , 2008, IEEE/ACM Transactions on Networking.

[34]  Henning Schulzrinne,et al.  RTP: A Transport Protocol for Real-Time Applications , 1996, RFC.

[35]  Donald F. Towsley,et al.  Inferring link loss using striped unicast probes , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[36]  Paul Barford,et al.  Comparing probe-and router-based packet-loss measurement , 2004, IEEE Internet Computing.

[37]  D. Veitch,et al.  A User Friendly TSC Clock , 2006 .

[38]  Nick G. Duffield,et al.  Network Tomography of Binary Network Performance Characteristics , 2006, IEEE Transactions on Information Theory.

[39]  Dogu Arifler,et al.  Network tomography based on flow level measurements , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.