SLAC has been studying end-to-end WAN bandwidth availability and achievability for 2.5 years via IEPM-BW [1]. IEPM-BW performs network intensive tests every 90 minutes. Based on that experience we have also developed a light weight available bandwidth (ABwE [2]) measurement tool that can make a measurement within a second. We are now extending this to a WAN measurement and detection system (IEPM-LITE) aimed at more quickly detecting and troubleshooting network performance problems and also to be more friendly on lower performance paths. IEPM-LITE uses ping, forward traceroutes, and ABwE sensors to monitor, in close to real-time, Round Trip Times (RTT), changes in available bandwidth and routes to and from target hosts. This paper discusses the experiences, techniques and algorithms used to detect and report on significant traceroute and bandwidth changes. The ultimate aim is to develop a lightweight WAN network performance monitoring system that can detect, in near real time, significant changes and generate alerts.
[1]
Hans-Werner Braun,et al.
The NLANR network analysis infrastructure
,
2000,
IEEE Commun. Mag..
[2]
C. A. Logg.
Experiences and Results from a New High Performance Network and Application Monitoring Toolkit
,
2003
.
[3]
Richard A. Davis,et al.
Introduction to time series and forecasting
,
1998
.
[4]
Christophe Diot,et al.
Diagnosing network-wide traffic anomalies
,
2004,
SIGCOMM.
[5]
Connie Logg,et al.
Correlating Internet Performance Changes and Route Changes to Assist in Trouble-Shooting from an End-User Perspective
,
2004,
PAM.
[6]
Jiri Navratil,et al.
ABwE :A Practical Approach to Available Bandwidth Estimation
,
2002
.
[7]
Mark Crovella,et al.
Diagnosing network-wide traffic anomalies
,
2004,
SIGCOMM '04.