Analysis of Anomalies on a Virtualized Network Testbed

To clarify useful parameters for avoiding unstable conditions in network experiments on a virtualized testbed, we used PlanetLab as the virtualized testbed and measured network throughput using a combination of probe and data transfers. Although PlanetLab has been widely used as a testbed for overlay networks, distributed systems, and network measurement, it is provided as a virtualized environment to users. A set of these environments on different nodes is called as ‘slice’, and multiple slices run simultaneously on each node. We found that network throughput was occasionally decreased even though the network condition was stable. The cause of the throughput decrease was an unintended large packet spacing. The unintended large packet spacing is an anomaly. Although the cause of the anomaly is known to be unstable CPU scheduling latency, no clear conditions for anomaly avoidance had previously been given. We investigated throughput measurement with resource monitoring to clarify anomaly avoidance conditions. When the CPUs at a node are shared by many slices, slices are frequently scheduled off the CPUs, and the anomaly occurs. If network throughput is decreased by the anomaly, the measurement results should be discarded.

[1]  David E. Culler,et al.  Operating Systems Support for Planetary-Scale Network Services , 2004, NSDI.

[2]  Peter A. Dinda,et al.  Characterizing and Predicting TCP Throughput on the Wide Area Network , 2005, 25th IEEE International Conference on Distributed Computing Systems (ICDCS'05).

[3]  Rocco Aversa,et al.  Web Services Resilience Evaluation using LDS Load dependent Server Models , 2010, J. Commun..

[4]  Larry L. Peterson,et al.  Using PlanetLab for network research: myths, realities, and best practices , 2005, OPSR.

[5]  Larry L. Peterson,et al.  Experiences building PlanetLab , 2006, OSDI '06.

[6]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[7]  Richard Wolski,et al.  Multivariate Resource Performance Forecasting in the Network Weather Service , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[8]  Paul Barford,et al.  An active measurement system for shared environments , 2007, IMC '07.

[9]  Jennifer M. Schopf,et al.  Predicting sporadic grid data transfers , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[10]  Brian D. Noble,et al.  The end-to-end performance effects of parallel TCP sockets on a lossy wide-area network , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.