Memory balancing for large-scale network simulation in power -law networks

Kim, HyoJeong. Ph.D., Purdue University, December 2008. Memory Balancing for Large-scale Network Simulation in Power-law Networks. Major Professor: Kihong Park. Large-scale network simulation has grown in importance due to a rapid increase in Internet size and the availability of Internet measurement topologies with applications to computer networks and network security. A key obstacle to large-scale network simulation over PC clusters is the memory balancing problem, where a memoryoverloaded machine can slow down a distributed simulation due to disk I/O overhead. Network partitioning methods for parallel and distributed simulation are insufficiently equipped to handle new challenges brought on by memory balancing due to their focus on CPU and communication balancing. This dissertation studies memory balancing for large-scale network simulation in power-law networks over PC clusters. First, we design and implement a measurement subsystem for dynamically tracking memory consumption in DaSSFNet, a distributed network simulator. Accurate monitoring of memory consumption is difficult due to complex protocol interaction through which message related events are created and destroyed inside and outside a simulation kernel. Second, we achieve efficient memory cost monitoring by tackling the problem of estimating peak memory consumption of a group of simulated network nodes in power-law topologies during network partitioning. In contrast to CPU balancing where the processing cost of a group of nodes is proportional to their sum, in memory balancing this closure property need not hold. Power-law connectivity injects additional complications due to skews in resource consumption across network nodes. Third, we show that the maximum memory cost metric outperforms the total cost metric for memory balancing under multilevel rexv cursive partitioning but the opposite holds for CPU balancing. We show that the trade-off can be overcome through joint memory-CPU balancing—in general not feasible due to constraint conflicts—which is enabled by network simulation having a tendency to induce correlation between memory and CPU costs. Fourth, we evaluate memory balancing in the presence of virtual memory (VM) management which admits larger problem instances to be run over limited physical memory. VM introduces complex memory management dependencies that make understanding and evaluating simulation performance difficult. We provide a performance evaluation framework wherein the impact of memory thrashing in distributed network simulation is incorporated which admits quantitative performance comparison and diagnosis. Fifth, we show that improved memory balancing under the maximum cost metric in the presence of VM manifests as faster completion time compared to the total cost metric despite the CPU balancing advantage of the latter. In the cases where the CPU balancing advantage of the total cost metric is strong, we show that joint memory-CPU balancing can achieve the best of both worlds. We carry out performance evaluation using benchmark applications with varying traffic characteristics: BGP routing, worm propagation under local and global scanning, and distributed client/server system. We use a testbed of 32 Intel x86 machines running a measurement-enhanced DaSSFNet over Linux.

[1]  Walter Willinger,et al.  On the Self-Similar Nature of Ethernet Traffic ( extended version ) , 1995 .

[2]  John S. Heidemann,et al.  Minimizing routing state for light-weight network simulation , 2001, MASCOTS 2001, Proceedings Ninth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[3]  Wei-Min Yao,et al.  Downscaling Network Scenarios with Denial of Service (DoS) Attacks , 2008, 2008 IEEE Sarnoff Symposium.

[4]  Stephen John Turner,et al.  Load balancing for conservative simulation on shared memory multiprocessor systems , 2000, Proceedings Fourteenth Workshop on Parallel and Distributed Simulation.

[5]  Ramesh Govindan,et al.  Performance preserving network downscaling , 2005, 38th Annual Simulation Symposium.

[6]  Pei Cao,et al.  Adaptive page replacement based on memory reference behavior , 1997, SIGMETRICS '97.

[7]  Roger Wattenhofer,et al.  The impact of Internet policy and topology on delayed routing convergence , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[8]  Walter Willinger,et al.  A first-principles approach to understanding the internet's router-level topology , 2004, SIGCOMM '04.

[9]  Jelena Mirkovic,et al.  Distributed worm simulation with a realistic Internet model , 2005, Workshop on Principles of Advanced and Distributed Simulation (PADS'05).

[10]  L. Iftode,et al.  Memory servers for multicomputers , 1993, Digest of Papers. Compcon Spring.

[11]  Li Xiao,et al.  Improving distributed workload performance by sharing both CPU and memory resources , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[12]  George Karypis,et al.  Multilevel k-way Partitioning Scheme for Irregular Graphs , 1998, J. Parallel Distributed Comput..

[13]  Farnam Jahanian,et al.  Internet routing instability , 1997, SIGCOMM '97.

[14]  Mostafa H. Ammar,et al.  Enabling large-scale multicast simulation by reducing memory requirements , 2003, Seventeenth Workshop on Parallel and Distributed Simulation, 2003. (PADS 2003). Proceedings..

[15]  George F. Riley,et al.  The Georgia Tech Network Simulator , 2003, MoMeTools '03.

[16]  Amin Vahdat,et al.  Toward scaling network emulation using topology partitioning , 2003, 11th IEEE/ACM International Symposium on Modeling, Analysis and Simulation of Computer Telecommunications Systems, 2003. MASCOTS 2003..

[17]  David M. Nicol,et al.  Composite Synchronization in Parallel Discrete-Event Simulation , 2002, IEEE Trans. Parallel Distributed Syst..

[18]  Shahid H. Bokhari,et al.  Efficient Algorithms for a Class of Partitioning Problems , 1995, IEEE Trans. Parallel Distributed Syst..

[19]  David M. Nicol,et al.  Learning not to share , 2001, Workshop on Parallel and Distributed Simulation.

[20]  Samson Lee,et al.  Performance benchmark of a parallel and distributed network simulator , 2005, Workshop on Principles of Advanced and Distributed Simulation (PADS'05).

[21]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Mostafa H. Ammar,et al.  BencHMAP: benchmark-based, hardware and model-aware partitioning for parallel and distributed network simulation , 2004, The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004. (MASCOTS 2004). Proceedings..

[23]  Martin G. Everett,et al.  JOSTLE: Partitioning of Unstructured Meshes for Massively Parallel Machines , 1995 .

[24]  Richard T. Mills,et al.  Dynamic adaptation to cpu and memory load in scientific applications , 2004 .

[25]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[26]  Hirozumi Yamaguchi,et al.  Message from Program Co-chairs , 2003, PADS.

[27]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[28]  Xenofontas A. Dimitropoulos,et al.  Large-scale simulation models of BGP , 2004, The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004. (MASCOTS 2004). Proceedings..

[29]  Of references. , 1966, JAMA.

[30]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[31]  J. Ramanujam,et al.  Cluster partitioning approaches to mapping parallel programs onto a hypercube , 1987, Parallel Comput..

[32]  George F. Riley,et al.  Performance of routing protocols in very large-scale mobile wireless ad hoc networks , 2005, 13th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems.

[33]  Qi He,et al.  Mapping peer behavior to packet-level details: a framework for packet-level simulation of peer-to-peer systems , 2003, 11th IEEE/ACM International Symposium on Modeling, Analysis and Simulation of Computer Telecommunications Systems, 2003. MASCOTS 2003..

[34]  Heejo Lee,et al.  On the effectiveness of route-based packet filtering for distributed DoS attack prevention in power-law internets , 2001, SIGCOMM '01.

[35]  Michalis Faloutsos,et al.  On power-law relationships of the Internet topology , 1999, SIGCOMM '99.

[36]  Heejo Lee,et al.  On the effectiveness of route-based packet filtering for distributed DoS attack prevention in power-law internets , 2001, SIGCOMM 2001.

[37]  Hao Wu,et al.  Large-scale network simulation: how big? how fast? , 2003, 11th IEEE/ACM International Symposium on Modeling, Analysis and Simulation of Computer Telecommunications Systems, 2003. MASCOTS 2003..

[38]  Abhijit Bose,et al.  Delayed Internet routing convergence , 2000, SIGCOMM.

[39]  Alfred J. Lotka,et al.  The frequency distribution of scientific productivity , 1926 .

[40]  Hiroyuki Ohsaki,et al.  Quasi-Dynamic Network Model Partition Method for Accelerating Parallel Network Simulation , 2006, 14th IEEE International Symposium on Modeling, Analysis, and Simulation.

[41]  Jim Griffioen,et al.  A New Design for Distributed Systems: The Remote Memory Model , 1990, USENIX Summer.

[42]  Bruce Hendrickson,et al.  An empirical study of static load balancing algorithms , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[43]  Yu Liu,et al.  Genesis: A scalable distributed system for large-scale parallel network simulation , 2006, Comput. Networks.

[44]  Ralf Diekmann,et al.  PARTY - A Software Library for Graph Partitioning , 1997 .

[45]  Andy Oram,et al.  Understanding the Linux Kernel, Second Edition , 2002 .

[46]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[47]  Li Xiao,et al.  Dynamic Cluster Resource Allocations for Jobs with Known and Unknown Memory Demands , 2002, IEEE Trans. Parallel Distributed Syst..

[48]  Shashi Phoha,et al.  Path Preserving Scale Down for Validation of Internet Inter-Domain Routing Protocols , 2006, Proceedings of the 2006 Winter Simulation Conference.

[49]  Vern Paxson,et al.  End-to-end routing behavior in the Internet , 1996, TNET.

[50]  George Karypis,et al.  Multilevel algorithms for partitioning power-law graphs , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[51]  Daniel Pierre Bovet,et al.  Understanding the Linux Kernel , 2000 .

[52]  P. L'Ecuyer,et al.  Panel: strategic directions in simulation research , 1999, WSC'99. 1999 Winter Simulation Conference Proceedings. 'Simulation - A Bridge to the Future' (Cat. No.99CH37038).

[53]  Peter J. Denning,et al.  Working Sets Past and Present , 1980, IEEE Transactions on Software Engineering.

[54]  Bruce Hendrickson,et al.  The Chaco user`s guide. Version 1.0 , 1993 .

[55]  Xenofontas A. Dimitropoulos,et al.  Efficient large-scale BGP simulations , 2006, Comput. Networks.

[56]  Marek Chrobak,et al.  Reducing Large Internet Topologies for Faster Simulations , 2005, NETWORKING.

[57]  Kathryn S. McKinley,et al.  Hoard: a scalable memory allocator for multithreaded applications , 2000, SIGP.

[58]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.