Accuracy study and improvement of network simulation in the SimGrid framework

Distributed computing is a very broad and active research area comprising fields such as cluster computing, computational grids, desktop grids and peer-to-peer (P2P) systems. Studies in this area generally resort to simulations, which enable reproducible results and make it possible to explore wide ranges of platform and application scenarios. In this context, network simulation is certainly the most critical part. Many packet-level network simulators are available and enable high-accuracy simulation but they lead to prohibitively long simulation times. Therefore, many simulation frameworks have been developed that simulate networks at higher levels, thus enabling fast simulation but losing accuracy. One such framework, SimGrid, uses a flow-level approach that approximates the behavior of TCP networks, including TCP's bandwidth sharing properties. A preliminary study of the accuracy loss by comparing it to popular packet-level simulators has been proposed in [11] and in which regimes in which SimGrid's accuracy is comparable to that of these packet-level simulators are identified. In this article we come back on this study, reproduce these experiments and provide a deeper analysis that enables us to greatly improve SimGrid's range of validity.

[1]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[2]  Ian T. Foster,et al.  GangSim: a simulator for grid scheduling studies , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[3]  J.H. Cowie,et al.  Modeling the global Internet , 1999, Comput. Sci. Eng..

[4]  Henri Casanova,et al.  A Network Model for Simulation of Grid Application , 2002 .

[5]  Anukool Lakhina,et al.  BRITE: Universal Topology Generation from a User''s Perspective , 2001 .

[6]  David M. Nicol,et al.  Towards Realistic Million-Node Internet Simulation , 1999, PDPTA.

[7]  Sally Floyd,et al.  Promoting the use of end-to-end congestion control in the Internet , 1999, TNET.

[8]  Satoshi Matsuoka,et al.  Overview of a performance evaluation system for global computing scheduling algorithms , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[9]  Laurent Massoulié,et al.  Bandwidth sharing: objectives and algorithms , 2002, TNET.

[10]  Péter Urbán,et al.  Neko: a single environment to simulate and prototype distributed algorithms , 2001, Proceedings 15th International Conference on Information Networking.

[11]  Donald F. Towsley,et al.  A study of networks simulation efficiency: fluid simulation vs. packet-level simulation , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[12]  George F. Riley,et al.  The Georgia Tech Network Simulator , 2003, MoMeTools '03.

[13]  Voratas Kachitvichyanukul,et al.  Dynamic scheduling II: fast simulation model for grid scheduling using HyperSim , 2003, WSC '03.

[14]  Rajkumar Buyya,et al.  On incorporating differentiated levels of network service into GridSim , 2007, Future Gener. Comput. Syst..

[15]  Dah-Ming Chiu,et al.  Some observations on fairness of bandwidth sharing , 2000, Proceedings ISCC 2000. Fifth IEEE Symposium on Computers and Communications.

[16]  BERNARD M. WAXMAN,et al.  Routing of multipoint connections , 1988, IEEE J. Sel. Areas Commun..

[17]  Andrew A. Chien,et al.  The MicroGrid: a Scientific Tool for Modeling Computational Grids , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[18]  Henri Casanova,et al.  SimGrid: A Generic Framework for Large-Scale Distributed Experiments , 2008, Tenth International Conference on Computer Modeling and Simulation (uksim 2008).

[19]  Henri Casanova,et al.  Speed and accuracy of network simulation in the SimGrid framework , 2007, Valuetools 2007.

[20]  Steven H. Low,et al.  A duality model of TCP and queue management algorithms , 2003, TNET.

[21]  Xin Liu,et al.  Traffic-based Load Balance for Scalable Network Emulation , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[22]  P. Uthayopas,et al.  Fast simulation model for grid scheduling using HyperSim , 2003, Proceedings of the 2003 Winter Simulation Conference, 2003..

[23]  Paul Millar,et al.  OptorSim : a Simulation Tool for Scheduling and Replica Optimisation in Data Grids , 2005 .

[24]  Dimitri P. Bertsekas,et al.  Data Networks , 1986 .

[25]  Donald F. Towsley,et al.  Modeling TCP throughput: a simple model and its empirical validation , 1998, SIGCOMM '98.

[26]  Matthew Mathis,et al.  The macroscopic behavior of the TCP congestion avoidance algorithm , 1997, CCRV.