Self-adaptive Fault-Tolerance of HLA-Based Simulations in the Grid Environment

The objects of a HLA-based simulation can access model services to update their attributes. However, the grid server may be overloaded and refuse the model service to handle objects accesses. Because these objects have been accessed this model service during last simulation loop and their medium state are stored in this server, this may terminate the simulation. A fault-tolerance mechanism must be introduced into simulations. But the traditional fault-tolerance methods cannot meet the above needs because the transmission latency between a federate and the RTI in grid environment varies from several hundred milliseconds to several seconds. By adding model service URLs to the OMT and expanding the HLA services and model services with some interfaces, this paper proposes a self-adaptive fault-tolerance mechanism of simulations according to the characteristics of federates accessing model services. Benchmark experiments indicate that the expanded HLA/RTI can make simulations self-adaptively run in the grid environment.

[1]  Marian Bubak,et al.  Towards a grid management system for HLA-based interactive simulations , 2003, Proceedings Seventh IEEE International Symposium on Distributed Simulation and Real-Time Applications.

[2]  Joefon Jann,et al.  Dynamic reconfiguration: Basic building blocks for autonomic computing on IBM pSeries servers , 2003, IBM Syst. J..

[3]  Stephen John Turner,et al.  Federate migration in HLA-based simulation , 2005, Future Gener. Comput. Syst..

[4]  Stephen John Turner,et al.  Service provisioning for HLA-based distributed simulation on the grid , 2005, Workshop on Principles of Advanced and Distributed Simulation (PADS'05).

[5]  Sudharshan S. Vazhkudai Enabling the co-allocation of grid data transfers , 2003, Proceedings. First Latin American Web Congress.

[6]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[7]  Stephen John Turner,et al.  Federate Migration in HLA-Based Simulation , 2004, International Conference on Computational Science.

[8]  Qing-Hua Li,et al.  Resource co-allocation via agent-based coalition formation in computational grids , 2003, Proceedings of the 2003 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.03EX693).

[9]  Richard Wolski,et al.  Analyzing Market-Based Resource Allocation Strategies for the Computational Grid , 2001, Int. J. High Perform. Comput. Appl..

[10]  Paul Avery,et al.  Policy based scheduling for simple quality of service in grid computing , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..