Adaptive checkpointing in dynamic grids for uncertain job durations
暂无分享,去创建一个
Filip De Turck | Piet Demeester | Bart Dhoedt | Peter A. Vanrolleghem | Maria Chtepen | Filip H. A. Claeys
[1] Filip De Turck,et al. Adaptive Task Checkpointing and Replication: Toward Efficient Fault-Tolerant Grids , 2009, IEEE Transactions on Parallel and Distributed Systems.
[2] Zhiling Lan,et al. Adaptive Fault Management of Parallel Applications for High-Performance Computing , 2008, IEEE Transactions on Computers.
[3] Ramendra K. Sahoo,et al. Evaluating cooperative checkpointing for supercomputing systems , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.
[4] Larry Rudolph,et al. Cooperative checkpointing: a robust approach to large-scale systems reliability , 2006, ICS '06.
[5] Lefteris Angelis,et al. Performance and effectiveness trade‐off for checkpointing in fault‐tolerant distributed systems , 2007, Concurr. Comput. Pract. Exp..
[6] Jehoshua Bruck,et al. An on-line algorithm for checkpoint placement , 1996, Proceedings of ISSRE '96: 7th International Symposium on Software Reliability Engineering.
[7] Hong Chen,et al. Optimizing Adaptive Checkpointing Schemes for Grid Workflow Systems , 2006, 2006 Fifth International Conference on Grid and Cooperative Computing Workshops.