A Framework for High Availability Based on a Single System Image
暂无分享,去创建一个
[1] J. Duell. The design and implementation of Berkeley Lab's linux checkpoint/restart , 2005 .
[2] Christine Morin,et al. Containers: a sound basis for a true single system image , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.
[3] Christine Morin,et al. A Case for Single System Image Cluster Operating Systems: The Kerrighed Approach , 2003, Parallel Process. Lett..
[4] L. Alvisi,et al. A Survey of Rollback-Recovery Protocols , 2002 .
[5] Leslie Lamport,et al. Distributed snapshots: determining global states of distributed systems , 1985, TOCS.
[6] Christine Morin,et al. A Survey of Recoverable Distributed Shared Memory Systems , 1995 .
[7] Christine Morin,et al. Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters , 2005, The 4th International Symposium on Parallel and Distributed Computing (ISPDC'05).
[8] Christine Morin,et al. Kerrighed: A Single System Image Cluster Operating System for High Performance Computing , 2003, Euro-Par.
[9] G. Tortone,et al. OpenMosix approach to build scalable HPC farms with an easy management infrastructure , 2003 .
[10] Andreas Speck. Software Engineering (1) , 2006 .
[11] Brian Randell,et al. System structure for software fault tolerance , 1975, IEEE Transactions on Software Engineering.
[12] Jason Duell,et al. The design and implementation of Berkeley Lab's linuxcheckpoint/restart , 2005 .
[13] Eduardo Pinheiro,et al. Truly-Transparent Checkpointing of Parallel Applications , 1998 .
[14] David L. Russell,et al. State Restoration in Systems of Communicating Processes , 1980, IEEE Transactions on Software Engineering.
[15] Christine Morin,et al. Towards an efficient single system image cluster operating system , 2002, Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002. Proceedings..
[16] Taesoon Park,et al. Checkpointing and rollback-recovery in distributed systems , 1989 .
[17] Christine Morin,et al. OpenMosix, OpenSSI and Kerrighed: a comparative study , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..
[18] Stephen L. Scott,et al. HA-OSCAR: the birth of highly available OSCAR , 2003 .