Total order broadcast for fault tolerant exascale systems
暂无分享,去创建一个
Jonathan Appavoo | Orran Krieger | Dan Schatzberg | James Cadden | J. Appavoo | O. Krieger | Dan Schatzberg | James Cadden
[1] Robert Griesemer,et al. Paxos made live: an engineering perspective , 2007, PODC '07.
[2] Richard L. Graham,et al. Preserving Collective Performance across Process Failure for a Fault Tolerant MPI , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.
[3] Greg Bronevetsky,et al. Run-Through Stabilization: An MPI Proposal for Process Fault Tolerance , 2011, EuroMPI.
[4] Darius Buntinas. Scalable Distributed Consensus to Support MPI Fault Tolerance , 2011, EuroMPI.
[5] Michael Isard,et al. Autopilot: automatic data center management , 2007, OPSR.
[6] Dilma Da Silva,et al. Enabling autonomic behavior in systems software with hot swapping , 2003, IBM Syst. J..
[7] Brian F. Cooper. Spanner: Google's globally-distributed database , 2013, SYSTOR '13.
[8] Sape J. Mullender,et al. Distributed systems (2nd Ed.) , 1993 .
[9] Thomas Hérault,et al. An evaluation of User-Level Failure Mitigation support in MPI , 2012, Computing.
[10] Sam Toueg,et al. Unreliable failure detectors for reliable distributed systems , 1996, JACM.
[11] Dilma Da Silva,et al. K42: building a complete operating system , 2006, EuroSys.
[12] George Bosilca,et al. Algorithmic Based Fault Tolerance Applied to High Performance Computing , 2008, ArXiv.
[13] Yawei Li,et al. Megastore: Providing Scalable, Highly Available Storage for Interactive Services , 2011, CIDR.
[14] Leslie Lamport,et al. The part-time parliament , 1998, TOCS.
[15] George Bosilca,et al. Algorithm-based fault tolerance applied to high performance computing , 2009, J. Parallel Distributed Comput..
[16] Benjamin Reed,et al. A simple totally ordered broadcast protocol , 2008, LADIS '08.
[17] Mahadev Konar,et al. ZooKeeper: Wait-free Coordination for Internet-scale Systems , 2010, USENIX ATC.
[18] Fred B. Schneider,et al. Replication management using the state-machine approach , 1993 .