A Systematic Approach to System State Restoration during Storage Controller Micro-Recovery
暂无分享,去创建一个
[1] Carl Staelin,et al. The HP AutoRAID hierarchical storage system , 1995, SOSP.
[2] Ling Liu,et al. Enhancing Storage System Availability on Multi-Core Architectures with Recovery-Conscious Scheduling , 2008, FAST.
[3] Yennun Huang,et al. Software rejuvenation: analysis, module and applications , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.
[4] Laura L. Pullum,et al. Software Fault Tolerance Techniques and Implementation , 2001 .
[5] Brian Randell,et al. System structure for software fault tolerance , 1975, IEEE Transactions on Software Engineering.
[6] Srikanth Kandula,et al. Flashback: A Lightweight Extension for Rollback and Deterministic Replay for Software Debugging , 2004, USENIX Annual Technical Conference, General Track.
[7] Kishor S. Trivedi. Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .
[8] Daniel M. Roy,et al. Enhancing Server Availability and Security Through Failure-Oblivious Computing , 2004, OSDI.
[9] George Candea,et al. Microreboot - A Technique for Cheap Recovery , 2004, OSDI.
[10] Angelos D. Keromytis,et al. Using Rescue Points to Navigate Software Recovery (Short Paper) , 2007 .
[11] Yuanyuan Zhou,et al. Rx: treating bugs as allergies---a safe method to survive software failures , 2005, SOSP '05.
[12] Hamid Pirahesh,et al. ARIES: a transaction recovery method supporting fine-granularity locking and partial rollbacks using write-ahead logging , 1998 .
[13] Randy H. Katz,et al. A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.
[14] Koen De Bosschere,et al. RecPlay: a fully integrated practical record/replay system , 1999, TOCS.
[15] D. S. Scott. Making smart investments to reduce unplanned down-time , 1999 .
[16] L. Alvisi,et al. A Survey of Rollback-Recovery Protocols , 2002 .
[17] Leslie Lamport,et al. Time, clocks, and the ordering of events in a distributed system , 1978, CACM.
[18] Jason Nieh,et al. Transparent Checkpoint-Restart of Multiple Processes on Commodity Operating Systems , 2007, USENIX Annual Technical Conference.
[19] Jim Gray,et al. Why Do Computers Stop and What Can Be Done About It? , 1986, Symposium on Reliability in Distributed Software and Database Systems.
[20] Bjarne Stroustrup,et al. The Design and Evolution of C , 1994 .
[21] Angelos D. Keromytis,et al. Using Rescue Points to Navigate Software Recovery , 2007, 2007 IEEE Symposium on Security and Privacy (SP '07).
[22] Andreas Reuter,et al. Transaction Processing: Concepts and Techniques , 1992 .
[23] Mark Russinovich,et al. Replay for concurrent non-deterministic shared-memory applications , 1996, PLDI '96.
[24] E. N. Elnozahy,et al. Checkpointing for peta-scale systems: a look into the future of practical rollback-recovery , 2004, IEEE Transactions on Dependable and Secure Computing.
[25] Andrea C. Arpaci-Dusseau,et al. IRON file systems , 2005, SOSP '05.