Challenges in managing dependable data systems

Recent work shows how to automatically design storage systems that meet performance and dependability requirements by appropriately selecting and configuring storage devices, and creating snapshot, remote mirror, and traditional backup copies. Although this work represents a solid foundation, users demand an even higher level of functionality: the ability to cost-effectively manage data according to application-centric (or better, business process-centric) performance, dependability and manageability requirements, as these requirements evolve over the data's lifetime. In this paper, we outline several research challenges in managing dependable data systems, including capturing users' high-level goals; translating them into storage-level requirements; and designing, deploying, and analyzing the resulting data systems.

[1]  Gregory R. Ganger,et al.  Ursa minor: versatile cluster-based storage , 2005, FAST'05.

[2]  John Wilkes,et al.  Traveling to Rome: QoS Specifications for Automated Storage System Management , 2001, IWQoS.

[3]  Dirk Beyer,et al.  On the road to recovery: restoring data after disasters , 2006, EuroSys '06.

[4]  Richard Mortier,et al.  Using Magpie for Request Extraction and Workload Modelling , 2004, OSDI.

[5]  Darrell D. E. Long,et al.  Deep Store: an archival storage system architecture , 2005, 21st International Conference on Data Engineering (ICDE'05).

[6]  Marcos K. Aguilera,et al.  Performance debugging for distributed systems of black boxes , 2003, SOSP '03.

[7]  Stefan Savage,et al.  Total Recall: System Support for Automated Availability Management , 2004, NSDI.

[8]  Joseph Hall,et al.  An Experimental Study of Data Migration Algorithms , 2001, WAE.

[9]  Pradeep K. Khosla,et al.  Selecting the Right Data Distribution Scheme for a Survivable Storage System (CMU-CS-01-120) , 2001 .

[10]  Jeffrey S. Chase,et al.  Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control , 2004, OSDI.

[11]  Dirk Beyer,et al.  Designing for Disasters , 2004, FAST.

[12]  Don Petravick,et al.  Storage Resource Manager , 2004 .

[13]  Dirk Beyer,et al.  Lessons and challenges in automating data dependability , 2004, EW 11.

[14]  Eric Anderson,et al.  Proceedings of the Fast 2002 Conference on File and Storage Technologies Hippodrome: Running Circles around Storage Administration , 2022 .

[15]  Mary Baker,et al.  A fresh look at the reliability of long-term digital storage , 2005, EuroSys.

[16]  Margo I. Seltzer,et al.  Provenance-Aware Storage Systems , 2006, USENIX ATC, General Track.

[17]  Norman C. Hutchinson,et al.  Deciding when to forget in the Elephant file system , 1999, SOSP.

[18]  Eric Anderson,et al.  Selecting RAID Levels for Disk Arrays , 2002, FAST.

[19]  Anastasia Ailamaki,et al.  Continuous resource monitoring for self-predicting DBMS , 2005, 13th IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems.

[20]  Arif Merchant,et al.  Minerva: An automated resource provisioning tool for large-scale storage systems , 2001, TOCS.

[21]  Kimberly Keeton,et al.  A framework for evaluating storage system dependability , 2004, International Conference on Dependable Systems and Networks, 2004.

[22]  Jerome H. Saltzer,et al.  End-to-end arguments in system design , 1984, TOCS.