Disk drives are now available with capacities on the order of hundreds of gigabytes. What has not become available is an easy way to manage storage. With installed machines located across the enterprise, the backup, management of application installation, and maintenance of systems have become a nightmare. An increasing trend in the storage industry is to virtualize storage resources, maintaining a central repository that can be accessed across the network. We have designed a network block storage device, Peabody, that exposes virtual disks. These virtual disks provide mechanisms to: recover any previous state of their sectors and share backend storage to improve cache utilization and reduce the total amount of storage needed. Peabody is exposed as an iSCSI target, and is mountable by any iSCSI compatible initiator Using our implementation of Peabody, we show that for our workloads, up to 84% of disk sectors written, contain identical content to previously written sectors, motivating the need for content-based coalescing. The overhead for writing in a simple implementation is only 20 percent of the total write speed. This paper describes our early experiences with the Peabody implementation. We quantify how rapidly storage is consumed, examine optimizations, such as content-based coalescing and describe how recovery is currently implemented. We conclude with future plans based on these measurements.
[1]
Wilson C. Hsieh,et al.
The logical disk: a new approach to improving file systems
,
1994,
SOSP '93.
[2]
Ben Y. Zhao,et al.
OceanStore: an architecture for global-scale persistent storage
,
2000,
SIGP.
[3]
David Teigland,et al.
Volume Managers in Linux
,
2001,
USENIX Annual Technical Conference, FREENIX Track.
[4]
Chandramohan A. Thekkath,et al.
Petal: distributed virtual disks
,
1996,
ASPLOS VII.
[5]
Eric Anderson,et al.
A backup appliance composed of high-capacity disk drives
,
2001,
Proceedings Eighth Workshop on Hot Topics in Operating Systems.
[6]
David Mazières,et al.
A low-bandwidth network file system
,
2001,
SOSP.
[7]
Norman C. Hutchinson,et al.
Deciding when to forget in the Elephant file system
,
1999,
SOSP.
[8]
Craig A. N. Soules,et al.
Metadata Efficiency in a Comprehensive Versioning File System (CMU-CS-02-145)
,
2002
.
[9]
John Wilkes,et al.
UNIX Disk Access Patterns
,
1993,
USENIX Winter.
[10]
Sean Quinlan,et al.
Venti: A New Approach to Archival Storage
,
2002,
FAST.
[11]
A. Broder.
Some applications of Rabin’s fingerprinting method
,
1993
.
[12]
Marvin Theimer,et al.
Feasibility of a serverless distributed file system deployed on an existing set of desktop PCs
,
2000,
SIGMETRICS '00.