Efficient, distributed data placement strategies for storage area networks (extended abstract)

In the last couple of years a dramatic growth of enterprise data storage capacity can be observed. As a result, new strategies have been sought that allow servers and storage being centralized to better manage the explosion of data and the overall cost of ownership. Nowadays, a common approach is to combine storage devices into a dedicated network that is connected to LANs and/or servers. Such networks are usually called storage area networks (SAN). A very important aspect for these networks is scalability. If a SAN undergoes changes (for instance, due to insertions or removals of disks), it may be necessary to replace data in order to allow an efficient use of the system. To keep the influence of data replacements on the performance of the SAN small, this should be done as efficiently as possible. In this paper, we investigate the problem of evenly distributing and efficiently locating data in dynamically changing SANs. We consider two scenarios: (1) all disks have the same capacity, and (2) the capacities of the disks are allowed to be arbitrary. For both scenarios, we present placement strategies capable of locating blocks efficiently and that are able to quickly adjust the data placement to insertions or removals of disks or data blocks. Furthermore, we study how the performance of our placement strategies changes if we allow to waste a certain amount of capacity of the disks.

[1]  Eli Upfal,et al.  How to share memory in a distributed system , 1984, JACM.

[2]  David R. Karger,et al.  Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web , 1997, STOC '97.

[3]  Fouad A. Tobagi,et al.  Streaming RAID: a disk array management system for video files , 1993, MULTIMEDIA '93.

[4]  Yitzhak Birk Random RAIDs with selective exploitation of redundancy for high performance video servers , 1997, Proceedings of 7th International Workshop on Network and Operating System Support for Digital Audio and Video (NOSSDAV '97).

[5]  Juan Alemany,et al.  Random Striping for News on Demand Servers Random Striping for News on Demand Servers , 1997 .

[6]  Garth A. Gibson,et al.  Parity declustering for continuous operation in redundant disk arrays , 1992, ASPLOS V.

[7]  Friedhelm Meyer auf der Heide,et al.  Efficient PRAM simulation on a distributed memory machine , 1992, STOC '92.

[8]  Jan H. M. Korst Random duplicated assignment: an alternative to striping in video servers , 1997, MULTIMEDIA '97.

[9]  Jose Renato Santos,et al.  A parallel disk storage system for real-time multimedia applications , 1998, Int. J. Intell. Syst..

[10]  Shahram Ghandeharizadeh,et al.  Staggered striping in multimedia information systems , 1994, SIGMOD '94.

[11]  Eric J. Schwabe,et al.  Flexible Usage of Redundancy in Disk Arrays , 1999, Theory of Computing Systems.

[12]  Richard R. Muntz,et al.  Fault tolerant design of multimedia servers , 1995, SIGMOD '95.

[13]  H KatzRandy,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988 .

[14]  M. Blaum,et al.  EVENODD: an optimal scheme for tolerating double disk failures in RAID architectures , 1994, Proceedings of 21 International Symposium on Computer Architecture.

[15]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[16]  Kurt Mehlhorn,et al.  Randomized and deterministic simulations of PRAMs by par-allel machines with restricted granularity , 1994 .

[17]  Berthier A. Ribeiro-Neto,et al.  Comparing random data allocation and data striping in multimedia servers , 2000, SIGMETRICS '00.

[18]  Jose Renato Santos,et al.  Performance analysis of the RIO multimedia storage system with heterogeneous disk configurations , 1998, MULTIMEDIA '98.

[19]  Randy H. Katz,et al.  Choosing the best storage system for video service , 1995, MULTIMEDIA '95.