Optimal response time retrieval of replicated data (extended abstract)

This work deals with the problem of finding efficient access plans for retrieving a set of pages from a multi-disk system with replicated data. This paper contains two results related to this problem: (a) We solve the problem of finding an optimal access path by transforming it into a network flow problem. We also indicate how our method may be employed in dynamic environments where some (or all) of the disks have a preexisting load, are heterogeneous, and reside on different servers. (b) We present a lower bound for the worst case response time of a request under all replication schemes, and also discuss the replication scheme that results in this lower bound. We then use simulation to show how this replication scheme can also greatly reduce the average case response time.

[1]  Dina Bitton,et al.  Disk Shadowing , 1988, VLDB.

[2]  Robert E. Tarjan,et al.  Network Flow Algorithms , 1989 .

[3]  Doron Rotem,et al.  Declustering Objects for Visualization , 1993, VLDB.

[4]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[5]  Daniel M. Dias,et al.  Disk Mirroring with Alternating Deferred Updates , 1993, VLDB.

[6]  Doron Rotem,et al.  I/O performance of fully-replicated disk systems , 1992, [1992 Proceedings] Second Workshop on the Management of Replicated Data.

[7]  Norman S. Matloff,et al.  A Probabilistic Limit on the Virtual Size of Replicated Disk Systems , 1992, IEEE Trans. Knowl. Data Eng..