Data replication in Mariposa

The Mariposa distributed data manager uses an economic model for managing the allocation of both storage objects and queries to servers. We present extensions to the economic model which support replica management, as well as our mechanisms for propagating updates among replicas. We show how our replica control mechanism can be used to provide consistent, although potentially stale, views of data across many machines without expensive per-transaction synchronization. We present a rule-based conflict resolution mechanism, which can be used to enhance traditional time-stamp serialization. We discuss the effects of our replica system on query processing for both read-only and read-write queries. We further demonstrate how the replication model and mechanisms naturally support name service in Mariposa.

[1]  Wesley W. Chu,et al.  Optimal File Allocation in a Multiple Computer System , 1969, IEEE Transactions on Computers.

[2]  花田 収悦,et al.  Optimal File Allocation in a Multiple Computer System , 1970 .

[3]  Hector Garcia-Molina,et al.  Read-only transactions in a distributed database , 1982, TODS.

[4]  Lawrence W. Dowdy,et al.  Comparative Models of the File Assignment Problem , 1982, CSUR.

[5]  Flaviu Cristian,et al.  An efficient, fault-tolerant protocol for replicated data management , 1985, PODS '85.

[6]  Michael Stonebraker,et al.  The Design of the POSTGRES Storage System , 1988, VLDB.

[7]  Rafael Alonso,et al.  Data caching issues in an information retrieval system , 1990, TODS.

[8]  Michael Stonebraker,et al.  On rules, procedure, caching and views in data base systems , 1990, SIGMOD '90.

[9]  M. Stonebraker,et al.  On Rules, Procedures, Caching and Views in Data Base Systems , 1994, SIGMOD Conference.

[10]  Arthur J. Bernstein,et al.  Bounded ignorance in replicated systems , 1991, PODS.

[11]  Calton Pu,et al.  Replica control in distributed systems: as asynchronous approach , 1991, SIGMOD '91.

[12]  Sushil Jajodia,et al.  An algorithm for dynamic data distribution , 1992, [1992 Proceedings] Second Workshop on the Management of Replicated Data.

[13]  Calton Pu,et al.  A Structural Classification of Integrated Replica Control Mechanisms , 1992 .

[14]  Divyakant Agrawal,et al.  Modular Synchronization in Distributed, Multiversion Databases: Version Control and Concurrency Control , 1993, IEEE Trans. Knowl. Data Eng..

[15]  Stanley B. Zdonik,et al.  An Efficient Scheme for Dynamic Data Replication , 1993 .

[16]  C. Mohan,et al.  Two-phase commit optimizations and tradeoffs in the commercial environment , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[17]  Michael Stonebraker,et al.  Mariposa: a new architecture for distributed data , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[18]  Michael Stonebraker,et al.  An economic paradigm for query processing and data migration in Mariposa , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.

[19]  Marvin Theimer,et al.  Session guarantees for weakly consistent replicated data , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.