DTR: Distributed Transaction Routing in a Large Scale Network

Grid systems provide access to huge storage and computing resources at large scale. While they have been mainly dedicated to scientific computing for years, grids are now considered as a viable solution for hosting data-intensive applications. To this end, databases are replicated over the grid in order to achieve high availability and fast transaction processing thanks to parallelism. However, achieving both fast and consistent data access on such architectures is challenging at many points. In particular, centralized control is prohibited because of its vulnerability and lack of efficiency at large scale. In this article, we propose a novel solution for the distributed control of transaction routing in a large scale network. We leverage a cluster-oriented routing solution with a fully distributed approach that uses a large scale distributed directory to handle routing metadata. Moreover, we demonstrate the feasibility of our implementation through experimentation: results expose linear scale-up, and transaction routing time is fast enough to make our solution eligible for update intensive applications such as world wide online booking.

[1]  Hubert Naacke,et al.  Routage décentralisé de transactions avec gestion des pannes dans un réseau à large échelle , 2010, Ingénierie des Systèmes d Inf..

[2]  Fred B. Schneider,et al.  Implementing fault-tolerant services using the state machine approach: a tutorial , 1990, CSUR.

[3]  Fernando Pedone,et al.  Partial replication in the Database State Machine , 2001, Proceedings IEEE International Symposium on Network Computing and Applications. NCA 2001.

[4]  Stéphane Gançarski,et al.  Replica Refresh Strategies in a Database Cluster , 2006, VECPAR.

[5]  Antony I. T. Rowstron,et al.  Storage management and caching in PAST, a large-scale, persistent peer-to-peer storage utility , 2001, SOSP.

[6]  Fausto Giunchiglia,et al.  The Local Relational Model: Model and Proof Theory , 2001 .

[7]  Esther Pacitti,et al.  Fast Algorithms for Maintaining Replica Consistency in Lazy Master Replicated Databases , 1999, VLDB.

[8]  Mathieu Jan,et al.  JUXMEM : un service de partage transparent de données pour grilles de calcul fondé sur une approche pair-à-pair , 2006 .

[9]  Hubert Naacke,et al.  TransPeer: adaptive distributed transaction monitoring for Web2.0 applications , 2010, SAC '10.

[10]  Heiko Schuldt,et al.  FAS - A Freshness-Sensitive Coordination Middleware for a Cluster of OLAP Components , 2002, VLDB.

[11]  Gustavo Alonso,et al.  Ganymed: Scalable Replication for Transactional Web Applications , 2004, Middleware.

[12]  Ricardo Jiménez-Peris,et al.  Middleware based data replication providing snapshot isolation , 2005, SIGMOD '05.

[13]  Hiroshi Nakamura,et al.  Pangea: An Eager Database Replication Middleware guaranteeing Snapshot Isolation without Modification of Database Servers , 2009, Proc. VLDB Endow..

[14]  Prashant Malik,et al.  Cassandra: structured storage system on a P2P network , 2009, PODC '09.

[15]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[16]  Fuat Akal,et al.  Fine-Grained Replication and Scheduling with Freshness and Correctness Guarantees , 2005, VLDB.

[17]  Rachid Guerraoui,et al.  The Database State Machine Approach , 2003, Distributed and Parallel Databases.

[18]  Gustavo Alonso,et al.  MIDDLE-R: Consistent database replication at the middleware level , 2005, TOCS.

[19]  Gustavo Alonso,et al.  Don't Be Lazy, Be Consistent: Postgres-R, A New Way to Implement Database Replication , 2000, VLDB.

[20]  Patrick Valduriez,et al.  Preventive Replication in a Database Cluster , 2005, Distributed and Parallel Databases.

[21]  Fred B. Schneider,et al.  Replication management using the state-machine approach , 1993 .

[22]  Andrew S. Tanenbaum,et al.  Distributed systems: Principles and Paradigms , 2001 .

[23]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[24]  Panos K. Chrysanthis,et al.  A taxonomy of correctness criteria in database applications , 1996, The VLDB Journal.

[25]  Yi Lin,et al.  Snapshot isolation and integrity constraints in replicated databases , 2009, TODS.

[26]  Ricardo Jiménez-Peris,et al.  Lightweight Reflection for Middleware-based Database Replication , 2006, 2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06).

[27]  Hubert Naacke,et al.  Failure-Tolerant Transaction Routing at Large Scale , 2010, 2010 Second International Conference on Advances in Databases, Knowledge, and Data Applications.

[28]  Mikel Larrea,et al.  Efficient Algorithms to Implement Unreliable Failure Detectors in Partially Synchronous Systems , 1999, DISC.

[29]  Marvin Theimer,et al.  Flexible update propagation for weakly consistent replication , 1997, SOSP.

[30]  Fernando Pedone,et al.  P-Store: Genuine Partial Replication in Wide Area Networks , 2010, 2010 29th IEEE Symposium on Reliable Distributed Systems.

[31]  Diomidis Spinellis,et al.  Security applications of peer-to-peer networks , 2004, Comput. Networks.

[32]  Angelos D. Keromytis,et al.  SOS: secure overlay services , 2002, SIGCOMM '02.

[33]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[34]  Heterogeneity-Aware Peer-to-Peer Multicast , 2002 .

[35]  Taesoon Park,et al.  Checkpointing and rollback-recovery in distributed systems , 1989 .

[36]  Gustavo Alonso,et al.  Are quorums an alternative for data replication? , 2003, TODS.

[37]  André Schiper,et al.  From group communication to transactions in distributed systems , 1996, CACM.

[38]  Luciana Arantes,et al.  Building effective mutual exclusion services for grids , 2008, The Journal of Supercomputing.

[39]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[40]  Patrick Valduriez,et al.  The leganet system: Freshness-aware transaction routing in a database cluster , 2007, Inf. Syst..

[41]  Bettina Kemme,et al.  Postgres-R(SI): combining replica control with concurrency control based on snapshot isolation , 2005, 21st International Conference on Data Engineering (ICDE'05).

[42]  Hari Balakrishnan,et al.  Tolerating byzantine faults in transaction processing systems using commit barrier scheduling , 2007, SOSP.

[43]  Rachid Guerraoui,et al.  Transaction reordering in replicated databases , 1997, Proceedings of SRDS'97: 16th IEEE Symposium on Reliable Distributed Systems.

[44]  Paul Watson,et al.  Cloud Computing for e-Science with CARMEN , 2008 .

[45]  Marta Patiño-Martínez Consistent Database Replication at the Middleware Level , 2005 .

[46]  Sang Hyuk Son,et al.  Flexible update management in peer-to-peer database systems , 2005, 9th International Database Engineering & Application Symposium (IDEAS'05).

[47]  Rachid Guerraoui,et al.  Software-Based Replication for Fault Tolerance , 1997, Computer.

[48]  Gustavo Alonso,et al.  A new approach to developing and implementing eager database replication protocols , 2000, TODS.

[49]  Ricardo Jiménez-Peris,et al.  Boosting Database Replication Scalability through Partial Replication and 1-Copy-Snapshot-Isolation , 2007, 13th Pacific Rim International Symposium on Dependable Computing (PRDC 2007).

[50]  Marta Mattoso,et al.  Parallel OLAP query processing in database clusters with data replication , 2009, Distributed and Parallel Databases.

[51]  Chong-Sun Hwang,et al.  Group-based dynamic computational replication mechanism in peer-to-peer grid computing , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[52]  Mathieu Jan,et al.  JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid , 2001, Scalable Comput. Pract. Exp..

[53]  Patrick Valduriez,et al.  Refresco: Improving Query Performance Through Freshness Control in a Database Cluster , 2004, CoopIS/DOA/ODBASE.

[54]  Philip S. Yu,et al.  Divergence Control Algorithms for Epsilon Serializability , 1997, IEEE Trans. Knowl. Data Eng..

[55]  Sébastien Monnet,et al.  Gestion des données dans les grilles de calcul : support pour la tolérance aux fautes et la cohérence des données. (Grid data management : handling both fault-tolerance and data consistency) , 2006 .

[56]  Qi Zhang,et al.  Indra: a peer-to-peer approach to network intrusion detection and prevention , 2003, WET ICE 2003. Proceedings. Twelfth IEEE International Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises, 2003..

[57]  Amin Vahdat,et al.  Efficient Numerical Error Bounding for Replicated Network Services , 2000, VLDB.

[58]  David R. Karger,et al.  Kademlia: A peer-to-peer information system based on the xor metric , 2003 .

[59]  Pierre Sens,et al.  Performance evaluation of a fair fault-tolerant mutual exclusion algorithm , 2006, 2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06).

[60]  Stefan Plantikow,et al.  Transactions for Distributed Wikis on Structured Overlays , 2007, DSOM.