A scalable generic transaction model scenario for distributed NoSQL databases

With the development of cloud computing and internet; e-Commerce, e-Business and corporate world revenue are increasing with high rate. These areas not only require scalable and consistent databases but also require inter database transaction support. In this paper, we present, a scalable three-tier architecture along with a distributed middle-ware protocol to support atomic transactions across heterogeneous NoSQL databases. Our methodology does not compromise on any assumption on the accuracy of failure modalities. Hence, it is suitable for a class of heterogeneous distributed systems. To achieve such a target, our architectural model exploits an innovative methodology to achieve distributed atomic transactions. We simulate this architectural setup with different latency tests under different environments to produce reliable impact and correctness. This study proposes a three-tier architecture to facilitate atomic transactions among heterogeneous NoSQL distributed database environment as HBase.The first tier called client tier, deals with user requests. The client tier wraps transaction request and deliver it to a corresponding global transaction manager (GTM), which is assigned by a manager node.The second tier called middle tier, contains a zookeeper, manager nodes, and labour nodes. Zookeeper assigns an active manager node to the client and the active manager node assigns a labour node to process the transaction.The third tier called distributed tier manages multiple heterogeneous NoSQL databases participating with different transactions.Experimental results show that the proposed architecture is scalable and fault tolerated.

[1]  Ali Ghodsi,et al.  Highly Available Transactions: Virtues and Limitations , 2013, Proc. VLDB Endow..

[2]  Simeon C. Ntafos,et al.  The zookeeper route problem , 1992, Inf. Sci..

[3]  Michael Stonebraker,et al.  H-store: a high-performance, distributed main memory transaction processing system , 2008, Proc. VLDB Endow..

[4]  Michael Stonebraker,et al.  10 rules for scalable performance in 'simple operation' datastores , 2011, Commun. ACM.

[5]  Frank Dabek,et al.  Large-scale Incremental Processing Using Distributed Transactions and Notifications , 2010, OSDI.

[6]  Michael J. Cahill Serializable isolation for snapshot databases , 2009, TODS.

[7]  Daniel J. Abadi,et al.  Calvin: fast distributed transactions for partitioned database systems , 2012, SIGMOD Conference.

[8]  Michael Stonebraker,et al.  The End of an Architectural Era (It's Time for a Complete Rewrite) , 2007, VLDB.

[9]  M. N. Vora,et al.  Hadoop-HBase for large-scale data , 2011, Proceedings of 2011 International Conference on Computer Science and Network Technology.

[10]  Daniel J. Abadi,et al.  Low overhead concurrency control for partitioned main memory databases , 2010, SIGMOD Conference.

[11]  Esther Pacitti,et al.  Preventive Multi-master Replication in a Cluster of Autonomous Databases , 2003, Euro-Par.

[12]  Alan Fekete,et al.  Serializable snapshot isolation for replicated databases in high-update scenarios , 2011, Proc. VLDB Endow..

[13]  Ramesh Dharavath Atomicity and Snapshot Isolation on Column Oriented Databases , 2013 .

[14]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[15]  Arthur T. Whitney,et al.  High volume transaction processing without currency control, two phase commit, SQLor C++ , 1997 .

[16]  Tim Hawkins,et al.  The Definitive Guide to MongoDB: The NoSQL Database for Cloud and Desktop Computing , 2010 .

[17]  Bipin C. Desai,et al.  A two-phase commit protocol and its performance , 1996, Proceedings of 7th International Conference and Workshop on Database and Expert Systems Applications: DEXA 96.

[18]  Jim Gray,et al.  A critique of ANSI SQL isolation levels , 1995, SIGMOD '95.

[19]  Sameh Elnikety,et al.  One-copy serializability with snapshot isolation under the hood , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[20]  D. Ramesh,et al.  Design of a transaction recovery instance based on bi-directional ring election algorithm for crashed coordinator in distributed database systems , 2012, 2012 World Congress on Information and Communication Technologies.

[21]  Patrick E. O'Neil,et al.  Precisely Serializable Snapshot Isolation (PSSI) , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[22]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[23]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[24]  Hans De Sterck,et al.  Supporting multi-row distributed transactions with global snapshot isolation using bare-bones HBase , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.

[25]  Michael Stonebraker Stonebraker on NoSQL and enterprises , 2011, CACM.

[26]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[27]  Patrick O'Neil,et al.  Precisely serializable snapshot isolation , 2011 .

[28]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[29]  Rachid Guerraoui,et al.  Implementing e-Transactions with asynchronous replication , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[30]  Rachid Guerraoui,et al.  e-Transactions: End-to-End Reliability for Three-Tier Architectures , 2002, IEEE Trans. Software Eng..

[31]  Chiranjeev Kumar,et al.  Accuracy of Atomic Transaction Scenario for Heterogeneous Distributed Column-Oriented Databases , 2013, ICACNI.

[32]  S. Sudarshan,et al.  Automating the Detection of Snapshot Isolation Anomalies , 2007, VLDB.

[33]  Nick Koudas,et al.  The design of a query monitoring system , 2009, TODS.

[34]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.

[35]  C. Kumar,et al.  Implementation of atomicity and snapshot isolation for multi-row transactions on column oriented distributed databases using RDBMS , 2012, 2012 International Conference on Communications, Devices and Intelligent Systems (CODIS).

[36]  Prashant Malik,et al.  Cassandra: a decentralized structured storage system , 2010, OPSR.

[37]  Dennis Shasha,et al.  Making snapshot isolation serializable , 2005, TODS.

[38]  J W Ballard,et al.  Data on the web? , 1995, Science.