Integration and Virtualization of Relational SQL and NoSQL Systems Including MySQL and MongoDB

NoSQL databases are growing in popularity for Big Data applications in web analytics and supporting large web sites due to their high availability and scalability. Since each NoSQL system has its own API and does not typically support standards such as SQL and JDBC, integrating these systems with other enterprise and reporting software requires extra effort. In this work, we present a generic standards-based architecture that allows NoSQL systems, with specific focus on MongoDB, to be queried using SQL and seamlessly interact with any software supporting JDBC. A virtualization system is built on top of the NoSQL sources that translates SQL queries into the source-specific APIs. The virtualization architecture allows users to query and join data from both NoSQL and relational SQL systems in a single SQL query. Experimental results demonstrate that the virtualization layer adds minimal overhead in translating SQL to NoSQL APIs, and the virtualization system can efficiently perform joins across sources.

[1]  Caetano Sauer,et al.  XQuery processing over NoSQL stores , 2013, Grundlagen von Datenbanken.

[2]  Alon Y. Halevy,et al.  Principles of Data Integration , 2012 .

[3]  Michael Stonebraker,et al.  SQL databases v. NoSQL databases , 2010, CACM.

[4]  Won Kim,et al.  Introduction to Object-Oriented Databases , 1991, Computer systems.

[5]  RahmErhard,et al.  A survey of approaches to automatic schema matching , 2001, VLDB 2001.

[6]  Ramon Lawrence,et al.  Dynamic Database Integration in a JDBC Driver , 2005, ICEIS.

[7]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[8]  F. E. A Relational Model of Data Large Shared Data Banks , 2000 .

[9]  Hakan Hacigümüs,et al.  Partiqle: an elastic SQL engine over key-value stores , 2012, SIGMOD Conference.

[10]  Vanja Josifovski,et al.  SQL/MED: a status report , 2002, SGMD.

[11]  Yun Chi,et al.  CloudDB: One Size Fits All Revived , 2010, 2010 6th World Congress on Services.

[12]  George H. L. Fletcher,et al.  On Bridging Relational and Document-Centric Data Stores , 2013, BNCOD.

[13]  Ronaldo dos Santos Mello,et al.  SimpleSQL: A Relational Layer for SimpleDB , 2012, ADBIS.

[14]  Erhard Rahm,et al.  Generic schema matching, ten years later , 2011, Proc. VLDB Endow..

[15]  José Pereira,et al.  An Effective Scalable SQL Engine for NoSQL Databases , 2013, DAIS.

[16]  David J. DeWitt,et al.  Can the Elephants Handle the NoSQL Onslaught? , 2012, Proc. VLDB Endow..

[17]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[18]  C. M. Sperberg-McQueen,et al.  Extensible markup language , 1997 .

[19]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[20]  Paolo Atzeni,et al.  Uniform Access to Non-relational Database Systems: The SOS Platform , 2012, CAiSE.

[21]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.