Model Driven Extraction of NoSQL Databases Schema: Case of MongoDB

Big Data have received a great deal of attention in recent years. Not only the amount of data is on a completely different level than before but also, we have different type of data including factors such as format, structure, and sources. This has definitely changed the tools we need to handle Big Data, giving rise to NoSQL systems. While NoSQL systems have proven their efficiency to handle Big Data, it’s still an unsolved problem how the extraction of a NoSQL database model could be done. This paper proposes an automatic approach for extracting a physical model starting from a document-oriented NoSQL database, including links between different collections. In order to demonstrate the practical applicability of our work, we have realized it in a tool using the Eclipse Modeling Framework environment.

[1]  Jesús García Molina,et al.  Inferring Versioned Schemas from NoSQL Databases and Its Applications , 2015, ER.

[2]  Matteo Golfarelli,et al.  Schema profiling of document-oriented databases , 2018, Inf. Syst..

[3]  Jacky Akoka,et al.  Model driven reverse engineering of NoSQL property graph databases: The case of Neo4j , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[4]  Dario Colazzo,et al.  Schema Inference for Massive JSON Datasets , 2017, EDBT.

[5]  Carlyna Bondiombouy Query Processing in Cloud Multistore Systems , 2015 .

[6]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[7]  Mark Rouncefield,et al.  Model-driven engineering practices in industry , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[8]  Dario Colazzo,et al.  Parametric schema inference for massive JSON datasets , 2019, The VLDB Journal.

[9]  Takaaki Goto,et al.  A Framework to Convert NoSQL to Relational Model , 2018, ACIT 2018.

[10]  Guy Harrison,et al.  Next Generation Databases , 2015, Apress.

[11]  Fatma Abdelhédi,et al.  MDA-Based Approach for NoSQL Databases Modelling , 2017, DaWaK.

[12]  Frank Budinsky,et al.  Eclipse modeling framework : a developer's guide , 2004 .

[13]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[14]  Meike Klettke,et al.  Schema Extraction and Structural Outlier Detection for JSON-based NoSQL Data Stores , 2015, BTW.