Extraction process of conceptual model from a document-oriented NoSQL database

NoSQL systems are used to manage massive databases that verify 3V: Volume, Variety and Velocity. Generally, these systems are known by the characteristic "schema less" which means that we can create a database without defining the data schema beforehand. This property offers more flexibility and speed by allowing the evolution of the data model during the exploitation of the base. However, to formulate queries on the database, the user needs a precise knowledge of data model. In this article, we propose a process for the automatic extraction of the conceptual model of a document-oriented NoSQL database. To do this, we use the Model Driven Architecture (MDA) architecture that provides a formal framework for automatic model transformation. From a NoSQL database, we propose a set of transformation rules with QVT to generate the conceptual model in the form of a UML class diagram. An experimentation of the extraction process was carried out on an application in the medical field..

[1]  Meike Klettke,et al.  Schema Extraction and Structural Outlier Detection for JSON-based NoSQL Data Stores , 2015, BTW.

[2]  Frank Budinsky,et al.  Eclipse modeling framework : a developer's guide , 2004 .

[3]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[4]  Jacky Akoka,et al.  Model driven reverse engineering of NoSQL property graph databases: The case of Neo4j , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[5]  Amal Ait Brahim,et al.  MDA Process to Extract the Data Model from Document-oriented NoSQL Database , 2019, ICEIS.

[6]  Mark Rouncefield,et al.  Model-driven engineering practices in industry , 2011, 2011 33rd International Conference on Software Engineering (ICSE).

[7]  Carlyna Bondiombouy Query Processing in Cloud Multistore Systems , 2015 .

[8]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[9]  Dario Colazzo,et al.  Parametric schema inference for massive JSON datasets , 2019, The VLDB Journal.

[10]  Takaaki Goto,et al.  A Framework to Convert NoSQL to Relational Model , 2018, ACIT 2018.

[11]  Fatma Abdelhédi,et al.  Formalizing the Mapping of UML Conceptual Schemas to Column-Oriented Databases , 2018, Int. J. Data Warehous. Min..

[12]  Dario Colazzo,et al.  Schema Inference for Massive JSON Datasets , 2017, EDBT.

[13]  Guy Harrison,et al.  Next Generation Databases , 2015, Apress.

[14]  Jesús García Molina,et al.  Inferring Versioned Schemas from NoSQL Databases and Its Applications , 2015, ER.

[15]  Matteo Golfarelli,et al.  Schema profiling of document-oriented databases , 2018, Inf. Syst..