Towards a Systematic Approach to Graph Data Modeling: Scenario-based Design and Experiences

Graph database is recently being adopted by data analytic systems as an appealing alternative to relational database for the management of large-scale inherent graph-like data. A great challenge of leveraging graph database technologies is to model a problem domain into graph. However, in the absence of considering application requirements or goals, current graph data modeling approaches seem to be invalid. This paper presents an exploration of a systematic approach for graph data modeling—SuMo. Starting from real world scenarios, requirements are transformed into a domain model, which acts as an intermediate model in SuMo, captures modeling features of that domain. SuMo defines a set of rules for the subsequent transformation of this domain model to produce a graph model. We applied SuMo to the modeling of a data-intensive analytic system using real datasets as an illustrating example to clarify our main idea. SuMo is empirically evaluated in terms of query performance, the experimental results indicate promising feasibility and efficiency. The major contribution of our work is a preliminary graph data modeling approach based on scenarios. Keywords-graph data modeling; data analytic system; scenariobased modeling; model-driven design

[1]  Roberto De Virgilio,et al.  Converting relational to graph databases , 2013, GRADES.

[2]  Roberto De Virgilio,et al.  Model-Driven Design of Graph Databases , 2014, ER.

[3]  Jim Webber,et al.  Graph Databases: New Opportunities for Connected Data , 2013 .

[4]  Susanna-Assunta Sansone,et al.  Bio-GraphIIn: a graph-based, integrative and semantically-enabled repository for life science experimental data , 2013 .

[5]  Renzo Angles,et al.  A Comparison of Current Graph Database Models , 2012, 2012 IEEE 28th International Conference on Data Engineering Workshops.

[6]  Dewi W. Wardani,et al.  Semantic mapping relational to graph model , 2014, 2014 International Conference on Computer, Control, Informatics and Its Applications (IC3INA).

[7]  Joydeep Ghosh,et al.  Graph databases for large-scale healthcare systems: A framework for efficient data management and data services , 2014, 2014 IEEE 30th International Conference on Data Engineering Workshops.

[8]  Ciro Cattuto,et al.  Time-varying social networks in a graph database: a Neo4j use case , 2013, GRADES.

[9]  Josef Küng,et al.  Semantic Mapping Relational to a Directed Property Hypergraph Model , 2015, 2015 IEEE International Conference on Computer and Information Technology; Ubiquitous Computing and Communications; Dependable, Autonomic and Secure Computing; Pervasive Intelligence and Computing.