Graph-based Data Integration in EUDAT Data Infrastructure

European Data Infrastructure (EUDAT) is a distributed research infrastructure offering generic data management services to the research communities. The services deal with different phases of the data life cycle, some of them are tailored to account for special needs of the individual communities or replicated to increase the availability and resilience. All that leads to scattering of the large and heterogeneous data across service landscape limiting discoverability, openness, and data reuse. In this paper, we show how graph database technology can be leveraged to integrated the data across service boundaries. Such an integration will facilitate better cooperation among the researchers, improve searching and increase the openness of the infrastructure. We report on our work in progress, to show how better user experience and enhancement of the services can be achieved by using graph algorithms. Keywords–Data Integration; Graph Databases; Designing for Open Data; Linked Data.

[1]  Tim Berners-Lee,et al.  Linked data , 2020, Semantic Web for the Working Ontologist.

[2]  Yves Raimond,et al.  RDF 1.1 Primer , 2014 .

[3]  Jesse Weaver,et al.  Facebook Linked Data via the Graph API , 2013, Semantic Web.

[4]  Jim Webber,et al.  A programmatic introduction to Neo4j , 2018, SPLASH '12.

[5]  Robert Wilensky,et al.  A framework for distributed digital object services , 2006, International Journal on Digital Libraries.

[6]  Renzo Angles,et al.  A Comparison of Current Graph Database Models , 2012, 2012 IEEE 28th International Conference on Data Engineering Workshops.

[7]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[8]  Venkat N. Gudivada,et al.  Data Management Issues in Big Data Applications , 2015, Big Data 2015.

[9]  Eric J. Evans,et al.  Domain-driven design , 2003 .

[10]  Peter Wittenburg,et al.  Big Data in Science and the EUDAT Project , 2014, 2014 Annual SRII Global Conference.