Thinking the Incorporation of LOD in Semantic Cubes as a Strategic Decision

With the advent of Linked Open Data (LOD) initiatives, organizations have seen the opportunity of augmenting their internal data cube systems with these external data. While IT actors manage technical issues of internal and external sources, a new actor has emerged “the Chief Data Officer” (CDO), the role of which is to align and prioritize data activities with key organizational priorities and goals. Existing literature managing the incorporation of LOD in internal Data cubes mainly focus on technical aspects of the LOD source and ignores the CDO role in this strategy. In this paper, we claim that technical actions should be conducted by the managerial level, which is reflected through the goals of the organization data cube and their related Key Performance Indicators (KPIs). For doing this, we first propose a metamodel aligning the three models: the data-flow model, the goal model and the KPI model. Then, we propose a process for specifying KPIs into Sparql language, the standard language for querying LOD sources. Experiments are conducted to measure the impact of the decision of integrating external LOD sources at KPI/goal level and on the technical data level. A case tool dedicated to the CDO is implemented to conduct the proposed approach.

[1]  John Mylopoulos,et al.  Conceptual Modeling for Indicator Selection , 2017, Conceptual Modeling Perspectives.

[2]  John Mylopoulos,et al.  Monitoring strategic goals in data warehouses with awareness requirements , 2012, SAC '12.

[3]  Lorena Etcheverry,et al.  Querying Semantic Web Data Cubes , 2016, AMW.

[4]  Torben Bach Pedersen,et al.  Towards a Programmable Semantic Extract-Transform-Load Framework for Semantic Data Warehouses , 2015, DOLAP.

[5]  Lorena Etcheverry,et al.  Modeling and Querying Data Warehouses on the Semantic Web Using QB4OLAP , 2014, DaWaK.

[6]  Kuo-Ming Chao,et al.  OLAP for Multidimensional Semantic Web Databases , 2014, BIRTE.

[7]  John Mylopoulos,et al.  Reasoning with Key Performance Indicators , 2011, PoEM.

[8]  Matteo Golfarelli,et al.  QETL: An approach to on-demand ETL from non-owned data sources , 2017, Data Knowl. Eng..

[9]  John Mylopoulos,et al.  Specification and derivation of key performance indicators for business analytics: A semantic approach , 2017, Data Knowl. Eng..

[10]  Carlos Ordonez,et al.  Value-driven Approach for Designing Extended Data Warehouses , 2019, DOLAP.

[11]  Cristina Dutra de Aguiar Ciferri,et al.  Cube Algebra: A Generic User-Centric Model and Query Language for OLAP Cubes , 2013, Int. J. Data Warehous. Min..

[12]  Alberto Abelló,et al.  Interactive multidimensional modeling of linked data for exploratory OLAP , 2018, Inf. Syst..

[13]  Ladjel Bellatreche,et al.  Managing Data Warehouse Traceability: A Life-Cycle Driven Approach , 2015, CAiSE.

[14]  Alberto Abelló,et al.  Towards Exploratory OLAP on Linked Data , 2016, SEBD.

[15]  Alberto Abelló,et al.  Research in data warehouse modeling and design: dead or alive? , 2006, DOLAP '06.

[16]  Olivier Teste,et al.  OLAP Manipulations on RDF Data following a Constellation Model , 2013, SemStats@ISWC.

[17]  John Mylopoulos,et al.  Strategic business modeling: representation and reasoning , 2014, Software & Systems Modeling.