On modeling linked open statistical data

Abstract A major part of Open Data concerns statistics such as economic and social indicators. Statistical data are structured in a multidimensional manner creating data cubes. Recently, National Statistical Institutes and public authorities adopted the Linked Data paradigm to publish their statistical data on the Web. Many vocabularies have been created to enable modeling data cubes as RDF graphs, and thus creating Linked Open Statistical Data (LOSD). However, the creation of LOSD remains a demanding task mainly because of modeling challenges related either to the conceptual definition of the cube, or to the way of modeling cubes as linked data. The aim of this paper is to identify and clarify (a) modeling challenges related to the creation of LOSD and (b) approaches to address them. Towards this end, nine LOSD experts were involved in an interactive feedback collection and consensus-building process that was based on Delphi method. We anticipate that the results of this paper will contribute towards the formulation of best practices for creating LOSD, and thus facilitate combining and analyzing statistical data from diverse sources on the Web.

[1]  Michael Schrefl,et al.  From Federated Databases to a Federated Data Warehouse System , 2008, Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008).

[2]  Torben Bach Pedersen,et al.  Dimensional enrichment of statistical linked open data , 2016, J. Web Semant..

[3]  Suzanne D. Pawlowski,et al.  The Delphi method as a research tool: an example, design considerations and applications , 2004, Inf. Manag..

[4]  Murray Turoff,et al.  Delphi: A brief look backward and forward , 2011 .

[5]  Gottfried Vossen,et al.  Towards Self-Service Business Intelligence , 2013 .

[6]  Anindya Datta,et al.  The cube data model: a conceptual model and algebra for on-line analytical processing in data warehouses , 1999, Decis. Support Syst..

[7]  Valentina Janev,et al.  Exploratory spatio-temporal analysis of linked statistical data , 2016, J. Web Semant..

[8]  Efthimios Tambouris,et al.  Challenges on Developing Tools for Exploiting Linked Open Data Cubes , 2015, SemStats@ISWC.

[9]  Martin Necaský,et al.  Publication and usage of official Czech pension statistics Linked Open Data , 2017, J. Web Semant..

[10]  Marijn Janssen,et al.  Big and Open Linked Data (BOLD) in government: A challenge to transparency and privacy? , 2015, Gov. Inf. Q..

[11]  Sören Auer,et al.  A systematic review of open government data initiatives , 2015, Gov. Inf. Q..

[12]  Rodney L. Custer,et al.  The Modified Delphi Technique - A Rotational Modification , 1999 .

[13]  Efthimios Tambouris,et al.  Linked Open Cube Analytics Systems: Potential and Challenges , 2016, IEEE Intelligent Systems.

[14]  Michael Hausenblas,et al.  Exploiting Linked Data to Build Web Applications , 2009, IEEE Internet Computing.

[15]  Jane Hunter,et al.  An ontological approach to dynamic fine-grained Urban Indicators , 2017, ICCS.

[16]  Efthimios Tambouris,et al.  A classification scheme for open government data: towards linking decentralised data , 2011, Int. J. Web Eng. Technol..

[17]  Sören Auer,et al.  Linked SDMX Data: Path to high fidelity Statistical Linked Data , 2015, Semantic Web.

[18]  Efthimios Tambouris,et al.  ICT tools for creating, expanding and exploiting statistical linked Open Data , 2017 .

[19]  Efthimios Tambouris,et al.  Exploiting Linked Statistical Data in Public Administration: The Case of the Greek Ministry of Administrative Reconstruction , 2017, AMCIS.

[20]  Chia-Chien Hsu,et al.  The Delphi Technique: Making Sense of Consensus , 2007 .

[21]  Frank S. C. Tseng,et al.  Integrating heterogeneous data warehouses using XML technologies , 2005, J. Inf. Sci..

[22]  Efthimios Tambouris,et al.  Open Statistics: The Rise of a New Era for Open Data? , 2016, EGOV.

[23]  Luca Cabibbo,et al.  A Logical Approach to Multidimensional Databases , 1998, EDBT.

[24]  Lynn M. Jamieson,et al.  Delivery Methodology of the Delphi: A Comparison of Two Approaches , 2001 .

[25]  Andreas Harth,et al.  Enriching integrated statistical open city data by combining equational knowledge and missing value imputation , 2017, J. Web Semant..

[26]  Marijn Janssen,et al.  Open data policies, their implementation and impact: A framework for comparison , 2014, Gov. Inf. Q..