Provenance in Open Data Entity-Centric Aggregation

Recently an increasing number of open data catalogs appear on the Web [1]. These catalogs contain data that represents real world entities and their attributes. Data can be imported from several catalogs to build web services; hence there is a need to trace the source of each entity and attribute value in a way that handles also the possible conflicts between attribute values coming from overlapping sources [2]. For open data, source tracing requires capturing both the provenance [3] of the attribute values and the identity links [4] between entities. Moreover, resolving the conflicts manually becomes harder with the increasing size of data.

[1]  Juan Ignacio Pane Fernandez Distributed Identity Management , 2012 .

[2]  Katrin Braunschweig,et al.  The State of Open Data Limits of Current Open Data Platforms , 2012 .

[3]  Fausto Giunchiglia,et al.  Managing Language Diversity Across Cultures: the English-Mongolian Case Study , 2013 .

[4]  Marian Bubak,et al.  Generating Scientific Documentation for Computational Experiments Using Provenance , 2014, IPAW.

[5]  Bettina Berendt,et al.  Crowdsourcing data citation graphs using provenance , 2014 .

[6]  Vasa Curcin,et al.  ProvAbs: model, policy, and tooling for abstracting PROV graphs , 2014, IPAW.

[7]  Carole A. Goble,et al.  LabelFlow: Exploiting Workflow Provenance to Surface Scientific Data Provenance , 2014, IPAW.

[8]  Jun Zhao,et al.  Towards Query Generation for PROV-O Data , 2013 .

[9]  Paul T. Groth,et al.  Provenance: An Introduction to PROV , 2013, Provenance.

[10]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[11]  James Cheney,et al.  An Analytical Survey of Provenance Sanitization , 2014, IPAW.

[12]  Deborah L. McGuinness,et al.  When owl: sameAs Isn't the Same: An Analysis of Identity in Linked Data , 2010, SEMWEB.

[13]  Fausto Giunchiglia,et al.  An experiment in managing language diversity across cultures , 2014 .

[14]  Marta Mattoso,et al.  Applying Provenance to Protect Attribution in Distributed Computational Scientific Experiments , 2014, IPAW.

[15]  Ivan Tankoyeu,et al.  Open Government Data: Fostering Innovation , 2014 .

[16]  Paul T. Groth,et al.  Looking Inside the Black-Box: Capturing Data Provenance Using Dynamic Instrumentation , 2014, IPAW.

[17]  Paul T. Groth,et al.  PROV-O-Viz - Understanding the Role of Activities in Provenance , 2014, IPAW.

[18]  Ian T. Foster,et al.  Auditing and Maintaining Provenance in Software Packages , 2014, IPAW.

[19]  Luc Moreau,et al.  Provenance for Online Decision Making , 2014, IPAW.

[20]  Fausto Giunchiglia,et al.  Approaching Regular Polysemy in WordNet , 2013 .

[21]  Patrick J. Hayes,et al.  When owl: sameAs isn't the Same: An Analysis of Identity Links on the Semantic Web , 2010, LDOW.

[22]  Fausto Giunchiglia,et al.  GeoWordNet: A Resource for Geo-spatial Applications , 2010, ESWC.

[23]  C. Tilmes Formal Provenance Representation of the Data and Information Supporting the National Climate Assessment , 2014 .

[24]  Paolo Missier,et al.  ProvGen: generating synthetic PROV graphs with predictable structure , 2014, IPAW.

[25]  Rik Van de Walle,et al.  A Lightweight Provenance Pingback and Query Service for Web Publications , 2014, IPAW.

[26]  Farshad Hakimpour,et al.  Resolving Semantic Heterogeneity in Schema Integration: an Ontology Based Approach , 2001 .

[27]  Alasdair J. G. Gray Dataset Descriptions for Linked Data Systems , 2014, IEEE Internet Computing.

[28]  Devarshi Ghoshal,et al.  Regenerating and Quantifying Quality of Benchmarking Data Using Static and Dynamic Provenance , 2014, IPAW.

[30]  Marta Mattoso,et al.  Experiences in using provenance to optimize the parallel execution of scientific workflows steered by users , 2014 .

[31]  Elena Console,et al.  Data Fusion , 2009, Encyclopedia of Database Systems.

[32]  Wolfgang Lehner,et al.  Identifying and weighting integration hypotheses on open data platforms , 2012, WOD.

[33]  Deborah L. McGuinness,et al.  Walking into the Future with PROV Pingback: An Application to OPeNDAP Using Prizms , 2014, IPAW.